Ollama API
Copy Page
Ollama API
Endpoints
Conventions
Generate a completion
Overview
Generate request (Streaming)
POST
Request (No streaming)
POST
Request (with suffix)
POST
Request (Structured outputs)
POST
Request (JSON mode)
POST
Request (with images)
POST
Request (Raw Mode)
POST
Request (Reproducible outputs)
POST
Generate request (With options)
POST
Load a model
POST
Unload a model
POST
Generate a chat completion
Overview
Chat Request (Streaming)
POST
Chat request (No streaming)
POST
Chat request (Structured outputs)
POST
Chat request (With History)
POST
Chat request (with images)
POST
Chat request (Reproducible outputs)
POST
Chat request (with tools)
POST
Load a model
POST
Unload a model
POST
Create a Model
Overview
Create a new model
POST
Quantize a model
POST
Create a model from GGUF
POST
Create a model from a Safetensors directory
POST
Check if a Blob Exists
Overview
Push a Blob
Overview
List Local Models
Overview
Examples
Show Model Information
Overview
Examples
Copy a Model
Overview
Examples
Delete a Model
Overview
Examples
Pull a Model
Overview
Examples
Push a Model
Overview
Generate Embeddings
Overview
Examples
Request (Multiple input)
List Running Models
Overview
Examples
Generate Embedding
Overview
Examples
Version
Overview
Endpoints
Copy Page
Generate a completion
Generate a chat completion
Create a Model
List Local Models
Show Model Information
Copy a Model
Delete a Model
Pull a Model
Push a Model
Generate Embeddings
List Running Models
Version
Modified at
2025-03-14 07:50:56
Next
Conventions