Michael Yang 291bb97e3d client request options		3 years ago
api	client request options	3 years ago
app	quit sooner with single instance lock	3 years ago
cmd	generate progress	3 years ago
docs	Move python docs to separate file	3 years ago
llama	remove prompt cache	3 years ago
server	generate progress	3 years ago
web	set version at build time	3 years ago
.dockerignore	update `Dockerfile`	3 years ago
.gitignore	use `Makefile` for dependency building instead of `go generate`	3 years ago
.prettierrc.json	move .prettierrc.json to root	3 years ago
Dockerfile	fix dockerfile	3 years ago
LICENSE	`proto` -> `ollama`	3 years ago
Makefile	update app to use go binary	3 years ago
README.md	update api documentation	3 years ago
go.mod	progress	3 years ago
go.sum	progress	3 years ago
main.go	add llama.cpp go bindings	3 years ago
models.json	Update models.json	3 years ago

Ollama

Run large language models with llama.cpp.

Note: certain models that can be run with this project are intended for research and/or non-commercial use only.

Install

You can also build the binary from source.

Run the model that started it all.

ollama run llama

Have a conversation.

ollama run vicuna "Why is the sky blue?"

Ask questions. Get answers.

ollama run orca "Write an email to my boss."

Sometimes you just need a little help writing code.

ollama run replit "Give me react code to render a button"

Venture into the unknown.

ollama run nous-hermes "Once upon a time"

ollama run ~/Downloads/vicuna-7b-v1.3.ggmlv3.q4_1.bin

make

To run it start the server:

./ollama server &

Finally, run a model!

./ollama run ~/Downloads/vicuna-7b-v1.3.ggmlv3.q4_1.bin

Download a model

curl -X POST http://localhost:11343/api/pull -d '{"model": "orca"}'

Complete a prompt

curl -X POST http://localhost:11434/api/generate -d '{"model": "orca", "prompt": "hello!", "stream": true}'