145 Commits (4e1ff6dcbbc61b0cd266885d50faaaf8c7f667de)

Author SHA1 Message Date
Jeffrey Morgan 993cf8bf55
llm: limit generation to 10x context size to avoid run on generations (#3918) 2 years ago
Daniel Hiltgen 34b9db5afc Request and model concurrency 2 years ago
Cheng 62be2050dd
chore: use errors.New to replace fmt.Errorf will much better (#3789) 2 years ago
Eli Bendersky ad90b9ab3d
api: start adding documentation to package api (#2878) 2 years ago
Michael Yang 01114b4526 fix: rope 2 years ago
Michael Yang 9502e5661f cgo quantize 2 years ago
Michael Yang e1c9a2a00f no blob create if already exists 2 years ago
Michael Yang be517e491c no rope parameters 2 years ago
Patrick Devine 1b272d5bcd
change `github.com/jmorganca/ollama` to `github.com/ollama/ollama` (#3347) 2 years ago
Patrick Devine 47cfe58af5
Default Keep Alive environment variable (#3094) 2 years ago
Jeffrey Morgan 3b4bab3dc5
Fix embeddings load model behavior (#2848) 2 years ago
Ikko Eltociear Ashimine e95b896790
Update types.go (#2744) 2 years ago
Michael Yang 897b213468
use http.DefaultClient (#2530) 2 years ago
bnorick caf2b13c10
Fix infinite keep_alive (#2480) 2 years ago
Patrick Devine b5cf31b460
add keep_alive to generate/chat/embedding api endpoints (#2146) 2 years ago
Patrick Devine 7c40a67841
Save and load sessions (#2063) 2 years ago
Michael Yang 745b5934fa add model to ModelResponse 2 years ago
Michael Yang a38d88d828 api: add model for all requests 2 years ago
Michael Yang 5ffbbea1d7 remove client.py 2 years ago
Patrick Devine 22e93efa41 add show info command and fix the modelfile 2 years ago
Brian Murray 0d6e3565ae
Add embeddings to API (#1773) 2 years ago
Jeffrey Morgan 55978c1dc9 clean up cache api option 2 years ago
Jeffrey Morgan d4ebdadbe7 enable `cache_prompt` by default 2 years ago
K0IN 10da41d677
Add Cache flag to api (#1642) 2 years ago
Bruce MacDonald d99fa6ce0a
send empty messages on last chat response (#1530) 2 years ago
Patrick Devine d9e60f634b
add image support to the chat api (#1490) 2 years ago
Patrick Devine 910e9401d0
Multimodal support (#1216) 2 years ago
Jeffrey Morgan 9e1406e4ed Don't expose model information in `/api/generate` 2 years ago
Michael Yang 5d75505ebd return model configuration in generate 2 years ago
Bruce MacDonald 195e3d9dbd
chat api endpoint (#1392) 2 years ago
Michael Yang 0db4706ec2 api: add version api handler 2 years ago
Jeffrey Morgan 00d06619a1 Revert "chat api (#991)" while context variable is fixed 2 years ago
Bruce MacDonald 7a0899d62d
chat api (#991) 2 years ago
Patrick Devine cde31cb220
Allow setting parameters in the REPL (#1294) 2 years ago
Bruce MacDonald 928950fcc6
update python client create example (#1227) 2 years ago
Michael Yang bc22d5a38b no blob response 2 years ago
Michael Yang 1901044b07 use checksum reference 2 years ago
Michael Yang 1552cee59f client create modelfile 2 years ago
Michael Yang 3ca56b5ada add create modelfile field 2 years ago
Jeffrey Morgan cdddd3df65 add `format` to example python client 2 years ago
Jeffrey Morgan 5cba29b9d6
JSON mode: add `"format" as an api parameter (#1051) 2 years ago
Bruce MacDonald a49d6acc1e
add a complete /generate options example (#1035) 2 years ago
Bruce MacDonald ec2a31e9b3
support raw generation requests (#952) 2 years ago
Jeffrey Morgan 17678b7225 Restore system prompt on requests and default `num_keep` to `0` 2 years ago
Jeffrey Morgan 06589a3b30
Set `NumKeep` to `4` by default (#982) 2 years ago
Michael Yang 6db3691b8f update default NumKeep 2 years ago
Michael Yang 60bb3c03a1 use http.Method 2 years ago
Bruce MacDonald 5c3491f425
allow for a configurable ollama model storage directory (#897) 2 years ago
Michael Yang 28c3f288e2 client: fix trailing slash 2 years ago
Michael Yang 459f4a7889 fix: ollama host for hostname 2 years ago