ollama

Commit Graph

Author	SHA1	Message	Date
Jeffrey Morgan	993cf8bf55	llm: limit generation to 10x context size to avoid run on generations (#3918 ) * llm: limit generation to 10x context size to avoid run on generations * add comment * simplify condition statement	2 years ago
Daniel Hiltgen	34b9db5afc	Request and model concurrency This change adds support for multiple concurrent requests, as well as loading multiple models by spawning multiple runners. The default settings are currently set at 1 concurrent request per model and only 1 loaded model at a time, but these can be adjusted by setting OLLAMA_NUM_PARALLEL and OLLAMA_MAX_LOADED_MODELS.	2 years ago
Cheng	62be2050dd	chore: use errors.New to replace fmt.Errorf will much better (#3789 )	2 years ago
Eli Bendersky	ad90b9ab3d	api: start adding documentation to package api (#2878 ) * api: start adding documentation to package api Updates #2840 * Fix lint typo report	2 years ago
Michael Yang	01114b4526	fix: rope	2 years ago
Michael Yang	9502e5661f	cgo quantize	2 years ago
Michael Yang	e1c9a2a00f	no blob create if already exists	2 years ago
Michael Yang	be517e491c	no rope parameters	2 years ago
Patrick Devine	1b272d5bcd	change `github.com/jmorganca/ollama` to `github.com/ollama/ollama` (#3347 )	2 years ago
Patrick Devine	47cfe58af5	Default Keep Alive environment variable (#3094 ) --------- Co-authored-by: Chris-AS1 <8493773+Chris-AS1@users.noreply.github.com>	2 years ago
Jeffrey Morgan	3b4bab3dc5	Fix embeddings load model behavior (#2848 )	2 years ago
Ikko Eltociear Ashimine	e95b896790	Update types.go (#2744 ) specfied -> specified	2 years ago
Michael Yang	897b213468	use http.DefaultClient (#2530 ) default client already handles proxy	2 years ago
bnorick	caf2b13c10	Fix infinite keep_alive (#2480 )	2 years ago
Patrick Devine	b5cf31b460	add keep_alive to generate/chat/embedding api endpoints (#2146 )	2 years ago
Patrick Devine	7c40a67841	Save and load sessions (#2063 )	2 years ago
Michael Yang	745b5934fa	add model to ModelResponse	2 years ago
Michael Yang	a38d88d828	api: add model for all requests prefer using req.Model and fallback to req.Name	2 years ago
Michael Yang	5ffbbea1d7	remove client.py	2 years ago
Patrick Devine	22e93efa41	add show info command and fix the modelfile	2 years ago
Brian Murray	0d6e3565ae	Add embeddings to API (#1773 )	2 years ago
Jeffrey Morgan	55978c1dc9	clean up cache api option	2 years ago
Jeffrey Morgan	d4ebdadbe7	enable `cache_prompt` by default	2 years ago
K0IN	10da41d677	Add Cache flag to api (#1642 )	2 years ago
Bruce MacDonald	d99fa6ce0a	send empty messages on last chat response (#1530 )	2 years ago
Patrick Devine	d9e60f634b	add image support to the chat api (#1490 )	2 years ago
Patrick Devine	910e9401d0	Multimodal support (#1216 ) --------- Co-authored-by: Matt Apperson <mattapperson@Matts-MacBook-Pro.local>	2 years ago
Jeffrey Morgan	9e1406e4ed	Don't expose model information in `/api/generate`	2 years ago
Michael Yang	5d75505ebd	return model configuration in generate	2 years ago
Bruce MacDonald	195e3d9dbd	chat api endpoint (#1392 )	2 years ago
Michael Yang	0db4706ec2	api: add version api handler	2 years ago
Jeffrey Morgan	00d06619a1	Revert "chat api (#991 )" while context variable is fixed This reverts commit `7a0899d62d`.	2 years ago
Bruce MacDonald	7a0899d62d	chat api (#991 ) - update chat docs - add messages chat endpoint - remove deprecated context and template generate parameters from docs - context and template are still supported for the time being and will continue to work as expected - add partial response to chat history	2 years ago
Patrick Devine	cde31cb220	Allow setting parameters in the REPL (#1294 )	2 years ago
Bruce MacDonald	928950fcc6	update python client create example (#1227 ) * add remote create to python example client	2 years ago
Michael Yang	bc22d5a38b	no blob response	2 years ago
Michael Yang	1901044b07	use checksum reference	2 years ago
Michael Yang	1552cee59f	client create modelfile	2 years ago
Michael Yang	3ca56b5ada	add create modelfile field	2 years ago
Jeffrey Morgan	cdddd3df65	add `format` to example python client	2 years ago
Jeffrey Morgan	5cba29b9d6	JSON mode: add `"format" as an api parameter (#1051 ) * add `"format": "json"` as an API parameter --------- Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com>	2 years ago
Bruce MacDonald	a49d6acc1e	add a complete /generate options example (#1035 )	2 years ago
Bruce MacDonald	ec2a31e9b3	support raw generation requests (#952 ) - add the optional `raw` generate request parameter to bypass prompt formatting and response context -add raw request to docs	2 years ago
Jeffrey Morgan	17678b7225	Restore system prompt on requests and default `num_keep` to `0`	2 years ago
Jeffrey Morgan	06589a3b30	Set `NumKeep` to `4` by default (#982 )	2 years ago
Michael Yang	6db3691b8f	update default NumKeep	2 years ago
Michael Yang	60bb3c03a1	use http.Method	2 years ago
Bruce MacDonald	5c3491f425	allow for a configurable ollama model storage directory (#897 ) * allow for a configurable ollama models directory - set OLLAMA_MODELS in the environment that ollama is running in to change where model files are stored - update docs Co-Authored-By: Jeffrey Morgan <jmorganca@gmail.com> Co-Authored-By: Jay Nakrani <dhananjaynakrani@gmail.com> Co-Authored-By: Akhil Acharya <akhilcacharya@gmail.com> Co-Authored-By: Sasha Devol <sasha.devol@protonmail.com>	2 years ago
Michael Yang	28c3f288e2	client: fix trailing slash	2 years ago
Michael Yang	459f4a7889	fix: ollama host for hostname	2 years ago

1 2 3

145 Commits (4e1ff6dcbbc61b0cd266885d50faaaf8c7f667de)