3299 Commits (f84cc9939c85319d0e6b17543bdb74b04d0df2c7)
 

Author SHA1 Message Date
Bruce MacDonald f84cc9939c allow ollama.com to call inference and info endpoints 2 years ago
royjhan 5b3a21b578
add metrics to docs (#6079) 2 years ago
Kyle Kelley ad0c19dde4
Use llama3.1 in tools example (#5985) 2 years ago
Jesse Gross 69eb06c40e
Merge pull request #6145 from ollama/jessegross/bug5840 2 years ago
Jesse Gross 1829fb61bd manifest: Fix crash on startup when trying to clean up unused files (#5840) 2 years ago
Jesse Gross 685a53534b manifest: Don't prune layers if we can't open a manifest file 2 years ago
Jeffrey Morgan de4fc29773
llm: reserve required number of slots for embeddings (#6219) 2 years ago
Jeffrey Morgan e04c7012c2
update llama.cpp submodule to `1e6f6554` (#6208) 2 years ago
Chua Chee Seng d4a7216c82
Fixed invalid option provided not displaying the invalid option name problem. (#6202) 2 years ago
Daniel Hiltgen a4fdd03c3b
Merge pull request #6207 from dhiltgen/sparse_win 2 years ago
Daniel Hiltgen fc85f50a2b Ensure sparse files on windows during download 2 years ago
royjhan 86b907f82a
sort batch results (#6189) 2 years ago
Michael Yang 10d49bce70
Merge pull request #6190 from ollama/mxyng/fix-integration 2 years ago
Michael Yang 7ed367419e fix concurrency test 2 years ago
Daniel Hiltgen 50ee8b5f56
Merge pull request #6186 from dhiltgen/numa 2 years ago
Michael Yang 03bdac0595
Merge pull request #6146 from ollama/mxyng/testing 2 years ago
Daniel Hiltgen f457d63400 Implement linux NUMA detection 2 years ago
Michael Yang 39f2bc6bfc
Merge pull request #6167 from ollama/mxyng/line-feed 2 years ago
frob b73b0940ef
Disable paging for journalctl (#6154) 2 years ago
Michael Yang 6a07344786 line feed 2 years ago
sryu1 8b920f35a4
Add Gemma 2 2b (#6151) 2 years ago
Ivan Charapanau 4221e39867
Reference ollama integration with Harbor (#6147) 2 years ago
Michael Yang a091fadfda use testing tempdirs 2 years ago
Michael Yang 77ccbf04dc
Merge pull request #6128 from ollama/mxyng/lint 2 years ago
royjhan 4addf6b587
Update OpenAI Compatibility Docs with /v1/completions (#5311) 2 years ago
royjhan 85c7f11170
Update docs (#5310) 2 years ago
Michael Yang b732beba6a lint 2 years ago
Kim Hallberg ce1fb4447e
Fix models/{model} URL (#6132) 2 years ago
royjhan 558a54b098
Update OpenAI Compatibility Docs with /v1/embeddings (#5470) 2 years ago
royjhan ed52833bb1
Add to docs (#5309) 2 years ago
royjhan 6f133a0bdd
OpenAI: Add Usage to `v1/embeddings` (#5886) 2 years ago
royjhan f561eecfb8
Update OpenAI Compatibility Docs with /v1/models (#5151) 2 years ago
Michael Yang ff7c9060ec
Merge pull request #6115 from slouffka/fix-context 2 years ago
Michael Yang 0ff42e84b0
Merge pull request #4756 from ollama/mxyng/convert2 2 years ago
Vyacheslav Moskalev 8a9f946ca7 Refactor and format code. 2 years ago
Vyacheslav Moskalev 3b5210548e Refactor code. Remove extra variable. 2 years ago
Vyacheslav Moskalev b0c216584c Better types and naming closer to style. 2 years ago
Vyacheslav Moskalev 49a5483139 Change the order of context and prompt. 2 years ago
Vyacheslav Moskalev 6bc5c13758 Fix extra context concatenation in generate handler (#5980). 2 years ago
Michael Yang 3e614260af
Merge pull request #6109 from ollama/mxyng/fix-modelfile 2 years ago
Michael Yang d87b4a488e fix modelfile message quotes 2 years ago
Michael Yang 4c14855ad7
Merge pull request #6106 from ollama/mxyng/default-sliding-window-attention 2 years ago
Blake Mizerany dc77bbcfa4
server: fix json marshalling of downloadBlobPart (#6108) 2 years ago
Michael Yang d8e2664c33 convert: fix parse functions 2 years ago
Michael Yang eafc607abb convert: only extract large files 2 years ago
Michael Yang 781fc2d576 Update convert/reader_safetensors.go 2 years ago
Michael Yang df993fa37b comments 2 years ago
Michael Yang 5e9db9fb0b refactor convert 2 years ago
Michael Yang 0f3271db88 patches: phi3 default sliding window attention 2 years ago
Michael Yang 6b252918fb update convert test to check result data 2 years ago