3054 Commits (8f8e736b131510c8707bed5886b343906cb74a24)
 

Author SHA1 Message Date
Jeffrey Morgan 8f8e736b13
update llama.cpp submodule to `d7fd29f` (#5475) 2 years ago
Jeffrey Morgan d89454de80
Use slot with cached prompt instead of least recently used (#5492) 2 years ago
Daniel Hiltgen af28b94533
Merge pull request #5469 from dhiltgen/prevent_system_oom 2 years ago
Jeffrey Morgan e9188e971a
Fix assert on small embedding inputs (#5491) 2 years ago
Daniel Hiltgen 78eddfc068
Merge pull request #4412 from dhiltgen/win_docs 2 years ago
Daniel Hiltgen 02c24d3d01
Merge pull request #5466 from dhiltgen/fix_clip_unicode 2 years ago
Daniel Hiltgen 52abc8acb7 Document older win10 terminal problems 2 years ago
Jeffrey Morgan 4d71c559b2
fix error detection by limiting model loading error parsing (#5472) 2 years ago
Anatoli Babenia 0d16eb310e
fix: use `envconfig.ModelsDir` directly (#4821) 2 years ago
Daniel Hiltgen 8072e205ff
Merge pull request #5447 from dhiltgen/fix_keepalive 2 years ago
Daniel Hiltgen 955f2a4e03 Only set default keep_alive on initial model load 2 years ago
Daniel Hiltgen 3c75113e37 Prevent loading models larger than total memory 2 years ago
Daniel Hiltgen ccd7785859
Merge pull request #5243 from dhiltgen/modelfile_use_mmap 2 years ago
royjhan 3b5a4a77f3
Return Correct Prompt Eval Count Regardless of Cache Prompt (#5371) 2 years ago
Daniel Hiltgen daed0634a9
Merge pull request #5467 from dhiltgen/bogus_cpu_mac_error 2 years ago
Daniel Hiltgen 0d4dd707bc
Merge pull request #5465 from dhiltgen/better_cuda_logging 2 years ago
Daniel Hiltgen 0e982bc1f4 Fix corner cases on tmp cleaner on mac 2 years ago
Daniel Hiltgen 6298f49816 Fix clip model loading with unicode paths 2 years ago
Daniel Hiltgen ef757da2c9 Better nvidia GPU discovery logging 2 years ago
Michael Yang e5352297d9
Merge pull request #5448 from ollama/mxyng/fix-generate 2 years ago
Michael Yang 65a5040e09 fix generate template 2 years ago
royjhan d626b99b54
OpenAI: v1/completions compatibility (#5209) 2 years ago
Michael Yang dddb58a38b
Merge pull request #5051 from ollama/mxyng/capabilities 2 years ago
Michael Yang 400056e154
Merge pull request #5420 from ollama/mxyng/insecure-path 2 years ago
Daniel Hiltgen d2f19024d0
Merge pull request #5442 from dhiltgen/concurrency_docs 2 years ago
Daniel Hiltgen 69c04eecc4 Add windows radeon concurreny note 2 years ago
royjhan 996bb1b85e
OpenAI: /v1/models and /v1/models/{model} compatibility (#5007) 2 years ago
Daniel Hiltgen 422dcc3856
Merge pull request #5439 from dhiltgen/fix_centos_7_build 2 years ago
Daniel Hiltgen 020bd60ab2 Switch amd container image base to rocky 8 2 years ago
Daniel Hiltgen 8e277b72bb
Merge pull request #5438 from dhiltgen/fix_centos_7_build 2 years ago
Daniel Hiltgen 4f67b39d26 Centos 7 EOL broke mirrors 2 years ago
Josh 2425281317
Merge pull request #5336 from ollama/jyan/from-errors 2 years ago
Josh 0403e9860e
Merge pull request #5421 from ollama/jyan/ver 2 years ago
Josh Yan 33a65e3ba3 error 2 years ago
Michael Yang 88bcd79bb9 err on insecure path 2 years ago
Josh Yan 7e571f95f0 trimspace test case 2 years ago
Michael Yang da8e2a0447 use kvs to detect embedding models 2 years ago
Michael Yang a30915bde1 add capabilities 2 years ago
Michael Yang 58e3fff311 rename templates to template 2 years ago
Michael Yang 3f0b309ad4 remove ManifestV2 2 years ago
Daniel Hiltgen e70610ef06
Merge pull request #5410 from dhiltgen/ctx_cleanup 2 years ago
Daniel Hiltgen dfded7e075
Merge pull request #5364 from dhiltgen/concurrency_docs 2 years ago
Daniel Hiltgen 173b550438 Remove default auto from help message 2 years ago
Daniel Hiltgen cff3f44f4a Fix case for NumCtx 2 years ago
Josh Yan 26e4e66faf updated parsefile test 2 years ago
Daniel Hiltgen 97c9e11768 Switch use_mmap to a pointer type 2 years ago
Daniel Hiltgen 3518aaef33
Merge pull request #4218 from dhiltgen/auto_parallel 2 years ago
RAPID ARCHITECT 1963c00201
Update README.md (#5214) 2 years ago
Eduard 27402cb7a2
Update gpu.md (#5382) 2 years ago
Jeffrey Morgan c1218199cf
Update api.md 2 years ago