4249 Commits (6e9a7a256856bf1119c992cf7da39c05276f386c)
 

Author SHA1 Message Date
Michael Yang 6e9a7a2568
lint: enable usetesting, disable tenv (#10594) 11 months ago
Michael Yang b585a58121
chore: remove unused ZipReader type (#10621) 11 months ago
Jeffrey Morgan fa9973cd7f
api: remove unused sampling parameters (#10581) 11 months ago
Jesse Gross 3d9498a425 ollamarunner: Use correct constant to remove cache entries 11 months ago
Daniel Hiltgen 3098c8b29b
CI: trigger downstream release process (#10508) 11 months ago
Daniel Hiltgen 5e380c3b42
sched: fix race leading to orphaned runners (#10599) 11 months ago
Jeffrey Morgan 392de84031
api: remove unused RetrieveModelResponse type (#10603) 11 months ago
Daniel Hiltgen af31ccefc0
fix data race in WriteGGUF (#10598) 11 months ago
Daniel Hiltgen fa393554b9
remove cuda v11 (#10569) 11 months ago
Aharon Bensadoun 307e3b3e1d
readme: add Flufy to community integrations (#9719) 11 months ago
Devon Rifkin 4090aca97b
server: send 405 instead of 404 for unallowed methods (#10275) 11 months ago
Michael Yang 92ce438de0
server: remove internal cmd (#10595) 11 months ago
Daniel Hiltgen 424810450f
Move quantization to new backend (#10363) 11 months ago
Michael Yang 95e744beeb
discover: fix compiler warnings (#10572) 11 months ago
Jeffrey Morgan 3b2d2c8326
api: remove unused or unsupported api options (#10574) 11 months ago
Michael Yang d931ee8f22
create blobs in parallel (#10135) 11 months ago
Jesse Gross 7073600797 ggml: Reduce log level of "key not found" 11 months ago
Daniel Hiltgen b1c40138da
win: lint fix (#10571) 11 months ago
Ashok Gelal 17466217e5
Hide empty terminal window (#8668) 11 months ago
Jeffrey Morgan 1703d1472e
server: fix panic when runner.Options is nil (#10566) 11 months ago
Jeffrey Morgan 913905028b
all: fix cgo compiler warnings on windows (#10563) 11 months ago
湛露先生 7e5c8eee5c
file close check and close. (#10554) 11 months ago
Daniel Hiltgen 6a74bba7e7
win: ensure ollama paths come first (#10549) 11 months ago
Daniel Hiltgen 76ea735aaf
sched: logging improvements (#10550) 11 months ago
aritra saha dd1d4e99e7
readme: add llama 4 models (#10530) 11 months ago
Jesse Gross a6ef73f4f2 ggml: Fix race that resulted in "context canceled" when loading 11 months ago
Jesse Gross c2f5d6662b ollamarunner: Re-enable worst case graph preallocation. 11 months ago
Harsh Nevse 57fb759f3c
readme: update link to langchain in community integrations (#10465) 11 months ago
Jeffrey Morgan 8dd12c873d
llama: update to commit e1e8e099 (#10513) 11 months ago
frob e6d2d04121
image: add vision capability for projector-based models (#10509) 11 months ago
Jesse Gross 074bac8447 kvcache: Log batch size if we can't find a slot 11 months ago
Jesse Gross 8e8f2c6d67 ollamarunner: Fix memory leak when processing images 11 months ago
AliAhmedNada 938e8447e8
readme: add Jirapt project to community integrations (#10522) 11 months ago
aritra saha d5d5f0c445
readme: change granite3.2 to granite3.3 (#10525) 11 months ago
Michael Yang a7835c6716
fix: write gguf padding (#10510) 11 months ago
Devon Rifkin ad3c7c9bda
strip out thinking tags in message history for qwen3 & r1 (#10490) 11 months ago
Daniel Hiltgen 415c8fcc3d
Fix "Stopping..." scheduler hang (#10487) 11 months ago
Daniel Hiltgen 718eda1b3e
Narrow set of paths we load GGML from (#10485) 11 months ago
Shahin R 421b7edeb4
readme: add link to lumina, a lightweight React frontend client (#10378) 11 months ago
batuhankadioglu 7b68e254c2
all: update several golang.org/x packages (#10436) 11 months ago
Daniel Hiltgen 7bec2724a5
integration: fix embedding tests error handling (#10478) 11 months ago
Jesse Gross a27462b708 ollamarunner: Temporarily disable worst case graph preallocation 11 months ago
crStiv 6bf0b8193a
readme: fix typos (#10399) 11 months ago
Devon Rifkin db428adbb8
Merge pull request #10468 from ollama/drifkin/num-parallel-1 11 months ago
Devon Rifkin fe5b9bb21b
lower default num parallel to 2 11 months ago
Devon Rifkin 6ec71d8fb6
Merge pull request #10452 from ollama/drifkin/4096-context-length 11 months ago
Devon Rifkin 44b466eeb2 config: update default context length to 4096 11 months ago
Devon Rifkin a25f3f8260
Merge pull request #10451 from ollama/revert-10364-drifkin/context-length 11 months ago
Devon Rifkin dd93e1af85
Revert "increase default context length to 4096 (#10364)" 11 months ago
Michael Yang 5cfc1c39f3
model: fix build (#10416) 11 months ago