4366 Commits (a3f7dd3e98df803695f1ae165bc61a1b52142449)
 

Author SHA1 Message Date
Devon Rifkin a3f7dd3e98 Merge branch 'main' into drifkin/array-head-count-simple 9 months ago
Daniel Hiltgen c85c0ebf89
CI: switch windows to vs 2022 (#11184) 9 months ago
Daniel Hiltgen 10a8e04a8d
avoid context overflow (#11175) 9 months ago
Daniel Hiltgen 1c6669e64c
Re-remove cuda v11 (#10694) 9 months ago
Devon Rifkin b2b270ad5d Merge branch 'main' into drifkin/array-head-count-simple 9 months ago
AJ 2bb69b40c7
readme: add ai-hub to community integrations (#11169) 9 months ago
Daniel Hiltgen 65bff664cb
build speedups (#11142) 9 months ago
Michael Yang c088ac0e79
convert: utility for merging tensors (#11069) 9 months ago
Michael Yang 0a066cfd91
Reapply "feat: incremental gguf parser (#10822)" (#11114) (#11119) 9 months ago
Jesse Gross 87b7af6cee ggml: Check return status for computation. 9 months ago
Daniel Hiltgen f2527b08fb
int: add coverage for older models (#11137) 9 months ago
Jeffrey Morgan 8bcb3125c1
benchmark: remove unused benchmark test (#11120) 10 months ago
Jeffrey Morgan 6baf1e31e2
Revert "Revert "ggml: Export GPU UUIDs" (#11115)" (#11117) 10 months ago
Jeffrey Morgan ed567ef43b
Revert "ggml: Export GPU UUIDs" (#11115) 10 months ago
Jeffrey Morgan a6e64fbdf2
Revert "feat: incremental gguf parser (#10822)" (#11114) 10 months ago
曹家巧 60cfa2a203
cache: fix comment function name in cache.go (#11110) 10 months ago
Jeffrey Morgan 55bbf3b4a1
tools: return empty arguments object instead of null (#11113) 10 months ago
Jeffrey Morgan 6bda1d2479
tools: fix parsing tool calls without any parameters (#11101) 10 months ago
Jeffrey Morgan 9e125d884c
model: treat 'user defined' tokens as special tokens (#11077) 10 months ago
Michael Yang a6fbfc880c
gguf: fix write order (#11068) 10 months ago
NGC13009 502028968d
readme: add ollama-launcher to community integrations (#11080) 10 months ago
Phil 5a8eb0e151
readme: add GPTranslate to community integrations (#11071) 10 months ago
Jeffrey Morgan 9f8a18ec05
tools: loosen tool parsing to allow for more formats (#11030) 10 months ago
Michael Yang 6b04cad7e8
feat: incremental gguf parser (#10822) 10 months ago
Michael Yang 45f56355d5
feat: uneven splits (#11048) 10 months ago
Michael Yang 0dabb4ef6a
skip tokenizer.model if possible (#11050) 10 months ago
Michael Yang 2e77aa1ae7
use nn.Linear in place of ml.Tensor (#11049) 10 months ago
Attogram Project deaabe292d
readme: add ollama-multirun to community integrations (#11038) 10 months ago
Jeffrey Morgan af21a5ac39
readme: update quickstart link text to Gemma 3 10 months ago
Jeffrey Morgan f63d7f68eb
readme: update quickstart example to Gemma 3 10 months ago
Daniel Hiltgen 82ad1dbc07
mac: handle "keep" named apps (#11031) 10 months ago
Daniel Hiltgen feeabdadd2
spawn desktop quickly (#11011) 10 months ago
Krzysztof Jeziorny fc0309615e
docs: update link to AMD drivers in linux.md (#10973) 10 months ago
Jeffrey Morgan 09d308d6b6
Revert "server: add model capabilities to the list endpoint (#10174)" (#11004) 10 months ago
Daniel Hiltgen a8ed68bd93
launch app hidden (#10962) 10 months ago
Daniel Hiltgen 2ae65ae471
win: handle more than 2048 processes (#10997) 10 months ago
Devon Rifkin a3b6886b7d
move thinking logic into its own package (#10990) 10 months ago
Hunter Wittenborn c6a6d7294d
docs: fix typo in development.md (#10998) 10 months ago
Devon Rifkin 2cf007c9d1
Merge pull request #10987 from ollama/drifkin/export-thinking-parser 10 months ago
Devon Rifkin 0683efa637 export ThinkingParser 10 months ago
JasonHonKL 0943001193
server: add model capabilities to the list endpoint (#10174) 10 months ago
HardCodeDev 5c42800fca
readme: add SimpleOllamaUnity to community integrations (#10817) 10 months ago
Parth Sareen 65f10c2823
tools: resiliency upgrade to name and arg extraction from template (#10917) 10 months ago
Jesse Gross aaa7818000 ggml: Export GPU UUIDs 11 months ago
Jesse Gross f15ffc4320 llm: Make "POST predict" error message more informative 11 months ago
Devon Rifkin 5f57b0ef42
add thinking support to the api and cli (#10584) 10 months ago
Patrick Devine aa25aff10d
client: add request signing to the client (#10881) 10 months ago
Jesse Gross ea79003180 kvcache: Skip computing causal mask for worst case graph reservation 10 months ago
Kyle Steere 9239a254e0
server: abort download on empty digest 10 months ago
Parth Sareen 066d0f4746
tools: relax JSON parse constraints for tool calling (#10872) 10 months ago