4433 Commits (30f8a68c4cc55e0f3a717b891931847c97190843)
 

Author SHA1 Message Date
Daniel Hiltgen 2c4ce40334
mimic logs for layers on new engine (#11278) 9 months ago
XuKecheng 5d8c173529
readme: add NativeMind to community integrations (#11242) 9 months ago
Jeffrey Morgan 44b17d2bfa
tools: fix parsing tool calls with empty arguments, missing required fields (#11233) 9 months ago
Attogram Project 3b8b692218
readme: add ollama-bash-toolshed to community integrations (#11224) 9 months ago
Michael Yang 4129af9205
chore: cleanup comments + unused vars (#11225) 9 months ago
Jesse Gross 45f216a9c7 ggml: Temporarily disable reporting UUIDs 9 months ago
Michael Yang d0b32def60
skip quantizing per_layer_token_embd (#11207) 9 months ago
Daniel Hiltgen 11ffc36157
ci: multi-stage release process (#11001) 9 months ago
Jeffrey Morgan ba04902670
fs/ggml: add multiplier in graph estimates (#11208) 9 months ago
Jeffrey Morgan 3944602f51
fs/ggml: add missing architecture to OllamaEngineRequired() (#11206) 9 months ago
Michael Yang 73b642e6f3
add new gemma model (#11204) 9 months ago
Daniel Hiltgen ad118d8b13
ci: arm sbsa fixes (#11194) 9 months ago
Daniel Hiltgen f08534137b ci: include dependencies 9 months ago
Daniel Hiltgen 4b4a90f233
ci: pick up arm sbsa cuda libs (#11192) 9 months ago
Daniel Hiltgen 03274a6b2f
ci: recombine linux amd64 binaries (#11188) 9 months ago
Devon Rifkin cc6463ebca
Merge pull request #10238 from ollama/drifkin/array-head-count-simple 9 months ago
Daniel Hiltgen 405d2f628f
ci: rocm parallel builds on windows (#11187) 9 months ago
Devon Rifkin a3f7dd3e98 Merge branch 'main' into drifkin/array-head-count-simple 9 months ago
Daniel Hiltgen c85c0ebf89
CI: switch windows to vs 2022 (#11184) 9 months ago
Daniel Hiltgen 10a8e04a8d
avoid context overflow (#11175) 9 months ago
Daniel Hiltgen 1c6669e64c
Re-remove cuda v11 (#10694) 9 months ago
Devon Rifkin b2b270ad5d Merge branch 'main' into drifkin/array-head-count-simple 9 months ago
AJ 2bb69b40c7
readme: add ai-hub to community integrations (#11169) 9 months ago
Daniel Hiltgen 65bff664cb
build speedups (#11142) 9 months ago
Michael Yang c088ac0e79
convert: utility for merging tensors (#11069) 9 months ago
Michael Yang 0a066cfd91
Reapply "feat: incremental gguf parser (#10822)" (#11114) (#11119) 9 months ago
Jesse Gross 87b7af6cee ggml: Check return status for computation. 9 months ago
Daniel Hiltgen f2527b08fb
int: add coverage for older models (#11137) 9 months ago
Jeffrey Morgan 8bcb3125c1
benchmark: remove unused benchmark test (#11120) 10 months ago
Jeffrey Morgan 6baf1e31e2
Revert "Revert "ggml: Export GPU UUIDs" (#11115)" (#11117) 10 months ago
Jeffrey Morgan ed567ef43b
Revert "ggml: Export GPU UUIDs" (#11115) 10 months ago
Jeffrey Morgan a6e64fbdf2
Revert "feat: incremental gguf parser (#10822)" (#11114) 10 months ago
曹家巧 60cfa2a203
cache: fix comment function name in cache.go (#11110) 10 months ago
Jeffrey Morgan 55bbf3b4a1
tools: return empty arguments object instead of null (#11113) 10 months ago
Jeffrey Morgan 6bda1d2479
tools: fix parsing tool calls without any parameters (#11101) 10 months ago
Jeffrey Morgan 9e125d884c
model: treat 'user defined' tokens as special tokens (#11077) 10 months ago
Michael Yang a6fbfc880c
gguf: fix write order (#11068) 10 months ago
NGC13009 502028968d
readme: add ollama-launcher to community integrations (#11080) 10 months ago
Phil 5a8eb0e151
readme: add GPTranslate to community integrations (#11071) 10 months ago
Jeffrey Morgan 9f8a18ec05
tools: loosen tool parsing to allow for more formats (#11030) 10 months ago
Michael Yang 6b04cad7e8
feat: incremental gguf parser (#10822) 10 months ago
Michael Yang 45f56355d5
feat: uneven splits (#11048) 10 months ago
Michael Yang 0dabb4ef6a
skip tokenizer.model if possible (#11050) 10 months ago
Michael Yang 2e77aa1ae7
use nn.Linear in place of ml.Tensor (#11049) 10 months ago
Attogram Project deaabe292d
readme: add ollama-multirun to community integrations (#11038) 10 months ago
Jeffrey Morgan af21a5ac39
readme: update quickstart link text to Gemma 3 10 months ago
Jeffrey Morgan f63d7f68eb
readme: update quickstart example to Gemma 3 10 months ago
Daniel Hiltgen 82ad1dbc07
mac: handle "keep" named apps (#11031) 10 months ago
Daniel Hiltgen feeabdadd2
spawn desktop quickly (#11011) 10 months ago
Krzysztof Jeziorny fc0309615e
docs: update link to AMD drivers in linux.md (#10973) 10 months ago