4338 Commits (0dabb4ef6a1aab240a59b6bb4ef82372d335e3a9)
 

Author SHA1 Message Date
Michael Yang 0dabb4ef6a
skip tokenizer.model if possible (#11050) 10 months ago
Michael Yang 2e77aa1ae7
use nn.Linear in place of ml.Tensor (#11049) 10 months ago
Attogram Project deaabe292d
readme: add ollama-multirun to community integrations (#11038) 10 months ago
Jeffrey Morgan af21a5ac39
readme: update quickstart link text to Gemma 3 10 months ago
Jeffrey Morgan f63d7f68eb
readme: update quickstart example to Gemma 3 10 months ago
Daniel Hiltgen 82ad1dbc07
mac: handle "keep" named apps (#11031) 10 months ago
Daniel Hiltgen feeabdadd2
spawn desktop quickly (#11011) 10 months ago
Krzysztof Jeziorny fc0309615e
docs: update link to AMD drivers in linux.md (#10973) 10 months ago
Jeffrey Morgan 09d308d6b6
Revert "server: add model capabilities to the list endpoint (#10174)" (#11004) 10 months ago
Daniel Hiltgen a8ed68bd93
launch app hidden (#10962) 10 months ago
Daniel Hiltgen 2ae65ae471
win: handle more than 2048 processes (#10997) 10 months ago
Devon Rifkin a3b6886b7d
move thinking logic into its own package (#10990) 10 months ago
Hunter Wittenborn c6a6d7294d
docs: fix typo in development.md (#10998) 10 months ago
Devon Rifkin 2cf007c9d1
Merge pull request #10987 from ollama/drifkin/export-thinking-parser 10 months ago
Devon Rifkin 0683efa637 export ThinkingParser 10 months ago
JasonHonKL 0943001193
server: add model capabilities to the list endpoint (#10174) 10 months ago
HardCodeDev 5c42800fca
readme: add SimpleOllamaUnity to community integrations (#10817) 10 months ago
Parth Sareen 65f10c2823
tools: resiliency upgrade to name and arg extraction from template (#10917) 10 months ago
Jesse Gross aaa7818000 ggml: Export GPU UUIDs 11 months ago
Jesse Gross f15ffc4320 llm: Make "POST predict" error message more informative 11 months ago
Devon Rifkin 5f57b0ef42
add thinking support to the api and cli (#10584) 10 months ago
Patrick Devine aa25aff10d
client: add request signing to the client (#10881) 10 months ago
Jesse Gross ea79003180 kvcache: Skip computing causal mask for worst case graph reservation 10 months ago
Kyle Steere 9239a254e0
server: abort download on empty digest 10 months ago
Parth Sareen 066d0f4746
tools: relax JSON parse constraints for tool calling (#10872) 10 months ago
Parth Sareen aea6fb9b58
tools: remove newline stripping (#10869) 10 months ago
RAPID ARCHITECT 012cf65340
readme: add AWS Strands Agents SDK example to community integrations (#10865) 10 months ago
Min Yoo a45231af47
readme: Add macLlama to community integrations (#10790) 10 months ago
Daniel Hiltgen 2307fc2bcd
tests: drop llama3.2-vision embedding tests (#10837) 10 months ago
frob 6623898198
docs: remove unsupported quantizations (#10842) 10 months ago
frob eda472df1b
server: add hint to the error message when model path access fails (#10843) 10 months ago
Jesse Gross f18e0cb550 ml: Improve slog formatting for BackendMemory 10 months ago
Parth Sareen e8b981fa5d
tools: refactor tool call parsing and enable streaming (#10415) 10 months ago
Parth Sareen 884d26093c
llama: add minimum memory for grammar (#10820) 10 months ago
Jesse Gross 1f371ea92f ml: Panic rather than return error on tensor allocation failure 11 months ago
Jesse Gross 73d6a82cce ollamarunner: Memory usage reporting 12 months ago
Jesse Gross 6db8a3771c ggml: Report graph memory for failed allocations 11 months ago
Daniel Hiltgen d950ff12c0
sched: fix runner leak during reloading unload (#10819) 10 months ago
Michael Yang adff143bcd
fix: mllama quality (#10807) 10 months ago
Bruce MacDonald fbe6ae285a
server: improve tensor quantization fallback logic (#10806) 10 months ago
Daniel Hiltgen fdd4d479a3
integration: add qwen2.5-vl (#10815) 10 months ago
Michael Yang 61aeaf7e81
remove support for multiple ggufs in a single file (#10722) 10 months ago
Daniel Hiltgen 7359b02707
win: detect background upgrade in progress (#10785) 10 months ago
Michael Yang c890011322
feat: port qwen2 model (#10782) 10 months ago
Michael Yang e0ed984cde
feat: qwen3 dense and sparse models (#10708) 10 months ago
Michael Yang 139f84cf21
fix cmakelists (#10804) 10 months ago
Michael Yang 375839ea2d
chore: disable debug in binary libraries (#10788) 10 months ago
Michael Yang 69b2fe9282
fix: qwen25vl assign samebatch in multimodal input (#10789) 10 months ago
Michael Yang 9ed8bf14cb
ml: add more rope options (#10775) 10 months ago
DarkCaster e6a800ca11
llama: fix incorrect initialization of C.struct_common_sampler_cparams.penalty_present (#10779) 10 months ago