4461 Commits (a24f90604f883df01500b62992fb80b242022510)
 

Author SHA1 Message Date
Daniel Hiltgen a24f90604f
int: adjust a few models for integration tests (#11872) 8 months ago
Daniel Hiltgen dc5a645434
cuda: leverage JIT for smaller footprint (#11635) 8 months ago
youzichuan bb71654ebe chore: fix some inconsistent function name in comment 8 months ago
Jesse Gross a343ae53a4 ggml: Use ordinal IDs for AMD GPUs on Linux when UUID is unavailable 8 months ago
Michael Yang d0cf6c8281
fix(openai): handle reasoning_effort (#11868) 8 months ago
Jesse Gross 8f4ec9ab28 discover: CPU supports flash attention 8 months ago
Devon Rifkin dbfd7bd027
Merge pull request #11861 from ollama/drifkin/fix-parsing-error 8 months ago
Devon Rifkin ee04dbba51 server: fix error when parsing bad harmony tool calls 8 months ago
Daniel Andersen ea7657b54a
sched: Add support for grouping GPUs (#10678) 8 months ago
Michael Vorburger 2c776f0780
CONTRIBUTING: Explicitly note docs:... as a good example (#11755) 8 months ago
Jesse Gross 79f6376f5b ggml: No-alloc mode 8 months ago
Jesse Gross 756c78cfc7 ggml: Support closing backends 12 months ago
Jesse Gross d7f4f788d1 ggml: Use GGML's typedef'ed pointer types 8 months ago
Daniel Hiltgen 114c3f2265
tests: add integration coverage for oss-gpt (#11696) 8 months ago
Jesse Gross f2e9c9aff5 server: Reduce gpt-oss context length for small VRAM GPUs 8 months ago
Devon Rifkin aa9d889522
Merge pull request #11765 from ollama/drifkin/thinking-without-content 8 months ago
Devon Rifkin 735c41f9ca openai: always provide reasoning 8 months ago
Devon Rifkin 223a619468
Merge pull request #11761 from ollama/drifkin/openai-tool-names 8 months ago
Devon Rifkin 759dd78dd6 openai: when converting role=tool messages, propagate the tool name 8 months ago
Patrick Devine 44bc36d063
docs: update the faq (#11760) 8 months ago
Devon Rifkin 8f14e1f5f6
Merge pull request #11759 from ollama/drifkin/oai-tool-calling 8 months ago
Devon Rifkin 203c137810 openai: allow for content _and_ tool calls in the same message 8 months ago
Daniel Hiltgen fa8be9e35c
clean up debugging (#11756) 8 months ago
Gao feng 8a75e9ee15
Update downloading to pulling in api.md (#11170) 8 months ago
Parth Sareen 4742e12c23
docs: update turbo model name (#11707) 8 months ago
Devon Rifkin 2d06977ade
Merge pull request #11705 from ollama/drifkin/fn-schema 8 months ago
Devon Rifkin 30f8a68c4c tools: support anyOf types 8 months ago
Daniel Hiltgen e378e33421
win: static link msvc libs (#11612) 8 months ago
Michael Yang fcec04bf42
gptoss: fix memory calc (#11700) 8 months ago
Jeffrey Morgan ee92ca3e1d
docs: add docs for Ollama Turbo (#11687) 8 months ago
Jesse Gross 8253ad4d2b ggml: Prevent kv cache quanitization on gpt-oss 8 months ago
Michael Yang fa7776fd24
gpt-oss (#11672) 8 months ago
Jesse Gross 0d38b66502 kvcache: Log contents of cache when unable to find a slot 8 months ago
Jesse Gross 4183bb0574 kvcache: Enable SWA to retain additional entries 8 months ago
Sajal Kulshreshtha ff89ba90bc
fixing broken AMD driver link (#11579) 8 months ago
Daniel Hiltgen 6dcc5dfb9c
Revert "CI: switch back to x86 macos builder" (#11588) 8 months ago
Daniel Hiltgen 25911a6e6b
mac: disable bf16 on unsupported OS versions (#11585) 8 months ago
Daniel Hiltgen 8afa6e83f2
CI: switch back to x86 macos builder (#11572) 8 months ago
Oliver Simons ea85e27bbd
Increase performance for Gemma3n models on NVGPUs by enabling CUDA Graph execution (#11525) 8 months ago
Jesse Gross c116a7523d kvcache: Don't shift empty batches 8 months ago
Yoshi 3515cc377c
docs: fix typos and remove trailing whitespaces (#11554) 8 months ago
Mayan EDMS bbf66c0b96
readme: add Mayan EDMS to community integrations (#11543) 8 months ago
Jesse Gross 764be7480f kvcache: Group shift operations into batches 8 months ago
Ruyut b72e5adb14
CONTRIBUTING: fix typo in commit message example (#11528) 8 months ago
Patrick Devine 80b538e312
cli: catch upstream errors gracefully (#11512) 8 months ago
Jeffrey Morgan 4f8a0166cc
tools: loosen tool argument parsing (#11509) 8 months ago
minxinyi 1e6eab5c33
server: use slices.Equal to simplify code (#11502) 8 months ago
Michael Yang 6c733bf0a6
s#x/exp/maps#maps# (#11506) 8 months ago
Patrick Devine 3bac5cba60
Fix GetModelInfo (#11496) 8 months ago
ycomiti 4151ef8cf7
Update linux.md (#11462) 8 months ago