4366 Commits (a3f7dd3e98df803695f1ae165bc61a1b52142449)
 

Author SHA1 Message Date
Devon Rifkin ad3c7c9bda
strip out thinking tags in message history for qwen3 & r1 (#10490) 11 months ago
Daniel Hiltgen 415c8fcc3d
Fix "Stopping..." scheduler hang (#10487) 11 months ago
Daniel Hiltgen 718eda1b3e
Narrow set of paths we load GGML from (#10485) 11 months ago
Shahin R 421b7edeb4
readme: add link to lumina, a lightweight React frontend client (#10378) 11 months ago
batuhankadioglu 7b68e254c2
all: update several golang.org/x packages (#10436) 11 months ago
Daniel Hiltgen 7bec2724a5
integration: fix embedding tests error handling (#10478) 11 months ago
Jesse Gross a27462b708 ollamarunner: Temporarily disable worst case graph preallocation 11 months ago
crStiv 6bf0b8193a
readme: fix typos (#10399) 11 months ago
Devon Rifkin db428adbb8
Merge pull request #10468 from ollama/drifkin/num-parallel-1 11 months ago
Devon Rifkin fe5b9bb21b
lower default num parallel to 2 11 months ago
Devon Rifkin 6ec71d8fb6
Merge pull request #10452 from ollama/drifkin/4096-context-length 11 months ago
Devon Rifkin 44b466eeb2 config: update default context length to 4096 11 months ago
Devon Rifkin a25f3f8260
Merge pull request #10451 from ollama/revert-10364-drifkin/context-length 11 months ago
Devon Rifkin dd93e1af85
Revert "increase default context length to 4096 (#10364)" 11 months ago
Devon Rifkin d2ee599dcf load arrays with up to 1024 elements when estimating 11 months ago
Devon Rifkin 6ed8898590 ggml: fix crash for array head counts 11 months ago
Michael Yang 5cfc1c39f3
model: fix build (#10416) 11 months ago
Michael Yang f0ad49ea17 memory 11 months ago
Michael Yang 7ba9fa9c7d fixes for maverick 11 months ago
Michael Yang 8bf11b84c1 chunked attention 12 months ago
Michael Yang 470af8ab89 connect vision to text 12 months ago
Michael Yang 178761aef3 image processing 12 months ago
Michael Yang f0c66e6dea llama4 1 year ago
Michael Yang 54055a6dae fix test 11 months ago
Michael Yang 340448d2d1 explicitly decode maxarraysize 1024 11 months ago
Michael Yang ced7d0e53d fix parameter count 11 months ago
Michael Yang a0dba0f8ae default slice values 11 months ago
Michael Yang 5e20b170a7 update comment 11 months ago
Michael Yang d26c18e25c fix token type 11 months ago
Michael Yang 8d376acc9b zero means zero 11 months ago
Michael Yang dc1e81f027 convert: use -1 for read all 11 months ago
Michael Yang 5d0279164c generic ggml.array 11 months ago
Michael Yang 214a7678ea fix superfluous call to WriteHeader 11 months ago
Michael Yang 4892872c18 convert: change to colmajor 11 months ago
Michael Yang 0b9198bf47 ci: silence deprecated gpu targets warning 1 year ago
Jeffrey Morgan e9e5f61c45
llama: update to commit 2016f07b (#10352) 11 months ago
Parth Sareen 11dde41824
server: improve spacing for JSON grammar (#10131) 11 months ago
Parth Sareen a53d744b01
llama: remove model loading for grammar (#10096) 11 months ago
Adrien Duermael 40b10eee6d
api: fix ImageData struct comment to expect raw image bytes (#10386) 11 months ago
Devon Rifkin 424f648632
increase default context length to 4096 (#10364) 11 months ago
Richard Shiue 2eb1fb3231
readme: add AppFlowy to community integrations (#10335) 11 months ago
greengrass821 0806521642
cmd: add support for escaping ~ in filepath (#10339) 11 months ago
Michael Yang 88738b357b create tempdir in models directory 12 months ago
Blake Mizerany 4e535e6188
server/internal/registry: make pull send errors with Error field (#10326) 12 months ago
Michael Yang 40b8fdbdca arange 1 year ago
Blake Mizerany 1d99451ad7
server/internal/client/ollama: handle some network errors gracefully (#10317) 12 months ago
Jeffrey Morgan 09bb2e30f6
ml/backend/ggml: use default CUDA compression mode (#10314) 12 months ago
Jeffrey Morgan dc264be6ff
ml: add missing cmake property and remove additional CMakeLists.txt (#10310) 12 months ago
Devon Rifkin fbe7039618
Merge pull request #10290 from ollama/drifkin/template-highlighting 12 months ago
Jeffrey Morgan 943464ccb8
llama: update to commit 71e90e88 (#10192) 12 months ago