4433 Commits (30f8a68c4cc55e0f3a717b891931847c97190843)
 

Author SHA1 Message Date
Daniel Hiltgen 27da2cddc5
Fix lingering Q4_0 help reference (#10720) 11 months ago
Bruce MacDonald feb8923ada
cmd: add ellipses to truncated show metadata (#10717) 11 months ago
Jesse Gross fe623c2cf4 ollamarunner: Multi-modal worst case graph 12 months ago
Jesse Gross 3c14461d5d ollamarunner: Separate text and multimodal graphs 11 months ago
Jesse Gross 499ae7311f ollamarunner: Base cached tokens on current prompt 11 months ago
Michael Yang ef202789fa
fix pixel values padding (#10718) 11 months ago
Michael Yang 55760195e6
fix mllama conversion (#10716) 11 months ago
Bruce MacDonald bd68d3ae50
ggml: update qwen25vl vision size estimate (#10711) 11 months ago
Daniel Hiltgen ff80718e9c
fix crash in old clients with quantization progress (#10710) 11 months ago
Bruce MacDonald 0aa8b371dd
model: add Qwen2.5-VL support (#10385) 11 months ago
Michael Yang 23125648b8
chore: update mllama to use ollama engine (#10637) 11 months ago
tej 0478d440f0
Fixed over vram allcation dure to small initial layer sizes. 11 months ago
Parth Sareen 8cc33f4c2b
llama: fix memory leak for grammar (#10696) 11 months ago
Jeffrey Morgan f46df4e5d2
llama: fix defrag patch to defragment when no slots are available (#10695) 11 months ago
Daniel Hiltgen c6bcdc4223
Revert "remove cuda v11 (#10569)" (#10692) 11 months ago
Jeffrey Morgan 4b903f088a
llama: fix crash on snowflake embedding model (#10690) 11 months ago
Jeffrey Morgan c7f4ae7b9c
server: add webp image input support (#10653) 11 months ago
Michael Yang 526b2ed102
fix vocabulary (#10679) 11 months ago
Bruce MacDonald a7240c6d63
models: remove unused qwen2vl processing (#10677) 11 months ago
Daniel Hiltgen 9d6df90805
Follow up to #10363 (#10647) 11 months ago
Jeffrey Morgan 0cefd46f23
llama: update to commit de4c07f93 (#10655) 11 months ago
Bruce MacDonald ad035ad595
convert: quantize from safetensors needs kv (#10675) 11 months ago
Michael Yang f95a1f2bef
feat: add trace log level (#10650) 11 months ago
HardCodeDev 82a9e9462a
readme: add UnityCodeLama to community integrations (#10665) 11 months ago
HardCodeDev 76724e2f29
readme: add OllamaPlusPlus C++ library to community integrations (#10664) 11 months ago
frob ecf14a220f
llama: allocate grammar buffer based on schema length (#10649) 11 months ago
frob 69ce44b33c
envconfig: Remove no longer supported max vram var (#10623) 11 months ago
Michael Yang 5969674cf1
feat: add threshold to dump options (#10639) 11 months ago
AliAhmedNada 867d75b21e
readme: add ojira to community integrations (#10648) 11 months ago
Bruce MacDonald 3fa78598a1
cmd: strip single quotes from image page (#10636) 11 months ago
Michael Yang 0d6e35d3c6
fix: stream accumulator exits early (#10593) 11 months ago
Devon Rifkin 20c5fd39c8
Merge branch 'main' into drifkin/array-head-count-simple 11 months ago
Michael Yang 6e9a7a2568
lint: enable usetesting, disable tenv (#10594) 11 months ago
Michael Yang b585a58121
chore: remove unused ZipReader type (#10621) 11 months ago
Jeffrey Morgan fa9973cd7f
api: remove unused sampling parameters (#10581) 11 months ago
Jesse Gross 3d9498a425 ollamarunner: Use correct constant to remove cache entries 11 months ago
Daniel Hiltgen 3098c8b29b
CI: trigger downstream release process (#10508) 11 months ago
Daniel Hiltgen 5e380c3b42
sched: fix race leading to orphaned runners (#10599) 11 months ago
Jeffrey Morgan 392de84031
api: remove unused RetrieveModelResponse type (#10603) 11 months ago
Daniel Hiltgen af31ccefc0
fix data race in WriteGGUF (#10598) 11 months ago
Daniel Hiltgen fa393554b9
remove cuda v11 (#10569) 11 months ago
Aharon Bensadoun 307e3b3e1d
readme: add Flufy to community integrations (#9719) 11 months ago
Devon Rifkin 4090aca97b
server: send 405 instead of 404 for unallowed methods (#10275) 11 months ago
Michael Yang 92ce438de0
server: remove internal cmd (#10595) 11 months ago
Daniel Hiltgen 424810450f
Move quantization to new backend (#10363) 11 months ago
Michael Yang 95e744beeb
discover: fix compiler warnings (#10572) 11 months ago
Jeffrey Morgan 3b2d2c8326
api: remove unused or unsupported api options (#10574) 11 months ago
Michael Yang d931ee8f22
create blobs in parallel (#10135) 11 months ago
Jesse Gross 7073600797 ggml: Reduce log level of "key not found" 11 months ago
Daniel Hiltgen b1c40138da
win: lint fix (#10571) 11 months ago