4366 Commits (a3f7dd3e98df803695f1ae165bc61a1b52142449)
 

Author SHA1 Message Date
Michael Yang 526b2ed102
fix vocabulary (#10679) 11 months ago
Bruce MacDonald a7240c6d63
models: remove unused qwen2vl processing (#10677) 11 months ago
Daniel Hiltgen 9d6df90805
Follow up to #10363 (#10647) 11 months ago
Jeffrey Morgan 0cefd46f23
llama: update to commit de4c07f93 (#10655) 11 months ago
Bruce MacDonald ad035ad595
convert: quantize from safetensors needs kv (#10675) 11 months ago
Michael Yang f95a1f2bef
feat: add trace log level (#10650) 11 months ago
HardCodeDev 82a9e9462a
readme: add UnityCodeLama to community integrations (#10665) 11 months ago
HardCodeDev 76724e2f29
readme: add OllamaPlusPlus C++ library to community integrations (#10664) 11 months ago
frob ecf14a220f
llama: allocate grammar buffer based on schema length (#10649) 11 months ago
frob 69ce44b33c
envconfig: Remove no longer supported max vram var (#10623) 11 months ago
Michael Yang 5969674cf1
feat: add threshold to dump options (#10639) 11 months ago
AliAhmedNada 867d75b21e
readme: add ojira to community integrations (#10648) 11 months ago
Bruce MacDonald 3fa78598a1
cmd: strip single quotes from image page (#10636) 11 months ago
Michael Yang 0d6e35d3c6
fix: stream accumulator exits early (#10593) 11 months ago
Devon Rifkin 20c5fd39c8
Merge branch 'main' into drifkin/array-head-count-simple 11 months ago
Michael Yang 6e9a7a2568
lint: enable usetesting, disable tenv (#10594) 11 months ago
Michael Yang b585a58121
chore: remove unused ZipReader type (#10621) 11 months ago
Jeffrey Morgan fa9973cd7f
api: remove unused sampling parameters (#10581) 11 months ago
Jesse Gross 3d9498a425 ollamarunner: Use correct constant to remove cache entries 11 months ago
Daniel Hiltgen 3098c8b29b
CI: trigger downstream release process (#10508) 11 months ago
Daniel Hiltgen 5e380c3b42
sched: fix race leading to orphaned runners (#10599) 11 months ago
Jeffrey Morgan 392de84031
api: remove unused RetrieveModelResponse type (#10603) 11 months ago
Daniel Hiltgen af31ccefc0
fix data race in WriteGGUF (#10598) 11 months ago
Daniel Hiltgen fa393554b9
remove cuda v11 (#10569) 11 months ago
Aharon Bensadoun 307e3b3e1d
readme: add Flufy to community integrations (#9719) 11 months ago
Devon Rifkin 4090aca97b
server: send 405 instead of 404 for unallowed methods (#10275) 11 months ago
Michael Yang 92ce438de0
server: remove internal cmd (#10595) 11 months ago
Daniel Hiltgen 424810450f
Move quantization to new backend (#10363) 11 months ago
Michael Yang 95e744beeb
discover: fix compiler warnings (#10572) 11 months ago
Jeffrey Morgan 3b2d2c8326
api: remove unused or unsupported api options (#10574) 11 months ago
Michael Yang d931ee8f22
create blobs in parallel (#10135) 11 months ago
Jesse Gross 7073600797 ggml: Reduce log level of "key not found" 11 months ago
Daniel Hiltgen b1c40138da
win: lint fix (#10571) 11 months ago
Ashok Gelal 17466217e5
Hide empty terminal window (#8668) 11 months ago
Jeffrey Morgan 1703d1472e
server: fix panic when runner.Options is nil (#10566) 11 months ago
Jeffrey Morgan 913905028b
all: fix cgo compiler warnings on windows (#10563) 11 months ago
湛露先生 7e5c8eee5c
file close check and close. (#10554) 11 months ago
Daniel Hiltgen 6a74bba7e7
win: ensure ollama paths come first (#10549) 11 months ago
Daniel Hiltgen 76ea735aaf
sched: logging improvements (#10550) 11 months ago
aritra saha dd1d4e99e7
readme: add llama 4 models (#10530) 11 months ago
Jesse Gross a6ef73f4f2 ggml: Fix race that resulted in "context canceled" when loading 11 months ago
Jesse Gross c2f5d6662b ollamarunner: Re-enable worst case graph preallocation. 11 months ago
Harsh Nevse 57fb759f3c
readme: update link to langchain in community integrations (#10465) 11 months ago
Jeffrey Morgan 8dd12c873d
llama: update to commit e1e8e099 (#10513) 11 months ago
frob e6d2d04121
image: add vision capability for projector-based models (#10509) 11 months ago
Jesse Gross 074bac8447 kvcache: Log batch size if we can't find a slot 11 months ago
Jesse Gross 8e8f2c6d67 ollamarunner: Fix memory leak when processing images 11 months ago
AliAhmedNada 938e8447e8
readme: add Jirapt project to community integrations (#10522) 11 months ago
aritra saha d5d5f0c445
readme: change granite3.2 to granite3.3 (#10525) 11 months ago
Michael Yang a7835c6716
fix: write gguf padding (#10510) 11 months ago