92 Commits (main)

Author SHA1 Message Date
Michael Yang fa7776fd24
gpt-oss (#11672) 8 months ago
Michael Yang 6c733bf0a6
s#x/exp/maps#maps# (#11506) 8 months ago
Michael Yang 4129af9205
chore: cleanup comments + unused vars (#11225) 9 months ago
Michael Yang 73b642e6f3
add new gemma model (#11204) 9 months ago
Michael Yang c088ac0e79
convert: utility for merging tensors (#11069) 9 months ago
Michael Yang 45f56355d5
feat: uneven splits (#11048) 10 months ago
Michael Yang adff143bcd
fix: mllama quality (#10807) 10 months ago
Jesse Gross 94ab428e3f ggml: Seperate tensor load from backend creation 12 months ago
Michael Yang 333e360422
model: handle multiple eos tokens (#10577) 11 months ago
Michael Yang 55760195e6
fix mllama conversion (#10716) 11 months ago
Bruce MacDonald 0aa8b371dd
model: add Qwen2.5-VL support (#10385) 11 months ago
Michael Yang 23125648b8
chore: update mllama to use ollama engine (#10637) 11 months ago
Michael Yang b585a58121
chore: remove unused ZipReader type (#10621) 11 months ago
Daniel Hiltgen 424810450f
Move quantization to new backend (#10363) 11 months ago
湛露先生 7e5c8eee5c
file close check and close. (#10554) 11 months ago
Michael Yang 7ba9fa9c7d fixes for maverick 11 months ago
Michael Yang 8bf11b84c1 chunked attention 12 months ago
Michael Yang f0c66e6dea llama4 1 year ago
Michael Yang dc1e81f027 convert: use -1 for read all 11 months ago
Michael Yang 4892872c18 convert: change to colmajor 11 months ago
Michael Yang 2fec73eef6 fix write gguf padding 12 months ago
Bruce MacDonald 6bd0a983cd model: support for mistral-small in the ollama runner 1 year ago
Bruce MacDonald 9876c9faa4
chore(all): replace instances of interface with any (#10067) 1 year ago
Bruce MacDonald 61a8825216
convert: return name of unsupported architecture (#9862) 1 year ago
Patrick Devine 80c7ce381b
fix: change default context size for gemma3 (#9744) 1 year ago
jmorganca 83f0ec8269 all: address linter errors 1 year ago
Michael Yang 63a394068c use 2d pooling 1 year ago
Patrick Devine 2e54d72fc3 fix gemma3 1b conversion 1 year ago
Michael Yang 6b32a2d549 compat with upstream gguf 1 year ago
Michael Yang d368c039f0 skip repacking vision tensors 1 year ago
Patrick Devine 9b54267e69 fix configs 1 year ago
Michael Yang 46bb0169c4 update model 1 year ago
Patrick Devine c62861f4fa fix conversion 1 year ago
Michael Yang 0df1800436 set non-causal attention 1 year ago
Patrick Devine 631fecc6d9 temporary work around for converting spm 1 year ago
Michael Yang 4b037a97dc add gemma vision encoder 1 year ago
Patrick Devine 5f74d1fd47 gemma2 impl 1 year ago
Michael Yang 58245413f4
next ollama runner (#7913) 1 year ago
Josh 93a8daf285
convert: import support for command-r models from safetensors (#6063) 1 year ago
Bruce MacDonald f6f3713001
convert: qwen2 from safetensors (#8408) 1 year ago
Stefan Weil abfdc4710f
all: fix typos in documentation, code, and comments (#7021) 1 year ago
Michael Yang 4456012956 fix unmarshaling merges 1 year ago
Patrick Devine c7cb0f0602
image processing for llama3.2 (#6963) 1 year ago
Patrick Devine 84b84ce2db
catch when model vocab size is set correctly (#6714) 2 years ago
Patrick Devine 608e87bf87
Fix gemma2 2b conversion (#6645) 2 years ago
Patrick Devine 6c1c1ad6a9
throw an error when encountering unsupport tensor sizes (#6538) 2 years ago
Michael Yang 60e47573a6 more tokenizer tests 2 years ago
Michael Yang eae3af6807 clean up convert tokenizer 2 years ago
Michael Yang 3eb08377f8 detect chat template from configs that contain lists 2 years ago
Patrick Devine 0c819e167b
convert safetensor adapters into GGUF (#6327) 2 years ago