4460 Commits (dc5a645434f0ea6364c426c6ba112da1afa40cb2)
 

Author SHA1 Message Date
Jeffrey Morgan bdd9d22dfd
tools: fix parsing issue when a tool name is a substring of another (#11456) 8 months ago
zmldndx 5fc38d042f
readme: update argo description to support deep research (#11455) 8 months ago
Daniel Hiltgen 191d94289d
ci: switch mac builder to arm64 (#11379) 9 months ago
frob 802ad16ce4
docs: add the no-Modelfile function of `ollama create` (#9077) 9 months ago
frob 5e67f4f90e
openai: allow openai endpoint to accept webp images (#11412) 9 months ago
Haiyue Wang e840ccb523
readme: update the llama.cpp github link (#11427) 9 months ago
Michael Yang b4fe3adc0a
compile bf16 support into ggml-metal (#11430) 9 months ago
Parth Sareen d73f8aa8c3
cmd: add default assistant role to message construction (#11431) 9 months ago
Bruce MacDonald 92c2e8a56c
api: fix unreachable status err (#11423) 9 months ago
Marcelo Fornet 2e3fd86d48
docs: fix typo in macos.md (#11425) 9 months ago
先知 4261a3b0b2
docs: update modelfile.md to reflect current default num_ctx (#11189) 9 months ago
Jesse Gross acef9b4c1b ggml: Use assigned layers when reporting loading stats 9 months ago
Jesse Gross 9a43994c45 ggml: Disable unused pipeline parallelism 9 months ago
Daniel Hiltgen f8a6e88819
Only load supported models on new engine (#11362) 9 months ago
Jesse Gross 35fda7b4af ggml: Report ordinal IDs for AMD GPUs on Windows 9 months ago
Daniel Hiltgen 66fb8575ce
doc: add MacOS docs (#11334) 9 months ago
Daniel Hiltgen 20c3266e94
Reduce default parallelism to 1 (#11330) 9 months ago
Daniel Hiltgen 34088dbcfb
API/CLI context enhancements (#11331) 9 months ago
Parth Sareen 43107b15b9
add `tool_name` to api.md (#11326) 9 months ago
Parth Sareen 1f91cb0c8c
template: add tool result compatibility (#11294) 9 months ago
Daniel Hiltgen 12d8ad0d38
ci: modularization (#11324) 9 months ago
Jesse Gross 592d21e7db Revert "ggml: Temporarily disable reporting UUIDs" 9 months ago
Jeffrey Morgan 5a08b01f5b
readme: update Ollama icon size 9 months ago
Daniel Hiltgen 4f473e224c
int: add performance integration tests (#11173) 9 months ago
Daniel Hiltgen 9d60bb44cf
doc: add NVIDIA blackwell to supported list (#11307) 9 months ago
Vincent RAMPAL f371260e75
Update base image to Ubuntu 24.04 LTS (#9681) 9 months ago
Daniel Hiltgen c9e6d7719e
doc: Update link for mac install (#11288) 9 months ago
Daniel Hiltgen 2c4ce40334
mimic logs for layers on new engine (#11278) 9 months ago
XuKecheng 5d8c173529
readme: add NativeMind to community integrations (#11242) 9 months ago
Jeffrey Morgan 44b17d2bfa
tools: fix parsing tool calls with empty arguments, missing required fields (#11233) 9 months ago
Attogram Project 3b8b692218
readme: add ollama-bash-toolshed to community integrations (#11224) 9 months ago
Michael Yang 4129af9205
chore: cleanup comments + unused vars (#11225) 9 months ago
Jesse Gross 45f216a9c7 ggml: Temporarily disable reporting UUIDs 9 months ago
Michael Yang d0b32def60
skip quantizing per_layer_token_embd (#11207) 9 months ago
Daniel Hiltgen 11ffc36157
ci: multi-stage release process (#11001) 9 months ago
Jeffrey Morgan ba04902670
fs/ggml: add multiplier in graph estimates (#11208) 9 months ago
Jeffrey Morgan 3944602f51
fs/ggml: add missing architecture to OllamaEngineRequired() (#11206) 9 months ago
Michael Yang 73b642e6f3
add new gemma model (#11204) 9 months ago
Daniel Hiltgen ad118d8b13
ci: arm sbsa fixes (#11194) 9 months ago
Daniel Hiltgen f08534137b ci: include dependencies 9 months ago
Daniel Hiltgen 4b4a90f233
ci: pick up arm sbsa cuda libs (#11192) 9 months ago
Daniel Hiltgen 03274a6b2f
ci: recombine linux amd64 binaries (#11188) 9 months ago
Devon Rifkin cc6463ebca
Merge pull request #10238 from ollama/drifkin/array-head-count-simple 9 months ago
Daniel Hiltgen 405d2f628f
ci: rocm parallel builds on windows (#11187) 9 months ago
Devon Rifkin a3f7dd3e98 Merge branch 'main' into drifkin/array-head-count-simple 9 months ago
Daniel Hiltgen c85c0ebf89
CI: switch windows to vs 2022 (#11184) 9 months ago
Daniel Hiltgen 10a8e04a8d
avoid context overflow (#11175) 9 months ago
Daniel Hiltgen 1c6669e64c
Re-remove cuda v11 (#10694) 9 months ago
Devon Rifkin b2b270ad5d Merge branch 'main' into drifkin/array-head-count-simple 9 months ago
AJ 2bb69b40c7
readme: add ai-hub to community integrations (#11169) 9 months ago