1745 Commits (acfc376efda39919baeca9a8b68992b7fb74bcc0)
 

Author SHA1 Message Date
Michael Yang acfc376efd add .golangci.yaml 2 years ago
Michael Yang 997253143f add lint and test on pull_request 2 years ago
Michael Yang 62023177f6
Merge pull request #1614 from jmorganca/mxyng/fix-set-template 2 years ago
Jeffrey Morgan 6164f378f2 revert cuda overhead to 20% 2 years ago
Jeffrey Morgan f387e9631b use runner if cuda alloc won't fit 2 years ago
Jeffrey Morgan 6566387ae3 add `TODO` for cuda overhead 2 years ago
Jeffrey Morgan 37708931fb update cuda overhead to 20% to fix crashes when switching between models and large context sizes 2 years ago
Jeffrey Morgan f6cb0a553c update cuda overhead to 15% or 400MiB 2 years ago
Jeffrey Morgan 2680078c13 fix build on linux 2 years ago
Jeffrey Morgan f1b7e5f560 update overhead to 15% 2 years ago
Jeffrey Morgan cb534e6ac2 use 10% vram overhead for cuda 2 years ago
Jeffrey Morgan 58ce2d8273 better estimate scratch buffer size 2 years ago
Jeffrey Morgan 18ddf6d57d fix windows build 2 years ago
Michael Yang 61e6502449
Merge pull request #1818 from jmorganca/mxyng/fix-alt-prompt 2 years ago
Jeffrey Morgan 08f1e18965
Offload layers to GPU based on new model size estimates (#1850) 2 years ago
Bruce MacDonald 7e8f7c8358
remove ggml automatic re-pull (#1856) 2 years ago
Bruce MacDonald 3f3eb19a3b
document response in modelfile template variables (#1428) 2 years ago
Daniel Hiltgen 059ae4585e
Merge pull request #1834 from dhiltgen/old_cuda 2 years ago
Daniel Hiltgen 6347f501ca
Merge pull request #1828 from dhiltgen/fix_llava 2 years ago
Jeffrey Morgan 5feec959ad
dont use `-Wall` in static build (#1833) 2 years ago
Jeffrey Morgan dbdd50b283
add `-DCMAKE_SYSTEM_NAME=Darwin` cmake flag (#1832) 2 years ago
Daniel Hiltgen d74ce6bd4f Detect very old CUDA GPUs and fall back to CPU 2 years ago
Guilherme Baptista 57942b4676
Update README.md - Community Integrations - Ollama for Ruby (#1830) 2 years ago
Daniel Hiltgen e0d05b0f1e Accept windows paths for image processing 2 years ago
Daniel Hiltgen 2d9dd14f27
Merge pull request #1697 from dhiltgen/win_docs 2 years ago
Jeffrey Morgan 1caa56128f add cuda lib path for nvidia container toolkit 2 years ago
Michael Yang 0101e76dbe
Merge pull request #1797 from sublimator/nd-allow-extension-origins-still-needs-explicit-listing-2024-01-05 2 years ago
Michael Yang 2ef9352b94 fix(cmd): history in alt mode 2 years ago
Michael Yang 5580ae2472 fix: set template without triple quotes 2 years ago
Bruce MacDonald 3a9f447141
only pull gguf model if already exists (#1817) 2 years ago
Patrick Devine 9c2941e61b
switch api for ShowRequest to use the name field (#1816) 2 years ago
Patrick Devine 238ac5e765
Add unit tests for Parser (#1815) 2 years ago
Bruce MacDonald 4f4980b66b
simplify ggml update logic (#1814) 2 years ago
Patrick Devine 22e93efa41 add show info command and fix the modelfile 2 years ago
Patrick Devine 2909dce894 split up interactive generation 2 years ago
Jeffrey Morgan df32537312
gpu: read memory info from all cuda devices (#1802) 2 years ago
Bruce MacDonald 3367b5f3df
remove unused generate patches (#1810) 2 years ago
Matt Williams 46edbbc518
Merge pull request #1801 from jmorganca/mattw/correctdockerlink 2 years ago
Michael Yang d2ff18cd6b
Merge pull request #1791 from jmorganca/mxyng/update-build 2 years ago
Matt Williams df086d3c8c fix docker doc to point to hub 2 years ago
Nicholas Dudfield 8baaaa39c0 Allow extension origins (still needs explicit listing), fixes #1686 2 years ago
Michael Yang f9961c70ae update build 2 years ago
Daniel Hiltgen cd8fad3398
Merge pull request #1790 from dhiltgen/llm_code_shuffle 2 years ago
Daniel Hiltgen 9983fa5f4e Cleaup stale submodule 2 years ago
Daniel Hiltgen dfda91c2ee
Merge pull request #1788 from dhiltgen/llm_code_shuffle 2 years ago
Daniel Hiltgen fac9060da5 Init submodule with new path 2 years ago
Daniel Hiltgen a554616f8e remove old llama.cpp submodule path 2 years ago
Daniel Hiltgen 77d96da94b Code shuffle to clean up the llm dir 2 years ago
Brian Murray 0d6e3565ae
Add embeddings to API (#1773) 2 years ago
Daniel Hiltgen b5939008a1
Merge pull request #1785 from dhiltgen/win_native_cli 2 years ago