You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Michael Yang de4ded68b0
Merge pull request #3923 from ollama/mxyng/mem
2 years ago
..
ext_server Support unicode characters in model path (#3681) 2 years ago
generate Remove trailing spaces (#3889) 2 years ago
llama.cpp@7593639ce3 update llama.cpp submodule to `7593639` (#3665) 2 years ago
patches use matrix multiplcation kernels in more cases 2 years ago
ggla.go refactor tensor query 2 years ago
ggml.go fix: mixtral graph 2 years ago
gguf.go fixes for gguf (#3863) 2 years ago
llm.go cgo quantize 2 years ago
llm_darwin_amd64.go Switch back to subprocessing for llama.cpp 2 years ago
llm_darwin_arm64.go Switch back to subprocessing for llama.cpp 2 years ago
llm_linux.go Switch back to subprocessing for llama.cpp 2 years ago
llm_windows.go Move nested payloads to installer and zip file on windows 2 years ago
memory.go only count output tensors 2 years ago
payload.go Move nested payloads to installer and zip file on windows 2 years ago
server.go llm: limit generation to 10x context size to avoid run on generations (#3918) 2 years ago
status.go Switch back to subprocessing for llama.cpp 2 years ago