You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Michael Yang 6297f85606 gofmt, goimports 2 years ago
..
ext_server revert tokenize ffi (#4761) 2 years ago
generate speed up tests by only building static lib (#4740) 2 years ago
llama.cpp@5921b8f089 Update llama.cpp submodule to `5921b8f0` (#4731) 2 years ago
patches Update llama.cpp submodule to `5921b8f0` (#4731) 2 years ago
filetype.go Add support for IQ1_S, IQ3_S, IQ2_S, IQ4_XS. IQ4_NL (#4322) 2 years ago
ggla.go simplify safetensors reading 2 years ago
ggml.go Update llm/ggml.go 2 years ago
gguf.go lint 2 years ago
llm.go revert tokenize ffi (#4761) 2 years ago
llm_darwin_amd64.go Switch back to subprocessing for llama.cpp 2 years ago
llm_darwin_arm64.go Switch back to subprocessing for llama.cpp 2 years ago
llm_linux.go Switch back to subprocessing for llama.cpp 2 years ago
llm_windows.go Move nested payloads to installer and zip file on windows 2 years ago
memory.go gofmt, goimports 2 years ago
payload.go replace x/exp/slices with slices 2 years ago
server.go lint 2 years ago
status.go Switch back to subprocessing for llama.cpp 2 years ago