10 Commits (69be940bf6d2816f61c79facfa336183bc882720)

Author SHA1 Message Date
Daniel Hiltgen 69be940bf6
gpu: Group GPU Library sets by variant (#6483) 2 years ago
Michael Yang e40145a39d lint 2 years ago
Daniel Hiltgen 34b9db5afc Request and model concurrency 2 years ago
Daniel Hiltgen 39928a42e8 Always dynamically load the llm server library 2 years ago
Fabian Preiß 3bc8b9832b
fix gpu_test.go Error (same type) uint64->uint32 (#1921) 2 years ago
Jeffrey Morgan c336693f07
calculate overhead based number of gpu devices (#1875) 2 years ago
Daniel Hiltgen a2ad952440 Fix windows system memory lookup 2 years ago
Daniel Hiltgen d966b730ac Switch windows build to fully dynamic 2 years ago
Daniel Hiltgen 35934b2e05 Adapted rocm support to cgo based llama.cpp 2 years ago