You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Jesse Gross f2e9c9aff5 server: Reduce gpt-oss context length for small VRAM GPUs 8 months ago
..
internal cache: fix comment function name in cache.go (#11110) 10 months ago
auth.go fix nil deref in auth.go 2 years ago
create.go remove support for multiple ggufs in a single file (#10722) 10 months ago
create_test.go server: validate local path on safetensor create (#9379) 1 year ago
download.go server: abort download on empty digest 10 months ago
fixblobs.go server: replace blob prefix separator from ':' to '-' (#3146) 2 years ago
fixblobs_test.go server: replace blob prefix separator from ':' to '-' (#3146) 2 years ago
harmonyparser.go gpt-oss (#11672) 8 months ago
harmonyparser_test.go gpt-oss (#11672) 8 months ago
images.go gpt-oss (#11672) 8 months ago
images_test.go Reapply "feat: incremental gguf parser (#10822)" (#11114) (#11119) 9 months ago
layer.go One corrupt manifest should not wedge model operations (#7515) 1 year ago
manifest.go One corrupt manifest should not wedge model operations (#7515) 1 year ago
manifest_test.go One corrupt manifest should not wedge model operations (#7515) 1 year ago
model.go tools: refactor tool call parsing and enable streaming (#10415) 10 months ago
modelpath.go server: add hint to the error message when model path access fails (#10843) 10 months ago
modelpath_test.go lint: enable usetesting, disable tenv (#10594) 11 months ago
prompt.go gpt-oss (#11672) 8 months ago
prompt_test.go gpt-oss (#11672) 8 months ago
quantization.go skip quantizing per_layer_token_embd (#11207) 9 months ago
quantization_test.go Reapply "feat: incremental gguf parser (#10822)" (#11114) (#11119) 9 months ago
routes.go server: Reduce gpt-oss context length for small VRAM GPUs 8 months ago
routes_create_test.go Move quantization to new backend (#10363) 11 months ago
routes_delete_test.go Update the /api/create endpoint to use JSON (#7935) 1 year ago
routes_generate_test.go tools: support anyOf types 8 months ago
routes_harmony_streaming_test.go tools: support anyOf types 8 months ago
routes_list_test.go Update the /api/create endpoint to use JSON (#7935) 1 year ago
routes_test.go server: use slices.Equal to simplify code (#11502) 8 months ago
sched.go Reduce default parallelism to 1 (#11330) 9 months ago
sched_test.go Reapply "feat: incremental gguf parser (#10822)" (#11114) (#11119) 9 months ago
sparse_common.go Don't hard fail on sparse setup error 2 years ago
sparse_windows.go Don't hard fail on sparse setup error 2 years ago
upload.go server: always print upload/download part info (#8832) 1 year ago