ollama

mirror of https://github.com/ollama/ollama synced 2026-04-23 08:45:14 +00:00

History

Bruce MacDonald 3980c0217d server: decompress zstd request bodies in cloud passthrough middleware (#14827 ) When a zstd-compressed request (e.g. from Codex CLI) hits /v1/responses with a cloud model the request failed. Fix by decompressing zstd bodies before model extraction, so cloud models are detected and proxied directly without the writer being wrapped.		2026-03-13 15:06:47 -07:00
..
internal	docs: fix typos in repository documentation (#10683 )	2025-11-15 20:22:29 -08:00
auth.go	server: reject unexpected auth hosts (#13738 )	2026-01-16 14:10:36 -05:00
auth_test.go	server: reject unexpected auth hosts (#13738 )	2026-01-16 14:10:36 -05:00
cloud_proxy.go	server: decompress zstd request bodies in cloud passthrough middleware (#14827 )	2026-03-13 15:06:47 -07:00
cloud_proxy_test.go	server: decompress zstd request bodies in cloud passthrough middleware (#14827 )	2026-03-13 15:06:47 -07:00
create.go	Reapply "don't require pulling stubs for cloud models" again (#14608 )	2026-03-06 14:27:47 -08:00
create_test.go	Clean up the manifest and modelpath (#13807 )	2026-01-21 11:46:17 -08:00
download.go	Clean up the manifest and modelpath (#13807 )	2026-01-21 11:46:17 -08:00
fixblobs.go
fixblobs_test.go
images.go	mlxrunner: Enforce model context limit	2026-02-27 17:29:47 -08:00
images_test.go	x/imagegen: add image edit capabilities (#13846 )	2026-01-22 20:35:08 -08:00
logprob.go	logprob: add bytes to logprobs (#13068 )	2025-11-13 13:49:25 -08:00
model.go	Clean up the manifest and modelpath (#13807 )	2026-01-21 11:46:17 -08:00
model_resolver.go	Reapply "don't require pulling stubs for cloud models" again (#14608 )	2026-03-06 14:27:47 -08:00
model_resolver_test.go	Reapply "don't require pulling stubs for cloud models" again (#14608 )	2026-03-06 14:27:47 -08:00
prompt.go	mlxrunner: Enforce model context limit	2026-02-27 17:29:47 -08:00
prompt_test.go	model/renderers: fix glm-ocr image tags in renderer prompts (#14584 )	2026-03-03 12:51:34 -08:00
quantization.go	model: support for qwen3.5 architecture (#14378 )	2026-02-24 20:08:05 -08:00
quantization_test.go	model: support for qwen3.5 architecture (#14378 )	2026-02-24 20:08:05 -08:00
routes.go	server: remove experimental aliases support (#14810 )	2026-03-12 20:27:24 -07:00
routes_cloud_test.go	cloud_proxy: send ollama client version (#14769 )	2026-03-10 15:53:25 -07:00
routes_create_test.go	Reapply "don't require pulling stubs for cloud models" again (#14608 )	2026-03-06 14:27:47 -08:00
routes_debug_test.go	server: use tiered VRAM-based default context length	2026-02-02 10:47:09 -08:00
routes_delete_test.go	Reapply "don't require pulling stubs for cloud models" again (#14608 )	2026-03-06 14:27:47 -08:00
routes_generate_renderer_test.go	server: use tiered VRAM-based default context length	2026-02-02 10:47:09 -08:00
routes_generate_test.go	bugfix: better mlx model scheduling (#14290 )	2026-02-17 13:57:05 -08:00
routes_harmony_streaming_test.go	preserve tool definition and call JSON ordering (#13525 )	2026-01-05 18:03:36 -08:00
routes_list_test.go	Update the /api/create endpoint to use JSON (#7935 )	2024-12-31 18:02:30 -08:00
routes_options_test.go	server: use tiered VRAM-based default context length	2026-02-02 10:47:09 -08:00
routes_test.go	server: return error when embedding contains NaN or Inf values (#13599 )	2026-01-03 02:20:12 -05:00
routes_web_experimental_test.go	cloud_proxy: send ollama client version (#14769 )	2026-03-10 15:53:25 -07:00
sched.go	chore: remove old imagegen LLMs models (#14597 )	2026-03-03 13:23:40 -08:00
sched_test.go	chore: remove old imagegen LLMs models (#14597 )	2026-03-03 13:23:40 -08:00
sparse_common.go
sparse_windows.go
test_home_test.go	add ability to disable cloud (#14221 )	2026-02-12 15:47:00 -08:00
upload.go	Clean up the manifest and modelpath (#13807 )	2026-01-21 11:46:17 -08:00