ollama

mirror of https://github.com/ollama/ollama synced 2026-04-30 04:04:23 +00:00

History

Jesse Gross f20dc6b698 mlx: don't default to affine quantization for unquantized models Otherwise the BF16 version of models trigger segfaults when they call into quantized kernels.		2026-02-23 15:03:53 -08:00
..
base	mlxrunner: Fix memory leaks with pin/sweep lifecycle management	2026-02-23 09:50:07 -08:00
linear.go	bugfix: add missing linear layer factory (#14289 )	2026-02-16 17:22:20 -08:00
quant.go	mlx: don't default to affine quantization for unquantized models	2026-02-23 15:03:53 -08:00
root.go	model: add gemma3 to the mlxrunner (#14276 )	2026-02-15 22:47:59 -08:00
root_stub.go	mlxrunner fixes (#14247 )	2026-02-13 22:30:42 -08:00