ollama/x/mlxrunner/model
Jesse Gross f20dc6b698 mlx: don't default to affine quantization for unquantized models
Otherwise the BF16 version of models trigger segfaults when they
call into quantized kernels.
2026-02-23 15:03:53 -08:00
..
base mlxrunner: Fix memory leaks with pin/sweep lifecycle management 2026-02-23 09:50:07 -08:00
linear.go bugfix: add missing linear layer factory (#14289) 2026-02-16 17:22:20 -08:00
quant.go mlx: don't default to affine quantization for unquantized models 2026-02-23 15:03:53 -08:00
root.go model: add gemma3 to the mlxrunner (#14276) 2026-02-15 22:47:59 -08:00
root_stub.go mlxrunner fixes (#14247) 2026-02-13 22:30:42 -08:00