ollama/model
Jeffrey Morgan 3d01f2aa34
parsers: refactor Nemotron parser to reuse Qwen3Coder for tool calls (#13764)
Simplify Nemotron3NanoParser by delegating tool call parsing to
Qwen3CoderParser instead of duplicating the parsing logic. The
Nemotron parser now only handles the thinking state machine and
transitions to Qwen3CoderParser for content and tool call parsing.

This also fixes an issue where tool calls without </think> would
cause the parser to get stuck in thinking mode.
2026-01-17 18:28:52 -08:00
..
imageproc deepseekocr 2025-11-18 16:11:37 -08:00
input batch: use tensors for outputs (#12185) 2025-09-15 14:33:06 -07:00
models revert granite-embedding (#13505) 2025-12-16 15:44:52 -08:00
parsers parsers: refactor Nemotron parser to reuse Qwen3Coder for tool calls (#13764) 2026-01-17 18:28:52 -08:00
renderers olmo3: fix flaky test (#13629) 2026-01-05 22:37:20 -08:00
testdata gemma2 impl 2025-03-11 14:35:08 -07:00
bytepairencoding.go remove unnecessary code (#13502) 2025-12-16 15:11:26 -08:00
bytepairencoding_test.go refactor: using testing.B.Loop 2025-10-10 13:25:29 -07:00
model.go fix: leaf alt name (#12390) 2025-09-23 17:50:53 -07:00
model_test.go fix: leaf alt name (#12390) 2025-09-23 17:50:53 -07:00
sentencepiece.go fix(tokenizer): add special tokens to empty inputs (#13091) 2025-11-18 11:16:56 -08:00
sentencepiece_test.go model: implement bert in ollama engine (#9080) 2025-09-15 15:35:59 -07:00
textprocessor.go model: handle multiple eos tokens (#10577) 2025-05-16 13:40:23 -07:00
vocabulary.go fix(tokenizer): add special tokens to empty inputs (#13091) 2025-11-18 11:16:56 -08:00
vocabulary_test.go fix(tokenizer): add special tokens to empty inputs (#13091) 2025-11-18 11:16:56 -08:00
wordpiece.go nomic-embed-text model implementation (#13071) 2025-11-18 18:28:10 -08:00
wordpiece_test.go nomic-embed-text model implementation (#13071) 2025-11-18 18:28:10 -08:00