releasefeatureopen-source
Ollama v0.17.3 fixes tool calling bug in Qwen 3 and 3.5 models//Ollama released v0.17.3, a patch version that addresses a critical parsing issue with tool calls in the Qwen 3 and Qwen 3.5 model families. The bug occurred when tool calls were emitted during the model's thinking phase, preventing proper execution of function calls. releasebugfix
releasebugfix
releasefeaturemodel
Ollama v0.17.1-rc0 updates mlx-c bindings to 0.5.0//Ollama releases v0.17.1-rc0, a release candidate that updates mlx-c bindings to version 0.5.0 and switches Linux builds to use GCC 11. This incremental update improves compatibility and compiler support for users running Ollama on Linux systems. releasebugfix
releasebugfix
releasefeatureintegration
releasebugfixsecurity
releasebugfixfeature
Ollama v0.16.2-rc0 fixes mlxrunner model loading and image generation//This release candidate addresses several issues with the mlxrunner component, including fixes for loading GLM4 MOE Lite models, diffusion model loading, and the --imagegen flag. The update resolves functionality gaps that were preventing proper model execution in the mlxrunner backend. releasebugfix
Ollama v0.16.1 improves installer UX and image generation timeouts//Ollama releases v0.16.1 with quality-of-life improvements to installation and model configuration. Changes include smarter password prompts for macOS curl installations, installation progress visibility on Windows, and respect for the OLLAMA_LOAD_TIMEOUT variable in image generation models. releasebugfix
Ollama v0.16.0 adds GLM-5 and MiniMax-M2.5 models, introduces app launcher//Ollama releases v0.16.0 with support for two new state-of-the-art models: GLM-5, a 40B-parameter reasoning model designed for complex systems tasks, and MiniMax-M2.5, optimized for productivity and coding. The release also introduces a new `ollama launch` command for seamlessly integrating models with applications. releasefeaturemodel
releasebugfix
releasefeatureapimodelbugfix
releasebugfix
Ollama v0.15.5-rc3 fixes Qwen3 delta net tensor broadcasting issue//This release candidate fixes a critical broadcasting issue in the Qwen3 model's delta network computation where gradient tensors were being multiplied along the wrong axis. The fix reshapes the gradient difference tensor to the correct dimensions for proper multiplication with the key tensor. bugfixrelease