Ollama

Ollama — AI Infrastructure

Changelogs

Ollama v0.17.5 adds GGUF model compatibility support//Ollama releases v0.17.5 with improved compatibility for imported GGUF models. The update enables better integration with external model formats, expanding the range of models users can run locally.

releasefeatureopen-source

Ollama v0.17.3 fixes tool calling bug in Qwen 3 and 3.5 models//Ollama released v0.17.3, a patch version that addresses a critical parsing issue with tool calls in the Qwen 3 and Qwen 3.5 model families. The bug occurred when tool calls were emitted during the model's thinking phase, preventing proper execution of function calls.

releasebugfix

Ollama v0.17.2 fixes Windows app crash on update downloads//Ollama releases v0.17.2, a patch release addressing a critical stability issue affecting Windows users. The update resolves a crash that occurred when the application detected and downloaded new updates.

releasebugfix

v0.17.1-rc2

Ollama v0.17.1-rc1 adds support for Qwen3.5 architecture//Ollama releases v0.17.1-rc1, a release candidate that introduces support for the Qwen3.5 model architecture. This expansion allows users to run Qwen3.5 models locally through Ollama's platform.

releasefeaturemodel

Ollama v0.17.1-rc0 updates mlx-c bindings to 0.5.0//Ollama releases v0.17.1-rc0, a release candidate that updates mlx-c bindings to version 0.5.0 and switches Linux builds to use GCC 11. This incremental update improves compatibility and compiler support for users running Ollama on Linux systems.

releasebugfix

Ollama v0.17.0-rc1 removes noisy error output from MLX library loading//Ollama's v0.17.0-rc1 release candidate addresses misleading error messages that appeared during dynamic library loading for MLX on systems without rpath set. The fix silences expected fallback failures while preserving proper error logging for actual failures.

releasebugfix

Ollama v0.16.3 adds Cline CLI integration and support for Gemma 3, Llama 3, Qwen 3//Ollama releases v0.16.3 with new CLI integration for Cline and expanded model architecture support in the MLX runner. The model picker now displays consistently across all launch integrations.

releasefeatureintegration

Ollama v0.16.3-rc2 prevents partial script execution during installation//Ollama's installation process now wraps the download script in a main function to prevent truncated partial downloads from executing incomplete code. This security-focused fix ensures that interrupted installations won't leave the system in a compromised state.

releasebugfixsecurity

v0.16.3-rc0

Ollama v0.16.2 adds web search for Claude cloud models, fixes image generation//Ollama's latest patch release enhances Claude cloud model integration with web search capabilities and fixes a critical issue preventing experimental image generation models from running. The update also adds privacy controls to disable cloud models entirely for sensitive workloads.

releasebugfixfeature

Ollama v0.16.2-rc0 fixes mlxrunner model loading and image generation//This release candidate addresses several issues with the mlxrunner component, including fixes for loading GLM4 MOE Lite models, diffusion model loading, and the --imagegen flag. The update resolves functionality gaps that were preventing proper model execution in the mlxrunner backend.

releasebugfix

Ollama v0.16.1 improves installer UX and image generation timeouts//Ollama releases v0.16.1 with quality-of-life improvements to installation and model configuration. Changes include smarter password prompts for macOS curl installations, installation progress visibility on Windows, and respect for the OLLAMA_LOAD_TIMEOUT variable in image generation models.

releasebugfix

Ollama v0.16.0 adds GLM-5 and MiniMax-M2.5 models, introduces app launcher//Ollama releases v0.16.0 with support for two new state-of-the-art models: GLM-5, a 40B-parameter reasoning model designed for complex systems tasks, and MiniMax-M2.5, optimized for productivity and coding. The release also introduces a new `ollama launch` command for seamlessly integrating models with applications.

releasefeaturemodel

Ollama v0.15.6 fixes context limits and adds automatic model downloads//Ollama releases v0.15.6 with three key bugfixes addressing context handling and model management. The update improves reliability when launching AI models and enhances the user experience by automatically downloading missing models instead of failing.

releasebugfix

Ollama v0.15.5 adds two new models and improves agentic coding support//Ollama releases v0.15.5 with two new models for coding and document understanding, plus improved `ollama launch` support for sub-agents and context length auto-tuning based on available VRAM. The release also fixes token handling bugs in the API.

releasefeatureapimodelbugfix

Ollama v0.15.5-rc4 fixes off-by-one error in token prediction limits//An off-by-one bug in Ollama's numPredict parameter caused users to receive one fewer token than requested and incorrect token statistics. The fix ensures token limits are properly enforced at prediction time rather than batch setup.

releasebugfix

Ollama v0.15.5-rc3 fixes Qwen3 delta net tensor broadcasting issue//This release candidate fixes a critical broadcasting issue in the Qwen3 model's delta network computation where gradient tensors were being multiplied along the wrong axis. The fix reshapes the gradient difference tensor to the correct dimensions for proper multiplication with the key tensor.

bugfixrelease