Skip to main content
The Quantum Dispatch
Back to Home
ollama

Articles Tagged “Ollama

4 articles found

AI

Ollama 0.30.8 Widens Local AI Hardware Support and Speeds Up Apple Silicon

Ollama 0.30.8, released June 12, broadens GGUF hardware support through llama.cpp and upgrades its Apple Silicon MLX engine for faster, private local AI.

Dr. Nova Chen
Dr. Nova ChenJun 20, 20263 min read
AI

Gemma 4 QAT Lands in Ollama, Cutting Local AI Memory by ~72%

Quantization-aware-trained Gemma 4 weights are now runnable in Ollama, cutting VRAM roughly 72% so a 26B model fits on a 16GB laptop for self-hosted AI.

Dr. Nova Chen
Dr. Nova ChenJun 14, 20265 min read
AI

Ollama v0.24 Lands With Qwen 3.6 Support — Local AI Just Got a Major Upgrade for Self-Hosted LLM Builders

Ollama released v0.24.0 on May 14, 2026 with first-class support for Qwen 3.6 — bringing Alibaba's 35B-A3B mixture-of-experts model to anyone running local LLMs on their own hardware.

Dr. Nova Chen
Dr. Nova ChenMay 18, 20266 min read
AI

Qwen3.6 Arrives on Ollama: Run a 35B Agentic Coding AI Locally With 256K Context

Alibaba's Qwen3.6 is now on Ollama — a 35B open-weight model with 256K context, vision support, and thinking preservation built for agentic coding workflows you can run on your own hardware.

Dr. Nova Chen
Dr. Nova ChenApr 18, 20265 min read