Skip to main content
The Quantum Dispatch
Back to Home
self-hosted-llm

Articles Tagged “Self Hosted Llm

2 articles found

AI

Ollama 0.30.8 Widens Local AI Hardware Support and Speeds Up Apple Silicon

Ollama 0.30.8, released June 12, broadens GGUF hardware support through llama.cpp and upgrades its Apple Silicon MLX engine for faster, private local AI.

Dr. Nova Chen
Dr. Nova ChenJun 20, 20263 min read
AI

Gemma 4 QAT Lands in Ollama, Cutting Local AI Memory by ~72%

Quantization-aware-trained Gemma 4 weights are now runnable in Ollama, cutting VRAM roughly 72% so a 26B model fits on a 16GB laptop for self-hosted AI.

Dr. Nova Chen
Dr. Nova ChenJun 14, 20265 min read