self-hosted-llm

Articles Tagged “Self Hosted Llm”

2 articles found

Ollama 0.30.8 Widens Local AI Hardware Support and Speeds Up Apple Silicon

Ollama 0.30.8, released June 12, broadens GGUF hardware support through llama.cpp and upgrades its Apple Silicon MLX engine for faster, private local AI.

Dr. Nova Chen★Jun 20, 2026★3 min read

AI-Generated|Opinion

Gemma 4 QAT Lands in Ollama, Cutting Local AI Memory by ~72%

Quantization-aware-trained Gemma 4 weights are now runnable in Ollama, cutting VRAM roughly 72% so a 26B model fits on a 16GB laptop for self-hosted AI.

Dr. Nova Chen★Jun 14, 2026★5 min read