Skip to main content
The Quantum Dispatch
Back to Home
mixture-of-experts

Articles Tagged “Mixture Of Experts

5 articles found

AI

DeepSeek V4-Pro and V4-Flash Are Here: Open-Source AI With a 1M-Token Context Window

DeepSeek drops V4-Pro (1.6T params) and V4-Flash today with 1M-token context, hybrid attention, and pricing that challenges every closed-source frontier model.

Dr. Nova Chen
Dr. Nova ChenApr 24, 20265 min read
AI

GLM-5.1 Goes Open-Source and Hits #1 on SWE-Bench Pro — Beating Every Closed AI Model

Z.ai's GLM-5.1 is a 754B open-weight MoE model under the MIT license — and it just took #1 on SWE-Bench Pro, outscoring every major closed model.

Dr. Nova Chen
Dr. Nova ChenApr 10, 20264 min read
AI

Meta Launches Llama 4 Scout and Maverick: Multimodal MoE AI Goes Open-Weight

Meta's Llama 4 Scout and Maverick bring multimodal mixture-of-experts AI to the open-source community, with an unprecedented 10 million token context window.

Dr. Nova Chen
Dr. Nova ChenApr 9, 20265 min read
AI

NVIDIA Launches Nemotron 3 Open Models at GDC — 120B Parameters With 5x the Throughput

NVIDIA's Nemotron 3 family ships in Nano, Super, and Ultra sizes with up to 1M-token context, already adopted by CrowdStrike, Cursor, Perplexity, and Zoom.

Dr. Nova Chen
Dr. Nova ChenMar 12, 20264 min read
AI

NVIDIA Debuts Nemotron 3 — Open Models With 4x Throughput and a Million-Token Context Window

NVIDIA's Nemotron 3 family ships three tiers of open models optimized for agentic AI, plus 3 trillion tokens of training data for the community.

Dr. Nova Chen
Dr. Nova ChenMar 2, 20265 min read