NVIDIA enters the open-weights frontier with a 550B-parameter MoE running 55B active parameters per …
releases, ideas, and moments that changed how everyone’s work gets done. model deep dives in the wiki.
The administration signs 'Promoting Advanced Artificial Intelligence Innovation and Security' — the …
Microsoft ships MAI-Thinking-1 at Build 2026 — its first in-house reasoning model, a sparse MoE with …
Alphabet prices an $84.75B equity capital raise on June 2 — upsized from $80B after investor demand …
MiniMax releases M3 on June 1 with a sparse attention architecture (MSA — MiniMax Sparse Attention) …
Anthropic confidentially submits a draft S-1 to the SEC on June 1, 2026 — three days after closing t …
Anthropic's next flagship ships with a deliberately quiet announcement, but the improvements are str …
Anthropic closes a $65B Series H on the same day Opus 4.8 ships — the timing is deliberate: close th …
Gemini 3.5 Flash goes GA on May 19 and immediately reshapes how the tier system is supposed to work. …
Anthropic ships Opus 4.7, the first model in the Claude 4 family designed explicitly for sustained, …
Google's annual developer conference bets the keynote on AI and for once the products justify it. …
Build goes deep on 'AI-native Windows': Copilot+ PCs now run Phi-4 locally with a new Windows AI API …
Jensen Huang keynotes Computex with Blackwell Ultra: 1.5× the dense FLOPS of standard Blackwell at t …
Mistral releases Large 3 — a 200B+ dense model under a non-commercial research license, their heavie …
The EU AI Act's high-risk category prohibitions go live — the first binding AI regulation with real …
Perplexity ships Comet, a standalone browser built around an AI agent that operates the web on your …
OpenAI ships GPT-5.5 on April 23 with an explicit focus on agentic work: coding, computer use, and k …
DeepSeek drops V4-Flash and V4-Pro on April 24 under MIT — 1M-token context, dual Thinking/Non-Think …
Jeff Bezos's physical-AI lab closes a $10B round at a $38B valuation, led by BlackRock and JPMorgan. …
Cognition — maker of Devin, the first broadly-deployed AI software engineer — enters funding talks a …
Anthropic launches a cybersecurity coalition with AWS, Apple, Google, Microsoft, and others — backed …
Google DeepMind ships Gemma 4 in four sizes — E2B, E4B, 26B MoE, and 31B Dense — distilled from Gemi …
OpenAI closes the largest private funding round in history: $122B with Amazon ($50B), Nvidia ($30B), …
OpenAI deprecates GPT-5.1 and ships GPT-5.4, GPT-5.4 Thinking, and GPT-5.4 mini. …
Meta's first major model release since acquiring Scale AI's Alexandr Wang scores #4 on the Artificia …
xAI ships Grok 4.20 with the strongest current-events accuracy of any frontier model at release — a …
Anthropic spins out a dedicated research organization led by co-founder Jack Clark — who spent five …
Karpathy releases a 630-line open-source script that lets an AI agent autonomously run ML experiment …
Within days of Karpathy's release, Tobi Lütke adapts autoresearch for a Shopify model training run — …
Anthropic's flagship at release, later succeeded by Opus 4.7. …
The first Sonnet-class model to hit 1M token context, which matters because previous Sonnet models t …
Released February 27, 2025, GPT-4.5 is OpenAI's largest non-reasoning model — the explicit bet that …
Open-sourced March 4, 2026 under Apache 2.0 at github.com/openai/symphony, Symphony is a Codex App S …
Huntley's thesis: stop building brick by brick, start programming the loop. …
OpenAI takes the Realtime API from beta to general availability on August 28, paired with gpt-realti …
Mistral ships Medium 3.1 on August 12 — a multimodal proprietary model with a custom-trained vision …
OpenAI ships GPT-5 on August 7 — not as a single new model but as a unified system: a fast tier for …
NVIDIA closes at a $4 trillion market capitalization on July 10 — the first publicly traded company …
xAI ships Grok 4 and Grok 4 Heavy on July 9, unveiled via a livestream that drew 1.5 million concurr …
Google moves Gemini 2.5 Flash to general availability on June 17, completing the transition from exp …
Anthropic's most capable agentic model at time of release. …
Google's Gemini 2.5 Pro moves from experimental to general availability, bringing its top-ranked lon …
Cursor reaches $100M ARR — the fastest SaaS product to that milestone in history. …
Anthropic's Claude 4 generation ships May 22, 2025. …
Launched as a limited research preview in February 2025, Claude Code went generally available on May …
OpenAI open-sources Codex CLI under Apache 2.0 in May 2025 — a terminal-native coding agent that acc …
Lütke's internal memo goes wide: AI usage is now a baseline expectation at Shopify, not a differenti …
Released March 2025 as an experimental preview — meaning rate-limited, no SLA, and subject to change …
DeepSeek releases the first open-source reasoning model trained via pure reinforcement learning to m …
Karpathy coins the term in a tweet: describe a project in natural language, accept AI-generated code …
Anthropic's first hybrid reasoning model: standard fast responses for most tasks, extended thinking …
xAI ships Grok 3, trained on 10x the compute of Grok 2. …
OpenAI open-sources a multi-agent orchestration framework with first-class primitives for handoffs, …
Anthropic ships Claude 3.5 computer use in public beta: Claude can see your screen, move the cursor, …
Anthropic's fastest model in the 3.5 family ships in November 2024 and immediately reframes what 'sm …
OpenAI opens the Realtime API to public beta in October 2024, giving developers a persistent WebSock …
Apple's redesigned Mac Mini ships with M4 and M4 Pro chips at $600 and $1,400. …
Anthropic open-sources MCP — a universal connector standard for AI agents and external systems. …
Alibaba releases Qwen 2.5-Coder in 7B (Sep), 32B, and 72B (Nov) variants. …
OpenAI ships the full o1 on December 5, 2024 — not a preview, a production model — and the gap over …
Meta closes out 2024 on December 6 with Llama 3.3 70B Instruct — and the headline is efficiency, not …
Google announces Gemini 2.0 Flash: native image and audio output, a Multimodal Live API for real-tim …
A 671B MoE model trained for $5.58M that matches Claude 3.5 Sonnet and o1 on most benchmarks and run …
Meta ships Llama 3.1 with a 405B parameter flagship, 128K context, and support for eight additional …
OpenAI releases GPT-4o mini on July 18, 2024 at $0.15/$0.60 per million input/output tokens — more t …
Mistral releases Large 2 on July 24, 2024 — 123B parameters, 128K context, MIT license — their most …
OpenAI ships o1-preview and o1-mini: models that 'think before they answer' via long internal chain- …
Alibaba releases Qwen 2.5 across dense and MoE variants, describing it as 'perhaps the largest open- …
Google ships Audio Overviews in NotebookLM — converts any uploaded document into a two-host podcast. …
Meta releases Llama 3.2 in four sizes: 1B and 3B text-only models optimized for edge and mobile, plu …
OpenAI ships GPT-4 Omni: natively multimodal across text, audio, image, and video — no separate enco …
Anthropic releases Claude 3.5 Sonnet on June 20, 2024 — at 80% lower cost than Claude 3 Opus, while …
Meta releases Llama 3 in 8B and 70B sizes, trained on 15 trillion tokens — 7x more data than Llama 2 …
Apple announces Apple Intelligence at WWDC 2024: on-device models across iPhone, iPad, and Mac — wri …
Google's Gemini 1.5 Pro becomes generally available with a 1-million-token context window. …
Anthropic launches Haiku, Sonnet, and Opus — the first model family to give developers meaningful sp …
Google announces Gemini 1.5 Pro with a breakthrough 1-million-token context window using a sparse Mo …
OpenAI announces Sora: a text-to-video diffusion model that generates 60-second photorealistic clips …
xAI open-sources Grok-1 — the full 314B MoE base model — under Apache 2.0. …
OpenAI hosts its first developer conference: GPT-4 Turbo with 128K context, GPTs (custom instruction …
Mistral AI drops Mixtral 8x7B without announcement — a torrent link on X with no blog post. …
Google announces Gemini 1.0 — its multimodal response to GPT-4. …
Elon Musk's xAI launches Grok to X Premium+ subscribers — a 314B MoE model trained from scratch with …
Meta and Microsoft release Llama 2 with 7B, 13B, and 70B variants, trained on 2 trillion tokens — do …
Meta releases Code Llama (7B, 13B, 34B) — Llama 2 fine-tuned on 500B tokens of code with fill-in-the …
Mistral AI, founded in April by ex-Meta and Google researchers, releases its first model — Mistral 7 …
The Technology Innovation Institute in Abu Dhabi releases Falcon 180B — a 180B parameter model train …
Auto-GPT — an open-source experiment that chains GPT-4 calls into a goal-directed autonomous agent — …
LangChain consolidates its position as the default framework for building LLM applications — chains, …
GitHub launches Copilot Chat in general availability — a conversational interface for code in VS Cod …
ChatGPT reaches 100 million monthly active users two months after launch — the fastest consumer appl …
Microsoft ships Bing Chat (later Bing AI) — a GPT-4-powered search engine before GPT-4 was publicly …
OpenAI releases GPT-4: the first multimodal frontier model, passing the bar exam in the 90th percent …
Anthropic releases its first production model. …
The meaningful unit of AI integration has shifted from the API call to the context window. …
For most tasks, the bottleneck has inverted: generating tokens is cheap, sampling enough to find a g …
The major benchmarks that defined frontier capability for 2022–2024 are saturated or compromised. …
The bottleneck in deploying AI agents is not model capability — it's context management, tool reliab …
The gap between open-weight and frontier closed models closed in 2023–2024, then widened again in 20 …
Richard Sutton's March 2019 essay distills 70 years of AI research into a single uncomfortable obser …
Ilya Sutskever's 2023 observation — most clearly stated at NVIDIA GTC in March of that year — refram …
Sam Altman first used this framing in July 2024 when announcing GPT-4o mini pricing, echoing Lewis S …
Vaswani et al. …
Kaplan et al. …
Lewis et al. …
The GPT-3 paper. …
Wei et al. …
The InstructGPT paper. …
Bai et al. …
Rafailov et al. …
Dao et al. …
Bubeck et al. …
Templeton et al. …
Marks et al. …
The DeepSeek team trains a frontier reasoning model using pure RL — no supervised fine-tuning on cha …