releases, ideas, and moments that changed how everyone’s work gets done. model deep dives in the wiki.

2026 · Q226
modelNVIDIA Nemotron 3 Ultra

NVIDIA enters the open-weights frontier with a 550B-parameter MoE running 55B active parameters per …

signalTrump AI Executive Order

The administration signs 'Promoting Advanced Artificial Intelligence Innovation and Security' — the …

modelMiniMax M3

MiniMax releases M3 on June 1 with a sparse attention architecture (MSA — MiniMax Sparse Attention) …

modelClaude Opus 4.8

Anthropic's next flagship ships with a deliberately quiet announcement, but the improvements are str …

modelGemini 3.5 Flash

Gemini 3.5 Flash goes GA on May 19 and immediately reshapes how the tier system is supposed to work. …

modelClaude Opus 4.7

Anthropic ships Opus 4.7, the first model in the Claude 4 family designed explicitly for sustained, …

momentGoogle I/O 2026

Google's annual developer conference bets the keynote on AI and for once the products justify it. …

momentMicrosoft Build 2026

Build goes deep on 'AI-native Windows': Copilot+ PCs now run Phi-4 locally with a new Windows AI API …

modelMistral Large 3

Mistral releases Large 3 — a 200B+ dense model under a non-commercial research license, their heavie …

toolPerplexity Comet

Perplexity ships Comet, a standalone browser built around an AI agent that operates the web on your …

modelGPT-5.5

OpenAI ships GPT-5.5 on April 23 with an explicit focus on agentic work: coding, computer use, and k …

modelDeepSeek V4 preview

DeepSeek drops V4-Flash and V4-Pro on April 24 under MIT — 1M-token context, dual Thinking/Non-Think …

fundingProject Prometheus closes $10B

Jeff Bezos's physical-AI lab closes a $10B round at a $38B valuation, led by BlackRock and JPMorgan. …

fundingCognition AI in talks at $25B

Cognition — maker of Devin, the first broadly-deployed AI software engineer — enters funding talks a …

momentProject Glasswing

Anthropic launches a cybersecurity coalition with AWS, Apple, Google, Microsoft, and others — backed …

modelGemma 4

Google DeepMind ships Gemma 4 in four sizes — E2B, E4B, 26B MoE, and 31B Dense — distilled from Gemi …

modelGPT-5.4

OpenAI deprecates GPT-5.1 and ships GPT-5.4, GPT-5.4 Thinking, and GPT-5.4 mini. …

modelMeta Muse Spark

Meta's first major model release since acquiring Scale AI's Alexandr Wang scores #4 on the Artificia …

modelGrok 4.20 + xAI Series E

xAI ships Grok 4.20 with the strongest current-events accuracy of any frontier model at release — a …

2026 · Q18
momentThe Anthropic Institute

Anthropic spins out a dedicated research organization led by co-founder Jack Clark — who spent five …

momentKarpathy's autoresearch

Karpathy releases a 630-line open-source script that lets an AI agent autonomously run ML experiment …

momentTobi adapts autoresearch

Within days of Karpathy's release, Tobi Lütke adapts autoresearch for a Shopify model training run — …

modelClaude Opus 4.6

Anthropic's flagship at release, later succeeded by Opus 4.7. …

modelClaude Sonnet 4.6

The first Sonnet-class model to hit 1M token context, which matters because previous Sonnet models t …

modelGPT-4.5

Released February 27, 2025, GPT-4.5 is OpenAI's largest non-reasoning model — the explicit bet that …

toolSymphony

Open-sourced March 4, 2026 under Apache 2.0 at github.com/openai/symphony, Symphony is a Codex App S …

2025 · Q39
modelMistral Medium 3.1

Mistral ships Medium 3.1 on August 12 — a multimodal proprietary model with a custom-trained vision …

modelGPT-5

OpenAI ships GPT-5 on August 7 — not as a single new model but as a unified system: a fast tier for …

signalNVIDIA hits $4T

NVIDIA closes at a $4 trillion market capitalization on July 10 — the first publicly traded company …

modelGrok 4

xAI ships Grok 4 and Grok 4 Heavy on July 9, unveiled via a livestream that drew 1.5 million concurr …

modelGemini 2.5 Flash GA

Google moves Gemini 2.5 Flash to general availability on June 17, completing the transition from exp …

modelClaude Sonnet 4.5

Anthropic's most capable agentic model at time of release. …

modelGemini 2.5 Pro GA

Google's Gemini 2.5 Pro moves from experimental to general availability, bringing its top-ranked lon …

2025 · Q25
toolClaude Code GA

Launched as a limited research preview in February 2025, Claude Code went generally available on May …

toolCodex CLI

OpenAI open-sources Codex CLI under Apache 2.0 in May 2025 — a terminal-native coding agent that acc …

momentTobi on AI at Shopify

Lütke's internal memo goes wide: AI usage is now a baseline expectation at Shopify, not a differenti …

modelGemini 2.5 Pro Experimental

Released March 2025 as an experimental preview — meaning rate-limited, no SLA, and subject to change …

2025 · Q15
modelDeepSeek R1

DeepSeek releases the first open-source reasoning model trained via pure reinforcement learning to m …

momentVibe coding

Karpathy coins the term in a tweet: describe a project in natural language, accept AI-generated code …

modelClaude 3.7 Sonnet

Anthropic's first hybrid reasoning model: standard fast responses for most tasks, extended thinking …

modelGrok 3

xAI ships Grok 3, trained on 10x the compute of Grok 2. …

toolOpenAI Agents SDK

OpenAI open-sources a multi-agent orchestration framework with first-class primitives for handoffs, …

2024 · Q410
toolComputer use

Anthropic ships Claude 3.5 computer use in public beta: Claude can see your screen, move the cursor, …

modelClaude 3.5 Haiku

Anthropic's fastest model in the 3.5 family ships in November 2024 and immediately reframes what 'sm …

toolGPT-4o Realtime API

OpenAI opens the Realtime API to public beta in October 2024, giving developers a persistent WebSock …

momentMac Mini M4

Apple's redesigned Mac Mini ships with M4 and M4 Pro chips at $600 and $1,400. …

toolModel Context Protocol

Anthropic open-sources MCP — a universal connector standard for AI agents and external systems. …

modelQwen 2.5 Coder

Alibaba releases Qwen 2.5-Coder in 7B (Sep), 32B, and 72B (Nov) variants. …

modelOpenAI o1

OpenAI ships the full o1 on December 5, 2024 — not a preview, a production model — and the gap over …

modelLlama 3.3 70B

Meta closes out 2024 on December 6 with Llama 3.3 70B Instruct — and the headline is efficiency, not …

modelGemini 2.0 Flash

Google announces Gemini 2.0 Flash: native image and audio output, a Multimodal Live API for real-tim …

modelDeepSeek V3

A 671B MoE model trained for $5.58M that matches Claude 3.5 Sonnet and o1 on most benchmarks and run …

2024 · Q37
modelLlama 3.1 405B

Meta ships Llama 3.1 with a 405B parameter flagship, 128K context, and support for eight additional …

modelGPT-4o mini

OpenAI releases GPT-4o mini on July 18, 2024 at $0.15/$0.60 per million input/output tokens — more t …

modelMistral Large 2

Mistral releases Large 2 on July 24, 2024 — 123B parameters, 128K context, MIT license — their most …

modelOpenAI o1 preview

OpenAI ships o1-preview and o1-mini: models that 'think before they answer' via long internal chain- …

modelQwen 2.5

Alibaba releases Qwen 2.5 across dense and MoE variants, describing it as 'perhaps the largest open- …

toolNotebookLM Audio Overviews

Google ships Audio Overviews in NotebookLM — converts any uploaded document into a two-host podcast. …

modelLlama 3.2

Meta releases Llama 3.2 in four sizes: 1B and 3B text-only models optimized for edge and mobile, plu …

2024 · Q25
modelGPT-4o

OpenAI ships GPT-4 Omni: natively multimodal across text, audio, image, and video — no separate enco …

modelClaude 3.5 Sonnet

Anthropic releases Claude 3.5 Sonnet on June 20, 2024 — at 80% lower cost than Claude 3 Opus, while …

modelLlama 3

Meta releases Llama 3 in 8B and 70B sizes, trained on 15 trillion tokens — 7x more data than Llama 2 …

toolApple Intelligence

Apple announces Apple Intelligence at WWDC 2024: on-device models across iPhone, iPad, and Mac — wri …

modelGemini 1.5 Pro GA

Google's Gemini 1.5 Pro becomes generally available with a 1-million-token context window. …

2024 · Q14
modelClaude 3 family

Anthropic launches Haiku, Sonnet, and Opus — the first model family to give developers meaningful sp …

modelGemini 1.5 Pro announced

Google announces Gemini 1.5 Pro with a breakthrough 1-million-token context window using a sparse Mo …

modelSora

OpenAI announces Sora: a text-to-video diffusion model that generates 60-second photorealistic clips …

modelGrok-1 open source

xAI open-sources Grok-1 — the full 314B MoE base model — under Apache 2.0. …

2023 · Q44
momentOpenAI DevDay

OpenAI hosts its first developer conference: GPT-4 Turbo with 128K context, GPTs (custom instruction …

modelMixtral 8x7B

Mistral AI drops Mixtral 8x7B without announcement — a torrent link on X with no blog post. …

modelGemini 1.0

Google announces Gemini 1.0 — its multimodal response to GPT-4. …

modelGrok (xAI)

Elon Musk's xAI launches Grok to X Premium+ subscribers — a 314B MoE model trained from scratch with …

2023 · Q34
modelLlama 2

Meta and Microsoft release Llama 2 with 7B, 13B, and 70B variants, trained on 2 trillion tokens — do …

modelCode Llama

Meta releases Code Llama (7B, 13B, 34B) — Llama 2 fine-tuned on 500B tokens of code with fill-in-the …

modelMistral 7B

Mistral AI, founded in April by ex-Meta and Google researchers, releases its first model — Mistral 7 …

modelFalcon 180B

The Technology Innovation Institute in Abu Dhabi releases Falcon 180B — a 180B parameter model train …

2023 · Q23
momentAuto-GPT goes viral

Auto-GPT — an open-source experiment that chains GPT-4 calls into a goal-directed autonomous agent — …

toolLangChain hits 1.0

LangChain consolidates its position as the default framework for building LLM applications — chains, …

toolGitHub Copilot Chat GA

GitHub launches Copilot Chat in general availability — a conversational interface for code in VS Cod …

2023 · Q14
momentChatGPT hits 100M users

ChatGPT reaches 100 million monthly active users two months after launch — the fastest consumer appl …

toolBing AI

Microsoft ships Bing Chat (later Bing AI) — a GPT-4-powered search engine before GPT-4 was publicly …

modelGPT-4

OpenAI releases GPT-4: the first multimodal frontier model, passing the bar exam in the 90th percent …

modelClaude 1.0

Anthropic releases its first production model. …

Signals8
signalContext is the new API

The meaningful unit of AI integration has shifted from the API call to the context window. …

signalInference cost inversion

For most tasks, the bottleneck has inverted: generating tokens is cheap, sampling enough to find a g …

signalThe eval collapse

The major benchmarks that defined frontier capability for 2022–2024 are saturated or compromised. …

signalAgent infrastructure, not agent models

The bottleneck in deploying AI agents is not model capability — it's context management, tool reliab …

signalThe open-weight treadmill

The gap between open-weight and frontier closed models closed in 2023–2024, then widened again in 20 …

signalThe Bitter Lesson

Richard Sutton's March 2019 essay distills 70 years of AI research into a single uncomfortable obser …

signalIntelligence too cheap to meter

Sam Altman first used this framing in July 2024 when announcing GPT-4o mini pricing, echoing Lewis S …

Papers · Foundations13
paperDeepSeek-R1

The DeepSeek team trains a frontier reasoning model using pure RL — no supervised fine-tuning on cha …