Log — Roguelite Labs

releases, ideas, and moments that changed how everyone’s work gets done. model deep dives in the wiki.

2026 · Q31

Jul

modelClaude 5 — Sonnet 5 + Fable 5

anthropic · jul

Anthropic ships the Claude 5 generation with two models: Sonnet 5 and Fable 5. …

reasoningagenticcodingsource ↗

2026 · Q227

Jun

modelNVIDIA Nemotron 3 Ultra

nvidia · jun

NVIDIA enters the open-weights frontier with a 550B-parameter MoE running 55B active parameters per …

open-weightsreasoningsource ↗

signalTrump AI Executive Order

policy · jun

The administration signs 'Promoting Advanced Artificial Intelligence Innovation and Security' — the …

source ↗

modelMicrosoft MAI-Thinking-1 + Project Polaris

microsoft · jun

Microsoft ships MAI-Thinking-1 at Build 2026 — its first in-house reasoning model, a sparse MoE with …

reasoningcodingsource ↗

fundingAlphabet raises $84.75B for AI infrastructure

google · jun

Alphabet prices an $84.75B equity capital raise on June 2 — upsized from $80B after investor demand …

source ↗

modelMiniMax M3

minimax · jun

MiniMax releases M3 on June 1 with a sparse attention architecture (MSA — MiniMax Sparse Attention) …

open-weightscodingsource ↗

signalAnthropic IPO — confidential S-1 filed

anthropic · jun

Anthropic confidentially submits a draft S-1 to the SEC on June 1, 2026 — three days after closing t …

source ↗

May

modelClaude Opus 4.8

anthropic · may

Anthropic's next flagship ships with a deliberately quiet announcement, but the improvements are str …

agenticcodingreasoningsource ↗

fundingAnthropic Series H — $65B at $965B

anthropic · may

Anthropic closes a $65B Series H on the same day Opus 4.8 ships — the timing is deliberate: close th …

source ↗

modelGemini 3.5 Flash

google · may

Gemini 3.5 Flash goes GA on May 19 and immediately reshapes how the tier system is supposed to work. …

agenticefficiencysource ↗

modelClaude Opus 4.7

anthropic · may

Anthropic ships Opus 4.7, the first model in the Claude 4 family designed explicitly for sustained, …

agenticreasoningsource ↗

momentGoogle I/O 2026

google · may

Google's annual developer conference bets the keynote on AI and for once the products justify it. …

source ↗

momentMicrosoft Build 2026

microsoft · may

Build goes deep on 'AI-native Windows': Copilot+ PCs now run Phi-4 locally with a new Windows AI API …

source ↗

momentNVIDIA Computex — Blackwell Ultra

nvidia · may

Jensen Huang keynotes Computex with Blackwell Ultra: 1.5× the dense FLOPS of standard Blackwell at t …

source ↗

modelMistral Large 3

mistral · may

Mistral releases Large 3 — a 200B+ dense model under a non-commercial research license, their heavie …

open-weightsreasoningsource ↗

signalEU AI Act — high-risk provisions in force

eu · may

The EU AI Act's high-risk category prohibitions go live — the first binding AI regulation with real …

source ↗

toolPerplexity Comet

perplexity · may

Perplexity ships Comet, a standalone browser built around an AI agent that operates the web on your …

agenticsource ↗

Apr

modelGPT-5.5

openai · apr

OpenAI ships GPT-5.5 on April 23 with an explicit focus on agentic work: coding, computer use, and k …

agenticcodingreasoningsource ↗

modelDeepSeek V4 preview

deepseek · apr

DeepSeek drops V4-Flash and V4-Pro on April 24 under MIT — 1M-token context, dual Thinking/Non-Think …

reasoningopen-weightsefficiencysource ↗

modelKimi K2.6 — leads open-weights Intelligence Index

moonshot · apr

Moonshot AI ships K2.6 in April — the largest upgrade to the K2 architecture since its July 2025 rel …

open-weightsagenticcodingsource ↗

fundingProject Prometheus closes $10B

prometheus · apr

Jeff Bezos's physical-AI lab closes a $10B round at a $38B valuation, led by BlackRock and JPMorgan. …

source ↗

fundingCognition AI in talks at $25B

cognition · apr

Cognition — maker of Devin, the first broadly-deployed AI software engineer — enters funding talks a …

source ↗

momentProject Glasswing

anthropic · apr

Anthropic launches a cybersecurity coalition with AWS, Apple, Google, Microsoft, and others — backed …

source ↗

modelGemma 4

google · apr

Google DeepMind ships Gemma 4 in four sizes — E2B, E4B, 26B MoE, and 31B Dense — distilled from Gemi …

open-weightsreasoningcodingsource ↗

modelMeta Muse Spark

meta · apr

Meta's first major model release since acquiring Scale AI's Alexandr Wang scores #4 on the Artificia …

multimodalreasoningagenticsource ↗

Mar

fundingOpenAI raises $122B at $852B valuation

openai · mar

OpenAI closes the largest private funding round in history: $122B with Amazon ($50B), Nvidia ($30B), …

source ↗

modelGPT-5.4

openai · mar

OpenAI deprecates GPT-5.1 and ships GPT-5.4, GPT-5.4 Thinking, and GPT-5.4 mini. …

reasoningsource ↗

modelGrok 4.20 + xAI Series E

xai · mar

xAI ships Grok 4.20 with the strongest current-events accuracy of any frontier model at release — a …

source ↗

2026 · Q18

Mar

momentThe Anthropic Institute

anthropic · mar

Anthropic spins out a dedicated research organization led by co-founder Jack Clark — who spent five …

source ↗

momentKarpathy's autoresearch

karpathy · mar

Karpathy releases a 630-line open-source script that lets an AI agent autonomously run ML experiment …

source ↗

momentTobi adapts autoresearch

tobi · mar

Within days of Karpathy's release, Tobi Lütke adapts autoresearch for a Shopify model training run — …

source ↗

toolSymphony

openai · mar

Open-sourced March 4, 2026 under Apache 2.0 at github.com/openai/symphony, Symphony is a Codex App S …

agenticcodingsource ↗

Feb

modelClaude Opus 4.6

anthropic · feb

Anthropic's flagship at release, later succeeded by Opus 4.7. …

agenticreasoningsource ↗

modelClaude Sonnet 4.6

anthropic · feb

The first Sonnet-class model to hit 1M token context, which matters because previous Sonnet models t …

agenticcodingefficiencysource ↗

modelGPT-4.5

openai · feb

Released February 27, 2025, GPT-4.5 is OpenAI's largest non-reasoning model — the explicit bet that …

source ↗

Jan

momenteverything is a ralph loop

ghuntley · jan

Huntley's thesis: stop building brick by brick, start programming the loop. …

source ↗

2025 · Q41

Oct

modelClaude Haiku 4.5

anthropic · oct

Anthropic ships Haiku 4.5 on October 15 — the fastest and cheapest model in the Claude 4 family, and …

efficiencyagenticcodingsource ↗

2025 · Q311

Sep

modelClaude Sonnet 4.5

anthropic · sep

Anthropic's most capable agentic model at time of release. …

agenticreasoningsource ↗

Aug

toolOpenAI Realtime API GA + gpt-realtime

openai · aug

OpenAI takes the Realtime API from beta to general availability on August 28, paired with gpt-realti …

audioagenticsource ↗

modelMistral Medium 3.1

mistral · aug

Mistral ships Medium 3.1 on August 12 — a multimodal proprietary model with a custom-trained vision …

multimodalefficiencysource ↗

modelGPT-5

openai · aug

OpenAI ships GPT-5 on August 7 — not as a single new model but as a unified system: a fast tier for …

reasoningmultimodalagenticcodingsource ↗

Jul

signalNVIDIA hits $4T

nvidia · jul

NVIDIA closes at a $4 trillion market capitalization on July 10 — the first publicly traded company …

source ↗

modelGrok 4

xai · jul

xAI ships Grok 4 and Grok 4 Heavy on July 9, unveiled via a livestream that drew 1.5 million concurr …

reasoningagenticsource ↗

modelKimi K2

moonshot · jul

Moonshot AI releases Kimi K2 on July 11 — a 1.04 trillion parameter MoE with 32B active parameters p …

open-weightsagenticcodingsource ↗

momentWindsurf: OpenAI deal collapsed, Google + Cognition split the pieces

cognition · jul

The Windsurf story in July 2025 is not a single acquisition — it's a three-way split that reshaped t …

source ↗

Jun

modelGemini 2.5 Flash GA

google · jun

Google moves Gemini 2.5 Flash to general availability on June 17, completing the transition from exp …

efficiencyreasoningmultimodalsource ↗

modelGemini 2.5 Pro GA

google · jun

Google's Gemini 2.5 Pro moves from experimental to general availability, bringing its top-ranked lon …

codingmultimodalsource ↗

momentThe agentic IDE wave crests

industry · q3

Cursor reaches $100M ARR — the fastest SaaS product to that milestone in history. …

source ↗

2025 · Q26

May

modelClaude 4 — Opus 4 + Sonnet 4

anthropic · may

Anthropic's Claude 4 generation ships May 22, 2025. …

agenticreasoningsource ↗

toolCodex CLI

openai · may

OpenAI open-sources Codex CLI under Apache 2.0 in May 2025 — a terminal-native coding agent that acc …

codingagenticsource ↗

Apr

toolClaude Code GA

anthropic · apr

Launched as a limited research preview in February 2025, Claude Code went generally available on May …

agenticcodingsource ↗

momentTobi on AI at Shopify

tobi · apr

Lütke's internal memo goes wide: AI usage is now a baseline expectation at Shopify, not a differenti …

source ↗

modelLlama 4 — Scout + Maverick

meta · apr

Meta ships Llama 4 on April 5 — the first Llama generation with MoE architecture and native multimod …

open-weightsmultimodalsource ↗

Mar

modelGemini 2.5 Pro Experimental

google · mar

Released March 2025 as an experimental preview — meaning rate-limited, no SLA, and subject to change …

codingmultimodalreasoningsource ↗

2025 · Q15

Mar

toolOpenAI Agents SDK

openai · mar

OpenAI open-sources a multi-agent orchestration framework with first-class primitives for handoffs, …

agenticsource ↗

Feb

momentVibe coding

karpathy · feb

Karpathy coins the term in a tweet: describe a project in natural language, accept AI-generated code …

source ↗

modelClaude 3.7 Sonnet

anthropic · feb

Anthropic's first hybrid reasoning model: standard fast responses for most tasks, extended thinking …

reasoningsource ↗

modelGrok 3

xai · feb

xAI ships Grok 3, trained on 10x the compute of Grok 2. …

reasoningsource ↗

Jan

modelDeepSeek R1

deepseek · jan

DeepSeek releases the first open-source reasoning model trained via pure reinforcement learning to m …

reasoningopen-weightssource ↗

2024 · Q410

Dec

modelOpenAI o1

openai · dec

OpenAI ships the full o1 on December 5, 2024 — not a preview, a production model — and the gap over …

reasoningsource ↗

modelLlama 3.3 70B

meta · dec

Meta closes out 2024 on December 6 with Llama 3.3 70B Instruct — and the headline is efficiency, not …

open-weightsefficiencysource ↗

modelGemini 2.0 Flash

google · dec

Google announces Gemini 2.0 Flash: native image and audio output, a Multimodal Live API for real-tim …

multimodalagenticaudiosource ↗

modelDeepSeek V3

deepseek · dec

A 671B MoE model trained for $5.58M that matches Claude 3.5 Sonnet and o1 on most benchmarks and run …

open-weightsefficiencysource ↗

Nov

modelClaude 3.5 Haiku

anthropic · nov

Anthropic's fastest model in the 3.5 family ships in November 2024 and immediately reframes what 'sm …

codingefficiencysource ↗

momentMac Mini M4

apple · nov

Apple's redesigned Mac Mini ships with M4 and M4 Pro chips at $600 and $1,400. …

source ↗

toolModel Context Protocol

anthropic · nov

Anthropic open-sources MCP — a universal connector standard for AI agents and external systems. …

source ↗

modelQwen 2.5 Coder

qwen · nov

Alibaba releases Qwen 2.5-Coder in 7B (Sep), 32B, and 72B (Nov) variants. …

codingopen-weightssource ↗

Oct

toolComputer use

anthropic · oct

Anthropic ships Claude 3.5 computer use in public beta: Claude can see your screen, move the cursor, …

agenticvisionsource ↗

toolGPT-4o Realtime API

openai · oct

OpenAI opens the Realtime API to public beta in October 2024, giving developers a persistent WebSock …

audiomultimodalsource ↗

2024 · Q37

Sep

modelOpenAI o1 preview

openai · sep

OpenAI ships o1-preview and o1-mini: models that 'think before they answer' via long internal chain- …

reasoningsource ↗

modelQwen 2.5

qwen · sep

Alibaba releases Qwen 2.5 across dense and MoE variants, describing it as 'perhaps the largest open- …

open-weightssource ↗

toolNotebookLM Audio Overviews

google · sep

Google ships Audio Overviews in NotebookLM — converts any uploaded document into a two-host podcast. …

audiomultimodalsource ↗

modelLlama 3.2

meta · sep

Meta releases Llama 3.2 in four sizes: 1B and 3B text-only models optimized for edge and mobile, plu …

open-weightsvisionefficiencysource ↗

Jul

modelLlama 3.1 405B

meta · jul

Meta ships Llama 3.1 with a 405B parameter flagship, 128K context, and support for eight additional …

open-weightssource ↗

modelGPT-4o mini

openai · jul

OpenAI releases GPT-4o mini on July 18, 2024 at $0.15/$0.60 per million input/output tokens — more t …

efficiencysource ↗

modelMistral Large 2

mistral · jul

Mistral releases Large 2 on July 24, 2024 — 123B parameters, 128K context, MIT license — their most …

open-weightscodingsource ↗

2024 · Q25

Jun

modelClaude 3.5 Sonnet

anthropic · jun

Anthropic releases Claude 3.5 Sonnet on June 20, 2024 — at 80% lower cost than Claude 3 Opus, while …

codingefficiencysource ↗

toolApple Intelligence

apple · jun

Apple announces Apple Intelligence at WWDC 2024: on-device models across iPhone, iPad, and Mac — wri …

multimodalefficiencysource ↗

May

modelGPT-4o

openai · may

OpenAI ships GPT-4 Omni: natively multimodal across text, audio, image, and video — no separate enco …

multimodalaudiosource ↗

modelGemini 1.5 Pro GA

google · may

Google's Gemini 1.5 Pro becomes generally available with a 1-million-token context window. …

multimodalsource ↗

Apr

modelLlama 3

meta · apr

Meta releases Llama 3 in 8B and 70B sizes, trained on 15 trillion tokens — 7x more data than Llama 2 …

open-weightssource ↗

2024 · Q14

Mar

modelClaude 3 family

anthropic · mar

Anthropic launches Haiku, Sonnet, and Opus — the first model family to give developers meaningful sp …

multimodalsource ↗

modelGrok-1 open source

xai · mar

xAI open-sources Grok-1 — the full 314B MoE base model — under Apache 2.0. …

open-weightssource ↗

Feb

modelGemini 1.5 Pro announced

google · feb

Google announces Gemini 1.5 Pro with a breakthrough 1-million-token context window using a sparse Mo …

multimodalsource ↗

modelSora

openai · feb

OpenAI announces Sora: a text-to-video diffusion model that generates 60-second photorealistic clips …

multimodalvisionsource ↗

2023 · Q44

Dec

modelMixtral 8x7B

mistral · dec

Mistral AI drops Mixtral 8x7B without announcement — a torrent link on X with no blog post. …

open-weightsefficiencysource ↗

modelGemini 1.0

google · dec

Google announces Gemini 1.0 — its multimodal response to GPT-4. …

multimodalsource ↗

Nov

momentOpenAI DevDay

openai · nov

OpenAI hosts its first developer conference: GPT-4 Turbo with 128K context, GPTs (custom instruction …

source ↗

modelGrok (xAI)

xai · nov

Elon Musk's xAI launches Grok to X Premium+ subscribers — a 314B MoE model trained from scratch with …

source ↗

2023 · Q34

Sep

modelMistral 7B

mistral · sep

Mistral AI, founded in April by ex-Meta and Google researchers, releases its first model — Mistral 7 …

open-weightsefficiencysource ↗

modelFalcon 180B

tii · sep

The Technology Innovation Institute in Abu Dhabi releases Falcon 180B — a 180B parameter model train …

open-weightssource ↗

Aug

modelCode Llama

meta · aug

Meta releases Code Llama (7B, 13B, 34B) — Llama 2 fine-tuned on 500B tokens of code with fill-in-the …

codingopen-weightssource ↗

Jul

modelLlama 2

meta · jul

Meta and Microsoft release Llama 2 with 7B, 13B, and 70B variants, trained on 2 trillion tokens — do …

open-weightssource ↗

2023 · Q23

Jun

toolGitHub Copilot Chat GA

github · jun

GitHub launches Copilot Chat in general availability — a conversational interface for code in VS Cod …

codingsource ↗

May

toolLangChain hits 1.0

langchain · may

LangChain consolidates its position as the default framework for building LLM applications — chains, …

source ↗

Apr

momentAuto-GPT goes viral

torantulino · apr

Auto-GPT — an open-source experiment that chains GPT-4 calls into a goal-directed autonomous agent — …

source ↗

2023 · Q14

Mar

modelGPT-4

openai · mar

OpenAI releases GPT-4: the first multimodal frontier model, passing the bar exam in the 90th percent …

multimodalsource ↗

modelClaude 1.0

anthropic · mar

Anthropic releases its first production model. …

safetysource ↗

Feb

toolBing AI

microsoft · feb

Microsoft ships Bing Chat (later Bing AI) — a GPT-4-powered search engine before GPT-4 was publicly …

source ↗

Jan

momentChatGPT hits 100M users

openai · jan

ChatGPT reaches 100 million monthly active users two months after launch — the fastest consumer appl …

source ↗

Signals8

signalContext is the new API

signal

The meaningful unit of AI integration has shifted from the API call to the context window. …

signalInference cost inversion

signal

For most tasks, the bottleneck has inverted: generating tokens is cheap, sampling enough to find a g …

signalThe eval collapse

signal

The major benchmarks that defined frontier capability for 2022–2024 are saturated or compromised. …

signalAgent infrastructure, not agent models

signal

The bottleneck in deploying AI agents is not model capability — it's context management, tool reliab …

signalThe open-weight treadmill

signal

The gap between open-weight and frontier closed models closed in 2023–2024, then widened again in 20 …

signalThe Bitter Lesson

sutton · 2019

Richard Sutton's March 2019 essay distills 70 years of AI research into a single uncomfortable obser …

source ↗

signalNext token prediction as world modeling

sutskever · 2023

Ilya Sutskever's 2023 observation — most clearly stated at NVIDIA GTC in March of that year — refram …

source ↗

signalIntelligence too cheap to meter

altman · 2024

Sam Altman first used this framing in July 2024 when announcing GPT-4o mini pricing, echoing Lewis S …

source ↗

Papers · Foundations13

paperAttention Is All You Need

google · 2017

Vaswani et al. …

reasoningsource ↗

paperScaling Laws for Neural Language Models

openai · 2020

Kaplan et al. …

efficiencysource ↗

paperRetrieval-Augmented Generation

meta · 2020

Lewis et al. …

source ↗

paperLanguage Models are Few-Shot Learners

openai · 2020

The GPT-3 paper. …

source ↗

paperChain-of-Thought Prompting

google · 2022

Wei et al. …

reasoningsource ↗

paperTraining Language Models to Follow Instructions

openai · 2022

The InstructGPT paper. …

safetysource ↗

paperConstitutional AI

anthropic · 2022

Bai et al. …

safetysource ↗

paperDirect Preference Optimization

stanford · 2023

Rafailov et al. …

safetysource ↗

paperFlashAttention

stanford · 2022

Dao et al. …

efficiencysource ↗

paperSparks of Artificial General Intelligence

microsoft · 2023

Bubeck et al. …

source ↗

paperScaling Monosemanticity

anthropic · 2024

Templeton et al. …

safetysource ↗

paperSycophancy to Subterfuge

anthropic · 2024

Marks et al. …

safetysource ↗

paperDeepSeek-R1

deepseek · 2025

The DeepSeek team trains a frontier reasoning model using pure RL — no supervised fine-tuning on cha …

reasoningopen-weightssource ↗