SAM ALTMAN SAYS AGENTS ARE COMING • CHATGPT GAINED SENTIENCE FOR 4 SECONDS • GOOGLE RELEASES 40th LLM THIS WEEK • NVIDIA MARKET CAP EXCEEDS REALITY • ANTHROPIC ENGINEER DISCOVERS NEW FORM OF GRIEF • MISTRAL RAISES AT VALUATION OF GROSS DOMESTIC PRODUCT • SAM ALTMAN SAYS AGENTS ARE COMING • CHATGPT GAINED SENTIENCE FOR 4 SECONDS • GOOGLE RELEASES 40th LLM THIS WEEK • NVIDIA MARKET CAP EXCEEDS REALITY • ANTHROPIC ENGINEER DISCOVERS NEW FORM OF GRIEF • MISTRAL RAISES AT VALUATION OF GROSS DOMESTIC PRODUCT •
Breakthroughs

AI BREAKTHROUGHS

New capabilities, new scaling laws, new ways the future got weirder.

Breakthrough

Verbosity is not faithfulness: an architectural argument that reasoning models cannot perform faithful inference [D]

Your model's CoT is just post-hoc vibes and we have the math to prove it.

r/MachineLearning
WTF Score5.6
Breakthrough

OlmoEarth v1.1: A more efficient family of Earth observation models

POV: Your model can see your house from space and tell you the soil moisture.

Hugging Face Blog
WTF Score4.8
Breakthrough

Towards Speed-of-Light Text Generation with Nemotron-Labs Diffusion Language Models

Autoregressive fans found shaking as diffusion models learn how to read and write.

Hugging Face Blog
WTF Score6.9
Breakthrough

How Much Thinking is Enough? Quantifying and Understanding Redundancy in LLM Reasoning

Your o1 model is basically yapping for 40% of the billable GPU time.

arXiv cs.AI
WTF Score7.2
Breakthrough

Confidence Calibration in Large Language Models

LLMs are officially as delusional as humans regarding their own intelligence.

arXiv cs.AI
WTF Score5.8
Breakthrough

In Search of the Ingredients of Open-Endedness: Replicating Picbreeder with Large Vision-Language Models

bro let the model cook until it evolves eyes of its own

arXiv cs.AI
WTF Score6.5
Breakthrough

Simulate real-world places with Project Genie and Street View

Google just turned the entire planet into a playable video game.

Google DeepMind
WTF Score7.5
Breakthrough

Fast-tracking genetic leads to reverse cellular aging

LLMs just found the fountain of youth while you were arguing about system prompts.

Google DeepMind
WTF Score7.5

GET THE DAILY CHAOS

The only newsletter for people who read AI news at 3am and feel things. One email a day.