Reasoning Overthinking Solved: SAGE Cuts Tokens 44% While Improving AccuracyFebruary 2026Infrastructure
175% Faster Prefill with Better Accuracy: ConceptMoE's Adaptive Token Compression for MoEJanuary 2026Infrastructure
15-Hour Agent Runtimes Solved: Idea2Story Precomputes Research Knowledge OfflineJanuary 2026AI Agents
Tsinghua Researchers Show Diffusion LLMs Reason Better When You Take Away Their FlexibilityJanuary 2026Infrastructure
Agent Memory Fragmentation Solved: EverMemOS Achieves 93% on LoCoMo via Engram-Inspired LifecycleJanuary 2026AI Agents
4 Percentage Points Better Accuracy With Rude Prompts: How Tone Affects GPT-4o PerformanceJanuary 2026LLMs
6% Better Math Reasoning in Fewer Tokens: Multiplex Thinking Merges Multiple Paths into OneJanuary 2026Infrastructure