Mirage of Synthesis: DREAM's Agentic Framework Catches What Static Benchmarks MissFebruary 2026Safety
When Should AI Agents Ask for Help? CMU's CowCorpus Maps Four Human Collaboration StylesFebruary 2026AI Agents
Even GPT-5 Fails at Discovery: OdysseyArena Exposes the Inductive Bottleneck in LLM AgentsFebruary 2026AI Agents
Prompt Fatigue Solved: Vibe AIGC Turns Users Into 'Commanders' of Multi-Agent Creative WorkflowsFebruary 2026AI Agents
Google Introduces Agentic Vision: Gemini 3 Flash Now Zooms, Annotates, and Investigates ImagesFebruary 2026AI Agents
15-Hour Agent Runtimes Solved: Idea2Story Precomputes Research Knowledge OfflineJanuary 2026AI Agents
56.7% on OSWorld: EvoCUA's Evolutionary Training Beats Closed-Source Computer Use AgentsJanuary 2026AI Agents
Microsoft's Agent Lightning Decouples RL Training from Agent Logic, Enabling Fine-Tuning of Any AI Agent with Zero Code ChangesJanuary 2026Infrastructure
Agent Memory Fragmentation Solved: EverMemOS Achieves 93% on LoCoMo via Engram-Inspired LifecycleJanuary 2026AI Agents
Agent Memory Loss Solved: InfiAgent's File-Centric Architecture Enables Unlimited RuntimeJanuary 2026AI Agents
DeepResearchEval: Benchmark Shows Gemini Leads Quality, Manus Wins Factual AccuracyJanuary 2026Safety
Zero Training Data, Full Performance: Dr. Zero Matches Supervised Search AgentsJanuary 2026Infrastructure