Research

Discover latest breakthroughs in Tech

Meta's LeCun Introduces SAI: A Measurable Alternative to AGI

March 2026Safety

Meta's LeCun Introduces SAI: A Measurable Alternative to AGI

8x Terminal Performance Gains: NVIDIA's Data Recipe Lets 32B Beat 480B

February 2026Infrastructure

8x Terminal Performance Gains: NVIDIA's Data Recipe Lets 32B Beat 480B

Mirage of Synthesis: DREAM's Agentic Framework Catches What Static Benchmarks Miss

February 2026Safety

Mirage of Synthesis: DREAM's Agentic Framework Catches What Static Benchmarks Miss

When Should AI Agents Ask for Help? CMU's CowCorpus Maps Four Human Collaboration Styles

February 2026AI Agents

When Should AI Agents Ask for Help? CMU's CowCorpus Maps Four Human Collaboration Styles

80.3 on ScreenSpotPro: GUI-Owl-1.5 Sets New Bar for Open-Source GUI Agents

February 2026AI Agents

80.3 on ScreenSpotPro: GUI-Owl-1.5 Sets New Bar for Open-Source GUI Agents

Unified Latents Hits 1.4 FID by Replacing Stable Diffusion's Ad Hoc VAE with a Diffusion Prior

February 2026Vision

Unified Latents Hits 1.4 FID by Replacing Stable Diffusion's Ad Hoc VAE with a Diffusion Prior

3.5x Faster Image Generation: DDiT Dynamically Resizes Patches in Diffusion Transformers

February 2026Vision

3.5x Faster Image Generation: DDiT Dynamically Resizes Patches in Diffusion Transformers

Reasoning Overthinking Solved: SAGE Cuts Tokens 44% While Improving Accuracy

February 2026Infrastructure

Reasoning Overthinking Solved: SAGE Cuts Tokens 44% While Improving Accuracy

Voice Search Breaks in Noise: SQuTR Benchmark Reveals the Real Bottleneck

February 2026Voice AI

Voice Search Breaks in Noise: SQuTR Benchmark Reveals the Real Bottleneck

Even GPT-5 Fails at Discovery: OdysseyArena Exposes the Inductive Bottleneck in LLM Agents

February 2026AI Agents

Even GPT-5 Fails at Discovery: OdysseyArena Exposes the Inductive Bottleneck in LLM Agents

Prompt Fatigue Solved: Vibe AIGC Turns Users Into 'Commanders' of Multi-Agent Creative Workflows

February 2026AI Agents

Prompt Fatigue Solved: Vibe AIGC Turns Users Into 'Commanders' of Multi-Agent Creative Workflows

Baidu Introduces ERNIE 5.0: Trillion-Parameter Unified Multimodal MoE Rivals GPT-5

February 2026Vision

Baidu Introduces ERNIE 5.0: Trillion-Parameter Unified Multimodal MoE Rivals GPT-5

Google Introduces Agentic Vision: Gemini 3 Flash Now Zooms, Annotates, and Investigates Images

February 2026AI Agents

Google Introduces Agentic Vision: Gemini 3 Flash Now Zooms, Annotates, and Investigates Images

First Holistic OCR Model: OCRVerse Unifies Document Parsing and Code Generation

January 2026Vision

First Holistic OCR Model: OCRVerse Unifies Document Parsing and Code Generation

175% Faster Prefill with Better Accuracy: ConceptMoE's Adaptive Token Compression for MoE

January 2026Infrastructure

175% Faster Prefill with Better Accuracy: ConceptMoE's Adaptive Token Compression for MoE

260% Better at Catching Moving Objects: DynamicVLA Solves Robot Latency Problem

January 2026AI Agents

260% Better at Catching Moving Objects: DynamicVLA Solves Robot Latency Problem

15-Hour Agent Runtimes Solved: Idea2Story Precomputes Research Knowledge Offline

January 2026AI Agents

15-Hour Agent Runtimes Solved: Idea2Story Precomputes Research Knowledge Offline

UPLiFT vs Cross-Attention Upsamplers: Linear Scaling Meets SOTA Quality

January 2026Vision

UPLiFT vs Cross-Attention Upsamplers: Linear Scaling Meets SOTA Quality

2x Faster VLA Inference with 70% Fewer Layers: Shallow-π Distillation for Edge Robotics

January 2026Infrastructure

2x Faster VLA Inference with 70% Fewer Layers: Shallow-π Distillation for Edge Robotics

6x Fewer Tokens, Better OCR: DeepSeek's Visual Causal Flow Beats GPT-4o and Gemini

January 2026Vision

6x Fewer Tokens, Better OCR: DeepSeek's Visual Causal Flow Beats GPT-4o and Gemini

11% Better Than Human: Chroma 1.0's Real-Time Voice Cloning for Spoken Dialogue

January 2026Voice AI

11% Better Than Human: Chroma 1.0's Real-Time Voice Cloning for Spoken Dialogue

90% Attention Sparsity with Zero Quality Loss: SALAD Speeds Up Video Diffusion 1.7x

January 2026Infrastructure

90% Attention Sparsity with Zero Quality Loss: SALAD Speeds Up Video Diffusion 1.7x

73% on BrowseComp: Meituan's 560B Open-Source Model Leads Agentic Benchmarks

January 2026AI Agents

73% on BrowseComp: Meituan's 560B Open-Source Model Leads Agentic Benchmarks

FP8 Rollout Instability Solved: Jet-RL Unifies Precision for Stable RL Training

January 2026Infrastructure

FP8 Rollout Instability Solved: Jet-RL Unifies Precision for Stable RL Training

56.7% on OSWorld: EvoCUA's Evolutionary Training Beats Closed-Source Computer Use Agents

January 2026AI Agents

56.7% on OSWorld: EvoCUA's Evolutionary Training Beats Closed-Source Computer Use Agents

97ms First-Packet Latency: Qwen3-TTS Beats ElevenLabs in Voice Cloning Across 10 Languages

January 2026Voice AI

97ms First-Packet Latency: Qwen3-TTS Beats ElevenLabs in Voice Cloning Across 10 Languages