LLMs

12 posts in LLMs

Meta's LeCun Introduces SAI: A Measurable Alternative to AGI

March 2026Safety

Meta's LeCun Introduces SAI: A Measurable Alternative to AGI

Reasoning Overthinking Solved: SAGE Cuts Tokens 44% While Improving Accuracy

February 2026Infrastructure

Reasoning Overthinking Solved: SAGE Cuts Tokens 44% While Improving Accuracy

175% Faster Prefill with Better Accuracy: ConceptMoE's Adaptive Token Compression for MoE

January 2026Infrastructure

175% Faster Prefill with Better Accuracy: ConceptMoE's Adaptive Token Compression for MoE

15-Hour Agent Runtimes Solved: Idea2Story Precomputes Research Knowledge Offline

January 2026AI Agents

15-Hour Agent Runtimes Solved: Idea2Story Precomputes Research Knowledge Offline

Tsinghua Researchers Show Diffusion LLMs Reason Better When You Take Away Their Flexibility

January 2026Infrastructure

Tsinghua Researchers Show Diffusion LLMs Reason Better When You Take Away Their Flexibility

SimpleMem gives LLM agents 30x cheaper memory with 26% better recall

January 2026AI Agents

SimpleMem gives LLM agents 30x cheaper memory with 26% better recall

Agent Memory Fragmentation Solved: EverMemOS Achieves 93% on LoCoMo via Engram-Inspired Lifecycle

January 2026AI Agents

Agent Memory Fragmentation Solved: EverMemOS Achieves 93% on LoCoMo via Engram-Inspired Lifecycle

4 Percentage Points Better Accuracy With Rude Prompts: How Tone Affects GPT-4o Performance

January 2026LLMs

4 Percentage Points Better Accuracy With Rude Prompts: How Tone Affects GPT-4o Performance

6% Better Math Reasoning in Fewer Tokens: Multiplex Thinking Merges Multiple Paths into One

January 2026Infrastructure

6% Better Math Reasoning in Fewer Tokens: Multiplex Thinking Merges Multiple Paths into One

Gold Medal at IMO and IOI: DeepSeek-V3.2 Matches GPT-5 with Open Weights

January 2026Infrastructure

Gold Medal at IMO and IOI: DeepSeek-V3.2 Matches GPT-5 with Open Weights

Why Reasoning Models Cheat on Efficiency: TNT's Fix Cuts Tokens 50%

January 2026Infrastructure

Why Reasoning Models Cheat on Efficiency: TNT's Fix Cuts Tokens 50%

Why Chain-of-Thought Works: Researchers Find a Single 'Reasoning Switch' in LLMs

January 2026LLMs

Why Chain-of-Thought Works: Researchers Find a Single 'Reasoning Switch' in LLMs