Yingru Li
Yingru Li
Home
Posts
Research
Contact
Resume
RL-Seminar
Light
Dark
Automatic
Research
Language as a Universal Interface for Reinforcement Learning Agents
This post establishes a formal mathematical framework for language agents, deriving fundamental challenges from first principles and providing concrete design guidelines with real-world examples from SWE-Bench.
Yingru LI
Nov 7, 2025
22 min read
Research
,
Theory
,
Engineering
Information Bandwidth in Reinforcement Learning
An information-theoretic analysis showing that scalar advantage formulations learn ≤ log₂(B) bits per episode, while per-timestep advantages preserve full reward entropy.
Yingru LI
Last updated on Nov 4, 2025
16 min read
Research
,
Theory