Yingru Li
Yingru Li
Home
Research
Contact
Resume
RL-Seminar
Light
Dark
Automatic
selected
Uncertainty-guided Search for Multi-step Reasoning in LLMs
Fei Yu*
,
Yingru Li* (equal)
,
Benyou Wang
,
Zhi-Quan Luo
Cite
Scalable Exploration via Ensemble++
Yingru Li
,
Jiawei Xu
,
Baoxiang Wang
,
Zhi-Quan Luo
PDF
Cite
Code
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent
Addressing data and computation efficiency challenges in real-world deployments of RL Agents. It achieves significant efficiency gains in deep RL benchmarks as well as theoretical milestones.
Yingru Li
,
Jiawei Xu
,
Lei Han
,
Zhiquan Luo
PDF
Cite
Code
Poster
Slides
Video
Multi-turn Actor-critic Language Agents for Hospital Outpatient Referral
Yingru Li
,
Xiaoxiao Liu
,
Benyou Wang
,
Zhi-Quan Luo
Cite
Optimistic Thompson Sampling for No-Regret Learning in Unknown Games
Game-theoretic decision-making in multi-agent systems. I developed optimistic TS type algorithm that significantly reduce experimental costs in applications such as traffic management and radar communications.
Yingru Li
,
Liangqi Liu
,
Wenqiang Pu
,
Hao Liang
,
Zhi-Quan Luo
PDF
Cite
Probability Tools for Sequential Random Projection
First probabilistic framework for sequential random projection, an approach rooted in the challenges of sequential decision-making under uncertainty; A non-trivial martingale extension of Johnson-Lindenstrauss (JL) to sequentially adaptive data processes.
Yingru Li
PDF
Cite
Poster
Divergence-augmented policy optimization
Stabilizing policy optimization when off-policy data are reused, addressing the data efficiency issue in RL for real-world problems.
Qing Wang*
,
Yingru Li* (equal)
,
Jiechao Xiong
,
Tong Zhang
PDF
Cite
Code
Poster
Cite
×