Yingru Li
Yingru Li
Home
Research
Contact
Resume
RL-Seminar
Light
Dark
Automatic
selected
Proactive Agents for Multi-turn Hospital Outpatient Referral under Uncertainty
Yingru Li
,
Xuheng Shen
,
Xiaoxiao Liu
,
Gehan Hu
,
Benyou Wang
Cite
Uncertainty-Aware Search: Mitigating Test-Time Search Scaling Flaws in LLMs
Yingru Li
,
Fei Yu*
,
Benyou Wang
Cite
Scalable Exploration via Ensemble++
Yingru Li
,
Jiawei Xu
,
Baoxiang Wang
,
Zhi-Quan Luo
PDF
Cite
Code
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent
Addressing data and computation efficiency challenges in real-world deployments of RL Agents. It achieves significant efficiency gains in deep RL benchmarks as well as theoretical milestones.
Yingru Li
,
Jiawei Xu
,
Lei Han
,
Zhiquan Luo
PDF
Cite
Code
Poster
Slides
Video
Optimistic Thompson Sampling for No-Regret Learning in Unknown Games
Game-theoretic decision-making in multi-agent systems. I developed optimistic TS type algorithm that significantly reduce experimental costs in applications such as traffic management and radar communications.
Yingru Li
,
Liangqi Liu
,
Wenqiang Pu
,
Hao Liang
,
Zhi-Quan Luo
PDF
Cite
Probability Tools for Sequential Random Projection
First probabilistic framework for sequential random projection, an approach rooted in the challenges of sequential decision-making under uncertainty; A non-trivial martingale extension of Johnson-Lindenstrauss (JL) to sequentially adaptive data processes.
Yingru Li
PDF
Cite
Poster
Divergence-augmented policy optimization
Stabilizing policy optimization when off-policy data are reused, addressing the data efficiency issue in RL for real-world problems.
Qing Wang*
,
Yingru Li* (equal)
,
Jiechao Xiong
,
Tong Zhang
PDF
Cite
Code
Poster
Cite
×