selected

Proactive Agents for Multi-turn Hospital Outpatient Referral under Uncertainty

Yingru Li, Xuheng Shen, Xiaoxiao Liu, Gehan Hu, Benyou Wang

Uncertainty-Aware Search: Mitigating Test-Time Search Scaling Flaws in LLMs

Yingru Li, Fei Yu*, Benyou Wang

Scalable Exploration via Ensemble++

Yingru Li, Jiawei Xu, Baoxiang Wang, Zhi-Quan Luo

Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent

Addressing data and computation efficiency challenges in real-world deployments of RL Agents. It achieves significant efficiency gains in deep RL benchmarks as well as theoretical milestones.

Yingru Li, Jiawei Xu, Lei Han, Zhiquan Luo

Optimistic Thompson Sampling for No-Regret Learning in Unknown Games

Game-theoretic decision-making in multi-agent systems. I developed optimistic TS type algorithm that significantly reduce experimental costs in applications such as traffic management and radar communications.

Yingru Li, Liangqi Liu, Wenqiang Pu, Hao Liang, Zhi-Quan Luo

Probability Tools for Sequential Random Projection

First probabilistic framework for sequential random projection, an approach rooted in the challenges of sequential decision-making under uncertainty; A non-trivial martingale extension of Johnson-Lindenstrauss (JL) to sequentially adaptive data processes.

Yingru Li

Divergence-augmented policy optimization

Stabilizing policy optimization when off-policy data are reused, addressing the data efficiency issue in RL for real-world problems.

Qing Wang*, Yingru Li* (equal), Jiechao Xiong, Tong Zhang