Yingru Li
Yingru Li
Home
Research
Contact
Resume
RL-Seminar
Light
Dark
Automatic
Bandit
Scalable Exploration via Ensemble++
Yingru Li
,
Jiawei Xu
,
Baoxiang Wang
,
Zhi-Quan Luo
PDF
Cite
Code
Adaptive Foundation Models for Online Decisions: HyperAgent with Fast Incremental Uncertainty Estimation
We prove HyperAgent closes a theoretical gap in scalable exploration. Further, GPT-HyperAgent addresses risk and efficiency challenges in human-Al interplay for automated content moderation with human feedback.
Yingru Li
,
Jiawei Xu
,
Zhi-Quan Luo
PDF
Cite
Code
Poster
Slides
Video
Optimistic Thompson Sampling for No-Regret Learning in Unknown Games
Game-theoretic decision-making in multi-agent systems. I developed optimistic TS type algorithm that significantly reduce experimental costs in applications such as traffic management and radar communications.
Yingru Li
,
Liangqi Liu
,
Wenqiang Pu
,
Hao Liang
,
Zhi-Quan Luo
PDF
Cite
No-Regret Learning in Unknown Game with Applications
Aug 23, 2022 2:00 PM
Yingru LI
Slides
Follow
Cite
×