Yingru Li
Yingru Li
Home
Research
Contact
Resume
RL-Seminar
Light
Dark
Automatic
Reinforcement Learning
Uncertainty-Aware Search: Mitigating Test-Time Search Scaling Flaws in LLMs
Yingru Li
,
Fei Yu*
,
Benyou Wang
Cite
Scalable Exploration via Ensemble++
Yingru Li
,
Jiawei Xu
,
Baoxiang Wang
,
Zhi-Quan Luo
PDF
Cite
Code
Adaptive Foundation Models for Online Decisions: HyperAgent with Fast Incremental Uncertainty Estimation
We prove HyperAgent closes a theoretical gap in scalable exploration. Further, GPT-HyperAgent addresses risk and efficiency challenges in human-Al interplay for automated content moderation with human feedback.
Yingru Li
,
Jiawei Xu
,
Zhi-Quan Luo
PDF
Cite
Code
Poster
Slides
Video
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent
Addressing data and computation efficiency challenges in real-world deployments of RL Agents. It achieves significant efficiency gains in deep RL benchmarks as well as theoretical milestones.
Yingru Li
,
Jiawei Xu
,
Lei Han
,
Zhiquan Luo
PDF
Cite
Code
Poster
Slides
Video
Prior-dependent analysis of posterior sampling reinforcement learning with function approximation
Has implications on how the integration of prior knowledge enhances the efficiency of RL agents without extensive online exploration.
Yingru Li
,
Zhi-Quan Luo
PDF
Cite
HyperAgent - A Simple, Efficient, Scalable and Provable RL Framework
Practically and provably efficient RL under resource constraints!
Mar 23, 2024 1:30 PM
Rice University
Yingru LI
Slides
Video
Follow
Optimistic Thompson Sampling for No-Regret Learning in Unknown Games
Game-theoretic decision-making in multi-agent systems. I developed optimistic TS type algorithm that significantly reduce experimental costs in applications such as traffic management and radar communications.
Yingru Li
,
Liangqi Liu
,
Wenqiang Pu
,
Hao Liang
,
Zhi-Quan Luo
PDF
Cite
HyperAgent - A Simple, Efficient and Scalable RL Framework for Complex Environments
Practically and provably efficient RL under resource constraints!
Jan 13, 2024 1:20 PM
Daoyuan Building
Yingru LI
Slides
Follow
News
Towards AGI for Humanity through Efficient Reinforcement Learning
Addressing efficiency chanllenge in RL by HyperFQI algorithm
Oct 21, 2023 2:30 PM
Teaching B Building
Yingru LI
Slides
Follow
HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning
TL;DR
: We design a practical randomized exploration method to address the sample efficiency issue in online reinforcement learning.
Ziniu Li
,
Yingru Li* (corresponding)
,
Yushun Zhang
,
Tong Zhang
,
Zhi-Quan Luo
PDF
Cite
Code
Video
»
Cite
×