Yingru Li
Yingru Li
Home
Research
Contact
Resume
RL-Seminar
Light
Dark
Automatic
probability
Adaptive Foundation Models for Online Decisions: HyperAgent with Fast Incremental Uncertainty Estimation
We prove HyperAgent closes a theoretical gap in scalable exploration. Further, GPT-HyperAgent addresses risk and efficiency challenges in human-Al interplay for automated content moderation with human feedback.
Yingru Li
,
Jiawei Xu
,
Zhi-Quan Luo
PDF
Cite
Code
Poster
Slides
Video
Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent
Addressing data and computation efficiency challenges in real-world deployments of RL Agents. It achieves significant efficiency gains in deep RL benchmarks as well as theoretical milestones.
Yingru Li
,
Jiawei Xu
,
Lei Han
,
Zhiquan Luo
PDF
Cite
Code
Poster
Slides
Video
Prior-dependent analysis of posterior sampling reinforcement learning with function approximation
Has implications on how the integration of prior knowledge enhances the efficiency of RL agents without extensive online exploration.
Yingru Li
,
Zhi-Quan Luo
PDF
Cite
Probability Tools for Sequential Random Projection
First probabilistic framework for sequential random projection, an approach rooted in the challenges of sequential decision-making under uncertainty; A non-trivial martingale extension of Johnson-Lindenstrauss (JL) to sequentially adaptive data processes.
Yingru Li
PDF
Cite
Poster
Simple, unified analysis of Johnson-Lindenstrauss with applications
TL;DR: We simplify and unify various constructions of the Johnson-Lindenstrauss (JL) lemma, including spherical and sub-Gaussian models, and provide the first rigorous proof for spherical construction’s effectiveness. Our work extends the Hanson-Wright inequality and solidifies the JL lemma’s theoretical foundation, enhancing its practical applications in computational algorithms.
Yingru Li
PDF
Cite
Poster
Cite
×