Yingru Li
Yingru Li
Home
Research
Contact
Resume
RL-Seminar
Light
Dark
Automatic
llm
Uncertainty-guided Search for Multi-step Reasoning in LLMs
Fei Yu*
,
Yingru Li* (equal)
,
Benyou Wang
,
Zhi-Quan Luo
Cite
Scalable Exploration via Ensemble++
Yingru Li
,
Jiawei Xu
,
Baoxiang Wang
,
Zhi-Quan Luo
PDF
Cite
Code
Adaptive Foundation Models for Online Decisions: HyperAgent with Fast Incremental Uncertainty Estimation
We prove HyperAgent closes a theoretical gap in scalable exploration. Further, GPT-HyperAgent addresses risk and efficiency challenges in human-Al interplay for automated content moderation with human feedback.
Yingru Li
,
Jiawei Xu
,
Zhi-Quan Luo
PDF
Cite
Code
Poster
Slides
Video
Multi-turn Actor-critic Language Agents for Hospital Outpatient Referral
Yingru Li
,
Xiaoxiao Liu
,
Benyou Wang
,
Zhi-Quan Luo
Cite
Cite
×