Publications

(2024). Proactive Agents for Multi-turn Hospital Outpatient Referral under Uncertainty.

Cite

(2024). Scalable Exploration via Ensemble++.

PDF Cite Code

(2024). Q-Star Meets Scalable Posterior Sampling: Bridging Theory and Practice via HyperAgent. International Conference on Machine Learning (ICML).

PDF Cite Code Poster Slides Video

(2024). Adaptive Foundation Models for Online Decisions: HyperAgent with Fast Incremental Uncertainty Estimation. Preprint. Presentation at ICML 2024 Workshops: (1) “Aligning Reinforcement Learning Experimentalists and Theorists”; (2) “Automated Reinforcement Learning: Exploring Meta-Learning, AutoML, and LLMs”.

PDF Cite Code Poster Slides Video

(2024). Prior-dependent analysis of posterior sampling reinforcement learning with function approximation. International Conference on Artificial Intelligence and Statistics (AISTATS).

PDF Cite

(2024). Simple, unified analysis of Johnson-Lindenstrauss with applications. Preprint. Presentation at ICML 2024 Workshop “High-dimensional Learning Dynamics 2024: The Emergence of Structure and Reasoning”.

PDF Cite Poster

(2024). Probability Tools for Sequential Random Projection. Preprint. Presentation at ICML 2024 Workshop “High-dimensional Learning Dynamics 2024: The Emergence of Structure and Reasoning”.

PDF Cite Poster

(2024). Optimistic Thompson Sampling for No-Regret Learning in Unknown Games. Preprint. Presentation at ICML 2023 Workshop “The Many Facets of Preference-Based Learning”.

PDF Cite

(2022). HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning. International Conference on Learning Representations (ICLR).

PDF Cite Code Video

(2019). Divergence-augmented policy optimization. Advances in Neural Information Processing Systems (NeurIPS).

PDF Cite Code Poster

(2018). Hidden community detection in social networks. Information Sciences.

PDF Cite Code