Application

Learning an Opponent-aware Anti-jamming Strategy via Online Convex Optimization
Controlled Decoding via Q-Star on Outcome Feedback for Language Models
Uncertainty-aware Multi-turn Language Agents for Medical Decision-making
Hidden community detection in social networks