Peer-Reviewed Publications
-
The Closeness of In-Context Learning and Weight Shifting for Softmax Regression
(Alphabetical Order) Shuai Li, Zhao Song, Yu Xia, Tong Yu, Tianyi Zhou.
NeurIPS 2024 [Paper] -
Aligning as Debiasing: Causality-Aware Alignment via Reinforcement Learning with Interventional Feedback
Yu Xia, Tong Yu, Zhankui He, Handong Zhao, Julian McAuley, Shuai Li.
NAACL 2024 [Paper] -
Hallucination Diversity-Aware Active Learning for Text Summarization
Yu Xia, Xu Liu, Tong Yu, Sungchul Kim, Ryan A. Rossi, Anup Rao, Tung Mai, Shuai Li.
NAACL 2024 [Paper] -
Which LLM to Play? Convergence-Aware Online Model Selection with Time-Increasing Bandits
Yu Xia*, Fang Kong*, Tong Yu, Liya Guo, Ryan A. Rossi, Sungchul Kim, Shuai Li.
WWW 2024 Oral [Paper] -
Towards Joint Utilization of Absolute and Relative Bandit Feedback for Conversational Recommendation
Yu Xia*, Zhihui Xie*, Tong Yu, Canzhe Zhao, Shuai Li.
UMUAI 2024 Special Issue on CRS [Paper] -
User-Regulation Deconfounded Conversational Recommender System with Bandit Feedback
Yu Xia, Junda Wu, Tong Yu, Sungchul Kim, Ryan A. Rossi, Shuai Li.
KDD 2023 [Paper]