Peer-Reviewed Publications

  • The Closeness of In-Context Learning and Weight Shifting for Softmax Regression
    (Alphabetical Order) Shuai Li, Zhao Song, Yu Xia, Tong Yu, Tianyi Zhou.
    NeurIPS 2024 [Paper]

  • Aligning as Debiasing: Causality-Aware Alignment via Reinforcement Learning with Interventional Feedback
    Yu Xia, Tong Yu, Zhankui He, Handong Zhao, Julian McAuley, Shuai Li.
    NAACL 2024 [Paper]

  • Hallucination Diversity-Aware Active Learning for Text Summarization
    Yu Xia, Xu Liu, Tong Yu, Sungchul Kim, Ryan A. Rossi, Anup Rao, Tung Mai, Shuai Li.
    NAACL 2024 [Paper]

  • Which LLM to Play? Convergence-Aware Online Model Selection with Time-Increasing Bandits
    Yu Xia*, Fang Kong*, Tong Yu, Liya Guo, Ryan A. Rossi, Sungchul Kim, Shuai Li.
    WWW 2024 Oral [Paper]

  • Towards Joint Utilization of Absolute and Relative Bandit Feedback for Conversational Recommendation
    Yu Xia*, Zhihui Xie*, Tong Yu, Canzhe Zhao, Shuai Li.
    UMUAI 2024 Special Issue on CRS [Paper]

  • User-Regulation Deconfounded Conversational Recommender System with Bandit Feedback
    Yu Xia, Junda Wu, Tong Yu, Sungchul Kim, Ryan A. Rossi, Shuai Li.
    KDD 2023 [Paper]