Publications
-
The Closeness of In-Context Learning and Weight Shifting for Softmax Regression
(Alphabetical Order) Shuai Li, Zhao Song, Yu Xia, Tong Yu, Tianyi Zhou.
NeurIPS 2024 [arXiv] -
Aligning as Debiasing: Causality-Aware Alignment via Reinforcement Learning with Interventional Feedback
Yu Xia, Tong Yu, Zhankui He, Handong Zhao, Julian McAuley, Shuai Li.
NAACL 2024 [Paper] -
Hallucination Diversity-Aware Active Learning for Text Summarization
Yu Xia, Xu Liu, Tong Yu, Sungchul Kim, Ryan A. Rossi, Anup Rao, Tung Mai, Shuai Li.
NAACL 2024 [Paper] -
Which LLM to Play? Convergence-Aware Online Model Selection with Time-Increasing Bandits
Yu Xia*, Fang Kong*, Tong Yu, Liya Guo, Ryan A. Rossi, Sungchul Kim, Shuai Li.
WWW 2024 Oral [Paper] -
Towards Joint Utilization of Absolute and Relative Bandit Feedback for Conversational Recommendation
Yu Xia*, Zhihui Xie*, Tong Yu, Canzhe Zhao, Shuai Li.
UMUAI 2024 Special Issue on CRS [Paper] -
User-Regulation Deconfounded Conversational Recommender System with Bandit Feedback
Yu Xia, Junda Wu, Tong Yu, Sungchul Kim, Ryan A. Rossi, Shuai Li.
KDD 2023 [Paper]
arXiv Preprints
-
Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs
Yu Xia, Rui Wang, Xu Liu, Mingyan Li, Tong Yu, Xiang Chen, Julian McAuley, Shuai Li.
[arXiv] -
Visual Prompting in Multimodal Large Language Models: A Survey
Junda Wu, Zhehao Zhang, Yu Xia, Xintong Li, Zhaoyang Xia, Aaron Chang, Tong Yu, Sungchul Kim, Ryan A. Rossi, Ruiyi Zhang, Subrata Mitra, Dimitris N. Metaxas, Lina Yao, Jingbo Shang, Julian McAuley.
[arXiv] -
Federated Large Language Models: Current Progress and Future Directions
Yuhang Yao, Jianyi Zhang, Junda Wu, Chengkai Huang, Yu Xia, Tong Yu, Ruiyi Zhang, Sungchul Kim, Ryan Rossi, Ang Li, Lina Yao, Julian McAuley, Yiran Chen, Carlee Joe-Wong.
[arXiv]