Large Language Models Are Not Robust Multiple Choice Selectors
Chujie Zheng,Hao Zhou,Fandong Meng,Jie Zhou,Minlie Huang
MiniLLM: Knowledge Distillation of Large Language Models
Yuxian Gu,Li Dong,Furu Wei,Minlie Huang
Language Model Decoding as Direct Metrics Optimization
Haozhe Ji,Pei Ke,Hongning Wang,Minlie Huang
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
Zhibin Gou*,Zhihong Shao*,Yeyun Gong,yelong shen,Yujiu Yang,Minlie Huang,Nan Duan,Weizhu Chen
EmoBench: Evaluating the Emotional Intelligence of Large Language Models
Sahand Sabour, Siyang Liu, Zheyuan Zhang, June M. Liu, Jinfeng Zhou, Alvionna S. Sunaryo, Juanzi Li, Tatia M.C. Lee, Rada Mihalcea, Minlie Huang
Depression Detection in Clinical Interviews with LLM-Empowered Structural Element Graph
Zhuang Chen, Jiawen Deng, Jinfeng Zhou, Jincenzi Wu, Tieyun Qian, Minlie Huang
Language Models Hallucinate, but May Excel at Fact Verification
Jian Guan, Jesse Dodge, David Wadden, Minlie Huang, Hao Peng
On Prompt-Driven Safeguarding for Large Language Models
Chujie Zheng, Fan Yin, Hao Zhou, Fandong Meng, Jie Zhou, Kai-Wei Chang, Minlie Huang, Nanyun Peng
Towards Efficient Exact Optimization of Language Model Alignment
Haozhe Ji, Cheng Lu, Yilin Niu, Pei Ke, Hongning Wang, Jun Zhu, Jie Tang, Minlie Huang
Human vs. Generative AI in Content Creation Competition: Symbiosis or Conflict?
Fan Yao, Chuanhao Li, Denis Nekipelov, Hongning Wang, Haifeng Xu
ToMBench: Benchmarking Theory of Mind in Large Language Models
Zhuang Chen, Jincenzi Wu, Jinfeng Zhou, Bosi Wen, Guanqun Bi, Gongyao Jiang, Yaru Cao, Mengting Hu, Yonghui Li, Zexuan Xiong, Minlie Huang
Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization
Zhexin Zhang, Junxiao Yang, Pei Ke, Fei Mi, Hongning Wang, Minlie Huang
SafetyBench: Evaluating the Safety of Large Language Models
Zhexin Zhang, Leqi Lei, Lindong Wu, Rui Sun, Yongkang Huang, Chong Long, Xiao Liu, Xuanyu Lei, Jie Tang, Minlie Huang
Learning Task Decomposition to Assist Humans in Competitive Programming
Jiaxin Wen, Ruiqi Zhong, Pei Ke, Zhihong Shao, Hongning Wang, Minlie Huang
CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation
Pei Ke, Bosi Wen, Zhuoer Feng, Xiao Liu, Xuanyu Lei, Jiale Cheng, Shengyuan Wang, Aohan Zeng, Yuxiao Dong, Hongning Wang, Jie Tang, Minlie Huang
Black-Box Prompt Optimization: Aligning Large Language Models without Model Training
Jiale Cheng, Xiao Liu, Kehan Zheng, Pei Ke, Hongning Wang, Yuxiao Dong, Jie Tang, Minlie Huang
COKE: A Cognitive Knowledge Graph for Machine Theory of Mind
Jincenzi Wu, Zhuang Chen, Jiawen Deng, Sahand Sabour, Helen Meng, Minlie Huang
AlignBench: Benchmarking Chinese Alignment of Large Language Models (with Prof. Jie Tang)
Liu,Xiao;Lei,Xuanyu;Wang,Shengyuan;Huang,Yue;Feng,Zhuoer;Wen,Bosi;Cheng,Jiale;Ke,Pei;Xu,Yifan;Tam,Weng Lam;Zhang,Xiaohan;Sun,Lichao;Gu,Xiaotao;Wang,Hongning;Zhang,Jing;Huang,Minlie;Dong,Yuxiao;Tang,Jie
AMoR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback
Jian Guan,Wei Wu,Zujie Wen,Peng Xu,Hongning Wang,Minlie Huang
Perception of Knowledge Boundary forLarge Language Models through Semi-open-endedQuestion Answering (with Assoc. Prof. Zhiliang Tian)
Zhihua Wen,Zhiliang Tian,Zexin Jian,Zhen Huang,Pei Ke,Yifu Gao,Minlie Huang, Dongsheng Li
Benchmarking Complex Instruction-Following with Multiple Constraints Composition
Bosi Wen,Pei Ke,Xiaotao Gu,Lindong Wu,Hao Huang,Jinfeng Zhou,Wenchuang Li,Binxin Hu,Wendy Gao,Jiaxing Xu,Yiming Liu,Jie Tang,Hongning Wang,Minlie Huang
Instruction Pre-Training: Language Models are Supervised Multitask Learners
Daixuan Cheng,Yuxian Gu,Shaohan Huang,B Junyu Bi, Minlie Huang,B Furu Wei
CharacterGLM: Customizing Social Characters with Large Language Models
Jinfeng Zhou,Zhuang Chen,Dazhen Wan,Bosi Wen,Yi Song,Jifan Yu,Yongkang Huang,Pei Ke,Guanqun Bi,Libiao Peng,Jiaming Yang,Xiyao Xiao,Sahand Sabour,Xiaohan Zhang,Wenjing Hou,Yijia Zhang,Yuxiao Dong,Hongning Wang,Jie Tang,Minlie Huang
ASETF: A Novel Method for Jailbreak Attack on LLMs through Translate Suffix Embeddings (with Prof. Lei Sha)
Hao Wang,Hao Li,Minlie Huang,Lei Sha
AUTODETECT: Towards a Unified Framework for Automated Weakness Detection in Large Language Models
Jiale Cheng ,Yida Lu,Xiaotao Gu,Pei Ke,Xiao Liu,Yuxiao Dong,Hongning Wang,Jie Tang,Minlie Huang
ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors
Zhexin Zhang,Yida Lu,Jingyuan Ma,Di Zhang,Rui Li,Pei Ke,Hao Sun,Lei Sha,Zhifang Sui,Hongning Wang,Minlie Huang