CoAI

Large Language Models Are Not Robust Multiple Choice Selectors

  • ICLR 2024(Spotlight)

Chujie Zheng,Hao Zhou,Fandong Meng,Jie Zhou,Minlie Huang

MiniLLM: Knowledge Distillation of Large Language Models

  • ICLR 2024

Yuxian Gu,Li Dong,Furu Wei,Minlie Huang

Language Model Decoding as Direct Metrics Optimization

  • ICLR 2024

Haozhe Ji,Pei Ke,Hongning Wang,Minlie Huang

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving

  • ICLR 2024

Zhibin Gou*,Zhihong Shao*,Yeyun Gong,yelong shen,Yujiu Yang,Minlie Huang,Nan Duan,Weizhu Chen

Depression Detection in Clinical Interviews with LLM-Empowered Structural Element Graph

  • NAACL 2024

Zhuang Chen, Jiawen Deng, Jinfeng Zhou, Jincenzi Wu, Tieyun Qian, Minlie Huang

Language Models Hallucinate, but May Excel at Fact Verification

  • NAACL 2024

Jian Guan, Jesse Dodge, David Wadden, Minlie Huang, Hao Peng

On Prompt-Driven Safeguarding for Large Language Models

  • ICML 2024

Chujie Zheng, Fan Yin, Hao Zhou, Fandong Meng, Jie Zhou, Kai-Wei Chang, Minlie Huang, Nanyun Peng

Towards Efficient and Exact Optimization of Language Model Alignment

  • ICML 2024

Haozhe Ji, Cheng Lu, Yilin Niu, Pei Ke, Hongning Wang, Jun Zhu, Jie Tang, Minlie Huang

Human vs. Generative AI in Content Creation Competition: Symbiosis or Conflict?

  • ICML 2024

Fan Yao, Chuanhao Li, Denis Nekipelov, Hongning Wang, Haifeng Xu

ToMBench: Benchmarking Theory of Mind in Large Language Models

  • ACL 2024

Zhuang Chen, Jincenzi Wu, Jinfeng Zhou, Bosi Wen, Guanqun Bi, Gongyao Jiang, Yaru Cao, Mengting Hu, Yonghui Li, Zexuan Xiong, Minlie Huang

EmoBench: Evaluation the Emotional Intelligence of Large Language Models

  • ACL 2024

Sahand Sabour, Siyang Liu, Zheyuan Zhang, June M. Liu, Jinfeng Zhou, Alvionna S. Sunaryo, Juanzi Li, Tatia M.C. Lee, Rada Mihalcea, Minlie Huang

Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization

  • ACL 2024

Zhexin Zhang, Junxiao Yang, Pei Ke, Fei Mi, Hongning Wang, Minlie Huang

SafetyBench: Evaluating the Safety of Large Language Models

  • ACL 2024

Zhexin Zhang, Leqi Lei, Lindong Wu, Rui Sun, Yongkang Huang, Chong Long, Xiao Liu, Xuanyu Lei, Jie Tang, Minlie Huang

Scalable Oversight by Learning Decomposition From Human Feedback: A Case Study in Competitive Programming

  • ACL 2024

Jiaxin Wen, Ruiqi Zhong, Pei Ke, Zhihong Shao, Hongning Wang, Minlie Huang

CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation

  • ACL 2024

Pei Ke, Bosi Wen, Zhuoer Feng, Xiao Liu, Xuanyu Lei, Jiale Cheng, Shengyuan Wang, Aohan Zeng, Yuxiao Dong, Hongning Wang, Jie Tang, Minlie Huang

Black-Box Prompt Optimization: Aligning Large Language Models without Model Training

  • ACL 2024

Jiale Cheng, Xiao Liu, Kehan Zheng, Pei Ke, Hongning Wang, Yuxiao Dong, Jie Tang, Minlie Huang

COKE: A Cognitive Knowledge Graph for Machine Theory of Mind

  • ACL 2024

Jincenzi Wu, Zhuang Chen, Jiawen Deng, Sahand Sabour, Helen Meng, Minlie Huang