CoAI

Large Language Models Are Not Robust Multiple Choice Selectors

  • ICLR 2024(Spotlight)

Chujie Zheng,Hao Zhou,Fandong Meng,Jie Zhou,Minlie Huang

MiniLLM: Knowledge Distillation of Large Language Models

  • ICLR 2024

Yuxian Gu,Li Dong,Furu Wei,Minlie Huang

Language Model Decoding as Direct Metrics Optimization

  • ICLR 2024

Haozhe Ji,Pei Ke,Hongning Wang,Minlie Huang

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving

  • ICLR 2024

Zhibin Gou*,Zhihong Shao*,Yeyun Gong,yelong shen,Yujiu Yang,Minlie Huang,Nan Duan,Weizhu Chen

EmoBench: Evaluating the Emotional Intelligence of Large Language Models

  • ACL 2024

Sahand Sabour, Siyang Liu, Zheyuan Zhang, June M. Liu, Jinfeng Zhou, Alvionna S. Sunaryo, Juanzi Li, Tatia M.C. Lee, Rada Mihalcea, Minlie Huang

Depression Detection in Clinical Interviews with LLM-Empowered Structural Element Graph

  • NAACL 2024

Zhuang Chen, Jiawen Deng, Jinfeng Zhou, Jincenzi Wu, Tieyun Qian, Minlie Huang

Language Models Hallucinate, but May Excel at Fact Verification

  • NAACL 2024

Jian Guan, Jesse Dodge, David Wadden, Minlie Huang, Hao Peng

On Prompt-Driven Safeguarding for Large Language Models

  • ICML 2024

Chujie Zheng, Fan Yin, Hao Zhou, Fandong Meng, Jie Zhou, Kai-Wei Chang, Minlie Huang, Nanyun Peng

Towards Efficient Exact Optimization of Language Model Alignment

  • ICML 2024

Haozhe Ji, Cheng Lu, Yilin Niu, Pei Ke, Hongning Wang, Jun Zhu, Jie Tang, Minlie Huang

Human vs. Generative AI in Content Creation Competition: Symbiosis or Conflict?

  • ICML 2024

Fan Yao, Chuanhao Li, Denis Nekipelov, Hongning Wang, Haifeng Xu

ToMBench: Benchmarking Theory of Mind in Large Language Models

  • ACL 2024

Zhuang Chen, Jincenzi Wu, Jinfeng Zhou, Bosi Wen, Guanqun Bi, Gongyao Jiang, Yaru Cao, Mengting Hu, Yonghui Li, Zexuan Xiong, Minlie Huang

Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization

  • ACL 2024

Zhexin Zhang, Junxiao Yang, Pei Ke, Fei Mi, Hongning Wang, Minlie Huang

SafetyBench: Evaluating the Safety of Large Language Models

  • ACL 2024

Zhexin Zhang, Leqi Lei, Lindong Wu, Rui Sun, Yongkang Huang, Chong Long, Xiao Liu, Xuanyu Lei, Jie Tang, Minlie Huang

Learning Task Decomposition to Assist Humans in Competitive Programming

  • ACL 2024

Jiaxin Wen, Ruiqi Zhong, Pei Ke, Zhihong Shao, Hongning Wang, Minlie Huang

CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation

  • ACL 2024

Pei Ke, Bosi Wen, Zhuoer Feng, Xiao Liu, Xuanyu Lei, Jiale Cheng, Shengyuan Wang, Aohan Zeng, Yuxiao Dong, Hongning Wang, Jie Tang, Minlie Huang

Black-Box Prompt Optimization: Aligning Large Language Models without Model Training

  • ACL 2024

Jiale Cheng, Xiao Liu, Kehan Zheng, Pei Ke, Hongning Wang, Yuxiao Dong, Jie Tang, Minlie Huang

COKE: A Cognitive Knowledge Graph for Machine Theory of Mind

  • ACL 2024

Jincenzi Wu, Zhuang Chen, Jiawen Deng, Sahand Sabour, Helen Meng, Minlie Huang

AlignBench: Benchmarking Chinese Alignment of Large Language Models (with Prof. Jie Tang)

  • ACL 2024

Liu,Xiao;Lei,Xuanyu;Wang,Shengyuan;Huang,Yue;Feng,Zhuoer;Wen,Bosi;Cheng,Jiale;Ke,Pei;Xu,Yifan;Tam,Weng Lam;Zhang,Xiaohan;Sun,Lichao;Gu,Xiaotao;Wang,Hongning;Zhang,Jing;Huang,Minlie;Dong,Yuxiao;Tang,Jie

AMoR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback

  • NeurIPS 2024

Jian Guan,Wei Wu,Zujie Wen,Peng Xu,Hongning Wang,Minlie Huang

Perception of Knowledge Boundary forLarge Language Models through Semi-open-endedQuestion Answering (with Assoc. Prof. Zhiliang Tian)

  • NeurIPS 2024

Zhihua Wen,Zhiliang Tian,Zexin Jian,Zhen Huang,Pei Ke,Yifu Gao,Minlie Huang, Dongsheng Li

Benchmarking Complex Instruction-Following with Multiple Constraints Composition

  • NeurIPS D&B Track 2024

Bosi Wen,Pei Ke,Xiaotao Gu,Lindong Wu,Hao Huang,Jinfeng Zhou,Wenchuang Li,Binxin Hu,Wendy Gao,Jiaxing Xu,Yiming Liu,Jie Tang,Hongning Wang,Minlie Huang

Instruction Pre-Training: Language Models are Supervised Multitask Learners

  • EMNLP 2024

Daixuan Cheng,Yuxian Gu,Shaohan Huang,B Junyu Bi, Minlie Huang,B Furu Wei

CharacterGLM: Customizing Social Characters with Large Language Models

  • EMNLP 2024 industry track

Jinfeng Zhou,Zhuang Chen,Dazhen Wan,Bosi Wen,Yi Song,Jifan Yu,Yongkang Huang,Pei Ke,Guanqun Bi,Libiao Peng,Jiaming Yang,Xiyao Xiao,Sahand Sabour,Xiaohan Zhang,Wenjing Hou,Yijia Zhang,Yuxiao Dong,Hongning Wang,Jie Tang,Minlie Huang

AUTODETECT: Towards a Unified Framework for Automated Weakness Detection in Large Language Models

  • Findings of EMNLP 2024

Jiale Cheng ,Yida Lu,Xiaotao Gu,Pei Ke,Xiao Liu,Yuxiao Dong,Hongning Wang,Jie Tang,Minlie Huang

ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors

  • Findings of EMNLP 2024

Zhexin Zhang,Yida Lu,Jingyuan Ma,Di Zhang,Rui Li,Pei Ke,Hao Sun,Lei Sha,Zhifang Sui,Hongning Wang,Minlie Huang