Publications
Preprint
Renze Lou, Hanzi Xu, Sijia Wang, Jiangshu Du, Ryo Kamoi, Xiaoxin Lu, Jian Xie, Yuxuan Sun, Yusen Zhang, Jihyun Janice Ahn, Hongchao Fang, Zhuoyang Zou, Wenchao Ma, Xi Li, Kai Zhang, Congying Xia, Lifu Huang, Wenpeng Yin. AAAR-1.0: Assessing AI's Potential to Assist Research. Arxiv; Project Webpage
Zhiwei Zhang, Fali Wang, Xiaomin Li, Zongyu Wu, Xianfeng Tang, Hui Liu, Qi He, Wenpeng Yin, Suhang Wang. Does Your LLM Truly Unlearn? An Embarrassingly Simple Approach to Recover Unlearned Knowledge. Arxiv
A M Muntasir Rahman, Junyi Ye, Wei Yao, Wenpeng Yin, Guiling Wang. From Blind Solvers to Logical Thinkers: Benchmarking LLMs' Logical Integrity on Faulty Mathematical Problems. Arxiv
Junyi Ye, Jingyi Gu, Xinyun Zhao, Wenpeng Yin, Guiling Wang. Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems. Arxiv
Jihyun Janice Ahn, Ryo Kamoi, Lu Cheng, Rui Zhang, Wenpeng Yin. Direct-Inverse Prompting: Analyzing LLMs' Discriminative Capacity in Self-Improving Generation. Arxiv
Hanzi Xu, Renze lou, Jiangshu Du, Vahid Mahzoon, Elmira Talebianaraki, Zhuoan Zhou, Elizabeth Garrison, Slobodan Vucetic, Wenpeng Yin. LLMs’ Classification Performance is Overclaimed. Arxiv
Jiaxu Zhao, Meng Fang, Shirui Pan, Wenpeng Yin, Mykola Pechenizkiy. GPTBIAS: A Comprehensive Framework for Evaluating Bias in Large Language Models. Arxiv
2025
Saptarshi Sengupta, Connor Heaton, Shreya Ghosh, Wenpeng Yin, Suhang Wang and Preslav Nakov. TOP-Training: Target-Oriented Pretraining for Medical Extractive Question Answering. COLING'25
Saptarshi Sengupta, Suhang Wang, Wenpeng Yin, Shreya Ghosh and Preslav Nakov. Exploring Language Model Generalization in Low-Resource Extractive QA. COLING'25
2024
Jiangshu Du, Yibo Wang, Wenting Zhao, Zhongfen Deng, Shuaiqi Liu, Renze Lou, Henry Peng Zou, Pranav Narayanan Venkit, Nan Zhang, Mukund Srinath, Haoran Ranran Zhang, Vipul Gupta, Yinghui Li, Tao Li, Fei Wang, Qin Liu, Tianlin Liu, Pengzhi Gao, Congying Xia, Chen Xing, Jiayang Cheng, Zhaowei Wang, Ying Su, Raj Sanjay Shah, Ruohao Guo, Jing Gu, Haoran Li, Kangda Wei, Zihao Wang, Lu Cheng, Surangika Ranathunga, Meng Fang, Jie Fu, Fei Liu, Ruihong Huang, Eduardo Blanco, Yixin Cao, Rui Zhang, Philip S. Yu, Wenpeng Yin. LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing. EMNLP'24
Chang Tian, Matthew B. Blaschko, Wenpeng Yin, Mingzhe Xing, Yinliang Yue, Marie-Francine Moens. A Generic Method for Fine-grained Category Discovery in Natural Language Texts. EMNLP'24
Wenpeng Yin, Muhao Chen, Rui Zhang, Ben Zhou, Fei Wang, Dan Roth. Enhancing LLM Capabilities Beyond Scaling Up. EMNLP'24 Tutorial
Simone Luchini, Ibraheem Moosa, John D. Patterson, Dan Johnson, Matthijs Baas, Baptiste Barbot, Iana Bashmakova, Mathias Benedek, Qunlin Chen, Giovanni E. Corazza, Boris Forthmann, Benjamin Goecke, Sameh Ibrahim, Maciej Karwowski, Yoed N. Kenett, Todd Lubart, Kirill G. Miroshnik, Felix-Kingsley Obialo, Marcela Ovando-Tellez, Ricardo Primi, Rogelio Puente, Claire Stevenson, Emmanuelle Volle, Janet G. van Hell, Wenpeng Yin, Roger E. Beaty. Automated Assessment of Creativity in Multilingual Narratives. Psychology of Aesthetics, Creativity, and the Arts (Journal)
Ryo Kamoi , Sarkar Snigdha Sarathi Das , Renze Lou , Jihyun Janice Ahn, Yilun Zhao, Xiaoxin Lu, Nan Zhang, Yusen Zhang, Ranran Haoran Zhang, Sujeeth Reddy Vummanthala, Salika Dave, Shaobo Qin, Arman Cohan, Wenpeng Yin, Rui Zhang. Evaluating LLMs at Detecting Errors in LLM Responses. COLM 2024
Ying Shen, Zhiyang Xu, Qifan Wang, Yu Cheng, Wenpeng Yin, Lifu Huang. Multimodal Instruction Tuning with Conditional Mixture of LoRA. ACL'24
Congying Xia, Chen Xing, Jiangshu Du, Xinyi Yang, Yihao Feng, Ran Xu, Wenpeng Yin, Caiming Xiong. FOFO: A Benchmark to Evaluate LLMs' Format-Following Capability. ACL'24
Zihao Lin, Mohammad Beigi, Hongxuan Li, Yufan Zhou, Yuxiang Zhang, Qifan Wang, Wenpeng Yin, Lifu Huang: Navigating the Dual Facets: A Comprehensive Evaluation of Sequential Memory Editing in Large Language Models. ACL'24
Hanzi Xu, Muhao Chen, Lifu Huang, Slobodan Vucetic, Wenpeng Yin. X-Shot: A Unified System to Handle Frequent, Few-shot and Zero-shot Learning Simultaneously in Classification. ACL'24 Findings
Tianyi Yan, Fei Wang, James Y. Huang, Wenxuan Zhou, Fan Yin, Aram Galstyan, Wenpeng Yin, Muhao Chen. Contrastive Instruction Tuning. ACL'24 Findings
Renze Lou, Kai Zhang, Wenpeng Yin. Large Language Model Instruction Following: A Survey of Progresses and Challenges. Computational Linguistics
Ibraheem Muhammad Moosa, Rui Zhang, Wenpeng Yin. MT-Ranker: Reference-free machine translation evaluation by inter-system ranking. ICLR'24 Spotlight
Renze Lou, Kai Zhang, Jian Xie, Yuxuan Sun, Janice Ahn, Hanzi Xu, Yu Su, Wenpeng Yin. MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following. ICLR'24
Jingyi Gu, Junyi Ye, Guiling Wang, Wenpeng Yin. Adaptive and Explainable Margin Trading via Large Language Models on Portfolio Management. ICAIF 2024 (5th ACM International Conference on AI in Finance)
Philip Wootaek Shin, Jihyun Janice Ahn, Wenpeng Yin, Jack Sampson, Vijaykrishnan Narayanan. Can Prompt Modifiers Control Bias? A Comparative Analysis of Text-to-Image Generative Models. IJCAI 2024 AI Governance Workshop.
Chang Tian, Wenpeng Yin, Dan Li, Marie-Francine Moens. Fighting Against the Repetitive Training and Sample Dependency Problem in Few-shot Named Entity Recognition. IEEE Access
Janice Ahn, Rishu Verma, Renze Lou, Di Liu, Rui Zhang, Wenpeng Yin. Large Language Models for Mathematical Reasoning: Progresses and Challenges. EACL'24 Student Research Workshop
Renze Lou, Wenpeng Yin. Toward Zero-Shot Instruction Following. EACL'24 Student Research Workshop
2023:
Yair Neuman, Yochai Cohen, Wenpeng Yin. Identifying social norm violation in movie plots: from Borat to American Pie. Digital Scholarship in the Humanities (JOURNAL ARTICLE).
Sarkar Snigdha Sarathi Das, Haoran Ranran Zhang, Peng Shi, Wenpeng Yin, Rui Zhang. Unified Low-Resource Sequence Labeling by Sample-Aware Dynamic Sparse Finetuning. EMNLP'23.
Jiangshu Du, Congying Xia, Wenpeng Yin, Tingting Liang and Philip Yu. All Labels Together: Low-shot Intent Detection with an Efficient Label Semantic Encoding Paradigm. AACL'23 (short)
Jiasheng Gu, Hongyu Zhao, Hanzi Xu, Liangyu Nie, Hongyuan Mei and Wenpeng Yin. Robustness of Learning from Task Instructions. Findings of ACL 2023
Wenpeng Yin, Qinyuan Ye, Pengfei Liu, Xiang Ren, Hinrich Schütze. LLM-driven Instruction Following: Progresses and Concerns. Tutorial at EMNLP'23 & KONVENS'23 (Slides at KONVENS'23; Materials at EMNLP'23)
Wenpeng Yin, Muhao Chen, Ben Zhou, Qiang Ning, Kai-Wei Chang, Dan Roth. Indirectly Supervised Natural Language Processing. Tutorial at ACL'23
Xiaodong Yu, Wenpeng Yin, Nitish Gupta, Dan Roth. Event Linking: Grounding Event Mentions to Wikipedia. The 17th Conference of the European Chapter of the Association for Computational Linguistics (EACL'23)
Jiangshu Du, Wenpeng Yin, Congying Xia, Philip S. Yu. Learning to Select from Multiple Options. Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI'23)
2022:
Hanzi Xu, Slobodan Vucetic and Wenpeng Yin, OpenStance: Real-world Zero-shot Stance Detection. CoNLL2022
Xiaodong Yu, Wenpeng Yin, Dan Roth. Paired Representation Learning for Event and Entity Coreference. The 11th Joint Conference on Lexical and Computational Semantics (*SEM 2022)
Chang Tian, Wenpeng Yin, Marie-Francine Moens. Anti-overestimation Dialogue Policy Learning for Task-completion Dialogue System. Findings of NAACL.
Tian Xie, Xinyi Yang, Angela S Lin, Feihong Wu, Kazuma Hashimoto, Jin Qu, Young Mo Kang, Wenpeng Yin, Huan Wang, Semih Yavuz, Gang Wu, Michael Jones, Richard Socher, Yingbo Zhou, Wenhao Liu, Caiming Xiong. Converse--A Tree-Based Modular Task-Oriented Dialogue System. Arxiv.
Wenpeng Yin, Jia Li, Caiming Xiong. ConTinTin: Continual Learning from Task Instructions. The 60th Annual Meeting of the Association for Computational Linguistics (ACL).
Bangzheng Li, Wenpeng Yin, Muhao Chen. Ultra-fine Entity Typing with Indirect Supervision from Natural Language Inference. Transactions of ACL.
2021:
Wenpeng Yin, Shelby Heinecke, Jia Li, etc. Combining Data-driven Supervision with Human-in-the-loop Feedback for Entity Resolution. Data-Centric AI Workshop at NeurIPS.
Wenpeng Yin, Dragomir Radev, Caiming Xiong. Doc-NLI: A Large-scale Dataset for Document-level Natural Language Inference. The 59th Annual Meeting of the Association for Computational Linguistics (ACL Findings)
Wenpeng Yin, Huan Wang, Jin Qu, Caiming Xiong. BatchMixup: Improving Training by Interpolating Hidden States of the Entire Mini-batch. The 59th Annual Meeting of the Association for Computational Linguistics (ACL Findings)
Congying Xia*, Wenpeng Yin*, Yihao Feng, Philip S. Yu. Incremental Few-shot Text Classi cation with Multi-round New Classes: Formulation, Dataset and System. 2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL). “∗”: equal contribution
Bailin Wang, Wenpeng Yin, Xi Victoria Lin, Caiming Xiong. Learning to Synthesize Data for Semantic Parsing. 2021 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL).
Andre Esteva, Anuprit Kale, Romain Paulus, Kazuma Hashimoto, Wenpeng Yin, Dragomir Radev, Richard Socher. CO-Search: COVID-19 Information Retrieval with Semantic Search, Question Answering, and Abstractive Summarization. Nature Digital Medicine. Featured by Forbes and TechRepublic
2020:
Wenpeng Yin, Nazneen Fatema Rajani, Dragomir Radev, Richard Socher, Caiming Xiong. Universal Natural Language Processing with Limited Annotations: Try Few-shot Textual Entailment as a Start (EMNLP, PDF)
Lichao Sun, Congying Xia, Wenpeng Yin, Tingting Liang, Philip Yu and Lifang He. Mixup-Transfomer: Dynamic Data Augmentation for NLP Tasks. The 28th International Conference on Computational Linguistics (COLING)
Jianquan Li, Xiaokang Liu, Wenpeng Yin, Min Yang, Liqun Ma. Empirical evaluation of multi-task learning in deep neural networks for natural language processing. Neural Computing and Applications
Wenpeng Yin. Meta-learning for Few-shot Natural Language Processing: A Survey. arXiv
Nazneen Fatema Rajani, Ben Krause, Wenpeng Yin, Tong Niu, Richard Socher, Caiming Xiong. Explaining and Improving Model Behavior with k Nearest Neighbor Representations. arXiv
Lichao Sun, Kazuma Hashimoto, Wenpeng Yin, Akari Asai, Jia Li, Philip Yu, Caiming Xiong. Adv-BERT: BERT is not robust on misspellings! Generating nature adversarial samples on BERT. arXiv
2019:
Wenpeng Yin, Jamaal Hay, and Dan Roth. Benchmarking Zero-shot Text Classification: Datasets, Evaluation and Entailment Approach. (EMNLP'2019, Dataset&Code, Demo at UPenn, Demo at Huggingface)
Min Yang, Wenpeng Yin, Qiang Qu, Wenting Tu, Ying Shen, Xiaojun Chen. Neural Attentive Network for Cross-Domain Aspect-level Sentiment Classification. IEEE Transactions on Active Computing.
Sihao Chen, Daniel Khashabi, Wenpeng Yin, Chris Callison-Burch, and Dan Roth. Seeing Things from a Different Angle: Discovering Diverse Perspectives about Claims. (NAACL'2019, PDF, Dataset)
Yibo Sun, Duyu Tang, Nan Duan, Tao Qin, Shujie Liu, Zhao Yan, Ming Zhou, Yuanhua Lv, Wenpeng Yin, Xiaochen Feng, Bing Qin, Ting Liu. Joint Learning of Question Answering and Question Generation. IEEE Transactions on Knowledge and Data Engineering (TKDE)
2018:
Wenpeng Yin, Dan Roth. TwoWingOS: A Two-Wing Optimization Strategy for Evidential Claim Verification. (EMNLP'2018, PDF, code)
Wenpeng Yin, Hinrich Schütze. Attentive Convolution: Equipping CNNs with RNN-style Attention Mechanisms. Transactions of the ACL.
Wenpeng Yin, Hinrich Schütze, Dan Roth. End-Task Oriented Textual Entailment via Deep Explorations of Inter-Sentence Interactions. (ACL'2018, PDF, code)
Wenpeng Yin, Dan Roth. Term Definitions Help Hypernymy Detection. 7th Joint Conference on Lexical and Computational Semantics (*SEM'2018, PDF)
Wenpeng Yin, Yadollah Yaghoobzadeh and Hinrich Schütze. Recurrent One-Hop Predictions for Reasoning over Knowledge Graphs. (COLING'2018, PDF, Awarded ``Area Chair Favorites")
2017:
Mo Yu, Wenpeng Yin, Kazi Saidul Hasan, Cicero dos Santos, Bing Xiang and Bowen Zhou. Improved Neural Relation Detection for Knowledge Base Question Answering. (ACL'2017 PDF)
Wenpeng Yin, Hinrich Schütze. Task-Specific Attentive Pooling of Phrase Alignments Contributes to Sentence Matching. (EACL'2017 PDF)
Wenpeng Yin, Katharina Kann, Mo Yu, Hinrich Schütze. Comparative Study of CNN and RNN for Natural Language Processing. arXiv.
2016:
Wenpeng Yin, Hinrich Schütze, Bing Xiang, Bowen Zhou. ABCNN: Attention-Based Convolutional Neural Network for Modeling Sentence Pairs. Transactions of the Association for Computational Linguistics (TACL).
Wenpeng Yin, Mo Yu, Bing Xiang, Bowen Zhou, Hinrich Schütze. Simple Question Answering by Attentive Convolutional Neural Network. COLING'2016, PDF
Wenpeng Yin, Hinrich Schütze. Learning Word Meta-Embeddings. ACL'2016, PDF
Wenpeng Yin, Sebastian Ebert, Hinrich Schütze. Attention-Based Convolutional Neural Network for Machine Comprehension. NAACL'2016 HCQA Workshop, PDF, extended version at Arxiv
2015:
Wenpeng Yin, Tobias Schnabel, Hinrich Schütze. Online Updating of Word Representations for Part-of-Speech Taggging. EMNLP'2015, short paper, 24%. PDF, Software
Wenpeng Yin, Hinrich Schütze. Multichannel Variable-Size Convolution for Sentence Classification. CoNLL'2015. PDF.
Wenpeng Yin, Hinrich Schütze. MultiGranCNN: An Architecture for General Matching of Text Chunks on Multiple Levels of Granularity. ACL'2015. PDF.
Min Yang, Wenting Tu, Wenpeng Yin, Ziyu Lu. Deep Markov Neural Network for Sequential Data Classification. ACL'2015. PDF
Wenpeng Yin, Yulong Pei. Optimizing Sentence Modelling and Selection for Document Summarization. IJCAI'2015. PDF.
Wenpeng Yin, Hinrich Schütze. Convolutional Neural Network for Paraphrase Identification. NAACL-HLT'2015. PDF.
Wenpeng Yin, Hinrich Schütze. Discriminative Phrase Embedding for Paraphrase Identification. NAACL-HLT'2015. PDF.
Min Yang, Wenting Tu, Ziyu Lu, Wenpeng Yin, Kam-Pui Chow. LCCT: A Semi-supervised Model for Sentiment Classification. NAACL-HLT'2015. PDF.
Zhiang Wu, Jie Cao, Guixiang Zhu, Wenpeng Yin, Alfredo Cuzzocrea, Jin Shi. Detecting overlapping communities in poly-relational networks. World Wide Web.
2014:
Wenpeng Yin, Hinrich Schütze. An Exploration of Embeddings for Generalized Phrases. ACL'2014 Student Research workshop. PDF Phrase Embeddings Download
Wenpeng Yin, Hinrich Schütze. Deep Learning Embeddings for Discontinuous Linguistic Units. International Conference on Learning Representations (ICLR'2014 workshop). PDF
-----------------------------------------------------------Before Ph.D. Study-----------------------------------------------------------------------
Zhi'ang Wu, Wenpeng Yin, Jie Cao, Guandong Xu, Alfredo Cuzzocrea. Community Detection in Multi-relational Social Networks. The 14th International Conference on Web Information System Engineering (WISE'2013). (Full Paper, 24.7%) PDF One of the Best Papers
Wenpeng Yin, Yulong Pei, Fan Zhang, Lian’en Huang. SentTopic-MultiRank: A Novel Ranking Model for Multi-document Summarization. COLING'2012. PDF
Wenpeng Yin, Lifu Huang, Yulong Pei, Lian’en Huang. RelationListwise for query-focused multi-document summarization. COLING'2012. PDF
Yulong Pei, Wenpeng Yin, Qifeng Fan, Lian’en Huang. A supervised aggregation framework for multi-document summarization. COLING'2012. PDF
Wenpeng Yin, Yulong Pei, Fan Zhang, Lian’en Huang. Query-Focused Multi-document Summarization Based on Query-Sensitive Feature Space. CIKM'2012. PDF
Yulong Pei, Wenpeng Yin, Lian’en Huang. Generic Multi-Document Summarization Using Topic-Oriented Information. The 12th Pacific Rim International Conference on Artificial Intelligence, (PRICAI'2012). 435-446. (Full Paper, 25%) PDF
Wenpeng Yin, Yulong Pei, Lian’en Huang. Automatic Multi-document Summarization Based on New Sentence Similarity Measures. The 12th Pacific Rim International Conference on Artificial Intelligence, (PRICAI'2012). 832-837. (Short Paper 25%) PDF