Publications

Note: * indicates the papers that I closely supervised for the first student author(s).

PhD Thesis

Unrestricted Bridging Resolution
Yufang Hou. PhD thesis, Heidelberg University, 2016
[pdf] [bib]

Journal Articles

*Holmes: Benchmark the Linguistic Competence of Language Models
Andreas Waldis, Yotam Perlitz, Leshem Choshen, Yufang Hou, Iryna Gurevych.
In TACL, 2024
[pdf][bib]

Privacy-aware Supervised Classification: An Informative Subspace Based Multi-objective Approach
Chandan Biswas, Debasis Ganguly, Partha Sarathi Mukherjee, Ujjwal Bhattacharya, Yufang Hou.
In Pattern Recognition, 2022
[pdf] [bib]

An Autonomous Debating System
Noam Slonim, Yonatan Bilu, Carlos Alzate, Roy Bar-Haim, Ben Bogin, Francesca Bonin, Leshem Choshen, Edo Cohen-Karlik, Lena Dankin, Lilach Edelstein, Liat Ein-Dor, Roni Friedman-Melamed, Assaf Gavron, Ariel Gera, Martin Gleize, Shai Gretz, Dan Gutfreund, Alon Halfon, Daniel Hershcovich, Ron Hoory, Yufang Hou, Shay Hummel, Michal Jacovi, Charles Jochim, Yoav Kantor, Yoav Katz, David Konopnicki, Zvi Kons, Lili Kotlerman, Dalia Krieger, Dan Lahav, Tamar Lavee, Ran Levy, Naftali Liberman, Yosi Mass, Amir Menczel, Shachar Mirkin, Guy Moshkowich, Shila Ofek-Koifman, Matan Orbach, Ella Rabinovich, Ruty Rinott, Slava Shechtman, Dafna Sheinwald, Eyal Shnarch, Ilya Shnayderman, Aya Soffer, Artem Spector, Benjamin Sznajder, Assaf Toledo, Orith Toledo-Ronen, Elad Venezian, Ranit Aharonov.
In Nature, 2021
[pdf] [bib]

Unrestricted Bridging Resolution
Yufang Hou, Katja Markert, Michael Strube.
In Computational Linguistics, 2018
[pdf] [bib]

Preprints

*Grounding Fallacies Misrepresenting Scientific Publications in Evidence
Max Glockner, Yufang Hou, Preslav Nakov, Iryna Gurevych.
In ArXiv, 2024
[pdf]

Conference/Workshop Papers

2024

WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia
Yufang Hou, Alessandra Pascale, Javier Carnerero-Cano, Tigran Tchrakian, Radu Marinescu, Elizabeth Daly, Inkit Padhi, Prasanna Sattigeri.
In NeurIPS D&B Track, 2024
[pdf][bib]

*Efficient Performance Tracking: Leveraging Large Language Models for Automated Construction of Scientific Leaderboards
Furkan Sahinuç, Thy Thy Tran, Yulia Grishina, Yufang Hou, Bei Chen, Iryna Gurevych.
In EMNLP, 2024
[pdf][bib]

*SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement
Ishani Mondal, Zongxia Li, Yufang Hou, Anandhavelu Natarajan, Aparna Garimella, Jordan Lee Boyd-Graber.
In EMNLP Findings, 2024
[pdf][bib]

*MISSCI: Reconstructing Fallacies in Misrepresented Science
Max Glockner, Yufang Hou, Preslav Nakov, Iryna Gurevych.
In ACL, 2024
[pdf][bib]

*Systematic Task Exploration with LLMs: A Study in Citation Text Generation
Furkan Şahinuç, Ilia Kuznetsov, Yufang Hou, Iryna Gurevych.
In ACL, 2024
[pdf][bib]

*How to Handle Different Types of Out-of-Distribution Scenarios in Computational Argumentation? A Comprehensive and Fine-Grained Field Study
Andreas Waldis, Yufang Hou, Iryna Gurevych.
In ACL, 2024
[pdf][bib]

A Course Shared Task on Evaluating LLM Output for Clinical Questions
Yufang Hou, Thy Thy Tran, Doan Nam Long Vu, Yiwen Cao, Kai Li, Lukas Rohde, Iryna Gurevych.
In Proceedings of the 6th Workshop on Teaching NLP at ACL 2024
[pdf][bib]

*Beyond Abstracts: A New Dataset, Prompt Design Strategy and Method for Biomedical Synthesis Generation
James O’Doherty, Cian Nolan, Yufang Hou, Anya Belz.
In ACL Student Research Workshop, 2024
[pdf][bib]

*On the Role of Summary Content Units in Text Summarization Evaluation
Marcel Nawrath, Agnieszka Wiktoria Nowak, Tristan Ratz, Danilo Constantin Walenta, Juri Opitz, Leonardo F. R. Ribeiro, João Sedoc, Daniel Deutsch, Simon Mille, Yixin Liu, Sebastian Gehrmann, Lining Zhang, Saad Mahamood, Miruna Clinciu, Khyathi Chandu, Yufang Hou.
In NAACL, 2024
[pdf][bib]

*Dive into the Chasm: Probing the Gap between In- and Cross-Topic Generalization
Andreas Waldis, Yufang Hou, Iryna Gurevych.
In EACL Findings, 2024
[pdf][bib]

2023

*A Diachronic Analysis of Paradigm Shifts in NLP Research: When, How, and Why?
Aniket Pramanick, Yufang Hou, Saif M. Mohammad, Iryna Gurevych.
In EMNLP, 2023
[pdf][bib]

*`Don’t Get Too Technical with Me’: A Discourse Structure-Based Framework for Automatic Science Journalism
Ronald Cardenas, Bingsheng Yao, Dakuo Wang, Yufang Hou.
In EMNLP, 2023
[pdf][bib]

*CiteBench: A Benchmark for Scientific Citation Text Generation
Martin Funkquist, Ilia Kuznetsov, Yufang Hou, Iryna Gurevych.
In EMNLP, 2023
[pdf][bib]

*PairSpanBERT: An Enhanced Language Model for Bridging Resolution
Hideo Kobayashi, Yufang Hou, Vincent Ng.
In ACL, 2023
[pdf][bib]

*Are Fairy Tales Fair? Analyzing Gender Bias in Temporal Narrative Event Chains of Children’s Fairy Tales
Paulina Toro Isaza, Guangxuan Xu, Toye Oloko, Yufang Hou, Nanyun Peng, Dakuo Wang.
In ACL, 2023
[pdf][bib]

*Matching Pairs: Attributing Fine-Tuned Models to their Pre-Trained Large Language Models
Myles Foley, Ambrish Rawat, Taesung Lee, Yufang Hou, Gabriele Picco and Giulio Zizzo.
In ACL, 2023
[pdf][bib]

A Needle in a Haystack: An Analysis of Finding Qualified Workers on MTurk for Summarization
Lining Zhang, Simon Mille, Yufang Hou, Sebastian Gehrmann, Daniel Deutsch, Elizabeth Clark, Yixin Liu, Miruna Clinciu, Saad Mahamood, Khyathi Raghavi Chandu, João Sedoc.
In ACL, 2023
[pdf][bib]

LOWRECORP: the Low-Resource NLG Corpus Building Challenge
Khyathi Raghavi Chandu, David M. Howcroft, Dimitra Gkatzia, Yi-Ling Chung, Yufang Hou, Chris Chinenye Emezue, Pawan Rajpoot, Tosin Adewumi.
In INLG, 2023
[pdf][bib]

2022

*Missing Counter–Evidence Render NLP Fact-Checking Unrealistic for Misinformation
Max Glockner, Yufang Hou, Iryna Gurevych.
In EMNLP, 2022
[pdf][bib]

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code
Sebastian Gehrmann, Abhik Bhattacharjee, Abinaya Mahendiran, Alex Wang, Alexandros Papangelis, Aman Madaan, Angelina McMillan-Major, Anna Shvets, Ashish Upadhyay, Bingsheng Yao, Bryan Wilie, Chandra Bhagavatula, Chaobin You, Craig Thomson, Cristina Garbacea, Dakuo Wang, Daniel Deutsch, Deyi Xiong, Di Jin, Dimitra Gkatzia, Dragomir Radev, Elizabeth Clark, Esin Durmus, Faisal Ladhak, Filip Ginter, Genta Indra Winata, Hendrik Strobelt, Hiroaki Hayashi, Jekaterina Novikova, Jenna Kanerva, Jenny Chim, Jiawei Zhou, Jordan Clive, Joshua Maynez, João Sedoc, Juraj Juraska, Kaustubh Dhole, Khyathi Raghavi Chandu, Leonardo FR Ribeiro, Lewis Tunstall, Li Zhang, Mahima Pushkarna, Mathias Creutz, Michael White, Mihir Sanjay Kale, Moussa Kamal Eddine, Nico Daheim, Nishant Subramani, Ondrej Dusek, Paul Pu Liang, Pawan Sasanka Ammanamanchi, Qi Zhu, Ratish Puduppully, Reno Kriz, Rifat Shahriyar, Ronald Cardenas, Saad Mahamood, Salomey Osei, Samuel Cahyawijaya, Sanja Štajner, Sebastien Montella, Shailza Jolly, Simon Mille, Tahmid Hasan, Tianhao Shen, Tosin Adewumi, Vikas Raunak, Vipul Raheja, Vitaly Nikolaev, Vivian Tsai, Yacine Jernite, Ying Xu, Yisi Sang, Yixin Liu, Yufang Hou.
In EMNLP Demo Track, 2022
[pdf][bib]

*End-to-End Neural Bridging Resolution
Hideo Kobayashi, Yufang Hou, Vincent Ng.
In COLING, 2022
[pdf][bib]

*Constrained Multi-Task Learning for Bridging Resolution
Hideo Kobayashi, Yufang Hou, Vincent Ng.
In ACL, 2022
[pdf] [bib]

*Educational Question Generation of Children Storybooks via QuestionType Distribution Learning and Event-centric Summarization
Zhenjie Zhao, Yufang Hou, Dakuo Wang, Mo Yu, Chengzhong Liu, Xiaojuan Ma.
In ACL, 2022
[pdf] [bib]

Fantastic Questions and Where to Find Them: FairytaleQA–An Authentic Dataset for Narrative Comprehension
Ying Xu, Dakuo Wang, Mo Yu, Daniel Ritchie, Bingsheng Yao, Tongshuang Wu, Zheng Zhang, Toby Jia-Jun Li, Nora Bradford, Branda Sun, Tran Bao Hoang, Yisi Sang, Yufang Hou, Xiaojuan Ma, Diyi Yang, Nanyun Peng, Zhou Yu, Mark Warschauer.
In ACL, 2022
[pdf] [bib]

*Finding Sub-task Structure with Natural Language Instruction
Ryokan Ri, Yufang Hou, Radu Marinescu, Akihiro Kishimoto.
In the First Workshop on Learning with Natural Language Supervision at ACL 2022
[pdf] [bib]

2021

Ensemble Graph Prediction for AMR Parsing
Thanh Lam Hoang, Gabriele Picco, Yufang Hou, Young-Suk Lee, Lam M. Nguyen, Dzung T. Phan, Vanessa Lopez, Ramon Fernandez Astudillo.
In NeurIPS, 2021
[pdf] [bib]

End-to-end Neural Information Status Classification
Yufang Hou.
In EMNLP Findings, 2021
[pdf] [bib]

HAConvGNN: Hierarchical Attention Based Convolutional Graph Neural Network for Code Documentation Generation in Jupyter Notebooks
Xuye Liu, Dakuo Wang, April Wang, Yufang Hou, Lingfei Wu.
In EMNLP Findings, 2021
[pdf] [bib]

*Employing Argumentation Knowledge Graphs for Neural Argument Generation
Khalid Al Khatib, Lukas Trautner, Henning Wachsmuth, Yufang Hou, Benno Stein.
In ACL, 2021
[pdf] [bib]

*End-to-End Construction of NLP Knowledge Graph
Ishani Mondal, Yufang Hou, Charles Jochim.
In ACL Findings, 2021
[pdf] [bib]

*D2S: Automated Slide Generation With Query-based Text Summarization From Documents
Edward Sun, Yufang Hou, Dakuo Wang, Yunfeng Zhang and Nancy Xin Ru Wang.
In NAACL, 2021
[pdf] [bib]

*Probing for Bridging Inference in Transformer Language Models
Onkar Pandit, Yufang Hou.
In NAACL, 2021
[pdf] [bib]

TDMSci: A Specialized Corpus for Scientific Literature Entity Tagging of Tasks Datasets and Metrics
Yufang Hou, Charles Jochim, Martin Gleize, Francesca Bonin, Debasis Ganguly.
In EACL, 2021
[pdf] [bib]

Outcome Prediction from Behaviour Change Intervention Evaluations using a Combination of Node and Word Embedding
Debasis Ganguly, Martin Gleize, Yufang Hou, Charles Jochim, Francesca Bonin, Alessandra Pascale, Pierpaolo Tommasi, Pol Mac Aonghusa, Susan Michie, Robert West, Mike Kelly.
In AMIA Symposium, 2021
[pdf] [bib]

Overview of the 2021 Key Point Analysis Shared Task
Roni Friedman, Lena Dankin, Yufang Hou, Ranit Aharonov, Yoav Katz, Noam Slonim.
In the 8th Workshop on Argument Mining at EMNLP 2021
[pdf] [bib]

Argument Mining for Scholarly Document Processing: Taking Stock and Looking Ahead
Khalid Al Khatib, Tirthankar Ghosal, Yufang Hou, Anita de Waard, Dayne Freitag.
In the Second Workshop on Scholarly Document Processing at NAACL 2021
[pdf] [bib]

The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics
Sebastian Gehrmann, Tosin Adewumi, Karmanya Aggarwal, Pawan Sasanka Ammanamanchi, Aremu Anuoluwapo, Antoine Bosselut, Khyathi Raghavi Chandu, Miruna Clinciu, Dipanjan Das, Kaustubh D Dhole, Wanyu Du, Esin Durmus, Ondřej Dušek, Chris Emezue, Varun Gangal, Cristina Garbacea, Tatsunori Hashimoto, Yufang Hou, Yacine Jernite, Harsh Jhamtani, Yangfeng Ji, Shailza Jolly, Dhruv Kumar, Faisal Ladhak, Aman Madaan, Mounica Maddela, Khyati Mahajan, Saad Mahamood, Bodhisattwa Prasad Majumder, Pedro Henrique Martins, Angelina McMillan-Major, Simon Mille, Emiel van Miltenburg, Moin Nadeem, Shashi Narayan, Vitaly Nikolaev, Rubungo Andre Niyongabo, Salomey Osei, Ankur Parikh, Laura Perez-Beltrachini, Niranjan Ramesh Rao, Vikas Raunak, Juan Diego Rodriguez, Sashank Santhanam, João Sedoc, Thibault Sellam, Samira Shaikh, Anastasia Shimorina, Marco Antonio Sobrevilla Cabezudo, Hendrik Strobelt, Nishant Subramani, Wei Xu, Diyi Yang, Akhila Yerukola, Jiawei Zhou.
In the 1st Workshop on Natural Language Generation, Evaluation, and Metrics at ACL 2021
[pdf] [bib]

2020

Bridging Anaphora Resolution as Question Answering
Yufang Hou.
In ACL, 2020
[pdf] [bib]

Fine-grained Information Status Classification Using Discourse Context-Aware BERT
Yufang Hou.
In COLING, 2020
[pdf] [bib]

*End-to-End Argumentation Knowledge Graph Construction
Khalid Al-Khatib, Yufang Hou, Henning Wachsmuth, Charles Jochim, Francesca Bonin, Benno Stein.
In AAAI, 2020
[pdf] [bib]

Corpus Wide Argument Mining–A Working Solution
Liat Ein-Dor, Eyal Shnarch, Lena Dankin, Alon Halfon, Benjamin Sznajder, Ariel Gera, Carlos Alzate, Martin Gleize, Leshem Choshen, Yufang Hou, Yonatan Bilu, Ranit Aharonov, Noam Slonim.
In AAAI, 2020
[pdf] [bib]

HBCP Corpus: A New Resource for the Analysis of Behaviour ChangeIntervention Reports
Francesca Bonin, Ailbhe N. Finnerty, Candice Moore, Charles Jochim, Emma Norris, Yufang Hou, Martin Gleize, Debasis Ganguly, Alison J. Wright, Emily Hayes, Silje Zink, Alessandra Pascale, Pol Mac Aonghusa, Susan Michie.
In LREC, 2020
[pdf] [bib]

Knowledge Extraction and Prediction from Behavior Science Randomized Controlled Trials: A Case-Study in Smoking Cessation
Francesca Bonin, Martin Gleize, Yufang Hou, Debasis Ganguly, Ailbhe Finnerty, Charles Jochim, Alessandra Pascale, Pierpaolo Tommasi, Pol Mac Aonghusa, Susan Michie.
In AMIA Symposium, 2020
[pdf] [bib]

2019

Identification of Tasks, Datasets, Evaluation Metrics, and Numeric Scores for Scientific Leaderboards Construction
Yufang Hou, Charles Jochim, Martin Gleize, Francesca Bonin, Debasis Ganguly.
In ACL, 2019
[pdf] [bib]

A Summarization System for Scientific Documents
Shai Erera, Michal Shmueli-Scheuer, Guy Feigenblat, Ora Peled Nakash, Odellia Boni, Haggai Roitman, Doron Cohen, Bar Weiner, Yosi Mass, Or Rivlin, Guy Lev, Achiya Jerbi, Jonathan Herzig, Yufang Hou, Charles Jochim, Martin Gleize, Francesca Bonin, David Konopnicki.
In EMNLP Demo Track, 2019
[pdf] [bib]

Extracting Factual Min/Max Age Information from Clinical Trial Studies
Yufang Hou, Debasis Ganguly, Lea A Deleris, Francesca Bonin.
In the 2nd Clinical Natural Language Processing Workshop at NAACL 2019
[pdf] [bib]

Information Extraction of Behavior Change Intervention Descriptions
Debasis Ganguly, Yufang Hou, Lea A. Deleris, Francesca Bonin.
In AMIA Informatics Summit, 2019
[pdf] [bib]

2018

Will it Blend? Blending Weak and Strong Labeled Data in a Neural Network for Argumentation Mining
Eyal Shnarch, Carlos Alzate, Lena Dankin, Martin Gleize, Yufang Hou, Leshem Choshen, Ranit Aharonov, Noam Slonim.
In ACL, 2018
[pdf] [bib]

A Deterministic Algorithm for Bridging Anaphora Resolution
Yufang Hou.
In EMNLP, 2018
[pdf] [bib]

Enhanced Word Representations for Bridging Anaphora Resolution
Yufang Hou.
In NAACL, 2018
[pdf] [bib]

Know Who Your Friends Are: Understanding Social Connections from Unstructured Text
Lea A. Deleris, Francesca Bonin, Elizabeth Daly, Stephane Deparis, Yufang Hou, Charles Jochim, Yassine Lassoued and Killian Levacher.
In NAACL Demo Track, 2018
[pdf] [bib]

2017

Computational Argumentation Quality Assessment in Natural Language
Henning Wachsmuth, Nona Naderi, Yufang Hou, Yonatan Bilu, Vinodkumar Prabhakaran, Graeme Hirst, Benno Stein.
In EACL, 2017
[pdf] [bib]

Argumentation Quality Assessment: Theory vs. Practice
Henning Wachsmuth, Nona Naderi, Ivan Habernal, Yufang Hou, Graeme Hirst, Iryna Gurevych, Benno Stein.
In ACL, 2017
[pdf] [bib]

Argument Relation Classification Using a Joint Inference Model
Yufang Hou, Charles Jochim.
In the 4th Workshop on Argument Mining at EMNLP 2017
[pdf] [bib]

*The Cool Cucumber System at the 2017 TAC KBP BeSt Evaluation
Thanh-Son Nguyen, Yufang Hou, Charles Jochim, Elizabeth M. Daly, Lea A Deleris.
In TAC KBP, 2017
[pdf] [bib]

2016

Incremental Fine-grained Information Status Classification Using Attention-based LSTMs
Yufang Hou.
In COLING, 2016
[pdf] [bib]

2015

Analyzing Sentiment in Classical Chinese Poetry
Yufang Hou, Anette Frank.
In the 9th SIGHUM Workshop on Language Technology for Cultural Heritage at ACL 2015
[pdf] [bib]

2014

A Rule-Based System for Unrestricted Bridging Resolution: Recognizing Bridging Anaphora and Finding Links to Antecedents
Yufang Hou, Katja Markert, Michael Strube.
In EMNLP, 2014
[pdf] [bib]

2013

Global Inference for Bridging Anaphora Resolution
Yufang Hou, Katja Markert, Michael Strube.
In NAACL, 2013
[pdf] [bib]

Cascading Collective Classification for Bridging Anaphora Recognition using a Rich Linguistic Feature Set
Yufang Hou, Katja Markert, Michael Strube.
In EMNLP, 2013
[pdf] [bib]

2012

Collective Classification for Fine-grained Information Status
Katja Markert, Yufang Hou, Michael Strube.
In ACL, 2012
[pdf] [bib]