default search action
Daniel Deutsch
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c21]Marcel Nawrath, Agnieszka Nowak, Tristan Ratz, Danilo C. Walenta, Juri Opitz, Leonardo F. R. Ribeiro, João Sedoc, Daniel Deutsch, Simon Mille, Yixin Liu, Sebastian Gehrmann, Lining Zhang, Saad Mahamood, Miruna Clinciu, Khyathi Raghavi Chandu, Yufang Hou:
On the Role of Summary Content Units in Text Summarization Evaluation. NAACL (Short Papers) 2024: 272-281 - [c20]Wenda Xu, Daniel Deutsch, Mara Finkelstein, Juraj Juraska, Biao Zhang, Zhongtao Liu, William Yang Wang, Lei Li, Markus Freitag:
LLMRefine: Pinpointing and Refining Large Language Models via Fine-Grained Actionable Feedback. NAACL-HLT (Findings) 2024: 1429-1445 - [c19]Parker Riley, Daniel Deutsch, George F. Foster, Viresh Ratnakar, Ali Dabirmoghaddam, Markus Freitag:
Finding Replicable Human Evaluations via Stable Ranking Probability. NAACL-HLT 2024: 4908-4919 - [i20]Parker Riley, Daniel Deutsch, George F. Foster, Viresh Ratnakar, Ali Dabirmoghaddam, Markus Freitag:
Finding Replicable Human Evaluations via Stable Ranking Probability. CoRR abs/2404.01474 (2024) - [i19]Marcel Nawrath, Agnieszka Nowak, Tristan Ratz, Danilo C. Walenta, Juri Opitz, Leonardo F. R. Ribeiro, João Sedoc, Daniel Deutsch, Simon Mille, Yixin Liu, Lining Zhang, Sebastian Gehrmann, Saad Mahamood, Miruna Clinciu, Khyathi Raghavi Chandu, Yufang Hou:
On the Role of Summary Content Units in Text Summarization Evaluation. CoRR abs/2404.01701 (2024) - [i18]Brian Thompson, Nitika Mathur, Daniel Deutsch, Huda Khayrallah:
Improving Statistical Significance in Human Evaluation of Automatic Metrics via Soft Pairwise Accuracy. CoRR abs/2409.09598 (2024) - 2023
- [c18]Lining Zhang, Simon Mille, Yufang Hou, Daniel Deutsch, Elizabeth Clark, Yixin Liu, Saad Mahamood, Sebastian Gehrmann, Miruna Clinciu, Khyathi Raghavi Chandu, João Sedoc:
A Needle in a Haystack: An Analysis of High-Agreement Workers on MTurk for Summarization. ACL (1) 2023: 14944-14982 - [c17]Daniel Deutsch, Dan Roth:
Incorporating Question Answering-Based Signals into Abstractive Summarization via Salient Span Selection. EACL 2023: 575-588 - [c16]Daniel Deutsch, George F. Foster, Markus Freitag:
Ties Matter: Meta-Evaluating Modern Metrics with Pairwise Accuracy and Tie Calibration. EMNLP 2023: 12914-12929 - [c15]Christoph Leiter, Juri Opitz, Daniel Deutsch, Yang Gao, Rotem Dror, Steffen Eger:
The Eval4NLP 2023 Shared Task on Prompting Large Language Models as Explainable Metrics. Eval4NLP 2023: 117-138 - [c14]Jan-Thorsten Peter, David Vilar, Daniel Deutsch, Mara Finkelstein, Juraj Juraska, Markus Freitag:
There's No Data like Better Data: Using QE Metrics for MT Data Filtering. WMT 2023: 561-577 - [c13]Markus Freitag, Nitika Mathur, Chi-kiu Lo, Eleftherios Avramidis, Ricardo Rei, Brian Thompson, Tom Kocmi, Frédéric Blain, Daniel Deutsch, Craig Stewart, Chrysoula Zerva, Sheila Castilho, Alon Lavie, George F. Foster:
Results of WMT23 Metrics Shared Task: Metrics Might Be Guilty but References Are Not Innocent. WMT 2023: 578-628 - [c12]Juraj Juraska, Mara Finkelstein, Daniel Deutsch, Aditya Siddhant, Mehdi Mirzazadeh, Markus Freitag:
MetricX-23: The Google Submission to the WMT 2023 Metrics Shared Task. WMT 2023: 756-767 - [c11]Subhajit Naskar, Daniel Deutsch, Markus Freitag:
Quality Estimation Using Minimum Bayes Risk. WMT 2023: 806-811 - [c10]Daniel Deutsch, Juraj Juraska, Mara Finkelstein, Markus Freitag:
Training and Meta-Evaluating Machine Translation Evaluation Metrics at the Paragraph Level. WMT 2023: 996-1013 - [c9]Patrick Fernandes, Daniel Deutsch, Mara Finkelstein, Parker Riley, André Martins, Graham Neubig, Ankush Garg, Jonathan H. Clark, Markus Freitag, Orhan Firat:
The Devil Is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation. WMT 2023: 1066-1083 - [e2]Daniel Deutsch, Rotem Dror, Steffen Eger, Yang Gao, Christoph Leiter, Juri Opitz, Andreas Rücklé:
Proceedings of the 4th Workshop on Evaluation and Comparison of NLP Systems, Eval4NLP 2023, Bali, Indonesia, November 1, 2023. Association for Computational Linguistics 2023, ISBN 979-8-89176-021-9 [contents] - [i17]Daniel Deutsch, George F. Foster, Markus Freitag:
Ties Matter: Modifying Kendall's Tau for Modern Metric Meta-Evaluation. CoRR abs/2305.14324 (2023) - [i16]Patrick Fernandes, Daniel Deutsch, Mara Finkelstein, Parker Riley, André F. T. Martins, Graham Neubig, Ankush Garg, Jonathan H. Clark, Markus Freitag, Orhan Firat:
The Devil is in the Errors: Leveraging Large Language Models for Fine-grained Machine Translation Evaluation. CoRR abs/2308.07286 (2023) - [i15]Daniel Deutsch, Juraj Juraska, Mara Finkelstein, Markus Freitag:
Training and Meta-Evaluating Machine Translation Evaluation Metrics at the Paragraph Level. CoRR abs/2308.13506 (2023) - [i14]Christoph Leiter, Juri Opitz, Daniel Deutsch, Yang Gao, Rotem Dror, Steffen Eger:
The Eval4NLP 2023 Shared Task on Prompting Large Language Models as Explainable Metrics. CoRR abs/2310.19792 (2023) - [i13]Jan-Thorsten Peter, David Vilar, Daniel Deutsch, Mara Finkelstein, Juraj Juraska, Markus Freitag:
There's no Data Like Better Data: Using QE Metrics for MT Data Filtering. CoRR abs/2311.05350 (2023) - [i12]Wenda Xu, Daniel Deutsch, Mara Finkelstein, Juraj Juraska, Biao Zhang, Zhongtao Liu, William Yang Wang, Lei Li, Markus Freitag:
Pinpoint, Not Criticize: Refining Large Language Models via Fine-Grained Actionable Feedback. CoRR abs/2311.09336 (2023) - 2022
- [c8]Daniel Deutsch, Dan Roth:
Benchmarking Answer Verification Methods for Question Answering-Based Summarization Evaluation Metrics. ACL (Findings) 2022: 3759-3765 - [c7]Daniel Deutsch, Rotem Dror, Dan Roth:
On the Limitations of Reference-Free Evaluations of Generated Text. EMNLP 2022: 10960-10977 - [c6]Daniel Deutsch, Rotem Dror, Dan Roth:
Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics. NAACL-HLT 2022: 6038-6052 - [e1]Daniel Deutsch, Can Udomcharoenchaikit, Juri Opitz, Yang Gao, Marina Fomicheva, Steffen Eger:
Proceedings of the 3rd Workshop on Evaluation and Comparison of NLP Systems, Eval4NLP 2022, Online, November 20, 2022. Association for Computational Linguistics 2022, ISBN 978-1-959429-00-5 [contents] - [i11]Daniel Deutsch, Dan Roth:
Benchmarking Answer Verification Methods for Question Answering-Based Summarization Evaluation Metrics. CoRR abs/2204.10206 (2022) - [i10]Daniel Deutsch, Rotem Dror, Dan Roth:
Re-Examining System-Level Correlations of Automatic Summarization Evaluation Metrics. CoRR abs/2204.10216 (2022) - [i9]Daniel Deutsch, Dan Roth:
Repro: An Open-Source Library for Improving the Reproducibility and Usability of Publicly Available Research Code. CoRR abs/2204.13848 (2022) - [i8]Sebastian Gehrmann, Abhik Bhattacharjee, Abinaya Mahendiran, Alex Wang, Alexandros Papangelis, Aman Madaan, Angelina McMillan-Major, Anna Shvets, Ashish Upadhyay, Bingsheng Yao, Bryan Wilie, Chandra Bhagavatula, Chaobin You, Craig Thomson, Cristina Garbacea, Dakuo Wang, Daniel Deutsch, Deyi Xiong, Di Jin, Dimitra Gkatzia, Dragomir R. Radev, Elizabeth Clark, Esin Durmus, Faisal Ladhak, Filip Ginter, Genta Indra Winata, Hendrik Strobelt, Hiroaki Hayashi, Jekaterina Novikova, Jenna Kanerva, Jenny Chim, Jiawei Zhou, Jordan Clive, Joshua Maynez, João Sedoc, Juraj Juraska, Kaustubh D. Dhole, Khyathi Raghavi Chandu, Laura Perez-Beltrachini, Leonardo F. R. Ribeiro, Lewis Tunstall, Li Zhang, Mahima Pushkarna, Mathias Creutz, Michael White, Mihir Sanjay Kale, Moussa Kamal Eddine, Nico Daheim, Nishant Subramani, Ondrej Dusek, Paul Pu Liang, Pawan Sasanka Ammanamanchi, Qi Zhu, Ratish Puduppully, Reno Kriz, Rifat Shahriyar, Ronald Cardenas, Saad Mahamood, Salomey Osei, Samuel Cahyawijaya, Sanja Stajner, Sébastien Montella, Shailza Jolly, Simon Mille, Tahmid Hasan, Tianhao Shen, Tosin P. AMahidewumi, Vikas Raunak, Vipul Raheja, Vitaly Nikolaev, Vivian Tsai, Yacine Jernite, Ying Xu, Yisi Sang, Yixin Liu, Yufang Hou:
GEMv2: Multilingual NLG Benchmarking in a Single Line of Code. CoRR abs/2206.11249 (2022) - [i7]Daniel Deutsch, Rotem Dror, Dan Roth:
On the Limitations of Reference-Free Evaluations of Generated Text. CoRR abs/2210.12563 (2022) - [i6]Lining Zhang, João Sedoc, Simon Mille, Yufang Hou, Sebastian Gehrmann, Daniel Deutsch, Elizabeth Clark, Yixin Liu, Miruna Clinciu, Saad Mahamood, Khyathi Raghavi Chandu:
Needle in a Haystack: An Analysis of Finding Qualified Workers on MTurk for Summarization. CoRR abs/2212.10397 (2022) - 2021
- [j2]Daniel Deutsch, Tania Bedrax-Weiss, Dan Roth:
Towards Question-Answering as an Automatic Metric for Evaluating the Content Quality of a Summary. Trans. Assoc. Comput. Linguistics 9: 774-789 (2021) - [j1]Daniel Deutsch, Rotem Dror, Dan Roth:
A Statistical Analysis of Summarization Evaluation Metrics Using Resampling Methods. Trans. Assoc. Comput. Linguistics 9: 1132-1146 (2021) - [c5]Daniel Deutsch, Dan Roth:
Understanding the Extent to which Content Quality Metrics Measure the Information Quality of Summaries. CoNLL 2021: 300-309 - [i5]Daniel Deutsch, Rotem Dror, Dan Roth:
A Statistical Analysis of Summarization Evaluation Metrics using Resampling Methods. CoRR abs/2104.00054 (2021) - [i4]Daniel Deutsch, Dan Roth:
Question-Based Salient Span Selection for More Controllable Text Summarization. CoRR abs/2111.07935 (2021) - 2020
- [c4]Disha Jindal, Daniel Deutsch, Dan Roth:
Is Killed More Significant than Fled? A Contextual Model for Salient Event Detection. COLING 2020: 114-124 - [i3]Daniel Deutsch, Dan Roth:
SacreROUGE: An Open-Source Library for Using and Developing Summarization Evaluation Metrics. CoRR abs/2007.05374 (2020) - [i2]Daniel Deutsch, Tania Bedrax-Weiss, Dan Roth:
Towards Question-Answering as an Automatic Metric for Evaluating the Content Quality of a Summary. CoRR abs/2010.00490 (2020) - [i1]Daniel Deutsch, Dan Roth:
Understanding the Extent to which Summarization Evaluation Metrics Measure the Information Quality of Summaries. CoRR abs/2010.12495 (2020)
2010 – 2019
- 2019
- [c3]Daniel Deutsch, Shyam Upadhyay, Dan Roth:
A General-Purpose Algorithm for Constrained Sequential Inference. CoNLL 2019: 482-492 - [c2]Daniel Deutsch, Dan Roth:
Summary Cloze: A New Task for Content Selection in Topic-Focused Summarization. EMNLP/IJCNLP (1) 2019: 3718-3727 - 2018
- [c1]Daniel Deutsch, John Hewitt, Dan Roth:
A Distributional and Orthographic Aggregation Model for English Derivational Morphology. ACL (1) 2018: 1938-1947
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-15 20:46 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint