publications

Selected publications of NALA members (since 2020).

2024

  1. Zero-Shot vs. Translation-Based Cross-Lingual Transfer: The Case of Lexical Gaps
    Abteen Ebrahimi, and Katharina von der Wense
    In Proceedings of the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (to appear), 2024
  2. Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget
    Minh Duc Bui, Fabian David Schmidt, Goran Glavaš, and Katharina von der Wense
    In Proceedings of the Workshop on Insights from Negative Results in NLP (to appear), 2024
  3. The Trade-off between Performance, Efficiency, and Fairness in Adapter Modules for Text Classification
    Minh Duc Bui, and Katharina von der Wense
    In Proceedings of the Fourth Workshop on Trustworthy Natural Language Processing (to appear), 2024
  4. NLP for Language Documentation: Two Reasons for the Gap between Theory and Practice
    Luke Gessler, and Katharina von der Wense
    In Proceedings of the 4th Workshop on Natural Language Processing for Indigenous Languages of the Americas (AmericasNLP) (to appear), 2024
  5. JGU Mainz’s Submission to the AmericasNLP 2024 Shared Task on the Creation of Educational Materials for Indigenous Languages
    Minh Duc Bui, and Katharina von der Wense
    In Proceedings of the 4th Workshop on Natural Language Processing for Indigenous Languages of the Americas (AmericasNLP) (to appear), 2024
  6. Quantifying the Hyperparameter Sensitivity of Neural Networks for Character-level Sequence-to-Sequence Tasks
    Adam Wiemerslage, Kyle Gorman, and Katharina von der Wense
    In Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024
  7. Comparing Template-based and Template-free Language Model Probing
    Sagi Shaier, Kevin Bennett, Lawrence Hunter, and Katharina von der Wense
    In Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024
  8. Desiderata For The Context Use Of Question Answering Systems
    Sagi Shaier, Lawrence Hunter, and Katharina von der Wense
    In Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics, 2024

2023

  1. On the Automatic Generation and Simplification of Children’s Stories
    Maria Valentini, Jennifer Weber, Jesus Salcido, Téa Wright, Eliana Colunga, and Katharina von der Wense
    In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
  2. Emerging Challenges in Personalized Medicine: Assessing Demographic Effects on Biomedical Question Answering Systems
    Sagi Shaier, Kevin Bennett, Lawrence Hunter, and Katharina von der Wense
    In Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023
  3. Who Are All The Stochastic Parrots Imitating? They Should Tell Us!
    Sagi Shaier, Lawrence Hunter, and Katharina von der Wense
    In Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, 2023
  4. Findings of the CoCo4MT 2023 Shared Task on Corpus Construction for Machine Translation
    Ananya Ganesh, Marine Carpuat, William Chen, Katharina Kann, Constantine Lignos, John E. Ortega, Jonne Saleva, Shabnam Tafreshi, and Rodolfo Zevallos
    In Proceedings of the Second Workshop on Corpus Generation and Corpus Augmentation for Machine Translation, 2023
  5. Neural Machine Translation for the Indigenous Languages of the Americas: An Introduction
    Manuel Mager, Rajat Bhatnagar, Graham Neubig, Ngoc Thang Vu, and Katharina Kann
    In Proceedings of the Third Workshop on NLP for Indigenous Languages of the Americas, 2023
  6. Findings of the AmericasNLP 2023 Shared Task on Machine Translation into Indigenous Languages
    Abteen Ebrahimi, Manuel Mager, Shruti Rijhwani, Enora Rice, Arturo Oncevay, Claudia Baltazar, María Cortés, Cynthia Montaño, John E Ortega, Rolando Coto-Solano, Hilaria Cruz, Alexis Palmer, and Katharina Kann
    In Proceedings of the Third Workshop on NLP for Indigenous Languages of the Americas, 2023
  7. A Survey of Challenges and Methods in the Computational Modeling of Multi-Party Dialog
    Ananya Ganesh, Martha Palmer, and Katharina Kann
    In Proceedings of the 5th Workshop on NLP for Conversational AI, 2023
  8. Mind the Gap between the Application Track and the Real World
    Ananya Ganesh, Jie Cao, Margaret Perkoff, Rosy Southwell, Martha Palmer, and Katharina Kann
    In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics, 2023
  9. Ethical Considerations for Machine Translation of Indigenous Languages: Giving a Voice to the Speakers
    Manuel Mager, Elisabeth Albine Mager, Katharina Kann, and Ngoc Thang Vu
    In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics, 2023
  10. An Investigation of Noise in Morphological Inflection
    Adam Wiemerslage, Changbing Yang, Garrett Nicolai, Miikka Silfverberg, and Katharina Kann
    In Findings of the 61st Annual Meeting of the Association for Computational Linguistics, 2023
  11. A Comparative Analysis of Automatic Speech Recognition Errors in Small Group Classroom Discourse
    Jie Cao, Ananya Ganesh, Jon Cai, Rosy Southwell, Margaret Perkoff, Michael Reagan, Katharina Kann, James Martin, Martha Palmer, and Sidney D’Mello
    In Proceedings of the 31st ACM Conference on User Modeling, Adaptation and Personalization, 2023
  12. Navigating Wanderland: Highlighting Off-Task Discussions in Classrooms
    Ananya Ganesh, Michael Chang, Rachel Dickler, Michael Regan, Jon Cai, Kristin Wright-Bettner, James Pustejovsky, James Martin, Jeff Flanigan, Martha Palmer, and Katharina Kann
    In Proceedings of the 24th International Conference on Artificial Intelligence in Education, 2023
  13. Meeting the Needs of Low-Resource Languages: Exploring Automatic Alignments via Pretrained Models
    Abteen Ebrahimi, Arya D. McCarthy, Arturo Oncevay, John E. Ortega, Luis Chiruzzo, Rolando Coto-Solano, Gustavo A. Giménez-Lugo, and Katharina Kann
    In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

2022

  1. Findings of the Second AmericasNLP Competition on Speech-to-Text Translation
    Abteen Ebrahimi, Manuel Mager, Adam Wiemerslage, Pavel Denisov, Arturo Oncevay, Danni Liu, Sai Koneru, Enes Yavuz Ugan, Zhaolin Li, Jan Niehues, Monica Romero, Ivan G Torre, Tanel Alumäe, Jiaming Kong, Sergey Polezhaev, Yury Belousov, Wei-Rui Chen, Peter Sullivan, Ife Adebara, Bashar Talafha, Alcides Alcoba Inciarte, Muhammad Abdul-Mageed, Luis Chiruzzo, Rolando Coto-Solano, Hilaria Cruz, Sofía Flores-Solórzano, Aldo Andrés Alvarez López, Ivan Meza-Ruiz, John E. Ortega, Alexis Palmer, Rodolfo Joel Zevallos Salazar, Kristine Stenzel, Thang Vu, and Katharina Kann
    In Proceedings of the NeurIPS 2022 Competitions Track, 2022
  2. AmericasNLI: Machine translation and natural language inference systems for Indigenous languages of the Americas
    Katharina Kann, Abteen Ebrahimi, Manuel Mager, Arturo Oncevay, John E. Ortega, Annette Rios, Angela Fan, Ximena Gutierrez-Vasques, Luis Chiruzzo, Gustavo A. Giménez-Lugo, Ricardo Ramos, Ivan Vladimir Meza Ruiz, Elisabeth Mager, Vishrav Chaudhary, Graham Neubig, Alexis Palmer, Rolando Coto-Solano, and Ngoc Thang Vu
    Frontiers in Artificial Intelligence 2022
  3. A Major Obstacle for NLP Research: Let’s Talk about Time Allocation!
    Katharina Kann, Shiran Dudy, and Arya D. McCarthy
    In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
  4. A Comprehensive Comparison of Neural Networks as Cognitive Models of Inflection
    Adam Wiemerslage, Shiran Dudy, and Katharina Kann
    In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
  5. CHIA: CHoosing Instances to Annotate for Machine Translation
    Rajat Bhatnagar, Ananya Ganesh, and Katharina Kann
    In Findings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022
  6. Generate Me a Bedtime Story: Leveraging Natural Language Processing for Early Vocabulary Enhancement
    Trevor A. Hall, Maria Valentini, Eliana Colunga, and Katharina Kann
    In Proceedings of the Workshop on NLP for Positive Impact, 2022
  7. Machine Translation Between High-resource Languages in a Language Documentation Setting
    Katharina Kann, Abteen Ebrahimi, Kristine Stenzel, and Alexis Palmer
    In Proceedings of the First Workshop on Applying NLP to Field Linguistics, 2022
  8. Response Construct Tagging: NLP-Aided Assessment for Engineering Education
    Ananya Ganesh, Hugh Scribner, Jasdeep Singh, Katherine Goodman, Jean Hertzberg, and Katharina Kann
    In Proceedings of the 17th Workshop on Innovative Use of NLP for Building Educational Applications, 2022
  9. Open-domain Dialogue Generation: What We Can Do, Cannot Do, And Should Do Next
    Katharina Kann, Abteen Ebrahimi, Joewie J. Koh, Shiran Dudy, and Alessandro Roncone
    In Proceedings of the 4th Workshop on NLP for Conversational AI, 2022
  10. AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource Languages
    Abteen Ebrahimi, Manuel Mager, Arturo Oncevay, Vishrav Chaudhary, Luis Chiruzzo, Angela Fan, John Ortega, Ricardo Ramos, Annette Rios, Ivan Vladimir Meza Ruiz, Gustavo A. Giménez-Lugo, Elisabeth Mager, Graham Neubig, Alexis Palmer, Rolando Coto-Solano, Thang Vu, and Katharina Kann
    In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022
  11. How Does Multilingual Pretraining Affect Cross-Lingual Transferability?
    Yoshinari Fujinuma, Jordan Lee Boyd-Graber, and Katharina Kann
    In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022
  12. Morphological Processing of Low-Resource Languages: Where We Are and What’s Next
    Adam Wiemerslage, Miikka Silfverberg, Changbing Yang, Arya D. McCarthy, Garrett Nicolai, Eliana Colunga, and Katharina Kann
    In Findings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022
  13. BPE vs. Morphological Segmentation: A Case Study on Machine Translation of Four Polysynthetic Languages
    Manuel Mager, Arturo Oncevay, Elisabeth Mager, Katharina Kann, and Thang Vu
    In Findings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

2021

  1. The World of an Octopus: How Reporting Bias Influences a Language Model’s Perception of Color
    Cory Paik, Stéphane Aroca-Ouellette, Alessandro Roncone, and Katharina Kann
    In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 2021
  2. What Would a Teacher Do? Predicting Future Talk Moves
    Ananya Ganesh, Martha Palmer, and Katharina Kann
    In Findings of the 59th Annual Meeting of the Association for Computational Linguistics, 2021
  3. PROST: Physical Reasoning of Objects through Space and Time
    Stephane Aroca-Ouellette, Cory Paik, Alessandro Roncone, and Katharina Kann
    In Findings of the 59th Annual Meeting of the Association for Computational Linguistics, 2021
  4. How to Adapt Your Pretrained Multilingual Model to 1600 Languages
    Abteen Ebrahimi, and Katharina Kann
    In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics, 2021
  5. Don’t Rule Out Monolingual Speakers: A Method For Crowdsourcing Machine Translation Data
    Rajat Bhatnagar, Ananya Ganesh, and Katharina Kann
    In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics, 2021
  6. Findings of the LoResMT 2021 Shared Task on COVID and Sign Language for Low-resource Languages
    Atul Kr. Ojha, Chao-Hong Liu, Katharina Kann, John Ortega, Sheetal Shatam, and Theodorus Fransen
    In Proceedings of the 4th Workshop on Technologies for MT of Low Resource Languages (LoResMT2021), 2021
  7. Paradigm Clustering with Weighted Edit Distance
    Andrew Gerlach, Adam Wiemerslage, and Katharina Kann
    In Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, 2021
  8. Findings of the SIGMORPHON 2021 Shared Task on Unsupervised Morphological Paradigm Clustering
    Adam Wiemerslage, Arya D. McCarthy, Alexander Erdmann, Garrett Nicolai, Manex Agirrezabal, Miikka Silfverberg, Mans Hulden, and Katharina Kann
    In Proceedings of the 18th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, 2021
  9. Findings of the AmericasNLP 2021 Shared Task on Open Machine Translation for Indigenous Languages of the Americas
    Manuel Mager, Arturo Oncevay, Abteen Ebrahimi, John Ortega, Annette Rios, Angela Fan, Ximena Gutierrez-Vasques, Luis Chiruzzo, Gustavo Giménez-Lugo, Ricardo Ramos, Ivan Vladimir Meza Ruiz, Rolando Coto-Solano, Alexis Palmer, Elisabeth Mager-Hois, Vishrav Chaudhary, Graham Neubig, Ngoc Thang Vu, and Katharina Kann
    In Proceedings of the First Workshop on Natural Language Processing for Indigenous Languages of the Americas, 2021
  10. Coloring the Black Box: What Synesthesia Tells Us about Character Embeddings
    Katharina Kann, and Mauro M. Monsalve-Mercado
    In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics, 2021
  11. CLiMP: A Benchmark for Chinese Language Model Evaluation
    Beilei Xiang, Changbing Yang, Yu Li, Alex Warstadt, and Katharina Kann
    In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics, 2021

2020

  1. Making a Point: Pointer-Generator Transformers for Disjoint Vocabularies
    Nikhil Prabhu, and Katharina Kann
    In Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 9th International Joint Conference on Natural Language Processing Student Research Workshop, 2020
    Best Paper Award
  2. English Intermediate-Task Training Improves Zero-Shot Cross-Lingual Transfer Too
    Jason Phang, Phu Mon Htut, Yada Pruksachatkun, Haokun Liu, Clara Vania, Iacer Calixto, Katharina Kann, and Samuel R. Bowman
    In Proceedings of the 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 9th International Joint Conference on Natural Language Processing, 2020
  3. Tackling the Low-resource Challenge for Canonical Segmentation
    Manuel Mager, Özlem Çetinoğlu, and Katharina Kann
    In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
  4. Acrostic Poem Generation
    Rajat Agarwal, and Katharina Kann
    In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
  5. IGT2P: From Interlinear Glossed Texts to Paradigms
    Sarah Moeller, Ling Liu, Changbing Yang, Katharina Kann, and Mans Hulden
    In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020
  6. Why Overfitting Isn’t Always Bad: Retrofitting Cross-Lingual Word Embeddings to Dictionaries
    Mozhi Zhang, Yoshinari Fujinuma, Michael J. Paul, and Jordan Boyd-Graber
    In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
  7. The SIGMORPHON 2020 Shared Task on Unsupervised Morphological Paradigm Completion
    Katharina Kann, Arya D. McCarthy, Garrett Nicolai, and Mans Hulden
    In Proceedings of the 17th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, 2020
  8. Frustratingly Easy Multilingual Grapheme-to-Phoneme Conversion
    Nikhil Prabhu, and Katharina Kann
    In Proceedings of the 17th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, 2020
  9. The NYU-CUBoulder Systems for SIGMORPHON 2020 Task 0 and Task 2
    Assaf Singer, and Katharina Kann
    In Proceedings of the 17th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, 2020
  10. The IMS–CUBoulder System for the SIGMORPHON 2020 Shared Task on Unsupervised Morphological Paradigm Completion
    Manuel Mager, and Katharina Kann
    In Proceedings of the 17th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, 2020
  11. Self-Training for Unsupervised Parsing with PRPN
    Anhad Mohananey, Katharina Kann, and Samuel R. Bowman
    In Proceedings of the 16th International Conference on Parsing Technologies and the IWPT 2020 Shared Task on Parsing into Enhanced Universal Dependencies, 2020
  12. Intermediate-Task Transfer Learning with Pretrained Language Models: When and Why Does It Work?
    Yada Pruksachatkun, Jason Phang, Haokun Liu, Phu Mon Htut, Xiaoyi Zhang, Richard Yuanzhe Pang, Clara Vania, Katharina Kann, and Samuel R. Bowman
    In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
  13. Unsupervised Morphological Paradigm Completion
    Huiming Jin, Liwei Cai, Yihui Peng, Chen Xia, Arya McCarthy, and Katharina Kann
    In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020
  14. Learning to Learn Morphological Inflection for Resource-Poor Languages
    Katharina Kann, Samuel R. Bowman, and Kyunghyun Cho
    In Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
  15. Weakly Supervised POS Taggers Perform Poorly on Truly Low-Resource Languages
    Katharina Kann, Ophélie Lacroix, and Anders Søgaard
    In Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020
  16. Acquisition of Inflectional Morphology in Artificial Neural Networks With Prior Knowledge
    Katharina Kann
    In Proceedings of the Society for Computation in Linguistics, 2020