Nitin Madnani

2025
Span Labeling with Large Language Models: Shell vs. Meat. To appear in Proc. Workshop on Innovative Use of Natural Language Processing for Building Educational Applications (BEA). Phoebe Mulcaire and Nitin Madnani. [pdf] [bib]
2023
Local Similarity and Global Variability Characterize the Semantic Space of Human Languages. In Proceedings of the National Academy of Sciences, 120(51):e2300986120. Molly Lewis, Aoife Cahill, Nitin Madnani, and James Evans. [link]
Beyond the Repo: A Case Study on Open Source Integration with GECToR. In Proc. Workshop for Natural Language Processing Open Source Software (NLP-OSS). Sanjna Kashyap, Zhaoyang Xie, Kenneth Steimel, and Nitin Madnani. [pdf] [bib]
The Role of Robust Software in Automated Scoring. In Yaneva, V. and von Davier, M. (Eds) Advancing Natural Language Processing in Educational Assessment. NCME Educational Measurement and Assessment Book Series. Taylor & Francis. Nitin Madnani, Aoife Cahill, and Anastassia Loukina. [Volume] [Chapter]
2021
Automated Essay Scoring. In Synthesis Lectures on Human Language Technologies (Vol. 14, Issue 5, pp. 1–314). Springer Nature. Beata Beigman Klebanov and Nitin Madnani. [link]
2020
User-centered & Robust Open-source Software: Lessons Learned from Developing & Maintaining RSMTool. In Proc. Workshop for Natural Language Processing Open Source Software (NLP-OSS). Nitin Madnani and Anastassia Loukina. [pdf] [bib]
Using PRMSE to Evaluate Automated Scoring Systems in the Presence of Label Noise. In Proc. Workshop on Innovative Use of Natural Language Processing for Building Educational Applications (BEA). Anastassia Loukina, Nitin Madnani, Aoife Cahill, Lili Yao, Matthew S. Johnson, Brian Riordan, and Daniel F. McCaffrey. [pdf] [bib]
Automated Evaluation of Writing - 50 Years and Counting. In Proc. ACL. Beata Beigman Klebanov and Nitin Madnani. [pdf] [bib]
Detecting Learning in Noisy Data: The Case of Oral Reading Fluency. In Proc. 10th International Learning Analytics & Knowledge Conference (LAK). Beata Beigman Klebanov, Anastassia Loukina, John Lockwood, Van Liceralde, John Sabatini, Nitin Madnani, Binod Gyawali, Zuowei Wang and Jennifer Lentini. [pdf] [bib]
2019
My Turn To Read: An Interleaved E-book Reading Tool for Developing and Struggling Readers. In Proc. ACL (demos). Nitin Madnani, Beata Beigman Klebanov, Anastassia Loukina, Binod Gyawali, Patrick Lange, John Sabatini and Michael Flor. [pdf] [bib]
The Many Dimensions of Algorithmic Fairness in Educational Applications. In Proc. 14th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications (BEA). Anastassia Loukina, Nitin Madnani, and Klaus Zechner. [pdf] [bib]
Would you? Could you? On a tablet? Analytics of Children’s eBook Reading. In Proc. 9th International Learning Analytics & Knowledge Conference (LAK). Beata Beigman Klebanov, Anastassia Loukina, Nitin Madnani, John Sabatini, and Jennifer Lentini. [pdf] [bib]
2018
Evaluating On-device ASR on Field Recordings from an Interactive Reading Companion. In Proc. IEEE Workshop on Spoken Language Technology (SLT). Anastassia Loukina, Nitin Madnani, Beata Beigman Klebanov, Abhinav Misra, Georgi Angelov, and Ognjen Todic. [pdf] [bib]
Writing Mentor: Writing Progress Using Self-Regulated Writing Support. In Journal of Writing Analytics, 2:280-284. Jill Burstein, Norbert Elliot, Beata Beigman Klebanov, Nitin Madnani, Diane Napolitano, Maxwell Schwartz, Patrick Houghton, and Hillary Molloy. [pdf] [bib]
Writing Mentor: Self-Regulated Writing Feedback for Struggling Writers. In Proc. COLING (demos). Nitin Madnani, Jill Burstein, Norbert Elliot, Beata Beigman Klebanov, Diane Napolitano, Slava Andreyev, and Maxwell Schwartz. [pdf] [bib]
Automated Scoring: Beyond Natural Language Processing. In Proc. COLING. Nitin Madnani and Aoife Cahill. [pdf] [bib]
Atypical Inputs in Educational Applications. In Proc. NAACL (Industry Track). Su‐Youn Yoon, Aoife Cahill, Anastassia Loukina, Klaus Zechener, Brian Riordan, and Nitin Madnani. [pdf] [bib]
Second Language Acquisition Modeling. In Proc. 13th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications (BEA). Burr Settles, Chris Brust, Erin Gustafson, Masato Hagiwara and Nitin Madnani. [pdf] [bib]
The ACL Anthology: Current State and Future Directions. In Proc. ACL Workshop on NLP Open Source Software (NLPOSS). Dan Gildea, Min-Yen Kan, Nitin Madnani, Christoph Teichmann, and Martin Villalba. [pdf] [bib]
A Robust Microservice Architecture for Scaling Automated Scoring Applications. ETS Research Report Series, doi: 10.1002/ets2.12202. Nitin Madnani, Aoife Cahill, Daniel Blanchard, Slava Andreyev, Diane Napolitano, Binod Gyawali, Michael Heilman, Chong Min Lee, Chee Wee Leong, Matthew Mulholland, and Brian Riordan. [pdf] [bib]
Analyzing Item Generation with Natural Language Processing Tools for the TOEIC® Listening Test. ETS Research Report Series, doi:10.1002/ets2.12183. Su‐Youn Yoon, Chong Min Lee, Patrick Houghton, Melissa Lopez, Jennifer Sakano, Anastassia Loukina, Bob Krovetz, Chi Lu, and Nitin Madnani. [pdf] [bib]
2017
Building Better Open-source Tools to Support Fairness in Automated Scoring.In Proc. EACL Workshop on Ethics in Natural Language Processing. Nitin Madnani, Anastassia Loukina, Alina von Davier, Jill Burstein, and Aoife Cahill. [pdf] [bib]
Generating Language Activities in Real-Time for English Learners using Language Muse. In Proc. Fourth Annual ACM Conference on Learning at Scale (Short Papers). Jill Burstein, Nitin Madnani, John Sabatini, Dan McCaffrey, Kietha Biggers, and Kelsey Dreier. [pdf] [bib]
Examination of Paraphrasing Behavior in Source-Based Writing. Presented at the 27th Annual Meeting of the Society for Text and Discourse. Beata Beigman Klebanov and Nitin Madnani.
A Large Scale Quantitative Exploration of Modeling Strategies for Content Scoring. In Proc. 12th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications (BEA). Nitin Madnani, Anastassia Loukina, and Aoife Cahill. [pdf] [bib]
Speech- and Text-driven Features for Automated Scoring of English Speaking Tasks. In Proc. EMNLP Workshop on Speech-centric Natural Language Processing. Anastassia Loukina, Nitin Madnani, and Aoife Cahill. [pdf] [bib]
2016
Technology-Assisted Generation of Linguistically Relevant Instructional Activities to Support English Learners in Content and Language Learning. Presented at the Annual Meeting for the American Educational Research Association. John Sabatini, Jill Burstein, Nitin Madnani, and Kietha Biggers.
Prediction of Passage Acceptance/Rejection Using Linguistic Information. Presented at the the 78th Annual Meeting for the National Council on Measurement in Education. Swapna Somasundaran, Yoko Futagi, Nitin Madnani, Nancy Glazer, Matt Chametsky and Cathy Wendler.
Automatically Scoring Tests of Proficiency in Music Instruction. In Proc. 11th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications (BEA). Nitin Madnani, Aoife Cahill, and Brian Riordan. [pdf] [bib]
Model Combination for Correcting Preposition Selection Errors. In Proc. 11th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications (BEA). Nitin Madnani, Michael Heilman, and Aoife Cahill. [pdf] [bib]
The Effect of Multiple Grammatical Errors on Processing Non-native Writing. In Proc. 11th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications (BEA). Courtney Napoles, Aoife Cahill, and Nitin Madnani. [pdf] [bib]
Language Muse: Automated Linguistic Activity Generation for English Language Learners. In Proc. ACL (demos). Nitin Madnani, Jill Burstein, John Sabatini, Kietha Biggers, and Slava Andreyev. [pdf] [bib]
RSMTool: A Collection of Tools for Building and Evaluating Automated Scoring Models. In Journal of Open Source Software (JOSS), 1(3). Nitin Madnani and Anastassia Loukina. [html] [bib]
2015
Effective Feature Integration for Automated Short Answer Scoring. In Proc. NAACL (short papers). Keisuke Sakaguchi, Michael Heilman and Nitin Madnani. [pdf] [bib]
The Impact of Training Data on Automated Short Answer Scoring Performance. In Proc. 10th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications (BEA). Michael Heilman and Nitin Madnani. [pdf] [bib]
Preliminary Experiments on Crowdsourced Evaluation of Feedback Granularity. In Proc. 10th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications (BEA). Nitin Madnani, Martin Chodorow, Aoife Cahill, Melissa Lopez, Yoko Futagi and Yigal Attali. [pdf] [bib]
Using Automated Methods to Identify Overly Similar Discrete Items. Presented at the 77th Annual Meeting for the National Council on Measurement in Education. Nitin Madnani and Aoife Cahill.
2014
An Explicit Feedback System for Preposition Errors based on Wikipedia Revisions. In Proc. 9th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications (BEA). Nitin Madnani and Aoife Cahill. [pdf] [bib]
Content Importance Models for Scoring Writing From Sources. In Proc. ACL (short papers). Beata Beigman Klebanov, Nitin Madnani, Jill Burstein and Swapna Somasundaran. [pdf] [bib]
Predicting Grammaticality on an Ordinal Scale. In Proc. ACL (short papers). Michael Heilman, Aoife Cahill, Nitin Madnani, Melissa Lopez, Matthew Mulholland and Joel Tetreault. [pdf] [bib]
Bucking the Trend: Improved Evaluation and Anotation Practices for ESL Error Detection Systems. In Language Resources & Evaluation: Special Issue on Resources and Tools for Language Learners, 48(1). Joel Tetreault, Martin Chodorow and Nitin Madnani. [html]
2013
ParaQuery: Making Sense of Paraphrase Collections. In Proc. ACL (demos). Lili Kotlerman, Nitin Madnani and Aoife Cahill. [pdf] [bib]
Detecting Missing Hyphens in Learner Text. In Proc. 8th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications (BEA). Aoife Cahill, Martin Chodorow, Susanne Wolff and Nitin Madnani. [pdf] [bib]
Automated Scoring of a Summary-Writing Task Designed to Measure Reading Comprehension. In Proc. 8th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications (BEA). Nitin Madnani, Jill Burstein, John Sabatini and Tenaha O'Reilly. [pdf] [bib]
Robust Systems for Preposition Error Correction Using Wikipedia Revisions. In Proc. NAACL. Aoife Cahill, Nitin Madnani, Joel Tetreault and Diane Napolitano. [pdf] [bib]
HENRY-CORE: Domain Adaptation and Stacking for Text Similarity. In Proc. of the Second Joint Conference on Lexical and Computational Semantics (SEM), Volume 1: Proceedings of the Main Conference and the Shared Task: Semantic Textual Similarity.* Michael Heilman and Nitin Madnani. [pdf] [bib]
ETS: Domain Adaptation and Stacking for Short Answer Scoring. In Second Joint Conference on Lexical and Computational Semantics (SEM), Volume 2: Proceedings of the Seventh International Workshop on Semantic Evaluation (SemEval).* [Note: This version fixes some errors in the official ACL Anthology version.] Michael Heilman and Nitin Madnani. [pdf] [bib]
Using Pivot-based Paraphrasing and Sentiment Profiles to Improve a Subjectivity Lexicon for Essay Data. Transactions of the Association for Computational Linguistics, 1(2013):99−110. Beata Beigman Klebanov, Nitin Madnani and Jill Burstein. [pdf] [bib]
Sentiment Profiles of Multi-Word Expressions in Test-Taker Essays: The Case of Noun-Noun Compounds. ACM Transactions on Speech and Language Processing, 10(3):12. Beata Beigman Klebanov, Jill Burstein and Nitin Madnani. [html]
Sentiment Analysis and Detection for Essay Evaluation. Jill Burstein, Beata Beigman Klebanov, Nitin Madnani and Adam Faulkner. Handbook for Automated Essay Evaluation, Mark D. Shermis and Jill Burstein (eds.), Taylor and Francis. [link]
The E-rater Automated Essay Scoring System. Jill Burstein, Joel Tetreault and Nitin Madnani. Handbook for Automated Essay Scoring, Mark D. Shermis and Jill Burstein (eds.), Taylor and Francis. [link]
Generating Targeted Paraphrases for Improved Translation. ACM Transactions on Intelligent Systems and Technology, 4(3). Nitin Madnani and Bonnie Dorr. [pdf] [bib]
2012
Topical Trends in a Corpus of Persuasive Writing. ETS Research Report Series, RR-12-19. Michael Heilman and Nitin Madnani. [pdf] [bib]
Discriminative Edit Models for Paraphrase Scoring. In Proc. of the 6th International Workshop on Semantic Evaluation (SemEval). Michael Heilman and Nitin Madnani. [pdf] [bib]
Exploring Grammatical Error Correction with Not-So-Crummy Machine Translation. In Proc. 7th Workshop on Innovative Use of Natural Language Processing for Building Educational Applications (BEA). Nitin Madnani, Joel Tetreault and Martin Chodorow. [pdf] [bib]
Re-examining Machine Translation Metrics for Paraphrase Identification. In Proc. NAACL. Nitin Madnani, Joel Tetreault and Martin Chodorow. [pdf] [bib] [zip]
Identifying High Level Organizational Elements in Argumentative Discourse. In Proc. NAACL. Nitin Madnani, Michael Heilman, Joel Tetreault and Martin Chodorow. [pdf] [bib]
Building Subjectivity Lexicon(s) from Scratch for Essay Data. In Proc. CICLing. Beata Beigman Klebanov, Jill Burstein, Nitin Madnani and Adam Faulkner. [pdf] [bib]
2011
iBLEU: Interactively Scoring and Debugging Statistical Machine Translation Systems. In Proc. Fifth IEEE International Conference on Semantic Computing (Demos). Nitin Madnani. [pdf] [bib] [website]
E-rating Machine Translation. In Proc. WMT. Kristen Parton, Joel Tetreault, Nitin Madnani, Martin Chodorow. [pdf] [bib]
They Can Help: Using Crowdsourcing to Improve the Evaluation of Grammatical Error Detection Systems. In Proc. ACL (Short Papers). Nitin Madnani, Joel Tetreault, Martin Chodorow and Alla Rozovskaya. [pdf] [bib] [tgz]
The Web is not a PERSON, Berners-Lee is not an ORGANIZATION, and African-Americans are not LOCATIONS: An Analysis of the Performance of Named-Entity Recognition. In Proc. Workshop on Multiword Expressions. Bob Krovetz, Paul Deane and Nitin Madnani. [pdf] [bib]
2010
Machine Translation Evaluation and Optimization. Handbook of Natural Language Processing and Machine Translation. Joseph Olive, John McCary, and Caitlin Christianson (eds.) Yaser Al-Onaisan, Bonnie Dorr, Doug Jones, Jeremy Kahn, Seth Kulick, Alon Lavie, Gregor Leusch, Nitin Madnani, Chris Manning, Arne Mauser, Alok Parlikar, Mark Przybocki, Rich Schwartz, Matthew Snover, Stephan Vogel and Clare Voss. [html]
Putting the User in the Loop: Interactive Maximal Marginal Relevance for Query-Focused Summarization. In Proc. NAACL (Short Papers). Jimmy Lin, Nitin Madnani and Bonnie J. Dorr. [pdf] [bib]
Measuring Transitivity using Untrained Annotators. In Proc. Workshop on Creating Speech and Language Data With Amazon’s Mechanical Turk. Nitin Madnani, Jordan Boyd-Graber and Philip Resnik. [pdf] [bib]
Generating Phrasal & Sentential Paraphrases: A Survey of Data-Driven Methods. Computational Linguistics, 36(3):341-387. Nitin Madnani and Bonnie Dorr. [pdf] [bib]
The Circle of Meaning: From Translation to Paraphrasing and Back. Doctoral Dissertation. Department of Computer Science. University of Maryland College Park. [pdf] [bib]
The Python and The Elephant: Large Scale Natural Language Processing with NLTK and Dumbo. In Proc. of the Eighth Annual Python Conference. Nitin Madnani and Jimmy Lin. [video] [bib] [zip]
TER-Plus: Paraphrase, Semantic, and Alignment Enhancements to Translation Edit Rate. Machine Translation, 23(2-3):117-127. Matthew Snover, Nitin Madnani, Bonnie Dorr and Richard Schwartz. [html] [bib]
2009
Fluency, Adequacy, or HTER? Exploring Different Human Judgments with a Tunable MT Metric. In Proc. WMT. Matthew Snover, Nitin Madnani, Bonnie Dorr and Richard Schwartz. [pdf] [bib]
Querying and Serving N-gram Language Models with Python. The Python Papers. Volume 4, Issue 2. Nitin Madnani. [pdf] [bib]
Source Code: Querying and Serving N-gram Language Models with Python. The Python Papers Source Codes. Volume 1. Nitin Madnani. [pdf] [bib]
2008
Applying Automatically Generated Semantic Knowledge: A Case Study in Machine Translation. In Proc. of the Symposium on Semantic Knowledge Discovery, Organization and Use. Nitin Madnani, Philip Resnik, Bonnie Dorr and Richard Schwartz. [pdf] [bib] [poster]
Are Multiple Reference Translations Necessary? Investigating the Value of Paraphrased Reference Translations in Parameter Optimization. In Proc. AMTA. Nitin Madnani, Philip Resnik, Bonnie Dorr and Richard Schwartz. [pdf] [bib]
Combining Open-Source with Research to Re-engineer a Hands-on Introductory NLP Course. In Proc. of the Third ACL Workshop on Issues in Teaching Computational Linguistics (TeachCL-08). Nitin Madnani and Bonnie Dorr. [pdf] [bib]
Multiple Alternative Sentence Compressions and Word-Pair Antonymy for Automatic Text Summarization and Recognizing Textual Entailment. In Proc. of the Text Analysis Conference (TAC). Saif Mohammad, Bonnie J. Dorr, Melissa Egan, Nitin Madnani, David Zajic, and Jimmy Lin. [pdf] [bib]
TERp: A System Description. In Proc. of the First NIST Metrics for Machine Translation Challenge (MetricsMATR). Matthew Snover, Nitin Madnani, Bonnie Dorr and Richard Schwartz. [pdf] [bib]
2007
Using Paraphrases for Parameter Tuning in Statistical Machine Translation. In Proc. WMT. Nitin Madnani, Necip Fazil Ayan, Philip Resnik, Bonnie Dorr. [pdf] [bib]
Measuring Variability in Sentence Ordering for News Summarization. In Proc. ENLG. Nitin Madnani, Rebecca Passonneau, John Conroy, Necip Fazil Ayan, Bonnie Dorr, Judith Klavans, Dianne O'Leary and Judith Schlesinger. [pdf] [bib]
Getting Started on Natural Language Processing with Python. ACM Crossroads, 13(4). Nitin Madnani. [Note: This PDF version is now out of date and is here only for historical reasons. The GitHub repository contains a nice interactive iPython notebook version of the article and is kept up to date.] [github repo] [pdf] [html] [bib]
TREC 2007 ciQA Task: University of Maryland. In Proc. TREC. Nitin Madnani, Jimmy Lin, and Bonnie Dorr. [pdf] [bib]
Multiple Alternative Sentence Compressions for Automatic Text Summarization. In Proc. of the Document Understanding Conference (DUC) at HLT/NAACL. Nitin Madnani, David Zajic, Bonnie Dorr, Necip Fazil Ayan and Jimmy Lin. [pdf] [bib]
2005 and earlier
The Hiero Machine Translation System: Extensions, Evaluation, and Analysis. 2005. In Proc. HLT/EMNLP. David Chiang, Adam Lopez, Nitin Madnani, Christof Monz, Philip Resnik and Michael Subotin. [pdf] [bib]
Rapid Porting of DUSTer to Hindi. 2003. ACM Transactions on Asian Language Information Processing, 2(2). Bonnie J. Dorr, Necip Fazil Ayan, Nizar Habash, Nitin Madnani, and Rebecca Hwa. [pdf] [bib]