Publications

2024

  1. INMT-Lite: Accelerating Low-Resource Language Data Collection via Offline Interactive Neural Machine Translation
    Harshita Diddee, Anurag Shukla, Tanuja Ganu, and 4 more authors
    In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) May 2024
  2. (Best Paper Award): Akal Badi ya Bias: An Exploratory Study of Gender Bias in Hindi Language Technology
    Rishav Hada, Safiya Husain, Varun Gumma, and 8 more authors
    In FAccT ’23: Proceedings of the 2023 ACM Conference on Fairness, Accountability, and Transparency May 2024

2023

  1. MEGA: Multilingual Evaluation of Generative AI
    Kabir Ahuja, Harshita Diddee, Rishav Hada, and 9 more authors
    In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing Dec 2023
  2. “Fifty Shades of Bias”: Normative Ratings of Gender Bias in GPT Generated English Text
    Rishav Hada*, Agrima Seth*, Harshita Diddee, and 1 more author
    In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing Dec 2023
  3. Are large language model-based evaluators the solution to scaling up multilingual evaluation?
    Rishav Hada, Varun Gumma, Adrian Wynter, and 5 more authors
    Association for Computational Linguistics Dec 2023

2022

  1. CodeFed: Federated Speech Recognition for Low-Resource Code-Switching Detection
    Chetan Madan, Harshita Diddee, Deepika Kumar, and 1 more author
    ACM Transactions Asian Low-Resource Language Information Processing (TALLIP) Aug 2022
  2. Too Brittle To Touch: Comparing the Stability of Quantization and Distillation Towards Developing Lightweight Low-Resource MT Models
    Harshita Diddee, Sandipan Dandapat, Monojit Choudhury, and 2 more authors
    Conference On Machine Translation 2022 Oct 2022
  3. Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages
    Gowtham Ramesh, Sumanth Doddapaneni, Aravinth Bheemaraj, and 15 more authors
    Transactions of the Association for Computational Linguistics Feb 2022
  4. The Six Conundrums of Building and Deploying Language Technologies for Social Good
    Harshita Diddee, Kalika Bali, Monojit Choudhury, and 1 more author
    In ACM SIGCAS/SIGCHI Conference on Computing and Sustainable Societies (COMPASS) Jun 2022

2021

  1. Towards Quantifying the Carbon Emissions of Differentially Private Machine Learning
    Rakshit Naidu, Harshita Diddee, Ajinkya Mulay, and 3 more authors
    Socially Responsible Machine Learning Workshop at ICML 2021 Jun 2021

2020

  1. PsuedoProp at SemEval-2020 Task 11: Propaganda Span Detection Using BERT-CRF and Ensemble Sentence Level Classifier
    Aniruddha Chauhan, and Harshita Diddee
    In Proceedings of the Fourteenth Workshop on Semantic Evaluation Dec 2020
  2. CrossPriv: User Privacy Preservation Model for Cross-Silo Federated Software
    Harshita Diddee, and Bhrigu Kansra
    In Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering Dec 2020