publications

* denotes equal contribution

An up-to-date list is available on Google Scholar and Semantic Scholar.

2026

  1. BRIDGE: Predicting Human Task Completion Time From Model Performance

    Fengyuan Liu*, Jay Gala*, Nilaksh, Dzmitry Bahdanau, Siva Reddy, and Hugo Larochelle

    arXiv preprint, 2026

2025

  1. LLMs Can Compensate for Deficiencies in Visual Representations

    Sho Takishita*, Jay Gala*, Abdelrahman Mohamed, Kentaro Inui, and Yova Kementchedjhieva

    In Proceedings of the 2025 Annual Conference of the Empirical Methods in Natural Language Processing, 2025

  2. MMTEB: Massive Multilingual Text Embedding Benchmark

    Kenneth Enevoldsen, Isaac Chung, Imene Kerboua, Márton Kardos, Ashwin Mathur, David Stap, and 76 more authors

    In Thirteenth International Conference on Learning Representations, 2025

    ICLR Abstract PDF
  3. SHADES: Towards a Multilingual Assessment of Stereotypes in Large Language Models

    Margaret Mitchell, Giuseppe Attanasio, Ioana Baldini, Miruna Clinciu, Jordan Clive, Pieter Delobelle, and 48 more authors

    In Proceedings of the 2025 Annual Conference of the Nations of the Americas Chapter of the ACL, 2025

    NAACL Abstract PDF

2024

  1. Leverage Class-Specific Accuracy to Guide Data Generation for Improving Image Classification

    Jay Gala, and Pengtao Xie

    In Forty-first International Conference on Machine Learning, 2024

  2. Critical Learning Periods: Leveraging Early Training Dynamics for Efficient Data Pruning in Machine Translation

    Everlyn Chimoto, Jay Gala, Orevaoghene Ahia, Julia Kreutzer, Bruce Bassett, and Sara Hooker

    In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

    ACL Findings Abstract PDF
  3. An Empirical Study of In-context Learning in LLMs for Machine Translation

    Pranjal A. Chitale*, Jay Gala*, and Raj Dabre

    In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

    ACL Findings Abstract PDF Code Slides
  4. RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models via Romanization

    Jaavid Aktar Husain, Raj Dabre, Aswanth Kumar, Jay Gala, Thanmay Jayakumar, Ratish Puduppully, and 1 more author

    In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, 2024

    🏆 Senior Area Chair Award
  5. CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

    David Romero, Chenyang Lyu, Haryo Akbarianto Wibowo, Teresa Lynn, Injy Hamed, Aditya Nanda Kishore, and 69 more authors

    In Thirty-eight Conference on Neural Information Processing Systems Datasets and Benchmarks Track, 2024

    NeurIPS Abstract PDF Website
  6. Airavata: Introducing Hindi Instruction-tuned LLM

    Jay Gala, Thanmay Jayakumar, Jaavid Aktar Husain, Aswanth Kumar M, Mohammed Safi Ur Rahman Khan, Diptesh Kanojia, and 5 more authors

    arXiv preprint, 2024

  7. On the low-shot transferability of [V]-Mamba

    Diganta Misra*, Jay Gala*, and Antonio Orvieto

    arXiv preprint, 2024

    arXiv Abstract PDF

2023

  1. NICT-AI4B’s Submission to the Indic MT Shared Task in WMT 2023

    Raj Dabre, Jay Gala, and Pranjal Chitale

    In Proceedings of the Eighth Conference on Machine Translation, 2023

  2. IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages

    Jay Gala*, Pranjal A. Chitale*, Raghavan AK, Varun Gumma, Sumanth Doddapaneni, Aswanth Kumar, and 8 more authors

    Transactions on Machine Learning Research, 2023

  3. A Federated Approach for Hate Speech Detection

    Jay Gala*, Deep Gandhi*, Jash Mehta*, and Zeerak Talat

    In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, 2023

2022

  1. Expanding Access to ML Research through Student-led Collaboratives

    Deep Gandhi, Raghav Jain, Jay Gala, Jhagrut Lalwani, and Swapneel S Mehta

    In Workshop on Broadening Research Collaborations (NeurIPS), 2022

    NeurIPS Abstract PDF
  2. Combating COVID-19 using object detection techniques for next-generation autonomous systems

    Hrishikesh Shenai*, Jay Gala*, Kaustubh Kekre*, Pranjal Chitale*, and Ruhina Karani

    In Cyber-Physical Systems: AI and COVID-19 (Chapter 4), 2022

    Elsevier Abstract Paper

2021

  1. Improving Image-Based Dialog by Reducing Modality Biases

    Jay Gala, Hrishikesh Shenai, Pranjal Chitale, Kaustubh Kekre, and Pratik Kanani

    In 5th International Conference on Advances in Computing and Data Sciences, 2021

    Springer Abstract Paper Code

2020

  1. Pothole Detection and Dimension Estimation System using Deep Learning (YOLO) and Image Processing

    Pranjal Chitale, Kaustubh Kekre, Hrishikesh Shenai, Ruhina Karani, and Jay Gala

    In 35th International Conference on Image and Vision Computing New Zealand (IVCNZ), 2020

  2. IoT and ML based Smart System for Efficient Garbage Monitoring: Real Time AQI monitoring and Fire Detection for dump yards and Garbage Management System

    Dev Savla, Amogh Parab, Kaustubh Kekre, Jay Gala, and Meera Narvekar

    In 3rd International Conference on Smart Systems and Inventive Technology (ICSSIT), 2020

  3. Virtual Farmer: Real Time Crop Prediction and Automatic Irrigation System

    Dev Savla, Amogh Parab, Kaustubh Kekre, Jay Gala, S Ramchandra, and Pankaj Sonawane

    In 11th International Conference on Computing, Communication and Networking Technologies (ICCCNT), 2020