Jay Gala

Hey, thanks for stopping by! 👋

I am a Research Associate at MBZUAI supervised by Yova Kementchedjhieva and Alham Fikri Aji working on multimodal learning. My broad interests span the areas of multimodal and multilingual learning, specifically in the context of data-efficient learning, training dynamics, reasoning and generalization.

I also collaborate with Zeerak Talat on hate speech detection using federated learning. Previously, I was an AI Resident at AI4Bharat (IIT Madras) under the supervision of Mitesh Khapra, Anoop Kunchukuttan and Raj Dabre, where I worked on building open-source datasets and models for Indian languages. Before that, I was a research intern at University of California San Diego under the supervision of Pengtao Xie, where I worked on neural architecture search and generative models.

I completed my Bachelor’s degree in Computer Engineering from University of Mumbai, India. In the past, I was a machine learning intern at Tata Consultancy Services where I worked on understanding customer behavior using natural language processing. Before that, I collaborated with Prof. Pratik Kanani on an industry project focusing on anomaly detection in heart rate (pulse) using IoT and machine learning.

I also served as a mentor at DJ Unicode, a student organization that aims to inspire sophomores and juniors to contribute to open-source projects. Additionally, I led a team that developed a platform for conducting C programming examinations in college for over 500 students (demo).

I co-founded the research division of Unicode (a.k.a. Unicode Research) with Swapneel Mehta from NYU CSMaP group. We were fortunate to be joined by Akash Srivastava from MIT-IBM AI Lab for foundational lectures on deep generative models and probabilistic machine learning. I also worked as a teaching assistant for the Unicode Machine Learning Summer Course 2021 supported by Google Research India. Additionally, I was a founding research engineer at SimPPL where I collaborated with The Sunday Times and Ippen Digital to develop tools (parrot.report) that help policymakers and journalists audit online disinformation on social media.

News and Timeline

June 2025: Our preprint on LLMs Can Compensate for Deficiencies in Visual Representations is now available on arXiv.
January 2025: Our work on MMTEB: Massive Multilingual Text Embedding Benchmark is accepted to ICLR 2025.
August 2024: Gave a talk on in-context learning capabilities of LLMs for MT (slides) at the SNLP Reading Group, Microsoft Research India.
August 2024: Our work RomanSETU received 🏆 Senior Area Chair Award at ACL 2024! Congratulations to all the authors!
May 2024: Our works - RomanSETU, ICL study for MT and Data Pruning for MT got accepted at ACL 2024.
May 2024: Our work on Leverage Class-Specific Accuracy to Guide Data Generation for Improving Image Classification is accepted at ICML 2024. Stay tuned for the camera-ready version!
March 2024: Our new preprint On the low-shot transferability of [V]-Mamba is now out on arXiv.
January 2024: Our preprint on ICL abilities in LLMs for MT is available on arXiv.
January 2024: Excited to announce the release of Airavata, an instruction-tuned Hindi LLM. Check out the Technical Report and Code.
November 2023: IndicTrans2 submission has been accepted at TMLR. Check out the Camera Ready Version.
November 2023: Presenting tutorial on Developing SOTA MNMT Systems for Related Languages at AACL-IJCNLP 2023.
May 2023: Excited to share the release of IndicTrans2, first open-source model to support all 22 Scheduled Indian languages. Check out the Preprint and Code.
January 2023: A Federated Approach for Hate Speech Detection has been accepted to EACL 2023. Check out the Preprint and Code.