news

Mar 19, 2024 One short paper accepted to EMNLP main titled “M3T: A New Benchmark Dataset for Multi-Modal Document-Level Machine Translation “. The preprint will become available soon!
Jan 12, 2024 I visited Tohoku University and gave a talk titled “Recent Trends in Multilingual NLP focusing on Visually-Rich Documents and a Retrospective on my US PhD”.
Dec 21, 2023 I visited Nara Institute of Science and Technology and gave a talk titled “Recent Trends in Multilingual NLP focusing on Visually-Rich Documents and a Retrospective on my US PhD”.
Dec 15, 2023 I visited National Institute of Informatics and gave a talk titled “Recent Trends in Multilingual NLP focusing on Visually-Rich Documents and a Retrospective on my US PhD”.
Dec 12, 2023 I visited Nagoya University and gave a talk titled “Recent Trends in Multilingual NLP focusing on Visually-Rich Documents and a Retrospective on my US PhD”.
Oct 7, 2023 Two papers accepted to EMNLP (1 findings, 1 main):
  1. A dataset paper on visually-rich documents titled “A Multi-Modal Multilingual Benchmark for Document Image Classification”. Check it out here here.
  2. An evaluation paper on biases in multilingual models titled “Comparing Biases and the Impact of Multilingual Training across Multiple Languages”. Check out the arXiv version here.
May 26, 2023 Our work on efficient dialogue state tracking titled “Diable: Efficient Dialogue State Tracking as Operations on Tables” is accepted to ACL Findings and now on arXiv! Check it out here.
May 23, 2023 Our work on bias evaluation in multilingual models titled “Comparing Biases and the Impact of Multilingual Training across Multiple Languages” is now on arXiv. Check it out here.
Feb 17, 2023 My first pull request to Huggingface Transformers library got merged :)
Jul 27, 2022 We presented our work on “Match the Script, Adapt if Multilingual: Analyzing the Effect of Multilingual Pretraining on Cross-lingual Transferability” at the Japanese NLP colloquium series. The recording is available at here.
Mar 21, 2022 Our paper titled “Match the Script, Adapt if Multilingual: Analyzing the Effect of Multilingual Pretraining on Cross-lingual Transferability” has been accepted at the main conference of ACL 2022.
Dec 16, 2021 I got my PhD degree.
Sep 20, 2021 I started working at AWS AI Labs.
May 13, 2021 Our paper titled “Semi-Supervised Joint Estimation of Word and Document Readability” has been published at TextGraphs-15.
Nov 24, 2020 I’ll be an instructor for CSCI 4622 Machine Learning class @ CU Boulder.
Jun 3, 2020 Our paper titled “Why Overfitting Isn’t Always Bad: Retrofitting Cross-Lingual Word Embeddings to Dictionaries” has been accepted at ACL 2020.
Dec 13, 2019 Our paper titled “Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification” has been accepted at AAAI 2020.
Aug 16, 2019 Visiting & Giving a talk at Tokyo Metropolitan Univeristy.
Aug 1, 2019 Recorded presentation of our ACL talk is now available here.
Jul 10, 2019 Visiting University of Maryland CLIP lab.
May 14, 2019 Our paper titled “A Resource-Free Evaluation Metric for Cross-Lingual Word Embeddings based on Graph Modularity” got accepted to ACL 2019 as a long paper.
May 13, 2019 Our paper titled “Zika discourse in the Americas: a multilingual topic analysis of Twitter” has been published at PLOS ONE.