| Oct 23, 2025 | Uploaded the slides I used for my talk tour in Japan during 2023-2024 titled “Visually-Rich Documentを軸とした多言語処理の動向” |
| Oct 21, 2025 | New preprint on LLM judge bias titled “Contrastive Decoding Mitigates Score Range Bias in LLM-as-a-Judge” is dropped to arXiv! |
| Dec 9, 2024 | I started working as a member of technical staff at Cantina Labs. |
| Dec 1, 2024 | I got promoted to Senior Applied Scientist :) |
| Mar 19, 2024 | One short paper got accepted to NAACL main titled “M3T: A New Benchmark Dataset for Multi-Modal Document-Level Machine Translation”. |
| Jan 12, 2024 | I visited Tohoku University and gave a talk titled “Recent Trends in Multilingual NLP focusing on Visually-Rich Documents and a Retrospective on my US PhD”. |
| Dec 21, 2023 | I visited Nara Institute of Science and Technology and gave a talk titled “Recent Trends in Multilingual NLP focusing on Visually-Rich Documents and a Retrospective on my US PhD”. |
| Dec 15, 2023 | I visited National Institute of Informatics and gave a talk titled “Recent Trends in Multilingual NLP focusing on Visually-Rich Documents and a Retrospective on my US PhD”. |
| Dec 12, 2023 | I visited Nagoya University and gave a talk titled “Recent Trends in Multilingual NLP focusing on Visually-Rich Documents and a Retrospective on my US PhD”. |
| Oct 7, 2023 | Two papers accepted to EMNLP (1 findings, 1 main): - A dataset paper on visually-rich documents titled “A Multi-Modal Multilingual Benchmark for Document Image Classification”. Check it out here here.
- An evaluation paper on biases in multilingual models titled “Comparing Biases and the Impact of Multilingual Training across Multiple Languages”. Check out the arXiv version here.
|
| May 26, 2023 | Our work on efficient dialogue state tracking titled “Diable: Efficient Dialogue State Tracking as Operations on Tables” is accepted to ACL Findings and now on arXiv! Check it out here. |
| May 23, 2023 | Our work on bias evaluation in multilingual models titled “Comparing Biases and the Impact of Multilingual Training across Multiple Languages” is now on arXiv. Check it out here. |
| Feb 17, 2023 | My first pull request to Huggingface Transformers library got merged :) |
| Jul 27, 2022 | We presented our work on “Match the Script, Adapt if Multilingual: Analyzing the Effect of Multilingual Pretraining on Cross-lingual Transferability” at the Japanese NLP colloquium series. The recording is available at here. |
| Mar 21, 2022 | Our paper titled “Match the Script, Adapt if Multilingual: Analyzing the Effect of Multilingual Pretraining on Cross-lingual Transferability” has been accepted at the main conference of ACL 2022. |
| Dec 16, 2021 | I got my PhD degree. |
| Sep 20, 2021 | I started working at AWS AI Labs. |
| May 13, 2021 | Our paper titled “Semi-Supervised Joint Estimation of Word and Document Readability” has been published at TextGraphs-15. |
| Nov 24, 2020 | I’ll be an instructor for CSCI 4622 Machine Learning class @ CU Boulder. |
| Jun 3, 2020 | Our paper titled “Why Overfitting Isn’t Always Bad: Retrofitting Cross-Lingual Word Embeddings to Dictionaries” has been accepted at ACL 2020. |
| Dec 13, 2019 | Our paper titled “Exploiting Cross-Lingual Subword Similarities in Low-Resource Document Classification” has been accepted at AAAI 2020. |
| Aug 16, 2019 | Visiting & Giving a talk at Tokyo Metropolitan Univeristy. |
| Aug 1, 2019 | Recorded presentation of our ACL talk is now available here. |
| Jul 10, 2019 | Visiting University of Maryland CLIP lab. |
| May 14, 2019 | Our paper titled “A Resource-Free Evaluation Metric for Cross-Lingual Word Embeddings based on Graph Modularity” got accepted to ACL 2019 as a long paper. |
| May 13, 2019 | Our paper titled “Zika discourse in the Americas: a multilingual topic analysis of Twitter” has been published at PLOS ONE. |