Posts by Collection

engineeringProjects

PlayMaths

A playground where you can interact with widgets to solidify your mathematical understanding

Third Eye

A free online tool that aims to offer various computer vision services. Currently though, there is only one functionality available, object detection. Using our tool, you can detect objects from videos and photos in real time or asynchronously. 

Burmese G2P

Myanmar (Burmese) 🇲🇲 Language Grapheme to Phoneme Converter for automatic speech recognition (ASR) and text-to-speech (TTS)

Space Invader

A classic arcade style 2D shooting game wherein the player spaceship shoots at the enemies and bosses while avoiding their attacks.

essays

publications

WHEN TOM EATS KIMCHI: Evaluating Cultural Awareness of Multimodal Large Language Models in Cultural Mixture Contexts

Published in Workshop on Cross-Cultural Considerations in NLP (C3NLP) @ NAACL 2025, 2025

In a highly globalized world, it is important for multi-modal language models to correctly recognize visuals in mixed-cultural settings. This paper examines the robustness of MLLMs to mixed cultures by constructing MixCuBe, a cross-cultural awareness benchmark of images and evaluating SOTA MLLMs on it.

Download Paper

Physics-aware Multi-Object 3D Scene Reconstruction (in Progress)

Published in TBD, 2026

Recently, research in 3D reconstruction shifts from achieving consistency in mere appearance and geometry to attaining physically plausible models of the scene or the object. For this problem, while test-time optimization approaches takes hours to optimize reasonble physical parameters of even a single object, the generalizability of feed-forward approaches is too limited. We are currently striving to overcome the shortcomings of both approaches and provide a simulation pipeline that is easily generalizable.

researchProjects

Advanced Passage Retrieval

A NLP research into passage retrieval, the task of extracting top-k pertinent passages from a dataset as the output given a query as the input. We use the BM25 model as a benchmark to explore retrieval models that can achieve better accuracy specifically on the SQuAD1.1 dataset.

Download Slides

Dynamic Brain Connectome Learning

A set of graph machine learning architectures for learning temporal and spatial patterns of brain activation from fMRI images. Two downstream tasks are implemented: (1) brain activation prediction duration language tasks (link prediction) and (2) performance prediction from neural patterns (graph regression)

Download Slides

RenderFormer with Linear Attention

Renderformer is an end-to-end fully data-driven transformer-based rendering pipeline developed by Microsoft. Due to its usage of vinalla transformer architecture, it has time complexity O(N^2). We tried to achieve linear time complexity by using a linear attention mechanism, namely Performer (FAVOR++).

Download Slides

talks

SAP Analytics Cloud Training Permalink

Published:

Data analytics training using SAP analytics cloud to familiarize people around Myanmar with the theory of data science and hands-on experience with the tool. It also aims to promote ASEAN Data Science Explorer competition.

Download Slides

Overview of Programming Permalink

Published:

Introduction to the landscape of programming to the general public with emphasis on its relation with computer science and learning advice.

teaching

AI for Teenagers Permalink

Published:

An introductory AI course for teenagers and any novice of all ages alike. It starts off by exploring some subfields of AI before deep diving into machine learning, where you will learn three major types of learning and some of their algorithms. After that, it will give students a sense of the working of artificial neural networks and exposure to some phenomena of AI in society.