projects

You can find the full list of my projects on my GitHub account.

research

project thumbnail

Multi-turn RL

August 2025

Extended the VeRL framework to support training multimodal models with multi-turn reinforcement learning with external tools.

project thumbnail

Mixed-modal Reasoning

June 2025

Trained 3 paradigms of visual reasoning using GRPO

university

project thumbnail

Visual Reasoning

May 2025

Explored GRPO to enhance visual question answering in vision-language models

project thumbnail

Recommender Systems

December 2024

Compares collaborative filtering, matrix factorization, and neural networks

project thumbnail

Document Retrieval

November 2024

Built an efficient IR system across 7 languages with computational limits

project thumbnail

Segmentation and Classification

May 2024

Using classic computer vision techniques to segment and extract, and deep learning for the classification

project thumbnail

Mountain Car

May 2024

Handling sparse reward challenges in reinforcement learning using DQN and Dyna-Q algorithms

project thumbnail

GalactiTA

May 2024

1.3B LLM trained through a 3-stage pipeline of SFT, DPO, and RAG-tuning on scientific datasets.

project thumbnail

YouTube Analysis

December 2023

Analysis of Tech channels on YouTube using the videos published between May 2005 and October 2019

project thumbnail

Stance Detection

December 2023

Fine-tuning Large Language Models for argument stance detection in unseen domains

project thumbnail

Predicting Cardiovascular Diseases

October 2023

Using machine learning on behavioral risk factor data to predict heart disease

miscellaneous

project thumbnail

LLM training

May 2025

🥈2nd Place🥈 － Hackathon on LLM training & architecture

project thumbnail

Satellite Imagery

November 2024

🥇1st Place🥇 － Hackathon on analyzing satellite imagery based on LLMs and CV (Lauzhack 2024, AXA challenge)