You can find the full list of my projects on my GitHub account. research Multi-turn RL August 2025 Extended the VeRL framework to support training multimodal models with multi-turn reinforcement learning with external tools. Mixed-modal Reasoning June 2025 Trained 3 paradigms of visual reasoning using GRPO university Visual Reasoning May 2025 Explored GRPO to enhance visual question answering in vision-language models Recommender Systems December 2024 Compares collaborative filtering, matrix factorization, and neural networks Document Retrieval November 2024 Built an efficient IR system across 7 languages with computational limits Segmentation and Classification May 2024 Using classic computer vision techniques to segment and extract, and deep learning for the classification Mountain Car May 2024 Handling sparse reward challenges in reinforcement learning using DQN and Dyna-Q algorithms GalactiTA May 2024 1.3B LLM trained through a 3-stage pipeline of SFT, DPO, and RAG-tuning on scientific datasets. YouTube Analysis December 2023 Analysis of Tech channels on YouTube using the videos published between May 2005 and October 2019 Stance Detection December 2023 Fine-tuning Large Language Models for argument stance detection in unseen domains Predicting Cardiovascular Diseases October 2023 Using machine learning on behavioral risk factor data to predict heart disease miscellaneous LLM training May 2025 🥈2nd Place🥈 - Hackathon on LLM training & architecture Satellite Imagery November 2024 🥇1st Place🥇 - Hackathon on analyzing satellite imagery based on LLMs and CV (Lauzhack 2024, AXA challenge)