Skip to content
View ash3327's full-sized avatar

Block or report ash3327

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ash3327/README.md

Hi there👋, I'm Sam

A Year 4 AI Enthusiast & App Developer at The Chinese University of Hong Kong 💻

Nice to meet you! I love creating little gadgets and dive into the possibilities of AI.


💬 About Me

  • Energetic and enthusiastic towards learning new things.
  • Have experiences with PyTorch and Tensorflow projects
  • Graduating with a major in AI in Summer 2025
  • 🏆 Dean's List, 2021-2022, 2022-2023 and 2023-2024 in CUHK
  • 🚀 Skills:
    • AI:
      • Frameworks: Python, Tensorflow, PyTorch, HuggingFace, Keras
      • Models:
        • Reinforcement Learning: Deep Q-Learning
        • Vision: ResNet, U-Net, GANs, Diffusion
        • NLP: RNN, LSTM, BERT, LLaMa
    • Backend: Python, Java, MySQL, Docker, Flask
    • Frontend: JavaScript, CSS, HTML
    • Planning to Learn: Node.js, React

💼 Work Experiences

Glassbox AI (ML Intern)

Jun 2024 - Aug 2024 | Summer Internship
Sep 2024 - Nov 2024 | Part-time

  • Implemented and trained RNN-based models for sign language translation tasks
  • Developed a backend pipeline for data fetching and LLM inference, integrating it with existing fine-tuning workflows using Python, Flask, and MySQL.
  • Researched methods for temporal alignment on gesture sequences

🚀 Highlighted Projects

Oasis: The Calendar App (Jun 2024) SnowFight: Deep Q-Learning Agent for Third-Person Shooter Game (Dec 2022) RegSubjer:
Course Registration with Autoclicker (Jan 2022)
Java Android RoomDB

Calendar App Project “Oasis”

Mar 2023 - Present

  • Independently developed a mobile event planning and notification application for Android using Java, aiming to help users track their deadlines and events
  • Worked across the development lifecycle to build and maintain code
  • Migrated the data storage and retrieval to the more robust RoomDatabase with extensive use of SQL queries
  • Refactored codebase into distinct UI layers and business logic components

Others:

AI Music Project

Sep 2024 - Dec 2024

  • Gesture detection and mapping into the 3d scene with three.js
  • Beatmap generation via librosa

https://github.com/ash3327/ai_music_project

Peer-to-Peer Communication App

Jan 2024 - Apr 2024

https://github.com/ash3327/Peer-to-Peer-Communication-App

  • Created a peer-to-peer communication app supporting audio recording, waveform display and editing, and also screen share function
  • Implemented GUI for audio recording, waveform display and other functions.
  • Implemented synchronization mechanism for audio and video packets sent through socket.
  • Skills: Python

Archaic RPG game

  • A customizable RPG game with map creation tools.
  • Developed a GamePlayer feature allowing users to play their maps or load other people's maps.
  • Created a MapCreator tool for users to change landscape tiles, place items, monsters, and NPCs, and add multiple levels to their maps.
  • Implemented a Custom Code Editor enabling users to customize map interactions, including NPC interactions through custom coding.
  • Link: https://github.com/ash3327/ArchaicBitmapGame
  • Skills: Java

Deep Q-Learning Agent for Third-Person Shooter Game ImageSegmentation-UNet Report Group Project Project

Python Gymnasium Reinforcement Learning

  • Created a Gym environment of a simple third-person shooter game in Python
  • Implemented a Deep-Q Network with PyTorch to train agents to master the game with a variable quantity of moving objects
  • Fine-tuned the model to achieve average kill streak of 7 and lengthen survival duration by 4 times, significantly better than the random baseline.

With limited information provided to the agent, the agents developed the following strategies without prior prompting:

Precise Shooting Retreats to Corner Constant Spinning

🔭 More on Past Projects

Project Vision Transformer ImageSegmentation-UNet Report Project

Python PyTorch UNet ResNet ViT DeiT T2T Dataset | CIFAR10 Dataset | STL10 Dataset | Cityscapes

Exploring the generalizability of Vision Transformers (ViTs) on small datasets compared to Convolutional Neural Networks (CNNs). The project highlights:

  • Scalability: ViTs underperform on datasets with very small sizes, while CNNs are more robust.
  • Efficiency: ViTs are computationally less efficient than CNNs for models with the same accuracy.
  • Models used: ResNet, ViT, DeiT, and T2T-ViT.

Also touched training U-Net over the Cityscape dataset.

Key Results

Accuracy Comparison Computational Efficiency
Classification
Training IoU Curve Segmented Results
Segmentation

Protein Sequence Classification Kaggle Competition ProtTrans Report Learning

Python PyTorch ProtTrans Kaggle

  • Applied pretrained ProtTrans transformer model and trained a fully-connected classifier head.
  • Achieved 98.438% accuracy on public leaderboard and 94.161% accuracy on private leaderboard. CNN baseline is 77.106% and 78.241%.

Image Segmentation using U-Net ImageSegmentation-UNet Learning

Python PyTorch AI Segmentation Kaggle Cityscapes Reference Implementation ArXiv

Trained U-Net models for semantic segmentation on the Cityscapes and Carvana datasets to build basic concepts on the U-Net architecture, with the following results:

  • Carvana: 99.55% pixel accuracy, dice score of 0.9911 over the dataset downscaled to 320 x 480
  • Cityscapes: 84.0% pixel accuracy, did not track mIoU

ObjectDetection-v1 ObjectDetection-v1 Learning

Python Ultralytics YOLO AI Detection Reference Sort

Performed object detection and tracking on videos using YOLOv8. Key features include:

  • Static image and video object detection
  • Instance tracking with third party library and custom algorithm respectively
  • Car counting
Static Image Object Detection Instance Tracking Car Counting

GAN Self-Learning Project GAN-MNIST Learning

Python PyTorch Generative Adversarial Networks MNIST Dataset Reference ArXiv ArXiv ArXiv

A deep learning project exploring Generative Adversarial Networks (GANs) using the MNIST dataset:

  • Experimented with different learning rates
  • Observed the phoenomenon of mode collapse and the sensitivity of the GAN architecture to the learning rate
  • Understanding the architecture of GAN, improvements made by WGAN, and also the principles of providing class conditions to GANs
Vanilla GAN WGAN

Pinned Loading

  1. OasisPlanner OasisPlanner Public

    Java

  2. ArchaicBitmapGame ArchaicBitmapGame Public

    Abandoned Project

    Java

  3. GAN-self-learn-v1 GAN-self-learn-v1 Public

    A deep learning project exploring Generative Adversarial Networks (GANs) using the MNIST dataset

    Jupyter Notebook

  4. ImageSegmentation-UNet ImageSegmentation-UNet Public

    Trained U-Net models for semantic segmentation on the Cityscapes and Carvana datasets to build basic concepts on the U-Net architecture

    Python

  5. RegSubjer RegSubjer Public

    Very simple autoclicker for registering courses.

    Java

  6. SnowFight SnowFight Public

    Trained agents to play a custom shooter game with deep Q learning

    Python