Skip to content
View jcho19's full-sized avatar

Highlights

  • Pro

Block or report jcho19

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Foundation 3D ViT model for volumetric head CT

Jupyter Notebook 31 2 Updated Feb 17, 2025

Memory-optimized training scripts for video models based on Diffusers

Python 881 94 Updated Feb 24, 2025

Data collection and evaluation framework for on-device agents

Swift 23 Updated Oct 28, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,797 1,025 Updated Feb 22, 2025

A unified 3D Transformer Pipeline for visual synthesis

2,807 163 Updated May 29, 2023

[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Jupyter Notebook 780 50 Updated Jul 30, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,242 281 Updated May 4, 2024

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Python 977 44 Updated Jan 17, 2024

Fréchet Clip Distance Implementation for PyTorch

Python 8 Updated Feb 28, 2023
Python 95 11 Updated Nov 6, 2024

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,249 170 Updated Feb 12, 2025

Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)

1,690 154 Updated Jan 4, 2025

A collection of resources and papers on Diffusion Models

HTML 11,451 961 Updated Aug 1, 2024

Meditron is a suite of open-source medical Large Language Models (LLMs).

Python 1,969 186 Updated Apr 10, 2024

Pytorch implementation of VQGAN (Taming Transformers for High-Resolution Image Synthesis) (https://arxiv.org/pdf/2012.09841.pdf)

Python 492 82 Updated Jul 17, 2024

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

2,279 198 Updated Feb 13, 2025

The code repository for examples in the O'Reilly book 'Generative Deep Learning' using Pytorch

Jupyter Notebook 182 24 Updated Dec 15, 2019

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,266 599 Updated Feb 21, 2025

Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models

Jupyter Notebook 539 35 Updated Apr 6, 2024

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites

4,768 494 Updated Jul 30, 2024

My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT p…

Jupyter Notebook 1,017 176 Updated Dec 27, 2020

Python Data Science Handbook: full text in Jupyter Notebooks

Jupyter Notebook 43,952 18,107 Updated Jun 26, 2024

Explanation to key concepts in ML

7,467 593 Updated Feb 25, 2025

GastroVision: A Multi-class Endoscopy Image Dataset for Computer Aided Gastrointestinal Disease Detection https://drive.google.com/drive/folders/1T35gqO7jIKNxC-gVA2YVOMdsL7PSqeAa?usp=sharing

Python 42 3 Updated Sep 20, 2024
Jupyter Notebook 76 7 Updated Jun 26, 2023

An open source implementation of CLIP.

Python 11,064 1,043 Updated Feb 24, 2025