Stars
[Nature Machine Intelligence 2024] Code and evaluation repository for the paper
[CVPR 2024 Highlight] XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
2025 AI/ML internship & new graduate job list updated daily
An AI framework for clinical diagnosis of 3D biomedical scans
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
A Topic Modeling System Toolkit (ACL 2024 Demo)
[CIKM'24] Self-Supervision Improves Diffusion Models for Tabular Data Imputation
User-friendly LLaMA: Train or Run the model using PyTorch. Nothing else.
Code repository to analyse the performance and results of selected LLMs using the MIMIC CDM dataset.
Med-BERT, contextualized embedding model for structured EHR data
pytorch implementation of poincare embedding for ICD coding hierarchy(ICD-9-CM3)
ICLR 2022 Paper, SOTA Table Pre-training Model, TAPEX: Table Pre-training via Learning a Neural SQL Executor
A comprehensive paper list of Reasoning over Tables.
Official implementation of ORCA proposed in the paper "Cross-Modal Fine-Tuning: Align then Refine"
Summary statistics-based Randomized Haseman-Elston regression
PyTorch implementation of VQ-VAE-2 from "Generating Diverse High-Fidelity Images with VQ-VAE-2"
Processing pipelines for the HCP
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
A minimal PyTorch implementation of the VQ-VAE model described in "Neural Discrete Representation Learning".
CAPTURE-24: Human activity recognition with activity trackers
Extracting meaningful health information from large accelerometer datasets
Trying out the Mamba architecture on small examples (cifar-10, shakespeare char level etc.)
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
tiktoken is a fast BPE tokeniser for use with OpenAI's models.