Skip to content
View xz259's full-sized avatar

Block or report xz259

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
498 results for source starred repositories
Clear filter

OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

Python 417 39 Updated Jan 13, 2025

Differentiable ODE solvers with full GPU support and O(1)-memory backpropagation.

Python 5,714 945 Updated Nov 21, 2024

A collection of resources regarding the interplay between differential equations, deep learning, dynamical systems, control and numerical methods.

1,379 153 Updated Sep 13, 2024

An LM forked from my transformer-train-script repo that replaces attention with a novel idea called "matrix recurrent units."

Python 4 Updated Dec 20, 2024
Python 120 14 Updated Oct 31, 2019

Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"

Python 28 1 Updated Oct 25, 2024
Python 60 Updated Dec 22, 2024

[ICML 2024 Best Paper] Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution (https://arxiv.org/abs/2310.16834)

Python 453 47 Updated Feb 29, 2024

Emergent world representations: Exploring a sequence model trained on a synthetic task

Jupyter Notebook 173 42 Updated Jul 12, 2023

Bootstrapping ARC

Python 90 9 Updated Nov 20, 2024

Latent Program Network (from the "Searching Latent Program Spaces" paper)

Jupyter Notebook 42 1 Updated Nov 28, 2024

Our solution for the arc challenge 2024

Jupyter Notebook 84 6 Updated Dec 7, 2024
Jupyter Notebook 46 3 Updated Nov 22, 2024

Implementation of the proposed minGRU in Pytorch

Python 270 21 Updated Dec 18, 2024

Material for lectures on Diffusion models at IE university

Jupyter Notebook 71 1 Updated Jan 9, 2025

【PyTorch】Easy-to-use,Modular and Extendible package of deep-learning based CTR models.

Python 3,085 713 Updated Jul 2, 2024

🧬 Nucleotide Transformer: Building and Evaluating Robust Foundation Models for Human Genomics

Python 547 63 Updated Oct 1, 2024

Evaluating genomic sequence models for explaining personalized expression variation

Jupyter Notebook 19 2 Updated Dec 6, 2023

Implementation of Enformer, Deepmind's attention network for predicting gene expression, in Pytorch

Python 450 86 Updated Oct 9, 2024

GTEx & TOPMed data production and analysis pipelines

Python 352 176 Updated Dec 16, 2024

For fine-tuning Enformer using paired WGS & gene expression data

Python 11 3 Updated Jan 10, 2025

Demonstrations of Loss of Plasticity and Implementation of Continual Backpropagation

Python 251 49 Updated Nov 3, 2024

treemind interprets ensemble tree models by analyzing individual trees and their predictions, providing insights into the decision-making process.

Python 19 1 Updated Jan 7, 2025

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 28,799 3,417 Updated Jan 9, 2025
Python 36 4 Updated Oct 16, 2024

Mamba for Multivariate Time Series Forecasting

Python 35 3 Updated Aug 30, 2024

LLM-powered multiagent persona simulation for imagination enhancement and business insights.

Python 5,230 418 Updated Jan 3, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,038 42 Updated Nov 16, 2024

Biological foundation modeling from molecular to genome scale

Jupyter Notebook 1,241 152 Updated Dec 18, 2024
Next