Skip to content
View mrsndmn's full-sized avatar

Block or report mrsndmn

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Modeling, training, eval, and inference code for OLMo

Python 5,494 591 Updated Apr 10, 2025

Scaling Vision Pre-Training to 4K Resolution

113 6 Updated Mar 26, 2025
Python 38 1 Updated Mar 5, 2025

A Conversational Speech Generation Model

Python 12,512 1,127 Updated Mar 27, 2025

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 4,792 522 Updated Apr 7, 2025

Animation engine for explanatory math videos

Python 76,774 6,655 Updated Mar 20, 2025

Lets make documentation on YFM

TypeScript 117 39 Updated Apr 14, 2025

SciGraphQA: Large-Scale Synthetic Multi-Turn Question-Answering Dataset for Scientific Graphs

Jupyter Notebook 41 2 Updated Oct 25, 2024
Python 8 4 Updated Apr 10, 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 143,005 28,646 Updated Apr 15, 2025

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 59,995 6,067 Updated Aug 24, 2024

A community dedicated to supporting tools for technical and scientific communication and interactive computing

141 172 Updated Apr 1, 2025

Embedded driver for the SCD4x sensor family.

C 34 21 Updated Feb 10, 2025

Sensirion SCD4x sensor library for the ESP32 microcontroller family. It enables developers to communicate with the SCD4x sensor on the ESP32 platform using the I2C communication channel.

C 8 3 Updated Aug 7, 2022

Text and image to video generation: Kandinsky 4.0 (2024)

Python 144 11 Updated Dec 17, 2024

A Survey of Spoken Dialogue Models (60 pages)

288 16 Updated Nov 28, 2024

A mini-framework for evaluating LLM performance on the Bulls and Cows number guessing game, supporting multiple LLM providers.

HTML 240 1 Updated Jan 31, 2025

LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances human-computer interaction through real-time spoken dialogue…

Python 64 7 Updated Dec 22, 2024

Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?

Python 112 3 Updated Feb 11, 2025

MOS score prediction by fine-tuned wav2vec2.0 model

Python 156 21 Updated Oct 20, 2022

Inference and training library for high-quality TTS models.

Python 5,189 550 Updated Dec 10, 2024

Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs

Python 56 7 Updated Apr 11, 2025

Audio Captioning datasets for PyTorch.

Python 115 6 Updated Mar 18, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 9,765 685 Updated Apr 10, 2025

A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.

Jupyter Notebook 728 64 Updated May 7, 2024

The python library and service for automatic speech recognition and transcribing in Russian and English

Python 51 7 Updated Nov 30, 2024

Deep Learning Audio Course, 2024

Jupyter Notebook 81 3 Updated Nov 14, 2024

Algorithms and Data Structures course at ITMO University

C++ 8 Updated Jul 29, 2022

Digital Signal Processing course

Python 27 1 Updated Apr 14, 2025

PyTorch implementation for DDPM & DDIM

Python 27 Updated Nov 29, 2023
Next