Skip to content
View SarthakYadav's full-sized avatar

Highlights

  • Pro

Block or report SarthakYadav

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is the official repo for Gradient Agreement Filtering (GAF).

Python 23 4 Updated Jan 27, 2025

A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.

Python 333 21 Updated Jan 14, 2025

Minimalist ML framework for Rust

Rust 16,720 1,045 Updated Mar 3, 2025

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 21,600 2,261 Updated Jan 15, 2025

Repository containing experimentation platform on how to train, infer on wav2vec2 models.

Python 86 37 Updated Sep 22, 2022

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 1,065 117 Updated Feb 26, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 7,671 616 Updated Mar 6, 2025

[ICLR 2020] A repository for extremely fast adversarial training using FGSM

Python 443 89 Updated Jul 25, 2024
Python 10 4 Updated Oct 1, 2024

Solve puzzles. Learn CUDA.

Jupyter Notebook 10,631 821 Updated Sep 1, 2024

This repository implements SummaryMixing, a simpler, faster and much cheaper replacement to self-attention for automatic speech recognition (see: https://arxiv.org/abs/2307.07421). The code is read…

Python 117 11 Updated Sep 17, 2024

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 888 105 Updated Aug 7, 2024
Python 8 2 Updated Aug 31, 2024

[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications

682 38 Updated Feb 23, 2025

Implementation of gradient-based adversarial attack(FGSM,MI-FGSM,PGD)

Python 84 11 Updated Jul 8, 2021

A library for experimenting with, training and evaluating neural networks, with a focus on adversarial robustness.

Jupyter Notebook 926 182 Updated Jan 11, 2024

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,264 221 Updated Feb 13, 2025

Denoising Masked Autoencoders Help Robust Classification.

Python 61 4 Updated Jun 4, 2023

PyTorch implementation of adversarial attacks [torchattacks]

Python 1,966 357 Updated Jun 29, 2024

A challenge to explore adversarial robustness of neural networks on MNIST.

Python 741 180 Updated May 3, 2022

TRADES (TRadeoff-inspired Adversarial DEfense via Surrogate-loss minimization)

Python 530 124 Updated Mar 30, 2023
Shell 63 6 Updated Jun 28, 2022
Python 284 55 Updated Jan 8, 2025

Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]

Python 97 11 Updated Dec 5, 2023

Uses WiFi signals 📶 and machine learning to predict where you are

Python 5,126 247 Updated Nov 30, 2023

Neural Networks: Zero to Hero

Jupyter Notebook 13,358 1,849 Updated Aug 18, 2024

Gemini is a modern LaTex beamerposter theme 🖼

TeX 1,038 239 Updated Feb 1, 2025

An implementation of soft-DTW divergences.

Python 134 16 Updated Oct 14, 2021

Dataset and baseline code for the VocalSound dataset (ICASSP2022).

Jupyter Notebook 130 10 Updated Nov 12, 2022
Next