Skip to content
View Robinysh's full-sized avatar

Highlights

  • Pro

Block or report Robinysh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Solve puzzles. Learn CUDA.

Jupyter Notebook 10,623 820 Updated Sep 1, 2024

Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 209 21 Updated Feb 24, 2025

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 420 31 Updated Feb 14, 2025

Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context

Python 191 11 Updated Sep 10, 2024

A python script that allows your terminal to snow.

Python 592 34 Updated Dec 23, 2024

[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

Python 1,040 62 Updated Mar 4, 2025

Chronos: Pretrained Models for Probabilistic Time Series Forecasting

Python 3,005 331 Updated Feb 19, 2025

Free, no-nonsense, super fast blogging.

CSS 3,143 97 Updated Mar 3, 2025

A material you color generation tool

Rust 449 19 Updated Jan 3, 2025
Shell 2 Updated May 20, 2024

PII Masker is an open-source tool for protecting sensitive data by automatically detecting and masking PII using advanced AI, powered by DeBERTa-v3. It provides high-precision detection, scalable p…

Jupyter Notebook 80 3 Updated Dec 3, 2024

Windows' "Active Windows" watermark for Linux

Rust 30 1 Updated Nov 28, 2024

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Jupyter Notebook 5,635 984 Updated Feb 24, 2025

Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯

Python 802 32 Updated Dec 27, 2024

Multilingual Voice Understanding Model

Python 4,734 427 Updated Jan 8, 2025

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,184 275 Updated Nov 5, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,395 1,131 Updated Mar 1, 2025

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,038 75 Updated Mar 2, 2025

Efficient Triton Kernels for LLM Training

Python 4,551 274 Updated Mar 3, 2025

Evaluate your LLM's response with Prometheus and GPT4 💯

Python 877 55 Updated Jan 7, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 14,353 1,477 Updated Dec 25, 2024

PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.

Python 228 15 Updated Oct 2, 2024

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

Python 1,297 123 Updated Apr 24, 2024

Awesome speech/audio LLMs, representation learning, and codec models

911 58 Updated Feb 28, 2025

UI Library for Design Engineers. Animated components and effects you can copy and paste into your apps. Free. Open Source.

MDX 14,932 597 Updated Feb 27, 2025

Minimalist developer portfolio using Next.js 14, React, TailwindCSS, Shadcn UI and Magic UI

TypeScript 799 166 Updated Feb 25, 2025

Official implement of paper "AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation" [EMNLP 24']

Python 454 40 Updated Jan 3, 2025
155 16 Updated Aug 27, 2024

aider is AI pair programming in your terminal

Python 28,553 2,591 Updated Mar 3, 2025

A fast multimodal LLM for real-time voice

Python 3,672 261 Updated Feb 14, 2025
Next