Robinysh

Robin Yuen Shing Hei Robinysh

7 followers · 1 following

Vancouver, Canada

Achievements

Highlights

Stars

srush / GPU-Puzzles

Solve puzzles. Learn CUDA.

Jupyter Notebook 10,623 820 Updated Sep 1, 2024

zhenye234 / X-Codec-2.0

Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 209 21 Updated Feb 24, 2025

zhenye234 / LLaSA_training

LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis

Python 420 31 Updated Feb 14, 2025

k2-fsa / libriheavy

Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context

Python 191 11 Updated Sep 10, 2024

sontek / snowmachine

A python script that allows your terminal to snow.

Python 592 34 Updated Dec 23, 2024

showlab / ShowUI

[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

Python 1,040 62 Updated Mar 4, 2025

amazon-science / chronos-forecasting

Chronos: Pretrained Models for Probabilistic Time Series Forecasting

Python 3,005 331 Updated Feb 19, 2025

HermanMartinus / bearblog

Free, no-nonsense, super fast blogging.

CSS 3,143 97 Updated Mar 3, 2025

InioX / matugen

A material you color generation tool

Rust 449 19 Updated Jan 3, 2025

Robinysh / python-template

Shell 2 Updated May 20, 2024

HydroXai / pii-masker

PII Masker is an open-source tool for protecting sensitive data by automatically detecting and masking PII using advanced AI, powered by DeBERTa-v3. It provides high-precision detection, scalable p…

Jupyter Notebook 80 3 Updated Dec 3, 2024

Kaisia-Estrel / activate-linux

Windows' "Active Windows" watermark for Linux

Rust 30 1 Updated Nov 28, 2024

alirezadir / Machine-Learning-Interviews

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Jupyter Notebook 5,635 984 Updated Feb 24, 2025

lifeiteng / OmniSenseVoice

Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯

Python 802 32 Updated Dec 27, 2024

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 4,734 427 Updated Jan 8, 2025

gpt-omni / mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,184 275 Updated Nov 5, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,395 1,131 Updated Mar 1, 2025

jishengpeng / WavTokenizer

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,038 75 Updated Mar 2, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 4,551 274 Updated Mar 3, 2025

prometheus-eval / prometheus-eval

Evaluate your LLM's response with Prometheus and GPT4 💯

Python 877 55 Updated Jan 7, 2025

facebookresearch / sam2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 14,353 1,477 Updated Dec 25, 2024