yunbin

yunbin

4 followers · 4 following

Lists (14)

Sort

Starred repositories

simplescaling / s1

s1: Simple test-time scaling

Python 5,175 583 Updated Feb 13, 2025

deepseek-ai / DeepSeek-R1

73,854 9,525 Updated Feb 8, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 19,495 1,651 Updated Feb 13, 2025

NVIDIA / nv-ingest

NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other enterprise documents into metadata and text to embed into retri…

Python 2,490 214 Updated Feb 12, 2025

pymupdf / PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python 6,446 566 Updated Feb 12, 2025

piotrkawa / audio-deepfake-source-tracing

Baselines for IS25 Source Tracing Special Session

Python 22 1 Updated Jan 3, 2025

deepseek-ai / DeepSeek-V3

Python 83,826 13,413 Updated Feb 8, 2025

qinnzou / Gait-Recognition-Using-Smartphones

Deep Learning-Based Gait Recognition Using Smartphones in the Wild

Jupyter Notebook 110 45 Updated Jan 31, 2024

pkmandke / Human-Posture-Dataset

Accelerometer sensor data for Human Posture Recognition

4 Updated Dec 17, 2022

qiqitao77 / Awesome-Comprehensive-Deepfake-Detection

96 9 Updated Jan 28, 2025

DAMO-DI-ML / NeurIPS2023-One-Fits-All

The official code for "One Fits All: Power General Time Series Analysis by Pretrained LM (NeurIPS 2023 Spotlight)"

Python 507 71 Updated Jan 8, 2024

thuml / TimesNet

About Code release for "TimesNet: Temporal 2D-Variation Modeling for General Time Series Analysis" (ICLR 2023), https://openreview.net/pdf?id=ju_Uqw384Oq

780 70 Updated Apr 2, 2024

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 6,841 825 Updated Feb 12, 2025

ZZUFaceBookDL / GTN

Python 144 25 Updated May 9, 2023

huckiyang / Voice2Series-Reprogramming

ICML 21 - Voice2Series: Adversarial Reprogramming Acoustic Models for Time Series Classification

TypeScript 70 12 Updated Jun 12, 2024

gzerveas / mvts_transformer

Multivariate Time Series Transformer, public version

Python 803 177 Updated Aug 27, 2023

qingsongedu / Awesome-TimeSeries-SpatioTemporal-LM-LLM

A professional list on Large (Language) Models and Foundation Models (LLM, LM, FM) for Time Series, Spatiotemporal, and Event Data.

981 74 Updated Dec 22, 2024

qingsongedu / time-series-transformers-review

A professionally curated list of awesome resources (paper, code, data, etc.) on transformers in time series.

2,630 252 Updated Aug 8, 2024

emadeldeen24 / TS-TCC

[IJCAI-21] "Time-Series Representation Learning via Temporal and Contextual Contrasting"

Python 398 102 Updated Mar 31, 2024

mims-harvard / TFC-pretraining

Self-supervised contrastive learning for time series via time-frequency consistency

Python 458 84 Updated May 7, 2024

FL33TW00D / whisper-turbo

Cross-Platform, GPU Accelerated Whisper 🏎️

TypeScript 1,773 81 Updated Feb 27, 2024

HKUDS / LightRAG

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 11,759 1,638 Updated Feb 12, 2025

LinkSoul-AI / LLaSM

第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验，同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。

Python 542 55 Updated Sep 11, 2023

0nutation / SpeechGPT

SpeechGPT Series: Speech Large Language Models

Python 1,342 89 Updated Jul 22, 2024

ga642381 / speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

881 57 Updated Feb 11, 2025

Voice-Privacy-Challenge / Voice-Privacy-Challenge-2024

Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software

Python 48 6 Updated Jan 30, 2025

NeoVertex1 / SuperPrompt

SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.

5,799 537 Updated Dec 1, 2024

brentspell / torch-yin

Yin pitch estimator in PyTorch

Python 114 7 Updated Nov 7, 2022

hrnoh24 / stream-vc

An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)

Python 117 7 Updated Jul 30, 2024

microsoft / Phi-3CookBook

This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small langua…

Jupyter Notebook 2,727 327 Updated Feb 11, 2025

yunbin

Lists (14)

Time Series

speaker recognition

fiance

watermarking

LLM

AI

ML

forced alignment

w2c

cs

cv

speech

image-video forensic

audio forensic

Starred repositories

anomaly-localization