![scikit-learn logo](https://raw.githubusercontent.com/github/explore/80688e429a7d4ef2fca1e82350fe8e3517d3494d/topics/scikit-learn/scikit-learn.png)
- All languages
- Assembly
- Bikeshed
- Brainfuck
- C
- C#
- C++
- CSS
- Cuda
- Cython
- Dart
- Dockerfile
- EJS
- GLSL
- Go
- HTML
- Haskell
- Java
- JavaScript
- Jsonnet
- Jupyter Notebook
- Lua
- MATLAB
- Makefile
- OCaml
- Objective-C
- OpenEdge ABL
- PHP
- Perl
- PowerShell
- Processing
- Python
- R
- Roff
- Ruby
- Rust
- Scala
- Shell
- Svelte
- TeX
- TypeScript
- VHDL
- Vim Script
- Vue
- WebAssembly
- Zig
Starred repositories
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
Faster Whisper transcription with CTranslate2
Fast inference engine for Transformer models
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
A self-hosted API that takes a URL and returns a file with browser screenshots.
PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
DEIM: DETR with Improved Matching for Fast Convergence
Model for recasing and repunctuating ASR transcripts
Tools to download and cleanup Common Crawl data
The code of paper "Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation" published at NeurIPS 2022
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
Check for multiple patterns in a single string at the same time: a fast Aho-Corasick algorithm for Python
Perforator is a cluster-wide continuous profiling tool designed for large data centers
Deezer source separation library including pretrained models.
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
Algorithm designed to match strings by similarity
A High-performance cross-platform Video Processing Python framework powerpacked with unique trailblazing features 🔥
Janus-Series: Unified Multimodal Understanding and Generation Models
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
A fast and lightweight python-based CTC beam search decoder for speech recognition.
Code for CLIB-FIQA: Face Image Quality Assessment with Confidence Calibration