Highlights
Stars
🦜🔗 Build context-aware reasoning applications
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
🔊 Text-Prompted Generative Audio Model
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
A simple screen parsing tool towards pure vision based GUI agent
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
A guidance language for controlling large language models.
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
PyTorch code and models for the DINOv2 self-supervised learning method.
Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course
QLoRA: Efficient Finetuning of Quantized LLMs
Multi-Joint dynamics with Contact. A general purpose physics simulator.
Zero-Shot Speech Editing and Text-to-Speech in the Wild
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
A Bulletproof Way to Generate Structured JSON from Language Models
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Machine Learning University: Accelerated Computer Vision Class
Cool experiments at the intersection of Computer Vision and Sports ⚽🏃
Improving transcription performance of OpenAI Whisper for CPU based deployment