Stars
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
TypeChat is a library that makes it easy to build natural language interfaces using types.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
Promptflow experimentation framework
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Supercharge Your LLM Application Evaluations 🚀
LlamaIndex is the leading framework for building LLM-powered agents over your data.
A guidance language for controlling large language models.
GenAIOps with Prompt Flow is a "GenAIOps template and guidance" to help you build LLM-infused apps using Prompt Flow. It offers a range of features including Centralized Code Hosting, Lifecycle Man…
ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural Priors (TPAMI2023)
You like pytorch? You like micrograd? You love tinygrad! ❤️
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
[CVPR 2022] Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding
MAT: Mask-Aware Transformer for Large Hole Image Inpainting
A curated list of image inpainting and video inpainting papers and resources
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
📈 Implementation of eight evaluation metrics to access the similarity between two images. The eight metrics are as follows: RMSE, PSNR, SSIM, ISSM, FSIM, SRE, SAM, and UIQ.