Stars
Anthropic's educational courses
A python CLI script to create Entity Relationship Diagrams from JSON/YAML code.
#1 Locally hosted web application that allows you to perform various operations on PDF files
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
The official implementation of 3DDFA_V3 in CVPR2024 (Highlight).
Collection of audio-focused loss functions in PyTorch
A better pytorch-based implementation for the mean structural similarity. Differentiable simpler SSIM and MS-SSIM.
Transform datasets at scale. Optimize datasets for fast AI model training.
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
A Python framework for high performance GPU simulation and graphics
Code to easily try 30 (and growing) different image matching methods
深度学习与PyTorch入门实战视频教程 配套源代码和PPT
Supporting PyTorch models with the Google AI Edge TFLite runtime.
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
tsl: a PyTorch library for processing spatiotemporal data.
A PyTorch-based End-to-End Predict-then-Optimize Library for Linear and Integer Programming
Convert PDF to markdown + JSON quickly with high accuracy
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
A clean version (wash list) of MS-Celeb-1M face dataset, containing 6,464,018 face images of 94,682 celebrities