Stars
An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.
A generative world for general-purpose robotics & embodied AI learning.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Humpback Whale Identification
这是一个arcface-pytorch的源码,可以用于训练自己的模型。
State-of-the-art 2D and 3D Face Analysis Project
Pytorch0.4.1 codes for InsightFace
[AAAI 2024 Oral] AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models
Paper list and datasets for industrial image anomaly/defect detection (updating). 工业异常/瑕疵检测论文及数据集检索库(持续更新)。
Open-sourced codes, IAD vision-language datasets and pre-trained checkpoints for Myriad.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Prompt, run, edit, and deploy full-stack web applications
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…
Tools for merging pretrained large language models.
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
The model, data and code for the visual GUI Agent SeeClick
official code for paper: Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
100+ Chinese Word Vectors 上百种预训练中文词向量
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.