-
faster-whisper Public
Forked from SYSTRAN/faster-whisperFaster Whisper transcription with CTranslate2
Python MIT License UpdatedDec 12, 2024 -
FunASR Public
Forked from modelscope/FunASRA Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Python Other UpdatedNov 21, 2024 -
wenet Public
Forked from wenet-e2e/wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Python Apache License 2.0 UpdatedNov 8, 2024 -
HivisionIDPhotos Public
Forked from Zeyi-Lin/HivisionIDPhotos⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
Python UpdatedSep 4, 2024 -
unilm Public
Forked from microsoft/unilmLarge-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Python MIT License UpdatedAug 28, 2024 -
VILA Public
Forked from NVlabs/VILAVILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Python Apache License 2.0 UpdatedAug 22, 2024 -
segment-anything-2 Public
Forked from facebookresearch/sam2The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Jupyter Notebook Apache License 2.0 UpdatedJul 30, 2024 -
UCB-web-video-recorder-app-main Public
Forked from sbarrington/UCB-web-video-recorder-app-mainRepository for the UCB Audio_Visual Recording Web Application
TypeScript MIT License UpdatedJul 23, 2024 -
HOISDF Public
Forked from amathislab/HOISDF[CVPR 2024] HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields
Python UpdatedJul 22, 2024 -
logit-standardization-KD Public
Forked from sunshangquan/logit-standardization-KD[CVPR 2024 Highlight] Logit Standardization in Knowledge Distillation
Jupyter Notebook UpdatedJun 24, 2024 -
HoT Public
Forked from NationalGAILab/HoT[CVPR 2024 🔥] Official implementation of the paper "⏳ Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation"
Python MIT License UpdatedJun 20, 2024 -
nanoGPT Public
Forked from karpathy/nanoGPTThe simplest, fastest repository for training/finetuning medium-sized GPTs.
Python MIT License UpdatedJun 8, 2024 -
yolov10 Public
Forked from THU-MIG/yolov10Python GNU Affero General Public License v3.0 UpdatedMay 24, 2024 -
auto_avsr Public
Forked from mpc001/auto_avsrAuto-AVSR: Lip-Reading Sentences Project
Python Apache License 2.0 UpdatedApr 16, 2024 -
GitHubDaily Public
Forked from GitHubDaily/GitHubDaily坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.
UpdatedApr 15, 2024 -
MobileVLM Public
Forked from Meituan-AutoML/MobileVLMStrong and Open Vision Language Assistant for Mobile Devices
Python Apache License 2.0 UpdatedApr 15, 2024 -
-
DART Public
Forked from DART2022/DARTDART: Articulated Hand Model with Diverse Accessories and Rich Textures (NeurIPS 2022 - Datasets and Benchmarks Track)
Python UpdatedApr 1, 2024 -
MoneyPrinterTurbo Public
Forked from harry0703/MoneyPrinterTurbo利用大模型,一键生成短视频
Python MIT License UpdatedMar 24, 2024 -
Semi-Supervised Action Recognition with Temporal Contrastive Learning
Python UpdatedMar 22, 2024 -
panoramic-localization Public
Forked from 82magnolia/panoramic-localizationPanoramic localization library containing PyTorch implementations of various panoramic localization algorithms including PICCOLO (ICCV 2021), CPO (ECCV 2022), LDL (ICCV 2023) and FGPL (CVPR 2024).
Python Apache License 2.0 UpdatedMar 21, 2024 -
FaceX-Zoo Public
Forked from JDAI-CV/FaceX-ZooA PyTorch Toolbox for Face Recognition
Python Other UpdatedFeb 16, 2024 -
mmagic Public
Forked from open-mmlab/mmagicOpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…
Jupyter Notebook Apache License 2.0 UpdatedDec 18, 2023 -
TinyNeuralNetwork Public
Forked from alibaba/TinyNeuralNetworkTinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
Python MIT License UpdatedJul 11, 2023 -
lite.ai.toolkit Public
Forked from DefTruth/lite.ai.toolkit🛠 A lite C++ toolkit of awesome AI models with ONNXRuntime, NCNN, MNN and TNN. YOLOv5, YOLOX, YOLOP, YOLOv6, YOLOR, MODNet, YOLOX, YOLOv7, YOLOv8. MNN, NCNN, TNN, ONNXRuntime.
C++ GNU General Public License v3.0 UpdatedJun 4, 2023 -
SadTalker-Video-Lip-Sync Public
Forked from Zz-ww/SadTalker-Video-Lip-Sync本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
Python UpdatedJun 4, 2023 -
super-gradients Public
Forked from Deci-AI/super-gradientsEasily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
Jupyter Notebook Apache License 2.0 UpdatedMay 19, 2023 -
ultralytics Public template
Forked from ultralytics/ultralyticsNEW - YOLOv8 🚀 in PyTorch > ONNX > CoreML > TFLite
Python GNU Affero General Public License v3.0 UpdatedMay 9, 2023 -
YOLOv6 Public
Forked from meituan/YOLOv6YOLOv6: a single-stage object detection framework dedicated to industrial applications.
-
Semantic-Segment-Anything Public
Forked from fudan-zvg/Semantic-Segment-AnythingAutomated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
Python Apache License 2.0 UpdatedApr 24, 2023