Skip to content
View drewZZzz6's full-sized avatar

Block or report drewZZzz6

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • Faster Whisper transcription with CTranslate2

    Python MIT License Updated Dec 12, 2024
  • FunASR Public

    Forked from modelscope/FunASR

    A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

    Python Other Updated Nov 21, 2024
  • wenet Public

    Forked from wenet-e2e/wenet

    Production First and Production Ready End-to-End Speech Recognition Toolkit

    Python Apache License 2.0 Updated Nov 8, 2024
  • ⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

    Python Updated Sep 4, 2024
  • unilm Public

    Forked from microsoft/unilm

    Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

    Python MIT License Updated Aug 28, 2024
  • VILA Public

    Forked from NVlabs/VILA

    VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

    Python Apache License 2.0 Updated Aug 22, 2024
  • The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

    Jupyter Notebook Apache License 2.0 Updated Jul 30, 2024
  • Repository for the UCB Audio_Visual Recording Web Application

    TypeScript MIT License Updated Jul 23, 2024
  • HOISDF Public

    Forked from amathislab/HOISDF

    [CVPR 2024] HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields

    Python Updated Jul 22, 2024
  • [CVPR 2024 Highlight] Logit Standardization in Knowledge Distillation

    Jupyter Notebook Updated Jun 24, 2024
  • HoT Public

    Forked from NationalGAILab/HoT

    [CVPR 2024 🔥] Official implementation of the paper "⏳ Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation"

    Python MIT License Updated Jun 20, 2024
  • nanoGPT Public

    Forked from karpathy/nanoGPT

    The simplest, fastest repository for training/finetuning medium-sized GPTs.

    Python MIT License Updated Jun 8, 2024
  • yolov10 Public

    Forked from THU-MIG/yolov10
    Python GNU Affero General Public License v3.0 Updated May 24, 2024
  • auto_avsr Public

    Forked from mpc001/auto_avsr

    Auto-AVSR: Lip-Reading Sentences Project

    Python Apache License 2.0 Updated Apr 16, 2024
  • 坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.

    Updated Apr 15, 2024
  • Strong and Open Vision Language Assistant for Mobile Devices

    Python Apache License 2.0 Updated Apr 15, 2024
  • Python Updated Apr 10, 2024
  • DART Public

    Forked from DART2022/DART

    DART: Articulated Hand Model with Diverse Accessories and Rich Textures (NeurIPS 2022 - Datasets and Benchmarks Track)

    Python Updated Apr 1, 2024
  • 利用大模型,一键生成短视频

    Python MIT License Updated Mar 24, 2024
  • TCL Public

    Forked from CVIR/TCL

    Semi-Supervised Action Recognition with Temporal Contrastive Learning

    Python Updated Mar 22, 2024
  • Panoramic localization library containing PyTorch implementations of various panoramic localization algorithms including PICCOLO (ICCV 2021), CPO (ECCV 2022), LDL (ICCV 2023) and FGPL (CVPR 2024).

    Python Apache License 2.0 Updated Mar 21, 2024
  • FaceX-Zoo Public

    Forked from JDAI-CV/FaceX-Zoo

    A PyTorch Toolbox for Face Recognition

    Python Other Updated Feb 16, 2024
  • mmagic Public

    Forked from open-mmlab/mmagic

    OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…

    Jupyter Notebook Apache License 2.0 Updated Dec 18, 2023
  • TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

    Python MIT License Updated Jul 11, 2023
  • 🛠 A lite C++ toolkit of awesome AI models with ONNXRuntime, NCNN, MNN and TNN. YOLOv5, YOLOX, YOLOP, YOLOv6, YOLOR, MODNet, YOLOX, YOLOv7, YOLOv8. MNN, NCNN, TNN, ONNXRuntime.

    C++ GNU General Public License v3.0 Updated Jun 4, 2023
  • 本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。

    Python Updated Jun 4, 2023
  • Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.

    Jupyter Notebook Apache License 2.0 Updated May 19, 2023
  • ultralytics Public template

    Forked from ultralytics/ultralytics

    NEW - YOLOv8 🚀 in PyTorch > ONNX > CoreML > TFLite

    Python GNU Affero General Public License v3.0 Updated May 9, 2023
  • YOLOv6 Public

    Forked from meituan/YOLOv6

    YOLOv6: a single-stage object detection framework dedicated to industrial applications.

    Jupyter Notebook 1 GNU General Public License v3.0 Updated Apr 29, 2023
  • Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).

    Python Apache License 2.0 Updated Apr 24, 2023