Stars
✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
A Python package for fast and robust Image Stitching
binlin1209 / stitching
Forked from OpenStitching/stitchingA Python package for fast and robust Image Stitching
FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
这是一个faster-rcnn的keras实现的库,可以利用voc数据集格式的数据进行训练。