paddy0914

fengwei paddy0914

11 followers · 39 following

Stars

karolpiczak / ESC-50

ESC-50: Dataset for Environmental Sound Classification

Python 1,454 291 Updated Mar 20, 2024

shunk031 / simple-aesthetics-predictor

CLIP-based aesthetics predictor inspired by the interface of 🤗 huggingface transformers.

Python 33 Updated Jun 14, 2024

tangyipeng100 / Modelscope_sora_solution5

Modelscope-Sora挑战赛第五名参赛方案

Python 10 1 Updated Sep 12, 2024

YuanGongND / whisper-at

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

Python 348 28 Updated Feb 21, 2024

QwenLM / Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,398 93 Updated Aug 13, 2024

VikParuchuri / marker

Convert PDF to markdown + JSON quickly with high accuracy

Python 19,280 1,150 Updated Jan 15, 2025

bytedance / LatentSync

Taming Stable Diffusion for Lip Sync!

Python 1,807 204 Updated Jan 15, 2025

fireicewolf / wd-llm-caption-cli

A Python base cli tool for caption images with WD series, Joy-caption-pre-alpha,meta Llama 3.2 Vision Instruct and Qwen2 VL Instruct models.

Python 30 6 Updated Nov 10, 2024

OpenGVLab / InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Python 1,557 97 Updated Jan 14, 2025

ArrowLuo / CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Python 904 125 Updated Apr 12, 2024

Kedreamix / Linly-Dubbing

智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能，语言无界”

Jupyter Notebook 2,011 190 Updated Aug 23, 2024

UKPLab / sentence-transformers

State-of-the-Art Text Embeddings

Python 15,765 2,525 Updated Jan 10, 2025

YuanGongND / cav-mae

Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".

Python 244 23 Updated Mar 20, 2024

OpenGVLab / InternImage

[CVPR 2023 Highlight] InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions

Python 2,575 241 Updated Jan 16, 2025

LAION-AI / CLAP

Contrastive Language-Audio Pretraining

Python 1,500 149 Updated Nov 21, 2024

tiny-craft / tiny-rdm

Tiny RDM (Tiny Redis Desktop Manager) - A modern, colorful, super lightweight Redis GUI client for Mac, Windows, and Linux.

Vue 9,600 481 Updated Jan 8, 2025

MaartenGr / KeyBERT

Minimal keyword extraction with BERT

Python 3,656 357 Updated Jul 16, 2024

towhee-io / towhee

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.

Python 3,286 255 Updated Oct 18, 2024

fawazsammani / clip-interpret-mutual-knowledge

Interpreting and Analyzing CLIP's Zero-Shot Image Classification via Mutual Knowledge, NeurIPS 2024

Jupyter Notebook 8 1 Updated Dec 5, 2024

milvus-io / pymilvus

Python SDK for Milvus.

Python 1,068 339 Updated Jan 16, 2025

milvus-io / milvus

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 31,843 3,000 Updated Jan 16, 2025

zilliztech / attu

The GUI for Milvus

TypeScript 1,459 133 Updated Jan 14, 2025

oobabooga / text-generation-webui

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 41,620 5,421 Updated Jan 15, 2025

QwenLM / Qwen2-VL

Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Python 4,211 257 Updated Jan 11, 2025

pawelsalawa / sqlitestudio

A free, open source, multi-platform SQLite database manager.

C 5,612 601 Updated Jan 16, 2025

qiye45 / wechatDownload

微信公众号文章批量下载工具，支持图片、评论下载，支持保存html/mhtml/md/pdf/docx文件

HTML 3,653 394 Updated Jan 15, 2025

CSAILVision / places365

The Places365-CNNs for Scene Classification

Python 1,945 537 Updated Jul 16, 2020

wentaozhu / AutoShot

AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection - CVPR NAS 2023

Python 122 14 Updated Apr 18, 2023

modelscope / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 1,504 126 Updated Dec 24, 2024

opencv / opencv

Open Source Computer Vision Library

C++ 80,098 55,938 Updated Jan 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly