wyc2015fq

Yangchun W wyc2015fq

https://blog.csdn.net/wyc2015fq

1 follower · 4 following

Stars

yh-hust / PDF-Wukong

【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling

115 4 Updated Oct 18, 2024

2U1 / Qwen2-VL-Finetune

An open-source implementaion for fine-tuning Qwen2-VL and Qwen2.5-VL series by Alibaba Cloud.

Python 494 55 Updated Mar 21, 2025

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 8,960 631 Updated Mar 7, 2025

OpenBMB / MiniCPM-o

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,030 1,374 Updated Mar 3, 2025

flyingby / Awesome-Deepfake-Generation-and-Detection

A Survey on Deepfake Generation and Detection

433 20 Updated Feb 21, 2025

Kwai-Kolors / Kolors

Kolors Team

Python 4,286 323 Updated Nov 13, 2024

SpursGoZmy / Table-LLaVA

Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train Dataset for table understanding and develop a generalist tab…

Python 190 7 Updated Sep 27, 2024

Alpha-VLLM / Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,165 91 Updated Feb 16, 2025

Yuliang-Liu / Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Python 1,731 123 Updated Mar 20, 2025

Tencent / HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 4,003 334 Updated Jan 13, 2025

PixArt-alpha / PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,779 87 Updated Oct 31, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 25,734 2,475 Updated Mar 20, 2025

ZiqiaoPeng / SyncTalk

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Python 1,435 168 Updated Aug 28, 2024

NormXU / nougat-latex-ocr

Codebase for fine-tuning / evaluating nougat-based image2latex generation models

Python 144 19 Updated Sep 25, 2024

Duxiaoman-DI / public-achievements-on-CV

成员在ICCV、CVPR等CV顶会发表的论文，在ICDAR等比赛中的成果

1 Updated Jun 21, 2023

zigchang / HumanBench

Forked from OpenGVLab/HumanBench

This repo is official implementation of HumanBench (CVPR2023)

Python 2 Updated Apr 12, 2023

Python 1 Updated Nov 10, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yangchun W wyc2015fq

Block or report wyc2015fq

Stars

yh-hust / PDF-Wukong

2U1 / Qwen2-VL-Finetune

QwenLM / Qwen2.5-VL

OpenBMB / MiniCPM-o

flyingby / Awesome-Deepfake-Generation-and-Detection

Kwai-Kolors / Kolors

SpursGoZmy / Table-LLaVA

Alpha-VLLM / Lumina-T2X

Yuliang-Liu / Monkey

Tencent / HunyuanDiT

PixArt-alpha / PixArt-sigma

hpcaitech / Open-Sora

ZiqiaoPeng / SyncTalk

NormXU / nougat-latex-ocr

Duxiaoman-DI / public-achievements-on-CV

zigchang / HumanBench

facebookresearch / segment-anything

WenmuZhou / PytorchOCR

WenmuZhou / DBNet.pytorch

chenjun2hao / Bert_OCR.pytorch

detectRecog / CCPD

bleakie / MaskInsightface

Mingtzge / 2019-CCF-BDCI-OCR-MCZJ-OCR-IdentificationIDElement

linglanfeng / CCF2019-OCR

MhLiao / DB

wyc2015fq / DewarpNet

Linzaer / Ultra-Light-Fast-Generic-Face-Detector-1MB

deepcam-cn / Face-Anti-spoofing.pytorch

xavysp / Tensorflow-HED-RCF

AlexanderParkin / ChaLearn_liveness_challenge