Skip to content
View chenyyx's full-sized avatar
👻
guess it~
👻
guess it~

Block or report chenyyx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 16,790 1,094 Updated Mar 14, 2025

Visual Studio Code

TypeScript 168,809 31,047 Updated Mar 15, 2025

D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]

Python 1,704 137 Updated Mar 14, 2025

The reinforcement learning training code for AgiBot X1.

Python 1,372 433 Updated Oct 23, 2024

The inference module for AgiBot X1.

C++ 1,533 469 Updated Nov 22, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,171 629 Updated Feb 10, 2025

"LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 12,788 1,815 Updated Mar 14, 2025

Implementation of Alphafold 3 from Google Deepmind in Pytorch

Python 1,383 173 Updated Jan 22, 2025

Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具

Python 1,470 152 Updated Mar 12, 2025

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 10,493 1,052 Updated Mar 14, 2025

[CVPR 2024] Code for SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes

Python 547 29 Updated Mar 17, 2024

open Multiple View Geometry library. Basis for 3D computer vision and Structure from Motion.

C++ 5,899 1,686 Updated Mar 8, 2025

Algorithm to texture 3D reconstructions from multi-view stereo images

C++ 1,011 346 Updated Jul 25, 2023

Atlas: End-to-End 3D Scene Reconstruction from Posed Images

Python 1,834 220 Updated Apr 6, 2022

Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral

Python 2,127 304 Updated Oct 4, 2023

Official implementation of `Splatter Image: Ultra-Fast Single-View 3D Reconstruction' CVPR 2024

Python 924 72 Updated Aug 17, 2024

This repository introduces an approach to improve the efficiency of unsupervised MVS networks. We achieve this by eliminating the need for a separate cost volume regularization step for neural rend…

Python 2 Updated Aug 23, 2024

Official Pytorch Implementation of SPECTRE: Visual Speech-Aware Perceptual 3D Facial Expression Reconstruction from Videos

Python 270 23 Updated Aug 16, 2024

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,877 418 Updated Mar 5, 2025

open Multi-View Stereo reconstruction library

C++ 337 86 Updated Aug 14, 2023

C++ Library Manager for Windows, Linux, and MacOS

CMake 24,256 6,715 Updated Mar 15, 2025

open Multi-View Stereo reconstruction library

C++ 3,520 924 Updated Mar 13, 2025

A self-supervised learning framework for audio-visual speech

Python 883 138 Updated Dec 7, 2023

🔥 2D and 3D Face alignment library build using pytorch

Python 7,233 1,357 Updated Aug 30, 2024

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,527 239 Updated Mar 14, 2025

Ultralytics YOLO11 🚀

Python 37,971 7,361 Updated Mar 15, 2025

A generative speech model for daily dialogue.

Python 35,098 3,793 Updated Mar 14, 2025

基于大模型的智能对话客服工具,支持微信、拼多多、千牛、哔哩哔哩、抖音企业号、抖音、抖店、微博聊天、小红书专业号运营、小红书、知乎等平台接入,可选择 GPT3.5/GPT4.0/ 懒人百宝箱 (后续会支持更多平台),能处理文本、语音和图片,通过插件访问操作系统和互联网等外部资源,支持基于自有知识库定制企业 AI 应用。

TypeScript 2,883 327 Updated Dec 5, 2024
Next