Skip to content
View MIracleyin's full-sized avatar
  • Tencent
  • China
  • 10:39 (UTC +08:00)

Block or report MIracleyin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

ATP Tennis Rankings, Results, and Stats

1,069 625 Updated Dec 30, 2024

[CVPR 2025] "Towards Universal Soccer Video Understanding".

Python 99 4 Updated Mar 1, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 7,558 652 Updated Mar 6, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,031 600 Updated Mar 6, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,342 532 Updated Mar 6, 2025

Everything about the SmolLM2 and SmolVLM family of models

Python 1,981 111 Updated Feb 20, 2025

Pretraining code for a large-scale depth-recurrent language model

Python 662 54 Updated Mar 5, 2025

A minimal, easy-to-read PyTorch reimplementation of the Qwen2 series—without the complexity of larger frameworks.

Python 11 Updated Jan 18, 2025

Editor with LLM generation tree exploration

C++ 64 4 Updated Feb 12, 2025

On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)

Python 550 39 Updated Feb 14, 2025

⚡ TabPFN: Foundation Model for Tabular Data ⚡

Python 2,861 236 Updated Mar 4, 2025

Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).

Python 289 22 Updated Aug 2, 2022

Eagle Family: Exploring Model Designs, Data Recipes and Training Strategies for Frontier-Class Multimodal LLMs

Python 616 39 Updated Jan 28, 2025

小红书 (xiaohongshu, rednote) ai运营助手,包括小红书风格内容(包含图片)的生成和自动发布两部分,其中自动发布利用selenium实现RPA模拟点击,将生成内容和封面图和内容图自动发布

Python 445 62 Updated Feb 7, 2025

An implementation of the TrueSkill rating system for Python

Python 759 118 Updated Aug 30, 2023

A Comprehensive Benchmark for Document Parsing and Evaluation

Python 270 23 Updated Feb 25, 2025

Align Anything: Training All-modality Model with Feedback

Python 2,643 354 Updated Mar 2, 2025

The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"

Python 144 16 Updated Jan 18, 2025

This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training data, instruction fine-tuning data, and In-Context learning …

30 1 Updated Oct 7, 2024

Deep learning software for colorizing black and white images with a few clicks.

Python 2,703 447 Updated Jul 29, 2022

Style-Text data synthesis tool

Python 42 Updated Dec 9, 2024

[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

380 9 Updated Jan 17, 2025

Synthetic data generation pipelines for text-rich images.

Python 41 9 Updated Mar 1, 2025

💥 Blazing fast terminal file manager written in Rust, based on async I/O.

Rust 22,829 500 Updated Mar 6, 2025

Code for the Molmo Vision-Language Model

Python 315 20 Updated Dec 12, 2024

Python tool for converting files and office documents to Markdown.

Python 39,590 1,840 Updated Mar 6, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 8,417 591 Updated Mar 4, 2025

ETL, Analytics, Versioning for Unstructured Data

Python 2,402 106 Updated Mar 6, 2025

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 2,989 241 Updated Mar 7, 2025
Next