MIracleyin

Follow

Yin Zhang MIracleyin

Follow

Scientific NLP, Science of Science, Recommendation Systems, OS, Rust

43 followers · 158 following

Tencent
China
10:39 (UTC +08:00)

Achievements

Achievements

Lists (17)

Sort

Course

Data analysis

Dataset

explainability

AI explainability code

interesting

language

latex

LLM-eval

eval tools for LLM

LLM-tool

LLM-tuning

tuning repo for LLM

MLLM

Paper Idea

Paper implementation

Powerful library

12 repositories

rice

template

GitHub code template

Userful tools

有用的工具

12 repositories

Starred repositories

JeffSackmann / tennis_atp

ATP Tennis Rankings, Results, and Stats

1,069 625 Updated Dec 30, 2024

jyrao / UniSoccer

[CVPR 2025] "Towards Universal Soccer Video Understanding".

Python 99 4 Updated Mar 1, 2025

deepseek-ai / 3FS

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 7,558 652 Updated Mar 6, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,031 600 Updated Mar 6, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 5,342 532 Updated Mar 6, 2025

eddycmu / demystify-long-cot

Python 233 16 Updated Feb 6, 2025

huggingface / smollm

Everything about the SmolLM2 and SmolVLM family of models

Python 1,981 111 Updated Feb 20, 2025

seal-rg / recurrent-pretraining

Pretraining code for a large-scale depth-recurrent language model

Python 662 54 Updated Mar 5, 2025

Emericen / tiny-qwen

A minimal, easy-to-read PyTorch reimplementation of the Qwen2 series—without the complexity of larger frameworks.

Python 11 Updated Jan 18, 2025

blackhole89 / autopen

Editor with LLM generation tree exploration

C++ 64 4 Updated Feb 12, 2025

Yuliang-Liu / MultimodalOCR

On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)

Python 550 39 Updated Feb 14, 2025

PriorLabs / TabPFN

⚡ TabPFN: Foundation Model for Tabular Data ⚡

Python 2,861 236 Updated Mar 4, 2025

facebookresearch / sscd-copy-detection

Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).

Python 289 22 Updated Aug 2, 2022

NVlabs / EAGLE

Eagle Family: Exploring Model Designs, Data Recipes and Training Strategies for Frontier-Class Multimodal LLMs

Python 616 39 Updated Jan 28, 2025

BetaStreetOmnis / xhs_ai_publisher

小红书 (xiaohongshu, rednote) ai运营助手，包括小红书风格内容（包含图片）的生成和自动发布两部分，其中自动发布利用selenium实现RPA模拟点击，将生成内容和封面图和内容图自动发布

Python 445 62 Updated Feb 7, 2025

sublee / trueskill

An implementation of the TrueSkill rating system for Python

Python 759 118 Updated Aug 30, 2023

opendatalab / OmniDocBench

A Comprehensive Benchmark for Document Parsing and Evaluation

Python 270 23 Updated Feb 25, 2025

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Python 2,643 354 Updated Mar 2, 2025

DAMO-NLP-SG / multimodal_textbook

The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"

Python 144 16 Updated Jan 18, 2025

jinbo0906 / Awesome-MLLM-Datasets

This project aims to collect and collate various datasets for multimodal large model training, including but not limited to pre-training data, instruction fine-tuning data, and In-Context learning …

30 1 Updated Oct 7, 2024

junyanz / interactive-deep-colorization

Deep learning software for colorizing black and white images with a few clicks.

Python 2,703 447 Updated Jul 29, 2022

PFCCLab / StyleText

Style-Text data synthesis tool

Python 42 Updated Dec 9, 2024

LMM101 / Awesome-Multimodal-Next-Token-Prediction

[Survey] Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

380 9 Updated Jan 17, 2025

allenai / pixmo-docs

Synthetic data generation pipelines for text-rich images.

Python 41 9 Updated Mar 1, 2025

sxyazi / yazi

💥 Blazing fast terminal file manager written in Rust, based on async I/O.

Rust 22,829 500 Updated Mar 6, 2025

allenai / molmo

Code for the Molmo Vision-Language Model

Python 315 20 Updated Dec 12, 2024

microsoft / markitdown

Python tool for converting files and office documents to Markdown.

Python 39,590 1,840 Updated Mar 6, 2025

QwenLM / Qwen2.5-VL

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 8,417 591 Updated Mar 4, 2025

iterative / datachain

ETL, Analytics, Versioning for Unstructured Data

Python 2,402 106 Updated Mar 6, 2025

NVlabs / VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 2,989 241 Updated Mar 7, 2025

Starred topics

citation-recommendation

causal-inference

$latex logo$

LaTeX