Skip to content
View MaoYouSi's full-sized avatar

Block or report MaoYouSi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

RLHF implementation details of OAI's 2019 codebase

Python 183 9 Updated Jan 14, 2024

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 10,878 1,389 Updated Feb 1, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 4,123 379 Updated Mar 3, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 969 49 Updated Feb 28, 2025

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 627 45 Updated Feb 28, 2025

Fully open reproduction of DeepSeek-R1

Python 21,949 1,959 Updated Mar 3, 2025

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Python 460 28 Updated Feb 21, 2025
Jupyter Notebook 1,142 237 Updated Aug 1, 2024

A framework for few-shot evaluation of language models.

Python 8,072 2,161 Updated Mar 3, 2025

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,434 120 Updated Apr 17, 2024

Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train Dataset for table understanding and develop a generalist tab…

Python 188 7 Updated Sep 27, 2024

A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.

163 9 Updated Sep 9, 2024

基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。

Python 220 18 Updated Jan 10, 2025

文档图片表格结构识别算法-同花顺算法挑战赛-2022年2-4月春季赛

Python 25 6 Updated Mar 21, 2022

整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX Organize the currently open-source optimal table recognition models, improve pre-processing and post-processing, and convert the models to ONNX.

Python 543 47 Updated Feb 23, 2025

检测和提取各种场景图片中的表格区域,并纠正透视和旋转问题 Detect and extract table regions from images in various scenarios, and correct perspective and rotation issues.

Python 60 1 Updated Dec 10, 2024

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 13,644 1,088 Updated Jan 18, 2025

A Comprehensive Benchmark for Document Parsing and Evaluation

Python 265 22 Updated Feb 25, 2025

Grounded Language-Image Pre-training

Python 2,337 200 Updated Jan 24, 2024

Ultralytics YOLO11 🚀

Python 37,341 7,255 Updated Mar 3, 2025

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Python 888 66 Updated Jan 16, 2025

Math OCR model that outputs LaTeX and markdown

Python 1,030 82 Updated Jan 29, 2025

Office Automation by Using Pythonf (For Excel, Word, PPT and PDF .....)

Jupyter Notebook 312 146 Updated May 9, 2020

The Math23k dataset for downloading

18 Updated Apr 16, 2022

A LaTeX Template for Dissertation Writing at the University of Electronic Science and Technology of China Since 2024

TeX 181 18 Updated Feb 24, 2025

Large language model and dataset for natural language to first-order logic translation

Jupyter Notebook 50 5 Updated Oct 25, 2023

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,444 897 Updated Jul 1, 2024

[EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".

Python 70 2 Updated Jan 14, 2025

【Generative LLM as Verifiers】推理加速篇:早停法+复用KV缓存+并行推理,实现推理效率提升几十倍

Jupyter Notebook 5 Updated Nov 8, 2024
Next