Skip to content
View ryh95's full-sized avatar
🎯
Focusing
🎯
Focusing
  • UESTC
  • Chengdu

Organizations

@JulyEdu-PaperTranslation

Block or report ryh95

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 106 6 Updated Dec 7, 2024

Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"

Python 611 56 Updated Feb 24, 2025

Scalable data pre processing and curation toolkit for LLMs

Jupyter Notebook 877 123 Updated Apr 18, 2025

Fully open reproduction of DeepSeek-R1

Python 24,016 2,196 Updated Apr 18, 2025

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

2,412 152 Updated Apr 18, 2025

Machine Learning Engineering Open Book

Python 13,444 816 Updated Apr 7, 2025

A lightweight script for processing HTML page to markdown format with support for code blocks

HTML 79 3 Updated Apr 14, 2024

Heuristic filtering framework for RefineCode

Python 60 8 Updated Mar 13, 2025

The Open Cookbook for Top-Tier Code Large Language Model

Python 1,676 103 Updated Dec 8, 2024

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 8,230 837 Updated Apr 2, 2025

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 55,161 5,423 Updated Apr 5, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,990 4,339 Updated Apr 18, 2025

🎉 Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge. 基于中文法律知识的大语言模型

Python 5,963 551 Updated Jun 11, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,160 1,047 Updated Apr 17, 2025

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 9,346 659 Updated Apr 17, 2025

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,943 4,049 Updated Jul 17, 2024
Python 28 3 Updated Feb 17, 2024

Ongoing research training transformer models at scale

Python 12,117 2,717 Updated Apr 19, 2025

Code and Models for the paper "End-to-End Training of Multi-Document Reader and Retriever for Open-Domain Question Answering" (NeurIPS 2021)

Python 109 10 Updated Apr 18, 2022

Distributional Generalization in NLP. A roadmap.

Jupyter Notebook 88 3 Updated Dec 12, 2022

Awesome papers on Language-Model-as-a-Service (LMaaS)

556 32 Updated May 14, 2024

Code for the book "High Performance Python 2e" by Micha Gorelick and Ian Ozsvald with OReilly

Python 429 143 Updated Jan 18, 2023
Python 490 74 Updated Apr 26, 2021

Authors' implementation of EMNLP-IJCNLP 2019 paper "Answering Complex Open-domain Questions Through Iterative Query Generation"

Python 195 26 Updated Oct 29, 2019

This is an implementation of Hearst patterns, for finding hyponyms, written in Python.

Python 87 29 Updated Aug 8, 2022

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Python 1,781 310 Updated Apr 6, 2023

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 36,627 6,229 Updated Apr 19, 2025

Python implementation of algorithms from Russell And Norvig's "Artificial Intelligence - A Modern Approach"

Jupyter Notebook 8,315 3,908 Updated Aug 4, 2024

An Open-Source Package for Textual Adversarial Attack.

Python 723 127 Updated Jul 20, 2023
Next