Skip to content
View ZigeW's full-sized avatar

Block or report ZigeW

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Survey on Data Selection for Language Models

225 12 Updated Oct 13, 2024

Summarize existing representative LLMs text datasets.

1,248 127 Updated Mar 25, 2025

A quick guide (especially) for trending instruction finetuning datasets

3,011 196 Updated Nov 28, 2023

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

756 54 Updated May 8, 2024

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 11,401 882 Updated Mar 11, 2025

Collection of training data management explorations for large language models

322 30 Updated Aug 2, 2024

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

1,114 58 Updated Jan 4, 2024

✨✨Latest Advances on Multimodal Large Language Models

14,834 950 Updated Apr 22, 2025

translation of VHL repo in paddle

Python 25 Updated Jun 28, 2023
Python 24 Updated Jun 28, 2023

The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]

C 19 3 Updated Aug 4, 2022