Skip to content
View QQW-ing's full-sized avatar
🌴
On vacation
🌴
On vacation
  • Shan

Block or report QQW-ing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"

Python 106 8 Updated Feb 11, 2025

code for paper Hierarchical Retrieval-Augmented Generation Model with Rethink for Multi-hop Question Answering

Python 6 1 Updated Aug 13, 2024

Code for Robust Fine-tuning (RbFT)

Python 6 2 Updated Jan 31, 2025

Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models

1,106 73 Updated Feb 24, 2025

基于200万条医疗数据对DeepSeek-R1-Distill-Qwen-32B进行fine tune且部署

Jupyter Notebook 105 24 Updated Feb 25, 2025

This is the official repository for Auto-RAG.

Python 200 18 Updated Jan 10, 2025
Python 62 Updated Feb 28, 2025

Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This repository contains the code for the experiments in the paper.

Cuda 54 2 Updated Oct 31, 2024

Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".

Python 204 18 Updated Feb 19, 2025

SOTA RL fine-tuning solution for advanced math reasoning of LLM

Python 83 3 Updated Mar 5, 2025

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,665 355 Updated Dec 7, 2024

Robust recipes to align language models with human and AI preferences

Python 5,034 432 Updated Nov 21, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 47,536 5,046 Updated Jan 22, 2025

Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。

Python 28,008 2,056 Updated Feb 14, 2025

Rethinking Chain-of-Thought from the Perspective of Self-Training

Python 7 Updated Feb 15, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 43,090 5,276 Updated Mar 5, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,539 365 Updated Feb 26, 2025

A library for advanced large language model reasoning

Python 2,007 177 Updated Feb 21, 2025

This repository contains sources about reinforcement learning human feedback for math reasoning,.

6 Updated Aug 11, 2023

[EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".

Python 70 2 Updated Jan 14, 2025

open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality

Python 175 17 Updated Aug 2, 2024
Python 25 3 Updated Mar 5, 2025

Litex: A Minimalist Proof Assistant

Go 68 Updated Mar 5, 2025
Python 22 2 Updated Mar 15, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 6,935 469 Updated Jan 3, 2025

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 27,378 2,107 Updated Mar 4, 2025

大模型基础: 一文了解大模型基础知识

4,146 370 Updated Feb 24, 2025

《动手学大模型Dive into LLMs》系列编程实践教程

4,516 407 Updated Sep 20, 2024

A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..

204 8 Updated Oct 17, 2024
Next