Skip to content
View MrGGLS's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report MrGGLS

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Scaling Deep Research via Reinforcement Learning in Real-world Environments.

Python 280 21 Updated Apr 13, 2025

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Python 820 83 Updated Apr 1, 2025

adds Sequence Parallelism into LLaMA-Factory

Python 464 32 Updated Apr 14, 2025

大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"

Jupyter Notebook 847 64 Updated Oct 7, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 7,075 789 Updated Oct 22, 2024

总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力

Python 7,272 1,204 Updated Aug 24, 2022

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,056 144 Updated Apr 11, 2025

A block pruning framework for LLMs.

Python 22 3 Updated Jun 20, 2024

Awesome RL-based LLM Reasoning

452 21 Updated Apr 13, 2025

Fully open reproduction of DeepSeek-R1

Python 24,123 2,215 Updated Apr 23, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 7,090 781 Updated Apr 25, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,648 1,471 Updated Apr 24, 2025

System for AI Education Resource.

Python 3,969 503 Updated Oct 25, 2024

LiveBench: A Challenging, Contamination-Free LLM Benchmark

Python 686 56 Updated Apr 22, 2025

AllenAI's post-training codebase

Python 2,925 377 Updated Apr 25, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)

Python 6,412 630 Updated Apr 25, 2025

Arena-Hard-Auto: An automatic LLM benchmark.

Python 785 98 Updated Apr 24, 2025

A simple unified framework for evaluating LLMs

HTML 209 23 Updated Apr 14, 2025

The official evaluation suite and dynamic data release for MixEval.

Python 237 41 Updated Nov 10, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 47,568 5,804 Updated Apr 24, 2025

Robust recipes to align language models with human and AI preferences

Python 5,144 442 Updated Nov 21, 2024

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 6,840 555 Updated Apr 19, 2025

Converts text to speech in realtime

Python 2,912 288 Updated Apr 21, 2025

A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.

Python 2,130 130 Updated Oct 10, 2024

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Python 1,003 116 Updated Oct 7, 2024

Awesome LLM compression research papers and tools.

1,479 93 Updated Apr 22, 2025

Simple frontend for LLMs built in react-native.

TypeScript 1,272 84 Updated Apr 17, 2025

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

14,845 1,445 Updated Feb 13, 2023

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 138,483 11,541 Updated Apr 25, 2025
Next