Skip to content
View pau-mensa's full-sized avatar

Block or report pau-mensa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Exploring Applications of GRPO

Python 227 28 Updated May 16, 2025

Coding assistant MCP for Claude Desktop

Python 1,329 106 Updated May 7, 2025

Pytorch script hot swap: Change code without unloading your LLM from VRAM

Python 126 6 Updated Apr 21, 2025

Python tool for converting files and office documents to Markdown.

Python 57,784 2,960 Updated May 21, 2025

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 1,549 254 Updated May 22, 2025

FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent, VSCode Agent, Dia Browser & Trae AI (And other Open Sourced) System Prompts, Tools & AI Models.

50,935 15,650 Updated May 21, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 95,769 12,363 Updated May 22, 2025

Lightweight coding agent that runs in your terminal

TypeScript 26,756 2,694 Updated May 23, 2025

Official repo for Learning to Reason for Long-Form Story Generation

Python 58 6 Updated Apr 19, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,283 306 Updated May 13, 2025

Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"

Jupyter Notebook 458 39 Updated May 23, 2025

ByteCheckpoint: An Unified Checkpointing Library for LFMs

Python 214 7 Updated Apr 2, 2025

VeOmni: Scaling any Modality Model Training to any Accelerators with PyTorch native Training Framework

Python 330 14 Updated May 12, 2025

NanoGPT-speedrunning for the poor T4 enjoyers

Python 65 9 Updated Apr 22, 2025

Learning FPGA, yosys, nextpnr, and RISC-V

C++ 2,799 263 Updated Feb 25, 2025

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 1,950 213 Updated May 15, 2025

PyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf

Python 2,773 504 Updated Oct 23, 2024

INVASE: Instance-wise Variable Selection . For more details, read the paper "INVASE: Instance-wise Variable Selection using Neural Networks," International Conference on Learning Representations (I…

Python 5 1 Updated Aug 30, 2022

A python library for self-supervised learning on images.

Python 3,386 294 Updated May 21, 2025

Build your own visual reasoning model

Jupyter Notebook 367 20 Updated May 20, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,325 165 Updated May 19, 2025

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 12,602 1,338 Updated May 23, 2025

This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mastering CUDA programming. Whether you're just starting or look…

344 28 Updated Feb 22, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 8,911 886 Updated May 21, 2025

Custom PTX Instruction Benchmark

Cuda 125 8 Updated Feb 27, 2025

ETL, Analytics, Versioning for Unstructured Data

Python 2,561 113 Updated May 23, 2025

🦉 Data Versioning and ML Experiments

Python 14,484 1,213 Updated May 20, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,781 295 Updated Mar 10, 2025
Python 40 7 Updated Dec 12, 2024

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,350 1,024 Updated May 23, 2025
Next