pau-mensa

pau-mensa

7 followers · 7 following

@hugemensa

Achievements

Stars

brendanhogan / DeepSeekRL-Extended

Exploring Applications of GRPO

Python 227 28 Updated May 16, 2025

ezyang / codemcp

Coding assistant MCP for Claude Desktop

Python 1,329 106 Updated May 7, 2025

valine / training-hot-swap

Pytorch script hot swap: Change code without unloading your LLM from VRAM

Python 126 6 Updated Apr 21, 2025

microsoft / markitdown

Python tool for converting files and office documents to Markdown.

Python 57,784 2,960 Updated May 21, 2025

huggingface / lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 1,549 254 Updated May 22, 2025

x1xhlol / system-prompts-and-models-of-ai-tools

FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent, VSCode Agent, Dia Browser & Trae AI (And other Open Sourced) System Prompts, Tools & AI Models.

50,935 15,650 Updated May 21, 2025

open-webui / open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 95,769 12,363 Updated May 22, 2025

openai / codex

Lightweight coding agent that runs in your terminal

TypeScript 26,756 2,694 Updated May 23, 2025

Alex-Gurung / ReasoningNCP

Official repo for Learning to Reason for Long-Form Story Generation

Python 58 6 Updated Apr 19, 2025

agentica-project / rllm

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,283 306 Updated May 13, 2025

McGill-NLP / nano-aha-moment

Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"

Jupyter Notebook 458 39 Updated May 23, 2025

ByteDance-Seed / ByteCheckpoint

ByteCheckpoint: An Unified Checkpointing Library for LFMs

Python 214 7 Updated Apr 2, 2025

ByteDance-Seed / VeOmni

VeOmni: Scaling any Modality Model Training to any Accelerators with PyTorch native Training Framework

Python 330 14 Updated May 12, 2025

VatsaDev / NanoPoor

NanoGPT-speedrunning for the poor T4 enjoyers

Python 65 9 Updated Apr 22, 2025

BrunoLevy / learn-fpga

Learning FPGA, yosys, nextpnr, and RISC-V

C++ 2,799 263 Updated Feb 25, 2025

xdit-project / xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 1,950 213 Updated May 15, 2025

dreamquark-ai / tabnet

PyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf

Python 2,773 504 Updated Oct 23, 2024

vanderschaarlab / INVASE

INVASE: Instance-wise Variable Selection . For more details, read the paper "INVASE: Instance-wise Variable Selection using Neural Networks," International Conference on Learning Representations (I…

Python 5 1 Updated Aug 30, 2022

lightly-ai / lightly

A python library for self-supervised learning on images.

Python 3,386 294 Updated May 21, 2025

groundlight / r1_vlm

Build your own visual reasoning model

Jupyter Notebook 367 20 Updated May 20, 2025

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 2,325 165 Updated May 19, 2025

camel-ai / camel

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 12,602 1,338 Updated May 23, 2025

rkinas / cuda-learning

This repository is a curated collection of resources, tutorials, and practical examples designed to guide you through the journey of mastering CUDA programming. Whether you're just starting or look…

344 28 Updated Feb 22, 2025

deepseek-ai / 3FS

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 8,911 886 Updated May 21, 2025

LaurieWired / BenchmarkCustomPTX

Custom PTX Instruction Benchmark

Cuda 125 8 Updated Feb 27, 2025

iterative / datachain

ETL, Analytics, Versioning for Unstructured Data

Python 2,561 113 Updated May 23, 2025

iterative / dvc

🦉 Data Versioning and ML Experiments

Python 14,484 1,213 Updated May 20, 2025

deepseek-ai / DualPipe

A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

Python 2,781 295 Updated Mar 10, 2025

MCEVAL / McEval

Python 40 7 Updated Dec 12, 2024

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 8,350 1,024 Updated May 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pau-mensa

Achievements

Achievements

Block or report pau-mensa

Stars

brendanhogan / DeepSeekRL-Extended

ezyang / codemcp

valine / training-hot-swap

microsoft / markitdown

huggingface / lighteval

x1xhlol / system-prompts-and-models-of-ai-tools

open-webui / open-webui

openai / codex

Alex-Gurung / ReasoningNCP

agentica-project / rllm

McGill-NLP / nano-aha-moment

ByteDance-Seed / ByteCheckpoint

ByteDance-Seed / VeOmni

VatsaDev / NanoPoor

BrunoLevy / learn-fpga

xdit-project / xDiT

dreamquark-ai / tabnet

vanderschaarlab / INVASE

lightly-ai / lightly

groundlight / r1_vlm

PeterGriffinJin / Search-R1

camel-ai / camel

rkinas / cuda-learning

deepseek-ai / 3FS

LaurieWired / BenchmarkCustomPTX

iterative / datachain

iterative / dvc

deepseek-ai / DualPipe

MCEVAL / McEval

volcengine / verl