Lists (2)
Sort Name ascending (A-Z)
Stars
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Large World Model -- Modeling Text and Video with Millions Context
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch
[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
[ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestration
An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"
Open MatSci ML Toolkit is a framework for prototyping and scaling out deep learning models for materials discovery supporting widely used materials science datasets, and built on top of PyTorch Lig…
A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)
Decoupled Reward-free ExplorAtion and Execution for Meta-reinforcement learning
Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'