Lists (2)
Sort Name ascending (A-Z)
Stars
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filte…
Solutions of Reinforcement Learning, An Introduction
Anthropic's educational courses
A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
📺 Discover the latest machine learning / AI courses on YouTube.
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Repo for the Deep Reinforcement Learning Nanodegree program
Examples and guides for using the Gemini API
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Example causal datasets with consistent formatting and ground truth
Additional exercises and data for EE364a. No solutions; for public consumption.
A course in reinforcement learning in the wild
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
Transform plain text into a visually stunning Knowledge Graph with GPT-4 (latest preview)! It converts text into RDF tuples, and highlights the most frequent connections with a vibrant color-coding…
Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
Large World Model -- Modeling Text and Video with Millions Context
[ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestration
[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Open MatSci ML Toolkit is a framework for prototyping and scaling out deep learning models for materials discovery supporting widely used materials science datasets, and built on top of PyTorch Lig…
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI