Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filte…

Jupyter Notebook 17,407 4,280 Updated Aug 7, 2024

LyWangPX / Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions

Solutions of Reinforcement Learning, An Introduction

Jupyter Notebook 2,154 479 Updated May 20, 2024

anthropics / courses

Anthropic's educational courses

Jupyter Notebook 9,537 819 Updated Nov 26, 2024

openlifescience-ai / Open-Medical-Reasoning-Tasks

A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)

Python 112 10 Updated Sep 16, 2024

ridgerchu / matmulfreellm

Implementation for MatMul-free LM.

Python 2,966 188 Updated Nov 5, 2024

aishwaryanr / awesome-generative-ai-guide

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

11,212 2,349 Updated Mar 4, 2025

noahshinn / reflexion

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 2,603 253 Updated Jan 14, 2025

dair-ai / ML-YouTube-Courses

📺 Discover the latest machine learning / AI courses on YouTube.

16,347 1,963 Updated Jan 22, 2024

RITCHIEHuang / DeepRL_Algorithms

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

Python 329 39 Updated Mar 25, 2023

udacity / deep-reinforcement-learning

Repo for the Deep Reinforcement Learning Nanodegree program

Jupyter Notebook 5,000 2,360 Updated Nov 16, 2023

google-gemini / cookbook

Examples and guides for using the Gemini API

Jupyter Notebook 10,855 1,326 Updated Mar 6, 2025

seungeunrho / minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Python 2,967 461 Updated Apr 22, 2023

allenai / FineGrainedRLHF

Python 269 22 Updated Jan 6, 2025

cmu-phil / example-causal-datasets

Example causal datasets with consistent formatting and ground truth

79 11 Updated Sep 20, 2023

cvxgrp / cvxbook_additional_exercises

Additional exercises and data for EE364a. No solutions; for public consumption.

Julia 660 177 Updated Feb 13, 2025

yandexdataschool / Practical_RL

A course in reinforcement learning in the wild

Jupyter Notebook 6,045 1,714 Updated Mar 2, 2025

higgsfield-ai / higgsfield

Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters

Jupyter Notebook 3,320 557 Updated May 25, 2024

google-deepmind / graphcast

Python 5,899 723 Updated Jan 31, 2025

iAmmarTahir / KnowledgeGraphGPT

Transform plain text into a visually stunning Knowledge Graph with GPT-4 (latest preview)! It converts text into RDF tuples, and highlights the most frequent connections with a vibrant color-coding…

JavaScript 162 24 Updated Jan 10, 2024

spcl / graph-of-thoughts

Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"

Python 2,298 167 Updated Dec 11, 2024

RManLuo / Awesome-LLM-KG

Awesome papers about unifying LLMs and KGs

2,242 159 Updated Feb 6, 2025

LargeWorldModel / LWM

Large World Model -- Modeling Text and Video with Millions Context

Python 7,247 557 Updated Oct 19, 2024

efeslab / fiddler

[ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestration

Python 196 18 Updated Nov 18, 2024

sail-sg / lorahub

[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

Python 616 37 Updated Jul 22, 2024

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 17,648 1,770 Updated Mar 6, 2025

IntelLabs / matsciml

Open MatSci ML Toolkit is a framework for prototyping and scaling out deep learning models for materials discovery supporting widely used materials science datasets, and built on top of PyTorch Lig…

Python 169 26 Updated Mar 4, 2025

lucidrains / self-rewarding-lm-pytorch

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Python 1,367 71 Updated Apr 11, 2024

bstadie / Stat-320

Stat 320 materials

5 Updated Mar 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Young Ko YoungKo

Highlights

Block or report YoungKo

Lists (2)

LLM

RL

Stars

rasbt / LLMs-from-scratch

stanford-cs149 / asst1

rlabbe / Kalman-and-Bayesian-Filters-in-Python