-
Anyscale Inc.
- San Fransisco
- @CyrusHakha
-
ray Public
Forked from ray-project/rayAn open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyp…
Python Apache License 2.0 UpdatedFeb 22, 2025 -
SkyThought Public
Forked from NovaSky-AI/SkyThoughtSky-T1: Train your own O1 preview model within $450
Python Apache License 2.0 UpdatedFeb 13, 2025 -
verl Public
Forked from volcengine/verlveRL: Volcano Engine Reinforcement Learning for LLM
Python Apache License 2.0 UpdatedFeb 10, 2025 -
-
OpenRLHF Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Python Apache License 2.0 UpdatedFeb 4, 2025 -
simpleRL-reason Public
Forked from hkust-nlp/simpleRL-reasonThis is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
Python MIT License UpdatedJan 26, 2025 -
TinyZero Public
Forked from Jiayi-Pan/TinyZeroClean, accessible reproduction of DeepSeek R1-Zero
Python Apache License 2.0 UpdatedJan 26, 2025 -
LLaMA-Factory Public
Forked from hiyouga/LLaMA-FactoryUnified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Python Apache License 2.0 UpdatedDec 31, 2024 -
langchain Public
Forked from langchain-ai/langchain⚡ Building applications with LLMs through composability ⚡
Python MIT License UpdatedJun 22, 2023 -
-
-
-
-
d3rlpy Public
Forked from takuseno/d3rlpyAn offline deep reinforcement learning library
Python MIT License UpdatedMay 20, 2022 -
-
bag_deep_ckt Public
genetic and neural net optimization for circuit design
-
-
-
-
-
pytorch_sac Public
Forked from denisyarats/pytorch_sacPyTorch implementation of Soft Actor-Critic (SAC)
-
neural-processes Public
Forked from EmilienDupont/neural-processesPytorch implementation of Neural Processes for functions and images 🎆
Jupyter Notebook MIT License UpdatedMar 9, 2022 -
-
rl-experiments Public
Forked from ray-project/rl-experimentsKeeping track of RL experiments
Apache License 2.0 UpdatedSep 7, 2021 -
d4rl Public
Forked from kpertsch/d4rlA benchmark for offline reinforcement learning.
Python Apache License 2.0 UpdatedApr 14, 2021 -
spirl Public
Forked from clvrai/spirlOfficial implementation of "Accelerating Reinforcement Learning with Learned Skill Priors", Pertsch et al., CoRL 2020
-
-
RoBO Public
Forked from automl/RoBORoBO: a Robust Bayesian Optimization framework
Python BSD 3-Clause "New" or "Revised" License UpdatedDec 30, 2020 -
nasbench Public
Forked from google-research/nasbenchNASBench: A Neural Architecture Search Dataset and Benchmark
Python Apache License 2.0 UpdatedNov 9, 2020 -