Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

kouroshHakha Follow

Overview Repositories 53 Projects 0 Packages 0 Stars 31

More

Overview
Repositories
Projects
Packages
Stars

kouroshHakha

Follow

kourosh hakhamaneshi kouroshHakha

Follow

35 followers · 8 following

Anyscale Inc.
San Fransisco
@CyrusHakha

Achievements

Achievements

Block or report kouroshHakha

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Add an optional note:

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Overview Repositories 53 Projects 0 Packages 0 Stars 31

More

Overview
Repositories
Projects
Packages
Stars

Type All

Select type

All Sources Forks Archived Can be sponsored Mirrors Templates

Language All

Select language

All Python SCSS Shell Jupyter Notebook TeX

Sort Last updated

Select order

Last updated Name Stars

ray Public
Forked from ray-project/ray

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyp…

Python Apache License 2.0 Updated Feb 22, 2025
SkyThought Public
Forked from NovaSky-AI/SkyThought

Sky-T1: Train your own O1 preview model within $450

Python Apache License 2.0 Updated Feb 13, 2025
verl Public
Forked from volcengine/verl

veRL: Volcano Engine Reinforcement Learning for LLM

Python Apache License 2.0 Updated Feb 10, 2025
benchmark_inter_gpu_comm Public

Python Updated Feb 10, 2025
OpenRLHF Public
Forked from OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python Apache License 2.0 Updated Feb 4, 2025
simpleRL-reason Public
Forked from hkust-nlp/simpleRL-reason

This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data

Python MIT License Updated Jan 26, 2025
TinyZero Public
Forked from Jiayi-Pan/TinyZero

Clean, accessible reproduction of DeepSeek R1-Zero

Python Apache License 2.0 Updated Jan 26, 2025
LLaMA-Factory Public
Forked from hiyouga/LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Python Apache License 2.0 Updated Dec 31, 2024
langchain Public
Forked from langchain-ai/langchain

⚡ Building applications with LLMs through composability ⚡

Python MIT License Updated Jun 22, 2023
jumbo Public

Python 2 2 Other Updated May 5, 2023
kouroshHakha.github.io Public

SCSS Updated Apr 12, 2023
circuit-fewshot-code Public

Python 12 3 BSD 3-Clause "New" or "Revised" License Updated Jan 7, 2023
fist Public

Python 10 4 BSD 3-Clause "New" or "Revised" License Updated Aug 2, 2022
d3rlpy Public
Forked from takuseno/d3rlpy

An offline deep reinforcement learning library

Python MIT License Updated May 20, 2022
bagnet_ngspice Public

Shell 16 7 BSD 3-Clause "New" or "Revised" License Updated Mar 29, 2022
bag_deep_ckt Public

genetic and neural net optimization for circuit design

Python 18 9 Apache License 2.0 Updated Mar 29, 2022
efficient_ga Public

Python 1 Updated Mar 29, 2022
blackbox_eval_engine Public

Python Updated Mar 21, 2022
bb_envs Public

Python 1 1 Updated Mar 17, 2022
circuit-fewshot-data Public

Python 8 4 BSD 3-Clause "New" or "Revised" License Updated Mar 11, 2022
pytorch_sac Public
Forked from denisyarats/pytorch_sac

PyTorch implementation of Soft Actor-Critic (SAC)

Jupyter Notebook 1 MIT License Updated Mar 9, 2022
neural-processes Public
Forked from EmilienDupont/neural-processes

Pytorch implementation of Neural Processes for functions and images 🎆

Jupyter Notebook MIT License Updated Mar 9, 2022
MLutils Public

Python 1 Updated Jan 26, 2022
rl-experiments Public
Forked from ray-project/rl-experiments

Keeping track of RL experiments

Apache License 2.0 Updated Sep 7, 2021
d4rl Public
Forked from kpertsch/d4rl

A benchmark for offline reinforcement learning.

Python Apache License 2.0 Updated Apr 14, 2021
spirl Public
Forked from clvrai/spirl

Official implementation of "Accelerating Reinforcement Learning with Learned Skill Priors", Pertsch et al., CoRL 2020

Python 1 Updated Apr 1, 2021
editsql Public
Forked from ryanzhumich/editsql

Python MIT License Updated Feb 15, 2021
RoBO Public
Forked from automl/RoBO

RoBO: a Robust Bayesian Optimization framework

Python BSD 3-Clause "New" or "Revised" License Updated Dec 30, 2020
nasbench Public
Forked from google-research/nasbench

NASBench: A Neural Architecture Search Dataset and Benchmark

Python Apache License 2.0 Updated Nov 9, 2020
hiro Public

Python Updated Nov 4, 2020

Previous Next

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.