Skip to content
View ma-zihan's full-sized avatar
  • Daejeon, Republic of Korea
  • 02:39 (UTC +09:00)

Highlights

  • Pro

Block or report ma-zihan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement

    Python MIT License Updated Feb 7, 2025
  • ScoreFlow Public

    Forked from Gen-Verse/ScoreFlow

    Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"

    Python Updated Feb 7, 2025
  • HARL Public

    Forked from PKU-MARL/HARL

    Official implementation of HARL algorithms based on PyTorch.

    Python Updated Jan 23, 2025
  • ADAS Public

    Forked from ShengranHu/ADAS

    Automated Design of Agentic Systems

    Python Apache License 2.0 Updated Jan 10, 2025
  • ZSC-Eval Public

    Forked from sjtu-marl/ZSC-Eval

    This repository is the official implementation of ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-agent Zero-shot Coordination. Pre-trained Agent Zoo: https://huggingface.co/Leoxxxxh/ZSC-Ev…

    JavaScript MIT License Updated Nov 25, 2024
  • CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

    Jupyter Notebook Apache License 2.0 Updated Nov 21, 2024
  • GPTSwarm Public

    Forked from metauto-ai/GPTSwarm

    🐝 GPTSwarm: LLM agents as (Optimizable) Graphs

    Python MIT License Updated Oct 14, 2024
  • MAPPO Public

    Forked from marlbenchmark/on-policy

    This is the official implementation of Multi-Agent PPO (MAPPO).

    Python MIT License Updated Oct 1, 2024
  • pymarl2 Public

    Forked from hijkzzz/pymarl2

    Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)

    Python Apache License 2.0 Updated Sep 8, 2024
  • d3rlpy Public

    Forked from takuseno/d3rlpy

    An offline deep reinforcement learning library

    Python MIT License Updated Aug 25, 2024
  • og-marl Public

    Forked from instadeepai/og-marl

    Datasets with baselines for offline multi-agent reinforcement learning.

    Python Apache License 2.0 Updated Jul 24, 2024
  • CFCQL Public

    Forked from thu-rllab/CFCQL

    Code for NeurIPS2023 accepted paper: Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning.

    Python Updated Jun 18, 2024
  • makemore Public

    Forked from karpathy/makemore

    An autoregressive character-level language model for making more things

    Python MIT License Updated Jun 4, 2024
  • MARLlib Public

    Forked from Replicable-MARL/MARLlib

    One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)

    Python MIT License Updated May 29, 2024
  • DyLAN Public

    Forked from SALT-NLP/DyLAN

    Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization

    Python MIT License Updated May 16, 2024
  • mazihan Public

    Config files for my GitHub profile.

    Updated Jun 29, 2021