Revolution

Architecture

classDiagram
    class MultiAgentEnv {
        +teams
        +agents
        +pairings
    }
    class Game {
        +game_player_1
        +game_player_2
    }
    class Team {
        +team_players
    }
    class Player {
        +model
        +replay_buffer
    }
    class DQN_dynamic
    class DoubleDQN_dynamic {
        +dqn
        +target_dqn
    }
    class ActorCritic {
        +actor
        +critic
    }
    class PPO {
        +policy
        +policy_old
        +buffer
    }
    class RolloutBuffer

    nn_Module <|-- DQN_dynamic
    nn_Module <|-- ActorCritic

    MultiAgentEnv --> "*" Team : contains
    MultiAgentEnv --> "*" Player : contains
    MultiAgentEnv --> "*" Game : creates

    Team --> "*" Player : contains

    Game --> "2" Player : references

    Player --> "0..1" DoubleDQN_dynamic : may have
    Player --> "0..1" PPO : may have

    DoubleDQN_dynamic --> "2" DQN_dynamic : contains

    PPO --> ActorCritic : contains
    PPO --> RolloutBuffer : contains

Process

graph TD
    A[Start Simulation] --> B[Initialize MultiAgentEnv]
    B --> C[Create Teams and Players]
    C --> D[Start Epoch Loop]
    D --> E[Reset Environment]
    E --> F[Start Round Loop]
    F --> G[Create Player Pairings]
    G --> H[Play Games]
    H --> I[Update Player States and Rewards]
    I --> J[Check for Revolutions]
    J --> K[Update Team Scores]
    K --> L{More Rounds?}
    L -->|Yes| F
    L -->|No| M[Update Agent Models]
    M --> N[Log Epoch Data]
    N --> O{More Epochs?}
    O -->|Yes| D
    O -->|No| P[End Simulation]
    P --> Q[Generate Visualizations]
    Q --> R[Analyze Results]
    R --> S[End]

    subgraph Epoch Process
        D
        E
        F
        G
        H
        I
        J
        K
        L
        M
        N
    end

    subgraph Post-Simulation
        Q
        R
    end

Changelog

Version 5.0

Episodic Memory Empowers Agents: Agents now retain memories of past interactions, specific to each opponents, enabling them to develop sophisticated and adaptive strategies.
Replay Buffer Improvement: Agents now retain replay buffer of past interactions specific to each opponents, enabling user to learn state transition to one user, making predicting opponents behavior more explicit.
Richer State Information: Agents make more informed decisions by considering additional factors such as relative team performance and opponent identities.
NPC Opponents: Test your agents against a variety of non-player characters (NPCs) with predefined behavioral patterns.
Battle of the Sexes Integration: The reward function now includes internally both the battle of sexes and the prisoners dilemma.

Version 4.0

Reactive Training: Agents now adapt their strategies dynamically in response to opponents' predefined actions. We have verified that in fact agents with PPO and episodic memory are able to learn
Episodic Memory: Agents now are able to associate opponents with their previous action within a episode, which significantly improve the agent performance
Infinite Horizon: To better mimic the baseline game, a random episodic length has replaced the fixed length from the previous version
Improved Visualization: Gain deeper insights into agent behavior over time with refined visualization tools.
Enhanced Codebase: black formatted the entire codebase

Version 3.0

Focus on Baseline Paradigms: Transitioned core game logic to iterated prisoner's dilemma and battle of the sexes to establish a clear baseline for future exploration.
Added PPO model: Integrated the Proximal Policy Optimization algorithm for greater model flexibility.
Deprecated Revolution logic: Removed the original Revolution game and related functionality.

Version 2.0

MongoDB Backend: Introduced MongoDB for logging, enabling efficient data storage and analysis.

Version 1.0

Initial Implementation: Implemented the foundation of the Revolution game using OOP and a Double DQN model.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Visulization		Visulization
Game.py		Game.py
Model1.py		Model1.py
Model2.py		Model2.py
MongoCSVLogging.py		MongoCSVLogging.py
MultiAgentEnv.py		MultiAgentEnv.py
Player.py		Player.py
README.md		README.md
Revolution.py		Revolution.py
Team.py		Team.py
main.py		main.py
requirements.txt		requirements.txt
workspace.code-workspace		workspace.code-workspace

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Revolution

Architecture

Process

About

Releases

Packages

Languages

0xC000005/revolutions

Folders and files

Latest commit

History

Repository files navigation

Revolution

Architecture

Process

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages