GitHub - anirudhs001/Pikachu-RL

Pikachu-RL

The initial thought dump: The Pikachu Project

While the high level premise for this project is very simple(though a moonshot for me, probably): Build a robotic pikachu, what exactly do I mean by it and the path to do so has changed multiple times.
Here's what the super high level goal currently is:

    Build a quadruped that looks like the pokemon, can receive external stimulus (audio, video) and can respond to this stimulus through motion (walk, turn, play around) and facial expressions (a screen showing it's face).

The components

This repo contains the part of the brain responsible for the motor control.
The policy trained here would receive a medium level stimulus (direction, speed to walk in), and would generate the required motor angles to achieve that motion. A raspi probably has enough compute required to run a small policy network that will do this in real time.

The thinking part of the brain is in Pikachu-Brain. This will take in the high level stimulus (the audio and video signal) and generate the facial expressions to be put on the screen and medium level control to be sent to the Pikachu-RL policy. Since this will be probably be done via beefy LLMs, this code would run on a beefier system and would communicate with the other modules via TCP sockets (see PikaClient.py and PikaServer.py in Pikachu-Brain)

Work on the peripheral nervous system is yet to begin. This would convert the actions received from the RL policy into signals to be sent to the servos. This will be run on the raspi as well.

How this works

I have modded the ant env already existing in mujoco - changed the orientation of the legs, and moved the leg joint to where real quadrupeds(dogs, cats etc) have knees, and added some more sensors.

Mujoco's original ant

The modded ant that I use

So to make this work:

replace ant.xml in mujoco/assets (setting up a symlink works too)
install mujoco, gymnasium etc

Notion

I have tried to record some of my observations here. check it out, there are some cool videos and graphs there.

What are these folders in this repo:

These are some of the different algorithms I have tried so far. I have tried both implementing some of them myself, and using already written implementations from stablebaselines.
Apart from this, the Env folder contains the Environment classes which defines the mujoco env in which our agent lives. There are a couple of variations in the Env folder, and some of the algorithm folders might have their own Envs themselves. The Env class provide the observations, the step function to move in the mujoco env. The custom logic for the reward is defined in this class too.

Todos

probably add a readme in each folder

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
APPO		APPO
Envs		Envs
PPO V2		PPO V2
PPO		PPO
REDQ_spot		REDQ_spot
SAC		SAC
SAC_spot		SAC_spot
SB3		SB3
TD3		TD3
minitaur_models		minitaur_models
rex_models		rex_models
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
ant.xml		ant.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pikachu-RL

The components

How this works

Notion

What are these folders in this repo:

Todos

About

Releases

Packages

Languages

anirudhs001/Pikachu-RL

Folders and files

Latest commit

History

Repository files navigation

Pikachu-RL

The components

How this works

Notion

What are these folders in this repo:

Todos

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages