Skip to content

jonathanrandall/mobile_aloha_world

Repository files navigation

aloha_world

Equipment used for this project:

Xarm robot arm and synchronization controller https://www.hiwonder.com/products/xarm-1s?variant=32436121894999&_pos=1&_sid=57c2ee4b5&_ss=r

The xarm library for python is here

webcam I used a logitech C922, but any decent webcam should work.

I will now describe the directories:

Sync

The directory contains the Arduino sketch for the synchronisation controller. This is the master that I used to control the robot. They synchronisation controller has an Arduion Nano that is used to read the joint angles and output them to the serial monitor.

I have updated the Arduino sketch from the hiwonder documentation, so that it doesn't output when there is an error and it prints to the serial monitor every 50ms. I have used the Arduino IDE to upload to the Nano. Note, to get this to work with Linux, you need to remove the brail terminal via:

sudo apt remov brltty

contol_ws/src/xarm_pubsub

This contains the ros2 package that I use to record the robot actions for imitation learning. Need to have ros2 installed. I'm using Iron Irwini. You can build with colcon by typing from the contol_ws directory:

colcon build --symlink-install

contol_ws/src/xarm_pubsub/xarm_pubsub/

This directory contains the main code for the ros2 package, which is just written with publishers and subsribers. files are as follows:

master_publisher.py This publishers the data from the synchronization controller to the ros2 topic /joint_states. It first launches a thread to read the data from the serial input. This is because the data on the serial input will buffer and we need to get the latest reading. So, it puts the data into a queue and when we want to read it for publishing, we throw away everything in the queue and just take the most recent output.

puppet_subscriber.py This subscribes to the /joint_states topic and sends the data to the robot.

camera_publisher.py This publishers data from the webcam onto the /Image topic. Again, this runs a get camera on thread and then when it wants to publish the image, it throws away everything in the queue except the most recent image. This is because the images can buffer on the input, which leads to a lag. This file also includes a find_webcam_index(device_name) function for getting the index of the webcam, which can change from run to run.

camera_subscriber.py This subscribes to the /Image topic.

record_gui.py This has the gui for starting and stopping the recordings. This uses the python_qt_binding library (which I think is equivalent to PyQt5?). Note, I needed to put in the following line, otherwise there was a clash between opencv and the python_qt_binding libarary.

#need this or else opencv doesn't work with PyQt5
import os
os.environ['QT_QPA_PLATFORM_PLUGIN_PATH'] = '/usr/lib/x86_64-linux-gnu/qt5/plugins'

To get the QT_QPA_PLATFORM_PLUGIN_PATH, I asked chatgpt to write me some code. I used the PyQt5 library to display the image (because I couldn't get it to work with the cv2.imshow() function to work with the gui.

contol_ws/src/xarm_pubsub/launch/

rsp.launch.py This is the launch file for the tele operation imititation learning.

tony_zhao_forked/act

This is the code forked from Tony Zhao's ACT. I am using this for training and inference. I've made a few changes. I am using one arm with 6 joint angles. Tony Zhao was using two arms with 7 variables each. Parts of the input and the output were 14 dimensional, and this was hard coded in the models. I ahd to change this to 6 to match my model. I've also added a:

  • --continue training argument, which loads the last weights and continues the training, so we don't have to start from the beginning if we want more iterations. This is false if not present and true if present.
  • --historical_length argument, which takes an integer for the number of previous robot states to use in the input. The algorithm only takes the current robot state. It didn't really work as well as I wanted it to. I had to add some of the directories to my python path to get the imports to work.

About

mobile version of aloha world

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published