Skip to content

sizhe-li/neural-jacobian-field

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Neural Jacobian Field

This is the code for Unifying 3D Representation and Control of Diverse Robots with a Single Camera by Sizhe Lester Li, Annan Zhang, Boyuan Chen, Hanna Matusik, Chao Liu, Daniela Rus, and Vincent Sitzmann

Check out the project website here.

plot

News

  • [12/27/2024] We have released a clean implementation of the pipeline! Please check out branch "clean". The implementation is completely modularized and type-annotated. This will be reflected in the main branch soon. We will make the demo notebooks compatible with the clean implementations and release corresponding pre-trained checkpoints!
  • [12/27/2024] We have exciting new results on the emerging properties of neural jacobian fields! Please stay tuned!

Installation

conda create --name neural-jacobian-field python=3.9.18
conda activate neural-jacobian-field

cd project
bash install.sh
python3 -m pip install -e .

Acquiring Datasets

Neural Jacobian Fields was trained with our own multi-view robot dataset. Our dataset includes a pneumatic robot hand (mounted on a robot arm), the Allegro robot hand, the Handed Shearing Auxetics platform, and the Poppy robot arm.

We plan to release all our robot data to encourage future research endeavors.

(09/20/2024) We are actively working on uploading our dataset to the web. Updates on this coming very soon! Feel free to contact [email protected] if you need the dataset urgently.

Acquiring Pre-trained Checkpoints

You can find pre-trained checkpoints here. Directory model_checkpoints needs to be placed in the root directory for inference examples (.ipynb files) to run correctly.

Running the Code

Training

The main entry point is project/neural_jacobian_field/train.py. Call it via:

python3 -m neural_jacobian_field.train \
training.data.batch_size=6 training.data.num_workers=16 dataset=toy_arm \
model=toy_arm model.train_flow=false model.train_encoder=true

To reduce memory usage, you can change the batch size as follows:

python3 -m neural_jacobian_field.train training.data.batch_size=1 "...all other flags"

Our code supports multi-GPU training. The above batch size is the per-GPU batch size.

Evaluation

  • [Visualize Jacobian Fields] notebooks/1_visualize_jacobian_fields.ipynb
  • [Inverse Dynamics Optimization] notebooks/2_inverse_dynamics_optimization.ipynb
  • [Robot Experiment] Coming soon!

Camera Conventions

Our extrinsics are OpenCV-style camera-to-world matrices. This means that +Z is the camera look vector, +X is the camera right vector, and -Y is the camera up vector. Our intrinsics are normalized, meaning that the first row is divided by image width, and the second row is divided by image height.

BibTeX

@misc{li2024unifying3drepresentationcontrol,
      title={Unifying 3D Representation and Control of Diverse Robots with a Single Camera}, 
      author={Sizhe Lester Li and Annan Zhang and Boyuan Chen and Hanna Matusik and Chao Liu and Daniela Rus and Vincent Sitzmann},
      year={2024},
      eprint={2407.08722},
      archivePrefix={arXiv},
      primaryClass={cs.RO},
      url={https://arxiv.org/abs/2407.08722}, 
}

Acknowledgements

The authors thank Hyung Ju Terry Suh for his writing suggestions (system dynamics) and Tao Chen and Pulkit Agrawal for their hardware support on the Allegro hand. V.S. acknowledges support from the Solomon Buchsbaum Research Fund through MIT’s Research Suppport Committee. S.L.L. was supported through an MIT Presidential Fellowship. A.Z., H.M., C.L., and D.R. acknowledge support from the National Science Foundation EFRI grant 1830901 and the Gwangju Institute of Science and Technology.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published