Stars
A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR
Learning with 3D rotations, a hitchhiker’s guide to SO(3) - ICML 2024
FastAPI Best Practices and Conventions we used at our startup
The Incredible PyTorch: a curated list of tutorials, papers, projects, communities and more relating to PyTorch.
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Best Practices, code samples, and documentation for Computer Vision.
An Invitation to 3D Vision: A Tutorial for Everyone
A command line toolkit to generate maps, point clouds, 3D models and DEMs from drone, balloon or kite images. 📷
A resource repository for 3D machine learning
A collection of learning resources for curious software engineers
Unified storage framework for the entire machine learning lifecycle
Mobile manipulation research tools for roboticists
Official code and checkpoint release for "GNM: A General Navigation Model to Drive Any Robot".
A guide for technical professionals looking to start consulting
Conversion between different conventions of camera matrices and transform matrices.
Zero Experience Required: Plug & Play Modular Transfer Learning for Semantic Visual Navigation. CVPR 2022
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
📚 The list of vision-based SLAM / Visual Odometry open source, blogs, and papers