Stars
[CVPR 2022] HiVT: Hierarchical Vector Transformer for Multi-Agent Motion Prediction
[AAAI 2024] Official implementation of NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
Convert CAJ (China Academic Journals) files to PDF. 转换中国知网 CAJ 格式文献为 PDF。佛系转换,成功与否,皆是玄学。
[CVPR 2023] We propose a framework for the challenging 3D-aware ObjectNav based on two straightforward sub-policies. The two sub-polices, namely corner-guided exploration policy and category-aware …
GoogleTest - Google Testing and Mocking Framework
Official implementation of OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation Models
Leveraging Large Language Models for Visual Target Navigation
We proposed to explore and search for the target in unknown environment based on Large Language Model for multi-robot system.
Zeroshot Active VIsual Search
Reading list for research topics in embodied vision
基于Clash Core 制作的Clash For Linux备份仓库 A Clash For Linux Backup Warehouse Based on Clash Core
Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.
End-to-End Object Detection with Transformers
convert dataset to coco/voc format
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
deep learning for image processing including classification and object-detection etc.
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.