Skip to content
View langxinspieder's full-sized avatar

Block or report langxinspieder

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official implementation of Think Global, Act Local: Dual-scale GraphTransformer for Vision-and-Language Navigation (CVPR'22 Oral).

Python 138 10 Updated Jun 27, 2023

An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching

Python 67 2 Updated Oct 17, 2024

Cornell Touchdown natural language navigation and spatial reasoning dataset.

Python 99 13 Updated Sep 5, 2020

Dynamic Robot Instruction Following

Python 34 7 Updated Dec 28, 2021

Code for RSS2018 paper on the Grounded Semantic Mapping Network

3 Updated Nov 24, 2018

[RSS 2024] NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation

Python 49 3 Updated Jan 3, 2025

AI Research Platform for Reinforcement Learning from Real Panoramic Images.

C++ 534 132 Updated Jul 12, 2024

The repository provides code associated with the paper VLFM: Vision-Language Frontier Maps for Zero-Shot Semantic Navigation (ICRA 2024)

Python 299 33 Updated Jan 7, 2025

[TPAMI 2024] Official repo of "ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments"

Python 246 17 Updated Jul 23, 2024

Code for the habitat challenge

Python 315 57 Updated Apr 24, 2023

EventGPT: Event Stream Understanding with Multimodal Large Language Models

10 Updated Dec 5, 2024

Vision-and-Language Navigation in Continuous Environments using Habitat

Python 336 56 Updated Jan 7, 2025
Python 2 Updated Oct 11, 2024

Ideas and thoughts about the fascinating Vision-and-Language Navigation

176 13 Updated Jun 28, 2023

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 1,765 227 Updated Dec 11, 2024
C++ 1 Updated Oct 2, 2024
Python 11 Updated Jun 7, 2024

Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.

Python 13,024 2,911 Updated Jan 16, 2025

Get up and running with Llama 3.3, Phi 4, Gemma 2, and other large language models.

Go 109,460 8,745 Updated Jan 23, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 38,649 4,750 Updated Jan 21, 2025

Model Compression for Big Models

Python 155 21 Updated Jun 30, 2023

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 137,929 27,651 Updated Jan 23, 2025

The official Meta Llama 3 GitHub site

Python 28,026 3,215 Updated Aug 12, 2024

[CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI

Python 529 39 Updated Jan 11, 2025

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Python 38,165 5,563 Updated Jan 23, 2025
Python 2,724 306 Updated Jan 22, 2025

A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or laws in the future

276 19 Updated Aug 13, 2023

a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model

Jupyter Notebook 132 17 Updated Jun 25, 2024

MobiLlama : Small Language Model tailored for edge devices

Python 616 49 Updated Mar 3, 2024
Next