-
DFKI GmbH
- Bremen, Germany
- dfki.de/robotics
Stars
A generative world for general-purpose robotics & embodied AI learning.
This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-e…
Famous Vision Language Models and Their Architectures
Open-source and strong foundation image recognition models.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.
Semantic Segmentation of Images and Point Clouds for Traversability Estimation
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…
IFSeg: Image-free Semantic Segmentation via Vision-Language Model (CVPR 2023)
Collection of AWESOME vision-language models for vision tasks
[IROS 2024] [ICML 2024 Workshop Differentiable Almost Everything] MonoForce: Learnable Image-conditioned Physics Engine
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Awesome Papers about Autonomous Ground Robot System in Unstructured Outdoor Environments
Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
Train transformer language models with reinforcement learning.
Official inference repo for FLUX.1 models
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
A command line toolkit to generate maps, point clouds, 3D models and DEMs from drone, balloon or kite images. 📷
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
StyleGAN2-ADA - Official PyTorch implementation
Dockerfile for Velodyne VLP-16 and VLP-32 in ROS 2
Official PyTorch implementation of StyleGAN3
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
Export blender camera animations to Deforum Diffusion notebook format.
Terminator keyboard shortcuts
A program that displays videos without a graphical session