Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting…

Jupyter Notebook 15,874 2,311 Updated Jan 10, 2025

Shubhamsaboo / awesome-llm-apps

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 12,081 1,291 Updated Jan 12, 2025

Picovoice / porcupine

On-device wake word detection powered by deep learning

Python 3,853 511 Updated Jan 11, 2025

lllyasviel / Fooocus

Focus on prompting and generating

Python 42,549 6,187 Updated Aug 21, 2024

huangwl18 / ReKep

ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation

Python 613 63 Updated Aug 30, 2024

RoboFlamingo / RoboFlamingo

Code for RoboFlamingo

Python 334 28 Updated May 8, 2024

MIT-SPARK / llm_scene_understanding

HTML 82 6 Updated Jun 13, 2023

lutzroeder / netron

Visualizer for neural network, deep learning and machine learning models

JavaScript 29,061 2,827 Updated Jan 13, 2025

AndrejOrsula / pymoveit2

Basic Python interface for MoveIt 2 built on top of ROS 2 actions and services

Python 164 60 Updated Nov 28, 2024

mst272 / LLM-Dojo

欢迎来到 LLM-Dojo，这里是一个开源大模型学习场所，使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩‍🎓👨‍🎓

Python 442 39 Updated Jan 3, 2025

Meituan-AutoML / MobileVLM

Strong and Open Vision Language Assistant for Mobile Devices

Python 1,099 71 Updated Apr 15, 2024

YanjieZe / 3D-Diffusion-Policy

[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations

Python 629 61 Updated Nov 27, 2024

jgornet / predictive-coding-recovers-maps

73 4 Updated Jun 6, 2024

uml-robotics / fetchit2019

Code ran for the fetchit 2019 competition

C++ 4 2 Updated Mar 15, 2021

graspnet / graspnet-baseline

Baseline model for "GraspNet-1Billion: A Large-Scale Benchmark for General Object Grasping" (CVPR 2020)

Python 551 162 Updated Jun 13, 2024

langchain-ai / langchain

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 98,085 15,929 Updated Jan 11, 2025

longzw1997 / Open-GroundingDino

This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.

Python 500 79 Updated Jun 25, 2024

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,167 723 Updated Aug 12, 2024

mbzuai-oryx / Video-LLaVA

PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models

Python 248 12 Updated Jan 2, 2024

octo-models / octo

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Python 1,004 182 Updated Jul 31, 2024

thuml / iVideoGPT

Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223

Python 96 4 Updated Jan 4, 2025

MrNeRF / awesome-3D-gaussian-splatting

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

HTML 6,597 394 Updated Jan 12, 2025

Jumpat / SegAnyGAussians

The official implementation of SAGA (Segment Any 3D GAussians)

Jupyter Notebook 643 45 Updated Jun 21, 2024

hkchengrex / XMem

[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Python 1,798 195 Updated Nov 15, 2024

OpenGVLab / VisionLLM

VisionLLM Series

Python 976 32 Updated Jan 4, 2025

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,567 1,433 Updated Sep 5, 2024

mees / calvin

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Python 450 64 Updated Jan 8, 2025

ar-mine

Highlights

Starred repositories

Ubuntu

Tensorflow

Raspberry Pi

Python

Minecraft

Deep learning

C

C++