Skip to content
View AaddX-ai's full-sized avatar

Block or report AaddX-ai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Demonstration of running a native LLM on Android device.

Python 127 19 Updated Apr 1, 2025
Python 11 1 Updated Jun 19, 2024

Explore the Multimodal “Aha Moment” on 2B Model

Python 545 17 Updated Mar 18, 2025

NVIDIA Isaac GR00T N1 is the world's first open foundation model for generalized humanoid robot reasoning and skills.

Jupyter Notebook 3,029 363 Updated Mar 30, 2025

🙌 OpenHands: Code Less, Make More

Python 11 4 Updated Jan 8, 2025

Manus code from container

Python 285 128 Updated Mar 31, 2025

Minimal re-implementation of pi0 vision-language-action (VLA) model

Python 3 Updated Mar 9, 2025

Manus AI alternative that run locally. Powered with Deepseek R1. No APIs, No $456 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity.

Python 726 94 Updated Apr 1, 2025

Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence

Python 797 48 Updated Jan 31, 2025

YOLOE: Real-Time Seeing Anything

Python 981 76 Updated Apr 1, 2025

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

Python 835 80 Updated Nov 8, 2024

DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping

Python 177 14 Updated Apr 1, 2025

A high-performance runtime framework for modern robotics.

C++ 1,054 147 Updated Mar 31, 2025

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 14,969 1,755 Updated Mar 31, 2025

R1-onevision, a visual language model capable of deep CoT reasoning.

478 15 Updated Mar 26, 2025

[CVPR 2025] The offical Implementation of "Universal Actions for Enhanced Embodied Foundation Models"

Python 121 9 Updated Mar 24, 2025

(CVPR 2025) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models"

Python 122 6 Updated Mar 26, 2025

[KDD2025] Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective

Python 52 7 Updated Feb 23, 2025

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,028 53 Updated Feb 25, 2025

Vision agent

Python 4,451 498 Updated Apr 1, 2025

(TPAMI 2025) Invertible Diffusion Models for Compressed Sensing [PyTorch]

Python 110 12 Updated Mar 9, 2025

中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。

Python 1,495 170 Updated Apr 20, 2024

[CVPR 2025] OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints

96 Updated Feb 27, 2025

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Python 951 32 Updated Jul 31, 2024

[IJCV 2024] Hard-normal Example-aware Template Mutual Matching for Industrial Anomaly Detection

Python 16 1 Updated Jan 1, 2025

Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".

Python 124 3 Updated Dec 13, 2024

Code for the Molmo Vision-Language Model

Python 348 28 Updated Dec 12, 2024

[NeurIPS 2024] SlimSAM: 0.1% Data Makes Segment Anything Slim

Python 326 18 Updated Feb 22, 2025

GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)

Jupyter Notebook 64 4 Updated Jan 2, 2024
Next