Manus AI alternative that run locally. Powered with Deepseek R1. No APIs, No $456 monthly bills. Enjoy an autonomous agent that thinks, browses the web, and code for the sole cost of electricity.

Python 726 94 Updated Apr 1, 2025

allenzren / open-pi-zero

Re-implementation of pi0 vision-language-action (VLA) model from Physical Intelligence

Python 797 48 Updated Jan 31, 2025

THU-MIG / yoloe

YOLOE: Real-Time Seeing Anything

Python 981 76 Updated Apr 1, 2025

hkchengrex / Cutie

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

Python 835 80 Updated Nov 8, 2024

Psi-Robot / DexGraspVLA

DexGraspVLA: A Vision-Language-Action Framework Towards General Dexterous Grasping

Python 177 14 Updated Apr 1, 2025

AimRT / AimRT

A high-performance runtime framework for modern robotics.

C++ 1,054 147 Updated Mar 31, 2025

camel-ai / owl

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 14,969 1,755 Updated Mar 31, 2025

Fancy-MLLM / R1-Onevision

R1-onevision, a visual language model capable of deep CoT reasoning.

478 15 Updated Mar 26, 2025

2toinf / UniAct

[CVPR 2025] The offical Implementation of "Universal Actions for Enhanced Embodied Foundation Models"

Python 121 9 Updated Mar 24, 2025

iSEE-Laboratory / LLMDet

(CVPR 2025) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models"

Python 122 6 Updated Mar 26, 2025

Ouxiang-Li / SAFE

[KDD2025] Improving Synthetic Image Detection Towards Generalization: An Image Transformation Perspective

Python 52 7 Updated Feb 23, 2025

LTH14 / fractalgen

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,028 53 Updated Feb 25, 2025

landing-ai / vision-agent

Vision agent

Python 4,451 498 Updated Apr 1, 2025

Guaishou74851 / IDM

(TPAMI 2025) Invertible Diffusion Models for Compressed Sensing [PyTorch]

Python 110 12 Updated Mar 9, 2025

charent / ChatLM-mini-Chinese

中文对话0.2B小模型（ChatLM-Chinese-0.2B），开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调，给出三元组信息抽取微调示例。

Python 1,495 170 Updated Apr 20, 2024

pmj110119 / OmniManip

[CVPR 2025] OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints

96 Updated Feb 27, 2025

HarborYuan / ovsam

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Python 951 32 Updated Jul 31, 2024

NarcissusEx / HETMM

[IJCV 2024] Hard-normal Example-aware Template Mutual Matching for Industrial Anomaly Detection

Python 16 1 Updated Jan 1, 2025

congvvc / HyperSeg

Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".

Python 124 3 Updated Dec 13, 2024

allenai / molmo

Code for the Molmo Vision-Language Model

Python 348 28 Updated Dec 12, 2024

czg1225 / SlimSAM

[NeurIPS 2024] SlimSAM: 0.1% Data Makes Segment Anything Slim

Python 326 18 Updated Feb 22, 2025

om-ai-lab / GroundVLP

GroundVLP: Harnessing Zero-shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection (AAAI 2024)

Jupyter Notebook 64 4 Updated Jan 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AaddX-ai

Block or report AaddX-ai

Stars

DakeQQ / Native-LLM-for-Android

JustinTebbe / Dynamic-noise-AD

turningpoint-ai / VisualThinker-R1-Zero

NVIDIA / Isaac-GR00T

joncv / OpenHands

whit3rabbit / manus-open

Physical-Intelligence / openpi

jen-pan / min-pi0

Fosowl / agenticSeek