Skip to content

wangpf1129/segment-anything-webui

 
 

Repository files navigation

Segment Anything web UI

demo

This is a web interface for the Segment Anything.

Usage

Environment Require: Python >= 3.8.13, Node >= 18.15.0 (LTS), CUDA(optional)

  1. Fowllow the instructions in the Segment Anything and CLIP to install SAM and CLIP. And prepare webui environment:
# e.g. for Segment Anything
pip install git+https://github.com/facebookresearch/segment-anything.git
pip install opencv-python pycocotools matplotlib onnxruntime onnx

mkdir model
# download the model to `model/`
wget https://dl.fbaipublicfiles.com/segment_anything/sam_vit_b_01ec64.pth -O model/sam_vit_b_01ec64.pth
# https://dl.fbaipublicfiles.com/segment_anything/sam_vit_l_0b3195.pth
# https://dl.fbaipublicfiles.com/segment_anything/sam_vit_h_4b8939.pth

# e.g. for CLIP
pip install pytorch torchvision
pip install ftfy regex tqdm
pip install git+https://github.com/openai/CLIP.git

# python server as backend
pip3 install torch numpy 'uvicorn[standard]' fastapi pydantic python-multipart Pillow click
# or 
cd script && pip3 install -r requirements.txt
# webui frontend
npm i
  1. run the server and webui on different terminals:
python3 scripts/server.py   # webui backend
npm run dev                 # interactive webui frontend

Advanced

Change the .env.local file to change the server address.

The model server can be run on a remote GUI server, and the webui can be run on a local machine.

The API in server.py is Pure Function. Though it is slow (Encoding Image Each Request), it is easy to deploy and maintain.

TODO

  • Pre extract image features
  • Frontend onnx inference
  • Add CLIP for text Prompt
  • Better compress for mask matrix

Reference

License

MIT

Releases

No releases published

Packages

No packages published

Languages

  • TypeScript 81.7%
  • Python 16.0%
  • JavaScript 1.6%
  • CSS 0.7%