Skip to content

Docker image to get data from S3 bucket, train a yolo network (ideally on EC2) and save trained weights on S3

Notifications You must be signed in to change notification settings

deividasskiparis/aws-darknet-docker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

aws-darknet-docker

Nvidia based docker image with Darknet to train a neural net, such as yolov3-tiny on your custom images

Works seemlesly with AWS EC2 and S3

Steps:

  1. Create a S3 bucket on AWS
  2. Upload pretrained weights (optional), images, train.txt and test.txt lists to the S3 bucket. Expected structure of the bucket
your-s3-bucket/
└── pretrained/
    └── your-pretrained-weights-filename.conv.15
└── data/
    ├── train.txt
    ├── test.txt
    ├── images/
    |   ├── image-name-1.jpg
    |   ├── image-name-2.jpg
    |   └── ...
    └── labels/
        ├── image-name-1.txt
        ├── image-name-2.txt
        └── ...

train.txt and test.txt should contain image relative paths to the data/ folder in S3, ex.

data/images/image-name-1.jpg
data/images/image-name-2.jpg
  1. Modify class.names, config.data and network.cfg as described by https://github.com/AlexeyAB/darknet#how-to-train-to-detect-your-custom-objects

  2. Set up your EC2 instance

    1. Create a custom role in IAM settings for EC2, which allows access to your S3 bucket.
    2. Create a GPU instance on AWS which already has NVIDIA and CUDA pre-installed. Search for "Deep Learning" when choosing your instance AMI
    3. Ensure you set your created role as an IAM role of the instance
    4. SSH into the instance. More info
    5. clone this repo
    git clone https://github.com/deividasskiparis/aws-darknet-docker.git aws-darknet-docker && cd aws-darknet-docker
    
  3. Build docker image

docker build -t aws-darknet-docker .
  1. Run image
docker run -d --runtime=nvidia --name=aws-darknet-docker \
    -e PRETRAINED_WEIGHTS_FILE="your-pretrained-weights-filename.conv.15" \
    -e S3_BUCKET_NAME="your-s3-bucket-name" \
    aws-darknet-docker

NOTE: If running locally instead on EC2, you will have to provide AWS credentials to your docker container for a role, that has access to your S3 bucket

docker run -d --name=aws-darknet-docker \
    -e PRETRAINED_WEIGHTS_FILE="your-pretrained-weights-filename.conv.15" \
    -e S3_BUCKET_NAME="your-s3-bucket-name" \
    -e "AWS_ACCESS_KEY_ID=$AWS_ACCESS_KEY_ID" \
    -e "AWS_SECRET_ACCESS_KEY=$AWS_SECRET_ACCESS_KEY" \
    -e "AWS_SESSION_TOKEN=$AWS_SESSION_TOKEN" \
    aws-darknet-docker

Check anytime how your container is doing by getting logs from it

docker logs aws-darknet-docker
  1. After the training is finished, the trained weights will be uploaded to you S3 bucket /models directory

  2. Terminate your EC2 after the training is finished to avoid charges

About

Docker image to get data from S3 bucket, train a yolo network (ideally on EC2) and save trained weights on S3

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published