added code for OpenCV GPU Support

Koriukina, Valeriia · Koriukina, Valeriia · commit e258d38608b3 · 2020-09-09T17:40:37.000+03:00
diff --git a/OpenCV-GPU-Support/README.md b/OpenCV-GPU-Support/README.md
@@ -0,0 +1,167 @@
+# OpenCV GPU Support 
+
+This repository contains the code for [OpenCV GPU Support](https://www.learnopencv.com/opencv-gpu-support/)
+
+## Getting Started
+
+Our code is tested using Python 3.7.5, but it should also work with any other python3.x. If you'd like to check your
+version run:
+
+```bash
+python3 -V
+```
+
+### Virtual Environment
+
+Let's create a new virtual environment. You'll need to install [virtualenv](https://pypi.org/project/virtualenv/)
+package if you don't have it:
+
+```bash
+pip install virtualenv
+```
+
+Now we can create a new virtualenv variable and call it `env`:
+
+```bash
+python3 -m venv env
+```
+
+The last thing we have to do is to activate it:
+
+```bash
+source  env/bin/activate
+```
+
+### Numpy
+
+Install numpy package by running:
+
+```bash
+pip install numpy
+```
+
+### Installing CUDA
+
+The code was tested using CUDA Toolkit 10.2. Please follow the official instruction to download
+[CUDA Toolkit 10.2](https://developer.nvidia.com/cuda-10.2-download-archive) or higher.
+
+### OpenCV with CUDA Support
+
+In this blog post, we're using OpenCV with CUDA support to accelerate OpenCV algorithms. That is why we will need to
+customize the OpenCV library build and make it from scratch. To do so:
+
+1. Install dependencies:
+
+```bash
+sudo apt-get update
+sudo apt-get install build-essential cmake unzip pkg-config
+sudo apt-get install libjpeg-dev libpng-dev libtiff-dev
+sudo apt-get install libavcodec-dev libavformat-dev libswscale-dev libv4l-dev
+sudo apt-get install libxvidcore-dev libx264-dev
+sudo apt-get install libgtk-3-dev
+sudo apt-get install libatlas-base-dev gfortran
+sudo apt-get install python3-dev
+
+```
+
+2. Download the latest OpenCV version from the official repository:
+
+```bash
+wget -O opencv.zip https://github.com/opencv/opencv/archive/4.3.0.zip
+wget -O opencv_contrib.zip https://github.com/opencv/opencv_contrib/archive/4.3.0.zip
+```
+
+3. Unzip the downloaded archives:
+
+```bash
+unzip opencv.zip
+unzip opencv_contrib.zip
+```
+
+4. Rename the directories to match CMake paths:
+
+```bash
+mv opencv-4.3.0 opencv
+mv opencv_contrib-4.3.0 opencv_contrib
+```
+
+5. Compile OpenCV:
+
+Create and enter a build directory:
+
+```bash
+cd opencv
+mkdir build
+cd build
+```
+
+Run CMake to configure the OpenCV build. Don't forget to set the right pass to the `PYTHON_EXECUTABLE`. If you are using
+the CUDA version different from `10.2`, please change the last 3 arguments accordingly.
+
+```bash
+cmake -D CMAKE_BUILD_TYPE=RELEASE \
+      -D CMAKE_INSTALL_PREFIX=/usr/local \
+      -D INSTALL_PYTHON_EXAMPLES=OFF \
+      -D INSTALL_C_EXAMPLES=OFF \
+      -D OPENCV_ENABLE_NONFREE=ON \
+      -D OPENCV_EXTRA_MODULES_PATH=../../opencv_contrib/modules \
+      -D PYTHON_EXECUTABLE=env/bin/python3 \
+      -D BUILD_EXAMPLES=ON \
+      -D WITH_CUDA=ON \
+      -D CUDA_FAST_MATH=ON \
+      -D WITH_CUBLAS=ON \
+      -D CUDA_TOOLKIT_ROOT_DIR=/usr/local/cuda-10.2 \
+      -D OpenCL_LIBRARY=/usr/local/cuda-10.2/lib64/libOpenCL.so \
+      -DOpenCL_INCLUDE_DIR=/usr/local/cuda-10.2/include/ \
+      ..
+```
+
+Check the output and make sure that everything is set correctly. After that we're ready to build it with:
+
+```bash
+make -j4
+```
+
+Make sure, you didn't get any errors. Then run the following command:
+
+```bash
+sudo ldconfig
+```
+
+which creates the necessary links and cache to our freshly built shared library.
+
+Rename the created Python3 bindings for OpenCV to `cv2.so`:
+
+```bash
+mv lib/python3/cv2.cpython-37m-x86_64-linux-gnu.so cv2.so
+```
+
+The last step is to create a symlink of our OpenCV `cv2.so` into the virtual environment installed packages:
+
+```bash
+cd env/lib/python3.7/site-packages/
+ln -s ~/opencv/build/cv2.so cv2.so
+```
+
+## Running the Demo
+
+To run the demo, you will need to pass:
+
+- `--video` argument to set the path to the video file,
+- `--device` to choose between CPU and GPU inference. By default, the device is set to "cpu".
+
+For example:
+
+```bash
+python3 demo.py --video video/boat.mp4 --device "cpu"
+```
+
+# AI Courses by OpenCV
+
+Want to become an expert in AI? [AI Courses by OpenCV](https://opencv.org/courses/) is a great place to start. 
+
+<a href="https://opencv.org/courses/">
+<p align="center"> 
+<img src="https://www.learnopencv.com/wp-content/uploads/2020/04/AI-Courses-By-OpenCV-Github.png">
+</p>
+</a>
diff --git a/OpenCV-GPU-Support/demo.py b/OpenCV-GPU-Support/demo.py
@@ -0,0 +1,192 @@
+import argparse
+import time
+
+import cv2
+import numpy as np
+
+
+def main(video, device):
+
+    # init dict to track time for every stage at each iteration
+    timers = {
+        "full pipeline": [],
+        "reading": [],
+        "pre-process": [],
+        "optical flow": [],
+        "post-process": [],
+    }
+
+    # init video capture with video
+    cap = cv2.VideoCapture(video)
+    # get default video FPS
+    fps = cap.get(cv2.CAP_PROP_FPS)
+    # get total number of video frames
+    num_frames = int(cap.get(cv2.CAP_PROP_FRAME_COUNT))
+
+    # read the first frame
+    ret, previous_frame = cap.read()
+
+    # proceed if frame reading was successful
+    if ret:
+        # resize frame
+        frame = cv2.resize(previous_frame, (960, 540))
+        # convert to gray
+        previous_frame = cv2.cvtColor(frame, cv2.COLOR_BGR2GRAY)
+
+        # create hsv output for optical flow
+        hsv = np.zeros_like(frame)
+        # set saturation to a maximum value
+        hsv[..., 1] = 255
+
+        while True:
+            # start full pipeline timer
+            start_full_time = time.time()
+
+            # start reading timer
+            start_read_time = time.time()
+
+            # capture frame-by-frame
+            ret, current_frame = cap.read()
+
+            # end reading timer
+            end_read_time = time.time()
+            # add elapsed iteration time
+            timers["reading"].append(end_read_time - start_read_time)
+
+            # if frame reading was not successful, break
+            if not ret:
+                break
+
+            # start pre-process timer
+            start_pre_time = time.time()
+            # resize frame
+            frame = cv2.resize(current_frame, (960, 540))
+
+            # convert to gray
+            current_frame = cv2.cvtColor(frame, cv2.COLOR_BGR2GRAY)
+
+            if device == "cpu":
+                # end pre-process timer
+                end_pre_time = time.time()
+                # add elapsed iteration time
+                timers["pre-process"].append(end_pre_time - start_pre_time)
+
+                # start optical flow timer
+                start_of = time.time()
+                # calculate optical flow
+                flow = cv2.calcOpticalFlowFarneback(
+                    previous_frame, current_frame, None, 0.5, 3, 15, 3, 5, 1.2, 0,
+                )
+                # end of timer
+                end_of = time.time()
+                # add elapsed iteration time
+                timers["optical flow"].append(end_of - start_of)
+
+            else:
+                # move both frames to GPU
+                cu_previous = cv2.cuda_GpuMat()
+                cu_current = cv2.cuda_GpuMat()
+
+                cu_previous.upload(previous_frame)
+                cu_current.upload(current_frame)
+
+                # end pre-process timer
+                end_pre_time = time.time()
+                # add elapsed iteration time
+                timers["pre-process"].append(end_pre_time - start_pre_time)
+
+                # start optical flow timer
+                start_of = time.time()
+                # create optical flow instance
+                flow = cv2.cuda_FarnebackOpticalFlow.create(
+                    None, 0.5, 3, 15, 3, 5, 1.2, 0,
+                )
+                # calculate optical flow
+                flow = cv2.cuda_FarnebackOpticalFlow.calc(
+                    flow, cu_previous, cu_current, None,
+                )
+                # sent result from GPU back to CPU
+                flow = flow.download()
+
+                # end of timer
+                end_of = time.time()
+                # add elapsed iteration time
+                timers["optical flow"].append(end_of - start_of)
+
+            # start post-process timer
+            start_post_time = time.time()
+
+            # convert from cartesian to polar to get magnitude and angle
+            mag, ang = cv2.cartToPolar(flow[..., 0], flow[..., 1])
+            # set hue according to the angle of optical flow
+            hsv[..., 0] = ang * 180 / np.pi / 2
+            # set value according to the normalized magnitude of optical flow
+            hsv[..., 2] = cv2.normalize(mag, None, 0, 255, cv2.NORM_MINMAX)
+            # convert hsv to rgb
+            rgb = cv2.cvtColor(hsv, cv2.COLOR_HSV2BGR)
+            # update previous_frame value
+            previous_frame = current_frame
+
+            # end post-process timer
+            end_post_time = time.time()
+            # add elapsed iteration time
+            timers["post-process"].append(end_post_time - start_post_time)
+
+            # end full pipeline timer
+            end_full_time = time.time()
+            # add elapsed iteration time
+            timers["full pipeline"].append(end_full_time - start_full_time)
+
+            # visualization
+            cv2.imshow("original", frame)
+            cv2.imshow("result", rgb)
+            k = cv2.waitKey(1)
+            if k == 27:
+                break
+
+    # release the capture
+    cap.release()
+    # destroy all windows
+    cv2.destroyAllWindows()
+
+    # print results
+    print("Number of frames: ", num_frames)
+
+    # elapsed time at each stage
+    print("Elapsed time")
+    for stage, seconds in timers.items():
+        print("-", stage, ": {:0.2f} seconds".format(sum(seconds)))
+
+    # calculate frames per second
+    print("Default video FPS: {:0.2f}".format(fps))
+
+    of_fps = (num_frames - 1) / sum(timers["optical flow"])
+    print("Optical flow FPS: {:0.2f}".format(of_fps))
+
+    full_fps = (num_frames - 1) / sum(timers["full pipeline"])
+    print("Full pipeline FPS: {:0.2f}".format(full_fps))
+
+
+if __name__ == "__main__":
+
+    # init argument parser
+    parser = argparse.ArgumentParser(description="OpenCV CPU/GPU Comparison")
+
+    parser.add_argument(
+        "--video", help="path to .mp4 video file", required=True, type=str,
+    )
+
+    parser.add_argument(
+        "--device",
+        default="cpu",
+        choices=["cpu", "gpu"],
+        help="device to inference on",
+    )
+
+    # parsing script arguments
+    args = parser.parse_args()
+    video = args.video
+    device = args.device
+
+    # run pipeline
+    main(video, device)
diff --git a/OpenCV-GPU-Support/video/boat.mp4 b/OpenCV-GPU-Support/video/boat.mp4
diff --git a/README.md b/README.md
@@ -13,6 +13,7 @@ Want to become an expert in AI? [AI Courses by OpenCV](https://opencv.org/course
 
 | Blog Post | |
 | ------------- |:-------------|
+|[OpenCV GPU Support](https://www.learnopencv.com/opencv-gpu-support/)|[Code](https://github.com/spmallick/learnopencv/tree/master/opencv-gpu-support)|
 |[Training a Custom Object Detector with DLIB & Making Gesture Controlled Applications](https://www.learnopencv.com/training-a-custom-object-detector-with-dlib-making-gesture-controlled-applications/)|[Code](https://github.com/spmallick/learnopencv/tree/master/Training_a_custom_hand_detector_with_dlib) |
 |[How To Run Inference Using TensorRT C++ API](https://www.learnopencv.com/how-to-run-inference-using-tensorrt-c-api/) | [Code](https://github.com/spmallick/learnopencv/tree/master/PyTorch-ONNX-TensorRT-CPP) |
 |[Using Facial Landmarks for Overlaying Faces with Medical Masks](https://www.learnopencv.com/using-facial-landmarks-for-overlaying-faces-with-masks/)|[Code](https://github.com/spmallick/learnopencv/tree/master/FaceMaskOverlay) |