Bonn Activity Maps (BAM) API

This project contains the API to load and interact with Bonn Activity Maps datasets. (WIP)

General information about the datasets

The framerate of the videos is 25 Hz.

Installation

Clone or download the repository.
Install the required Python3 packages with: pip3 install -r requirements.txt
Place the BAM dataset folders at the same level as PythonAPI folder.

BAM class functions

BAM(dataset_dir, dataset_name, image_format)

Constructor for the BAM class used to perform all actions over the dataset.

Parameters:
- datset_dir: (String) path to the folder containing the datasets.
- dataset_name: (String) name of the dataset folder.
- image_format: (String, Optional) format wanted for the image extraction. "png"/"jpeg". default="png"
Returns:
- (BAM Object) object to perform operations over the dataset

unroll_videos(force, video)

Unrolls (converts each frame to an individial image) all the videos from the dataset and stores them in the videos folder. This folder will be created if it not exists and will contain another 12 folders (cameraXX), each one containing the unrolled frames for the camera noted with XX.

Parameters:
- force: (Bool, Optional) if True, the already unrolled frames will be deleted (if they exist) and the unroll will be performed. default=False
- video: (Integer, Optional) if None all videos will be unrolled. If it has a value between 1 and 12, only that camera will be unrolled. default=None

Note: if video is None and force is True all the already unrolled videos will be deleted and unrolled again.

get_person_ids()

Returns the existing person ids in the dataset

Returns:
- (numpy) numpy array with the existing IDs of the persons in the dataset

get_static_object_ids()

Returns the existing static object ids in the dataset

Returns:
- (numpy) numpy array with the existing IDs of the static objects in the dataset

get_total_frames()

Returns the total number of frames in the dataset

Returns:
- (int) total number of frames in the dataset

get_total_cameras()

Returns the total number of cameras in the dataset

Returns:
- (int) total number of cameras in the dataset

get_activity_names()

Returns the existing activity names in the dataset

Returns:
- (numpy) numpy array with the existing activity names in the dataset

get_images_in_frame(frame)

Obtains the images for the specified frame in all the cameras.

Parameters:
- frame: (Integer) frame number to get the images from.
Returns:
- (numpy) array with the 12 images for the specified frame.

Note: If the videos are not unrolled before getting the individual frames, then they will be automatically unrolled when you try to get the frames.

get_camera(video, frame)

Obtains the camera object for the specified video and frame.

Parameters:
- video: (Integer) camera/video number to get the object from.
- frame: (Integer) frame number to get the object from.
Returns:
- (Camera) camera object (will be explained in the next section)

get_persons_in_frame(frame)

Get all persons annotated in the given frame.

Parameters:
- frame: (Integer) frame number to get the persons from.
Returns:
- (numpy[personJSON]) persons annotated in the given frame in JSON format (for the structure of the JSON see section JSON structures)

get_person_in_frame(frame, person_id)

Gets the annotation for the specified person in the specified frame

Parameters:
- frame: (Integer) frame number to get the person from.
- person_id: (Integer) person identifier.
Returns:
- (numpy) numpy array with the 3d coordinates for the specified person

get_annotations_for_person(person_id)

Gets all the annotations for the specified person in the dataset

Parameters:
- person_id: (Integer) person identifier.
Returns:
- (numpy[personJSON]) all annotations for the specified person in JSON format (for the structure of the JSON see section JSON structures)

get_poses_in_frame(frame)

Get all poses annotated in the given frame.

Parameters:
- frame: (Integer) frame number to get the poses from.
Returns:
- (numpy[poseJSON]) poses annotated in the given frame in JSON format (for the structure of the JSON see section JSON structures)

get_pose_in_frame(frame, person_id)

Gets the pose annotation for the specified person in the specified frame

Parameters:
- frame: (Integer) frame number to get the person from.
- person_id: (Integer) person identifier.
Returns:
- (numpy) numpy array with the 3D coordinates for the specified person

get_annotations_for_pose(person_id)

Gets all the pose annotations for the specified person in the dataset

Parameters:
- person_id: (Integer) person identifier.
Returns:
- (numpy[poseJSON]) all annotated poses for the specified person in JSON format (for the structure of the JSON see section JSON structures)

get_static_objects_in_frame(frame)

Get all static objects annotated in the given frame.

Parameters:
- frame: (Integer) frame number to get the static objects from.
Returns:
- (numpy[staticobjectJSON]) static objects annotated in the given frame in JSON format (for the structure of the JSON see section JSON structures)

get_static_object_in_frame(frame, static_object_id)

Gets the annotation for the specified static object in the specified frame

Parameters:
- frame: (Integer) frame number to get the static object from.
- static_object_id: (Integer) static object identifier.
Returns:
- (numpy) numpy array with the 3D coordinates for the specified static object

get_annotations_for_static_object(static_object_id)

Gets all the annotations for the specified person in the dataset

Parameters:
- static_object_id: (Integer) static object identifier.
Returns:
- (numpy[staticobjectJSON]) all annotations for the specified static object in JSON format (for the structure of the JSON see section JSON structures)

get_activities_for_person(person_id)

Parameters:
- person_id: (Integer) person identifier.
Returns:
- (numpy[activityJSON]) numpy array with activities for the specified person in JSON format (for the structure of the JSON see section JSON structures)

distance_to_static_object(static_object_type, static_object_points, point)

Calculates the distance vector between a point and a static object's closest point to that point.

Parameters:
- static_object_type: (String) static object type string (boxAIK or cylinderAIK).
- static_object_points: (numpy) static object's points as stored in the dataset.
- point: (numpy) point for which we want to calculate the distance to.
Returns:
- (numpy) numpy array representing the minimum distance between the point and the object (the vector is the direction and its module the distance magnitude). If the point lies inside of the object it returns [0,0,0].

Camera class functions

get_C()

Returns:
- (numpy) (x,y,z) of the camera center in world coordinates.

undistort(image)

Undistorts the given image.

Parameters:
- image: (numpy) image to be undistorted.
Returns:
- (numpy) the image undistorted.

undistort_points(points2d)

Undistorts the given points.

Parameters:
- point2d: (numpy) points to be undistorted [(x,y,w), ...].
Returns:
- (numpy) the points undistorted.

projectPoints_undist(points3d)

Projects 3D points into 2D with no distortion.

Parameters:
- points3d: (numpy) 3D points to be projected.
Returns:
- (numpy) the points in 2D undistorted.

project_points(points3d, withmask, binary_mask)

Projects 3D points into 2D with distortion being considered.

Parameters:
- points3d: (numpy) 3D points to be projected.
- withmask: (Bool, Optional) if True returns mask that tell if a point is in the view or not. default=False
Returns:
- (numpy) the projected points in 2D.
- (numpy) if withmask is True only. Array representing the mask.

JSON structures

Person json

Contains:

person identifier
array with 3D coordinates corresponding to the nose of the person
object type, always personAIK for persons

{
    'pid': {int}, 
    'location': [
                  [x, y, z]
                ],
    'type': 'personAIK'
}

Pose json

Contains:

person identifier
array with 3D coordinates corresponding to the 24 joints of the person
object type, always poseAIK for poses

{
    'pid': {int}, 
    'location': [
                  [x, y, z],
                  [x, y, z],
                   ...
                ],
    'type': 'poseAIK'
}

Static object json

Contains:

static object identifier
array with 3D coordinates corresponding to the main points of the objects:
- boxAIK: [top front left, top front right, bottom back left]
- cylinderAIK: [top center, top border] (the bottom part is always in contact with the floor).
object type, boxAIK or cylinderAIK
array with labels describing the object

{
    'oid': {int}, 
    'location': [
                  [x, y, z],
                  [x, y, z],
                   ...
                ],
    'type': 'boxAIK' or 'cylinderAIK',
    'labels': ['label1', ...]
}

Activity json

Contains:

activity name
frame where the activity begins
frame where the activity ends

{
    'label': {string},
    'start_frame': {int},
    'end_frame': {int}
}

Data structure

Cameras folder

It contains 12 json files, cameraXX.json, with the camera parameters for each camera.

[
  {
    'w': {int},
    'h': {int},
    'start_frame': {int},
    'end_frame': {int},
    'tvec': [x, y, z],
    'rvec': [x, y, z],
    'distCoef': [a, b, c, d, e],
    'K': [3x3 matrix]
  }
]

dataset_unroll.json

It contains 3 jsons with the annotations for persons, objects and poses for all the videos. The format of each concrete type has been previously explained.

{
    'persons': [...]
}
{
    'objects': [...]
}
{
    'actions': [...]
}

dataset.json

It contains relevant information about the dataset.

{
    'n_cameras': {int},
    'scale_to_mm': {int},
    'image_extension': {string},
    'valid_frames': [...]
}

Videos

12 videos, dataset_XX.mp4, which are unrolled if it is necessary.

Videos folder

It is created when the videos are unrolled and contains 12 folders, cameraXX, with the corresponding frames for each camera.

Examples

For examples of how to use the API you can check the Jupyter Notebook test_api.ipynb provided inside the repository.

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
PythonAPI		PythonAPI
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
test_api.ipynb		test_api.ipynb
test_api.py		test_api.py
test_distances.ipynb		test_distances.ipynb

License

cagcoach/aikapi

Folders and files

Latest commit

History

Repository files navigation

Bonn Activity Maps (BAM) API

General information about the datasets

Installation

BAM class functions

BAM(dataset_dir, dataset_name, image_format)

unroll_videos(force, video)

get_person_ids()

get_static_object_ids()

get_total_frames()

get_total_cameras()

get_activity_names()

get_images_in_frame(frame)

get_camera(video, frame)

get_persons_in_frame(frame)

get_person_in_frame(frame, person_id)

get_annotations_for_person(person_id)

get_poses_in_frame(frame)

get_pose_in_frame(frame, person_id)

get_annotations_for_pose(person_id)

get_static_objects_in_frame(frame)

get_static_object_in_frame(frame, static_object_id)

get_annotations_for_static_object(static_object_id)

get_activities_for_person(person_id)

distance_to_static_object(static_object_type, static_object_points, point)

Camera class functions

get_C()

undistort(image)

undistort_points(points2d)

projectPoints_undist(points3d)

project_points(points3d, withmask, binary_mask)

JSON structures

Person json

Pose json

Static object json

Activity json

Data structure

Cameras folder

dataset_unroll.json

dataset.json

Videos

Videos folder

Examples

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages