A simple HTML visualization tool for computer vision research https://github.com/mtli/HTML4Vision
- Easy table description and generation for algorithm comparison and pipeline visualization
- Handy formatting controls to make pretty figures
- Web publishing for remote browsering
- Interactive tables, able to sort and toggle display states
- Tile images with optional captions and hyperlinks
- 3D models with camera controls
pip install html4vision
You can use glob patterns (e.g. results/*_method_2.png
) to specify the content and use imagetable()
to generate the table automatically.
from html4vision import Col, imagetable
# table description
cols = [
Col('id1', 'ID'), # make a column of 1-based indices
Col('img', 'Label Map', 'images/road_*_label.png'), # specify image content for column 2
Col('img', 'Road Object Map', 'images/road_*_roadobj.png'), # specify image content for column 3
Col('img', 'Amodel Road Mask', 'images/road_*_amodelroad.png'), # specify image content for column 4
]
# html table generation
imagetable(cols)
Example: examples/basic.py
The table is described by a list of Col
objects.
Col(type, name, content, subset, style, href)
type
: 'text' for text, 'img' for images, 'overlay' for image overlay, 'id0' for zero-based indices, 'id1' for one-based indices, and 'model' for 3D models.name
: the name of the column.content
: for images (both 'img' and 'overlay') and 3d models, it is a glob pattern or a list of the file paths; for text, it is a list of strings; it isNone
for all other types (indexing is automatic).subset
: subset selection of thecontent
provided. Ifsubset
is a single integer, it is interpreted as lengthn
and the firstn
items are selected; ifsubset
is atuple
, it is interpreted in the form of(start, stop)
or(start, step, stop)
; ifsubset
is alist
, it is interpreted as a list of indices to be selected.style
: a string of CSS rules for the entire column. See styling through CSS for more.href
: either a glob pattern or a list of URLs, works in the same way ascontent
forimg
columns.
Basic syntax:
imagetable(
# contents
cols, out_file='index.html', title='', thumbs_dir=None,
# additional contents
summary_row=None, copyright=True,
# modifiers
pathrep=None, sortcol=None, precompute_thumbs=False, thumb_quality=95,
# style
imsize=None, imscale=1, preserve_aspect=False,
summary_color=None, sticky_header=False, sort_style=None, zebra=False, style=None,
# interaction
overlay_toggle=False, sortable=False,
# 3d model viewer
auto_rotate=False, camera_controls=True, mesh_opt=False,
)
The only required argument is cols
, which is a sequence of Col
objects, specifying the content of the table. out_file
optionally names the output file while title
sets the title of the generated HTML page.
The meaning and format for other arguments can be found in respective sections
- generating the HTML file to another folder:
pathrep
. - size control:
imsize
,imscale
,preserve_aspect
. - precomputed thumbnails:
thumbs_dir
,precompute_thumbs
,thumb_quality
. - sorting:
sortcol
,sortable
,sticky_header
,sort_style
,zebra
,summary_row
andsummary_color
. - display toggle:
overlay_toggle
. - styling through CSS:
style
. - 3d models:
auto_rotate
,camera_controls
andmesh_opt
.
Web publishing is designed for sharing the visualized results. Typically, it can be used for sharing the results with others or viewing the results on a remote compute node without downloading them in advance. Of course, it can also be used as a general purpose HTTP server.
The script provided here is functionally similar to the SimpleHTTPServer
in Python 2 and serves the files in the current directory. It supports both Python 2 & 3 and uses multiple threads for a much better web experience. For security reasons, directory browsing is disabled and accessing files outside of the current directory is not allowed. So you need to type in the exact HTML file name (e.g. http://127.0.0.1:6096/index.html
) to access.
Change the current directory to the directory you want to serve and run
python -m html4vision.server
The default port is 6096. To specify a port (e.g. 23333), run
python -m html4vision.server 23333
The search path, the path used to look for the images in the python script, might be different from the publishing path, the path that is encoded in the generated HTML. It is not uncommon to generate the HTML file to a path other than the current folder. The pathrep
argument is designed for this case. It can be a string, or a tuple of two strings. The former specifies the part of path to be removed while the latter specifies the one replacing it.
Example: examples/pathrep.py
The image size is controlled by imsize
, imscale
, and preserve_aspect
arguments. Note that all scaling is done through JavaScript which takes place after the content of the webpage is loaded.
imsize
can be either a single index or a tuple
or a list
of width and height. If it is a single index, it means scaling the images in all other columns to match the corresponding image in the column specified by the index. The index is zero-based and refers to the items in the list of Col
objects. Understanding the indexing is important when you also use image overlay where two objects describe a single column. For example, you can scale the intermediate feature maps of a convolutional neural network (CNN) to match the size of the input image. If imsize
is a tuple
or a list
of width and height, then all images will be scaled to that size.
imscale
is a factor to scale the image. When used in combination with imsize
, the imscale
is applied after the effects of imsize
.
If preserve_aspect
is True
then the image aspect ratio will be preserved even if the size for imsize
differs from the original image aspect ratio. In that case, the image will be resized so that the maximum image size matches imsize
.
Image alignment is controlled by hori_center_img
. If True
, all images will be horizontally centered.
The default thumbnail previews are based on using JavaScript to resize down the image inside the browser. However, if the amount of image data is too much then this mode can become prohibitively slow or even crash in the browser, due to the system memory resources being exhausted.
If precompute_thumbs
is set to True
then thumbnail images are precomputed and stored alongside the HTML file. This allows the memory requirements in the browser to be greatly reduced. By default, a thumbnail directory name is automatically generated by appending thumbs
to the name provided for out_file
, however, thumbs_dir
can be used to specify the thumbnail directory explicitly. Thumbnail generation is also cached using the modification times returned by the system's os.stat()
function, so the slow thumbnail generation need not be re-run if thumbnails have already been computed in a previous run of the script.
Precomputed thumbnails will be generated with the same extension as images already have. In case the image extension is JPG, thumb_quality
will control the JPEG compression quality.
Example: examples/auto_thumbs.py
The benefit of an HTML table is the freedom to customize styles with CSS. Below is an example of setting the image border and box shadows, and highlighting a particular column.
Example: examples/formatting.py
Two consecutive Col
objects form a single image column. The first Col
object describes the bottom image while the second describes the top image. Their types need to be 'img' and 'overlay' respectively. If the top image by itself is not transparent, you can specify its opacity by adding opacity: 0.5
(value range from 0 to 1) to the style
field.
Example: examples/overlay.py
The sorting feature comes in handy when you have statistics for each data point in the dataset. A one-time sorting can be done at HTML generation. You can specify sortcol
to an index of the column based which you want to sort. Furthermore, you can enable post-generation interactive sorting by setting sortable
to True
.
Once interactive sorting is enabled, a style template can be specified with sort_style
. For a list of templates, check files contains "theme" here. Zebra stripes can be added to the table if zebra
is set to True
. When you have too many columns, it might be useful to enable (set sticky_header
to True
) to keep track of the columns as you scroll down the page. Note that some features may seem irrelevant to sorting, yet they only work when sortable
is True
.
In addition, you will find how to specify a summary row with a particular color in the following example.
Example: examples/sort.py
When image overlay is used and overlay_toggle
is set to True
, you can click the overlaid images to toggle image overlay.
Example: examples/overlay.py
The href
field is designed to create a clickable link for table items. All column types support href
as long as the column content is non-empty. It works in the same way as the content
field of img
columns — it will synergize with subset
selection and the pathrep
argument if specified (see here). For overlay columns, href
should be attached to the preceding img
column and it cannot be used together with overlay_toggle=True
since they both bind to the mouse click.
Example: examples/href.py
In addition to the main function imagetable
, we provide another function to generate a grid display of a list of images: imagetile
. The layout is specified through n_col
, which means each row has items no more than n_col
. The number of rows is calculated automatically. Optionally, you can add captions and URLs to the images.
Basic syntax:
imagetile(
# contents
content, n_col=3, out_file='index.html', title='',
# additional contents
caption=None, href=None, subset=None, copyright=True,
# modifiers
pathrep=None,
# style
imsize=None, imscale=1, caption_bottom=True, style=None,
)
Most arguments bear the same meaning as in imagetable
.
Example: examples/tile.py
For an integrated example of how HTML4Vision is used in practice, you can check out the sAP repo.
Not only does this repo tiles images, but also 3D models! The rendering and control are supported through Google's model-viewer. The 3d models used in the example below are also from model-viewer.
For glb meshes optimized with the EXT_meshopt_compression
extension, enable mesh_opt=True
. For more details about meshopt, see the model-viewer meshoptSupport documentation.
NOTE: you need to serve the generated HTML with a server to view the content (see web publishing). Directly opening the HTML file locally will yield cross-origin error (CROS).
Example: examples/model.py
Any contribution is welcome! Many thanks to these noteworthy community contributions:
- connellybarnes: thumbnail generation
- bertjiazheng: 3d model support