IIIF + Machine Learning Experiments

tl;dr

This repository contains code for a work in progress project to explore whether computer vision models can be used in conjunction with IIIF to transfer metadata across GLAM (Galleries, Libraries, Archives and Museums) collections.

Overview

This repository is where we organise some in progress work exploring whether we can use computer vision to help enrich metadata of image collections held by GLAM institutions. At the moment we're focusing on photographs. In particular, we're interested in generating metadata that is relevant for GLAM institutions. We could, for example, use a model trained on the COCO dataset on an image collection. However, this might not produce metadata that is particularly useful for a GLAM institution. We are therefore starting our exploration of this topic by focusing on the Thesaurus for Graphic Materials (TGM) and photographs.

As part of this, we are also particularly interested in using IIIF to enable this type of work. We are currently gathering data (trying to leverage existing metadata as training labels) and creating some basic baseline computer vision models.

This is all very much work in progress.

Repository contents

loc_harvester contains code for getting data from the Library of congress
data/europeana/example_data/data/edm/ cotains example European data
Various .ipynb notebook files contain WIP experiments/baselines for training models

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
data/europeana/example_data		data/europeana/example_data
loc_harvester		loc_harvester
notes		notes
papers		papers
presentation		presentation
.gitignore		.gitignore
00_wikidata_dl.ipynb		00_wikidata_dl.ipynb
01_wikidata_dl_label.ipynb		01_wikidata_dl_label.ipynb
02_wikidata_train_baseline.ipynb		02_wikidata_train_baseline.ipynb
02_wikidata_train_baseline_less_labels.ipynb		02_wikidata_train_baseline_less_labels.ipynb
03_download_loc.ipynb		03_download_loc.ipynb
05_presentation.ipynb		05_presentation.ipynb
README.md		README.md
weighted_loss_wikidata_train_baseline_less_labels.ipynb		weighted_loss_wikidata_train_baseline_less_labels.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IIIF + Machine Learning Experiments

tl;dr

Overview

Repository contents

About

Releases

Packages

Languages

glenrobson/IIIF-ML-TGM

Folders and files

Latest commit

History

Repository files navigation

IIIF + Machine Learning Experiments

tl;dr

Overview

Repository contents

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages