This repository provides a tool for importing HDF5 raw data into the SciCat data catalog used at the European Spallation Source (ESS). It also supports metadata harvesting to ensure comprehensive data cataloging.
- HDF5 Import: Automatically imports raw data files into SciCat.
- Metadata Harvesting: Extracts and registers metadata for datasets.
- Integration Ready: Seamlessly connects with the SciCat API.
- Python 3.8+
- Required libraries:
h5py
requests
Install dependencies:
pip install h5py requests
-
Clone the repository:
git clone https://github.com/your-username/scicat-hdf5-import.git
cd scicat-hdf5-import -
Install dependencies:
pip install -r requirements.txt
-
Update the
config.json
file with:- SciCat API endpoint.
- Authentication token.
-
Import HDF5 data:
python import_hdf5.py --file -
Monitor logs for import status and errors.
import_hdf5.py
: Main script for importing HDF5 data.config.json
: Configuration for SciCat API and metadata settings.requirements.txt
: Python dependencies.README.md
: Documentation for the repository.
-
Import an HDF5 file:
python import_hdf5.py --file example.h5 -
View metadata extraction:
python import_hdf5.py --file example.h5 --verbose
-
Fork the repository.
-
Create a new branch:
git checkout -b feature/your-feature -
Commit your changes:
git commit -m "Add your feature" -
Push the branch:
git push origin feature/your-feature -
Open a pull request.
This project is licensed under the MIT License. See the LICENSE file for details.
Effortlessly import HDF5 data and harvest metadata with this SciCat tool! ππ