NeXusCreator
Python CLI & API for converting heterogeneous experimental data into validated NeXus (HDF5) files using reusable mapping templates and extensible workflows.
Description
NeXusCreator
NeXusCreator is a Python-based command-line tool for converting heterogeneous experimental data into standards-compliant NeXus (HDF5) files.
Experimental instruments produce data in many incompatible formats, making long-term storage, sharing, and reproducibility challenging. NeXusCreator addresses this by automating the transformation of raw data into structured, metadata-rich NeXus files, following community standards used in photon, neutron, and muon science.
Key Capabilities
-
Automated data conversion
Convert raw files (e.g. SPEC, DTA/DAT, HDF5, TIFF) into NeXus (.nxs) format — for single files or entire directories. -
Template-based workflows
Generate reusable.nxddefinition templates directly from input data. Templates support scan expansion, prompt literals for missing metadata (placeholders flagging required manual input), and optional NXDL schema-guided field placement — enabling consistent, repeatable, and FAIR data transformations. -
Batch and large-scale processing
Process individual files or large datasets, including combined outputs for multi-file experiments. Per-scan outputs with HDF5 external links are supported for SPEC workflows. -
Domain-specific workflows
Built-in support for XAS (IKFT/Diamond B18), electrochemistry and operando EIS (batteries/DTA), photoemission (PEAXIS), and MPES experiments, with dedicated parsers and generators. -
Extensible plugin architecture
A priority-based plugin system auto-discovers parsers and generators at runtime. Supporting new formats requires only a single plugin file — no core modifications. -
Standards-compliant structuring and optional validation
Supports schema-guided placement via NXDL application definitions (e.g.NXxas), correct use ofNX_class, signals, axes, and@defaultchains, with optional validation viapunx. -
Python API and pipeline integration
A public API (create_nexus,NeXusCreator) enables seamless integration into automated workflows and data processing pipelines.
Why NeXusCreator?
NeXusCreator bridges the gap between instrument-specific raw data and FAIR, reusable scientific datasets. It enables:
- Reproducible data pipelines
- Consistent and standards-compliant metadata integration
- Interoperable data for cross-facility use and analysis
By separating data extraction (parsers) from data structure definition (.nxd templates), NeXusCreator provides a scalable and flexible foundation for managing complex experimental data workflows.
Typical Use Cases
- Converting beamline data (SPEC, XAS, MPES, EIS) into NeXus for archiving and analysis
- Standardising electrochemical and operando experiment datasets
- Preparing data for facility data management systems and repositories (e.g. ICAT)
- Automating data pipelines in large experimental campaigns
One-Line Summary
Convert heterogeneous experimental data into standards-compliant NeXus (HDF5) files using reusable mapping templates — reproducibly and at scale.
Participating organisations
Reference papers
- 1.Author(s): Hector Perez Ponce, Rolf Krahl, Daniel Tomecki, William Smith, Peter Wegmann, Heike GörzigPublished by Zenodo in 202510.5281/zenodo.17413948
- 2.Author(s): Katherine Rial, Heike Görzig, Rolf Krahl, Hector Perez Ponce, Marcus LewerenzPublished by Zenodo in 202510.5281/zenodo.17803992
- 3.Author(s): Hector Perez Ponce, Rolf Krahl, Daniel Tomecki, Will Smith, Peter Braun, Svetlana Grinman, Oliver Löhmann, Heike GörzigPublished by Zenodo in 202510.5281/zenodo.17815053
- 4.Author(s): Katherine Rial, Heike Görzig, Rolf Krahl, Hector Perez Ponce, Marcus LewerenzPublished by Zenodo in 202510.5281/zenodo.15095878
- 5.Author(s): Hector Perez Ponce, Heike Görzig, Rolf KrahlPublished by Zenodo in 202410.5281/zenodo.19817292
Contributors
Contact person
Hector Perez Ponce
Author/Developer/Maintainer
Helmholtz-Zentrum Berlin für Materialien und Energie
0009-0002-6192-9609
Mail HectorHelmholtz Program-oriented Funding IV
Related projects
DAPHNE4NFDI
Software developed or co-developed in the scope of the DAPHNE4NFDI consortium
HMC
Helmholtz Metadata Collaboration