All software
DataSAIL
DataSAIL is an open-source software tool that splits machine learning datasets while minimizing Information Leakage. It formulates the splitting of a dataset as a constrained minimization problem and optimizes the data split towards an objective function that accounts for information leakage.
- Convex Optimization
- data management
- Data Splitting
- + 3
- Python
- Smarty
Forgejo-aneksajo
A data management and collaboration platform based on Forgejo and extended with git-annex support. Essentially a git forge for data projects. It interoperates well with DataLad and provides a nice collaborative web hosting for datasets containing issue trackers, pull requests, CI, and more.
- collaboration
- data management
- Earth & Environment
- + 8
DataLad NEXT extension
This DataLad extension is a staging area for add-ons, for performance upgrades, and user experience improvements. Unlike other topical extensions, the focus is on functionality with broad applicability.
- data distribution
- DataLad
- data management
- Python
- Shell
- Batchfile
- + 2
mobile Drilling Information System (mDIS)
mDIS is a database management application for capturing and curating meta data on geological samples, drilling progress, and related datasets such as images or geophysical well logs. It is easily adaptable and ensures that all data is compiled and quality controlled in one place.
- data management
- drilling
- Earth & Environment
- + 1
- PHP
- Vue
- JavaScript
- + 2