GSoC Project Idea 8.2: Enhancing the scientific data format validation service in GIN

The G-Node Data Infrastructure (GIN) services[1] provide a platform for management and sharing of data in neuroscience. Inspired by GitHub, the platform uses a git/git-annex backend for versioning and sharing of scientific data, offering the power of a web based repository management service combined with a distributed file storage. It addresses the range of research data workflows starting from data analysis on the local workstation to remote collaboration and data publication. GIN also provides indexing services for convenient searching of data and metadata, including information in well-defined formats like the odML[2] metadata format and the NIX[3] format for scientific data.

To push the quality of published and shared scientific data, a microservice for validation of scientific data formats is being established. The gin-valid[4] service currently enables validation of BIDS[5] files within a GIN repository.

Goal of the project is to extend this service to support the validation of further scientific data formats, e.g. NIX, odML, PyNN[6] or NeuroML[7], as well as to enhance the usability of the existing prototype.

Skills: A successful candidate will have experience with the Go programming language and is familiar with git and github related topics.

Mentors: Achilleas Koutsou, Michael Sonntag, G-Node

[1] https://gin.g-node.org
[2] https://github.com/G-Node/python-odml
[3] https://github.com/G-Node/nix
[4] https://github.com/G-Node/gin-valid
[5] https://bids.neuroimaging.io/
[6] https://neuralensemble.org/PyNN/
[7] https://www.neuroml.org/