DataJoint is a Houston-based company whose mission is to build and operate a science orchestration platform that eliminates barriers for researchers who rely on automation, AI, and high-performance computing. Our projects include:
- DataJoint Core: open-source Python and MATLAB APIs for defining and operating computational data pipelines—sequences of steps for data acquisition, processing, and transformation—via a MySQL backend.
- DataJoint Elements: a growing compilation of community-curated, open-source software modules for building automated data pipelines and analysis workflows for neuroscience experiments.
- DataJoint Works: a cloud-based platform that hosts pipelines built via Core and Elements, facilitating collaboration and reanalysis.
DataJoint shares in the Open Ephys mission to help labs migrate to open standards. A primary goal of our Element Array Ephys is to provide an open standard for data management and preprocessing. To help teams ingest data directly into the Element, we envision an Open Ephys Plugin for streaming data directly into a database or AWS S3 bucket. This project aims to :
- Develop an Open Ephys plugin for streaming data to a server hosting a DataJoint database.
- Develop unit tests for the above plugin.
- Make this mechanism available to DataJoint Works users for direct data upload and automated preprocessing.
Skill level: Advanced
Required skills: Python, C++
Time commitment: Half-time or full-time (350 h)
Lead mentor: Dimitri Yatsenko, Kabilar Gunalan
Project website: datajoint.com
Backup mentors: Chris Brozdowski
Tech keywords: Python, Open Ephys, Steaming, MySQL, AWS