Mentors: Visakh Muraleedharan <visakh@incf.org> and Tom Gillespie <tom.h.gillespie@gmail.com>
Skill level: Intermediate
Required skills: Python, AI/ML, NLP, Neo4j, ElasticSearch,React, Node.js, GCP, GitLab CI/CD.
Time commitment: Full time (350 hours)
About: KnowledgeSpace is a community-driven online resource for the neuroscience community, facilitating open and accessible sharing of data, knowledge, and tools. Neuroscience research generates vast amounts of complex data and literature, making it challenging for users to locate specific, relevant information. By implementing an AI agent powered by RAG, the project aims to address this challenge by creating a tool that facilitates quick and reliable access to the right data and knowledge. This will empower neuroscientists, educators, and the broader community to leverage KnowledgeSpace more effectively.
Aims: To further enhance the platform’s usability, this project aims to develop an AI-powered agent that uses Retrieval-Augmented Generation (RAG) to provide precise, contextually relevant, and human-like answers to user queries that will improve user experience by providing concise, context-aware, and scientifically accurate information about neuroscience concepts and datasets.
Scope:
- Integration with existing KnowledgeSpace metadata
- Indexing and data retrieval based on text and vector search
- Neuroscience Domain context adaptation using standards NIFSTD
- Model deployment and integration in Vertex AI
- User Interface development
Websites: https://knowledge-space.org/ and GitHub - INCF/knowledge-space: KnowledgeSpace (KS) is a data-driven encyclopedia and search engine for the neuroscience community.
Tech keywords: Python, AI/ML, NLP, Neo4j, ElasticSearch