GSoC 2023 Project Idea 19.1 The Turing Way: A how-to guide to data science (175/350 h)

The Turing Way is an open-source, community-led and collaboratively developed “book project” on making data research accessible for a wider research community (https://the-turing-way.netlify.com). We bring together individuals from diverse fields and expertise to develop practices and learning resources that can make data research accessible and easy to understand. Our community members are researchers, engineers, data librarians, industry professionals, and experts in various domains, at all levels of seniority, from all around the world. They collaborate in the project to develop chapters by compiling best practices, tools and recommendations used by the researchers and data science communities worldwide.

Technical details: All questions, comments, recommendations and discussions are facilitated through an online GitHub repository (GitHub - alan-turing-institute/the-turing-way: Host repository for The Turing Way: a how to guide for reproducible data science). The online book with multiple guides is hosted as a Jupyter Book (GitHub - executablebooks/jupyter-book: Create beautiful, publication-quality books and documents from computational content.) at https://the-turing-way.netlify.com. Jupyter Book formats markdown files and Jupyter notebooks as static HTML making them easy to read. When a notebook is included in the book, the static page includes a link to an interactive version of the notebook via Binder (https://mybinder.readthedocs.io). Additional styling of the front end is possible by providing a CSS file that handles it across the entire book.

Background: Since the project’s launch in 2019, more than 300 contributors have so far co-authored more than 200 subchapters and community documents on reproducible research, communication, collaboration, project design and ethics. As the number of chapters continues to increase, it becomes important for us to offer appropriate ways for our readers to discover relevant and desired content in the book based on their topics of interest and skill levels. Over the last 3 months, software engineers at the Turing have enhanced the user interface (UI) of the book (developed a modular Python package) that made it possible for us to create multiple entry points for different user groups, who can start reading the book by exploring a curated set of chapters, rather than browsing the entire book (See details: bio-Turing-Way/README.md at malvikasharan-readme · alan-turing-institute/bio-Turing-Way · GitHub).

GSoC project plan and expected outcome: A GSoC contributor will help us integrate this newly developed package to The Turing Way book and further enhance this feature through user experience (UX) design. They will be supported in setting up community/user feedback processes and conducting interviews/focus groups to understand how our readers and contributors use the book and how this UX/UI enhancement adds to their experience. Based on their interest and availability, they will have the possibility to contribute to the development of Python scripts and GitHub actions to improve the project workflow, chapter development, community engagement and the overall interactivity in the book. They will be provided with appropriate guidance and the opportunity to work in a positive working environment. They will be fairly acknowledged for their contributions to the project.

Skill level: Beginner/Intermediate/Advanced

Required skills: Python programming, basic web-development skill required to work with Jupyter Book, and experience working in distributed communities, using git and GitHub. Optional skills: Experience collaborating on data science or quantitative research projects at any level, JavaScript skills (front end development), interactive visualisation of small datasets.

Time commitment: Flexible (17/350 h)

Lead mentor: Malvika Sharan (msharan@turing.ac.uk)

Project website: the-turing-way.netlify.com

Backup mentors: Turing Way community

Tech keywords: Python, Jupyter, git, JavaScript

2 Likes

Hello! @arnab1896 @malin @malvikasharan
*I’m Vrushali, Data science and programming student at IIT madras India . I’m passionate about Data science, AI , neuroinformatics , AR VR XR.

*I’ve some background in python , Javascript , html and
css .

*I would like to contribute to this project. I believe my skills and experience align perfectly with this project.

  • I like the idea of The turing way : how to guide to data science , to make research accessible to experts of various domain. The turing way is a huge community I think working with you will enhance my knowledge and creativity. I checked git repo , your youtube channel videos and feel extremely motivated to work with you.

It would be my pleasure if i can hear back from you and guide me how to start contributing.
Thank you :blush:

Hi @vrun , nice to hear from you :slight_smile:
As mentioned in the project idea itself, there are quite a few links and resources shared (like the project website, readme docs and github repository). Please go through them and try to come up with an idea of how “you” will implement the project. In the meantime, please give Malvika some time to reply.
Happy to help in case of more queries.

Also, please remember, that the more pointed and specific queries you come up with, the better mentors will be able to give you feedback

Thanks

1 Like

Hi @arnab1896,
Hi @malvikasharan, :slight_smile:

I’d be super interested to contribute to the Turing Way project as part of google summer of code!
I have already contributed to the Turing Way, and to the INCF Matlab Summer project last year. I am fluent in Python, and Jupyter book, git and have basic front end web dev skills (Javascript, html, css). I am a Post-Doc in computational psychiatry and enrolled in a Bachelor’s of Computer Science, so I work with a lot of data in general.

Just thought I’d give my official shoutout of interest here - also happy to talk on slack :slight_smile:

Best,
Johanna

1 Like

hello @arnab1896 ,
I am Rugved Shinde , 3rd year IT undergraduate studying in Vidyalankar Institute of Technology .

I am writing this letter to express my interest in joining the INCF community as a contributor . I have gone through the opensource projects and I feel they are of quality and worthy of having considerable impact . I am interested in 19.1 The Turing Way:A how-to guide to data science as it is a huge community and causing considerable impact
I hope I’ll get the chance to contribute to the open source projects at any scale possible whether it’s fixing bugs , improving documentation , or adding new features , I am ready to put my knowledge and skills to good use. I am eager to learn from the experienced contributors and mentors and work together to make the project even better . Kindly provide me further steps that I can take to contribute .

sincerely,

Rugved Shinde

1 Like

Hi @arnab1896!

I am a second year student of computer engineering at Universidade Tecnológica do Paraná.
I like the idea of understanding how the brain works, and being able to help an organization that
wants to make this understanding easier would be very cool.

Data science appears to be a interesting subject. Knowing about The Turing Way book
was good.

I currently have some experience with Python, but, by helping this project idea, I am willing to learn
more about web development, UX design, and how to interact in the open source environment.

Thanks for reading,
Fábio.

Hello @arnab1896

I am currently doing my masters in Topological Data Analysis in Data Science. My main field is Mathematics.

I am currently interested in contributing to 19.1 The Turing Way: A how -to guide to data science. I hope I get the chance to contribute to this project. It fits my current skill set and I am currently a beginner Python coder.

Thank you for reading
Jessica Randall

Hi All, Apology for coming back to this so late. I was not keeping track of the posts here, and was managing some query directly on GitHub. I am sharing the link to Slack for you to join and ask questions directly: Slack.

Furthermore, I have created an issue with some initial steps for you test what the defined project looks like: 2023 GSoC discussion issue at the Exploration Phase · Issue #2978 · alan-turing-institute/the-turing-way · GitHub. We might have a few more ideas for project, so please do join conversation or suggest related ideas via Slack.

1 Like

Hi @Johanna_Bayer, you are very senior and a well established open source contributor for the GSoC. As discussed with you, I have mentioned you as a mentor with me on this project: 2023 GSoC discussion issue at the Exploration Phase · Issue #2978 · alan-turing-institute/the-turing-way · GitHub. Thank you again for expressing interest.

hi @fabio-bays, The Turing Wa is not directly related for brain imaging or neuroscience, but broadly applicable practices in data science. The project described here aims to improve a Python package developed for The Turing Way. Please see this issue for more details: 2023 GSoC discussion issue at the Exploration Phase · Issue #2978 · alan-turing-institute/the-turing-way · GitHub.

1 Like

Hi @vrun , @fabio-bays @Johanna_Bayer @Rks-7 @JESSICA_HANNAH_RANDA

Please request you to kindly join Slack.

This is where you can directly reach out to advisor/mentor for the project - @malvikasharan for queries

2 Likes

Hello everyone, I am Arya, an undergraduate student of Computer Science Engineering with a specialisation in AI from Amrita University, Amritapuri, Kerala. I have a good foundation in python and I am skilled in using React, Flask and Django (beginner level). I have worked with a startup organisation as well (https://traboda.com/), where I had the opportunity to work on the front end of various projects which helped me in gaining a good experience in front end web development. I found this project interesting and I believe that I possess the essential skills to work on this project.

@malvikasharan @arnab1896
Hello Everyone,

My name is SUKRUTA JENA. I am from Bhubaneswar, Odisha. I am a 3rd-year undergraduate pursuing Btech in Computer science and technology from Parala Maharaja Engineering College, India.

I have good knowledge in AI, Machine Learning, Python, Tensorflow ,keras, openCV,NLP, Jupyter Notebook ,Java, C++, python.I have a good practical knowledge regarding blockchain, ethereum, solidity and Dapps.Currently I am learning Data science From IIT - Madras(Distance Course).

I have also a great Experience in HTML , CSS.

I have done a few machine learning projects like house rent prediction ,mask detection and smart attendance system etc.

I am a newbie in the world of open source and have a keen interest in exploring it. It would be very kind if I could get some help in ways I could contribute to this organization. I am interested in the project “The Turing Way”.

Thank You.

Hey @arnab1896.
You can see my skill in this area with my project at https://bouk.vercel.app

It uses MUI as the UI and UX system. for the frontend development I have used React JS and Backend is written in Node JS. Efficient use of API ensure its connectivity with the Recommendation model working on its own hosted server. All necessary details are stored on Mongo DB at its Atlas Cloud platform thus ensuring ease of scaling.

The ML model uses NLP with content based recommendation implemented using the K-NN algorithm. Biggest challenge was fitting huge dataset into the limited RAM available. To accomplish this task I used the sparse matrix.

My skills would definitely be assert to your rapidly growing organisation.
Please give me consideration for this opportunity.

Hello! @arnab1896 @malvikasharan
I am Mohamed Nasser, an undergraduate student of Biomedical engineering in Cairo university.
I have good knowledge in Python, Machine learing, Deep learning, Pytorch.
Also I’m experienced with different programming languages as C++, Java and JavaScript.
I’m skilled in using Flask, Spring boot, ExpressJs, Html, Css, JavaScript, React.js and AngularJs.
I have experience in fullstack development through my internships.
I found this project interesting and i think have skills that needed for this project.
I want to increase my opportunities to contribute to this project in gsoc 2024.
Also Iam not familiar with open source contribution so i think it’s a good way to start.