GSoC 2022 Project Idea 3.1: The Turing Way: A how-to guide to data science (175/350 h)

The Turing Way is an open-source, community-led and collaboratively developed “book project” on making data research accessible for a wider research community (https://the-turing-way.netlify.com). We bring together individuals from diverse fields and expertise to develop practices and learning resources that can make data research accessible and easy to understand. Our community members are researchers, engineers, data librarians, industry professionals, and experts in various domains, at all levels of seniority, from all around the world. They collaborate in the project to develop chapters by compiling best practices, tools and recommendations used by the researchers and data science communities worldwide.

Technical details: All questions, comments, recommendations and discussions are facilitated through an online GitHub repository (https://github.com/alan-turing-institute/the-turing-way). The online book with multiple guides is hosted as a Jupyter Book (https://github.com/jupyter/jupyter-book/) at https://the-turing-way.netlify.com. Jupyter Book formats markdown files and Jupyter notebooks as static HTML making them easy to read. When a notebook is included in the book, the static page includes a link to an interactive version of the notebook via Binder (https://mybinder.readthedocs.io). Additional styling of the front end is possible by providing a CSS file that handles it across the entire book.

Background: Since the project’s launch in 2019, more than 300 contributors have so far co-authored more than 200 subchapters and community documents on reproducible research, communication, collaboration, project design and ethics. As the number of chapters continues to increase, it becomes important for us to offer appropriate ways for our readers to discover relevant and desired content in the book based on their topics of interest and skill levels. Over the last 3 months, software engineers at the Turing have enhanced the user interface (UI) of the book (developed a modular Python package) that made it possible for us to create multiple entry points for different user groups, who can start reading the book by exploring a curated set of chapters, rather than browsing the entire book (See details: https://github.com/alan-turing-institute/bio-Turing-Way/blob/malvikasharan-readme/pathways/documentation/README.md).

GSoC project plan and expected outcome: A GSoC contributor will help us integrate this newly developed package to The Turing Way book and further enhance this feature through user experience (UX) design. They will be supported in setting up community/user feedback processes and conducting interviews/focus groups to understand how our readers and contributors use the book and how this UX/UI enhancement adds to their experience. Based on their interest and availability, they will have the possibility to contribute to the development of Python scripts and GitHub actions to improve the project workflow, chapter development, community engagement and the overall interactivity in the book. They will be provided with appropriate guidance and the opportunity to work in a positive working environment. They will be fairly acknowledged for their contributions to the project.

Required skills: 1) Python programming, 2) basic web-development skill required to work with Jupyter Book, 3) experience working in distributed communities, using git and GitHub.

Optional skills: Experience collaborating on data science or quantitative research projects at any level, JavaScript skills (front end development), interactive visualisation of small datasets.

Possible mentors:

Tech keywords: Python, Jupyter, git, JavaScript

@malvikasharan @malin Thank you for introducing me towards 3.1 the turing way : A how-to guid to data science project.
I am using windows 7, I can install linux. Kindly help as I can’t seem to find a guide on which version to use.

Hi Sumit, you can see the instructions here: https://github.com/alan-turing-institute/the-turing-way/issues/2266.

Can you explain what do you mean by versions here?

Hi I am Pranav Joshi, a 3rd year Undergrad of Computer Science. I am profound with Python, Jupyter and Javascript. I also have a good deal of experience with UI as well. I am proficient in Web Development as well. I would really like to contribute to this project for GSoC 2022.

Hi Pranav, really glad to see that you are testing the code. I have uodated the GitHub issue with more details. Please read that for details. Below I am providing a part of that description to help you get started:

Specific task for assessment

GSoC candidate upon selection will help improve a Python Package currently located here: https://github.com/alan-turing-institute/bio-Turing-Way.
This package has been developed to enhance the usability of The Turing Way by allowing users to browse curated set of chapters (rather than the entire book) by clicking on different cards as shown for ‘Data Study Group’, ‘PhD Students’, ‘Group Leaders’ and ‘Life Scientists’: Welcome — The Turing Way - Personas

We want to integrate this feature in The Turing Way, however, there are some improvements that needs to be done to allow better user experience (UX/UI). Some notes on what can be improved by the GSoC candidates upon seelection:

  • Adding description in the landing page of all profile (making it attractive)
  • All tags appear on the top for profiles that have been curated, so it will be helpful to have only the original profile (where the user clicked) on the top, and rest could be hidden to avoid confusions
  • Something else that you can come up with

Part 1: Tasks after you have tested the Python package locally as described above

Hello @malvikasharan ! I want to confirm the description you mentioned in the message above, is the “cards” you mentioned above can be interpreted as “tags” in website development? So the writer can assign tags to the article/chapter they created, and reader can search books based on what he needs with the tags, regardless of chapter or sequence? Thank you --Hisham

Hi There, you can see the description for card/panel and tags/badges (they are separate features in the Jupyter Book) in the issue posted in my earlier comment.

You can also directly see the card-like feature (panel in the Jupyterbook and as used in test repo for The Turing Way) and page tag or badge (see book badge in Jupyterbook and as used in test repo for The Turing Way on the top of curated pages).

Hey @malvikasharan I have asked some doubts regarding the project on github in my issue related to the turing way project. Can you please answer my queries, so that I can start working on my proposal ?