GSoC 2020 project idea 21: Automated comparison of scientific methods for time-series analysis

Thanks you @Salmankhancodes, @ram this is correct. You can imagine a new function comes in a generates a new ~column of hctsa_datamatrix.csv, and we want to see how the patterns in this column (characterizing the new feature) relate to existing patterns contained in columns of hctsa_datamatrix.csv. Descriptions of each file are in the figshare repo.

3 Likes

@ben.fulcher Thanks now I’m getting clear image what we actually need to do…and now what about comparison ?..are we just going to to compare on the bases of values that we got in our new column and the most similar columns will be the best match or we need to find some pattern for comparison?

@Salmankhancodes maybe a small addition. I guess one has to use OutputToCSV to reduce .mat to .csv files

1 Like

We already have the output in .csv format so not required.

3 Likes

A simple Spearman correlation-based similarity metric is a good place to start.

2 Likes

@ben.fulcher Sir when the user will contribute it’s analysis method then:

  1. the addition of their analysis method in the data-matrix will be any random position or it has to be placed at a specific position by applying clustering algorithm like k-medoids.

  2. addition of analysis method will be an optional feature or it will be added automatically as soon as the user will upload it for comparison purpose ?

Thanks @Salmankhancodes.
1—We will not need to place the new method in the data matrix—the ordering of columns does not need to be special.
2—I think we will allow the user to upload at their discretion (to avoid noise). For this they will need to provide some information about their function (check out how this is done currently on the CompEngine website for uploading time-series data). If possible, it would be great to have a backend where a website manager could approve the upload.

1 Like

Ok Thanks @ben.fulcher Sir so like CompEngine we will make it an optional choice to add their analysis method

1 Like

@ben.fulcher Sir so is this our task to get rank for all input features using Spearman Rank Correlation?

1 Like

Each input algorithm/feature will need to be ranked for its similarity to the existing library of algorithms/features, and the result visualized.

2 Likes

Hi, @harsht24
For finding correlation between data we need not to do it manually instead we can simply use the spearman’s correlation function from scipy library and compare our output with the each of the data-matrix columns.
Hope this helps you .
Thanks

1 Like

@ben.fulcher Since our output file contains only values , how we will tell which column is about which feature?

@ram Please try your best to answer your question before asking it. In this case @Salmankhancodes has already described this to you above, and pointed you to the figshare documentation which describes it. The info is in hctsa_features.csv.

@ben.fulcher Sir we have decent time interval between final proposal submission and announcement for selected students and as i had already submitted my final proposal so i thought i should discuss with you what will you suggest me to do during this period , any resources that i should go through or tools and frameworks that i should brush up or any other contribution to any related repositories or any other thing that you would like to suggest me so that it will us in the longer run during our coding period?

Thanks for the question @Salmankhancodes

2 Likes

@ben.fulcher Thanks For your advice Sir. I’ll surely go through it.

@ben.fulcher Is it necessary for this project to use CompEngine ? Can’t we build whole new web portal from different technologies ?

Open to hearing any plan, but my assumption is that building on an open-source platform (CompEngine) that achieves the same thing (but for data instead of features) would be time-efficient.

why i am saying this is because marionette and backbone is quite outdated …

also I had followed the steps given in compengine frontend respo. but getting error how to set up successfully

@ram I think you haven’t setup the configuration settings.