Using sklearn PLSRegression on fmri correlation matrices

MeenVen · March 6, 2018, 1:48pm

Given X = n correlation matrices (between ROIs of the brain) and y = multivariate response variable of dimension n*t, we’d like to implement PLSRegression (using sklearn). t=6 in our case.

pls = PLSRegression(n_components = 6)
pls.fit(correlationmatrices, y)
y_pred=pls.predict(correlationmatrices)

The correlationmatrices are obtained using nilearn as below:

correlationmeasure = ConnectivityMeasure(kind=‘correlation’, vectorize=True)
correlationmatrices = correlationmeasure.fit_transform(abide_subjects)

abide_subjects is a list of timeseries of the images obtained as follows:
timeseries_0=regionExtractor.transform(funcfiles[0])

However, the prediction for y seems very off and the mean classification accuracy is always 0.

What is wrong/missing with the above approach?

KamalakerD · March 7, 2018, 6:04pm

Could you please try to give us a reproducible script to understand what’s wrong. For instance, you can use from nilearn import datasets and use datasets.fetch_adhd or fetch_abide_pcp with n_subject=3 or 4.

Thanking you a lot for the script.

bthirion · March 8, 2018, 9:17pm

You may want to try with a more classic regression mdoel, e.g. RidgeRegression. If the prediction is really off, this means that y cannot be well predicted by X.
Best,

Bertrand