Hi,
My question is regarding multiple comparison, especially the FDR method. It is said that we should apply the correction whenever we perform non-independant tests. However, this criteria seems vague to me in some cases.
Consider this:
I use FDR correction on a set of p-values obtained from a parcel-wise analysis, where I perform a statistical test independently for 200 parcels of the brain (regression, ISC, etc.). I then get 200 p-values that I correct with the FDR algorithm (implementation in nlTools.stats.fdr
, Tal Yarkoni: API Reference — nltools 0.5.0 documentation) to know the p threshold that yields q < 0.05.
However, if I have 4 conditions and perform 4 x one-sample t-test across the 200 parcels, then the number of independent tests is 4 x 200. Similarly, if I perform 4 contrasts between these conditions, should I apply the FDR on the whole set of p values (4 x 200 parcels), or independently on the p values of each contrast?
Depending on what we consider to be independent, the results will change, since the corrected threshold could be lower for a condition that has a lot of small p-values (e.g., p <0.003), while it would be more stringent for a condition that has very high p-values (e.g., p < 0.000001).
In nilearn, for example, the correction is applied to one statistical image at a time, passing a Z_map into the multiple comparison function. My intuition would be that if correcting on a set of contrasts at the same time, such function would be built to take a list of images from the contrasts performed during GLM.
Curious to know the best practices regarding this,
Bests,
Dylan S.