I’m using fmriprep for fmri preprocessing and then using the confounds_regressors.tsv file to regress out CSF + WM signals. A lot of papers have mentioned using 6 acompcor vectors. However, it seems like some people use percentages (at least 75% of variance explained in the acompcor). To get 75% variance, I would be including 120 acompcor vectors for regression.
A quick follow up question. So, if fmriPrep says 122 components explain 50% of my variance, then it would make sense to include all 122 components from the _confounds.tsv file in my physiological regression?
Why is it that some people only have 5 or 6 components vs. my 122 components?
If you want to be consistent in regressing out 50% variance, then yes.
Others choose to prioritize keeping temporal degrees of freedom (tDOF) in the GLM the same across subjects, which (assuming the scan lengths are constant) would mean including the same amount of regressors in each model. Especially in shorter scans, removing too many tDOF could lead to inflexible model fitting.
This makes a lot of sense. My runs are currently around 10 mins each. What would be considered as a “shorter scan”? As for the tDOF, does this mean that they’re extracting 5 or 6 components from the tCompcor or the aCompcor?
Depends on the TR, but that seems okay. You can always test different schemes on held out subjects if they are not going to be included in your analysis.