In short, perform these procedures separately to get the confounding regressors of interest from them (i.e. the top 5 components from aCompCor and the timeseries of the ICA components identified as noise from ICA-AROMA). Then regress these confounders out using something like nilearn’s clean_img function.
EDIT: The top consideration that comes to mind is that ICA-AROMA is, in my experience, not good at identifying multiband artifacts as noise. FYI, these typically manifest as parallel lines seen in sagittal or coronal views:
Therefore, if you dataset is small enough, you might want to just manually go over the ICA components in a viewer like fsleyes and check which ones ICA-AROMA missed. If your dataset is huge, consider the return-on-investment that’s offered by ICA-FIX instead.