I’m having some confusing issues getting data with datalad. I’ve used datalad on our server before to grab openneuro datasets (albeit a dataset with a much smaller number of files), so I’m not sure if the problem is related to this specific repo, with many files, or if something changed on our servers.
I can clone the repo, though I get a message about config file download failure:
$ datalad clone https://github.com/ReproBrainChart/HBN_CPAC.git
[INFO ] Remote origin not usable by git-annex; setting annex-ignore
[INFO ] https://github.com/ReproBrainChart/HBN_CPAC.git/config download failed: Not Found
[INFO ] RIA store unavailable. -caused by- file:///cbica/comp_space/RBC/tmp_dir/output_ria/ria-layout-version not found, self.ria_store_url: ria+file:///cbica/comp_space/RBC/tmp_dir/output_ria, self.store_base_pass: /cbica/comp_space/RBC/tmp_dir/output_ria, self.store_base_pass_push: None, path: <class 'pathlib.PosixPath'> /cbica/comp_space/RBC/tmp_dir/output_ria/ria-layout-version -caused by- [Errno 2] No such file or directory: '/cbica/comp_space/RBC/tmp_dir/output_ria/ria-layout-version'
[INFO ] RIA store unavailable. -caused by- file:///cbica/comp_space/RBC/tmp_dir/input_ria/ria-layout-version not found, self.ria_store_url: ria+file:///cbica/comp_space/RBC/tmp_dir/input_ria, self.store_base_pass: /cbica/comp_space/RBC/tmp_dir/input_ria, self.store_base_pass_push: None, path: <class 'pathlib.PosixPath'> /cbica/comp_space/RBC/tmp_dir/input_ria/ria-layout-version -caused by- [Errno 2] No such file or directory: '/cbica/comp_space/RBC/tmp_dir/input_ria/ria-layout-version'
When I use “datalad get”, I get a “notneeded” message, and nothing is downloaded:
$ datalad get cpac_RBCv0/sub-NDARTB661TVR/ses-HBNsiteCBIC/func/sub-NDARTB661TVR_ses-HBNsiteCBIC_task-rest_run-?_atlas-Schaefer2018p200n17_space-MNI152NLin6ASym_reg-aCompCor_desc-PearsonNilearn_correlations.tsv
action summary:
get (notneeded: 2)
I’m running this on an NFS mount, so I’m not sure if that’s causing the problem (though again it worked on other datasets). If I datalad clone
to a local scratch directory, I get the same config download failure message, but when I datalad get
it does download eventually after a long initial “hang”. I get why NFS might slow things down, but I’m confused why I would get a “notneeded” message, as though the file already exists.
I’m running all of this on a ubuntu 22.04 server, with datalad 1.0.1.
Any ideas what might be going on?