Verify data uploaded and what should be included
Hi @cldw780f-at-tu-dresden.de : Here's a larger issue that I cannot solve because I have no clue about the origin of the data.
We are uploading a relase-version to our dataverse: https://data.fdz.ioer.de/dataset.xhtml?persistentId=doi:10.71830/6ILS40&version=DRAFT
This also includes the data that was downloaded within Jupyter notebooks and stored undwer 00_data
folder. Note that this folder is added to .gitignore
, so it is not part of the git repo.
Also see the file-tree here: https://stag.training.fdz.ioer.info/notebooks/205_publish.html#list-the-directory-file-tree
The question is: Should this data be part of the data archive, too? If not, we must remove it prior to creating the release, and then make sure that it can by dynamically retrieved in Jupyter notebooks at all times, ideally from other dataverse uploads. If the data is not yet in the dataverse, we can (e.g.) temporarily upload it to the TUD Datastore and download it from there, but this is a bit hacky.
This also, from my point, affects how this data publication should be cited: If Ralf-Uwe contributed to data used in here, but uploaded to a different dataverse publication, he should be the author of this (other) data publication, and not part of the training materials authors itself.
Anyway, I am not going to publish the data publication as of now and wait until you had a chance to look at this. There's also data from Fatemeh missing, and I don't know where to get it from (see my other issue).