Skip to content

Verify data uploaded and what should be included

Hi @cldw780f-at-tu-dresden.de : Here's a larger issue that I cannot solve because I have no clue about the origin of the data.

We are uploading a relase-version to our dataverse: https://data.fdz.ioer.de/dataset.xhtml?persistentId=doi:10.71830/6ILS40&version=DRAFT

This also includes the data that was downloaded within Jupyter notebooks and stored undwer 00_data folder. Note that this folder is added to .gitignore, so it is not part of the git repo.

Also see the file-tree here: https://stag.training.fdz.ioer.info/notebooks/205_publish.html#list-the-directory-file-tree

The question is: Should this data be part of the data archive, too? If not, we must remove it prior to creating the release, and then make sure that it can by dynamically retrieved in Jupyter notebooks at all times, ideally from other dataverse uploads. If the data is not yet in the dataverse, we can (e.g.) temporarily upload it to the TUD Datastore and download it from there, but this is a bit hacky.

This also, from my point, affects how this data publication should be cited: If Ralf-Uwe contributed to data used in here, but uploaded to a different dataverse publication, he should be the author of this (other) data publication, and not part of the training materials authors itself.

Anyway, I am not going to publish the data publication as of now and wait until you had a chance to look at this. There's also data from Fatemeh missing, and I don't know where to get it from (see my other issue).