You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently relative_path is mandatory, and a name is not (it's automatically generated from the relative_path if not provided.
It is potentially more user friendly the other way round, where many users may not care about the data structure within the registry, i.e., they don't care about the relative_path. I think often people will just choose any relative path as they have to enter something.
Propose changing the name to be mandatory, and the relative_path is automatically generated (if not provided). The automated relative_path may then be more meaningful in the long run for finding data (could be something along the lines of <name>_<version>_<date>).
The text was updated successfully, but these errors were encountered:
I'm more or less convinced, except in case the data are not to be copied (old_location not used), relative_path must still be mandatory (and in that case name could still be derived as it currently is, but maybe it's better to require name in any case to keep the interface from being too confusing).
No, never mind that last part: there doesn't need to be any special treatment for name when old_location is None. But relative_path should be required.
Concerning overwritable - I can image two ways someone might want to use it:
while debugging keep creating a dataset with identical name, version, version suffix (if any) and relative path until you're satisfied with it. But we've made this impossible. It conflicts with uniqueness of (owner_type, owner, name, version, version_suffix) combined with our practice of always making a new db entry, even when the old dataset (file(s)) is overwritten.
while debugging keep creating a dataset with identical name and relative path, but incrementing version.
If we recommend specifying name, and not necessarily relative_path, that's what people will be inclined to do, especially since it's usually less effort to think up just a name. This is even more true for people doing development, who are the ones most likely to want to overwrite and also the most likely to want to specify a minimum of parameters.
Regardless of what other changes we make or don't make, I think we should not allow overwriting a dataset with a different name.
We could handle 2. as follows: If relative_path is not specified, we could look for registered datasets whose version string matches except for patch number, or maybe patch number and version suffix. Find the one with greatest patch number which is overwritable and use its relative_path for the new dataset. (Or maybe only use it if the dataset with the largest patch number is overwritable.) Otherwise generate relative_path as usual from name, version and version suffix.
Currently
relative_path
is mandatory, and aname
is not (it's automatically generated from therelative_path
if not provided.It is potentially more user friendly the other way round, where many users may not care about the data structure within the registry, i.e., they don't care about the
relative_path
. I think often people will just choose any relative path as they have to enter something.Propose changing the
name
to be mandatory, and therelative_path
is automatically generated (if not provided). The automatedrelative_path
may then be more meaningful in the long run for finding data (could be something along the lines of<name>_<version>_<date>
).The text was updated successfully, but these errors were encountered: