Make the data import process more explicit #373

dalonsoa · 2024-10-03T13:28:10Z

At the moment, when importing data the user selects a Station and a Format. From the user perspective, this only selects some file related settings (like extension and delimiter) and how to deal with the date and time columns. Implicitly, however, the user is picking all the variables that are related to the chosen format via a Classification object, but there's no way for them to know what these variables are or what columns they are using except by walking through all of the Classifications.

It will make way more sense to have the Classification and the Format related via a ManyToMany field in the Format object, rather than a ForeignKey to Format in the Classification object. This way, when a user opens a format, they will see exactly what variables will be pulled from the data file and from where.

This requires some changes to the models, obviously, and to the code to parse the data file, but, specially, the main complication comes with the views and templates used to display the Format objects, which will need to be more complicated as they will need to show a list of all the Classifications related to a Format. Nothing that we have not done in the past - see for example the DeviceSpecification model in Liionsden - but nevertheless, a not straight forward process. I think it will be worth the effort from the point of view of the user experience.

The text was updated successfully, but these errors were encountered:

dalonsoa · 2024-10-04T06:00:14Z

@ICHydro , @tsmbland , I've open his issue to discuss/tackle the data ingestion process, which is not really such a good user experience. Any thoughts are most welcomed.

ICHydro · 2024-10-13T16:15:55Z

Yes, it took me a bit of time to get familiar with the role of the format object in the import process. From a user perspective, it would probably be most straightforward if the user can set options such as the delimiter symbol, and the meaning of the columns (variable, dimensions etc) directly during the import process, rather than creating a format object first, and then using this object during import.

The value of the latter approach is probably that it is quicker if the user imports frequently the same data format, and perhaps that is also the reason that it may have been implemented in the original FONAG system.

It would seem ideal if we can combine both, for example the interface allows a user to set all the settings manually when importing a data file, but is able to click a box like "save these import settings for future use". If clicked, a format object would be created and stored for future use. I guess that this would require the same under-the-hood changes to the models that @dalonsoa suggests above?

In any case, I agree that it is useful for the user to see exactly what variables will be pulled from the datafile where.

dalonsoa · 2024-10-14T07:49:16Z

There's a lot of parameters to set when importing data, specially to declare the columns to import (see the Classification), so presenting all of these to the user in the import form - even if it is just once and then they can save it for future use - can be daunting. Most users, I think, will be importing always the same type of data files, so having to define the right format upfront makes sense.

Let me check how we did things in Liionsden, the other project where we had to face this problem, and that was very neat for the end user (if not under the hood, I don't recall).

dalonsoa · 2024-10-25T07:34:56Z

I think we can move to a more lightweight approach like the one described in #407 . Not only is way simpler to implement, but it is also enough for what is meant to achieve - make it clear to the user what they are really importing.

dalonsoa mentioned this issue Oct 3, 2024

Adds edit, create and delete views #364

Merged

dalonsoa mentioned this issue Oct 4, 2024

Bring Association back and use it in new Add and Edit data import forms to limit format choices #90

Closed

dalonsoa added this to the Version 1.1.0 milestone Oct 7, 2024

dalonsoa self-assigned this Oct 7, 2024

dalonsoa added the enhancement New feature or request label Oct 7, 2024

dalonsoa mentioned this issue Oct 21, 2024

Show Classification associated to Format in the Format detail view #407

Closed

dalonsoa closed this as completed Oct 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make the data import process more explicit #373

Make the data import process more explicit #373

dalonsoa commented Oct 3, 2024 •

edited

Loading

dalonsoa commented Oct 4, 2024

ICHydro commented Oct 13, 2024

dalonsoa commented Oct 14, 2024

dalonsoa commented Oct 25, 2024

Make the data import process more explicit #373

Make the data import process more explicit #373

Comments

dalonsoa commented Oct 3, 2024 • edited Loading

dalonsoa commented Oct 4, 2024

ICHydro commented Oct 13, 2024

dalonsoa commented Oct 14, 2024

dalonsoa commented Oct 25, 2024

dalonsoa commented Oct 3, 2024 •

edited

Loading