Add option to list glacierefied folders #30

chuwy · 2017-06-08T12:27:29Z

In list_runids function we explicitly skip run ids that are archived on AWS Glacier, which means they will never appear in run manifest. I believe this is wrong solution as customer can restore particular folder without intention to reprocess it (let's say with PySpark job).

I propose to:

Add option list_runids(include_archived=False) to list folders archived to AWS Glacier
Make list_runids()return objects of RunId class (rather than plain string) that can hold information whether folder is archived
Add third possible state to Add state to run manifests #29 to mark RunId processing was explicitly cancelled. Something like CancelledAt.

This should make run manifests feature able to take full control over data processing.

TODO: Think what to do with another storage classes. Currently we list folders that have only STANDARD class.

The text was updated successfully, but these errors were encountered:

alexanderdean · 2017-06-08T16:55:10Z

I think this makes sense. I would probably call the state IgnoredAt, rather than CancelledAt?

chuwy · 2017-06-08T16:57:44Z

Agree about IgnoredAt.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add option to list glacierefied folders #30

Add option to list glacierefied folders #30

chuwy commented Jun 8, 2017 •

edited

Loading

alexanderdean commented Jun 8, 2017

chuwy commented Jun 8, 2017 •

edited

Loading

Add option to list glacierefied folders #30

Add option to list glacierefied folders #30

Comments

chuwy commented Jun 8, 2017 • edited Loading

alexanderdean commented Jun 8, 2017

chuwy commented Jun 8, 2017 • edited Loading

chuwy commented Jun 8, 2017 •

edited

Loading

chuwy commented Jun 8, 2017 •

edited

Loading