Replies: 2 comments 6 replies
-
I need to migrate some legacy stats in the next month or so, so I'm quite interested in the answers here. I had been assuming that I would need to create raw events in the invenio indices. But if it's viable to simply migrate the aggregated stats, and if there aren't negative side effects, I might prefer that approach. Has anyone actually tried each strategy? Are there potential pitfalls involved in just migrating the aggregations? |
Beta Was this translation helpful? Give feedback.
-
On the question of backups: we're using AWS opensearch service, so we're having it generate daily backup snapshots of the indices. That solves our backup needs for the time being. But we also want to have more permanent off-site (i.e., not on AWS data centres) backups made periodically. It's not a big deal to generate those with something like a cron job. But there'd be added convenience if invenio-files-rest were to generate file-system/s3 dumps on a configurable schedule. This is one of those cases where a custom solution per-deployment isn't hard. But everything takes developer hours, and we (like many implementations) are always working with too few people and too much work. So "convenience" isn't necessarily a trivial issue for implementers. (On the other hand, a built-in solution costs Invenio developer-hours that might be better spent elsewhere.) |
Beta Was this translation helpful? Give feedback.
-
Stats backup
The view/download events are stored in monthly indices in the search cluster, and there is no backup out-of-the-box.
Possible solutions:
elasticdump
and export all events to a local storageMore complex alternatives:
invenio-files-rest
) to regularly export events and add it to a (yet another) bucketMigration strategies
It would be useful to provide documentation, guidance or best practices on how to migrate legacy statistics.
Beta Was this translation helpful? Give feedback.
All reactions