CUMULUS-3757: Update granule upsert logic to allow updating collection info #3872

Nnaga1 · 2024-12-02T16:45:08Z

Summary: Summary of changes

Addresses CUMULUS-XX: Develop amazing new feature

Changes

Detailed list or prose of changes
...

PR Checklist

Update CHANGELOG
Unit tests
Ad-hoc testing - Deploy changes and test manually
Integration tests

…umulus into CUMULUS-3757-move-granule

etcart

appreciate the update, I think it's good to go, but one nit (remove a commented out line of code)
good to have partial state tests and parallelize as much as possible

packages/db/tests/lib/test-updateGranuleAndFiles.js

Jkovarik · 2024-12-12T15:00:42Z

Taking a quick look at this this morning.

Jkovarik · 2024-12-12T15:11:17Z

packages/db/src/lib/granule.ts

+    files: filesTable,
+  } = TableNames;
+  await Promise.all(granules.map(async (granule) => {
+    const pgGranule = await translateApiGranuleToPostgresGranule({


Theory/Nit: Should the DB package be taking API granules as input for anything other than translation generally? I see we're doing it for fields in other methods, so I'm not resting on package dogma, but seeing 'take a set of API granules and translate' in the method makes me wonder if we should be offloading that concurrency to the calling method and just take a set of updates to incoming Knex objects.

not really sure, that does sounds like it would make it better, does that entail passing the translation method into the function? changes-wise

ohhhhhhhhhhhh you're saying instead of sending APIGRANULES to the function, instead to send them already translated to PG, ok, that doesnt seem like any more work except that Ethan's task would need to do that 🤔

Jkovarik · 2024-12-12T15:11:47Z

packages/db/src/lib/granule.ts

+ * @param {Array<ApiGranule>} [granules] - updated ApiGranule records
+ * @returns {Promise<void>}
+ */
+export const updateGranuleAndFiles = async (


This takes an input array of multiples, it should probably be updateGranulesAndFiles

Jkovarik · 2024-12-12T15:12:16Z

packages/db/src/lib/granule.ts

+ */
+export const updateGranuleAndFiles = async (
+  knexOrTransaction: Knex | Knex.Transaction,
+  granules: Array<ApiGranule>


Consider ApiGranuleRecord (these shouldn't be new records, right?)

whats the difference? theyre not new records (the same granules with the updated files/collectionId based on the move), I assumed ApiGranule was fine, will change it but just wondering

Jkovarik

Hey @Nnaga1 - adding a review flag based on a couple of initial comments, working on a full review.

Jkovarik · 2024-12-12T15:16:49Z

packages/db/src/lib/granule.ts

@@ -354,3 +357,39 @@ export const getGranulesByGranuleId = async (
    .where({ granule_id: granuleId });
  return records;
 };
+
+/**
+ * Change a granules' PG record and its files' PG record based on collection move


This method looks like it's purpose is to create a bulk granule update method that circumvents the existing granule write logic. That probably isn't specific to collections.

If we are intending that it be specific to moving a collection....I'm assuming the need to entirely re-write the entire object is due to the intent to move files as well, but not try to parallelize writeGranuleFromApi because the business logic should be irrelevant to this use case?

This method looks like it's purpose is to create a bulk granule update method that circumvents the existing granule write logic. : this would be the intention of this method, to avoid changing core API functionality and instead do this.

If we are intending that it be specific to moving a collection: this part, I'm not sure, I intended it to be used for Ethan's task but if there are potential applications elsewhere then it can probably be re-used I'd assume

Jkovarik · 2024-12-12T15:18:05Z

packages/db/src/lib/granule.ts

+      dynamoRecord: granule,
+      knexOrTransaction,
+    });
+    await knexOrTransaction(granulesTable).where('granule_id', '=', pgGranule.granule_id).update(


I think we need to enforce a transaction here, not just have it be possible it's a transaction object. We don't want partial granule/file record updates.

I think we need to enforce a transaction here, can you point me to an example of this in the code I can refer to? not too familiar with it beyond just doing straight up : await knex(....).....

@Nnaga1 I'd read up on https://knexjs.org/guide/transactions.html / take a look at what we're doing in the API logic for https://github.com/nasa/cumulus/blob/CUMULUS-3757-move-granule/packages/api/lib/writeRecords/write-granules.js#L488

To be clear, I'm suggesting that we commit all granule/file updates together (and roll back on failure) in a transaction instead of make updates serially in a way that doesn't rollback if any of them fail.

Jkovarik · 2024-12-12T15:20:47Z

packages/db/src/lib/granule.ts

+    granules: granulesTable,
+    files: filesTable,
+  } = TableNames;
+  await Promise.all(granules.map(async (granule) => {


Question: If the map of granules is 10k, is it an acceptable usage scenario that when it fails on granule 1001 for that 1k granules were moved, and 9k failed, or do we want it to move 9,999 of them and fail the one with a metadata/connection/whatever issue? Apologies if that should be obvious, I may be lacking context.

Edit reviewed the tests - re-run/idempotent intent is probably fine here, but a doc annotation for @error is probably warranted in the header.

im not exactly sure on this, i'd assume this: or do we want it to move 9,999 of them and fail the one with a metadata/connection/whatever issue?? I think the failure of writes, of this function, is or should be dealt with task-side but if theres something I can change here then I can do it

Jkovarik · 2024-12-12T15:29:55Z

packages/db/tests/lib/test-updateGranuleAndFiles.js

+  }
+});
+
+test.serial('updateGranuleAndFiles successfully updates a complete list of granules, 1/2 of which have already been moved', async (t) => {


I think we need a more complete set of units testing all granule updates, given the method isn't written to update files and the collection ID, but all granule fields.

If that 's not intentional and this method is intending to be limited to updating file locations and the collectionID, we should probably make the method more defensive somehow.

so Ethan's task would send the target_granules to me, which are the granule records post move-collections, which is just a complete granule record, so I can do this: a more complete set of units testing all granule updates, would it just enforce that the other fields are.... the same as it was before?

@Nnaga1 to some degree that depends on what the intent is. As written this I don't believe this method enforces the ticket context (only update collection and file paths)

If the intent is to provide a method that takes an array of granule objects and updates it only utilizing translation conventions and ignoring API write business logic, then we should test that broader case.

If the intent is this helper is designed to just update those specific fields, we should update it to do only that and test.

This is important as this method is an exposed package method and creating a user contract.

…umulus into CUMULUS-3757-move-granule

Nnaga1 added 30 commits November 22, 2023 13:56

Merge branch 'master' of https://github.com/nasa/cumulus

83fd2c8

Merge branch 'master' of https://github.com/nasa/cumulus

aa314ee

Merge branch 'master' of https://github.com/nasa/cumulus

ae30573

Merge branch 'master' of https://github.com/nasa/cumulus

e3de4fb

Merge branch 'master' of https://github.com/nasa/cumulus

f195570

Merge branch 'master' of https://github.com/nasa/cumulus

e91d941

Merge branch 'master' of https://github.com/nasa/cumulus

a9d773b

Merge branch 'master' of https://github.com/nasa/cumulus

ee10ed1

Merge branch 'master' of https://github.com/nasa/cumulus

c7a2d79

Merge branch 'master' of https://github.com/nasa/cumulus

c301ea3

Merge branch 'master' of https://github.com/nasa/cumulus

03b9822

Merge branch 'master' of https://github.com/nasa/cumulus

5e52bb0

Merge branch 'master' of https://github.com/nasa/cumulus

25d28c1

Merge branch 'master' of https://github.com/nasa/cumulus

994e877

Merge branch 'master' of https://github.com/nasa/cumulus

7554d51

Merge branch 'master' of https://github.com/nasa/cumulus

0cdad44

Merge branch 'master' of https://github.com/nasa/cumulus

81cecdf

Merge branch 'master' of https://github.com/nasa/cumulus

1e773b7

Merge branch 'master' of https://github.com/nasa/cumulus

4c2e461

Merge branch 'master' of https://github.com/nasa/cumulus

fac41f7

Merge branch 'master' of https://github.com/nasa/cumulus

5a63c76

Merge branch 'master' of https://github.com/nasa/cumulus

de8f086

Merge branch 'master' of https://github.com/nasa/cumulus

926291f

Merge branch 'master' of https://github.com/nasa/cumulus

47fa0a3

Merge branch 'master' of https://github.com/nasa/cumulus

e9ee557

Merge branch 'master' of https://github.com/nasa/cumulus

ef1dacc

Merge branch 'master' of https://github.com/nasa/cumulus

67b6145

Merge branch 'master' of https://github.com/nasa/cumulus

adac427

Merge branch 'master' of https://github.com/nasa/cumulus

7e09e80

Merge branch 'master' of https://github.com/nasa/cumulus

1bc74e1

Nnaga1 added 2 commits December 6, 2024 12:42

Merge branch 'CUMULUS-3757-move-granule' of https://github.com/nasa/c…

b92ced4

…umulus into CUMULUS-3757-move-granule

feedback

e8346ea

etcart approved these changes Dec 9, 2024

View reviewed changes

etcart reviewed Dec 9, 2024

View reviewed changes

packages/db/tests/lib/test-updateGranuleAndFiles.js Outdated Show resolved Hide resolved

Nnaga1 and others added 4 commits December 9, 2024 15:37

small feedback change

6a0d618

adding jsdoc

8342a02

Merge branch 'master' into CUMULUS-3757-move-granule

70b09b7

Merge branch 'master' into CUMULUS-3757-move-granule

9a800b8

Nnaga1 added the needs review label Dec 12, 2024

Jkovarik reviewed Dec 12, 2024

View reviewed changes

Jkovarik requested changes Dec 12, 2024

View reviewed changes

Jkovarik reviewed Dec 12, 2024

View reviewed changes

Nnaga1 and others added 9 commits December 12, 2024 12:33

PR feedback--naming

a835a6a

PR feedback - transactions

4834f68

Merge branch 'CUMULUS-3757-move-granule' of https://github.com/nasa/c…

c8bc308

…umulus into CUMULUS-3757-move-granule

PR feedback

2d15b5f

Merge branch 'master' into CUMULUS-3757-move-granule

5c81cf4

Merge branch 'CUMULUS-3757-move-granule' of https://github.com/nasa/c…

a2efe77

…umulus into CUMULUS-3757-move-granule

Merge branch 'master' into CUMULUS-3757-move-granule

5f29b66

Merge branch 'CUMULUS-3757-move-granule' of https://github.com/nasa/c…

0e05f81

…umulus into CUMULUS-3757-move-granule

converting to PATCH endpoint

03237e0

Nnaga1 removed the needs review label Dec 18, 2024

api client for this endpoint

c4b76f6

Nnaga1 mentioned this pull request Dec 19, 2024

CUMULUS-3757: Update granule upsert logic to allow updating collection info #3887

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUMULUS-3757: Update granule upsert logic to allow updating collection info #3872

CUMULUS-3757: Update granule upsert logic to allow updating collection info #3872

Nnaga1 commented Dec 2, 2024

etcart left a comment

Jkovarik commented Dec 12, 2024

Jkovarik Dec 12, 2024 •

edited

Loading

Nnaga1 Dec 12, 2024

Nnaga1 Dec 12, 2024

Jkovarik Dec 12, 2024

Jkovarik Dec 12, 2024

Nnaga1 Dec 12, 2024 •

edited

Loading

Jkovarik left a comment

Jkovarik Dec 12, 2024 •

edited

Loading

Nnaga1 Dec 12, 2024

Jkovarik Dec 12, 2024

Nnaga1 Dec 12, 2024

Jkovarik Dec 12, 2024

Jkovarik Dec 12, 2024

Jkovarik Dec 12, 2024 •

edited

Loading

Nnaga1 Dec 12, 2024 •

edited

Loading

Jkovarik Dec 12, 2024

Nnaga1 Dec 12, 2024

Jkovarik Dec 12, 2024

CUMULUS-3757: Update granule upsert logic to allow updating collection info #3872

Are you sure you want to change the base?

CUMULUS-3757: Update granule upsert logic to allow updating collection info #3872

Conversation

Nnaga1 commented Dec 2, 2024

Changes

PR Checklist

etcart left a comment

Choose a reason for hiding this comment

Jkovarik commented Dec 12, 2024

Jkovarik Dec 12, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Nnaga1 Dec 12, 2024 • edited Loading

Choose a reason for hiding this comment

Jkovarik left a comment

Choose a reason for hiding this comment

Jkovarik Dec 12, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Jkovarik Dec 12, 2024 • edited Loading

Choose a reason for hiding this comment

Nnaga1 Dec 12, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Jkovarik Dec 12, 2024 •

edited

Loading

Nnaga1 Dec 12, 2024 •

edited

Loading

Jkovarik Dec 12, 2024 •

edited

Loading

Jkovarik Dec 12, 2024 •

edited

Loading

Nnaga1 Dec 12, 2024 •

edited

Loading