Fix missing column separator in the file #5

maxpatiiuk · 2023-03-23T17:19:14Z

No description provided.

grantfitzsimmons

This could have introduced an extra tab character:

Fixed the columns being off bug by refactoring the code (original implementation used string concatenation - bad idea - I belive I used it because I had to save on ram - needed to fit into a 2GB ram limit on that VM - funny thing is, ChatGPT says this is actually eating more ram - well, we didn't have ChatGPT back then) > When you store the data as a nested array, each element in the outer array is a reference to a separate inner array, and each inner array contains the strings. This means that when you append a new string to an inner array, you are only modifying that specific inner array, and not creating a new string object or copying the entire array. > > On the other hand, if you store the data as a single string, every time you append a new string, you would need to create a new string object with the concatenated values of the existing string and the new string. This can consume a lot of memory, especially if the strings are large. That's why programmers should avoid premature optimization, should test the impact of their changes (or stop worrying about changes if there is no noticeable impact) and read how the code is executed on the low-level to make correct tradeoffs

Fixed duplicate "Skipping node with wrong rank order" messages Made "Skipping node with wrong rank order" include more information

maxpatiiuk · 2023-03-25T03:31:16Z

Fixed duplicate "Skipping node with wrong rank order" messages
Made "Skipping node with wrong rank order" include more information
Fixed the columns being off bug by refactoring the code (original implementation used string concatenation - bad idea - I belive I used it because I had to save on ram - needed to fit into a 2GB ram limit on that VM - funny thing is, ChatGPT says this is actually eating more ram - well, we didn't have ChatGPT back then)

When you store the data as a nested array, each element in the outer array is a reference to a separate inner array, and each inner array contains the strings. This means that when you append a new string to an inner array, you are only modifying that specific inner array, and not creating a new string object or copying the entire array.

On the other hand, if you store the data as a single string, every time you append a new string, you would need to create a new string object with the concatenated values of the existing string and the new string. This can consume a lot of memory, especially if the strings are large.

That's why programmers should avoid premature optimization, should test the impact of their changes (or stop worrying about changes if there is no noticeable impact) and read how the code is executed on the low-level to make correct tradeoffs

--

To test:
Make sure this is not present in the file when selected Mollusca -> Bivalvia (notice the glued URL and incertae sedis):

https://www.marinespecies.org/aphia.php?p=taxdetails&id=105incertae sedis

Make sure columns are not off by one:

maxpatiiuk · 2023-04-24T14:56:29Z

If this code works for worms, I can do the same modification for catalog of line

Same as #5, but for CoL instead of WoRMS

Fix missing column separator in the file

1b1bcfa

maxpatiiuk requested a review from grantfitzsimmons March 23, 2023 17:19

grantfitzsimmons requested changes Mar 23, 2023

View reviewed changes

maxpatiiuk added 2 commits March 24, 2023 22:30

Improve wrong order node detection

ea4df04

Fixed duplicate "Skipping node with wrong rank order" messages Made "Skipping node with wrong rank order" include more information

maxpatiiuk force-pushed the worms-separator-fix branch from 621a816 to ea4df04 Compare March 25, 2023 03:30

maxpatiiuk requested a review from grantfitzsimmons March 25, 2023 03:31

maxpatiiuk added a commit that referenced this pull request May 2, 2023

Fix column's being off by one

736cb10

Same as #5, but for CoL instead of WoRMS

maxpatiiuk mentioned this pull request May 2, 2023

Fix front-end skipping "Incertae sedis" leaf nodes #11

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix missing column separator in the file #5

Fix missing column separator in the file #5

maxpatiiuk commented Mar 23, 2023

grantfitzsimmons left a comment

maxpatiiuk commented Mar 25, 2023

maxpatiiuk commented Apr 24, 2023

Fix missing column separator in the file #5

Are you sure you want to change the base?

Fix missing column separator in the file #5

Conversation

maxpatiiuk commented Mar 23, 2023

grantfitzsimmons left a comment

Choose a reason for hiding this comment

maxpatiiuk commented Mar 25, 2023

maxpatiiuk commented Apr 24, 2023