fix_: limit the maximum number of message hashes by query hash #5688

richard-ramos · 2024-08-09T19:48:19Z

Team, do you think this could be added to the release or is it too risky at this point?

The problem of not adding this code is that if the missing message verification is enabled there's no limit for the number of missing message hashes to request and that can increase the load of the storenodes while with this change we reduce the number of message hashes per query to 50.

The equivalent PR for develop branch will be created later once waku-org/go-waku#1190 is merged

status-im-auto · 2024-08-09T19:51:21Z

Jenkins Builds

❔	Commit	#️⃣	Finished (UTC)	Duration	Platform	Result
✔️	`1915ab9`	#1	2024-08-09 19:51:20	~2 min	`tests-rpc`	📄`log`
✔️	`1915ab9`	#1	2024-08-09 19:52:48	~4 min	`linux`	📦`zip`
✔️	`1915ab9`	#1	2024-08-09 19:53:09	~4 min	`ios`	📦`zip`
✔️	`1915ab9`	#1	2024-08-09 19:54:04	~5 min	`android`	📦`aar`
✔️	`1915ab9`	#1	2024-08-09 20:34:47	~46 min	`tests`	📄`log`

❌	`34112a9`	#2	2024-08-12 17:33:13	~26 sec	`ios`	📄`log`
❌	`34112a9`	#2	2024-08-12 17:33:26	~36 sec	`linux`	📄`log`
✖️	`34112a9`	#2	2024-08-12 17:33:56	~1 min	`tests-rpc`	📄`log`
✖️	`34112a9`	#2	2024-08-12 17:34:06	~1 min	`tests`	📄`log`
❌	`34112a9`	#2	2024-08-12 17:34:11	~1 min	`android`	📄`log`

chaitanyaprem

LGTM

chaitanyaprem · 2024-08-10T11:16:10Z

Team, do you think this could be added to the release or is it too risky at this point?

I think we can add to release if we dogfood the changes and see no issues. Especially if it helps improve store node performance.

richard-ramos · 2024-08-10T12:35:50Z

Sounds good, I'll create a dogfooding PR and hopefully we can do a quick dogfooding session early next week to get this merged ASAP! 🚀

kaichaosun

LGTM

ilmotta

Team, do you think this could be added to the release or is it too risky at this point?

@richard-ramos, code looks good, but before merging directly into the release branch we can ask the mobile QA team @status-im/mobile-qa to have a look (we just need to quickly create a mobile PR pointing to this branch).

It would be less risky if FetchHistory was covered by tests.

richard-ramos · 2024-08-12T11:55:58Z

I have created the following test PRs:

igor-sirotin

But isn't pagination already setting this limit? 🤔
And why is it better to do a few parallel requests (to the same store node), rather then sequential?

wakuv2/missing_messages.go

richard-ramos · 2024-08-12T17:15:08Z

But isn't pagination already setting this limit? 🤔

The query we do kinda looks like this:

SELECT * 
FROM messages
WHERE messageHash IN (msgHash1, msgHash2, .... msgHashN)
LIMIT 100

The pageLimit will limit the results after the filtering per messageHash is done. I.E. if you pass 300 message hashes as an IN condition, the database will attempt to find all those messages first before doing the limit, and will have to do the same for each page of the results if we pass a cursor. Since the messageHash is unique, it's more efficient to just find and return a smaller resultset, since it will not have to find all the records first to later just return a subset of them (It also means less load for the DB).

Regarding sequential vs parallel, sending smaller requests concurrently I believe it is the better approach, as it can significantly reduce the overall time needed to retrieve all the data. I'm 100% not sure if nwaku will really benefit from this change but some of the benefits with this approach are:

Reduced total latency: By sending requests in parallel, you can reduce the overall time to receive all responses. This is particularly important if each request has a significant latency (e.g., due to network delays or query processing time).
Better utilization of resources: nwaku uses chronos, so in theory it should handle multiple concurrent operations efficiently. We also have a connection pool in postgresql. By sending requests in parallel, we should be able to take advantage of the available resources (e.g., CPU, network bandwidth, available connections) to process multiple requests simultaneously.
Improved User Experience: users will experience faster response times as data is retrieved and processed more quickly.

Co-authored-by: Igor Sirotin <[email protected]>

igor-sirotin · 2024-08-14T09:54:23Z

The pageLimit will limit the results after the filtering per messageHash is done.

But... shouldn't we change the query on server (nwaku) side then, rather than modifying clients not to overload servers? 🤔
I mean, just limit the mshHash list before putting it into query.

igor-sirotin · 2024-08-14T09:54:27Z

some of the benefits with this approach are: ...

Ok, I guess it makes sense in this case 👍

Though, again, some of this points sounds like we're trying to make clients to use servers more efficient, while we could make servers smarter and work efficient even with dumb clients 😄 (e.g., to utilize postgresql connection pool, the SQL request could be split into multiple requests on server side).

richard-ramos · 2024-08-19T20:34:25Z

Closing due to deciding to not go with this PR for 2.30 as described in status-im/status-mobile#21021 (comment)

fix_: limit the maximum number of message hashes by query hash

1915ab9

richard-ramos requested review from ilmotta, kaichaosun, jrainville, chaitanyaprem, igor-sirotin, fryorcraken and Ivansete-status August 9, 2024 19:48

richard-ramos requested a review from iurimatias August 9, 2024 21:09

chaitanyaprem approved these changes Aug 10, 2024

View reviewed changes

kaichaosun approved these changes Aug 10, 2024

View reviewed changes

ilmotta approved these changes Aug 12, 2024

View reviewed changes

This was referenced Aug 12, 2024

test: limit message hashes per query status-im/status-desktop#16079

Closed

test: limit message hashes per query status-im/status-mobile#21021

Closed

igor-sirotin reviewed Aug 12, 2024

View reviewed changes

wakuv2/missing_messages.go Outdated Show resolved Hide resolved

Update wakuv2/missing_messages.go

34112a9

Co-authored-by: Igor Sirotin <[email protected]>

richard-ramos closed this Aug 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix_: limit the maximum number of message hashes by query hash #5688

fix_: limit the maximum number of message hashes by query hash #5688

richard-ramos commented Aug 9, 2024

status-im-auto commented Aug 9, 2024 •

edited

Loading

chaitanyaprem left a comment

chaitanyaprem commented Aug 10, 2024

richard-ramos commented Aug 10, 2024

kaichaosun left a comment

ilmotta left a comment

richard-ramos commented Aug 12, 2024

igor-sirotin left a comment

richard-ramos commented Aug 12, 2024

igor-sirotin commented Aug 14, 2024

igor-sirotin commented Aug 14, 2024

richard-ramos commented Aug 19, 2024

fix_: limit the maximum number of message hashes by query hash #5688

fix_: limit the maximum number of message hashes by query hash #5688

Conversation

richard-ramos commented Aug 9, 2024

status-im-auto commented Aug 9, 2024 • edited Loading

Jenkins Builds

chaitanyaprem left a comment

Choose a reason for hiding this comment

chaitanyaprem commented Aug 10, 2024

richard-ramos commented Aug 10, 2024

kaichaosun left a comment

Choose a reason for hiding this comment

ilmotta left a comment

Choose a reason for hiding this comment

richard-ramos commented Aug 12, 2024

igor-sirotin left a comment

Choose a reason for hiding this comment

richard-ramos commented Aug 12, 2024

igor-sirotin commented Aug 14, 2024

igor-sirotin commented Aug 14, 2024

richard-ramos commented Aug 19, 2024

status-im-auto commented Aug 9, 2024 •

edited

Loading