contrib: change default value of epee::serialization to fix limitation #9433

0xFFFC0000 · 2024-08-13T22:37:12Z

Based on different benchmarks and measurements we realized that 65536 would be the idea number for this limitation.

Co-authored-by: Boog900 <[email protected]>

Co-authored-by: Boog900 <[email protected]>

0xFFFC0000 · 2024-08-14T06:56:24Z

Here are a few benchmarks we did to measure the times to find these limits.

Keep in mind there is 100mb network packet size limit. So we don't need to support anything that goes beyond that.

0xFFFC0000 · 2024-08-14T07:00:25Z

And code for the benchmark is available here [1]. Incase anyone wants to take a look at it.

Adding benchmark for (de)serialization 0xFFFC0000/monero#30

selsta · 2024-08-14T11:48:32Z

What is the RAM usage? We added these limits to avoid DoS packets using too much RAM.

0xFFFC0000 · 2024-08-14T12:35:14Z

@selsta RAM usage was negligible. On my 64gb PC I didn’t notice usage more than 5% when running these benchmarks. That’s why I didn’t measure RAM usage. But I will report the numbers.

Boog900 · 2024-08-14T14:05:53Z

Just to add some more details on how these limits were chosen, the string limit was chosen to allow us to roughly reach the 100mb packet size limit with average size txs.

Then the object limit was chosen to allow us to store the same amount of pruned txs in an GetObjects response as normal txs, the fields limit was chosen the same way.

However looking at the pruned GetObjects response the pruned tx's objects fields are both strings, which means to truly reach the same limit we would need to increase the string limit to 130k.

monero/src/cryptonote_protocol/cryptonote_protocol_defs.h

Lines 121 to 128 in 0db9e74

    
           struct tx_blob_entry 
        
           { 
        
             blobdata blob; 
        
             crypto::hash prunable_hash; 
        
             BEGIN_KV_SERIALIZE_MAP() 
        
               KV_SERIALIZE(blob) 
        
               KV_SERIALIZE_VAL_POD_AS_BLOB(prunable_hash) 
        
             END_KV_SERIALIZE_MAP()

A 130k string limit may be too high, in which case it might be best to accept that when syncing pruned blocks we are not able to pack as many txs as when syncing full blocks. If we were to accept this we could decrease the object limit to 32k and field limit to 64k.

iamamyth · 2024-08-14T14:53:01Z

The reported time units in the benchmark don't make sense: For example, in the final tiny_4_string_test, the benchmark reports 13461399ms for "Time" (which I assume means wall clock time) and 3396ms for CPU. Does pausing and resuming the timer break wall clock timing, or are the units here wrong, or something else?

0xFFFC0000 · 2024-08-14T14:56:13Z

The reported time units in the benchmark don't make sense: For example, in the final tiny_4_string_test, the benchmark reports 13461399ms for "Time" (which I assume means wall clock time) and 3396ms for CPU. Does pausing and resuming the timer break wall clock timing, or are the units here wrong, or something else?

I am pausing and resuming the timer when I am creating the objects. Only measuring the epee::(de)serialization operation, if that's what you mean.

You can build the benchmark_serialization test from here [1].

Adding benchmark for (de)serialization 0xFFFC0000/monero#30

iamamyth · 2024-08-14T15:02:09Z

Setting aside my prior comment, assuming the CPU time values are accurate, the required time in many of these scenarios seems too high: In the faster of the two runs (the experimental setup lacks details, such as thread count, affinity, etc, to determine why there's such a huge gap between them), the maximum time is ~23s. The test doesn't split serialization and deserialization, but if one assumes an even split for each operation, that's ~11s spent just to deserialize the message, i.e. ~10% of the expected time between blocks. Aside from implying a lot of net splits, if an attacker can jam up a CPU for 11s at will, that's a big problem.

iamamyth · 2024-08-14T15:03:55Z

I am pausing and resuming the timer when I am creating the objects. Only measuring the epee::(de)serialization operation, if that's what you mean.

What I mean is the numbers plainly make no sense: Wall time (13461399ms) shouldn't exceed CPU time (3396ms) by a factor of ~4000 for a CPU-bound operation.

0xFFFC0000 · 2024-08-14T17:27:00Z

I am pausing and resuming the timer when I am creating the objects. Only measuring the epee::(de)serialization operation, if that's what you mean.

What I mean is the numbers plainly make no sense: Wall time (13461399ms) shouldn't exceed CPU time (3396ms) by a factor of ~4000 for a CPU-bound operation.

A few points about this:

That was an unrelated bug because I don't use for loop operation of the Google Benchmark library. and Pausing and Resuming the timer was not working without GoogleBenchmark for loop.
I just wanted CPU runtime.
Just to make sure there is no dispute here, I updated the benchmark, ran it and you can find new screenshots here. For most cases they are even better, and as you can see sometimes numbers are roughly the same ( CPU is important for us ). So there is no problem increasing the default_levin_limits.
You can compile and run it on your machine. Configure the same as the default monero project with CMake. Instead of building monerod, just run ninja benchmark_serialization or make benchmark_serialization if you are using make. ( Adding benchmark for (de)serialization 0xFFFC0000/monero#30 )
It is quite easy to have separate numbers for serialization and (de)serialization. But we don't care about each operation's performance here. We only care that change does not introduce any opportunities for any DDos attack.

-Ofast default compilation:

Hope this helps.

0xFFFC0000 · 2024-08-14T19:53:27Z

Added memory usage to benchmarks too, in case there is any concern about memory usage:

iamamyth · 2024-08-15T15:28:11Z

The latest numbers look much more reasonable, thanks. I agree with the overall methodology. The high watermark (~17.8 s) is still a bit concerning, but not bad enough that I consider it within the scope of this PR (ultimately, it should be lowered a lot by other work to improve serialization).

rbrunner7 · 2024-08-21T18:02:43Z

@selsta asked a while back here:

What is the RAM usage? We added these limits to avoid DoS packets using too much RAM.

Not sure whether this has already be mentioned and explained clearly enough, but the limits that this PR wants to rise were introduced as a reaction to an attack on the Monero network. As far as I understand daemons were crashed using specially crafted packets that, while themselves reasonably small, blew up to an enormous size in RAM when deserialized. You can read a bit about the attack here on Reddit.

The PR that introduced these limits at the start of 2021 is [#7286].

It seems that knowledge exactly how that attack worked is lost, unfortunately. Which, like @selsta probably as well, I find mildly worrying: Under these circumstances, how can we be reasonably sure we do not re-enable such an attack by larger limits?

We are probably safe, I would say. But on the other hand I don't think like an attacker ...

vtnerd · 2024-08-21T19:25:15Z

It seems that knowledge exactly how that attack worked is lost, unfortunately

The details are not lost. #7999 and #8867 both contain tests that simulate the attack.

The primary mechanism for the attack was serializing 0-length strings or 0-length objects inside of an array. Each element was ~1 byte on the wire compared to ~128 bytes in the DOM. The ratio of 1/128 meant that you could have unpacked size ~12.5 GiB. IIRC, the first attempt to stop it failed because it enforced limits per array instead of global.

vtnerd · 2024-08-21T19:33:17Z

It seems that knowledge exactly how that attack worked is lost, unfortunately

The details are not lost. #7999 and #8867 both contain tests that simulate the attack.

The primary mechanism for the attack was serializing 0-length strings or 0-length objects inside of an array. Each element was ~1 byte on the wire compared to ~128 bytes in the DOM. The ratio of 1/128 meant that you could have unpacked size ~12.5 GiB. IIRC, the first attempt to stop it failed because it enforced limits per array instead of global.

I also forgot fields. The unpacked size is at least 160 bytes (not including std::map overhead per entry), and the on-wire length varies due to string length. But the ratio is definitely pretty large there too.

rbrunner7 · 2024-08-21T20:03:29Z

Thanks for the interesting details, @vtnerd ! They make sense, and basically make my worries go away.

0xFFFC0000 force-pushed the dev/0xfffc/fix-serialization-limit branch from bcc2265 to 7d1911a Compare August 13, 2024 22:37

0xFFFC0000 added pending review important labels Aug 13, 2024

contrib: change default value of epee::serialization to fix limitation

13c5aa5

Co-authored-by: Boog900 <[email protected]>

0xFFFC0000 force-pushed the dev/0xfffc/fix-serialization-limit branch from 7d1911a to 13c5aa5 Compare August 14, 2024 06:52

iamamyth approved these changes Aug 15, 2024

View reviewed changes

Rucknium mentioned this pull request Aug 26, 2024

Monero Research Lab Meeting - Wed 21 August 2024, 17:00 UTC monero-project/meta#1061

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

contrib: change default value of epee::serialization to fix limitation #9433

contrib: change default value of epee::serialization to fix limitation #9433

0xFFFC0000 commented Aug 13, 2024

0xFFFC0000 commented Aug 14, 2024

0xFFFC0000 commented Aug 14, 2024

selsta commented Aug 14, 2024

0xFFFC0000 commented Aug 14, 2024

Boog900 commented Aug 14, 2024

iamamyth commented Aug 14, 2024

0xFFFC0000 commented Aug 14, 2024 •

edited

Loading

iamamyth commented Aug 14, 2024

iamamyth commented Aug 14, 2024

0xFFFC0000 commented Aug 14, 2024 •

edited

Loading

0xFFFC0000 commented Aug 14, 2024

iamamyth commented Aug 15, 2024

rbrunner7 commented Aug 21, 2024

vtnerd commented Aug 21, 2024

vtnerd commented Aug 21, 2024

rbrunner7 commented Aug 21, 2024

contrib: change default value of epee::serialization to fix limitation #9433

Are you sure you want to change the base?

contrib: change default value of epee::serialization to fix limitation #9433

Conversation

0xFFFC0000 commented Aug 13, 2024

0xFFFC0000 commented Aug 14, 2024

0xFFFC0000 commented Aug 14, 2024

selsta commented Aug 14, 2024

0xFFFC0000 commented Aug 14, 2024

Boog900 commented Aug 14, 2024

iamamyth commented Aug 14, 2024

0xFFFC0000 commented Aug 14, 2024 • edited Loading

iamamyth commented Aug 14, 2024

iamamyth commented Aug 14, 2024

0xFFFC0000 commented Aug 14, 2024 • edited Loading

0xFFFC0000 commented Aug 14, 2024

iamamyth commented Aug 15, 2024

rbrunner7 commented Aug 21, 2024

vtnerd commented Aug 21, 2024

vtnerd commented Aug 21, 2024

rbrunner7 commented Aug 21, 2024

0xFFFC0000 commented Aug 14, 2024 •

edited

Loading

0xFFFC0000 commented Aug 14, 2024 •

edited

Loading