Save 75% of platform message size 🤦🏻‍♂️ #80

zubairov · 2019-02-08T19:41:54Z

This place here returns a string with base64 encoded string in it. Base64 binary is roughly 75% larger than binary data. The publish method that sends data to the queue accepts a binary Buffer so it makes little sense to send a based64 string over it.
BTW publish method copies the data, so data copied 3 times - here, here when new Buffer is created and here when data is published to the queue

The main problem here when fixing it - maintain backwards compatibility.

The text was updated successfully, but these errors were encountered:

jhorbulyk · 2019-02-11T07:25:32Z

The main problem here when fixing it - maintain backwards compatibility.

All sailor versions need to be updated at the same time. :(

ghaiklor · 2019-02-13T12:01:21Z

@zubairov @jhorbulyk either I did not understand the problem, but I think we can make it backward compatible. It is a little bit stinky, but should work.

So, if you want to have a possibility to exchange both base64 or Buffer, you can predict the format of the message you got.

Pseudo-code

function decryptIV(encData, options) {
    const isBase64 = Buffer.from(encData, 'base64').toString('base64') === encData;
    const inputFormat = isBase64 ? 'base64' : 'buffer';
    const result = cipher.update(encData, inputFormat, 'utf-8') + cipher.final('utf-8');

    return result;
}

The idea behind this trick is simple:

You got a message from rabbit (it can be both base64 or buffer, you don't know for sure)
You are trying to decode message as base64 -> encode back to base64 and IF encoded data equals to encoded messages - we can be sure that is base64
Otherwise, it message was not base64, it can not be decoded/encoded properly and definitely will not be equal to the source message. In that case, we are assuming that message was sent in Buffer format.

Now, regard to compatibility:

New sailor versions in encrypt would use Buffer format, so all the messages will be sent in Buffer format.
Plus, decrypt method will have a mechanism to determine the format of message and handle appropriate (as described above).

Old sailor versions in encrypt would use base64 format and decrypt would use base64 as well.

Another case, when message is sent from old to new sailor:

Old encrypt method will send base64 format, but since our new sailor is capable of determine that it is base64 it will just decrypt it as before. Otherwise, it will take it as a buffer if message sent from new sailor.

Hope I'm right about the problem you saying

zubairov · 2019-02-13T12:37:32Z

@ghaiklor ok, what happens when "new" version send a message in Buffer format to "old" sailor?

ghaiklor · 2019-02-13T12:49:19Z

@zubairov dope, I didn't think about this one, shame on me 😆

We could ignore this issue for now and just increase the limits for pods where sailor is running, but... it is a short-term solution.

zubairov · 2019-02-13T13:11:09Z

It's not really a pod related but RabbitMQ related issue :) but yes, we need to build a message versioning support into sailor ASAP to make sure we are protected from such issues in future.

ghaiklor · 2019-02-13T13:15:40Z

RabbitMQ has no issues with handling big messages. What do you mean by that ?

zubairov · 2019-02-13T13:23:37Z

Yes, correct, my statement should be - we produce unnecessary load on RabbitMQ due to sailor implementation details

ghaiklor · 2019-02-13T13:36:41Z

I'm going to think about any other possible solutions for problem with huge messages, but for now we have the following facts:

Most of the steps have been killed because of Out Of Memory
seems like OOM is produced by sailor due to dirty implementation (do we have the same problem with Java sailor?)
it has nothing to do with RabbitMQ, since its amqp protocol is based on tcp and all the "split-send" stuff is happening under the hood

For now, I do not see any ways to solve it without touching sailor (expect increasing memory limits).

zubairov · 2019-02-13T14:05:02Z

@ghaiklor I don't think it's the solution for huge messages, only a minor efficiency gain, not more than that.

jhorbulyk mentioned this issue Apr 17, 2019

Add the ability to transform and split the results of the HTTP body elasticio/rest-api-component#68

Closed

jhorbulyk mentioned this issue Dec 3, 2019

Add the ability to transform & split results once recieved openintegrationhub/soap-component#8

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Save 75% of platform message size 🤦🏻‍♂️ #80

Save 75% of platform message size 🤦🏻‍♂️ #80

zubairov commented Feb 8, 2019 •

edited

Loading

jhorbulyk commented Feb 11, 2019

ghaiklor commented Feb 13, 2019 •

edited

Loading

zubairov commented Feb 13, 2019

ghaiklor commented Feb 13, 2019 •

edited

Loading

zubairov commented Feb 13, 2019

ghaiklor commented Feb 13, 2019

zubairov commented Feb 13, 2019

ghaiklor commented Feb 13, 2019

zubairov commented Feb 13, 2019

Save 75% of platform message size 🤦🏻‍♂️ #80

Save 75% of platform message size 🤦🏻‍♂️ #80

Comments

zubairov commented Feb 8, 2019 • edited Loading

jhorbulyk commented Feb 11, 2019

ghaiklor commented Feb 13, 2019 • edited Loading

zubairov commented Feb 13, 2019

ghaiklor commented Feb 13, 2019 • edited Loading

zubairov commented Feb 13, 2019

ghaiklor commented Feb 13, 2019

zubairov commented Feb 13, 2019

ghaiklor commented Feb 13, 2019

zubairov commented Feb 13, 2019

zubairov commented Feb 8, 2019 •

edited

Loading

ghaiklor commented Feb 13, 2019 •

edited

Loading

ghaiklor commented Feb 13, 2019 •

edited

Loading