Long Polling only happening every 25 seconds? #157

smtlaissezfaire · 2018-01-02T08:00:59Z

I'm seeing long polling events getting chunked and the js client only receiving all of the messages on the channel, but only every 25 seconds.

If, on the other hand, I run something like this in a console:

MessageBus.subscribe('/my_channel') do |msg|
  puts msg
end

I see the results immediately.

I couldn't find any config variables that were set with 25 seconds. Any idea why this would be the case?

I tried setting

MessageBus.enableLongPolling = false;
MessageBus.callbackInterval = 1500;

in which I start to get messages "immediately" (every second and half).

Also, I tried setting MessageBus.alwaysLongPoll = true; and MessageBus.enableChunkedEncoding = false; without any change in behavior.

I can confirm that this happens in both safari and chrome on OS X (haven't tried FF yet).

I'm integrating into Sinatra, calling use MessageBus::Rack::Middleware in my config.ru and using thin v1.7.2. Using message_bus gem version 2.1.1.

The text was updated successfully, but these errors were encountered:

SamSaffron · 2018-01-03T00:18:40Z

What web server are you running?

smtlaissezfaire · 2018-01-03T07:08:35Z

I was using thin, but I'm open to switching. This is an app that's just for personal consumption.

SamSaffron · 2018-01-03T07:11:37Z

Try puma, unicorn or passenger, way less hacky

…

On Wed, 3 Jan 2018 at 6:08 pm, Scott Taylor ***@***.***> wrote: I was using thin, but I'm open to switching. This is an app that's just for personal consumption. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#157 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAAUXXkpnm2rjPH6LkTVDHNmh6wnggArks5tGydzgaJpZM4RQXVn> .

pierreozoux · 2018-10-11T13:09:34Z

@SamSaffron you'll not like it, but well, I have to try ;)

So, as you know, I'm running my own Docker image for discourse, and not the official tool.
By starting discourse with rails s I'm starting puma.

As a user in the admin interface, I see these long polling of 25s.
As a user in the admin interface, when I change, say the title of the discourse instance, if I reload the page, then there is no change, even if the PUT is 200.
But if I wait this long polling to happen, then, yes, the change is persisted.

I think it is the same bug, but I might be wrong in my analysis.
If it is the same bug, what do you recommend me to do?

Thanks a lot for your help!

SamSaffron · 2018-10-11T23:02:11Z

Quick question ... what browser are you using? Does the same issue happen in other browsers?

unteem · 2018-12-07T12:40:54Z

@SamSaffron taking over @pierreozoux so this happens independently of the browser ( tested on firefox and chromium)

So each time I make change in the admin interface I need to wait at least 25 seconds and even if I wait it can get a bit random.

For instance, if I reload without cache, I go back to the initial parameter, reload again I see my change, reload again back to initial parameter.

We are also experiencing strange behaviors when we change the logo. Sometimes its the right logo, sometimes its discourse logo. I don't know if that is related

As Pierre mentioned we are using our own docker image

Thanks for your help

SamSaffron · 2018-12-08T08:08:11Z

My guess is that you have some nginx buffering or othe type of buffering going on

…

On Fri, 7 Dec 2018 at 11:40 pm, Timothee Gosselin ***@***.***> wrote: @SamSaffron <https://github.com/SamSaffron> taking over @pierreozoux <https://github.com/pierreozoux> so this happens either on firefox and chromium. So each time I make change in the admin interface I need to wait at least 25 seconds and even if I wait it can get a bit random. For instance, if I reload without cache, I go back to the initial parameter, reload again I see my change, reload again back to initial parameter. We are also experiencing strange behaviors when we change the logo. Sometimes its the right logo, sometimes its discourse logo. I don't know if that is related As Pierre mentioned we are using our own docker image <https://github.com/libresh/docker-discourse/edit/master/Dockerfile> Thanks for your help — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#157 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAAUXc0BqfUE0DY-LDxMo_GqXvpmv4frks5u2mHXgaJpZM4RQXVn> .

unteem · 2018-12-19T10:23:23Z

@SamSaffron nginx buffering is off, I thought it could come from haproxy that is in front of our nginx but actually the issue is fixed if I run discourse with unicorn and not puma

SamSaffron · 2018-12-19T12:24:01Z

Wow that is somewhat odd, it would indicate a bug of sorts in puma's hijack implementation, perhaps raise on puma?

…

On Wed, Dec 19, 2018 at 12:23 PM Timothee Gosselin ***@***.***> wrote: @SamSaffron <https://github.com/SamSaffron> nginx buffering is off, I thought it could come from haproxy that is in front of our nginx but actually is issue is fixed if I run discourse unicorn and not puma — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#157 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAAUXWIqV4b75LIwsF2rhsyHBYKa6dzcks5u6hObgaJpZM4RQXVn> .

pierreozoux · 2018-12-19T16:25:09Z

For us the issue is closed, we'll not investigate further, I think @unteem spent 2 full days on that :) I guess we should open the issue on puma, but we don't have resources :/

I'd be curious to know why discourse is using unicorn and not puma :) (maybe for that kind of reasons)

And thanks a lot for your kind support!

SamSaffron · 2018-12-19T17:15:21Z

@pierreozoux I know for sure that both @schneems and @evanphx care dearly about making puma as robust as possible and they definitely want the hijack implementation not to stall, this is critical for web sockets and other issues.

If @unteem has any kind of repro here it would be very handy and save others multiple days of debugging.

As to why Discourse use unicorn and not puma in clustered mode, I guess we very much like the fact that rogue requests just take out a single worker and single request rather than taking out potentially a large number of unrelated requests, plus the automatic shielding against rogue gems that don't release GIL in c extensions is nice. Unicorn has treated us nicely, but yeah memory is a challenge. However rack hijack having issues was never a factor in our decision to use unicorn vs puma.

evanphx · 2018-12-19T17:30:33Z

The puma hijacking implementation explicitly performs no buffering so I'm unsure why you'd see a 25s delay, but I don't know much of anything about the message_bus code. I'm happy to look at any specific usage of the hijacking to try and see if there could be an issue though.

SamSaffron · 2018-12-19T17:36:14Z

@evanphx

We just write direct to the socket as stuff happens using chunked encoding:

https://github.com/SamSaffron/message_bus/blob/master/lib/message_bus/client.rb#L226-L257

Not sure this is a specific bug in puma though until we make some sort of test harness that works in unicorn/passenger and fails in puma which I was hoping @unteem could help with.

evanphx · 2018-12-19T17:57:48Z

That all looks just fine. Those write calls hit the socket directly and the data is sent without buffering. Probably the best bet is to use tcpdump to try to verify that the data is being sent back to the client properly though.

pierreozoux · 2018-12-20T07:35:29Z

@evanphx send us your public ssh key by mail or here, I'll give you access to a VM with a reproducible test based on discourse.
[email protected]

kevin-klein · 2019-03-05T10:44:42Z

I had similar issues with puma. When i disabled clustered mode, it suddenly started working.

pierreozoux · 2019-03-06T08:28:54Z

Sorry, forgot to comment back, we solved it by using unicorn back.

chriscz · 2019-03-19T19:34:35Z

Sorry, forgot to comment back, we solved it by using unicorn back.

So using unicorn instead of puma?

tobymao · 2020-05-09T01:19:29Z

i’m hitting this issue as well on puma, and more info on this?

for me i was running puma default, 0:16 and over time notifications would go from instant to to 25 seconds. i’ve since switched to unicorn as well

tobymao · 2020-06-01T02:17:39Z

an update, i'm hitting this on unicorn, again 25 second delays on some workers. if i had to guess it's just due to some kind of leak, perhaps threading related. in order to counter act this, i'm going to use unicorn worker killer

anthotsang · 2021-06-03T08:54:19Z

I'm wondering if people with this issue are not calling MessageBus.after_fork as mentioned here?

I'm running Puma in clustered mode and I was having this issue until I added the aforementioned code to my puma configuration. Removing the above code also reliably causes delays.

Upon re-reading the instructions, it does mention that this can solve non-immediate delivery for forking/threading app servers, but it seems like it should be recommended as standard configuration rather than as optional? I probably overlooked it initially because of the way it was written.

tobymao · 2021-06-03T15:23:11Z

This was not the case for me.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Long Polling only happening every 25 seconds? #157

Long Polling only happening every 25 seconds? #157

smtlaissezfaire commented Jan 2, 2018

SamSaffron commented Jan 3, 2018

smtlaissezfaire commented Jan 3, 2018

SamSaffron commented Jan 3, 2018 via email

pierreozoux commented Oct 11, 2018

SamSaffron commented Oct 11, 2018

unteem commented Dec 7, 2018 •

edited

Loading

SamSaffron commented Dec 8, 2018 via email

unteem commented Dec 19, 2018 •

edited

Loading

SamSaffron commented Dec 19, 2018 via email

pierreozoux commented Dec 19, 2018 •

edited

Loading

SamSaffron commented Dec 19, 2018

evanphx commented Dec 19, 2018

SamSaffron commented Dec 19, 2018

evanphx commented Dec 19, 2018

pierreozoux commented Dec 20, 2018

kevin-klein commented Mar 5, 2019

pierreozoux commented Mar 6, 2019

chriscz commented Mar 19, 2019 •

edited

Loading

tobymao commented May 9, 2020 •

edited

Loading

tobymao commented Jun 1, 2020

anthotsang commented Jun 3, 2021

tobymao commented Jun 3, 2021

Long Polling only happening every 25 seconds? #157

Long Polling only happening every 25 seconds? #157

Comments

smtlaissezfaire commented Jan 2, 2018

SamSaffron commented Jan 3, 2018

smtlaissezfaire commented Jan 3, 2018

SamSaffron commented Jan 3, 2018 via email

pierreozoux commented Oct 11, 2018

SamSaffron commented Oct 11, 2018

unteem commented Dec 7, 2018 • edited Loading

SamSaffron commented Dec 8, 2018 via email

unteem commented Dec 19, 2018 • edited Loading

SamSaffron commented Dec 19, 2018 via email

pierreozoux commented Dec 19, 2018 • edited Loading

SamSaffron commented Dec 19, 2018

evanphx commented Dec 19, 2018

SamSaffron commented Dec 19, 2018

evanphx commented Dec 19, 2018

pierreozoux commented Dec 20, 2018

kevin-klein commented Mar 5, 2019

pierreozoux commented Mar 6, 2019

chriscz commented Mar 19, 2019 • edited Loading

tobymao commented May 9, 2020 • edited Loading

tobymao commented Jun 1, 2020

anthotsang commented Jun 3, 2021

tobymao commented Jun 3, 2021

unteem commented Dec 7, 2018 •

edited

Loading

unteem commented Dec 19, 2018 •

edited

Loading

pierreozoux commented Dec 19, 2018 •

edited

Loading

chriscz commented Mar 19, 2019 •

edited

Loading

tobymao commented May 9, 2020 •

edited

Loading