-
-
Notifications
You must be signed in to change notification settings - Fork 38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hubot doesn't reconnect to the stream on API restart #77
Comments
Yes, I reported that the other day. Let me know if you need any info from my end if you can't repro it. |
@emptywee thanks! I can, although not always. This is probably an eventstream module issue: by default a client should attempt reconnects indefinitely, and it looks like the module we use doesn't always do that. Even if it's the case, I think we'll be able to fix it or implement an additional layer of checks on our level. |
Hi there. I'm running a new installation to trial StackStorm and I think this issue biting us fairly hard. Anytime the our Hubot loses connection to the StackStorm API it doesn't attempt reconnect and is then running lame. No more StackStorm goodness. I have to manually restart Hubot :( Is this on the priority to fix? The functionality I've been able to implement quickly is great! But the reliability here is a big deal. We're running Hubot independently. I installed this script into our previously existing bot. Thanks for the help! |
Thanks for the report! we will be looking into it, your +1 increments the priority but no committed fix yet. |
@ticean: did you install with packages or AIO installer? We still have this issue on packages, but AIO should be good. In short, this error is caused by the stream consumer module not recognizing error 5xx as a reason for reconnect. In AIO we apply a special fix to give stream errors special treatment: https://github.com/StackStorm/st2workroom/pull/303/files If you chose packages as your install method, right now you can apply it manually, and in the future we'll hopefully have a better fix. |
I installed with AIO installer, but I configured this As an underlying problem, I'm finding that the stackstorm nginx instance is stopping (or going zombie) every night. I haven't been able to find out what's scheduled to cause that, but it definitely seems like periodic task. The host is dedicated to stackstorm with AIO. If I could find and fix this, then it would definitely lower the urgency of the issue. For now, though, I find our bot disconnected each morning and have to restart nginx and then the bot. |
I should also note that I've customized the HTTPS certs using letsencrypt. I modified the paths to the certs in I mention this because I haven't used puppet. Maybe there's a convergence scheduled nightly? Any recommendations? |
I ran into this failure to reconnect too just changed over to the new packages and running st2chatops on the same server as the rest of stackstorm. In the short term could be may be make |
As discussed on slack yesterday. I ran into this (again), but this time seems to have been caused by the st2stream process having a traceback during log rotation. 👍 |
@armab I just stumbled across reconnecting-eventsource. It looks like it has most of the logic we would need to implement to have hubot consistently reconnect to st2stream. What do you think of using that instead of the built-in |
@blag Seems like For example, https://github.com/fanout/reconnecting-eventsource#when-does-the-normal-eventsource-not-reconnect advertises to reconnect on But if you'll catch the root cause, understand what happens at a deeper level (is it missing closed connections in original eventsource or was it specific HTTP code or anything else), - that would be great. I think it's all doable, just a matter of dedication and time spent on troubleshooting. If finding that |
@armab @blag Installed system
Or stop all above services at once then start, |
@jinpingh Take a look at one edge case example of this: #157 (comment) |
A community report. I've run into that issue a couple times, too. Not sure if we should solve in st2client or here.
The text was updated successfully, but these errors were encountered: