Something causes intermittent, daily, non-scheduled 502s, visible in webservice

Bug #702134 reported by Gary Poster
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Launchpad itself
Fix Released
Critical
Unassigned

Bug Description

As described in bug 380504, webservice users report that requests return 502s intermittently. They said that this happened on production, daily, at different times throughout the day.

This seems to rule out deployment as the only cause (see bug 640065); it also seems to rule out at least some of our own scheduled application state changes, such as sending signals for log rotation.

The original bug was closed when lazr.restfulclient got the ability to retry. However, the underlying problem in Launchpad presumably still exists.

In https://bugs.launchpad.net/lazr.restfulclient/+bug/380504/comments/31 , I described a hypothesis from Francis Lacoste that HAProxy might be the cause. At the time, we were blocked from investigating because the LOSAs did not have access to the logs. Since then, LOSAs have gained access at our request, though they still cannot give us direct access because of operational concerns. We should take advantage of this and pursue that avenue.

Note also bug 636713; Francis' hypothesis is essentially that a sustained overload will also result in the symptoms of this bug.

Revision history for this message
Robert Collins (lifeless) wrote :

I agree with Francis hypothesis, we are deeply overloaded at the moment, new capacity should be coming online in the next couple of weeks.

Revision history for this message
Robert Collins (lifeless) wrote :

We've reconfigured and significantly reduced queue times - we served 5M api requests on monday alone. Closing this (somewhat optimistically, but 'users are not complaining' is the only metric we have - and they aren't now).

Changed in launchpad:
status: Triaged → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.