+project-cloud page timing out

Bug #325445 reported by Diogo Matsubara
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Launchpad itself
Fix Released
High
Jonathan Lange

Bug Description

As seen in OOPS-1127ED259, the +project-cloud page times out with a high non-sql value

Other occurances: OOPS-1131A271, OOPS-1131D296

Jonathan Lange (jml)
Changed in launchpad-bazaar:
assignee: nobody → jml
importance: Undecided → High
milestone: none → 2.2.2
status: New → In Progress
Revision history for this message
Jonathan Lange (jml) wrote :

https://pastebin.canonical.com/13489/ shows the output of EXPLAIN ANALYZE using roughly the same query on staging. The query takes only 0.4s -- much less than the 25s we are seeing on production.

https://pastebin.canonical.com/13490/ shows the same EXPLAIN call on production: less than 0.2s.

If I had to guess, I'd say we are being bitten by some sort of lock. I'd rather not guess though, and ask stub instead.

Revision history for this message
Stuart Bishop (stub) wrote :

The queries are all fast.

The OOPS reports state a massive amount of non-SQL time.

Looking at how the timeouts are currently implemented, we have no way of knowing if this was genuinely non-SQL time, or time spent waiting for a lock and attempting that last SQL query.

I think canonical.launchpad.webapp.adapter.LaunchpadTimeoutTracer needs to be fixed, checking if get_remaining_time() is positive before even attempting to issue a query - this will tell us if time has been spent in Python for some reason rather than time spend waiting on the db.

Nothing should be blocking that query - the only operations that block a select we only do during rollouts.

Revision history for this message
Jonathan Lange (jml) wrote :

Surely the timeout tracer wouldn't raise an error with a query that has already finished executing? Or are you suggesting that the query starts at roughly ~24s into the page request? (Or am I misunderstanding something)

Jonathan Lange (jml)
Changed in launchpad-bazaar:
status: In Progress → Fix Committed
Jonathan Lange (jml)
Changed in launchpad-bazaar:
status: Fix Committed → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.