Fun fact that I didn't know before:

We have an email list called "(this specific test suite) tiger team". I do wish one of them had actually bothered to look at the test results some time in the last month or so; they might have actually started doing something about it at that point!!

So far all we have is a bunch of "well there are a lot of different things that could cause this to fail", "we should increase the timeout so the test might pass", "let's keep the system alive for 6 hours so someone can troubleshoot" (uh, most of these are run overnight so they're gone by the time we get back in!!), and so on. Well, it's possible that someone will actually track down the source of the error if they keep the system around...
