Sheriff Log: Chromium OS (ARCHIVE!)

          THIS IS AN ARCHIVE.  See Sheriff Log: Chromium OS for newer entries.

          2014-04-24 - 2014-04-25
          Sheriffs: jwerner, pstew Gardener: bshe, Build Deputy: yjhong, Lab Sheriff: beeps
          • 366158 video_VideoEncodeAccelerator failure now breaking BVT as well
          • 366537 login_MultipleSessions fails in BVT
          • 365973365976366283366292366460366465366552366588 video_ChromeHWDecodeUsed causing a spate of failures
          • 346868 login_LoginSuccess makes a re-appearance in the PFQ uprev failures, but this is likely due to BrowserConnectionGoneException
          • 366577366581 graphics_SanAngeles failures popping back up
          • 366592 login_LogoutProcessCleanup also failing too, so there's clearly something systematic (in telemetry?) causing BrowserConnectionGoneException failures.
          • 366593 (login_OwnershipTaken), 
          • 356020 - Reset failing in BVT sporadically since Mar 24.  Odd that it hasn't been assigned to anyone yet for triage.
          • 366823 - daisy_spring DUTs are all non-functioning.
          • 366988 parallel_emerge failed with IOError: [Errno 9] Bad file descriptor
          • CL:196992 to fix sdk bot failure in strongswan and ipsec user/group
          • 367086: [bvt] login_OwnershipNotRetaken Failure
          • 367174: Tests aborting on HW tests
          Sheriffs: vapier Gardener: jamescook
          • 365507 Login screen failures due to multiple blink regressions
          • 365982 security_Firewall failed
          • 366067 graphics_GpuReset hung the kernel
          • 366142 Assertion about window opacity/visibility
          • 366141 Suite aborts from a timeout without any other failures (causes PFQ failures, due to lab overload)
          • 366158 video_VideoEncodeAccelerator failure breaking Chrome PFQ
          • CL:196473 and CL:196481 asan bots failed in unittest due to bug in bootstat unittest
                  Sheriffs: vapier Gardener: jamescook


                          Sheriffs: wuchengli

                          • [365195] stumpy_moblab canary unittest failed
                          • [332665] [bvt] autoupdate_CatchBadSignatures Failure


                          Sheriffs: wuchengli, semenzato, benchan

                          • [359223] [monroe] graphics_SanAngeles suspected to reboot/hang machine
                          • [365035] ChromeSDK failed on all full builders (because of LD=gold by default and probable gcc incompatibility)
                          • [364875beaglebone canary build fails running out of space (FIXED)
                          • [364887] moblab unit tests failures
                          • [364818] x86-generic ASAN unittest failing due to compile warnings in


                          Sheriffs: milleral, thieule

                          • [364617] Beaglebone servo image is too big
                          • [356020] [bvt] reset Failure on falco-chrome-pfq/R35-5684.0.0-rc4
                          • [355843] TreeCloser: build failure in DebugSymbols 600sec timeout
                          • [364669] Daisy skate build is failing on serial-tty
                          • [359223] [monroe] graphics_SanAngeles suspected to reboot/hang machine
                          • [358737] [bvt] graphics_GpuReset Failure on stout-release/R36-5718.0.0


                          Sheriffs: milleral, thieule


                          Gardener: derat

                          • [363884] LKGMSync step failing repeatedly due to bad SHA1 when syncing coreboot
                          • started new x86-generic nightly chromium PFQ build since last night's run died on some slaves


                          Sheriffs: jchuang, reinauer, Gardener: derat

                          • [363339] stumpy moblab failure 
                          • [363294] sandybridge-canary failed.
                          • [362999] Failed cbuildbot failed hwtest [bvt] [parrot_ivb]. Transient.
                          • [358737356020] Transient HWtest fail on falco and wolf (both issues have been auto filed many times)
                          • [363015] Failed cbuildbot failed debugsymbols [x86-mario]
                          • [363167] HWTest step timed out on daisy_spring and falco PFQ


                          Sheriffs: olofj, adlr, josephshi, ihf

                          • [362621] Removed pyauto dependencies that broke PFQ.


                          Sheriffs: keybuk, bfreed

                          • [339291] Reverted a set of CLs that caused platform_Powerwash failure.
                          • [360797] The chumps from 339291 broke incremental builders and required deputy assistance.
                          • [360898] video_DecodeAccelerator is increasingly unreliable.  Maybe we should make it experimental?


                          Sheriffs: keybuk, bfreed

                          • [360084] rambi-b canary build failed in the Archive stage on loopback mount failure.  Believed transient.
                          • [360082] Chrome PFQ fails with unknown linker flag (--reduce-memory-overheads), likely because of last night.

                          2014-04-02 & 2014-04-03

                          Sheriffs: pprabhu, dgreid

                          • [359143] generate_payload failed to unmount a file system, and we tried to rm it later. pprabhu@ forced canaries to restart, since ongoing runs were all going to hit this issue. But it took a while to take this decision.
                            This hit us again later in the day, keeping canaries red almost all day. The reason was that a script had to be manually upreved to pull in the revert. See CL.
                          • [359227] VMTest hung. Root cause unknown.
                          • [359422] VMTest failed because VM ran out of space when running the tests. We reverted the Cl in the morning. Unfortunately, although all the canaries are back online, we can't get the CQ to pass yet, due to flakes + lots of CLs trying to get in. So, we need to uprev the manifest by allowing a noop CL through a throttled tree.
                          • [348199] and [353590] daisy_spring-pfq failed because of known GOLO and update engine flakes.
                          • [359760] beaglebone_servo canary is currently broken. [TODO: If the canary is still red at EOD, revert the CL mentioned in the bug].


                            Sheriffs: katierh, dparker

                            • [358180] Daisy canary builder failure caused by error in a git repro. Existing error turned into a warning due to
                            • [358075] daisy_skate and daisy_spring canary failing due to clustered Chrome builds on one builder. Variants not using pre-built of Chrome from daisy.


                            Sheriffs: dbasehore, armansito, sheckylin

                            • [357364] Tree doesn't close anymore when builds fail. Looks like it's fixed now.
                            • repeated failures in daisy canary during build packages.


                            Sheriffs: dbasehore, armansito, sheckylin

                            • [353906] Builder out of space
                            • [357093] x86 generic ASAN fail due to Chrome
                            • [357202] Pre-CQ timeout.


                            Sheriffs: dianders, vbendeb

                            • [356187] widespread provision failures; waiting for lab sheriff for the most part
                            • [356198] and [356199] video_VideoDecodeAccelerator - probably a duplicate of [353898].  There's  a possible fix for that but it hasn't gone back to R34 yet.
                            • ... lab issue is hopefully fixed now ...
                            • ... David James and crew restarted CQ ...
                            • ... various things handled by David James ...
                            • [355843] beltino canary - DebugSymbols hung for 600 seconds
                            • [348188] slippy canary and daisy canary - Flood of "Too many open files"
                            • Chrome uprev has failed a few times; Chrome sheriff handling?

                            2014-03-21 and 2014-03-24

                            Sheriffs: jrbarnette, tbroch

                            • [355843] beltino canary: DebugSymbols failed during upload with timeout
                            • daisy incremental: CQ bug missed bump of chromeos-init for CL:190619 initially (race?) then got it fixed making manual override here unnecessary.
                            • [353018] sandybridge canary: OSError(16, 'Device or resource busy') ... believed to be not enough loopback devices.
                            • [354573] A bug in Chrome caused the x86-generic paladin to fail multiple times in VM testing.

                            2014-03-19 and 2014-03-20

                            Sheriffs: snanda

                            • [344506]: peppy canary failed to reboot due to ASIX USB issue.
                            • [352276]: falco canary platform2 build failure. is the fix but still waiting to be blessed by CQ.
                            • [354496]: monroe paladin misbehaved for a while.
                            • [354262]: sandybridge build failed.  DUT was down?
                            • [311350]: platform_Powerwash Failure on daisy_spring-release. USB dongle flakiness?

                            2014-03-17 and 2014-03-18

                            Sheriffs: yjlou, wfrichar, victoryang, hungte (TPE)

                            • [352994] cros_generate_test_payloads failed to find image folder (race condition)
                            • [353429] chrome/chromium pfq bots died in build_image due to missing
                            • [353461] failuer in uploading DebugSymbols

                            2014-03-13 and 2014-03-14

                            Sheriffs: cywang(TPE), dgarrett, bleung, gwendal

                            • [348855] amd64-generic-asan: logging_UserCrash timed out (flaky)
                            • [352093] daisy_spring: HWtest job timeout, but tests are still running
                            • [350677] x86-generic-full : cryptohome fails to link.
                            • [352276] platform2-0.0.1-r366 fails on arm-generic full
                            • [352297] Pre-CQ Failure- Gerrit Code Review requires Java 7
                            • [348855] amd64-generic-asan: logging_UserCrash timed out
                            • [352428] x86-generic asan : logging_AsanCrashTelemetry : Unhandled TabCrashException: Handshake Status 500
                            • [72633] x86-generic incremental: login_OwnershipNotRetaken
                            • [352520] atom canary: x86-mario: build_image failed (can't read superblock)

                            2014-03-11 and 2014-03-12

                            Sheriffs: dlaurie, grundler

                            2014-03-05 and 2014-03-06

                            Sheriffs: miletus, shawnn

                            • [337490] daisy incremental unmount completed, but returned an error
                            • [348855] amd64-generic-asain, logging_UserCrash timed out
                            • [348758] x86-generic-asan failure. not sure how to interpret the failure message.
                            • [349559] Signer failure on all canaries
                            • [343442] Wolf Paladin builder wedged
                            • [349597] chrome-internal-fetch netrc credentials revoked

                            2014-03-03 and 2014-03-04

                            Sheriffs: quiche, vpalatin

                            • [348607] Chrome PFQ failure. later Chrome builds cycled green.
                            • [345501] platform_FilePerms: jrbarnette@ has CL checked in, but it wasn't picked up by the lab server. lab team will update its server.
                            • [347932] security_AccountsBaseline (multiple times). cmasone@ investigating.
                            • [348059] chromiumos sdk builder failing. pinged bug.
                            • [348758] x86-generic-asan failure. not sure how to interpret the failure message.
                            • [348799] stumpdgarrett, bleung, gwendaly-paladin, reboot failure in autoupdate_CatchBadSignatures
                            • [348805] x86-generic, e2fsprogs failed to emerge
                            • [330670] breakpad unittest failure on amd64-generic
                            • [348855] amd64-generic-asain, logging_UserCrash timed out (2x)
                            • [348889] duck canary failure
                            • [345491] x86-mario canary: GSResponseError 403
                            • [349073] parrot canary: platform_PowerWash failed
                            • [337490] daisy incremental unmount completed, but returned an error
                            • [349187] x86-mario canary failed in ChromeSDK: out of disk?
                            • [349292] duck canary, GS_ERROR: Attempt to get key

                            2014-02-19 and 2014-02-20

                            Sheriffs: olofj, pstew

                            • mario-canary fails in UReadAheadServer.  No logs.
                            • Chrome uprev failed on thermal, dianders@ to revbump package, but failed again
                            • [344914] CQ failing due to failure to build hostapd, deemed to be a corrupted tarball in the buildier's cache.
                            • [345098] New factory install script broke the archive process.
                            • [345210] Rash of signer test failures (alexslippyparrotleon)
                            • [345479] VMTests fail with 'NoneType' object has no attribute 'Cleanup'
                            • [345491] GS AccessDenied error while uploading prebuilts for slippy_canary.  Invoking troopers.
                            • [345476] login_CryptohomeIncognitoTelemetry and ScreenLockerTelemtry [344849] causing chrome uprev issues

                            2014-02-17 and 2014-02-18

                            waihong (TPE), bhthompson, marcheu:

                            • peach_pit canary hwtest flake ->
                            • amd64-generic paladin machine went offline for a while. Contacted the Trooper to fix.
                            • stumpy canary hwtest flake - happened again,

                            2014-02-13 and 2014-02-14
                            reinauer, garnold

                            • beaglebone canary failed on DebugSymbols stage; appears to be a flake (

                            If you are seeing double-free/heap corruption errors when running gn during ChromeSDK runs (e.g.:


                            it's probably Please see my explanation in the bug.

                            - posciak


                              benchan, sosa, owenlin:


                              dkrahn, adlr, kcwu (TPE):

                              • Due to DiRT, MTV was offline and internal waterfall was affected ~14:00 - 17:00. Some buildbots were affected as well.
                              • Another chrome pfq vmtest failure on falco. - these don't actually block uprev, see
                              • Google Storage issues (fiber cut) cause canary failures -
                              • Filed for link canary build error.
                              • linux_chromeos dbg 2 bot has very long cycle time (~4 hours) so failures may show up late, filed


                              dkrahn, adlr, kcwu

                              • mario canary hwtest bvt failure - emailed troppers to escalate. This has been going on for a while now, it seems
                              • chrome pfq vmtest failure - next build cycled green, so letting this one go
                              • falco chrome pfq vmtest failure -
                              • mario canary hwtest bvt failure ended up being a combination of and no logs reported -
                              • investigated repeated failures on x86-generic asan builder - filed
                              • daisy_spring starvation in the lab due to
                              • wolf_canary lab flake due to


                              skuhne, jwerner, thieule:

                              • Tryserver unavailable ~13:00 - 16:00
                              • HW lab down 14:39 - 15:46
                              • dev_install failed due to connection timeout
                              • amd64-generic-incremental build failure during build image due to unmountable partition. Device busy.
                              • system sometimes does not come up after reboot in VMTest
                              • SimpleTestAndVerify fails on x86generic ASAN. Saw that yesterday already, but there were many more problems.. (->
                              • PFQ for daisy_spring is still failing in HWTests since there are apparently no machines since 6 days (->


                              mtennant, skuhne, jwerner, thieule (MTV):

                              • PFQ for falco, lumpy, .. had failures. Might be fluke upon "Failed to run /home/chrome-bot/depot-tools/gclient runhooks". Retriggered / clobbered / but no success (-> PFQ seems still to be broken @5:45pm, but it will cycle tomorrow morning green since Chrome was red as well.
                              • PFQ x86: lab failure or restarting VM's (->
                              • Several x86-generic and amd64-generic failures in SimpleTestUpdateAndVerify (->, VM hung on reboot, rolling back 3.10 kernel switch on generic to fix)
                              • daisy canary - random tests failed due to crashdumps from (non-fatal) Xorg crashes that took too long to symbolize (client test is marked GOOD but server job times out)
                              • daisy_spring canary - all tests after a certain point in the suite failed with ABORT (suite hit 2h timeout since not enough lab devices available soon enough to finish on time)
                              • dev_install test is failing on canaries - (offending change was rolled back with some difficulties problem was identified and will be fixed on reupload)


                              posciak (TOK)

                              dianders, rspangler (MTV):


                              dianders, rspangler


                              tbroch, jrbarnette

                              • race/flake for tar during DebugSymbols ... tar: debug/bin: file changed as we read it
                              • double-free/corruption errors when running gn during ChromeSDK stage
                              • [bvt] network_VPNConnect.l2tpipsec_cert Failure
                              • mount failure during build_image (error status 32)

                              posciak (TOK)

                              • See for a probable reason behind occasional double-free/corruption errors when running gn during ChromeSDK stage
                              • build failures as CQ missed one of the CQ-DEPEND CLs, because it was uploaded as a draft
                              • login timeouts on link canary in a few bvt tests; suspecting to have made login last longer/stop working... may need to followed up on if persists;


                              tbroch, jrbarnette

                              • failed to uprev chrome 34.0.1813 due to proto change for LocalExtensionCache::CacheItemInfo::CacheItemInfo
                                • contacted chrome gardner (harrym) to resolve
                              • stumpy canary. 
                              • leon/samus/link/panther canary failures for vm_test fix here.
                              • radvd ebuild failure ... fixed with clobber build.  Email triage by (jamescook, avi, xiyuan, achuith)

                              posciak (TOK)




                              katierh, armansito

                              • Jan 27 was full of clobbering...
                                • Needed a few reverts due to bad eclass CLs landing - and then lots of clobbering, removing prebuilts, etc to get the tree in a sane shape -
                              • Jan 24 had a number of failures due to Gaia corp errors...


                              derat, wiley, dparker

                              • failure in build image: mount(8) failed: Device or resource busy
                              • Timeouts in login_CryptohomeTelemetry. Fixed by this revert.
                              • PFQ failures with "TEST_NA: Unsatisfiable DEPENDENCIES" caused by a server dying in the lab (per scottz@).
                              • CQ fails in unittests due to timeout in chromite:


                              derat, wiley, dparker

                              • all PFQs failed due to factory-test-init and chromeos-test-init conflict (see 2014-01-20). forced rebuilds
                              • two CertificateManagerBrowserTest tests failing on "Linux ChromiumOS Tests (dbg)(2)" builder
                              • BVT failure on pit caused by kernel crashes:
                              • BVT failure on ZGB caused by flake in network_DhcpStaticIP:


                              reveman, pprabhu, dgreid

                              • security_ptraceRestrictions failed due to test_image update. Fixed by this revert.
                              • factorytest-init and chromeos-test-init package conflict. Fixed by this revert.
                              • Arm canaries and pfq were broken. It was mostly a chrome issue, fix had already made its way to ToT pfq builders. TODO(sheriff): Make sure that the nightly-pfq picks up this change. Essentially, make sure that nightly-pfq has a green run.
                              • We didn't have enough daisy_spring DUTs in the lab, so HWTest timed on ChromePFQ a couple times.

                              2014-01-16 and 2014-01-17

                              bfreed, snanda, ellyjones

                              2014-01-10 and 2014-01-13

                              bleung, dbasehore, spang

                              • Node.js issue with downloading Chrome. Caused all canary builders and full builders to fail.
                              • Delay between ebuild commit and uprev commit
                              • beltino canary failed in archive, might be low memory issue.

                              2014-01-06 and 2014-01-07

                              jsalz (TPE), dlaurie, grundler

                              • Recurring issue in ManifestVersionedSync step on several builders (zgb, falco, peppy, stout) and experimental builders.  This is the top item for tree closure on 1/7.
                              • Chrome PFQ nightly failing to compile.  Fixed in chrome already, should be good for tomorrow's build.
                              • autopdate_EndToEndTest failure
                              • "Update failed" in VMTest
                              • daisy_incremental out of space
                              • experimental_platform_RebootAfterUpdate suspected of putting machines in Repair Failed state.
                              • login_CryptohomeIncognitoTelemetry suspected of putting machines in Repair Failed state.
                              • audiovideo_VDA failed
                              • "Suite prep" failure - probable bvt timeout [update: fixed on parrot_ivb]
                              • platform_FilePerms: "/dev/pts" is missing options "set(['mode=620', 'gid=5'])"
                              • GerritHelperTest unit test failure
                              • UploadPrebuilts fails with CommandException: Invalid canned ACL

                              2014-01-02 and 2014-01-03

                              djkurtz (TPE)

                              • - autoupdate_CatchBadSignatures hash failure
                              • - "daisy canary" - autoupdate_Rollback failed failed to find a job_repo_url for the given host
                              • - "parrot canary" - time out during bvt 

                              *** 2014! Happy New Year!! 2014! ***

                              2013-12-25 and 2013-12-26

                              hungte (TPE)

                              2013-12-23 and 2013-12-24

                              jcliang (TPE)

                              2013-12-19 and 2013-12-20

                                cywang (TPE), gabeblack, zork

                                2013-12-17 and 2013-12-18

                                waihong (TPE), shawnn, charliemooney

                                2013-12-09 and 2013-12-10

                                sabercrombie, milleral, miletus, mtennant (Chrome OS build deputy), rginda (Chrome gardner)

                                • - chromeos-base/telemetry failed on chrome_pfq, multiple platforms
                                • - Parrot canary failed because of chromeos-chrome build failed

                                2013-12-5 and 2013-12-6

                                dbasehore, pstew, seanpaul, tengs (chrome)

                                2013-12-3 and 2013-12-4

                                reinauer, benchan, josephsih, jamescook (chrome)
                                • - repeated hwtest timeout on daisy_spring
                                • TestFailure on HWTest [bvt]: network_DefaultProfileCreation: Missing setting CheckPortalList=ethernet,wifi,cellular
                                • - lumpy chrome pfq failing (SimpleTestUpdateAndVerify fails, system doesn't come up after reboot)
                                • - devinstall_test failed with KeyboardInterrupt: SIGINT received in VMTest x86-alex canary 
                                • - chromite unitest failure on samus canary
                                • - chromite unitest gerrit_unittest failure with KeyError: 'http' on amd64-generic full and x86-mario. szager had submitted a patch to fix this problem.
                                • - ALL bvt tests failed. I believed they were caused by the same reason. Merged all the other auto-filed issues to this one to track this bug.

                                2013-12-2 (and 11-29 - holiday)

                                skuhne, dkrahn, rspangler, kinaba
                                • See several timeout problems (auto update, VMTest) and investigating. re-run builder with clobber
                                • - lumpy chrome pfq failing
                                • - daisy_spring canary hwtests failing b/c dut not coming back after reboot
                                • - filed this bug to track repeated hwtest timeout instances on daisy_spring
                                • - filed this bug to track repeated chromite unittest failure on x86-mario canary
                                • - filed this bug to track repeated perf (benchmark tests) failures on lumpy, parrot, daisy, ..
                                • - closed the tree because lumpy paladin will not pass until this is fixed -- update: temporary fix here and tree reopened


                                thieule, garnold, kcwu

                                • x86-zgb canary failure during BuildImage; cgpt error, likely due to a corrupt image remnant from an previously interrupted run; re-ran builder w/ clobber.
                                • Tree/builders closed for maintenance between 9am-12:35pm PST.

                                2013-11-25 and 2013-11-26

                                sosa, tbroch, yjlou

                                • 11/26: : stumpy canary: "update-engine failed"
                                • 11/26: butterfly canary: hwtest failed ... out of machines in lab
                                • 11/26 x86-mario canary: cryptohome bug ( )
                                • 11/26: : chrome PFQ failing on amd64-generic due to disk full
                                • 11/26: : previous build interrupted during setup_board left unclean state.  davidjames cleaned up manually
                                • 11/26: : cbuildbot issue w/ versioning
                                • 11/25: mtv no closures 9-5pm PST

                                2013-11-21 and 2013-11-22

                                fjhenigman, dianders, jwerner

                                • daisy_sping canary: new auto-filed bug 321997 - "update-engine failed" but can't tell why
                                • x86-generic ASAN#14728 - Failed in VMTest in cryptohome stuff (apparently a TPM failure?).
                                • All canaries are failing - closing tree as per lab folks while they investigate.
                                • Friday morning: vapier opened the tree with the message "network_3GSmokeTest.pseudomodem.3GPP failure ->"
                                • Friday morning: vapier opened the tree with the message "daisy_spring canary" looks like flake; peppy hwtest -> Note that 322263 has since been marked as fixed.
                                • Butterfly canary died again in network_3GSmokeTest.pseudomodem.3GPP, but with a different message. This time: "Unhandled gaierror". Opening up to track.

                                2013-11-19 and 2013-11-20

                                dgreid, wiley, owenlin

                                2013-11-15 and 2013-11-18

                                puneetster, adlr

                                • chrome crash on boot as of 4966.0.0/33.0.1710.0 . Chrome pinned. CL to unpin:
                                  • chrome is now unpinned
                                • Network issues w/ archiving ; still happening. on-call groups have been paged. Throttling that was impacting us is resolved.

                                2013-11-13 and 2013-11-14

                                rminnich, katierh

                                • - updated chrome build had a telemetry bug - fix landed in chrome and then sent to the canaries - the offending test was marked as experimental until it landed
                                • - flake on ASAN vmtest falls into an error path looking for pyautolib which doesn't exist
                                • - stout32 flake (autoupdate_Rollback_SERVER_JOB,Provisioning) -> duped to 317052


                                bfreed, vbendeb

                                • ARM systems broken: filesystem corruption causing chrome/autoupdate failure:
                                  • Toolchain reverted from 4.8 to 4.7!
                                • lumpy paladin rootfs is low ->
                                • VMTest SimpleTestVerify failed in login_OwnershipApi -> duplicated to


                                jrbarnette, wuchengli

                                • build failure during TPE shift, fixed with revert in
                                • stumpy canary went red: AU test failed because of
                                • bayleybay canary isn't important, filed to have it removed.
                                • mario canary failed unit tests - akeshet investigated.
                                • amd64-generic ASAN builder failed, timed out uploading to GS.
                                • bayleybay canary was red at the opening bell, filed


                                2013-10-28 and 2013-10-29

                                sbasi, dparker

                                2013-10-24 and 2013-10-25

                                jchuang, quiche, dlaurie

                                • bayleybay-canary UnitTest still fails. (not tree closer)
                                • Tree closed 2 time on Thursday for Archive time out in canary (1st time: 22 slaves fail. 2nd time: 2 slaves fail):
                                • Tree closed 1 time for Mario incremental compile error in platform2 and p2p:
                                • Tree closed for VMTest fail on x86/amd64 ASAN:

                                2013-10-18 and 2013-10-21

                                snanda, spang, dhendrix

                                • bayleybay-canary disk filled up, lab team fixed it. However it's still having other issues and hasn't built successfully for several days.
                                • Had issues with autotest_lib.client.common_lib.barrier_unittest:

                                2013-10-16 and 2013-10-17

                                bhthompson, jeremyt

                                • Tree closed 10 times due to (buildbot failure in ChromiumOS on {amd64,x86}-generic ASAN)
                                • Tree closed 1 time, at the end of Thursday on slippy canary.  Root cause is not known.
                                • Preflight queue failed 3 times,

                                2013-10-13 and 2013-10-14

                                bleung, vpalatin

                                • Tree closed much of Monday due to Lots of BVT failures due to hosts not returning from an update to R32-4820.0 Upon further investigation, it looks like there is a lab network issue (
                                • Build failed due to GoB flake. Failed in Clear and Clone phase :
                                • connection timed out due to banner exchange flake in VMTest happened on parrot64 canary : comment #100
                                • Segfault in AddresSanitizer on amd64-generic-ASAN and x86-generic-ASAN :

                                2013-10-10 and 2013-10-11
                                reinauer, garnold

                                • tree closure due to login_CryptohomeIncognitoUnmounted  during hwtests on peach_pit canary, 
                                • chromeos-chrome failed,
                                • x86 generic ASAN,
                                • security_OpenSSLBlacklist failed,
                                • tree manually closed and reopened for parrot 32->64 transition
                                • another closure (amd64 generic ASAN) due to chromeos-chrome errors during parallel stripping,
                                • amd64 generic full failed during SyncChrome with what seems like a bad DEPS file or stale mirror,

                                2013-10-08 and 2013-10-09
                                rharrison, sheu

                                2013-10-04 and 2013-10-07

                                • mario incremental timeout in SimpleTestVerify
                                • "x86 generic incremental" & "amd64 generic full"  "Connection timed out due to banner exchange"
                                • parrot32 vmtest failed due to KVM dying. Trooper rebooted server (build100-m2).
                                • "amd64 generic ASAN" & "x86 generic ASAN" - chromeos-chrome build fails
                                • "link canary" - power_Resume, "Sanity check failed: missed RTC wakeup" -
                                • "daisy_spring canary" - power_Resume, "Spurious wake from s5m-rtc" -
                                • "falco canary" - Archive  BackgroundFailure: "code 600" - filed new

                                2013-10-02 and 2013-10-03
                                kamrik, dkrahn, zork

                                • Archive stage sometimes flakes out while uploading symbols
                                • Chaps unittest failure:
                                • power_Resume hardware test failure:
                                • VMTest failure "Connection timed out due to banner exchange":
                                • October 3 ...
                                • Login tests failed in BVT on all platforms - matrix. One is a bug the other two seem to be real. Downloaded image - can't log in, get stuck on the "Updating screen". Guest login works ok.
                                • Login was actually broken but passed the chrome pfq.  Tracking bug is  Bug has been fixed but chrome uprev pending.  In the meantime, chrome has been pinned to 32.0.1658.2.  A CL to unpin is at
                                • Chrome unpinned - 10/04 10:35am

                                2013-09-24 and 2013-09-25

                                sabercrombie, dbasehore

                                • x86-mario canary autoupdate_EndToEndTest failed ( -- intermittent platform_Shutdown failure).
                                • daisy canary platform_CryptohomeTestAuth failed (
                                • some machines failing BVT due to
                                • Recurrence of of "Connection timed out due to banner exchange" on Mario incremental (security_Minijail_seccomp) and x86 generic ASAN (platform_CrosDisksDBus).
                                • Filed for intermittent Link power_Resume "Could not find start_resume_time entry" BVT failure.
                                • Filed which is causing the "Clear and Clone chromite" stage to fail.
                                • again on mario_incremental, this time with security_SymlinkRestrictions.


                                benchan, gabeblack, jcliang


                                benchan, gabeblack, jcliang


                                bfreed, pstew


                                bfreed, pstew

                                • All release builds fail in VMTests.  This seems to be an issue with the python eclass which has a broken issue with EROOT vs EPREFIX when looking for python wrappers during the gmerge tests.  This issue only affects EAPI=4 due to logic in that eclass, so only when hdctools upgraded to use it this problem was triggered.
                                • Chrome fails to uprev due to includes moving around, and PDF failing to compile against it.  This issue was fixed in Chrome but the version we were trying to uprev to did not have it.  Spoke with sky@ to have a new version cherry-picked to today's Chrome release branch, and have a build kicked off for it.
                                • 4:23: daisy canary fails with


                                  dianders, jrbarnette

                                  • start of day: tree closed due to daisy_spring canary (login_CryptohomeUnmounted failed on daisy_spring-release).  Filed (the bug, which looks like infrastructure) and (why did autofiler use wrong bug).
                                  • start of day: butterfly canary.  Autofiler chose and that seems reasonable.
                                  • start of day: crosbot wasn't updating IRC.  ellyjones booted it to fix it.
                                  • 9:30: Random PFQ failure filed as  "WARNING: Cannot rev sys-boot/chromeos-coreboot-fox"
                                  • 9:30: Ben points out that mario canary has been dead for days.  
                                  • 10:10: parrot canary: "double free or corruption" in VMTest login_CryptohomeMounted.  Reusing old
                                  • 10:20: daisy_spring canary failed on "login_CryptohomeUnmounted_SERVER_JOB".  Tracking with autofiled bug, although really all we can track is why proper debug info wasn't gathered.
                                  • various: Chrome PFQ Failing to uprev Chrome commit - jrbarnette thinks that this will get better with a commit to stop including Chrome Driver; hopefully Chrome is handling?
                                  • after 5:00: an x86-alex canary failure will happen soon.  Looks like is hitting again.


                                  dianders, jrbarnette

                                  • start of day: tree is green and things look reasonable.

                                  2013-09-12 and 2013-09-13

                                  jwerner, hungte (TPE)
                                  • mario-incremental: VMTest network failue ("Could not initiate first contact with remote host", "Connection timed out during banner exchange"),
                                    • Another one on x86-generic-incremental, same underlying cause but this time with a huge spew of  ssh debug output due to the connection problems
                                  • daisy_spring-canary: weird autoupdate_EndToEndTest.npo_test_delta timeout problem, no idea, autofiled
                                  • mario-incremental:



                                  • Tree started green in the morning.  cmp@ reverted a change which caused prebuild uploads to fail with a message about gerrit rejecting a push:
                                  • At around 4:30 the tree suffered two flakes nearly simultaneously: and

                                  2013-09-04 and 2013-09-05

                                  waihong (TPE), adlr, mtennant



                                  2013-08-29 and 2013-08-30

                                  puneetster, vbendeb, cywang

                                  2013-08-27 and 2013-08-28

                                  dgreid, tbroch, yoshiki (davidjames, cmp, vapier, others probably)

                                  2013-08-23 and 2013-08-26

                                    thieule, shawnn, dgozman

                                    • SSH connectivity drops temporarily in VMTests (This is holding up the CQ, bumped to P0)
                                    • CQ VMTest fails with "Devserver did not start"
                                    • [bvt] login_CryptohomeIncognitoUnmounted failed on daisy-release/R31-4583.0.0
                                    • devserver hang (vmtest failure: WARNING: Killing tasks...) (sosa@ has fix pending)

                                    2013-08-21 and 2013-08-22

                                      chromeos-chromedgarrett and rminnich and josephsih

                                      • Canary flake (4-5 times?) from known:
                                      • VM flake from known:
                                      • Canary failed because of HW Lab issues being worked. Reason obscured by
                                      • x86-zgb canary failed:
                                      • has continued to be popular.

                                      2013-08-19 and 2013-08-20

                                      quiche and sbasi

                                      2013-08-13 and 2013-08-14

                                      charliemooney and dhendrix and hychao

                                      • Failures with google storage are causing CQ and canary failures:
                                        • The CQ had managed to complete one run as of 18:15 PDT, but there's no confirmed resolution.
                                      • A bit of a commit queue dependency problem, that got sorted out neatly:
                                      • There was a mysql server that apparently died and caused some failures: (This was an issue both days)
                                      • login_CryptohomeIncognitoUnmounted failed twice in a row on x86-generic incremental builder. No root cause yet...
                                      • PFQ nightly builders were failing due to aforementioned dependency problem.
                                      • daisy canary failed hwtest platform_CryptohomeTestAuth - 262546 autofiled again.
                                      • Error execute cmd 'tar xjf /usr/local/autotest/packages/dep-pyauto_dep.tar.bz2 ...'
                                      • Filed
                                      • Another occurance of CryptoHomeTelemetry failing:
                                      • One FPQ failed suuper early and never even tried to build:

                                      2013-08-09 and 2013-08-12
                                      dkrahn, chinyue

                                      • VMTest login_CryptohomeIncognitoUnmounted failed
                                      • platform_RebootAfterUpdate failures
                                      • flaky chromeos-ec unittest
                                      • daisy_spring canary failure: All hosts with HostSpec ['board:daisy_spring', 'pool:bvt'] are dead!
                                      • peppy canary failure: chromite unittest
                                      • multiple failures: VMTest timeout error
                                      • Widespread canary failures over the weekend due to dependency problem: reverted

                                        2013-08-07 and 2013-08-08

                                        2013-08-05 and 2013-08-06
                                        cychiang, dlaurie

                                         2013-08-01 and 2013-08-02
                                        wfrichar, vpalatin, vapier

                                        reinauer, fjhenigman


                                            2013-07-24 and 2013-07-25
                                            olofj, garnold
                                            • --- daisy canary build fails to emerge kernel
                                            • --- AUTest failure on multiple builds, failing to spawn a local devserver via ssh; hopefully a transient (albeit scary) lab hickup
                                            • --- VMTest failure (login_CryptohomeMounted); appears to be a timeout sshing to the kvm

                                            2013-07-24 and 2013-07-25
                                            reinauer, bhthompson, bleung, sheckylin
                                            2013-07-22 and 2013-07-23
                                            petermayo, rspangler, zork

                                            2013-07-16 and 2013-07-17
                                            benchan, rcui, spang

                                            Continuing issues from previous shift:
                                            New issues:

                                            2013-07-12 and 2013-07-15
                                            djkurtz, mtennant, jrbarnette

                                            2013-07-10 and 2013-07-11
                                            sabercrombie, olofj
                                            • Stout canary security_HciconfigDefaultSettings BVT failure:
                                            • Multiple failures of video_VideoSanity:
                                            • Multiple failures of hardware_VideoDecodeCapable.  Apparently this isn't something we care about:  Update: Seems this is a longstanding issue with an unknown resolution date:
                                            • Daisy canary failures: and
                                            • Filed for peach_pit_canary modules_install failure.
                                            • desktopui_ScreenLocker failure on butterfly.  Appears to be another case of
                                            • autoupdate_EndToEndTest timeout failures:
                                            • Failure of Mario to come out of suspend:
                                            • Another instance of daisy_spring power_Resume: Autotest client terminated unexpectedly: DUT rebooted during the test run.

                                            2013-07-08 and 2013-07-09
                                            reveman, wdg
                                            • hang in chromite tests because gerrit was hanging -
                                            • video_VideoSanity keeps failing -
                                            • hardware_VideoDecodeCapable and other flaky hwtest failures reported by previous sheriffs are still present
                                            • failed to connect to virtual machine failure on stout32 canary (cycled green):

                                            2013-07-04 and 2013-07-05
                                            • autest failures seem to be time-related -- is someone rebooting something? Glancing through the bots, all the autest failures seem to be on the 10:30pm runs.
                                            • some dbus-related stack traces being reported from Chrome crashes in vmtests (mario-incremental and amd64-full). Waiting to see if it recurrs...
                                            • chronic hwtest flake on hardware_VideoDecodeCapable
                                            • chronic hwtest flake on experimental_video_VideoSanity (ignore this one)
                                            • chronic hwtest flake scattered across power_Resume, login_CryptohomeMounted, etc
                                            • fox fails to compile adhd occasionally; filed and uploaded a quick fix
                                            • amd64-generic full failing VmTest on an assertion in CrosDBusServiceImpl::OnOwnership; probably
                                            • wow, does the autofiled bugs "feature" get spammy... the tool you want on is under Actions in the upper left of the bug list, Bulk Edit.
                                            2013-07-02 Tue
                                            dparker, bfreed, ellyjones

                                            2013-07-01 Mon
                                            fjhenigman, dgreid, rminnich
                                            • tree was closed by lumpy canary but cycled green - maybe infrastructure glitch
                                            • stumpy canary red - 254678, dgreid
                                            • alex canary looks like it will cycle green - and it did
                                            • stout canary AUtest failure may be bug 237122 or 235608 though it was closed a couple days ago...
                                            • stout canary HWtest looks same as lumpy above
                                            • stout canary cycled green
                                            • slippy canary restarted, cycled green
                                            • x86 generic full was red, cycled green
                                            • stumpy canary cycled green
                                            • stout32 canary timeout error in VMTest, suspect 209719

                                            2013-06-27 Fri

                                            Tree closed around 0800 PDT, issue 233864

                                            2013-06-26 Thur
                                            dianders, clchiou, olege

                                              2013-06-26 Wed
                                              dianders, clchiou, olege

                                              2013-06-23 Mon & 2013-06-24 Tues
                                              rharrison, wiley, katierh
                                              • security_HciconfigDefaultSettings failure
                                              • symbol upload failures on alex and zgb
                                              • Same Chrome crash as yesterday on amd64 generic full
                                              • Failed to get a good response line from lab servers during reimaging
                                              • daisy_spring failed HWTest
                                              • Tree nicely busted first thing in the morning, investigating ...
                                                • Holding the tree closed until bots start cycling green
                                                • Networking sometimes does not come up after reboot in VMTest
                                                •[bvt] network_Ping failed on daisy-release/R29-4318.0.0
                                                • [bvt] security_ProfilePermissions.login failed on daisy-release/R29-4318.0.0
                                                • [bvt] security_ProfilePermissions.BWSI failed on daisy-release/R29-4318.0.0
                                                • canary: AUTest failure21: [au] autoupdate_EndToEndTest.npo_test_delta failed on butterfly-release/R29-4318.0.0
                                                • Looks like there was a network issue, since a lot of the bots failed due to RPCs failing, etc.
                                              • AUtest failure
                                              • strikes again.
                                                • Looks like a DUT not coming out of reboot
                                                • Is caused by shill not being able to get a DHCP lease on an ethernet port
                                                • cause mostly unknown
                                                • rharrison orinally filed: about this
                                              • was acting up, talked to kamrik about addressing this
                                              • Failed to connect to gerrit to download patches? (Required reverting update to gerrit)
                                              • falco VMTest failing - looks like bad reimage
                                              • Fatal chrome error "Failed to own: org.chromium.LibCrosService" during test automation
                                              • stout and stumpy canary went down at the same time:
                                                • hardware_VideoDecodeCapable control.v4l2 running on stumpy_canary (non-closer)
                                                • hardware_VideoDecodeCapable control.v4l2 running on stout_canary (non-closer)
                                                • security_HciconfigDefaultSettings failed HWTest (stout_canary, butterfly)
                                                • experimental_logging_UdevCrash failing (non-closer)
                                                • Video Sanity test flaking often on Chrome OS HWTest (non-closer)
                                                • [bvt] power_Resume failed on stumpy-release/R29-4315.0.0 (stumpy_closer)
                                                • Seeing some failures on try_new_image, hosts not returning from reboot (non-closer)
                                              • amd64 generic full VMTest failing occasionally due to Chrome crash (dup of 234383)
                                              • Came in to a broken tree, most redness look like flaky network

                                              2013-06-20 Thurs & 2013-06-21 Fri
                                              davidjames, zork

                                              2013-06-18 Tues & 2013-06-19 Wed
                                              vbendeb, shawnn, serya

                                              2013-06-14 Fri
                                              pstew, ihf, hungte

                                              ongoing issues: daisy boot issues
                                              • Mali changes apparently have left the system in an un-bootable state which was first detected by canary HWTest

                                                2013-06-14 Fri
                                                quiche, ihf, hungte

                                                ongoing issues: chrome automation timeouts, devserver problems, power_Resume flake
                                                2013-06-13 Thurs
                                                thieule, tbroch, sjg

                                                2013-06-12 Wed
                                                tbroch, sjg
                                                • Build failure 244055 - fix is in, has not happened since
                                                • (hwtest) power_Resume EarlyWakeupError on daisy_spring canary: 19227247458.

                                                2013-06-11 Tue
                                                • glmark2 failing on egl dependency; cwolfe landed 58195 to fix in glmark-0.0.1-r2329
                                                • R28 release builders failing in ManifestVersionedSync 248559
                                                2013-06-06 Thu & 2013-06-07 Fri
                                                rspangler, ferringb
                                                • (vmtest) Slippy-canary background task hung (logs).
                                                • (hwtest) power_Resume EarlyWakeupError on daisy_spring canary: 247458.
                                                • (hwtest) power_Resume SuspendFailure on link canary: 247460.
                                                • (vmtest) login_LoginSuccess CommandAutomationTimeout on alex canary: 240031.
                                                • (vmtest) network_3GSmokeTest timed out waiting for shill device disable: 247540.
                                                • peach_pit paladin couldn't run build_packages starting at build 882.  vbendeb investigating.

                                                2013-06-04 Tue & 2013-06-05 Wed
                                                wdg, dhendrix
                                                • (hwtest) login_BadAuthentication: 246754
                                                • U-boot was rebased and entailed a manifest change. There was a fair bit of fallout that caused chromeos-bootimage to fail to build and was fixed over the course of a few hours. In a nutshell, the issues were:
                                                  • peach_pit device tree files that were being installed on daisy platforms (snow, spring) and causing problems
                                                  • Missing device tree files for x86 platforms.
                                                  • A bug in the firmware bundling logic that was causing an invalid dependency on CrOS EC for platforms which do not use CrOS EC (parrot, butterfly, stout).
                                                • Network flakiness causing many failures when cloning.
                                                2013-05-31 Fri & 2013-06-03 Mon
                                                rspangler, dkrahn
                                                  • Ongoing - (vmtest) Cryptohome failures: 241789.  Tree's been red all week.  One fix went in over the weekend but it may still be flaky.  Got davidjames to increase retries for login_CryptohomeIncognitoMounted to see if that helps some changes get through the CQ.
                                                  • Ongoing -- (hwtest) Frequent failures in security_ProfilePermissions, platform_Pkcs11ChangeAuthData, video_VideoSanity (experimental tests)
                                                  • Link to tree closer issues is incorrect after the merge with the Chromium issue tracker.  Should be this; updated wiki.
                                                  • Monday AM -- poked troopers about 500 errors in HwTest and AuTest stages on canaries. [cwolfe drive-by]
                                                  2013-05-27 Mon (Holiday) & 2013-05-28 Wed
                                                  dlaurie, sbasi
                                                    • Ongoing -- (hwtest) Frequent failures in security_ProfilePermissions, platform_Pkcs11ChangeAuthData, video_VideoSanity
                                                    • Ongoing -- (hwtest/autotest) Flaky test automation causing frequenty failures
                                                    2013-05-21 Tue & 2013-05-22 Wed
                                                    semenzato, dgarrett, cywang
                                                      • Ongoing -- (hwtest) Flaky power_Resume test on canary builders: 242788220014
                                                      • Ongoing -- (buildbot) autotest-telemetry build failed on PFQ, ASAN builders: 242770
                                                      • Ongoing -- (autest) Flaky autoupdate_EndToEndTest: 235608
                                                      • Ongoing -- (vmtest) Hung then killed on Falco, Peppy canaries: 242470
                                                      2013-05-13 Mon & 2013-05-13 Tues
                                                      charliemooney, sheu
                                                      • Ongoing -- Lots of problems with the AU rebooting canary builders: 235608
                                                      • Fixed -- The PFQ's are mad about thier dependencies when building expected_deps: 240601
                                                      • Fixed -- Some PFQ's were crashing due to a typo: 239754
                                                      2013-05-09 Thu
                                                      • daisy_spring canary closed tree with media-libs/secomx build failure: Possibly due to new clang syntax checking for cros_workon-able packages.

                                                      2013-05-08 Wed
                                                      • stout canary closed tree with AUtest failure:
                                                      • mario incremental failed: reopened tree since it feels like flake

                                                      2013-05-07 Tue
                                                      • dennisjeffrey CL killed the Commit Queue. Since it moved an autotest from one package to another, it affected successive tests as well. Needed to add a "!" (remove) dependency to remove/update the origin of the files before installing the new package. kudos to davidjames for clobbering everything and explaining how to fix.
                                                      • dgreid changes  CL 49812 and CL 49921 enabled functionality that is broken in chrome version from two days ago that ChromeOS is currently using. ToT chrome is fixed but chromeOS didn't pick up the ToT last night due to other Chrome nightly build failures. dgreid will resubmit once ChromeOS has newer Chrome.
                                                      • CL adding apiclient to test image broke on canaries with a dev_install failure on VMTest.  See, and CLs 49815 and 50308.
                                                      2013-05-06 Mon
                                                      josephsih, piman
                                                      • mario incremental: BuildPackages failed due to a platform2 ebuild ( Revert the patch, and the builder cycled green.
                                                      • link canary: autest [au] failed report.
                                                      • network_LTEActivate flakiness
                                                      2013-05-02 Thu
                                                      • link canary, parrot canary failed at vmtest: "Unhandled JSONInterfaceError : Unable to get browser_pid over automation channel on first attempt."
                                                        • Root cause: "crossystem hwid" failed.  cat: /sys/devices/platform/chromeos_acpi/HWID: No such file or directory. 
                                                        • Filed a bug which was merged to
                                                      • x86-alex canary: vmtest failed "Unhandled AutomationCommandTimeout: Chrome automation timed out after 45 seconds for {"skip_image_selection": true, "command": "SkipToLogin"}"

                                                        2013-05-02 Thu
                                                        posciak, garnold, seanpaul
                                                        • x86-alex canary failed hwtest step with "Unhandled PackageInstallError: Installation of pyauto_dep(type:dep) failed"
                                                          • Couldn't root cause it, so filed a bug at and reopened 
                                                        • security_HciconfigDefaultSettings autotest failures due to
                                                        • Session manager did not restart after logout error on CryptohomeIncognitoUnmounted, filed
                                                        • Filed for address sanitizer segfault on amd64-generic during vmtest

                                                        2013-05-01 Wed
                                                        posciak, garnold, seanpaul
                                                        • x86-mario canary failed au step with “FAIL: Unhandled timeout: timed out”
                                                          • stumpy, stout & daisy also failed on autest step
                                                          • suspect there was an AU outage/problem last night which caused this
                                                          • Filed to track
                                                        • 03:04 lumpy nightly chrome pfq failed in VMTest
                                                          • this crash ( is being tracked in
                                                        • 05:34 stout32 hwtest failed with “ERROR: All hosts with HostSpec ['board:stout32', 'pool:bvt'] are dead!”
                                                          • All stout32 hosts in cautotest are marked “Repair Failed”
                                                          • Filed
                                                        • 05:34 parrot canary failed in unittest
                                                          • seanpaul not sure what the problem is, so filed
                                                          • I think it's caused by, reverted with and reopened

                                                        2013-04-29 Mon
                                                        • Parrot canary failure reported 2-day ago. The recent 2 parrot builds goes green and other builds also look good. Reopen the tree.
                                                        • Daisy canary failed, autoupdate_EndToEndTest could not verify that update was successful,

                                                        2013-04-26 Fri
                                                        rcui, taysom, spang
                                                        • Link failed again in power_Resume
                                                        • Stout BVT: power_Resume: Sanity check failed: did not try to suspend -
                                                        • Lumpy canary failed on repeat of
                                                        • Lumpy paladin failure in desktopui_ScreenLocker  test
                                                        • Parrot flaky test login_CryptohomeUnmounted
                                                        • Lumpy chrome crash - this may be a new problem but we have only seen it on lumpy
                                                        • Asan builder failing BuildPackages on the chromium.memory waterfall -
                                                          2013-04-25 Thur
                                                          rcui, taysom, spang
                                                          2013-04-18 Thur
                                                          mtennant, jrbarnette, dshi (hwlab), mukai (Chrome on ChromeOS)
                                                          • devinstall_test failure on all canaries -
                                                          • chrome crashes in CrosLanguageOptionsHandler::GetLanguageListInternal on a few builders -
                                                          2013-04-17 Wed
                                                          mtennant, jrbarnette, dshi (hwlab), mukai (Chrome on ChromeOS)
                                                          2013-04-15 Mon
                                                          vapier, jwerner
                                                          • (power_Resume failure "Could not find start_resume_time entry" due to SSD hardware flake)
                                                          • (unexpected reboot during login_LoginSuccess... can probably happen during all UITests)
                                                          • VMTest testUpdateKeepStateful error (cannot connect to KVM instance)... suspected flake
                                                          • python 2.7 upgrade breaking hwtests
                                                          • coreboot repo shuffling; any coreboot related errors -> reinauer
                                                          2013-04-12 Fri
                                                          fjhenigman, yusukes, dbasehore, rbyers (Chrome on ChromeOS)
                                                            • Couple cased of lab flake
                                                            2013-04-11 Thu
                                                            fjhenigman, yusukes, dbasehore sjg now, rbyers (Chrome on ChromeOS)
                                                            • link, parrot, stout canaries all had a string of hwtest fails ("power_Resume FAIL: EarlyWakeupError(1): Woke up at ...") but then started passing
                                                              • doesn't look like or
                                                              • that they came and went together suggests it was code changes, but I haven't dug into those
                                                              • open a bug if it happens again
                                                            • fox kernel > 8MB - opened
                                                            • sonic canary is wip - it shouldn't have closed the tree - opened
                                                            • libftdi uprev reverted - had to clobber builders -
                                                            • gerrit-int went down briefly - caused tests to timeout and paladins to hang
                                                            • missing dependency in chromeos-ec - thanks ellyjones for
                                                            • Chrome LKGM builder failing for a few days in cros_best_revision -
                                                              2013-04-10 Wed
                                                              katierh, clchiou, haruki, gedis(shadow)
                                                              • Unhandled AutomationCommandTimeout for {"skip_image_selection": true, "command": "SkipToLogin"} - already noted at

                                                              2013-04-09 Tue
                                                              katierh, clchiou, gedis(shadow)
                                                              • Daisy power_resume failure - already noted at
                                                              • ConnectionHealthChecker failures across the board - reverted -
                                                              • butterfly autoupdate_EndToEndTest.npo_test_delta flake - bug filed
                                                              2013-04-08 Mon
                                                              petkov, quiche, pstew
                                                              2013-04-05 Fri
                                                              petkov, quiche, pstew
                                                              2013-04-03 Wed
                                                              gabeblack, dgreid, sheckylin
                                                              • 01:19 autoupdate_EndToEndTest.parrot_nmo_test_delta flakiness.
                                                              • 8:30 everything broken, EndToEndTest, Autoupdate, desktop_VideoSanity, all failing on different boards.
                                                              • 10:45 try to re-open after disabling VideoSanity, AUTest and power_Resume flakes.
                                                              2013-04-02 Tue
                                                              rminnich, sonnyrao: west coast

                                                              2013-04-01 Mon
                                                              rminnich, sonnyrao: west coast
                                                              • 10am Link Canary Failed due to Archive step Time Out
                                                              • 10am Daisy Canary has been red all weekend -- found out about
                                                              • 1pm stout canary failed with Archive time out - opened
                                                              • 2pm x86-zgb canary failed with Archive time out -
                                                              • 8pm build packages started failing due to a gtest uprev and an associated python bug -
                                                              • Chrome ebuild also failed to uprev due to above issue
                                                              2013-03-29 Fri
                                                              bfreed, vbendeb: west coast
                                                              2013-03-28 Thu
                                                              bfreed, vbendeb: west coast
                                                              • 17:55pm  - again connectivity issue, on x86 generic ASAN
                                                              • 17:37pm tree reopened
                                                              • 17:26 pm - another connectivity failure, davidjames took "amd64 generic ASAN" builder down as it seems more prone to experiencing this problem
                                                              • 16:29 pm Tree reopened, filed
                                                              • 16:16pm "Could not resolve host:"
                                                              • 16:15pm - tree reopened
                                                              • 15:39pm - "Unable to look up (port 9418) (Name or service not known)" filed to deal with external dependency
                                                              • 2:45pm: "no space left on device" on incremental builder, fixed by davidjames.
                                                              • 2pm: Same pool:bvt issue as below, this time with x86-zgb.
                                                              • 1pm: As with now-closed, "All hosts with HostSpec ['board:parrot', 'pool:bvt'] are dead".  Suspect lab issue.
                                                                • Can view the list by going to http://cautotest/afe/#tab_id=hosts, then selecting Platform "parrot", then selecting Label "pool:bvt".
                                                              • 3am: vmtest failure closed tree on "amd64 generic ASAN". Subsequent builds worked, so maybe denniskempin fixed it.
                                                              2013-03-27 Wed
                                                              wdg,dparker: west coast
                                                              • 4pm: Closed tree on "butterfly canary" Command "crossystem hwid" failed
                                                              • 2pm: Closed tree on "mario incremental" Command "crossystem hwid" failed
                                                              • 1pm: Shill build failure closed tree on "x86 generic ASAN" and "amd64 generic ASAN". Reverted shill change

                                                              2013-03-26 Tue
                                                              wdg,dparker: west coast
                                                              • 3pm: Closed tree on "x86-zgb canary" -- writing off as test flake but starting to think we blame brand new chrome version...
                                                              • 3pm: Closed tree on "daisy canary" Device rebooted during power_Resume.
                                                              • 3pm: Closed tree on "x86-mario canary" Unhandled AutomationCommandTimeout
                                                              • 2pm: Closed tree on "x86 generic full". login_CryptohomeUnmounted failed but may be an underlying test framework issue.
                                                              • 8am: (not a tree-closer, but...) Build 1187, Parrot Canary: Failed cbuildbot failed vmtest failed report

                                                              2013-03-25 Mon
                                                              adlr,dhendrix: west coast
                                                                • (python free()'ing invalid pointers) strikes multiple times.

                                                                2013-03-22 Fri
                                                                adlr,dhendrix: west coast
                                                                • 1pm: timeout during archive

                                                                2013-03-20 Thu
                                                                quiche,wiley: west coast
                                                                • 8am: XXX chromium.chromiumos VMTest failure
                                                                • 2am: update engine failure on parrot-canary
                                                                • 1am: desktopui_VideoDecodeAcceleration failure on x86-zgb
                                                                • 12am: desktopui_VideoDecodeAcceleration failure on x86-mario

                                                                2013-03-20 Wed
                                                                quiche,wiley: west coast
                                                                • 11pm: desktopui_VideoDecodeAcceleration failure on x86-alex
                                                                • 7pm: desktopui_VideoDecodeAcceleration failure on x86-alex, x86-mario,  x86-zgb
                                                                • 7pm: AUTest failure on x86-mario
                                                                • 5pm: desktopui_VideoDecodeAcceleration failures on x86-alex, x86-mario, x86-zgb
                                                                • 5pm: buildbot failures on amd64-generic-incremental, due to disk filling up
                                                                • 1pm: desktopui_VideoDecodeAcceleration failures on x86-alex, x86-mario, x86-zgb
                                                                • 8am: build_RootFilesystemSize failure on link
                                                                • 8am: desktopui_VideoDecodeAcceleration failures on x86-mario, x86-alex 
                                                                • 8am: daisy incremental failure: kernel gerrit mirror out-of-sync
                                                                • 4am: chrome PFQ failure on amd64-generic: kernel gerrit mirror out-of-sync
                                                                • 1am: build_RootFilesystemSize failures on link, stout 
                                                                • 1am: desktopui_VideoDecodeAcceleration failures on x86-alex, x86-mario, daisy, x86-zgb, stumpy
                                                                2013-03-19 Tue
                                                                tbroch,thieule: west coast
                                                                • 1pm: kernel warning in power_Resume on daisy
                                                                • 8am: build_RootFilesystemSize fails as rootfs <100MB across most x86 systems
                                                                • 8am: experimental_desktopui_VideoSanity
                                                                • 8am: network problem leading to vmtest fail
                                                                2013-03-18 Mon
                                                                tbroch,thieule: west coast
                                                                • 3pm: Transient network problem while emerging chrome
                                                                • 9am: UploadArtifact task timeout (1800secs)
                                                                • 8am: intermittent (hopefully) 'Exception: Missing uploads.'  
                                                                • 8am: experimental_desktopui_VideoSanity

                                                                2013-03-14 Thursday
                                                                dlaurie, sbasi
                                                                • 8am: ARM build broken overnight due to build flags change, reverted here:
                                                                • 8am: GDB issues causing problems for Chrome PFQ, this "fixed itself" on retry
                                                                • 1pm: Commit queue stuck, mario-paladin waiting for alex-paladin

                                                                2013-03-07 2013-03-08 Tues-Wed
                                                                ferringb, charliemooney

                                                                2013-03-08, Fri
                                                                sjg, sabercrombie

                                                                2013-03-08, Fri
                                                                rspangler, sabercrombie
                                                                • Flake on amd64 generic ASAN uploading results to google storage
                                                                • Canaries failing due to; can't download libva-1.1.0.tar.bz2.  Uploaded what we hope is the right file.  It turns out that was not the right thing to do.  The problem stemmed from two versions of libva carrying the 1.1.0 designation, which led to an old cached version messing up the download process on the canary buildbots.  Mike Frysinger removed these old files.
                                                                • Stout canary failed with "NoHostsException: All hosts with HostSpec ['board:stout', 'pool:bvt'] are dead!" -  johndhong and jrbarnette investigated; lots of systems are in Repair Failed state, probably due to a DHCP problem this morning.  They kicked off a verify on all hosts, and the hosts started coming back on their own.
                                                                • Paladins failed with "ERROR: Project name mismatch for /mnt/host/source/src/platform/depthcharge (found chromiumos/platform/depthcharge, expected chromeos/platform/depthcharge)".  Probably caused by rev 1 of
                                                                • mario paladin was stuck waiting for stout paladin, but stout was idle.  Aborted mario paladin build; all paladins seem to be building normally now,

                                                                2013-03-04 - 2013-03-05, Mon-Tue
                                                                dianders, dkrahn
                                                                • dianders: ASAN failures (use after free).  Appears to be intermittent, but a real bug.
                                                                • dianders: ASAN failure "No such file or directory: '/home/.shadow'".  Digging into logs showed cryptohome not starting.  Digging more showed "cryptohome: symbol lookup error: /usr/lib64/ undefined symbol: __asan_handle_no_return".  Liam identified as  Reverted and chumped.  Re-opened to track.  Re-opened tree.
                                                                • Parrot canary failed with
                                                                • dianders: Some strange transitory failures across many builders with "update_scripts Sync buildbot slave files failed ( 9 secs )".  Didn't seem serious and went away on its own, but David James tracked it down as
                                                                • dianders: Failure with SDK builder on vboot_reference (it couldn't find <tss/tcs.h>).  Filed  Chumped in a CL that ought to fix this.
                                                                • dianders: Tree was closed overnight with x86 generic full failure.  A timeout building chromite?  Didn't reproduce...
                                                                • dianders: Hit the x86 generic full failure again.  Filed
                                                                • More 'update_scripts' failures: tracking in
                                                                • dianders: Got a BVT failure in experimental_desktopui_VideoSanity on x86-alex canary.  Filed

                                                                2013-02-28 - 2013-03-01, Thur-Fri
                                                                sheu, dgarrett
                                                                • Im Westen, (fast) nichts Neues
                                                                • git infrastructure issue takes down a bunch of builders:
                                                                • rename of gerrit-int repos without updating manifest takes down more builders:

                                                                2013-02-26 - 2013-02-27, Tue-Wed
                                                                grundler, benchan
                                                                • daisy powerResume failing on chromeos1-host5-rack4 crosbug/39260
                                                                • documented daisy repro case. (
                                                                • link canary failure (crosbug/p/17893) (found dups of this bug too)
                                                                • stout canary failure (
                                                                • alex/stumpy failed power_Resume due to new warning in Kernel - was reverted (
                                                                • parrt-canary failed due to "Session manager did not restart" (after following a chain of "merged into" -->
                                                                2013-02-22 - 2013-02-25, Fri-Mon
                                                                  2013-02-20 - 2013-02-21, Wed-Thu
                                                                  taysom, garnold, zork
                                                                  2013-02-19 - 2013-02-20 Mon, Tues
                                                                  ellyjones, reinauer, sque

                                                                  Feb 14, 15 Thu, Fri
                                                                  snanda, posciak

                                                                  Feb 12, 13 Tue, Wed
                                                                  dparker, semenzato
                                                                    Feb 8, 11 Fri, Mon
                                                                    jaysri, milleral, olege
                                                                    • Filed for power_Resume gen6_gt_check_fifodbg issue
                                                                    • Someone put a test that belongs in autotest-chrome into autotest-tests again, so BuildTarget is having to repeat emerging of autotest-tests again.
                                                                    Feb 4, 5 Mon, Tue
                                                                    • Failure on security_RestartJob for x86-mario canary, looks like flake, filed Reopened tree
                                                                    • Failure on power_Resume for link canary. looks like Reopened the tree
                                                                    • Came into a red tree on Monday(failure on stumpy), all the builders were green. Assuming it was flake, possibly from the fun with the HW lab over the weekend
                                                                    Jan 31, Feb1 Thu, Fri
                                                                    vpalatin, bleung, hungte, benrg
                                                                    Jan 29, 30 Tue, Wed
                                                                    bfreed, bhthompson, katierh, petermayo
                                                                    • tree still red Tuesday morning due to - revert of nss/nspr upgrade is resulting in segfault in local shlibsign.  These are security packages that might also cause the sandbox failure of
                                                                    • tree throttled Wednesday morning due to - timing out on VMTest update steps.  Failed to get through normal channels to find a flaw, rebooting the mario paladin build slave was sufficient.

                                                                    Jan 25, 28 Fri, Mon
                                                                    mtennant, gabeblack
                                                                    • - revert of nss/nspr upgrade is resulting in segfault in local shlibsign.  Found after hours by vapier.  P0 TreeCloser unresolved.
                                                                    • - Chrome crash on startup in renderer thread.  Causing major problems and possible overnight red tree.  P0 TreeCloser unresolved.
                                                                    • - vmtest testInterruptedUpdate failure in canary builds.
                                                                    • - git clone command in Chrome/Chromium PFQ builders suddenly asking for password.  Resolved.
                                                                    • - shill unittests segfault, intermittent, fix at:  Tree throttled as fix worked its way through commit queue then all canaries.  Bug got through commit queue originally because it is intermittent.  Resolved.
                                                                    • - stout canary - vmtest - testInterruptedUpdate - cannot allocate memory
                                                                    Jan 23,24 Wed,Thur
                                                                    rcui, sjg, dbashore

                                                                    Jan 22, Tu
                                                                    jamescook (chrome-on-cros)
                                                                    • - PyAutoFunctionalTests.FULL flakily reporting sig 6 from an intentional Chrome crash

                                                                    Jan 17 Thurs, Fri
                                                                    mkrebs, rminnich, sheckylin

                                                                    • Xorg signal 6 
                                                                    • on stout, seems not to be fixed, missing pxe rom for virtio.
                                                                    • again: "amd64 generic full" closed the tree this time.
                                                                    • If you're going to be helpful and post error messages, the best way to be sure you don't say anything you should not is to mention the error, the software, but not the file name.
                                                                    • We ought to just fix this vm error due to a missing pxe_virtio.bin. I will see what I can do.
                                                                    • Repeat failure in HWTest: "x86-mario-release/R26-3571.0.0/bvt/platform_CryptohomeMount     ABORT:".
                                                                    • (created): login_CryptohomeMounted failed with "Login timed out".  Couldn't find a similar bug that was open ( seemed to be the closest closed issue).
                                                                    • [mkrebs] Saw a bunch of "Chrome PFQ Failing to uprev Chrome" emails on the 17th.  Was allegedly a failure to build a certain package, but the fix was taking a while to land.  They seem to be fine now, but my best guess is that ellyjones@ actually got them working early on the 18th (on IRC he mentioned something about restarting the mario paladin around that time).

                                                                    Jan 15 Tue, Wed
                                                                    chinyue, sonnyrao, yusukes
                                                                    • x86-alex-release/R26-3560.0.0/bvt/experimental_kernel_fs_Inplace_SERVER_JOB FAIL: HTTP Error 500: Internal Server Error
                                                                    • "desktopui_ScreenLocker failure in bvt on parrot-r26" hits on stumpy canary, stout canary, x86-alex, parrot bvt, possibly x86-generic as well
                                                                    • chromium for chromium-os builder started failing VMTests due to automation timeouts around 1pm on Wednesday, might affect Chrome on ChromeOS starting Thursday

                                                                    Jan 11, 14 Fri, Mon
                                                                    dgreid, pstew
                                                                    • [bvt] failed at login_CryptohomeMounted: Cryptohome created a vault but did not mount (and Host did not return from reboot) - parrot canary
                                                                    • power_Resume test failing with "gen6_gt_check_fifodbg.isra.6+0x36/0x48()"
                                                                    • power_Resume test failing with "EarlyWakeupError(1)"

                                                                    Jan 9, 10 Wed, Thu
                                                                    djkurtz, jrbarnette, olofj

                                                                    Jan 7, 8, Mon, Tue
                                                                    clchiou, jwerner, josephsih

                                                                    All canaries have been failing randomly in login_Cryptohome* tests due to Chrome team has pushed a fix that should get synched during the night between Jan 8th/9th. If the same issue still shows up after that, please let them know!
                                                                    • parrot-canary: login_CryptohomeMounted
                                                                    • failed and many *_SERVER_JOB failed on daisy canary
                                                                    • [bvt] failed on platform_CryptohomeMount on x86-mario canary
                                                                    • try_new_image: Host did not return from reboot. Connection timed out.
                                                                    • stumpy-canary and lumpy-canary died from an experimental test because the crash server timed out on symbolizing the crash dumps
                                                                    • parrot-canary and kiev-canary: login_CryptohomeUnmounted. This can probably happen on all the login_Cryptohome* tests.
                                                                    • power_Resume fails on stout... flaky NIC sometimes fails to resume
                                                                    • power_Resume abort bvt
                                                                    • canary: login_CryptohomeMounted : Session manager did not restart after logout

                                                                    Jan 3, Jan 4, Thu, Fri
                                                                    dbasehore, wfrichar, hychao

                                                                    Dec 26, Dec 27, Wed, Thu
                                                                    dkrahn, sque, miletus
                                                                    • vmtest login_CryptohomeMounted: browser hang during shutdown (multiple occurrences)
                                                                    • vmtest Unable to connect to X server causing 2400 second timeout (multiple occurrences)
                                                                    • desktopui_VideoSanity fails to load video (not a tree closer)
                                                                    • stout BVT. platform_Pkcs11Events (not a tree closer, multiple occurrences)
                                                                    • python2 sig 6 during login_BadAuthentication test
                                                                    • login_CryptohomeIncognitoUnmounted of VMTest has failed in login timed out for >5 times
                                                                    • Login_BadAuthentication failed during HWTest (BVT) on Alex

                                                                    Dec 20, Dec 21, Thu, Fri
                                                                    rspangler, dhendrix, dgozman
                                                                    • vmtest login_CryptohomeMounted due to chrome crash (multiple failures)
                                                                    • vmtest login_CryptohomeUnmounted due to chrome or X crash
                                                                    • vmtest login_CryptohomeUnmounted times out waitng for UI to restart at the end of the test
                                                                    • vmtest unable to connect to remote host (ssh: connect to host port 9222: Connection refused)
                                                                    • vmtest desktopui_ScreenLocker failing
                                                                    • stumpy-canary is failing power_Resume test with warning in i915_drv.c.
                                                                    • daisy incremental build failure, believe git mirror was out-of-sync ("git-2_branch: changing the branch failed")
                                                                    • kiev, daisy, stout paladins failed a build, and mario paladin was stuck waiting for them.  Killed mario and forced a rebuild.  (In retrospect, just killing mario paladin was probably sufficient)
                                                                    • vmtest Unable to connect to X server causing 2400 second timeout
                                                                    • trying to get some Chrome devs to look into Chrome shutdown crash (which in turn caused session manager timeouts and VMTest failures)
                                                                    • vmtest login_CryptohomeMounted timeout waiting for login prompt

                                                                    Dec 18, Dec 19, Tue, Wed
                                                                    kochi (non-PST), dlaurie, puneetster
                                                                    • started open with status "hwtest failure = dependencies_info not being generated properly ->".
                                                                    • login_CryptohomeMounted timed out happend 3 times on x86 generic incremental.
                                                                    • desktopui_ScreenLocker fail with timeout on mario incremental. happened only once.
                                                                    • empty dependency_info causing hw_tests failure: LOTS
                                                                    • autotest-tests failing the first build and succeeding on retry, suspect desktopui_VideoSanity, email sent to developer
                                                                    • butterfly-canary failed with "Could not parse devserver log" possibly, was successful on next build
                                                                    • amd64-generic-full failed vmtest login_CryptohomeMounted due to chrome crash, filed
                                                                    • tlsdate issue determining its release number and causing failures in uprev step, fixed with
                                                                    • login_CryptohomeUnmounted causing Chrome/X to crash, filed
                                                                    • 12/19 11AM: Still seeing lots of vmtest failures due to issue 37337
                                                                    • x86-zgb canary failed BuildTarget step for zgb_he phase because build_packages was killed, filed

                                                                    Dec 14, Dec 17, Fri, Mon
                                                                    sabercrombie, thieule, zoro, rongchang
                                                                      Dec 10, Dec 11, Mon, Tues
                                                                      charliemooney, tbroch, milleral(10th), yjlou(11th)
                                                                      • went down, taking out chromiumos sdk during buildtarget
                                                                      • A BuildTarget reported back with a warning from a python crash while building chrome. 
                                                                      • buildbot threw an exception during VMTest due to a failed assertion.
                                                                      • Daisy TPM related activities need >= 2min to complete not current 45sec.  Fix in and propagating.

                                                                      Dec 4, Dec 5, Tue, Wed
                                                                      quiche, anush, spang
                                                                      • hit this on an x86-generic-full build and a daisy-canary build
                                                                      • daisy incremental build failure, believe git mirror was out-of-sync
                                                                      • link canary BVT failure, tree cycled green
                                                                      • x86-mario canary failure, google storage flake
                                                                      • - stout BVT. platform_Pkcs11Events (not a tree closer, multiple occurrences)
                                                                      • - stout BVT. platform_Pkcs11ChangeAuthData (not a tree closer)
                                                                      • - caused a snow BVT failure (not a tree closer)
                                                                      • false alarm email for buildbot failure stout-canary. sbasi checked the BVT results, and says the tests passed.
                                                                        suspects network flake causing buildbot to believe the BVT failed.

                                                                      Nov 30, Dec 3, Fri, Mon
                                                                      dparker, piman, fjhenigman (Mon. only)

                                                                      • google storage flake during archive step on x86-alex
                                                                      • - kiev BVT. Power_resume fail on reading RTC fail after 10 retries. 
                                                                      • - daisy BVT. platform_CryptohomeChangePassword fails to migrate password
                                                                      • - mario-r23 BVT. login_CryptohomeUnmounted times out waitng for the UI to restart at the end of the test.
                                                                      • stout-canary. HWtest failure due to infrastructure problems in the hwtest lab.
                                                                      • x86-mario canary. ABORT on security_ptraceRestrictions. Believed to be a test flake or lingering fallout from test lab going down (?)
                                                                      • stumpy-canary x 2. Power_resume error with warning in i915_drv.c.
                                                                      • Power_resume failure reading RTC on kiev & lumpy canaries.

                                                                          Analysis of the BuildTarget warnings

                                                                      Build http://chromegw/i/chromeos/builders/x86-mario%20canary/builds/2875/steps/BuildTarget/logs/stdio started Wed Nov 28 16:36:55 2012 looks ok while all subsequent builds (the next started at Wed Nov 28 22:36:22 2012) have this:
                                                                      WARNING: The following packages failed the first time,
                                                                      but succeeded upon retry. This might indicate incorrect
                                                                      and this:
                                                                      autotest-tests-0.0.1-r3342: ERROR:root:Dependency pyauto_dep does not exist
                                                                      so the problem could be a change introduced between those two times.  milleral on irc suggested a
                                                                       "test was likely added to autotest-tests.___.ebuild that needs to be in autotest-chrome.____.ebuild" but I don't see a change there at the right time.

                                                                      Nov 20-21, Tue, Wed
                                                                      reinauer, sleffler, fjhenigman
                                                                      • crbug/36566 CQ build failures in update_engine with "unrecognized command line option "-Wno-c++11-extensions""; fixed by kliegs
                                                                      • crbug/29895 filed by Prashanth for x86-alex-r23 bvt failure in power_Resume
                                                                      • crbug/35908 desktopui_UrlFetch.not-live FAIL hit three times overnight in the chrome pfq
                                                                      • All quiet on Tue the 20th

                                                                        Nov 16, Nov 19, Fri, Mon
                                                                        waihong (tpe), keybuk, garnold

                                                                        Nov 14 - Nov 15, Wed, Thu
                                                                        jamescook (cros gardener)
                                                                        • BVT chrome sig 11 on shutdown, crash in ash GetDisplayManager() due to metrics logging, official builds only
                                                                        • lumpy (perf) failing HWTest, "All hosts are dead" in [try_new_image] results status.log, infrastructure problem, fixed
                                                                        • ChromeOS Crash in WindowOpenPanelTest.ClosePanelsOnExtensionCrash
                                                                        • Snow: BVT login_LoginSuccess failure due to cryptohome / TPM issue (only affects chromeos1-rack5-host3, maybe preMP hardware issue?)
                                                                        Nov 9, Fri
                                                                        puneetster, sheu, kinaba
                                                                        Nov 6, Tue
                                                                        gpike, grundler, reveman
                                                                        • parrot canary, Crash in HWTest - enterprise_DevicePolicy
                                                                        • Chrome PFQ, Chrome/Init getting a lot of SIGBUS errors preventing Chrome from revving during VMTests
                                                                        • parrot canary, desktopui_NaClSanity: Failed to installed SecureShell extension
                                                                        • x86-alex canary, experimental desktopui_DocViewing failure closed tree
                                                                        Nov 5, Mon
                                                                        • x86-alex canary, Archive bug, command timed out: 9000 seconds without output (davidjames fixed it.)
                                                                        • chromiumos sdk failed SDKTest. make: *** [build/shims/] Error 1
                                                                        • amd64 generic full: Timeout in UrlFetch.not-live

                                                                        Oct 31 - Nov 1, Wed, Thu
                                                                        taysom, petermayo, wdg
                                                                        • Kiev paladin hwclock bug, same on link, timeout in URLFetch
                                                                        • Daisy flake; said there were no changes but widevine was failing to link properly
                                                                        • daisy, parrot problems
                                                                        • reverted change I02955c8e
                                                                        • Google died but it got better.
                                                                        • daisy incremental ran out of space, clobbered chroot
                                                                        • CQ got stuck
                                                                        Oct 29 - Oct 30, Mon, Tue
                                                                        wfrichar, pstew, cwolfe
                                                                          Oct 23 - Oct 24, Tue, Wed
                                                                          katierh, olege, mkrebs
                                                                          Oct 19 - Oct 22, Fri, Mon
                                                                          jrbarnette, mtennant, hungte
                                                                          Oct 17 - Oct 18, Wed, Thu
                                                                          dgreid, dbasehore
                                                                          • Day starts with tree closed due to Link now being over-size.
                                                                          • lumpy canary: HWTest failed likely due to lab networking issue
                                                                          • link canary: warning on build due to missing coreboot dependency
                                                                          • Single VMTest failures on all canaries (passed afterwards)
                                                                          Oct 15 - Oct 16, Mon, Tue
                                                                          bfreed, vbendeb
                                                                          • hit the mario and zgb canaries.
                                                                          • A few hours later, canaries now fail HWTest with "TimeoutError: Timeout occurred- waited 8400 seconds."  cmasone is investigating network outage.
                                                                          • link canary: desktopui_DocViewing fails in doc_viewing.DocViewingTest.testOpenOfficeFiles with "Extension could not be installed".
                                                                          • link canary: desktopui_NaClSanity fails in secure_shell.SecureShellTest.testLaunch with "Extension could not be installed".
                                                                          • link canary: desktopui_DocViewing fails in doc_viewing.DocViewingTest.testOpenOfficeFiles with "Chrome automation timed out after 45 seconds"
                                                                            • Throttling the tree.  I see consistent failures on various tests and on "try-new-image-*".
                                                                            • Not sure if this is server overload or chrome causing the failures.  Nothing points to chrome-os, best I can tell.
                                                                          • A set of 3 CLs broke shill in a lumpy PFQ. fixed it.
                                                                          • x86 alex canary: HWTest during SuitePrep: Connection timed out
                                                                          Oct 11 - Oct 12, Thu, Fri
                                                                          rcui, sjg
                                                                          • Link failed on BVT HWTest again
                                                                          • HWTest fails power_Resume with 'Autotest client terminated unexpectedly'
                                                                          • Noticed that failing test has a status log which shows success. According to sosa this is a network flake. Ignoring.
                                                                          • login_CryptohomeIncognitoUnmounted timeout.

                                                                          Oct 9 - Oct 10, Tue, Wed
                                                                          rharrison, bleung, sonnyrao
                                                                          • Daisy full failing due to issue with binutils (Appears to be a repeat of
                                                                          • amd64 generic incremental timed out after 8 hours on BuildTarget (Pinged troopers@, since this bot appears to be sick)
                                                                          • Link failed on BVT HWTest
                                                                          •,, x86-zgb failed on BVT HWTest
                                                                          • qemu-kvm failed to link with glib-2.32.4-r1
                                                                          • Came into very red tree due to bad WebKit roll and failure of the PFQs to prevent Chrome on ChromeOS from updating. This issue was created from the fact that we were patching WebKit in ChromeOS, there is a thread discussing that we shouldn't do this again. Many late arriving bots failed after the fix was in and the tree had to be reopened.
                                                                          • some canary builders (parrot, stumpy, kiev) failed in svn update. Connection reset by peer
                                                                            Oct 3 - Oct 4, Mon, Tue
                                                                            gpike, sjg, kamrik
                                                                            • failed trying to treat IP address as a float
                                                                            • butterfly R24 Chrome crash in ExtensionAppProvider (same bug has hit R23 recently) (twice)
                                                                            • svn flakiness downloading / unpacking chromeos_chrome (again)

                                                                            Oct 1 - Oct 2, Mon, Tue
                                                                            piman, rspangler, ellyjones

                                                                            Sept 27 - Sept 28, Thu, Fri
                                                                            rspangler, keybuk, rongchang
                                                                            • svn flakiness downloading / unpacking chromeos_chrome (twice).
                                                                            • ManifestVersionedSync failed on all canaries.  rcui, ferringb determined gerrit replication was failing and fixed it.
                                                                            • chromium:150568: canaries failed with "FAIL: Unhandled JSONInterfaceError: Chrome automation failed" (multiple times)
                                                                            • VMTest timeout: x86_generic_incremental.

                                                                            Sept 25 - Sept 26, 2012, Tue, Wed
                                                                            dianders, davidjames, yoshiki
                                                                            • test failures in BVT and VMtest with Unhandled JSONInterfaceError: Chrome automation failed prior to timing out ...
                                                                            • Chrome PFQ vmtest failure - alex and lumpy - Failed to installed SecureShell extension - Fixed, but see 34796 below
                                                                            • Daisy chrome PFQ: Not a valid NaCL executable - Fixed
                                                                            • desktopui_DocViewing failed on lumpy canary - Any repeats?
                                                                            • hitting canaries (like butterfly build 367); originally this was thought to be but that's because I didn't dig deep enough (I just saw the "Chrome automation failed...").  You need to dig into the artifacts and look for the "dmp.txt" file to see the real chrome crash. - Hitting all the time
                                                                            • Secure Shell did not get correct exit message
                                                                            • Saw some strange try_new_image failures in  milleral thought they were just warnings so no bug filed, but he's going to look at them. Failures are due to
                                                                            • 'desktopui_LoadBigFile: ERROR: The big file did not load' during x86-mario hw 

                                                                            Sept 21 - Sept 24, 2012, Fri, Mon
                                                                            olofj, dparker, chinyue

                                                                            Sept 19 - Sept 20, 2012, Wed - Thu
                                                                            marcheu, thieule, falken, sbasi, armansito
                                                                            Sept 13 - Sept 14, 2012, Thu - Fri
                                                                            jaysri, gabeblack, sheckylin
                                                                            Sept 12, 2012, Wed
                                                                            semenzato ,pstew
                                                                            Sept 11, 2012, Tues
                                                                            wdg, semenzato ,pstew
                                                                            Sept 7 - Sept 10, 2012, Fri - Mon
                                                                            rcui, tbroch , josephsih

                                                                              Sept 6, 2012, Thu
                                                                              tlambert, vbendeb, kochi

                                                                              Sept 5, 2012, Wed
                                                                              tlambert, vbendeb, kochi (9/5-6 JST)

                                                                              Sept 4, 2012, Tues
                                                                              mtennant, sonnyrao, vapier, kochi (9/5-6 JST)
                                                                              • Tree started the day closed, due to, a vmtest flake due to chrome timeout.  See run for mario incremental.
                                                                              • Two internal Chrome PFQ builders are also failing, since at least last Thursday, which has effectively caused the version of Chrome to be pinned.
                                                                                • http://chromegw/i/chromeos/builders/lumpy%20nightly%20chrome%20PFQ (
                                                                                • http://chromegw/i/chromeos/builders/alex%20nightly%20chrome%20PFQ/ ( created and assigned to UI).  Efforts to enlist Chrome sheriffs and ChromeOS chrome gardener did not get anywhere.
                                                                              • Another instance of  The current owner is out of office today, krisr re-assigned to craigdh.
                                                                              • This time hit the "x86 generic full" builder.  The bug is getting attention from test team now.
                                                                              • Another instance of on Mario Incremental -- added logs to the bug
                                                                              • x86-alex failed HWTest, sosa commented on IRC "looks like a false negative as i was rebooting/restarting the devservers when this happend so the update payloads weren't avialable on the devserver" -- re-opened and watching other canaries still running HWTest
                                                                              • meanwhile, hit another instance of 34102 on Mario Incremental
                                                                              • then another instance of 34102 on x86-mario Canary -- HWTest didn't seem to run (was orange)
                                                                              Sept 3, 2012, Mon
                                                                              mtennant, sonnyrao, vapier
                                                                              • Labor Day holiday in United States

                                                                              Aug 31, 2012, Fri
                                                                              adlr, ferringb
                                                                              • Sameer checked in a kernel change that caused all(?) machines to oops, reboot after ~10 seconds. Reverted the change.
                                                                              Aug 29, 2012, Wed
                                                                              miletus, garnold, mkrebs
                                                                              • Tree closed due to "Kernel image is larger than 8 MB" (  Reverted changes that added parted to initramfs.
                                                                                • Note: Reverts finally got merged in at about 8pm, so builds started before that could still fail (depending on their kernel size).
                                                                              • tree closure following x86 generic full VM test failure due to python crash; filed, tree re-opened.
                                                                              • Autotest failure: "Not logged in" error in platform_Pkcs11Persistence (possibly
                                                                              • Autotest failures: several more "supplied_Compositor sig 11" failures (  Also a "supplied_nacl_helper_boo sig 11" failure, which I added to that issue since it's also Chrome.
                                                                              Aug 28, 2012, Tue
                                                                              miletus, garnold, mkrebs
                                                                              • x86-alex and x86-mario canaries failed in hwtest (login_CryptohomeIncognitoUnmounted and login_CryptohomeUnmounted, respectively); investigation reveals network issues related to http / mysql server, tree re-opened.
                                                                              • lumpy, x86-mario and x86-zgb canaries failed in hwtest; latter two due to login issues, former on desktopui_{KillRestart,AccurateTime}. variety of failing bots suggests a transient flakiness. lab sheriff (jrbarnette) informed, tree re-opened.
                                                                              • Autotest failures: Bunch of failures with "Login timed out" and "chrome_200_percent.pak".  Turns out the chrome_200_percent errors are a red herring (they don't cause failures:  These are really login issues (
                                                                              • Autotest failures: "supplied_Compositor sig 11" in desktopui_DocViewing (
                                                                              Aug 24 Fri
                                                                              djkurtz (TPE), dgreid, katierh
                                                                              • lumpy canary failed enterprise_DevicePolicy
                                                                              • alex canary failed, enterprise_DevicePolicy, power_Resume (one login failure and an instance of
                                                                              • zgb canary failed imaging chromeos-rack6-host7 - multiple network failures on this board

                                                                              Aug 22 - Aug 23 Wed/Thu
                                                                              taysom, dhendrix, dgozma
                                                                              • x86-alex canary and x86-zgb canaray failed in HwTest during login
                                                                              • x86 generic incremental failed in flaky FMTtest
                                                                              • For login problems (
                                                                              Aug 20 - Aug 21 Mon/Tue
                                                                              cywang (TPE)
                                                                                Aug 16 - Aug 17 Thu/Fri
                                                                                waihong (TPE), posciak (MTV), bfreed (MTV)
                                                                                • x86-mario canary failed with a Chrome crash:
                                                                                • chromium.chromiumos amd64 failing most of the day,
                                                                                • flaky chromiumos-sdk: gtk-doc failing in configure, but intermittently
                                                                                • Flaky tegra2 full archive step's been failing intermittently on archive stage due to, will be getting rid of tegra2 bots Fri or Mon
                                                                                • Several packages failed with "select error: (4, 'Interrupted system call')", suspect something killed a build:
                                                                                • mario and alex canary failed due to HWTest losing connections, will be resolved itself.
                                                                                • Failed to connect to virtual machine:
                                                                                • security_ptraceRestrictions failing:
                                                                                • security_ASLR failing:
                                                                                • filed issue for recent >5 builds failed in login timed out.

                                                                                Aug 10 - Aug 13 Fri/Mon
                                                                                sleffler (SFO), quiche (MTV)
                                                                                Aug 8 - Aug 9 Wed/Thu
                                                                                sheu (MTV), bhthompson (MTV)for chromeos-factory
                                                                                • Intermittent flakes from security_SeccompSyscallFilters tracked in crosbug/33403.  Revert of promotion to bvt chumped in.
                                                                                • parrot canary failure due to 27c54ab in third_party/coreboot; fix chumped in.
                                                                                Aug 4 - Aug 5 Sat/Sun
                                                                                • I'm not actually sheriff today, but this is a note to sheriffs over the weekend and early Monday: there's a possible unit test failure in shill that made its way into the tree which could fail in build and cause a failure.  If this happens, feel free to submit in order to fix it.  It's waiting for normal review, but if it does end up causing trouble, chumping it is the right thing to do. (pstew)
                                                                                Aug 2 - Aug 3 Thu/Fri
                                                                                fjhenigman (WAT), benrg, snanda

                                                                                Jul 31 - Aug 1 Tue/Wed

                                                                                Jul 27 - Jul 30 Fri/Mon

                                                                                Jul 25 - Jul 26 Wed/Thu
                                                                                dennisjeffrey (MTV), sosa (MTV), hungte (TPE)
                                                                                • bot hung after successfully completing archive stage but before the report stage; forcefully killed by buildbot after 9000 seconds.  Seems to be a rare flake.  Filed
                                                                                • lots of errors connecting to Google Storage (curl failures).  Google Storage team was contacted and they fixed the problem on their end.  Followed-up by filing to track the task of updating the version of gsutil used on the chromeOS builders (a recommendation by the Google Storage team).
                                                                                • another "python2 sig 6" error.  Updated existing bug, which is currently under investigation.

                                                                                Jul 23 - Jul 24 Mon-Tue
                                                                                dkrahn(MTV), dtu(MTV)
                                                                                Jul 19 - Jul 20 Thu-Fri
                                                                                puneet(MTV), rminnich(MTV), seanpaul(MTV)

                                                                                Lots of failures to network issues, the biggest symptom being curl fails.

                                                                                July 17 - July 18 Tue/Wed
                                                                                msb(MTV), kamrik(WAT)

                                                                                • pyautolib sig6 crash - test passes but leaves a crash file behind. Saw this thrice.
                                                                                • Bunch of tegra flakiness issues. Told to ignore.

                                                                                Jul 13 - Jul 16 Fri/Mon
                                                                                grundler(MTV), sabercrombie(MTV)

                                                                                canaries were mostly fine on Friday. More failures on Monday:
                                                                                • "zgb failed on update-engine". Saw similar AU timeouts on lumpy, x86-mario, and zgb.
                                                                                  UPDATE: "Issue was devserver overloading and deploying apache and fixing crashes that happened every test run has resolved this issue."
                                                                                • "mod_image_for_recovery failed on arm-daisy canary". Saw this once.
                                                                                • pyautolib sig6 crash - test passes but leaves a crash file behind. Saw this once.

                                                                                Jul 11 - Jul 12 Wed-Thu
                                                                                nirnimesh(MTV), piman(MTV)

                                                                                Canaries repeatedly kept breaking due to update_engine problems.
                                                                                • butterfly canary failed VMTest with 'No space left on device' on image (not host). Updated on existing bug
                                                                                • x86-zgb canary failed HWTest with "Host did not return from reboot." Updated on existing bug
                                                                                • tegra2_kaen canary failed HWTest with "update-engine failed". Updated on existing bug

                                                                                Jul 5 - Jul 6 Thu-Fri
                                                                                chinyue(TPE), dhendrix (MTV), ferringb (MTV)
                                                                                • Thu Jul 05, 06:30 UTC: amd64 generic full failed: update_engine unittest takes too long to finish. (
                                                                                • Fri Jul 06 - ?: update_engine unittest fails on multiple internal builders during the FilesystemCopierAction test (
                                                                                • Thu Jul 05, 07:33 UTC: stout canary failed: ManifestVersionedSync took too long (6+ hours) and thus BuildTarget didn't have enough time to finish. Seems a glitch, re-opened tree.

                                                                                  Jul 3 - Jul 4 Tue-Wed
                                                                                  nirnimesh(MTV), rharrison(WAT)
                                                                                  • chromium.chromiumos bots were dying in the VMTest, Chrome sheriffs fixed that.
                                                                                    • Potentially saw this filter through to x86 alex canary. File
                                                                                  • mario canary failed a couple of times due to HWTest losing connections over night, resolved itself.
                                                                                  • amd64 generic full failed due to unit tests taking too long. Filed This occured again on x86 alex canary.
                                                                                  • FilesystemCopierActionTest.RunAsRootSimpleTest in update_engine failed for no apparent reason. File
                                                                                  • stumpy canary failed in HWTest with "StageBuildFailure" and "500 Internal Server Error". Filed
                                                                                  • Saw instance of prebuilts getting a 500 on upload
                                                                                  29 Jun-2 Jul Fri-Mon
                                                                                  rspangler(MTV), mtennant(MTV), waihong (TPE)

                                                                                  27-28 Jun 2012 Wed-Thu
                                                                                  benchan (MTV), dparker (MTV), josephsih (TPE)
                                                                                  • x86-alex canary and x86-zgb canary failed =>
                                                                                    • Failded at HWTest [bvt]: try_new_image FAIL: Host did not return from reboot.
                                                                                    • This might be related with system failed to respond on the network to cause reboot timeout. Alex and zgb seem particular hard hit.
                                                                                  • amd64 generic full failed =>
                                                                                    • Failed at cros_run_vm_update in VMTest. Networking sometimes failed to come up maybe due to a bug in VM network driver.
                                                                                  • lumpy canary failed => . Unhandled AssertionError: Could not create /home/chronos/Consent To Send Stats. during VMTest. No obvious cause. Reopened the tree and kicked the builder to see if problem reoccurs. Other canaries are passing.
                                                                                  • lumpy/stumpy/tegra2_kaen canary failed =>
                                                                                    • Failed at HWTest [bvt]. Seemed to be network problem.

                                                                                  25-26 Jun 2012 Mon-Tues

                                                                                  dianders (MTV), bfreed (MTV), clchiou (TPE)
                                                                                  • ~8am MTV: amd64-generic-inc is failing, but looks like a builder issue (as found by kliegs / ellyjones).  Tree still open.  Looking for a trooper; fixed by pschmidt.  resolv.conf was empty on the builder
                                                                                  • Kaen canary has been failing since last Friday.  2086 - 2090 were various HWTest failures.  Now it doesn't even do the update. for the update problem.  Not a closer, so assuming bug filed is enough.
                                                                                  • Autotest failure in bvt on x86-mario-r22 R22-2490.0.0.  Flake?  Don't see info about the failure.
                                                                                  • chromium.chromiumos failure:
                                                                                  • All canaries died.  Theory by davidjames is <>.  Revert is here: <,26077>
                                                                                  • x86-mario canary died.  Reported
                                                                                  • tegra2_kaen canary died the same way it was dying Friday night.  That is an improvement over the weekend failures.
                                                                                  • tegra2_kaen and x86-mario canaries died.  tegra2_kaen canary =>; x86-mario =>
                                                                                    • Think x86-mario may be a flake and just a longer timeout needed?  Need owner
                                                                                    • Not sure about tegra2_kaen
                                                                                  • parrot canary failure
                                                                                    • Retry didn't help.  Trying a clobber retry.

                                                                                  21-22 Jun 2012 Thu-Fri
                                                                                  tammo, sleffler, gpike
                                                                                  • kliegs reverted lumpy hwtest connection to the bots: http://chromegw/i/chromeos/changes/2521
                                                                                  • Uprev failing; kliegs manually modified .repo/manifests on mario paladin and kicked bots.  This looks to have fixed uprev failures and vmtests also passing.  Still hobbled by lumpy hwtest failures (timeouts take 30mins).
                                                                                  • All canaries failing with HWTest [bvt] Suite prep 502 Proxy Error (crosbug/31921).  tammo: Tree throttled, as I have no idea what to do about this.
                                                                                  • Tree throttled for vmtest failures; MTV sheriffs left for the day w/o resolution (PSA posted to chromium-os-dev@)
                                                                                  • Paladin's stuck so force stopped alex+stumpy paladin's and clobber+force build mario.
                                                                                  • Lumpy paladin hw tests are timing out backing up the CQ by ~15mins.  Attached to existing crosbug/31916.
                                                                                  • Autotest failure in bvt on stumpy-r22 (R22-2471.0.0): after the test passed, Chrome crashed, and there was no stacktrace due to ; ddrew created
                                                                                  • Looks like a network issue caused gsutil to hang (link canary); created crosbug/32028.

                                                                                  19-20 Jun 2012 Tue-Wed
                                                                                  taysom, wfrichar, kliegs, vapier

                                                                                  • Tree closure due to RPC failure by build server
                                                                                  • Tree closure due to failure to upload prebuilts to Google Storage (gsutil flake; at
                                                                                  • Tree closure due to race condition in cleaning up. Appeared to be the same as
                                                                                  • Chromiumos-tegra2 failed due to disk full - the build people with access to that server were in Las Vegas

                                                                                  15 & 18 Jun 2012 Mon & Fri
                                                                                  sosa, quiche, djkurtz

                                                                                  13-14 Jun 2012 Wed-Thu
                                                                                  jrbarnette, rcui, kinaba

                                                                                  11-12 Jun 2012 Mon-Tue
                                                                                  bleung, petkov, thieule
                                                                                  7-8 Jun 2012 Thu-Fri

                                                                                  4 Jun Mon
                                                                                  fjhenigman, dtu, tlambert
                                                                                  • 8:37am PDT - amd64 generic incremental closed tree when vm16-m2 disk filled up - could not find a trooper but Peter Mayo helped - thanks Peter
                                                                                  • 2:05pm PDT - x86 zgb canary failure first thought to be upload_symbols flake, but investigation indicates those errors are not fatal - looking for real cause
                                                                                  • 2:40pm PDT - paladins blew up real good, vapier identified and fixed it as a permissions issue - thanks vapier
                                                                                  • 3:46pm PDT - mario incremental
                                                                                  • 7:14pm PDT - lumpy canary

                                                                                  1 Jun Fri
                                                                                  fjhenigman, dtu, tlambert

                                                                                    31 May Thu
                                                                                    sque, pstew, josephsih
                                                                                    30 May Wed
                                                                                    sque, pstew, josephsih

                                                                                    29 May Tue
                                                                                    miletus, semenzato, sonnyrao
                                                                                    • amd64-generic full failed on Archive, opened new bug
                                                                                    • VMTest flak on mario-incremental -
                                                                                    • unpinned Chrome from 21.0.1150.3

                                                                                    23-24 May 2012 Thu-Fri
                                                                                    sabercrombie(MTV), vbendeb(MTV), kochi(TOK)
                                                                                    • closure by VMTest flake (
                                                                                    • tree broken by libssl update.  ellyjones fixed it.  Ongoing problems caused by failure to rebuild binpkgs dependent on openssl.
                                                                                    • Chrome build broke various UI tests:  Pinned Chrome to 21.0.1150.3.  
                                                                                    • mario-incremental VMTest failure with two apparent variants of
                                                                                    • link paladin u-boot build failure -- change reverted
                                                                                    • cros_mark_as_stable broken.  fix chumped in.
                                                                                    • gcc change broke builds.  reverted.

                                                                                    22 May 2012 Tue
                                                                                    cwolfe, marcheu, micahc
                                                                                    • the experimental "unified lumpy paladin" is down on disk full; it is being moved to another machine so does not need a cleanup
                                                                                    • mario-incremental failed with "login_CryptohomeIncognitoMounted ... Chrome did not reopen the testing channel after login as guest"
                                                                                    • unclutter was causing retries in various builds; fixed by cwolfe
                                                                                    • link canary failure on chromeos-u-boot; fixed by sjg 
                                                                                    • everything else failed on svn server problems; fixed by maruel and nsylvain
                                                                                    18-21 May 2012 Fri-Mon
                                                                                    dkrahn, puneetster, hashimoto
                                                                                    • VMTest failure:
                                                                                    • Tree broken by manifest change: olofj fixed.
                                                                                    • Archive failure on lumpy canary:
                                                                                    • Update engine failure on tegra2_kaen canary continues:
                                                                                    • Multiple occurances of storage error: 'transfer failed with bytes remaining': davidjames filed
                                                                                    • gpsd timeout on x86-generic full:
                                                                                    • Another google storage failure on tegra2-full: 'No valid URLs found' exception: davidjames in contact with storage team.
                                                                                    • Autotest timeout on amd64-generic for test: SimpleTestUpdateAndVerify. Subsequent VMTest stage passed.
                                                                                      17 May 2012 Thu
                                                                                      dlaurie, grundler, yusukes(TOK)

                                                                                      16 May 2012 Wed
                                                                                      dlaurie, grundler, yusukes(TOK)
                                                                                      • Came in to red tree, all canaries failed in vmtest.   Eventually we pinned Chrome to 21.0.1137.5, but that was not usable for ARM so it is being moved forward again.
                                                                                      • Chromium OS also has vmtest failure attributed to
                                                                                      • 2pm: amd64-generic-incremental failure: collect2: ld terminated with signal 7 [Bus error].   This was my fault (dlaurie) for not applying the binhost change to other targets when I pinned chrome.
                                                                                      • 2:45pm: amd64-generic-incremental is out of disk space, escalated to troopers
                                                                                      • arm-daisy canary build failed (closed tree) due to missing dependency in chromeos-bootimage (was built in parallel and "usually" built)
                                                                                      • 5pm: network timeout trying to retrieve cros_sdk

                                                                                      14 May 2012 Mon
                                                                                      mkrebs, bhthompson, falken(TOK)

                                                                                      • mkrebs: filed for "tegra2_kaen canary": JSONInterfaceError => GetNextEvent => "received empty response"
                                                                                      • x86-mario and stumpy canaries failed, maybe same as
                                                                                      • ferringb@ on IRC: some duplicated output issues, "if you see anything screwed up, for example, if vmtest has parts of unittest logs in it, please open bugs for it w/ links to the specific failures"
                                                                                        • also: builders that hang without output for a long time are probably out of disk space. CLs are coming.
                                                                                      4 May 2012 Fri
                                                                                      snanda, ellyjones, katierh
                                                                                      • - arm generic full builder had a timeout on gpsd though it builds locally and did not break other builders. Will watch the next build (already in progress)

                                                                                      3 May 2012 Thu
                                                                                      pstew, gpike
                                                                                      • BVT test run last night still has failures in graphics_WindowManagerGraphicsCapture, but now they are just failures and not segmentation faults. Test disabled, so it should not feature in the BVT on 4 May.
                                                                                      • Monitoring login_CryptohomeIncognitoUnmounted failure which seems to have failed VMTest on a couple of platforms last night, but is cycling green.
                                                                                      • Issued for crashing on Link paladin in chrome!BaseTab::AdvanceLoadingAnimation.  Chrome gardener is flackr@, not bshe@ as the waterfall shows, due to swap.  This issue is claimed is and verified in early builds on May 4.
                                                                                      • Persistent "Timed out waiting to revert DNS." messages on Link paladin builds.  This appears to be a side effect of the bug above causing tests to end prematurely.  Submitted a CL (making its way through the queue) which will landed land before the Chrome change so we were able to confiurm that this fixes this secondary issue.
                                                                                      • Transient failure in VMTest on x86-generic.  Filed bug
                                                                                      • A couple of glitchy builds due to some dependency swaps for the parted package.  Should have cycled through all builds, but contact benchan@ if parted features in any build failures tomorrow.
                                                                                      2 May 2012 Wed
                                                                                      pstew, gpike
                                                                                      • BVT failure in graphics_WindowManagerGraphicsCapture (segmentation fault). Assigned to ihf@ who wants to "wait and see" how it does in BVT toight.
                                                                                      • Canaries are red: Fixed by scottz@ who reverted the offending change.
                                                                                      • Creation of "swap.conf" in chromeos-init conflicts with platform-specific swap.conf:  Reverted this, and michahc@ will land a more comprehensive change.
                                                                                      • UploadPrebuilts phase for multiple architectures failing with "GSResponseError:: status=502, code=None, reason=Bad Gateway."  Appears to have been a temporary server failure -- monitoring.
                                                                                      1 May 2012 Tue
                                                                                      jrbarnette, thutt, inter alia
                                                                                      • Tree started the west coast day green
                                                                                      • Occasional update_engine unit test failures due to
                                                                                        • There are ongoing changes underway trying to get to the root cause.
                                                                                      • One canary failure due to an ill-timed change to the dev server.

                                                                                      30 Apr 2012 Mon
                                                                                      jrbarnette, inter alia
                                                                                      • Apparently, nothing has happened for the past week and a half.
                                                                                      • The tree started the west coast day (and week) green.
                                                                                      • Minor failures during the day; known bugs (to be documented later).
                                                                                      • At the time of West coast sign-off, there is an ongoing outage due to multiple canary failures
                                                                                        • HWTest updates got 404 errors downloading stateful.tgz; root cause unknown
                                                                                        • jrbarnette is declaring it "transient" - time will tell whether this is right.

                                                                                      19 Apr 2012 Thu
                                                                                      gmorain, piman, kamrik
                                                                                      • Found the tree red with HW test stage failed for alex, zgb, stumpy and lumpy canaries. In all cases the HW test stage failed at an early stage before even running any test. with an error message "FAIL: Update failed. Timed out waiting for system to mark new kernel as successful."
                                                                                      • While trying to figure what it was, most of the builders cycled green. The two HW test failures in the new builds seem to be browser crashes, opening the tree.
                                                                                      • 19:43 UTC - Tree went red again on HW test failure on ZGB and Alex with error message that looked like a browser crash. After some investigation it appears to be which was reported several hours earlier. Also reported in

                                                                                      13 Apr 2012 Fri
                                                                                      dgreid, tlambert
                                                                                      • Had an instance of 28631, re opened.
                                                                                      • platform_CryptohomeAuthTest was failing for most of the day and not closing the tree, Found the offending commit a reverted.
                                                                                      12 Apr 2012 Thu
                                                                                      dgreid, tlambert
                                                                                      • Error with HwTest reimaging systems cleared up.
                                                                                      • One instance of 26646

                                                                                      11 Apr 2012 Wed
                                                                                      dianders, thieule, chinyue (TPE)
                                                                                      • scottz: Unfortunate user error failure on HWTest:
                                                                                      • again on stumpy canary
                                                                                      • chinyue 06:32 UTC: VMTest failed, crostestutils.lib.dev_server_wrapper.DevServerException: Timeout waiting for the devserver to startup. (reopen
                                                                                      • chinyue 07:01 UTC: again on mario incremental
                                                                                      • chinyue 07:16 UTC: again on x86-mario canary
                                                                                      • chinyue 09:26 UTC:, still investigating...
                                                                                      • dianders 12:27 MTV: x86-mario canary in HWTest.  
                                                                                        • Not much was logged in the link pointed to by the waterfall.  It sounds like that's because this wasn't a failure of the test but perhaps a failure in running the test (?).
                                                                                        • scottz says he knows the problem and working on it.  Filing a bug for himself.  TBD: bug #?
                                                                                      • thieule 12:40 MTV: alex-he failure is 26646
                                                                                      • thieule 3:04p MTV: Temporary bots failure due to chromeos-chrome needing libjpeg, vapier says they should cycle green once they pull in chromeos-chrome 20.0.1098.1_rc-r2.
                                                                                      • thieule/dianders 5:30 MTV: Lots of canaries died due to failure to build private version ixchariot.  dianders reproduced locally.  Found that ixchariot used the cros-binary eclass, which had changed today.  Revert of the eclass fixed ixchariot build, so chumped it in.

                                                                                      10 Apr 2012 Tues
                                                                                      dianders, thieule, chinyue (TPE)
                                                                                      • x86-zgb canary builder failed (
                                                                                      • gclient sync failed when building chromeos-chrome (
                                                                                      • dianders 10:30a MTV: tree was left closed at start of shift with message:  Note that at around 10am that bug had been marked as fixed.  David James said that several syncs had passed, so not keeping tree closed for this.
                                                                                      • dianders 10:30a MTV: David James pointed that was still causing VM test bots to fail (old temp files still left over).  He is fixing.
                                                                                      • dianders 11:03a MTV: Noticed that Lumpy paladin builder failed with something similar to yesterday's  Ben confirmed that this was the same as
                                                                                      • dianders 12:30p MTV: chromium-os-sdk is broken (and has been for a little bit--didn't notice with all of the other redness).  Proposed fix is here:
                                                                                      • dianders 12:45p MTV: Checking to see if latest mario incremental failure is another  Asking Ben (who is AFK) and digging myself.
                                                                                        • Ben says it's a dupe.  Updated the bug.  Note that according to Ben there doesn't appear to be any good way to tell between this bug and any other hang of chrome at bootup.  ...but if we got sig 6 or sig 11, we'd know it was a crash and different.
                                                                                      • dianders 1:30p MTV: Kaen paladin is dead, which blocks all internal paladins.  Escalating to troopers (both via email and IRC).
                                                                                        • Latest update on the machine: It found errors during a disk check and is now trying to fix the errors.  Continuing to escalate.
                                                                                        • Going to move to another machine. is the bug to track that.
                                                                                        • Fixed now.
                                                                                      • dianders 2:45p MTV: Since CQ was so flaky for internal stuff, I ended up chumping people's changes in if they passed enough stuff (as suggested by davidjames).  Ignored instances of 29224 and the fact that they hadn't gone through Kaen.
                                                                                      • thieule 4:47p MTV: arm generic full builder runs out of disk space.  davidjames mention that the builder only has 60GB of disk so it can only hold about 3 builds.  Opened
                                                                                      • thieule 5:04p MTV: arm-ironhide canary fails to emerge kernel,

                                                                                      6-9 Apr 2012 Fri-Mon
                                                                                      tbroch(fri), sjg(mon), mtennant, tammo

                                                                                      4-5 Apr 2012 Wed-Thur
                                                                                      tbroch, dparker, kinaba

                                                                                      2-3 Apr 2012 Mon-Tue
                                                                                      sheckylin, olofj, dkrahn
                                                                                      • autotest repo failed to replicate automatically, davidjames replicated manually and logged
                                                                                      • Noticed alex-canary has failed the last two builds due to lab environment issues. Talked to johndhong, opened
                                                                                      • Reverted a CL that broke the commit queues:
                                                                                      • Persistent bug (‘Timed out waiting for login’ in VMTest)
                                                                                      • New bug ('Timeout waiting for the devserver to startup' in VMTest)

                                                                                      29-30 Mar 2012 Thu-Fri
                                                                                      grundler, dlaurie, seanpaul
                                                                                      • on "lumpy canary"
                                                                                      • Lots of VMTest failures "Timed out waiting for login"
                                                                                      • stumpy-canary has been broken for a week, was isolated and hopefully fixed Friday
                                                                                      • upstream merge of modemmanager-next caused breakage due to interface change.  shill was updated to compensate.
                                                                                      • A few builders ran out of space.  Commit was landed to auto-clean ccache directory.
                                                                                      • R19 x86-alex pre-flight stuck after vmtest failure, discovered and escalated to troopers late Friday...
                                                                                      • Detailed notes at

                                                                                      27-28 Mar 2012 Tue-Wed
                                                                                      reinauer, taysom
                                                                                      • New problem: crosbug.com28631 Failed cbuildbot failed archive failed report. Assigned to ferringb
                                                                                      • flake: VMTest fails in various login tests. This happened several times
                                                                                      • flake: Timed out waiting for system to mark new kernel as successful. Multiple times but not as many as 26646
                                                                                      • Problem with chrome - reinauer will need to describe.

                                                                                      23-26 Mar 2012 Fri-Mon
                                                                                      puneetster, snanda (PST), waihong (non-US)

                                                                                        21 Mar 2012 Wed
                                                                                        adlr, bfreed (PST), kochi (non-US)
                                                                                        • ARM build failure (-nopie error): internal builds are only affected; toolchain is rebuilt in chroot and this is fixed.
                                                                                        • cgroup unhandled crash
                                                                                        • flake: VMTest fails with Timed out waiting for login prompt
                                                                                        • Transient gclient sync failure on Chrome.
                                                                                        • VMTest failed with Error parsing data because invalid syntax, but the Report log says Exception __main__._ShutDownException: _ShutDownException('Received signal 15; shutting down',)
                                                                                        • BuildTarget failed with Unavailable repository 'gentoo' referenced by masters entry due to,18809.

                                                                                        20 Mar 2012 Tue
                                                                                        sonnyrao/dennisjeffrey (PST), yjlou (non-US)
                                                                                        • Mon 22:51 UTC Tree closed.  "amd64 generic full" bot running out of memory.  Filed and re-opened tree.
                                                                                          • Build team switched amd64-generic over to a Builder with more memory and we closed 119009
                                                                                        • Tue 06:30 UTC Tree closed.  Transient download error.  Tree re-opened.
                                                                                        • Tue 20:17 UTC Tree closed.  tegra2_seaboard failed BuildBoard due to Arm hardening options being enabled in GCC. 
                                                                                        • Saw another flake.

                                                                                        15 Mar 2012 Thu
                                                                                        keybuk (PST), katierh (PST)

                                                                                        13 Mar 2012 Tue
                                                                                        waihong (TPE)

                                                                                        12 Mar 2012 Mon
                                                                                        vbendeb (PST), bhthompson (PST)

                                                                                        The tree has been closed over the weekend. The remaining problem is kernel size in factory image exceeds 8MB, which causes the EFI partition overflow.

                                                                                        09 Mar 2012 Fri
                                                                                        vbendeb (PST), bhthompson (PST)
                                                                                        • Had to revert sshfs-fuse update to 2.4 as it was breaking on ARM due to instability marker in the new ebuild,17773
                                                                                        • Kernel 3.2 update was pushed on Friday but the impacts were not felt until the evening, leaving the tree red for the weekend. 

                                                                                        08 Mar 2012 Thu
                                                                                        quiche (PST), micahc (PST), hungte (non-US)
                                                                                        • 17:48 PST - tegra2_kaen canary failure, sjg pushed,17630 to fix.
                                                                                        • 16:11 PST - x86-zgb canary,
                                                                                        • 15:36 PST - lumpy canary failure, during PublishUprev
                                                                                        • 15:00 PST - assist with resolving chrome-PFQ failure (in chrometest)
                                                                                        • 12:43 PST - tegra2 seaboard full failure, reverted
                                                                                          ** update: not reverted. quiche prepared the revert CL, but didn't push it. (confused by UI)
                                                                                        • 04:36 PST - alex_he canary ManifestVersionedSync failure (
                                                                                        07 Mar 2012 Wed
                                                                                        quiche (PST), micahc (PST), hungte (non-US)
                                                                                        • 16:36 PST - assist with chromium.chromiumos failure (
                                                                                        • 12:41 PST - mario incremental failure due to TreeCloser
                                                                                        • 09:59 PST - CleanUp failed on x86 generic full., CL in review.
                                                                                        • 07:20 PST - CleanUp failed on amd64 generic full. petermayo rebooted the bot.
                                                                                        • 02:24 PST - x86-mario canary failure due to HWTest.

                                                                                        06 Mar 2012 Tue
                                                                                          • 11 PST - amd64-generic full hit
                                                                                          • 6 PST - tegra2-full complaining of disk full
                                                                                          • Periodic

                                                                                          05 Mar 2012 Mon
                                                                                          • 9:43 PST - chromiumos-sdk hitting link errors in SDKTest; marcheu and zbehan got it fixed.
                                                                                          • Periodic all weekend.

                                                                                          01 Mar 2012 Thurs
                                                                                          • 04:00; llvm change landed tightening const strictness, breaking stumpy/lumpy canaries.  Revert chumped in, canaries restarted manually.
                                                                                          • 10:14 PST - ScreenLocker smoke test failing. keybuk says the test was relying on a bug. Reopened for now.
                                                                                          • More chrome flakiness on internal builders -
                                                                                          • 4:30 PST- x86 chrome PFQ failing to emerge chromeos-base/chromeos-0.0.1-r153, trying clobber build.

                                                                                          28 Feb 2012 Tue
                                                                                          kliegs (EST), davidjames (PST), nsanders (PST)
                                                                                          • Paladin bots aren't closing the tree -

                                                                                          28 Feb 2012 Tue
                                                                                          kliegs (EST), davidjames (PST), nsanders (PST)
                                                                                          • 8:00 PST - chrome buildspec fixed (was missing new chromite dependency whitelist).  chrome PFQ's running now to roll ebuild.  Finished @9:45 AM and canaries were kicked off manually.  Change was picked up by canaries and they went green.
                                                                                          • 7:39 PST - Canaries still red, Chrome 1055 buildspec was unsuccessful so no new chromeos-chrome ebuild.  This means vapiers revert was not picked up so unused variables still creating errors on the canaries.  Leaving tree throttled and have pinged Chrome sheriffs and PMs to help resolve

                                                                                          27 Feb 2012 Mon
                                                                                          dtu (MTV), mkrebs (MTV)
                                                                                          26 Feb 2012 Sun
                                                                                          • 6:40pm, ferringb reopens for flake on mario-incremental.
                                                                                          • 5:43pm, vapier reopens, files
                                                                                          25 Feb 2012 Sat
                                                                                          • 11:08: All internal canaries are taken out by Tree remains closed till sunday.
                                                                                          24 Feb 2012 Fri
                                                                                          dtu (MTV), mkrebs (MTV)
                                                                                          • 10:48 PST - All full and incremental builders, along with chromiumos on the Chrome waterfall, fail due to path conflict in DEPS. rcui reverts.
                                                                                          • 12:26 PST - x86-alex_he canary closes on chromium-os:26646. mkrebs reopens.
                                                                                          23 Feb 2012 Thu
                                                                                          • 14:06 PST - Builder could not find "configure" executable on x86 generic full, arm generic full, tegra2 full, and tegra2 seaboard full.  Fixed by zbehan.
                                                                                          • 18:44 PST - Bug 26646 hit x86-mario incremental.
                                                                                          22 Feb 2012 Wed
                                                                                          • 12:22 PST - ExtensionTerminalPrivateApiTest.TerminalTest failed in Linux ChromeOS Tester. Possibly fixed, so no bug filed.
                                                                                          • 14:38 PST - Chromium-OS:26646
                                                                                          18 Feb 2012 Sat
                                                                                          17 Feb 2012 Fri

                                                                                          16 Feb 2012 Thur

                                                                                          15 Feb 2012 Wed

                                                                                          14 Feb 2012 Tue
                                                                                          dianders (MTV), ferringb (MTV), clchiou (TPE), tammo (TPE)
                                                                                          • overnight - bvt failure in zgb (couldn't access  Interim failure?
                                                                                          • overnight - x86-mario canary failure => according to tree status history; clchiou has throttled because of this
                                                                                          • 8:35am - tree was open/green when dianders got in (ellyjones opened)
                                                                                          • 11:00am - x86 pineview full: ferringb IDed as an instance of
                                                                                            • dianders: nope.  Actually; I put a pling in that bug.  Still agree that it shouldn't close tree, since it's not a new issue.
                                                                                          • 12:47p - lumpy canary: Another instance of  Inserted pling and bumped to P1.  dianders: kicked lumpy canary build (1:15p) when I noticed that it wouldn't retry for a while.
                                                                                          • 2:00p - dianders noticed that chromium.chromiumos build was broken (though no email).
                                                                                            • Filed:
                                                                                            • Didn't close tree, since this doesn't appear to be a treecloser.
                                                                                          • 2:16p - gcc 4.6.2 (CL 15461) landed at 1:58p without a required dependency keyworded breaking amd64 i7 full, reverted via CL 15845.
                                                                                          • 3:49p - x86 generic incremental: vmtest, Actually; time to escalate?
                                                                                          • 4:31p - x86 generic incremental: vmtest, (supplied_chrome crash).  Added note to the bug, including a little bit of debugging.  Not sure there's much we can do here.
                                                                                          • 6:22p - x86 zgb-he canary, update delta again:
                                                                                          • 6:31p - lumpy, timeouts:

                                                                                          13 Feb 2012 Mon
                                                                                          msb, mkrebs (MTV), kamrik (EST)
                                                                                          10 Feb 2012 Fri
                                                                                          msb, mkrebs (MTV), kamrik (EST)
                                                                                          • x86 canary bots are all red as of 2230 PDT last night - filed as
                                                                                          • full buildbots have been running erratically for a week - e.g., the last run of x86-generic-full was on 2012-02-08 and the last run of arm-generic-full was on 2012-02-08 (with the previous run on 2012-02-01!)
                                                                                          • Tree closed at 2012-02-09 2244 PDT, reopened by ellyjones at 0628 PDT, reclosed by ellyjones at 0730 PDT. Still closed as of 0741 PDT.
                                                                                          • After much wailing and gnashing of teeth, ellyjones, kliegs and zbehan track the problem down to the binutils-2.21-r3 image produced by the chromiumos-sdk bot; if the same package is compiled locally (from the same git commit-id), the failure disappears. Tree still closed as of 0828 PDT.
                                                                                          • vapier points out that the act of rebuilding switches you back to bfd instead of gold, thus hiding the problem from earlier tests; back to square one
                                                                                          • CLs 15340 and 15176 were reverted and the SDK builder fired to rebuild the SDK. The new SDK works (verified around 12:40 PST)
                                                                                          • The problem was due to the -frecord-gcc-switches flag and the and the way how gold gets linked with glibc, which is linked with GNU ld.
                                                                                          • supplied_chrome sig 11 in security_ProfilePermissions.login again
                                                                                            • http://chromegw/i/chromeos/builders/lumpy64%20PFQ/builds/534/steps/VMTest%20%5Blumpy64%5D/logs/stdio

                                                                                          9 Feb 2012 Thu
                                                                                          sque, dkrahn (MTV)

                                                                                            9 Feb 2012 Thu
                                                                                            djkurtz (TPE)

                                                                                            8 Feb 2012 Wed
                                                                                            sque, dkrahn (MTV)
                                                                                            6 Feb - 2 Feb 2012 Mon/Tue
                                                                                            dparker, wfrichar (MTV)

                                                                                            31 Jan - 1 Feb 2012 Tue/Wed
                                                                                            jrbarnette, dgarrett (MTV), falken (Tokyo)

                                                                                            27-30 Jan 2012 Fri/Sat/Sun/Mon
                                                                                            ellyjones, dennisjeffrey, vbendeb

                                                                                            25-26 Jan 2012 Wed/Thu
                                                                                            rginda, sjg, mazda
                                                                                              • Lumpy64 and Pineview archive failures found to be a problem with dumpsym choking on files of the wrong architecture, filed as  Mkrebs working on a fix as a p0 item, so tree reopened.
                                                                                              • Lumpy PFQ failure due to chrome aura crash: (Build log: http://chromegw/i/chromeos/builders/lumpy%20PFQ/builds/595).  ChromeOS tree reopened since this is a chrome aura issue.
                                                                                              • Same aura crash also believed to be behind stumpy PFQ failure: http://chromegw/i/chromeos/builders/stumpy%20PFQ/builds/603
                                                                                              • Filed bug 25467 for Lumpy canary failure with and
                                                                                              • Filed bug 25468 for dump_syms problem (ERROR : Unable to dump symbols for /build/x86-pineview/usr/lib/debug/boot/vmlinux: dump_syms: src/common/linux/ const Elf32_Shdr*{anonymous}: :FindSectionByName(const char*, const Elf32_Shdr*, const Elf32_Shdr*, int): Assertion `nsection > 0' failed.
                                                                                                23-24 Jan 2012 Mon/Tue
                                                                                                19-20 Jan 2012, Thu/Fri
                                                                                                anush, puneetster, miletus

                                                                                                17 Jan 2012, Tuesday
                                                                                                nirnimesh, sosa, josephsih
                                                                                                • Out of disk space on a bot - issue 110480 filed for long term fix.
                                                                                                13 Jan 2012, Friday
                                                                                                jennyz, achuith, zvorygin
                                                                                                12 Jan 2012, A Rainy Thursday
                                                                                                jamescook, keybuk, vapier (east coast)
                                                                                                11 Jan 2012, A Wednesday
                                                                                                jamescook, keybuk, vapier (east coast)
                                                                                                • Hit kvm ssh timeout again
                                                                                                • build_image failure with "mount: you must specify the filesystem type" 
                                                                                                • x86-alex & tegra2_seaboard toolchain master bots dead due to sync error -> trooper reset them
                                                                                                • x86-alex 0.11.241.B factory & pre-flight bots dead for a while
                                                                                                • x86-zgb release factory-980.B bot has been down for a while
                                                                                                • google-breakpad failed its unit tests
                                                                                                • new dev-libs/glib pkg failed in toolchain fortify smoketest; dev was informed of CQ usage and multiple CL's landed to resolve

                                                                                                10 Jan 2012, Tuesday
                                                                                                davidjames, tbarzic
                                                                                                • Saw a couple shutdown crashes in Chrome:
                                                                                                1/9/2012, Monday
                                                                                                davidjames, tbarzic, jglasgow (east coast)
                                                                                                • Tree still throttled; Looking for jhorwich to organize a sheriff summit.
                                                                                                • Found ZGB PFQ reporting errors (http://chromegw/i/chromeos/builders/zgb%20PFQ/builds/193).
                                                                                                  "Found nothing new to build, trying again later.
                                                                                                  If this is a PFQ, then you should have forced the master, which runs cbuildbot_master
                                                                                                  Found no work to do."
                                                                                                  Tried to force a build via the web page, but not sure if that is what the error message means.
                                                                                                • Found TOT PFQ has not run since Friday, despite a long (28) queue of changes.  http://chromegw/i/chromeos/builders/TOT%20Pre-Flight%20Queue.  Filing bug.  Also observed that the ToT CQ has 749 pending requests.  No troopers on IRC or responding to email.

                                                                                                1/7/2012, Saturday
                                                                                                jhorwich, tlambert, jglasgow (east coast)
                                                                                                • Tree was throttled; it's possible to push things past the PFQ with "Publish and Submit" after verify.
                                                                                                • You may need to remove gerrit as a reviewer to do this, since it will -2 you on the PFQ.
                                                                                                • If future sheriffs use this as a workaround for the VMTest problem, keep the following in mind:
                                                                                                  • You need to watch the tree for non-VMTest failures.
                                                                                                  • Consider "Publish and Submit" for CLs that were rejected by PFQ failures involving VMTest IFF shutdown related; this allows people to make forward progress towards deadlines despite the VMTest issue.
                                                                                                  • Watching is no more onerous than reopening the tree every 27-35 minutes because of VMTest barfing (this was my Thr night and Fri day).
                                                                                                • jhorwich has the ability to get core dumps now; I'm talking to Randall Monday about making this a crossystem option. I honestly believe that he is a victim of VMTest in this, like the rest of us, and that we need to examine the (non)role of unit tests in diagnosing the test framework vis a vis tree closures:
                                                                                                  • In case that's not clear, let me bluntly say that a chrome failure from a passed test should not be a tree closer.
                                                                                                  • If we want to test Chrome fragility on shutdown, that should be a separate test; to my mind it would be of dubious value:
                                                                                                    • Chrome crashes -> restart Chrome -> gaia login
                                                                                                    • Chrome doesn't crash -> restart Chrome -> gaia login
                                                                                                • We need a sheriff's summit; if no one else calls one next week, I will.
                                                                                                  1/6/2012, Friday
                                                                                                  jhorwich, tlambert, jglasgow (east coast)
                                                                                                  • jglasgow: Found stumpy PFQ failing, filed, decided to fix rather than revert since VMTest was holding tree closed anyway
                                                                                                  • VM tests failures dues to Chrome crashes are a huge problem, but better debugged by jhorwich and those with Chrome experience.
                                                                                                    • Sorry to disagree here, but the VMTest failures are a meta-failure in the test infrastructure, and do not effect the validity of the test results.  They are bugs, but they are bugs that should not result in tree closure.
                                                                                                  • jglasgow: Filed for PFQ uprev failures

                                                                                                  1/5/2012, Thursday
                                                                                                  jhorwich, tlambert, jglasgow (east coast)
                                                                                                  • Filed a tree closer because of a test platform_ToolchainOptions was failing because of bluez.  Thanks zbehan for helping to look at this. It is not clear what change caused this to start failing.
                                                                                                  • Filed a tree closer because uboot was failing to build on tegra2.  Thanks David James who pointed this out, and vpalatin who quickly grabbed the bug.  Lots of red on the tree due to chrome sig11 certainly affected the sheriff's ability to notice this -- but we should have been more vigilant in making sure we understood all the red builders.
                                                                                                  • Chrome sig11 bug quite prevalent today. jhorwich noted 9 instances during MTV shift. Got a good stack trace on x86-alex canary build 1478, added to
                                                                                                  • Only other closure during shift was a straightforward build breakage (gerrit 13738) which was reverted
                                                                                                  • jhorwich reproduced a chrome sig11 on  local VM, is going to attempt to debug root cause Friday
                                                                                                  • tlambert reopened over the sig11; mostly jhorwich was faster
                                                                                                  • Added entry to the Sheriffs FAQ
                                                                                                    • we need to update the builder/closer list
                                                                                                    • temporary link to "all" for when you can't find the builder

                                                                                                  12/28/2011, Wednesday
                                                                                                  sonnyrao, mtennant
                                                                                                  • Noticed that sheriffs cannot push through gerrit with red tree.  Filed
                                                                                                  • Starting around 2pm started to see Chrome sig 11 crashes on internal zgb PFQ and TOT PFQ. Filed
                                                                                                  • Most Chrome sig 11 core files were unusable, but identified one as Re-opened bug.
                                                                                                  12/28/2011, Wednesday
                                                                                                  sonnyrao, mtennant
                                                                                                  12/27/2011, Tuesday
                                                                                                  derat, dlaurie, nkostylev
                                                                                                  12/26/2011, Monday
                                                                                                  derat, dlaurie, nkostylev
                                                                                                  • transient x86-alex canary vmtest "Timed out waiting for login prompt" (
                                                                                                  12/21/2011, Thursday
                                                                                                  • transient alex PFQ vmtest "Timed out waiting for login prompt" (
                                                                                                  • transient link PFQ vmtest sig 11 on login_BadAuthentication (didn't find a bug; should be one already)
                                                                                                  • transient x86-pineview-pull svn error with webrtc (didn't find a bug; should be one already)
                                                                                                  • Test failures on Chromium.ChromiumOS Linux ChromeOS Aura ( and
                                                                                                  12/21/2011, Wednesday
                                                                                                  marcheu, quiche
                                                                                                  • Another occurrence of (on x86 pineview full)
                                                                                                  • Build failure in chromiumos sdk. Due to race condition in groff ebuild. (
                                                                                                    • workaround by marcheu
                                                                                                    • fixed by vapier
                                                                                                  • Build failure in x86 generic commit queue. Due to missing sandbox exception for fontconfig. (, fixed by vapier)
                                                                                                  • Build failure in Chromium.ChromiumOS (aura). Fixed in ToT chrome.
                                                                                                  • Build failure on x86 generic PFQ (due to ferringb reverted.
                                                                                                  • Build failure on tegra2_kaen-aura canary (due to marcheu reverted.
                                                                                                  • Another occurrence of (on zgb PFQ)
                                                                                                  12/20/2011, Tuesday
                                                                                                  rharrison, kamrik shadowing
                                                                                                  • Came onto a red tree, due to Stumpy PFQ being forced directly instead of TOT PFQ
                                                                                                    • Filing a bug about the error message not being descriptive enough,
                                                                                                    • Created a CL to make error message more descriptive
                                                                                                    • Kicked the TOT PFQ and reopened the tree
                                                                                                  • VMTest Failure on amd full generic, created
                                                                                                  • Looked into
                                                                                                    • Pinged nkostlyev, altimofeev, and flackr to make sure it was being looked at
                                                                                                    • Approved CL for altimofeev to change test run order to try to get more information
                                                                                                  • Another occurrence of (on link PFQ)
                                                                                                  12/15/2011, Thursday
                                                                                                  ihf, gpike
                                                                                                  • Hit: chrome crash in suite_Smoke/desktopui_ScreenLocker.
                                                                                                  • One case of: chrome crash in suite_Smoke/security_ProfilePermissions.login.
                                                                                                  • The 3AM build of x86-zgb_he full release-R17-1412.B Build #21 failed: FAIL Archive (1:35:59) with BackgroundException.
                                                                                                  • The previous one (#20) failed in VMTest stage while unzipping the image.
                                                                                                  • ... and before that, #19 failed in VMTest stage during ../platform/crostestutils/generate_test_payloads/
                                                                                                  • The 3AM build of x86-mario full release-R17-1412.B Build #21 also failed during ../platform/crostestutils/generate_test_payloads/  The first problem may have been: "mount: you must specify the filesystem type"
                                                                                                  12/14/2011, Wednesday
                                                                                                  ihf, gpike
                                                                                                  • Hit issues of: chrome crash in suite_Smoke/desktopui_ScreenLocker.
                                                                                                  • Hit another issue of: VMTest ERROR: Test that updates to itself.
                                                                                                  • Problems on alex_he canary with recovery and vmlinuz images. Filed
                                                                                                  • chromiumos sdk broken: xmlrpc-c-1.18.02: curlmulti.c: curl/types.h missing. Filed
                                                                                                  12/12/2011, Monday
                                                                                                  • stumpy-canary link error in power_manager, can not reproduce locally. Clobber does not help as well. Filed
                                                                                                  • Lumpy-binary fails on building chromeos-u-boot-0.0.1-r336: boot_kernel.c:206:26: error: 'CHROMEOS_BOOTARGS' undeclared. Filed
                                                                                                  12/10/2011, Saturday
                                                                                                  cwolfe (drive-by, times unknown)
                                                                                                  • ARM release bots attempting to run vm_tests. Same as ( Probably from gerrit/12702, e-mailed rcui
                                                                                                  • Widespread build errors on pepper-flash "HTTP Error 403: User Rate Limit Exceeded" (
                                                                                                  • stumpy canary link error in power_manager; can not reproduce, probably just needs a clobber after the 403 clears up
                                                                                                  • Still some VMTest problems

                                                                                                  12/9/2011, Friday
                                                                                                  rspangler, chocobo, jglasgow, ellyjones
                                                                                                  • 1130 PST: VMTest chrome-static crashes timed out (
                                                                                                  • 1255 PST: VMTest timed out (
                                                                                                  • 1300 PST: VMTest login timeout (
                                                                                                  • 1405 PST: VMTest failure (
                                                                                                  • 1530 PST: VMTest flakiness (
                                                                                                  • 1600 PST: And more VMTest problems (

                                                                                                  12/8/2011, Thursday
                                                                                                  rspangler, chocobo, jglasgow, ellyjones

                                                                                                  12/7/2011, Wednesday
                                                                                                  thutt, thieule, yusukes
                                                                                                  • 1033 PST: Canary builders broke (, this was fixed and canary builder subsequently passed.  Some builders used dash instead of bash.
                                                                                                  • 1041 PST: Aura Chrome PFQ incorrectly configured, petermayo is working on a fix
                                                                                                  • 1127 PST: Chrome crashed during VMTest (
                                                                                                  • 1404 PST: Autotest client terminated unexpectedly (, this could be related to
                                                                                                  • 1717 PST: Chrome crashed during VMTest (
                                                                                                  • 1729 PST: Chrome crashed during VMTest (

                                                                                                  12/6/2011, Tuesday
                                                                                                  thutt, thieule, yusukes
                                                                                                  • 1059 PST: AU VM Test failure (
                                                                                                  • 1140 PST: Timed out waiting for login prompt (
                                                                                                  • 1415 PST: Chrome SEGV (
                                                                                                  • 1722 PST: Timed out waiting for login prompt (
                                                                                                  • 1740 PST: Commit queue hung and was restarted (
                                                                                                  11/28/2011, Monday
                                                                                                  ers, sleffler, stevenjb
                                                                                                  • BVT failures for zgb are chrome sig 11's that appear unrelated but the dump logs are zero length
                                                                                                  • BVT failures for mario sig 11 in synTPenh
                                                                                                  • 1146 EST: looks like occurrences are still closing the tree frequently
                                                                                                  • All three chrome PFQ builds had failed with "Clear and Clone chromite" errors (couldn't find branch named 'release'). A forced build on the arm generic chrome PFQ resulted in success, so I reopened the tree and forced builds on the other chrome PFQ bots.
                                                                                                  11/10/2011, Thursday
                                                                                                  • 0800 PST: 500 internal server error uploading prebuilt. Bug filed:
                                                                                                  • 0900 PST: x86 PFQ failure in autotest due to pygtk rev. pygobject was updated and PFQ clobbered.
                                                                                                  • 1100 PST: TOT PFQ faliure in autotest due to pygtk/pygobject rev. Next build was successful.
                                                                                                  • 1500 PST: Adobe pulls all Linux Flash 10 binaries.  Bastards.  I updated the adobe-flash ebuild to use flash11.
                                                                                                  • 1700 PST: VMTest failures due to broken flash, my ebuild did not install into correct directories...
                                                                                                  NOTE: autotest/pygtk/pygobject failures were related to python ebuild from a couple days ago

                                                                                                  11/3/2011, Thursday
                                                                                                  • 0700 PST: Chrome build was broke early in the morning.  We kept the ChromeOS tree open.  Resolved around 11:30.
                                                                                                  • Red canaries were expected to run overnight, but they are still red Friday morning.
                                                                                                  11/2/2011, Wednesday
                                                                                                  • 1356 PST: tegra2_seaboard-tangent-binary failed with a too large u-boot image.  reinauer fixed this.
                                                                                                  • 1415 PST: transient VMTest failures from 2:15 to around 3:00.
                                                                                                  Still open:
                                                                                                  • Want to understand how to make sure we get chrome stack crawls.  WIP at: <> and <>, which I think are different bugs.
                                                                                                  • amd64-generic-full still failing (less important).
                                                                                                  • BVT tests getting Synaptics sig 11s ( and chrome sig 11s (not too surprising given the ones we see below).

                                                                                                  10/28/2011, Friday
                                                                                                  • 1625 PST: chromium.chromiumos failure. VMTest stage timed after 9000 seconds. ericroman reverted a webkit roll.
                                                                                                  • 1620 PST: reopened tree
                                                                                                  • 1540 PST: restarted most internal builders. (restarted any builder that had a build fail due to the network issue; did not restart builders that were idle at the time of network failure)
                                                                                                  • 1530 PST: network issues resolved
                                                                                                  10/27/2011, Thursday
                                                                                                  • 2150 PST: network issues causing failure on internal builders (
                                                                                                  • 1718 PST: chromium.chromiumos closes due to gclient sync failure on chromeos-chrome. ericroman reopens.
                                                                                                  10/26/2011, Wednesday
                                                                                                  • 0922 EST: VMTest Failed due to not being able to access update server
                                                                                                    • Bug filed by petermayo as
                                                                                                    • Reopened, since it only occurred for one bot
                                                                                                  • 1443 EST: Tree closes because of failure to fetch webkit from Not supposed to happen. Is
                                                                                                  • 1529 EST: Tree closes because of a build failure in chromium's chromium. Not supposed to happen; sosa and petermayo are fixing this.
                                                                                                  • 1643 EST Another instance of from

                                                                                                  10/25/2011, Tuesday
                                                                                                  • 1022 EST: Failures due to issues with cros_run_vm_test from,10599 . Reverted as,10647
                                                                                                    • Multiple re-closures due to slow builds hitting this issue after the revert
                                                                                                  10/24/2011, Monday
                                                                                                  • 5:34p: Mosys ebuild failure.  Reverted here:
                                                                                                  • 5:12p: Another flaky sig11 in ChromiumOS (x86) (chromium.chromiumos).  Haven't investigated, but it went away.
                                                                                                  • 4:51p: Another flaky sig11 in alex-binary.
                                                                                                  • 3:15p (dianders): ChromiumOS (x86) (chromium.chromiumos) build failed.  2 issues:
                                                                                                    • First sig11 didn't give a stack crawl.  Seems to be a different problem than (??)
                                                                                                  • 2:31p (dianders): Stumpy canary 493 fails.  Different than 492, but probably also flaky. filed.
                                                                                                  • Early afternoon (dianders): Digging into overnight BVT failures.  2 of them thought to be another instance of
                                                                                                  • Morning (dianders): Digging into stumpy 492.  Filed w/ info.  Going to see what happens w/ 493.
                                                                                                  • 9:45a PT: Started (West coast) day with:
                                                                                                    • Tree opened with caveat (though already fixed).  Kicked binary builders to try bugfix.
                                                                                                    • Linux ChromeOS build failing (  Looks like a flaky build. seemed to be talking about this test, so added a comment.
                                                                                                    • Chromium OS SDK looks like it's still probably broken.
                                                                                                    • Several emails about BVT failures
                                                                                                    • Last stumpy canary (492) was a failed one.
                                                                                                      • security_ProfilePermissions.login        ERROR: Unhandled JSONInterfaceError: Automation call {'username': 'performancetestaccount@gmail. ...

                                                                                                  10/21/2011, Friday
                                                                                                  See also this doc:

                                                                                                  • 4:00pm PT: vpalatin points out that the (less important) amd64-generic-full is failing.
                                                                                                  • 3:16pm PT: Failure w/ shflags and testUpdateKeepStateful (  
                                                                                                  • 3:16pm PT: fix to chromium.chromeos waterfall <>
                                                                                                  • 2:14pm PT: fix to is pushed.
                                                                                                  • 2:14pm PT: another isntance of 21945
                                                                                                  • 2:14pm PT: another kernel build failure (same problem--revert hasn't made it everywhere).
                                                                                                  • 1:30pm PT: ...kernel build failure again (another case fixed by revert below)
                                                                                                  • 1:00pm PT: kernel build failure; fix by reverting ->
                                                                                                  • 10:38am PT: stumpy canary failure attributed to Bigstore; reportedly a power event in the data center.
                                                                                                  • 10:23am PT:  ellyjones reports kernel failure; fix: ->
                                                                                                  • 9:00am PT: Started the (West coast) day with
                                                                                                    • Chrome SEVG failures lumped under
                                                                                                    • Chrome PFQs all down
                                                                                                    • chromium.chromeos broken (and has been for several days).
                                                                                                  10/20/2011, Thursday; 10/19/2011, Wednesday

                                                                                                  10/17/2011, Monday
                                                                                                  • 9:40am. Tegra build fails.

                                                                                                  10/12/2011, Wednesday
                                                                                                  olege, semenzato, gpike
                                                                                                  • 9pm: getting lots of chrome sig 11 during vmtests.  Cause unknown.
                                                                                                  • 5:32pm: webkit.2011101101.patch needs update. Updated, got petermayo to work on a fix; restarted 3 bots after fix was done
                                                                                                  • 5:15pm: cmasone kindly fixed a crash reporter bug introduced this morning.
                                                                                                  • 4:43pm: hit 19204 again
                                                                                                  • 4:16pm: webkit.2011101101.patch needs update. Opened
                                                                                                  • 2:21pm: autoupdate vmtest failed.  Under psychological pressure, Chris Sosa admitted seeing this before.  Opened
                                                                                                  • 9am: disabled desktopui_UrlFetch.not-live, thereby sweeping under the rug.
                                                                                                  • sheriffs could not submit a change bypassing the commit queue.  Chris Sosa fixed this.
                                                                                                  • afternoon: svn checkout for chromeos-chrome failed again.  Opened
                                                                                                  • 8am. "arm generic full" failed on BuildTarget. svn checkout failed during building chromeos-chrome. Built fine on the next try.
                                                                                                  • 0am - 8am. Multiple occurrences of
                                                                                                  10/11/2011, Tuesday
                                                                                                  olege, semenzato, gpike
                                                                                                  times in PST unless otherwise marked
                                                                                                  • 2:30pm: 21517 is fixed (xorg.conf missing in arm builds).  This was making the arm canaries red.
                                                                                                  • 12:20pm.  Arm build broken by change 55311 at 11:45, fixed by change 55319 at 12:40.
                                                                                                  • 11am. Another occurrence of, assertion failure in google breakpad.
                                                                                                  • 10:30am. happened twice.  Raised priority and reopened.
                                                                                                  • 8am. Oleg reverted a change apparently responsible for vmtest failure on stumpy. (,9841)

                                                                                                  10/6/2011, Thursday
                                                                                                  Sheriffs: derat, stevenjb
                                                                                                  * Additional occurrences of
                                                                                                  * Multiple update failures:

                                                                                                  10/5/2011, Wednesday
                                                                                                  Sheriffs: derat, stevenjb
                                                                                                  Tree started closed with two issues:
                                                                                                  * (Unhandled JSONInterfaceError: Automation call) continues to close the tree.