Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[devicelab] catalina bot fails to pick up tasks #70244

Closed
keyonghan opened this issue Nov 10, 2020 · 12 comments
Closed

[devicelab] catalina bot fails to pick up tasks #70244

keyonghan opened this issue Nov 10, 2020 · 12 comments
Assignees
Labels
P0 Critical issues such as a build break or regression team-infra Owned by Infrastructure team tool Affects the "flutter" command-line tool. See also t: labels.

Comments

@keyonghan
Copy link
Contributor

Pre-health check fails with error

2020-11-10T15:42:51.652195: WARN close-ios-dialog: failed
2020-11-10T15:42:51.652225: WARN ERROR: Command "xcrun xcodebuild -project infra-dialog..." failed with exit code 70.
2020-11-10T15:42:51.652257: WARN #0      fail (package:cocoon_agent/src/utils.dart:139:3)
2020-11-10T15:42:51.652290: WARN #1      closeIosDialog.<anonymous closure> (package:cocoon_agent/src/health.dart:189:9)
2020-11-10T15:42:51.652322: WARN <asynchronous suspension>
2020-11-10T15:42:51.652352: WARN #2      inDirectory (package:cocoon_agent/src/utils.dart:360:24)
2020-11-10T15:42:51.652384: WARN <asynchronous suspension>
2020-11-10T15:42:51.652418: WARN #3      closeIosDialog (package:cocoon_agent/src/health.dart:170:11)
2020-11-10T15:42:51.652451: WARN <asynchronous suspension>
2020-11-10T15:42:51.652486: WARN #4      _captureErrors (package:cocoon_agent/src/health.dart:68:47)
2020-11-10T15:42:51.652518: WARN <asynchronous suspension>
2020-11-10T15:42:51.652551: WARN #5      performHealthChecks.<anonymous closure> (package:cocoon_agent/src/health.dart:49:41)
2020-11-10T15:42:51.652604: WARN <asynchronous suspension>
2020-11-10T15:42:51.652672: WARN #6      _captureErrors (package:cocoon_agent/src/health.dart:68:47)
@keyonghan keyonghan added this to New in Infra Ticket Queue via automation Nov 10, 2020
@keyonghan keyonghan self-assigned this Nov 10, 2020
@keyonghan keyonghan added the P1 label Nov 10, 2020
@keyonghan keyonghan moved this from New to Triaged in Infra Ticket Queue Nov 10, 2020
@keyonghan
Copy link
Contributor Author

Tried restart device but hit:

flutter@flutter-devicelab-mac-12 agent % idevicediagnostics restart          
ERROR: Could not connect to lockdownd, error code -8

@keyonghan
Copy link
Contributor Author

keyonghan commented Nov 11, 2020

Find some similar errors and workaround: libimobiledevice/libimobiledevice#587 (comment), libimobiledevice/libimobiledevice#717 (comment)

But doesn't help here.

@jmagman Do you have any idea how to fix this?

@keyonghan
Copy link
Contributor Author

Removing close-ios-dialog step in pre-health check for now to by pass the issue, so that the bot continues to pick up tasks.

But task failed with

Task failed with the following reason:
Task failed: DeviceException: No connected iOS devices found.

However

idevice_id -l             
e7184ae5fe05c2ea2ad4654ac70b4ba19c92f2c6

@keyonghan keyonghan added P0 Critical issues such as a build break or regression and removed P1 labels Nov 11, 2020
@keyonghan
Copy link
Contributor Author

keyonghan commented Nov 11, 2020

From flutter test runner side, it is using /Users/flutter/.cocoon/flutter/bin/flutter devices --machine --suppress-analytics --device-timeout 5. However it returns no ios device:

/Users/flutter/.cocoon/flutter/bin/flutter devices --machine --suppress-analytics --device-timeout 5
[
  {
    "name": "macOS",
    "id": "macos",
    "isSupported": true,
    "targetPlatform": "darwin-x64",
    "emulator": false,
    "sdk": "Mac OS X 10.15 19A582a darwin-x64",
    "capabilities": {
      "hotReload": true,
      "hotRestart": true,
      "screenshot": false,
      "fastStart": false,
      "flutterExit": true,
      "hardwareRendering": false,
      "startPaused": true
    }
  },
  {
    "name": "Web Server",
    "id": "web-server",
    "isSupported": true,
    "targetPlatform": "web-javascript",
    "emulator": false,
    "sdk": "Flutter Tools",
    "capabilities": {
      "hotReload": true,
      "hotRestart": true,
      "screenshot": false,
      "fastStart": false,
      "flutterExit": false,
      "hardwareRendering": false,
      "startPaused": true
    }
  },
  {
    "name": "Chrome",
    "id": "chrome",
    "isSupported": true,
    "targetPlatform": "web-javascript",
    "emulator": false,
    "sdk": "Google Chrome 86.0.4240.193",
    "capabilities": {
      "hotReload": true,
      "hotRestart": true,
      "screenshot": false,
      "fastStart": false,
      "flutterExit": false,
      "hardwareRendering": false,
      "startPaused": true
    }
  }
]

@keyonghan keyonghan removed their assignment Nov 11, 2020
@keyonghan keyonghan added team-infra Owned by Infrastructure team tool Affects the "flutter" command-line tool. See also t: labels. labels Nov 11, 2020
@jonahwilliams
Copy link
Member

I think the tool does not use idevice_id , it uses xcdevice

@jmagman
Copy link
Member

jmagman commented Nov 11, 2020

lockdown error -8 is LOCKDOWN_E_MUX_ERROR, it probably needs a restart. But you can't restart it because you are getting a lockdown error...

xcrun xcdevice list --timeout 10 might give us more info.

@keyonghan
Copy link
Contributor Author

Here is the info:
xcrun_xcdevice.txt

@jmagman
Copy link
Member

jmagman commented Nov 11, 2020

e7184ae5fe05c2ea2ad4654ac70b4ba19c92f2c6 isn't even in that list, it seems good and borked, likely needs a restart.

@keyonghan
Copy link
Contributor Author

keyonghan commented Nov 11, 2020

filed fxb/64108 to ask for reboot.
In the meantime, marked catalina tasks as flaky to open the tree: #70255

@keyonghan keyonghan moved this from Triaged to In progress in Infra Ticket Queue Nov 11, 2020
@keyonghan keyonghan added P1 and removed P0 Critical issues such as a build break or regression labels Nov 11, 2020
@keyonghan keyonghan self-assigned this Nov 11, 2020
@keyonghan
Copy link
Contributor Author

Lower the priority to p1 as this is not blocking the tree.

@keyonghan
Copy link
Contributor Author

Device comes back with fxb/64108 being fixed.
#70328 to remove flakiness.

Infra Ticket Queue automation moved this from In progress to Done Nov 12, 2020
@github-actions
Copy link

github-actions bot commented Aug 9, 2021

This thread has been automatically locked since there has not been any recent activity after it was closed. If you are still experiencing a similar issue, please open a new bug, including the output of flutter doctor -v and a minimal reproduction of the issue.

@github-actions github-actions bot locked as resolved and limited conversation to collaborators Aug 9, 2021
@flutter-triage-bot flutter-triage-bot bot added P0 Critical issues such as a build break or regression and removed P1 labels Jun 28, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
P0 Critical issues such as a build break or regression team-infra Owned by Infrastructure team tool Affects the "flutter" command-line tool. See also t: labels.
Projects
No open projects
Development

No branches or pull requests

4 participants