Automate the creation of an offline runner #519

npalm · 2021-02-06T16:46:42Z

The current approach requires we alwsys have an offline runner regirstered. Those offline runners are removed by github eacht 60 days. Therefore it would be convient to automate the process to keep 1 offline runner so we can scale back to 0.

Potential solution direction

Register via ED2 instance

Use the same mechanism we use to spin up runners with an extra lambda that only execute the user data till the config step. And also ensure the ec2 instance is take down

Reverse egineer github config process

Create lamda that is using github http calls based on reverse engineering, see actions/runner#558

Run the config in a lambda

Create a lambda that can exectue the config via a lambda layer

miked63017 · 2021-02-08T23:44:12Z

@npalm I've figured out most of the logic, urls, etc involved with registering a runner. I plan on making a python module to handle this using python 3x and requests. This will probably be this weekend when I'm off work.

If you would prefer to implement it yourself, I can share my notes I've gathered about the process before then.

I honestly think I want to spend some more time tracing the entire process, so it could actually pull and perform workflows as well, but the new runner registration was a must have for me.

Basically the process I have found to work:
Get runner token via api using pat or app creds
Post token to secretish endpoint, receive back json data with new secretish endpoint and a jwt
Use jwt to auth via auth bearer header, using newly revealed endpoint, can query existing "agents"(as they are called in the api), add a new one, or update an existing one.

There's obviously some more details involved, including creating an RSA key, and a bunch of headers. I haven't looked further than registration yet...

npalm · 2021-02-09T08:41:33Z

@gertjanmaas I think you will like the comment above

mcaulifn · 2021-02-10T12:41:28Z

@miked63017 let me know if you don't get to it. We are looking for this as well so I would have some time to work on it.

Edit: Not sure if you saw, but this is newly released for Python: https://github.blog/2020-12-18-learn-about-ghapi-a-new-third-party-python-client-for-the-github-api/

miked63017 · 2021-02-14T17:51:24Z

@npalm @mcaulifn here is a link, it's still pretty beta'ish and not well documented, but I guess we can say the same thing about the runners/actions API in general :-)

https://github.com/miked63017/pyghrunner

miked63017 · 2021-02-14T18:30:17Z

@mcaulifn in RE to the ghapi module it looks cool, but most of these calls are undocumented pieces of the api, and probably subject to change.

mcaulifn · 2021-03-02T21:02:04Z

Overall looks like it should work. Are you planning on adding it to this repo?

npalm · 2021-03-02T21:09:48Z

@gertjanmaas any opinion?

miked63017 · 2021-03-03T00:08:08Z

@npalm @mcaulifn not sure if I have the context to add it here to this repo, I am personally working in a GKE operator to do similar but figured it could help others to share some simple code to integrate with other projects. Seems to be a fairly common request for this functionality. If you'd like me to take a crack at adding here via a PR I can maybe spend some time this weekend.

gertjanmaas · 2021-03-03T07:11:42Z

I quickly skimmed through the python code and it seems to confirm what I saw when I tried to reverse engineer it a while ago. Would be great if this could be implemented here. Getting tired of adding offline runners by hand :P

miked63017 · 2021-03-03T23:34:12Z

@gertjanmaas where I am running (the equivalent) of this code(in a private library), we just run the few methods periodically, or in response to an event, and overwrite the previous "virtual runner". We are basically just using it as a placeholder so jobs queue rather than fail because no runners with labels exist. Then we look at the jobs details, and spin up the appropriate runner, with appropriate labels as needed, and with the --once flag.

I still have plans of further investigation into creating a full custom runner, most likely written in python, that can then be embedded in other places. This just hasn't been high priority for me yet.

dimisjim · 2021-03-10T00:42:14Z

How about runner deregistration?

The offline runner basically needs to be recreated every 30 days, in order to never have 0 runners in the org.

This should be automated as well.

boldandbusted · 2021-09-07T17:37:40Z

FYI, I'm running into this issue, still. At first I thought it might be because the team who has the runners isn't using them often, but thinking through the workflow a bit more, I realized GitHub is deleting the offline runner purely because the offline Runner Agent isn't 'phoning home'. :( I posted this in the GitHub Actions Discussions: https://github.community/t/github-actions-offline-runner-removed/191044/5?u=boldandbusted - so I hope that perhaps GitHub Engineering may offer some alternative approach.

If there is an automation that doesn't involve a PAT I'd love to hear it! :) We can limp along by manually bringing up the offline runner (or re-adding it every 29 days, which is kind of easier), but this isn't really a scalable path, sadly. I don't think this is the fault of this repo's code, BTW, to be clear, but an architecture deficit in the GitHub Runners approach.

npalm · 2021-11-09T21:35:25Z

Any help on automating this is still very welcome

miked63017 · 2021-11-09T22:46:18Z

If it helps, I have additional code that parses and decrypts all the communications, only thing missing is the ability to send logs back and forking off subprocceses. I haven't had time to do anything with it, but essentially I wanted to rewrite the runner itself in pure python which could make it a little more flexible.

axel3rd · 2022-01-04T17:28:41Z

I confirm Offline runner(s) is useless too since GHES v3.3(.1) series.

npalm added enhancement New feature or request help wanted Extra attention is needed labels Feb 6, 2021

npalm mentioned this issue Dec 29, 2021

docs: Offline runners are not required anymore. #1573

Merged

npalm linked a pull request Dec 29, 2021 that will close this issue

docs: Offline runners are not required anymore. #1573

Merged

npalm closed this as completed in #1573 Jan 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automate the creation of an offline runner #519

Automate the creation of an offline runner #519

npalm commented Feb 6, 2021

miked63017 commented Feb 8, 2021

npalm commented Feb 9, 2021

mcaulifn commented Feb 10, 2021 •

edited

miked63017 commented Feb 14, 2021

miked63017 commented Feb 14, 2021

mcaulifn commented Mar 2, 2021

npalm commented Mar 2, 2021

miked63017 commented Mar 3, 2021

gertjanmaas commented Mar 3, 2021

miked63017 commented Mar 3, 2021

dimisjim commented Mar 10, 2021

boldandbusted commented Sep 7, 2021

npalm commented Nov 9, 2021

miked63017 commented Nov 9, 2021

axel3rd commented Jan 4, 2022

Automate the creation of an offline runner #519

Automate the creation of an offline runner #519

Comments

npalm commented Feb 6, 2021

Potential solution direction

Register via ED2 instance

Reverse egineer github config process

Run the config in a lambda

miked63017 commented Feb 8, 2021

npalm commented Feb 9, 2021

mcaulifn commented Feb 10, 2021 • edited

miked63017 commented Feb 14, 2021

miked63017 commented Feb 14, 2021

mcaulifn commented Mar 2, 2021

npalm commented Mar 2, 2021

miked63017 commented Mar 3, 2021

gertjanmaas commented Mar 3, 2021

miked63017 commented Mar 3, 2021

dimisjim commented Mar 10, 2021

boldandbusted commented Sep 7, 2021

npalm commented Nov 9, 2021

miked63017 commented Nov 9, 2021

axel3rd commented Jan 4, 2022

mcaulifn commented Feb 10, 2021 •

edited