Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

EPIC: New Machine requirement: Replacement for Equinix x64 servers #3292

Open
5 of 6 tasks
sxa opened this issue Dec 7, 2023 · 6 comments
Open
5 of 6 tasks

EPIC: New Machine requirement: Replacement for Equinix x64 servers #3292

sxa opened this issue Dec 7, 2023 · 6 comments

Comments

@sxa
Copy link
Member

sxa commented Dec 7, 2023

I need to request a new machine:

  • New machine operating system (e.g. linux/windows/macos/solaris/aix): Linux and Solaris
  • New machine architecture (e.g. x64/aarch32/arm32/ppc64/ppc64le/sparc): x64
  • Provider (leave blank if it does not matter): TBC
  • Desired usage: Replacement for our x64 capacity at Equinix
  • Any unusual specification/setup required: Potentially a hypervisor that can be used to run the Solaris systems
  • How many of them are required: 3 (If equivalent size - around 48-64 core and capable of running multiple docker containers in parallel)

Please explain what this machine is needed for: Equinix have been sponsoring our infrastructure by providing a generous amount of capacity for the Adoptium infrastructure. This is now coming to and end and we need to make a plan for migrating our systems away from Equinix (Note: This does not affect the aarch64 Altras which are supplied as part of the Works On Arm project, but are hosted by Equinix)

This will involve migration of the following systems:

145.40.115.43 - VMware ESXi server (n3.large.x86 - London DC) and 147.28.133.218 - VMware ESXi server (m3.large.x86 - Paris DC)

These host our Solaris/x64 systems (including for Temurin Compliance) as well as other Linux VMs for performance test work on x64

London (145.40.x115.xx):

Paris (147.28.133.2xx):

  • jck-equinix_esxi-containerized-alpine317-x64-1
  • jck-equinix_esxi-solaris10-x64-1
  • jck-equinix_esxi-ubuntu2204-x64-1

dockerhost-equinix-ubuntu2004-x64-1 (AMD EPYC 7401P 24 core/48 thread - c2.medium.x86 - London DC)

Used for builds and hosting many containers:

dockerhost-equinix-ubuntu2204-x64-1 (Intel Xeon Gold 40 core - n3.xlarge.x86 - London DC)

Used for builds and hosting many containers

C3AWX - (c3.small.x86 - Amsterdam DC)

Hosts our AWX instance and also the c3jenkins agent used for intermediate work on jenkins pipelines (Replacement for the "Built in Node")

Issues for individual systems:

@sxa
Copy link
Member Author

sxa commented Jan 8, 2024

Noting for C3awx:

  • /var/lib/docker (AWX) is using 200Gb, although there seems to be only about 50Gb in use, mostly /Vendor_Files which has 43Gb - 33Gb is Visual Studio archives, 8Gb is Xcode 11.7, and about 3Gb for AIX. Going forward we sould probably try and use a mapped volume for this directory instead of hosting it within the container to make it easier to update. But overall about 100Gb and 1-2 cores would be fine for this system RAM uncertain
  • Jenkins workspace is using 48Gb, but RAM and CPU requirements likely low. Maybe 2 core, 4Gb would cover it with a decent amount of disk (100-200G). Probably doesn't need to be on x64 ...

@sxa
Copy link
Member Author

sxa commented Jan 9, 2024

i'm going to drop the number of executors on the two dockerhost x64 machiens to 1 for now to see if that causes any additional delays. Some of the jobs do use multiple executors as per the screnshot below but those arne't time critical jobs (unless thye hold up others)
image

@sxa
Copy link
Member Author

sxa commented Mar 7, 2024

@sxa
Copy link
Member Author

sxa commented Mar 28, 2024

Both equinix dockerhost x64 systems have now been shut down and the containers on them removed from jenkins.

@sxa
Copy link
Member Author

sxa commented Apr 24, 2024

c3-awx and vmware-esxi7 have been shut down and removed from the inventory (left for a few days just to ensure nothing is using them.

This only leaves vmware-esxi7-2 which hosts some of the Temurin Compliance systems and can go once @fredg02 confirms that we have the replacement Solaris boxes live and working on there

@sxa sxa added this to the 2024-04 (April) milestone Apr 24, 2024
@sxa sxa modified the milestones: 2024-04 (April), 2024-05 (May) May 13, 2024
@steelhead31
Copy link
Contributor

jck-equinix-solaris-x64-1 has now been shutdown and replaced.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: In Progress
Development

No branches or pull requests

2 participants