dl.hhvm.com's Shared Storage

What it is?

It is an AWS Elastic Block Store Volume (network block device) in us-west-2a; it contains an XFS filesystem, and can only be mounted by one host at a time.

In particular, it is usually mounted by our 'PublishBinaryPackages' jobs.

Why does it exist?

Some tools need to operate on a local filesystem, such as reprepro, which manages apt repository metadata for Debian and Ubuntu. reprepro in particular needs a full copy of the filesystem.

This leads to two problems:

As of 2022-03-30, it's roughly 2TB
It is really slow to download this from S3; I once cancelled this after 6 hours (when mounting had failed and failure wasn't handled properly) - ompared to < 30 minutes for the usual run with the shared filesystem

In principle, our solution is:

We mount the shared filesystem
We use aws s3 sync to get the shared filesystem in sync with S3, using S3 as the source of truth
We run the commands we need to do on this shared filesystem, e.g. adding .deb packages to the repositories and updating the metadata
We use aws s3 sync again copy from the shared filesystem to S3

We use a network block device and XFS instead of AWS's Elastic File System as once mounted, the EBS+XFS approach works as if it were a local POSIX-compatible filesystem.

What's the new problem?

Hack/HHVM grow over time, and eventually the storage gets full. While EBS and XFS are resizable, this is not automatic. It could be automated by our tooling, however this is only needed every few years; it seems likely automation would bitrot or have unintended side effects more than help.

Making the shared storage larger

Overview

Make the EBS Volume bigger - expand the existing volume, do not delete and recreate
Spin up an EC2 instance manually in the same availability zone as the volume, i.e. us-west-2a, not just us-west-2.
Attach the volume to the instance; can not be done while any other EC2 instance - such as a PublishBinaryPackages job - attached.
SSH to the EC2 instance
Find the device name for the block device - this may be in the EC2 web UI, otherwise check dmesg and /var/log/cloud-init-output.log
Use the standard XFS tools to non-destructively enlarge the filesystem
Mount the filesystem
Check df -h shows the expect results. For example, if you resize from 2TB to 4TB, you might expect to see a 4TB filesystem with 50% usage
shutdown -h now
Ensure that the EC2 instance is 'terminated' via the web UI. This probably happened automatically when the node was shut down

Details

These instructions are accurate as of 2022-03-30, but depend on third-party tools, APIs, and web interfaces; it is likely that it will not be possiible to follow them verbatim the next time this needs doing - keep the context above in mind when following/adapting these instructions in the future.

Connect to the AWS management console
Set region to Oregon/us-west-2 in the top right
Go to EC2 from 'Services' in the top left
Under 'Elastic Block Store' on the left, select 'Volumes'
Select the 'dl.hhvm.com XFS' volume
In the top right, actions -> modify volume
Enter the new desired size. Historically, I've been doubling this each time.
Click modify
You should now be back to the EBS Volumes page; go to Instances -> Instances in the left navigation
Click 'launch instance' in the top right
Choose an image; as long as it's 64-bit x86 Linux, it doesn't really matter, but I'm using "Ubuntu Server 20.04 LTS". Click 'Select'.
Choose an instance type (machine class). The default is usually fine for this, but pick something bigger if that doesn't work. Click "Next: Configure Instance Details"
Change the Subnet to the us-west-2a subnet
Click "Next: Add Storage"
Increase the default root partition from the tiny default value to something reasonable. I go for 1024GiB which is unneccessarily large, but we're not going to keep this instance running for long.
You may get a warning about operating system support for large root volumes; this can be ignored on any recent Linux.
Click "Next: Add Tags"
Add a Name tag, e.g. fredemmott-dev; this shows up in the EC2 instance list
Click "Review and Launch"
You will get warnings about ssh being open to 0.0.0.0/0, and that it will not be free. Ignore these.
Click 'Launch'; you will be prompted to select an SSH key pair that you have access to, or create (and download) a new one. Either works.
Acknowldge the warning that you need the private key, then click 'Launch Instances'
Wait for it to start (instance state == "Running")
Go back to Elastic Block Store -> Volumes in the AWS Web UI
Select dl.hhvm.com XFS
Actions -> attach volume
Select your new instance
You may see an information box about device naming between kernels: as of 2022-03-30, the value given to AWS should be of the form /dev/sdf, but it will be /dev/xvdf inside the instance. This applies to our use, but is informational - it does not require action.
Click 'attach volume'
ssh to your instance, e.g ssh ubuntu@PUBLIC_IP or ssh ec2-user@PUBLIC_IP for other imsages
Check the device exists, e.g. ls /dev/xvdf - there should usually be two devices, /dev/xvda and your device, /dev/xvdf. There will also likely be /dev/xvda1 which is a partition of /dev/xvda. If it's unclear:

the "Attach" page in the web UI hopefully gave you the needed information
search current EBS documentation for the device names + AWS + EBS
check mnt to see which are in use

As of 2022-03-22 the required XFS utilities are installed by default, but if not, the package is usually called xfsprogs and should be installed with yum/apt/...
mkdir /mnt/dl.hhvm.com; mount /dev/xvdf /mnt/dl.hhvm.com (changing device as needed)
check df -h has the expected old size
Open man xfs_growfs inside the VM to check the version you are using has compatible arguments for all of the following commands
run xfs_growfs -d /dev/xvdf -n - -d is to resize Data section to max size, -n is 'no change to the filesystem is to be made' - i.e. dry-run
if the values look good, run again without -n: xfs_growfs -d /dev/xvdf
It should print out the new data size
Check df -h has the expected new size and usage
umount /mnt/dl.hhvm.com
double-check: mount /dev/xvdf /mnt/dl.hhvm.com; df -h
unmount again
shut the instance down: shutdown -h now
if shutting down the instance does not change the instance state to 'terminated' automatically, manually terminate it through the EC2 web ui
once the instance state is 'terminated' (not "shut down" or similar), the EBS volume will be automatically detached and can be used by other instances, e.g. the usual build jobs
re-run any publish jobs that failed

The volume can alternatively be attached from the AWS CLI, but you will need to configure AWS CLI credentials:

packaging/aws/bin/update-repos

Lines 14 to 23 in fb50cf9

    
           INSTANCE_ID=$(curl --retry 5 http://169.254.169.254/latest/meta-data/instance-id) 
        
           # We keep a shared XFS volume as the source of truth to avoid having to 
        
           # redownload from S3 every day, which can take a *really* long time. 
        
           VOLUME_ID="vol-096d2c45d13d3a865" 
        
           aws ec2 attach-volume \ 
        
             --device /dev/sdf \ 
        
             --instance-id "$INSTANCE_ID" \ 
        
             --volume-id "$VOLUME_ID"

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DL.HHVM.COM_SHARED_STORAGE.md

DL.HHVM.COM_SHARED_STORAGE.md

dl.hhvm.com's Shared Storage

What it is?

Why does it exist?

What's the new problem?

Making the shared storage larger

Overview

Details

	INSTANCE_ID=$(curl --retry 5 http://169.254.169.254/latest/meta-data/instance-id)

	# We keep a shared XFS volume as the source of truth to avoid having to
	# redownload from S3 every day, which can take a really long time.
	VOLUME_ID="vol-096d2c45d13d3a865"

	aws ec2 attach-volume \
	--device /dev/sdf \
	--instance-id "$INSTANCE_ID" \
	--volume-id "$VOLUME_ID"

Files

DL.HHVM.COM_SHARED_STORAGE.md

Latest commit

History

DL.HHVM.COM_SHARED_STORAGE.md

File metadata and controls

dl.hhvm.com's Shared Storage

What it is?

Why does it exist?

What's the new problem?

Making the shared storage larger

Overview

Details