Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Scalling aggregation service #1255

Open
adhir111 opened this issue Apr 19, 2024 · 1 comment
Open

Scalling aggregation service #1255

adhir111 opened this issue Apr 19, 2024 · 1 comment
Labels
aggregate developer-input Question/feedback raised by a developer and posted here on their behalf for public discussion

Comments

@adhir111
Copy link

Need some details on load handling capacity of aggregation service,
we do have the guide for vertical scaling in documentation , to choose the right instance type based on the load.
Need help on how to scale aggregation service horizontally in these scenarios:

  • huge single batch

  • processing multiple btahces in parallel ( historical data processing use cases)

@apasel422 apasel422 added aggregate developer-input Question/feedback raised by a developer and posted here on their behalf for public discussion labels Apr 19, 2024
@maybellineboon
Copy link

hi @adhir111 ,

Thanks for reaching out. For future questions for Aggregation Service, we recommend you to reach out via the Aggregation Service github page so that the appropriate team can address your question properly.

  1. For batches, especially big batches, we recommend checking out the sizing guide. The sizing guide will help you estimate the size of instance you require base on your number of reports and domain. Additionally, you might want to check out the batching strategies to help you batch according to shared IDs and according to frequency. It is also advised to batch by advertiser for Attribution Reporting API to keep the batch sizes manageable.

  2. For processing multiple batches in parallel, each aggregation batch/job is processed by a single instance and a given batch/job cannot be split over multiple instances. If you have multiple batches running, Aggregation Service auto-scales and increases the number of instance upto the max_capacity_ec2_instances. Aggregation Service has max_capacity_ec2_instances available in the <env>.auto.tfvars terraform file which allows you to customize the maximum number of instances to be created on your AWS account.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
aggregate developer-input Question/feedback raised by a developer and posted here on their behalf for public discussion
Projects
None yet
Development

No branches or pull requests

3 participants