"Dictionary Size changed during iteration" around application deployment time #446

baydoun0 · 2024-05-18T10:19:25Z

Expected Behaviour

Flask Limiter should behave gracefully during application initialization.

Current Behaviour

About 50 times over the last couple of weeks, ever since we upgraded Flask Limiter to the latest version (we were on 1.x and now we're at 3.5.1), we consistently observe the following error on Sentry:

It happens around the time our application initializes after deployment.

Steps to Reproduce

I don't have concrete steps to reproduce for everyone, our application is in production under normal load, and we service quite a bit of requests, but for us, steps to reproduce are simply:

Construct Blueprints with routes registered, some of which have rate-limiting (this doesn't actually matter, since we observe failures even on endpoints that DO NOT HAVE the limiter decorator)
Register all those blueprints to a Flask App instance.
Initialize Flask limiter as usual.

Concurrently, the application needs to be bombarded with requests.

Your Environment

Flask-limiter version: 3.5.1
Flask version: 2.0.0
Operating system: Amazon Linux 2
Python version: 3.8

We're also using Redis as a backing store.

Note: We are constructing all blueprints and attaching all routes to them, and also configuring ALL before_request and after_request hooks before initializing Flask Limiter in an attempt to mitigate this, but it's not working as expected. Flask Limiter seems to be kicking in immediately (which is fine, but it should handle the race condition better).

Suggested Fix

The self._blueprint_exemptions field inside the LimitManager class is only modified in one place: the add_blueprint_exemption method. All that needs to be done is to add a lock, and acquire it when attempting to write, or iterate over self._blueprint_exemptions.

The text was updated successfully, but these errors were encountered:

alisaifee · 2024-05-19T00:52:11Z

If the blueprints have all been constructed (and decorated with any @exempt decorators) before you call limiter.init_app(app) it's a bit unclear to me how the extension's before_request hook could be registered and thus be executed while the add_blueprint_exemption method is still being called.

baydoun0 · 2024-05-19T13:16:14Z

We are not using the exempt decorator anywhere in our application, I double-checked. Only the limit decorator, and the request_filter decorator.

I can't actually figure out WHERE the calls to add_blueprint_exemption are happening, because it doesn't seem to be the case that this is called anywhere other than from an exempt decorator call, which we are not doing.

YET, we see these errors in production.

Edit

I see now that _blueprint_exemptions is a defaultdict, so a simple access may perform a write. This makes it a lot harder for me to reason about the program. I made sure we are only calling init_app after all Blueprints are constructed and all calls to Flask-Limiter's decorators are complete. I don't think I'll have time & resources to debug this.

By the way, it's worth noting that we observe these errors on endpoints that both have and do not have Flask-Limiter decorators. I feel as though the problem is that the _blueprint_exemptions map is observing blueprints it never saw before, and this is causing the race condition. Any concurrent call to _blueprint_exemption_scope and blueprint_limits could cause this issue. Nothing in the init_app method of the Limiter class suggests that all Blueprints the app knows of will be accounted for.

I'm now more convinced that we need a lock to protect access to self._blueprint_exemptions in the LimitManager.

Edit 2

An alternative solution would be to drop defaultdict usage here, in favor of a dict.get(key, default_value) pattern, which will not perform an invisible write.

alisaifee · 2024-05-19T14:46:57Z

Oh wow, I completely missed the defaultdict and this all makes sense now. Also, interesting that this hasn't been raised by anyone before! I'll work on a fix asap.

alisaifee · 2024-05-19T15:54:44Z

Fix will be available in 3.7.0

baydoun0 · 2024-05-19T16:21:59Z

Awesome! Already bumped on our end and deploying to production as we speak -- fingers crossed 🤞

alisaifee · 2024-05-20T14:06:22Z

@baydoun0 please close the issue if/when you've verified that it no longer exists in your environment! Thank you.

baydoun0 · 2024-05-22T20:53:48Z

I can confirm that this is no longer happening in production, so the bug is resolved! Thank you so much for fixing it this fast!

baydoun0 added the bug label May 18, 2024

alisaifee mentioned this issue May 19, 2024

Replace use of default dict for exemptions #447

Merged

baydoun0 closed this as completed May 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

"Dictionary Size changed during iteration" around application deployment time #446

"Dictionary Size changed during iteration" around application deployment time #446

baydoun0 commented May 18, 2024 •

edited

alisaifee commented May 19, 2024 •

edited

baydoun0 commented May 19, 2024 •

edited

alisaifee commented May 19, 2024

alisaifee commented May 19, 2024

baydoun0 commented May 19, 2024 •

edited

alisaifee commented May 20, 2024

baydoun0 commented May 22, 2024

"Dictionary Size changed during iteration" around application deployment time #446

"Dictionary Size changed during iteration" around application deployment time #446

Comments

baydoun0 commented May 18, 2024 • edited

Expected Behaviour

Current Behaviour

Steps to Reproduce

Your Environment

Suggested Fix

alisaifee commented May 19, 2024 • edited

baydoun0 commented May 19, 2024 • edited

Edit

Edit 2

alisaifee commented May 19, 2024

alisaifee commented May 19, 2024

baydoun0 commented May 19, 2024 • edited

alisaifee commented May 20, 2024

baydoun0 commented May 22, 2024

baydoun0 commented May 18, 2024 •

edited

alisaifee commented May 19, 2024 •

edited

baydoun0 commented May 19, 2024 •

edited

baydoun0 commented May 19, 2024 •

edited