Fix how to calculate `error_count` in `flags['update_logs']` for courses #5580

gabina · 2024-01-11T20:43:33Z

What is happening?

Some context

Courses have a flags field with some metadadata. In particular, the update_logs flag has information about course updates. The error_count flag inside the update_logs counts how many errors occurs during the update process. See UpdateLogger.update_course method to understand where that flag is updated.

The UpdateLogger.update_course method gets called during the UpdateCourseStats.initialize with the following hash as the new log parameter:

'start_time' => @start_time.to_datetime,
'end_time' => @end_time.to_datetime,
'sentry_tag_uuid' => sentry_tag_uuid,
'error_count' => error_count

That error_count value comes from UpdateServiceErrorHelper.error_count definition, which basically keeps a counter for the error count. That counter is incremented each time update_error_stats gets called, and currently that method is only invoked in the ApiErrorHandling.log_error method.

The problem

The LiftWingApi class implements the log_error_batch method to log errors in Sentry at the batch level. That means, when you call get_revision_data for a batch of revision ids, the errors for the individual calls are accumulated and, once we're done with the batch, we log all the errors (if any) in a single Sentry log. This is to prevent a systematic problem where every request is failing to saturate our Sentry event quota.

The problem here is that the error_count counter gets incremented at the batch level too. Notice that if there are 30 errors from a batch of 50 revisions, log_error_batch method will get invoked only once, meaning the same thing for ApiErrorHandling.log_error and thus for update_error_stats. Because of that, the error_counter will be incremented by one, when there were 30 new errors.

TLDR; one option could be to make UpdateServiceErrorHelper.update_error_stats method take in an optional argument new_errors_count to increment the @error_count by that number (and not always by 1).
Notice that the ApiErrorHandling.log_error already has the actual "new errors count" value in the sentry_extra parameter.

To Reproduce

The "tracks update errors properly in LiftWing" spec in spec/services/update_course_stats_spec.rb shows that the 'error_count' flag correspond to the number of batches instead of to the number of individual errors.

Expected behavior

flags['update_logs'][n]['error_count'] should be the real number of errors that occurred during the update process.

The text was updated successfully, but these errors were encountered:

gabina · 2024-01-12T17:02:40Z

rails and newcomer friendly are good labels for this issue (I don't think I have access to add them)

mit-27 · 2024-01-22T05:01:02Z

I would like to work on this issue. @ragesoss

Joice-crypto · 2024-02-20T05:18:30Z

Hello @gabina, I'm working on this and I have a question. Did you mean in the sentry_extra: { rev_ids:, project_code: wiki_key, project_model: model_key, error_count: @errors.count }) the error_count: @errors.count refers to the new errors count?

gabina · 2024-02-20T13:29:28Z

Yes, for LiftWingApi and ReferenceCounterApi classes the errors count is sent to the ApiErrorHandling.log_error through the sentry_extra[:error_count] parameter.

gabina added the bug label Jan 11, 2024

gabina mentioned this issue Jan 11, 2024

[Outreachy Round 27] Refactor LiftWingApi and RevisionScoreImporter #5578

Merged

ragesoss added newcomer friendly Rails labels Jan 16, 2024

Joice-crypto mentioned this issue Feb 26, 2024

The correction of how the 'error_count' is calculated in 'flags['update_logs']'. #5673

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix how to calculate `error_count` in `flags['update_logs']` for courses #5580

Fix how to calculate `error_count` in `flags['update_logs']` for courses #5580

gabina commented Jan 11, 2024

gabina commented Jan 12, 2024

mit-27 commented Jan 22, 2024

Joice-crypto commented Feb 20, 2024

gabina commented Feb 20, 2024

Fix how to calculate error_count in flags['update_logs'] for courses #5580

Fix how to calculate error_count in flags['update_logs'] for courses #5580

Comments

gabina commented Jan 11, 2024

What is happening?

Some context

The problem

To Reproduce

Expected behavior

gabina commented Jan 12, 2024

mit-27 commented Jan 22, 2024

Joice-crypto commented Feb 20, 2024

gabina commented Feb 20, 2024

Fix how to calculate `error_count` in `flags['update_logs']` for courses #5580

Fix how to calculate `error_count` in `flags['update_logs']` for courses #5580