Automatically cut off database-readable logs by id & job_id age #108

meatballhat · 2017-04-24T16:15:45Z

The idea here is to introduce an automatic window of time for reading logs from the logs database for purposes of being able to drop older records. With this change, records with id or job_id lower than the cutoff will automatically be assumed to be "archived", meaning they will be read from S3 by travis-api. In reality, this tends to happen within ~3h of job completion, so this change is mostly about defining a window of time within which we allow logs to be mutated, as is done via job restart.

other humans are OK with this idea
we have a plan for if/how to message this via web and cli

igorwwwwwwwwwwwwwwwwwwww · 2017-04-24T16:19:48Z

@meatballhat can you give a brief description of:

the existing behaviour
the proposed behaviour
the reason for the change

meatballhat · 2017-04-24T16:27:41Z

@igorwwwwwwwwwwwwwwwwwwww I was backfilling while you commented. Sorry about the delay.

renee-travisci

looks good to me. I'm a big fan of limiting the amount of back data we allow a user to display through the web UI. However, I think we should see how many customers try to get logs older than 6 months to understand better how many customers this will impact. I assume if Enterprise starts using the logs API their customers may want to set a longer back date, but they may also want to turn off the extra query here and allow everything - something for enterprise to answer.

lib/travis/logs/config.rb

@@ -52,6 +52,7 @@ class Config < Travis::Config
          log_parts_autovacuum_vacuum_scale_factor: 0.001,
          log_parts_autovacuum_vacuum_threshold: 0,
          min_messages: 'warning',
+          min_readable_cutoff_age: 60 * 60 * 24 * 180,


meatballhat · 2017-04-24T16:43:22Z

@renee-travisci I'm sorry for not being more explicit about this, but this change is not intended to alter reading logs data via web/cli, but rather only to change how long we'll allow it to be mutated.

acnagy · 2017-04-24T17:13:25Z

@meatballhat @renee-travisci Ah.... this is interesting, and it makes more sense (thanks for clarifying @meatballhat!). From my experience looking at users' job stats, people generally don't mutate/restart a job more than a few days to a week. Generally, old jobs are hard to find because they get buried on the web ui.

However, conceivably, someone who's not been using Travis much will want to restart a very old job when they resume working on a project. I'm not sure how frequently that happens, if at all, but I assume if we documented the behavior pretty clearly in the docs, people would understand

meatballhat · 2017-04-24T17:33:04Z

@acnagy I have a lot of sympathy for folks who need/want to restart a job that ran more than a few months in the past.

We could decide to start with a cutoff of something like 2 years, then maybe tighten it up over time? I suspect much of the potential pain could be avoided if we were to change our default mode of mutating build, job, and log records to instead create new records, but I think that's a more involved change.

svenfuchs · 2017-04-24T18:15:48Z

Personally I only ever restart jobs when they error, and I gotta get CI green.

@acnagy I think in the example of picking up a project after a number of months it would be pretty unlikely I'm interested in restarting the old stuff. I can't think of a single case like that. Instead I'd move forward and create new commits/builds?

acnagy · 2017-04-24T19:52:37Z

@meatballhat @svenfuchs I think it's a matter of workflow... some people create new commits, and I feel like I've emailed with someone who just jumped in and restarted... Can't remember exactly though...

That said, I'm not sure it's worth supporting the restarting-very-old-builds workflow very much. The problem is, if they restart after a long time, the image/dependencies could have changed, and then they could get new errors... which means they'll end up needing to do more commits anyway. I know the #reproducibility-study people run into these issues... So, I think 6 months is probably a fairly smart cut-off, we just need to document it

emdantrim · 2017-06-15T20:35:02Z

Bumping this in the interest of getting documentation sorted out and getting this PR merged.

Where in the docs do you think this belongs?

Dan Buch added 3 commits April 17, 2017 18:03

Implement automatic age-based cutoff exclusion

023a40e

Merge remote-tracking branch 'origin/master' into meat-automatic-cutoff

7421915

Merge remote-tracking branch 'origin/master' into meat-automatic-cutoff

781d47a

BanzaiMan added the in progress label Apr 24, 2017

meatballhat requested review from svenfuchs, joshk and renee-travisci April 24, 2017 16:16

meatballhat self-assigned this Apr 24, 2017

meatballhat requested a review from igorwwwwwwwwwwwwwwwwwwww April 24, 2017 16:16

Remove hstore remnants

4bca076

renee-travisci approved these changes Apr 24, 2017

View reviewed changes

lib/travis/logs/config.rb Outdated

@@ -52,6 +52,7 @@ class Config < Travis::Config

log_parts_autovacuum_vacuum_scale_factor: 0.001,

log_parts_autovacuum_vacuum_threshold: 0,

min_messages: 'warning',

min_readable_cutoff_age: 60 * 60 * 24 * 180,

This comment was marked as spam.

Sign in to view

meatballhat and others added 5 commits April 25, 2017 22:19

Merge branch 'master' into meat-automatic-cutoff

6c22c7c

Use numeric and time helpers to clarify some config settings

e718990

Merge branch 'master' into meat-automatic-cutoff

75dfbac

Merge branch 'master' into meat-automatic-cutoff

8eba2cc

Merge branch 'master' into meat-automatic-cutoff

aa143e5

meatballhat added the awaiting-review label Jun 15, 2017

Merge branch 'master' into meat-automatic-cutoff

7ef30a8

solarce removed the awaiting-review label Jul 19, 2017

vitalied force-pushed the master branch 3 times, most recently from f411adb to 943babc Compare January 12, 2023 20:11

vitalied force-pushed the master branch 3 times, most recently from 2b9d5ee to d611620 Compare January 17, 2023 14:19

vitalied force-pushed the master branch from d611620 to fc60172 Compare July 4, 2023 07:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Automatically cut off database-readable logs by id & job_id age #108

Automatically cut off database-readable logs by id & job_id age #108

meatballhat commented Apr 24, 2017 •

edited

igorwwwwwwwwwwwwwwwwwwww commented Apr 24, 2017 •

edited

meatballhat commented Apr 24, 2017

renee-travisci left a comment

This comment was marked as spam.

meatballhat commented Apr 24, 2017 •

edited

acnagy commented Apr 24, 2017

meatballhat commented Apr 24, 2017

svenfuchs commented Apr 24, 2017

acnagy commented Apr 24, 2017

emdantrim commented Jun 15, 2017

Automatically cut off database-readable logs by id & job_id age #108

Are you sure you want to change the base?

Automatically cut off database-readable logs by id & job_id age #108

Conversation

meatballhat commented Apr 24, 2017 • edited

igorwwwwwwwwwwwwwwwwwwww commented Apr 24, 2017 • edited

meatballhat commented Apr 24, 2017

renee-travisci left a comment

Choose a reason for hiding this comment

This comment was marked as spam.

meatballhat commented Apr 24, 2017 • edited

acnagy commented Apr 24, 2017

meatballhat commented Apr 24, 2017

svenfuchs commented Apr 24, 2017

acnagy commented Apr 24, 2017

emdantrim commented Jun 15, 2017

meatballhat commented Apr 24, 2017 •

edited

igorwwwwwwwwwwwwwwwwwwww commented Apr 24, 2017 •

edited

meatballhat commented Apr 24, 2017 •

edited