Add a user id to the collector end point (was record ip/browser agent) #22

danmarsden · 2017-12-04T23:56:04Z

When we send the headers we are not yet logged in so we'll send generic ones
later in the bootstrap, when we are logged in, lets see if we can overwrite the headers. If we can replace the csp report uri with a collector.php?user=xxx
when that is present then a violation comes in then record it too. If it is not then we can fall back to some other method like ip or user agent.
recording the user / ip / US against each record will make this a combinatoric explosion in the amount of data we would collect, so not sure if we stash this behind a setting, or just wear it

Original:

It would be really nice if the csp logs also recorded the ip and browser agent strings..

brendanheywood · 2017-12-05T01:01:00Z

By design we didn't want to collect a ton of data, it's just updating counts of errors so you can see whats the most important stuff to fix first. If we record ips and agents then this feels more like just dumping that into the normal moodle log, or maybe into a special log file or std error with a fixed prefix so can be easily filtered?

danmarsden · 2017-12-05T01:57:52Z

we've noticed some strange behaviour related to a specific user (probably browser hacked) - it would be nice to have a way to identify this a bit better and match it to the specific user that triggered the event by looking at the csp logs. (and matching against ip/browser agent)

brendanheywood · 2019-10-01T07:05:07Z

I'm gonna close this. This feels weird to be related to the csp plugin. If it's still an issue please dump some data or more context so I can get a handle on why we'd want this

danmarsden · 2019-10-02T22:30:55Z

the csp report was showing some extremely suspicious Javascript that was being included in a page that violated the CSP report policy - obviously malicious JS that was being included by a users browser - Turned out the users browser had been hacked but tracking the specific user down was difficult because all CSP reports was the url accessed, if more information was recorded at the time of the report it would have made it much easier to idenify the user so that IT support could make contact and arrange for password resets of all their logins/virus checker installed etc...

brendanheywood · 2019-10-03T06:50:49Z

That makes sense. You tracked it down via the access log then?

Also was that in report mode or in the real mode? Once we have settled on a policy and are enforcing it we could treat this more as a thing which alerts or collects data. I'm just worried about crazy amounts of data when you first turn this on, as a typical page could have 10 reports on it, multiplied by every page multiplied by every user.

danmarsden · 2019-10-03T07:34:01Z

only showed in report mode - in real mode you don't get the report at all, because the browser just blocks it.

Was a real pain to track down because the site was under heavy use, and lots of people were hitting the same pages/courses - we had to analyse the list of pages and then try and compare it against the current active users to find a similar pattern of access.

brendanheywood · 2019-11-21T02:49:25Z

I've softened up on this front so I'll reopen this. I've just come across the exact same scenario and now I can see properly how this is inadequate.

I think when you start out and still don't have a good policy in place you'll be spammed a lot and I don't want to record everything so I'm thinking about either putting this behind a setting so that it only records when you have a good policy in place and you only have a very small report of violations.

I think we can actually instrument this in a way to directly collect the user by adding a user id of some sort into the collection uri in the header

brendanheywood · 2020-10-25T23:25:43Z

@danmarsden I've just pushed a micro change 10c52a6 which should get us most of the way. Strictly I'm not recording the uid for the user but I just tacking it onto the collector endpoint. I'm not doing anything with it after that but this does let us easy trace these back through via the normal access logs. There will still be some more matching up of things to fully link it, but this is about as good as we can get generically without a crazy performance hit.

brendanheywood · 2020-12-01T00:35:33Z

Just adding some more thoughts here. In #40 we want to give out finer grained reports to course managers so they can see and fix their own content. However we also want to clearly distinguish between content level violations which need to be fixed by the manager vs violations caused by browser plugins or other issues. These latter tend to be specific to individuals. If we collect data for everyone then we get this massive log explosion, so I'm thinking of a hybrid:

we keep it as a single db record for each class of violation
we have a small field which records a list of affected userids, could be comma sep and contain no more than say 5 ids
if we collect more than 5 we just ignore it
if a report has < 5 ids then we can assume that this is an issue related to a plugin rather than the content

It's not perfect but feels like the right balance of performance and removing noise from the reports. These violations will still be visible to admins and we could also expose them to the course managers too via some warnings around how to interpret them properly.

brendanheywood added the wontfix label Oct 1, 2019

brendanheywood closed this as completed Oct 1, 2019

brendanheywood reopened this Nov 21, 2019

brendanheywood added enhancement and removed wontfix labels Nov 21, 2019

brendanheywood changed the title ~~record ip/browser agent~~ Add a user id to the collector end point (was record ip/browser agent) Nov 21, 2019

brendanheywood assigned Peterburnett Nov 25, 2019

brendanheywood added a commit that referenced this issue Oct 25, 2020

Append uid to collector endpoint #22

10c52a6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a user id to the collector end point (was record ip/browser agent) #22

Add a user id to the collector end point (was record ip/browser agent) #22

danmarsden commented Dec 4, 2017 •

edited by brendanheywood

brendanheywood commented Dec 5, 2017

danmarsden commented Dec 5, 2017

brendanheywood commented Oct 1, 2019

danmarsden commented Oct 2, 2019

brendanheywood commented Oct 3, 2019

danmarsden commented Oct 3, 2019

brendanheywood commented Nov 21, 2019

brendanheywood commented Oct 25, 2020

brendanheywood commented Dec 1, 2020

Add a user id to the collector end point (was record ip/browser agent) #22

Add a user id to the collector end point (was record ip/browser agent) #22

Comments

danmarsden commented Dec 4, 2017 • edited by brendanheywood

brendanheywood commented Dec 5, 2017

danmarsden commented Dec 5, 2017

brendanheywood commented Oct 1, 2019

danmarsden commented Oct 2, 2019

brendanheywood commented Oct 3, 2019

danmarsden commented Oct 3, 2019

brendanheywood commented Nov 21, 2019

brendanheywood commented Oct 25, 2020

brendanheywood commented Dec 1, 2020

danmarsden commented Dec 4, 2017 •

edited by brendanheywood