Skip to content

Zatura/clickstream-content-sessionization

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Clickstream Content Sessionization

This is an undergraduate thesis for Bachelor in Computer Engineering

The idea of this work is to identify account sharing and classify anonymised user click streams by content, dividing a click stream into sub-sessions and attributing each session to it's current anonymised user.

Useful files

The data used for this work was the globo news features from globo dataset, t-SNE was used for dimentionality reduction, affinity propagation for clustering and euclidean distance as threshold.