Skip to content

amybalmforth/similarity-score

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 

Repository files navigation

Task 1

In the data folder of this repo there is a CSV file called reactions.csv. It contains data corresponding to how users have reacted to (saved or skipped) jobs on the platform.

The reaction data consists of four columns:

  • user_id - the integer ID of the user who liked or disliked the job
  • job_id - the integer ID of the job the user interacted with
  • direction - whether the user liked (true) or disliked (false) the job
  • time - the timestamp corresponding to when they reacted to the job

Task: The similarity score between two users is the number of jobs which they both like. Find the two users with the highest similarity.

Answer: [(1791, 5121), 103)]

Task 2

In the data folder there is an additional CSV file called jobs.csv. It contains unique integer IDs for over 12,000 jobs, along with integer IDs for the job's associated company.

Task: The similarity score between two companies is the number of users who like at least one job at both companies. Using both the reactions.csv and jobs.csv data, find the two companies with the highest similarity score.

Answer: [(46, 92), 92)]

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages