Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Understand processing times #162

Open
2 of 3 tasks
dcsw2 opened this issue Sep 29, 2022 · 4 comments
Open
2 of 3 tasks

Understand processing times #162

dcsw2 opened this issue Sep 29, 2022 · 4 comments
Assignees

Comments

@dcsw2
Copy link
Collaborator

dcsw2 commented Sep 29, 2022

Goal: document processing times for different settings in the pipeline (esp. DeezyMatch vs. perfectmatch)

Use sample set of articles from The Sun for this test.

TASKS

Structure of "report":

  • end to end time
  • for a smaller sample, timings for each step
@dcsw2 dcsw2 created this issue from a note in Applications (To do) Sep 29, 2022
@dcsw2
Copy link
Collaborator Author

dcsw2 commented Oct 4, 2022

Kaspar's random sample from The Sun:
randsample0002194.csv

@fedenanni
Copy link
Contributor

The four versions of the pipelines are here as jupyter notebooks: https://github.com/Living-with-machines/toponym-resolution/tree/dev/experiments

@kmcdono2 kmcdono2 moved this from To do to In progress in Applications Oct 4, 2022
@kmcdono2
Copy link
Collaborator

Now waiting for updates to including candidates

@dcsw2 dcsw2 moved this from In progress to To review in Applications Feb 21, 2023
@kmcdono2
Copy link
Collaborator

@npedrazzini has done a nice analysis of processing times across the different T-Res methods: https://docs.google.com/spreadsheets/d/1ymjGPubsjq93VmCakBDxYOQW_TCXXxECG1OLG5n1YV8/edit#gid=175352557

@kmcdono2 kmcdono2 moved this from To review to Done in Applications Feb 24, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Development

No branches or pull requests

4 participants