Publicly Visible Simulation Results Archive for Benchmark Datasets from Synergy #1448

rohitgarud · 2023-05-23T05:16:47Z

Feature Request

Is your feature request related to a problem? Please describe.
Researchers perform the same simulations on the benchmark datasets using available models again and again wasting time and resources. Simulations play an important role in deciding the models for their own datasets from the same or different domains as the benchmark datasets.

Describe the solution you'd like
An archive of simulation results (maybe a GitHub repo or a page in the documentation) might be helpful for quickly going through the performance of different models for benchmark datasets. This can help in decision-making and avoid repeated simulations on benchmark datasets by new researchers trying to use ASReview for their own datasets.

Teachability, Documentation, Adoption, Migration Strategy
I think a filterable table is an ideal option to present the simulation results containing the fields for details such as feature extractor, classifier, balancer, and query strategy used The fields for simulation results such as recalls at different levels, WSS, ERF, and ATD along with the dataset information such as the name of the dataset, topic(s), number of records, number and percentage of included records, etc should be included. Some other information can also be included such as who performed the simulation and the random seed, the time required (including information about the hardware used) for the simulation, etc. Adding the recall plots will be a plus.

This can allow researchers to quickly see which models they should try first for their own simulations depending upon their domain and other factors such as the number of records and expected relevant records.

jteijema · 2023-05-24T09:53:19Z

Hi @rohitgarud. We've been playing with this idea for a while, and we would love your input on the project. Let's set up this project as a collaboration!

rohitgarud · 2023-05-24T10:11:42Z

Great.. how you are planning to develop the platform.. we will discuss further details in the meeting..

rohitgarud assigned J535D165 May 23, 2023

Rensvandeschoot unassigned J535D165 May 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Publicly Visible Simulation Results Archive for Benchmark Datasets from Synergy #1448

Publicly Visible Simulation Results Archive for Benchmark Datasets from Synergy #1448

rohitgarud commented May 23, 2023

jteijema commented May 24, 2023

rohitgarud commented May 24, 2023

Publicly Visible Simulation Results Archive for Benchmark Datasets from Synergy #1448

Publicly Visible Simulation Results Archive for Benchmark Datasets from Synergy #1448

Comments

rohitgarud commented May 23, 2023

Feature Request

jteijema commented May 24, 2023

rohitgarud commented May 24, 2023