Skip to content

josephcappadona/spatialQA

Repository files navigation

spatialQA

Setup

python3 -m pip install -r requirements.txt

Usage

# generate data
python scripts/generate.py

# test model
python scripts/test.py t5-small

# analyze results
python scripts/analyze.py t5-small

generate.py outputs data.tsv.

test.py outputs a results TSV to results/results-MODEL-NAME.tsv.

analyze.py outputs a summary TSV to summary/summary-MODEL-NAME.tsv and an analysis TSV to anlysis/analysis-MODEL-NAME.tsv.

Computing graphs

# Combines result of different model
python scripts/make_dataframes.py analysis/ summary/

# Generate figures
python scripts/make_figures.py analysis/df_analysis.csv summary/df_summary.csv 

make_dataframes.py outputs a dataframe of combined analysis results to analysis/df_analysis.csv and a dataframe of combined summary results to summary/df_summary.csv

make_figures.py output the figures to figures

TODO

  • Refactor generation to make it simpler and more scalable

About

Analyzing the spatial reasoning skills of language models

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published