Skip to content

Latest commit

 

History

History
51 lines (39 loc) · 2.07 KB

DOWNLOAD.md

File metadata and controls

51 lines (39 loc) · 2.07 KB

Download

We store our files on google drive. The easiest way to download them is to use gdown. Alternatively, you could download them with a browser from here.

Please see DATA.md for detailed data formats.

Dataset Files

Extract the data files into FGVE/data.

cd FGVE/data
gdown 1MYdHhBEribsIrkQkYoBYeSQa5jQTSdRS
unzip data.zip
mv data/* .
rm -r data
rm data.zip

Pre-Extracted Image Features

We extract the Flickr30K image features using a pretrained Faster R-CNN ResNeXt152-C4 detector. The image features are extracted following instructions in pzzhang/VinVL and microsoft/scene_graph_benchmark and we reformat the resulting files to .npy format.

Note that the Flickr30K dataset includes images obtained from Flickr. Use of the images must abide by the Flickr Terms of Use.

You can download pre-extracted image features with the following script.

cd $FEATURE_DIR
gdown 1hqyeBTKeDbjUwSJWSoxAUV6q4lT7WdqZ
unzip f30k_features.zip
rm f30k_features.zip

Model Checkpoints

Regardless of whether you need to do the training, you need the checkpoint pretrained by Oscar+. In fact, only its vocab (tokenizer) files are needed if you only want to do evaluation. Please see here for details.

We modified it to have an additional (3rd) token type embedding initialized with the text's token type embedding. You can download the checkpoint with the script below.

cd $CHECKPOINT_DIR
gdown 1Edxw-PTFNJU9koJ5AiJGiMDZRNGGBR-O
unzip pretrained_ckpt.zip
rm pretrained_ckpt.zip

We also release our best model's checkpoint (denoted Ours or Ours+CLS in the paper).

cd $CHECKPOINT_DIR
gdown 1er6A9kr1zVzyxV6HJU_1zAelghfkQwK0
unzip final_ckpt.zip
rm final_ckpt.zip