Road Network kNN Experimental Framework

This project consists of implementations of several kNN algorithms for road networks and the experimental framework to compare them. This has primarily been released to allow readers to reproduce results from a paper to appear at VLDB 2016 (details to be updated) and to use in future studies. If you use the code, please cite our paper. Please follow the Requirements below carefully. If you have any issues please check the FAQ and contact us.

Requirements

Software

Before the executable can be compiled the following packages/libraries must be installed. Note that we were able to install METIS from the Ubuntu repositories if you are using that operating system.

g++ version 4.9 or higher
Boost version 1.57 or higher for serialization
[METIS] (http://glaros.dtc.umn.edu/gkhome/metis/metis/download) version 5.1 or higher
CMake version 2.8 or higher
gnuplot for figure generation
Optional: [Google Sparsehash] (http://code.google.com/p/sparsehash/) for comparison of G-tree hashtables implementations

Note: We found Boost versions lower than 1.57 did not correctly support serialization of some STL data structures (such as std::unordered_map) thus they cannot be used here. This means you need to remove any existing Boost version, e.g. by uninstalling libboost-all-dev through your package manager.

System & Hardware

We recommend a 64-bit OS due to the size of indexes (and we have not tested on 32-bit). Furthermore 32GB RAM is required to reproduce all experiments as they are in the paper. Results can be reproduced using a smaller amount of RAM by omitting some of the larger datasets (see below). Also ensure that there is 200GB of disk space freely available on the hard disk storing indexes (for both travel distance and travel time experiments).

Files

The C++ source code can be found in the top-level RN-kNN-Exp directory, and in the subdirectories command, external, processing, queue, tuple and utility. All bash scripts for various tasks such as setting up the directories, running experiments, and generating figures are found in the scripts subdirectory.

Compilation

Change CMAKE_C_COMPILER and CMAKE_CXX_COMPILER in CMakeLists.txt to point to the correct g++ version (i.e. 4.9 or higher). CMakeLists.txt can be found in the RN-kNN-Exp directory.
Create a directory called build in the top-level (e.g. RN-kNN-Exp/) directory
```
mkdir build
```
Change into this directory and generate makefiles using CMake
```
cd build
cmake -G "Unix Makefiles" ../CMakeLists.txt ..
```
Compile all executables using make
```
make
```

Setup

Open the globalVariables bash script in scripts directory in your favourite editor
Change the output_path variable to the full-path of an existing (i.e. create it first) location where you wish to store all data (e.g. indexes, objects sets, figures, etc...)

Note: This can be different to the path containing the code (recommended)
Change the exe_path variable to the full path of the build directory created above (e.g. /home/user/Downloads/rn_knn/build)
Run resetExperimentalSetup in the scripts directory to create all necessary sub-directories
```
cd scripts
bash resetExperimentalSetup
```
Download the DIMACS distance edge-weight graph files (with extension .gr.gz and prefixed by USA-road-d) and coordinate files (with extension .co.gz) for DE, VT, ME and NW from https://github.com/tenindra/RN-kNN-Exp-Data to the output_path/data/dimacs directory
Download the DIMACS distance edge-weight graph files and coordinate files for COL, CAL, E, W, CTR, USA from http://www.dis.uniroma1.it/challenge9/ to the output_path/data/dimacs directory
Download the node (with extension .cnode) and edge (with extension .cedge) files for North America (NA) from http://www.cs.utah.edu/~lifeifei/SpatialDataset.htm to the output_path/data/tpq directory

Note: You must gzip these files so that output_path/data/tpq directory contains NA.cnode.gz and NA.cedge.gz
Download the real_world_pois.tar.gz from https://github.com/tenindra/RN-kNN-Exp-Data to the output_path directory
Unzip the real_world_pois.tar.gz archive (this should create a output_path/real_world_pois directory populated with subdirectories with POI sets for several road networks)

Less Than 32GB RAM

Experiments can still be run with less than 32GB RAM, but for fewer datasets. To do this go to the globalVariables file and modify the road_networks array. The road networks are in size order, so simply remove the road networks from the end. E.g. to run experiments up to the Eastern US dataset change it to road_networks=("DE" "VT" "ME" "COL" "FLA" "CAL" "E").

Note 1: All in-depth experiments on the US dataset will not be possible, of course. In this case it is best to comment out those experiments in the runPaperExperiments script.

Note 2: SILC (index used by Distance Browsing) can only be built upto the default NW dataset. SILC on the NW requires at least 20GB of memory. Less than this will cause all default experiments to be missing Distance Browsing comparisons. In this case we suggest changing the default road network to COL (for which SILC only requires 8GB), by changing default_network and default_parameters to COL.

Running Travel Distance Experiments

The follow instructions can be followed to re-create all figures from the paper. Assuming your are already in the scripts directory (otherwise cd into scripts):

Clean the DIMACS and TPQ datasets for errors and redundancy
```
bash transformInputData
```
Build the binary files for the basic graph representations
```
bash buildBinaryGraphs
```
Build all road network indexes (this may take a while... ~15 hours on our machine)
```
bash buildRoadNetworkIndexes
```
Generate query sets
```
bash generateQuerySets
```
Generate random object sets and build corresponding object indexes
```
bash buildObjectIndexes
```
Note: You can run resetExperimentalSetup again to remove all object indexes and query sets
Run all paper experiments and produce figures
```
bash runPaperExperiments
```
Note: The above command also creates figures, but if you wish to recreate figures without running experiments again (which can take sometime), you can use:
```
bash createPaperFigures
```

Note: All above commands may be batched in the shell terminal, just enter each separated by semi-colon ";"

Running Travel Time Experiments

Travel times experiments must be reproduced indepedently, as they require different indexes. These can be re-produced using the following procedure:

Create a new directory to store data for travel time experiments (e.g. figures etc...) somewhere
Change the output_path variable in globalVariables the full-path of this new location
Change the edge_type variable to edge_type=t

Note: The edge_type variable must be changed back to be "d" to run travel distance experiments again
Run the resetExperimentalSetup to create all necessary sub-directories
```
bash resetExperimentalSetup
```
Download the DIMACS travel time edge-weight graph files (with extension .gr.gz and prefixed by USA-road-t) and coordinate files (with extension .co.gz) for DE, VT, ME and NW from https://github.com/tenindra/RN-kNN-Exp-Data to the output_path/data/dimacs directory
Download the DIMACS travel time edge-weight graph files and coordinate files for COL, CAL, E, W, CTR, USA from http://www.dis.uniroma1.it/challenge9/ to the output_path/data/dimacs directory
Download the real_world_pois.tar.gz from https://github.com/tenindra/RN-kNN-Exp-Data to the output_path/ directory
Unzip the real_world_pois.tar.gz archive (this should create a output_path/real_world_pois directory populated with subdirectories with POI sets for several road networks)
Clean the DIMACS datasets for errors and redundancy for travel time edge weights
```
bash transformInputData
```
Build the binary files for the basic graph representations for travel time edge weights
```
bash buildBinaryGraphs
```

As in steps 3-5 in "Running Travel Distance Experiments", execute:

bash buildRoadNetworkIndexes
bash generateQuerySets
bash buildObjectIndexes

Run all travel time experiments and produce figures
```
bash runTravelTimeExperiments
```
Note: The above command also creates figures, but if you wish to recreate figures without running experiments again (which can take sometime), you can use:
```
bash createTravelTimeFigures
```

Acknowledgements

Parts of open source projects associated with the following publications were used in our project:

Takuya Akiba, Yoichi Iwata, Ken-ichi Kawarabayashi, and Yuki Kawata, Fast Shortest-path Distance Queries on Road Networks by Pruned Highway Labeling. In ALENEX 2014. (Code)
Lingkun Wu, Xiaokui Xiao, Dingxiong Deng, Gao Cong, Andy Diwen Zhu, Shuigeng Zhou: Shortest Path and Distance Queries on Road Networks: An Experimental Evaluation. PVLDB 5(5): 406-417 2012. (Code)

Licence

Road Network kNN Experimental Evaluation is free software; you can redistribute it and/or modify it under the terms of the GNU Affero General Public License as published by the Free Software Foundation; either version 3 of the License, or (at your option) any later version.

Road Network kNN Experimental Evaluation is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Affero General Public License for more details.

You should have received a copy of the GNU Affero General Public License along with Road Network kNN Experimental Evaluation; see LICENSE.txt; if not, see http://www.gnu.org/licenses/.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
command		command
external		external
processing		processing
queue		queue
scripts		scripts
tuple		tuple
utility		utility
CMakeLists.txt		CMakeLists.txt
LICENSE.txt		LICENSE.txt
README.md		README.md
common.h		common.h
main.cpp		main.cpp

License

tenindra/RN-kNN-Exp

Folders and files

Latest commit

History

Repository files navigation

Road Network kNN Experimental Framework

Requirements

Software

System & Hardware

Files

Compilation

Setup

Less Than 32GB RAM

Running Travel Distance Experiments

Running Travel Time Experiments

Acknowledgements

Licence

About

Topics

Resources

License

Stars

Watchers

Forks

Languages