Machine learning for research on climate change adaptation policy integration

This is the software repository accompanying the paper Machine learning for research on climate change adaptation policy integration: an explorative UK case study, by Robbert Biesbroek, Shashi Badloe, Ioannis N. Athanasiadis, published in Regional Environmental Change (2020).

This reposiroty contains all the software for reproducing the paper, and some sample documents. It can be executed online as a binder notebook, simply from your browser by clicking this button .

Due to its size, the full dataset has been made available separately via Zenodo, here. The software comes with a subset of the full dataset, and can be demonstrated without any additional changes.

Notes

Do not change the names of any folders or files before finishing the pipeline, as the scripts look in folders with these specific names.

Step 1:

In the folder 'PDF_files' the following PDF documents are contained.
'Adaptation policy documents' - Training data for adaptation policies
'Mitigation policy documents' - Training data for mitigation policies
'Non-climate policy documents' - Training data for non-climate documents
'Mixed policy documents' - Testing data, any PDF document(s) you want to predict on.

Step 2:
In the folder 'Python Scripts' every script in the pipeline is contained.

Main pipeline

pdf_parser.py - Extract raw text from PDf documents (parsed_files)
text_cleanup.py - Filters, cleansand structurizes data into 'bags-of-words' (structured_files)
sqlite_db.py - Builds database and inserts cleaned data (climate.db)
numberizer.py - Builds vocabulary from training data (conversion_dictionary.txt)
TF_classification_BW.py - Builds neural network and stores it (tensorflow/logdir)
TF_classification_predict.py - Uses stored model to predict on new data. Results stored in database.

Optional scripts

pipeline.py - Runs every script in the main pipeline in order.
web_scraper.py - Retrieves documents from gov.uk website (PDF_files\Scraped documents) -- OUTDATED
blocklength_dist.py - Plots distribution of block lengths (Plots)
document_prediction.py - Plots histogram of test set blocks and their labels. WARNING: Only use few documents
confidence_analysis.py - Visualization for fraction of high confidence blocks
tensorboard_launch.py - Launch tensorboard from stored model

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
PDF_files		PDF_files
Python Scripts		Python Scripts
parsed_files		parsed_files
structured_files		structured_files
README.md		README.md
apt.txt		apt.txt
environment.yml		environment.yml
index.ipynb		index.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PDF_files

PDF_files

Python Scripts

Python Scripts

parsed_files

parsed_files

structured_files

structured_files

README.md

README.md

apt.txt

apt.txt

environment.yml

environment.yml

index.ipynb

index.ipynb

Repository files navigation

Machine learning for research on climate change adaptation policy integration

Notes

Main pipeline

Optional scripts

About

Releases

Packages

Contributors 2

Languages

BigDataWUR/ML4ClimateAdaptationPolicy

Folders and files

Latest commit

History

Repository files navigation

Machine learning for research on climate change adaptation policy integration

Notes

Main pipeline

Optional scripts

About

Resources

Stars

Watchers

Forks

Languages