Skip to content

cabinetofcuriosity/knb_explore

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

85 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Basic Info

Author: Yuqing Lu

Database: KNB

Language: R

Contents:

Exploring KNB

This repo has the data, code and reports for my exploratary analysis on KNB, which is a website that aggregates ecology related datasets.

Accessing the data in KNB

In order to access the data in KNB programmatically, I downloaded their API(notebook1). Then following Ciera's suggestion, in notebook3, I was able to find the most popular headers in their database, under the help of the KNB staff. Playing with the headers, I decided to work on the datasets from PISCO and I need combine all the datasets first, which are around 200GB in total.

Understanding PISCO datasets

In notebook3, I downloaded one PISCO dataset's xml file and data frame. From the metadata file I understood the general information of that one PISCO dataset, including the purpose, location, organization, attribute definition, etc. Then I made a few plots of the attributes for that dataset to see how each attribute varies over time.

Merging species and PISCO data

In location.md and sckat.md, I aim to merge species and PISCO data using location and time in order for a relation between species and ocean temperature.

Releases

No releases published

Packages

No packages published