Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

scraped names mismatch URL #19

Open
jwiggi18 opened this issue Oct 21, 2018 · 0 comments
Open

scraped names mismatch URL #19

jwiggi18 opened this issue Oct 21, 2018 · 0 comments

Comments

@jwiggi18
Copy link

jwiggi18 commented Oct 21, 2018

@dimus

Using the following code to scrape names from Wikipedia inconsistencies indicated with *.

library(rphylotastic)
#Alaska
#scrape names from wikipedia
names_Alaska <- url_get_scientific_names(URL = "https://en.wikipedia.org/wiki/List_of_Alaska_state_symbols")

print(names_Alaska)
*#returns list()

#Check the scientific names
Alaska_resolved_otol <- taxa_resolve_names_with_otol(taxa = names_Alaska)

print(Alaska_resolved_otol)

Alaska_resolved_gnr <- taxa_resolve_names_with_gnr(taxa = names_Alaska)

print(Alaska_resolved_gnr)

#Arizona
#scrape names from wikipedia
names_Arizona <- url_get_scientific_names(URL = "https://en.wikipedia.org/wiki/List_of_Arizona_state_symbols")

print(names_Arizona)
*#name given in URL = "Hyla eximia" rphylotastic scrapes = "Hyla"
*# "Bassariscus astutus" "Bassariscus"
*# "Oncorhynchus gilae apache" "Oncorhynchus gilae"
*# "Crotalus willardi willardi" "Crotalus willardi"

#Check the scientific names
Arizona_resolved_otol <- taxa_resolve_names_with_otol(taxa = names_Arizona)

print(Arizona_resolved_otol)
*#removed Harpagornis incendei (the cacuts wren) and kept Peoria and Maricopa (county in *Arizona, also a genus of moth)

Arizona_resolved_gnr <- taxa_resolve_names_with_gnr(taxa = names_Arizona)

print(Arizona_resolved_gnr)
*#scraped list = "Harpagornis incendei" gnr = "Harpagornis"
*#kept Peoria and Maricopa

#Arkansas
#scrape names from wikipedia
names_Arkansas <- url_get_scientific_names(URL = "https://en.wikipedia.org/wiki/List_of_Arkansas_state_symbols")

print(names_Arkansas)
*#Populus = part of state motto
*#Ouachita = state region

#Check the scientific names
Arkansas_resolved_otol <- taxa_resolve_names_with_otol(taxa = names_Arkansas)

print(Arkansas_resolved_otol)
*#prints "Apis mellifera" Vitis aestivalis" "Pyrus malus" multiple times
*#keeps Populus
*#eliminates Ouachita

Arkansas_resolved_gnr <- taxa_resolve_names_with_gnr(taxa = names_Arkansas)

print(Arkansas_resolved_gnr)
*#keeps Populus
*#eliminates Ouachita

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant