Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clarification on the "standard concepts from Athena" #45

Open
smondet opened this issue Apr 5, 2018 · 4 comments
Open

Clarification on the "standard concepts from Athena" #45

smondet opened this issue Apr 5, 2018 · 4 comments

Comments

@smondet
Copy link

smondet commented Apr 5, 2018

The doc says:

The standard concepts from Athena have been downloaded and are available somewhere (including running the extra script to download CPT code definitions)

what is meant by "standard"? the pre-selected ones on the webapp? or is there a list somewhere?

@alistairewj
Copy link
Member

Yeah not very well documented apologies. I am pretty sure the default concepts which are checked for download on the website were the only ones used. I successfully ran the ETL with the following files in my vocab folder:

-rw-rw-r-- 1 alistairewj alistairewj  475 Jun 15  2016 readme.txt
drwxrwxr-x 2 alistairewj alistairewj 4.0K Jun 15  2016 lib
-rw-rw-r-- 1 alistairewj alistairewj   32 Jun 15  2016 cpt.sh
-rw-rw-r-- 1 alistairewj alistairewj   31 Jun 15  2016 cpt.bat
-rw-rw-r-- 1 alistairewj alistairewj 7.2M Apr  7  2017 cpt4.jar
-rw-rw-r-- 1 alistairewj alistairewj 1.3G Sep 14  2017 vocab_download_v5_{9DBE59FA-D92B-0BB8-77A8-198AB3FE4736}.zip
-rw-r--r-- 1 alistairewj alistairewj 118M Sep 14  2017 DRUG_STRENGTH.csv
-rw-r--r-- 1 alistairewj alistairewj 896K Sep 14  2017 CONCEPT_CPT4.csv
-rw-r--r-- 1 alistairewj alistairewj 1.3G Sep 14  2017 CONCEPT_RELATIONSHIP.csv
-rw-r--r-- 1 alistairewj alistairewj 2.3G Sep 14  2017 CONCEPT_ANCESTOR.csv
-rw-r--r-- 1 alistairewj alistairewj 4.8K Sep 14  2017 VOCABULARY.csv
-rw-r--r-- 1 alistairewj alistairewj  30K Sep 14  2017 RELATIONSHIP.csv
-rw-r--r-- 1 alistairewj alistairewj 1.2K Sep 14  2017 DOMAIN.csv
-rw-r--r-- 1 alistairewj alistairewj 373M Sep 14  2017 CONCEPT_SYNONYM.csv
-rw-r--r-- 1 alistairewj alistairewj  13K Sep 14  2017 CONCEPT_CLASS.csv
-rw-r--r-- 1 alistairewj alistairewj 493M Feb 16 09:35 CONCEPT.csv

If you can figure this out please do let us know it would save us a bit of effort in re-downloading and testing.

@parisni
Copy link
Contributor

parisni commented Apr 5, 2018 via email

@parisni
Copy link
Contributor

parisni commented Nov 19, 2021 via email

@stevenbedrick
Copy link

Yeah, I figured it out the second after I added the comment! 🤣 In the Athena vocabulary distribution directory there's a JAR file and shell script that pull down CPT4 from UMLS. My confusion was due to the wording in omop/build-omop/postgresql/README.md, which I found a bit unclear as to where to find the Java file that it mentions- whether it is part of the MIMIC/OMOP EETL codebase, or part of the Athena distribution. The answer, of course, is that it's part of the Athena distribution.

I'd be happy to take a stab at clarifying that part of the instructions, if a PR would be welcome!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants