Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Target word token pattern annotator #135

Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
31 commits
Select commit Hold shift + click to select a range
2bf6f53
added initial changes for using expander functionality for multitoken…
LydiaMennesHealth Jan 10, 2024
aac856b
improved naming of base config, separated processor loader code, fina…
LydiaMennesHealth Jan 11, 2024
0c339df
small cleanup of comments
LydiaMennesHealth Jan 18, 2024
7868d23
formatting
LydiaMennesHealth Jan 18, 2024
3653eee
formatting issues
LydiaMennesHealth Jan 18, 2024
95feeb2
flake finally happy
LydiaMennesHealth Jan 18, 2024
03c706d
docformatter
LydiaMennesHealth Jan 18, 2024
6df3507
made recall booster args function static
LydiaMennesHealth Jan 18, 2024
097ebcc
decided on config structure for two element dates
LydiaMennesHealth Jan 18, 2024
a0227fe
undid changes on this branch
LydiaMennesHealth Jan 18, 2024
56bbd2f
added the recall booster principle to the readme and complete overvie…
LydiaMennesHealth Jan 18, 2024
ae8c6b5
structure for implementing two element dates in config
LydiaMennesHealth Jan 18, 2024
42d0b0a
changed unneccessary usage of double slashes in regexes for readabili…
LydiaMennesHealth Jan 18, 2024
8ffff50
added first adapted regex and test implementation for two element dates
LydiaMennesHealth Jan 18, 2024
7de9985
added first replacement pattern in date_dmy_1 including testing, adde…
LydiaMennesHealth Jan 18, 2024
c513aa2
added recall booster functionality for all dates
LydiaMennesHealth Jan 19, 2024
809f8d0
added description in tutorial
LydiaMennesHealth Jan 19, 2024
d8017a9
formatting fixed
LydiaMennesHealth Jan 19, 2024
c11b1c9
final formatting
LydiaMennesHealth Jan 19, 2024
a2a6240
found issue with month only mentions during validation, fixed with wo…
LydiaMennesHealth Jan 24, 2024
937153c
functioning token pattern annotator with target word
LydiaMennesHealth Feb 2, 2024
45c6a15
small change in docstring
LydiaMennesHealth Feb 2, 2024
ac19886
expanded list
LydiaMennesHealth Feb 5, 2024
81629b9
ignore vscode stuff
LydiaMennesHealth Feb 14, 2024
43aba48
fixed merge conflicts
LydiaMennesHealth Feb 14, 2024
0cb1fac
last merge conflict
LydiaMennesHealth Feb 14, 2024
7e93d1b
typo
LydiaMennesHealth Feb 14, 2024
5f9abdd
small expansion pat env list
LydiaMennesHealth Feb 14, 2024
696d4ce
removed failing test that goes on other branch
LydiaMennesHealth Feb 14, 2024
4ffafb6
not using | for backward compatibility
LydiaMennesHealth Feb 14, 2024
f0b050a
small expansion patient environment
LydiaMennesHealth Feb 14, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Jump to
Jump to file
Failed to load files.
Diff view
Diff view
5 changes: 4 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -124,4 +124,7 @@ ENV/
# mypy
.mypy_cache/

.idea
.idea

#vscode
.vscode/
63 changes: 63 additions & 0 deletions base_config.json
Original file line number Diff line number Diff line change
Expand Up @@ -186,6 +186,69 @@
"tag": "_"
}
},
"patient_environment1": {
"annotator_type": "deduce.annotator.TargetWordTokenPatternAnnotator",
"group": "names",
"args": {
"tag": "naam",
"skip": ["(", ")", "zijn", "haar"],
"target_words_lookup": "patient_environment",
"annotate_pattern_indices": [1],
"pattern": [
{
"lookup": "patient_environment"
},
{
"or": [
{
"like_name": true
},
{
"title_case_lookup": "first_name"
},
{
"title_case_lookup": "interfix_surname"
},
{
"title_case_lookup": "surname"
}
]
}

]
}
},
"patient_environment2": {
"annotator_type": "deduce.annotator.TargetWordTokenPatternAnnotator",
"group": "names",
"args": {
"tag": "naam",
"skip": ["(", ")", "zijn", "haar", "hun"],
"target_words_lookup": "patient_environment",
"annotate_pattern_indices": [0],
"pattern": [
{
"or": [
{
"like_name": true
},
{
"title_case_lookup": "first_name"
},
{
"title_case_lookup": "interfix_surname"
},
{
"title_case_lookup": "surname"
}
]
},
{
"lookup": "patient_environment"
}
]
}
},
"name_context": {
"annotator_type": "deduce.annotator.ContextAnnotator",
"group": "names",
Expand Down