Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove debug data normalization for span analysis #13203

Merged
merged 3 commits into from
Feb 6, 2024

Conversation

adrianeboyd
Copy link
Contributor

Description

As a result of this normalization, debug data could show a user tokens that do not exist in their data.

Types of change

?

Checklist

  • I confirm that I have the right to submit this contribution under the project's MIT license.
  • I ran the tests, and all new and existing tests passed.
  • My changes don't require a change to the documentation, or if they do, I've added all required information.

As a result of this normalization, `debug data` could show a user tokens
that do not exist in their data.
@adrianeboyd adrianeboyd added the feat / cli Feature: Command-line interface label Dec 19, 2023
@adrianeboyd
Copy link
Contributor Author

I don't really like it, but I don't know whether the lowercasing is something that's typically used in related work. We can at least remove the confusing quote normalization. Not that it really matters that much either way...

spacy/cli/debug_data.py Outdated Show resolved Hide resolved
@svlandeg svlandeg deleted the branch explosion:main January 29, 2024 09:15
@svlandeg svlandeg closed this Jan 29, 2024
@svlandeg svlandeg reopened this Jan 29, 2024
@svlandeg svlandeg changed the base branch from v4 to v5 January 29, 2024 09:34
@svlandeg svlandeg changed the base branch from v5 to main January 29, 2024 09:35
@svlandeg
Copy link
Member

Yea I think it makes sense to remove the quote normalization.

@svlandeg svlandeg merged commit afb22ad into explosion:main Feb 6, 2024
8 checks passed
jordankanter pushed a commit to jordankanter/spaCy that referenced this pull request Mar 14, 2024
* Remove debug data normalization for span analysis

As a result of this normalization, `debug data` could show a user tokens
that do not exist in their data.

* Update spacy/cli/debug_data.py

---------

Co-authored-by: svlandeg <svlandeg@github.com>
jordankanter pushed a commit to jordankanter/spaCy that referenced this pull request Mar 14, 2024
* Remove debug data normalization for span analysis

As a result of this normalization, `debug data` could show a user tokens
that do not exist in their data.

* Update spacy/cli/debug_data.py

---------

Co-authored-by: svlandeg <svlandeg@github.com>
jordankanter pushed a commit to jordankanter/spaCy that referenced this pull request Mar 29, 2024
* Remove debug data normalization for span analysis

As a result of this normalization, `debug data` could show a user tokens
that do not exist in their data.

* Update spacy/cli/debug_data.py

---------

Co-authored-by: svlandeg <svlandeg@github.com>
jordankanter pushed a commit to jordankanter/spaCy that referenced this pull request Apr 17, 2024
* Remove debug data normalization for span analysis

As a result of this normalization, `debug data` could show a user tokens
that do not exist in their data.

* Update spacy/cli/debug_data.py

---------

Co-authored-by: svlandeg <svlandeg@github.com>
jordankanter pushed a commit to jordankanter/spaCy that referenced this pull request May 10, 2024
* Remove debug data normalization for span analysis

As a result of this normalization, `debug data` could show a user tokens
that do not exist in their data.

* Update spacy/cli/debug_data.py

---------

Co-authored-by: svlandeg <svlandeg@github.com>
jordankanter pushed a commit to jordankanter/spaCy that referenced this pull request May 10, 2024
* Remove debug data normalization for span analysis

As a result of this normalization, `debug data` could show a user tokens
that do not exist in their data.

* Update spacy/cli/debug_data.py

---------

Co-authored-by: svlandeg <svlandeg@github.com>
jordankanter pushed a commit to jordankanter/spaCy that referenced this pull request May 21, 2024
* Remove debug data normalization for span analysis

As a result of this normalization, `debug data` could show a user tokens
that do not exist in their data.

* Update spacy/cli/debug_data.py

---------

Co-authored-by: svlandeg <svlandeg@github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feat / cli Feature: Command-line interface
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants