Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RFE: move away from deprecated html5lib #57932

Open
1 of 3 tasks
kloczek opened this issue Mar 20, 2024 · 2 comments
Open
1 of 3 tasks

RFE: move away from deprecated html5lib #57932

kloczek opened this issue Mar 20, 2024 · 2 comments
Labels
Dependencies Required and optional dependencies Deprecate Functionality to remove in pandas IO HTML read_html, to_html, Styler.apply, Styler.applymap Needs Info Clarification about behavior needed to assess issue

Comments

@kloczek
Copy link

kloczek commented Mar 20, 2024

Feature Type

  • Adding new functionality to pandas

  • Changing existing functionality in pandas

  • Removing existing functionality in pandas

Problem Description

Is your feature request related to a problem? Please describe.
It would be nice tu cut tail of some legacy modules decencies.
One of those modules is html5lib.

pandas only optionally depends on html5lib but I think that ATM even that optional dependency can be removed like it has been done ~2 years ago wit pip pypa/pip#11259

Feature Description

Cut tail of some legacy modules dependenies.

Alternative Solutions

N/A

Additional Context

No response

@kloczek kloczek added Enhancement Needs Triage Issue that has not been reviewed by a pandas team member labels Mar 20, 2024
@lithomas1
Copy link
Member

Can you elaborate more?

Has Python's stdlib html.parser caught up to html5lib?

Looking at bs4's docs, it looks like html5lib is more lenient than the stdlib/lxml (which might be one reason people want to use it)

image

@lithomas1 lithomas1 added IO HTML read_html, to_html, Styler.apply, Styler.applymap Deprecate Functionality to remove in pandas Needs Info Clarification about behavior needed to assess issue Dependencies Required and optional dependencies and removed Enhancement Needs Triage Issue that has not been reviewed by a pandas team member labels Mar 20, 2024
@GAuravY19
Copy link

Can you clarify more clearly what is supposed to be done in this issue ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Dependencies Required and optional dependencies Deprecate Functionality to remove in pandas IO HTML read_html, to_html, Styler.apply, Styler.applymap Needs Info Clarification about behavior needed to assess issue
Projects
None yet
Development

No branches or pull requests

3 participants