Issues: adbar/trafilatura
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Extract text from buttons for semantic elements
question
Further information is requested
#573
opened Apr 23, 2024 by
zirkelc
Question: check if page is readable?
question
Further information is requested
#572
opened Apr 23, 2024 by
zirkelc
No timeout for some URLs when using focused_crawler
enhancement
New feature or request
#566
opened Apr 19, 2024 by
JER-CE
Use New feature or request
with_metadata
parameter to decide whether to run metadata extraction
enhancement
Why lzma for data compression?
question
Further information is requested
#559
opened Apr 15, 2024 by
Yomguithereal
Scraping websites which are protected by WAF
question
Further information is requested
#558
opened Apr 15, 2024 by
thebigbone
Preserve horizontal space in code blocks
enhancement
New feature or request
#553
opened Apr 9, 2024 by
mittsommer
Make cascade of different content extractors explicit and configurable
enhancement
New feature or request
#538
opened Apr 3, 2024 by
adbar
Downloads: Add ZStandard as optional Accept-Encoding header
enhancement
New feature or request
#537
opened Apr 3, 2024 by
adbar
List element inside a table is lost
bug
Something isn't working
#531
opened Mar 29, 2024 by
mikhainin
Link proportion heuristic fails for link paragraph
bug
Something isn't working
#529
opened Mar 27, 2024 by
adbar
Include links and Include formatting do not work together properly
bug
Something isn't working
#511
opened Feb 21, 2024 by
ibestvina
OVERALL_DISCARD_XPATH not discarding in some cases
question
Further information is requested
#510
opened Feb 19, 2024 by
felipehertzer
include_links option mixes texts and links
bug
Something isn't working
#476
opened Jan 12, 2024 by
hugoobauer
Add support for Netscape cookies file format
enhancement
New feature or request
#473
opened Jan 11, 2024 by
adbar
Configure pre-commit for this repository and update documentation
documentation
Docs in need of update or extension
up for grabs
Good for (first) contributors
#466
opened Jan 2, 2024 by
adbar
Here is an interesting example... any tips?
question
Further information is requested
#459
opened Dec 19, 2023 by
krstp
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.