Issues: adbar/trafilatura
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
focused_crawl returns nothing
feedback
Feedback from users requested
#589
opened May 7, 2024 by
bezir
<main> Content gets missed out
feedback
Feedback from users requested
#588
opened May 6, 2024 by
alroythalus
Extracting content from an URl is getting none
question
Further information is requested
#586
opened May 5, 2024 by
Fabiha15
Wrong links position in text from telegram post
question
Further information is requested
#585
opened May 4, 2024 by
RedHotUnicorn
Removing related links at end of article/sidebar on news websites?
bug
Something isn't working
#584
opened May 3, 2024 by
rahulbot
Update XML-TEI reference data
maintenance
Software compability and continuity
#577
opened Apr 29, 2024 by
adbar
Extract text from buttons for semantic elements
question
Further information is requested
#573
opened Apr 23, 2024 by
zirkelc
Question: check if page is readable?
question
Further information is requested
#572
opened Apr 23, 2024 by
zirkelc
Use New feature or request
with_metadata
parameter to decide whether to run metadata extraction
enhancement
Why lzma for data compression?
question
Further information is requested
#559
opened Apr 15, 2024 by
Yomguithereal
Preserve horizontal space in code blocks
enhancement
New feature or request
#553
opened Apr 9, 2024 by
mittsommer
Make cascade of different content extractors explicit and configurable
enhancement
New feature or request
#538
opened Apr 3, 2024 by
adbar
Downloads: Add ZStandard as optional Accept-Encoding header
enhancement
New feature or request
#537
opened Apr 3, 2024 by
adbar
List element inside a table is lost
bug
Something isn't working
#531
opened Mar 29, 2024 by
mikhainin
Link proportion heuristic fails for link paragraph
bug
Something isn't working
#529
opened Mar 27, 2024 by
adbar
Include links and Include formatting do not work together properly
bug
Something isn't working
#511
opened Feb 21, 2024 by
ibestvina
OVERALL_DISCARD_XPATH not discarding in some cases
question
Further information is requested
#510
opened Feb 19, 2024 by
felipehertzer
include_links option mixes texts and links
bug
Something isn't working
#476
opened Jan 12, 2024 by
hugoobauer
Add support for Netscape cookies file format
enhancement
New feature or request
#473
opened Jan 11, 2024 by
adbar
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.