html2text

Star

Here are 27 public repositories matching this topic...

puhoy / readability_cli

Star

a cli tool to fetch webpages main content and print it as markdown

markdown html-to-markdown python3 readability html2text readability-lxml readability-cli fetch-webpages

Updated Oct 31, 2020
Python

LukaszNiewinski / Microservice-for-retrieving-img-and-text

Star

Microservice for text and images collection for data science purposes.

python api docker flask service docker-compose scrapy html2text

Updated Nov 22, 2022
Python

C'est un projet de web scraping qui utilise Streamlit, BeautifulSoup, et html2text pour extraire, convertir en Markdown, et afficher le contenu de toutes les pages liées à une URL donnée. Il fournit un sommaire interactif des URL visitées et permet d'afficher le contenu extrait dans un format facile à lire.

markdown open-source interactive python3 web-application web-scraping data-extraction html2text beautifulsoup4 streamlit

Updated May 23, 2023
Python

hcq0618 / html-files-to-markdown-files

Star

batch convert html files to mardown files

html html2text mardown

Updated May 17, 2019
Python

pH-7 / Html2Text

Sponsor

Star

A very simple (but efficient) "HTML to plain text" converter ✍️

php converter php7 text plain-text html2text convertor text-converter email-text-parsing htmltotext symfony-mailer text-convertor

Updated Jun 11, 2023
PHP

gereoffy / deepspam2

Star

DeepSpam milter v2

nlp email-parsing spam-filtering html2text spam-detection neural

Updated Feb 17, 2024
Python

masroore / php-html2text

Star

A PHP package to convert HTML into a plain text format

html html-parser html2text

Updated Jun 13, 2022
PHP

zacanger / html2txt

Star

html2text but in node

html markdown cli node html2text

Updated Sep 24, 2023
JavaScript

cycloidio / docker-image-html2text

Star

Dockerized html2text command-line tool

docker tool html2text

Updated Mar 18, 2019
Makefile

cycloidio / docker-image-python-html2text

Star

Dockerized Python html2text command-line tool

html docker tool text html2text

Updated Mar 15, 2019
Makefile

BrenoFariasdaSilva / Python

Star

My Python Codes.

python adb python3 pip shellscript html2text pip3 dagster pydriller ppadb

Updated May 3, 2024
Python

sophiaken / Web-Scraping-Project-Python

Star

Scraped Web using an automated python script that acted as scrapper to extract content from Wikipedia pages and created a clean dataset from it.

pandas-dataframe python3 html2text beautifulsoup4 scrapper-script

Updated Jun 19, 2020
Python

afeiship / next-html2text

Star

Strip html to text for next.

html text strip html2text

Updated Mar 5, 2021
JavaScript

importcjj / go-readability

Star

Go package that cleans a HTML page for better readability.

go html golang text extractor text-extraction readability html2text html-extractor

Updated Aug 1, 2023
HTML

AbdellatifCHE / Collect_Store_Search

Star

The goal is to create a solution that crawls for articles from a news website (Theguardian), cleanses the response, stores it in a hosted mongo database (MongoDB Atlas), then makes it available to search via an API.

python mongodb pymongo nltk scrapy html2text lemmatization