Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
-
Updated
May 24, 2024 - Python
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Parse markdown article, download images and replace images URL's with local paths
Extract article or news by url or html, parse the title and content, output in markdown format.
Modern OpenAI GPT-4 Article Summarizer
To extract main article from given URL with Node.js
SmartReader is a library to extract the main content of a web page, based on a port of the Readability library by Mozilla
디시인사이드 Client-Side 글 검색기 입니다.
Article title, authors, date and body extraction dataset.
A Google Docs HTML Cleaner: This program transforms messy HTML from Google Docs into clean code primarily using LXML with a modular mixin design pattern.
Sumz is your go-to solution for effortlessly transforming lengthy articles into concise and clear summaries. With Sumz, you can say goodbye to information overload and hello to quick, digestible insights. Whether you're a student looking to condense research material or a professional staying up-to-date with the latest news, Sumz has you covered.
The main goal of an AI-Powered News Summarizer is to assist users in quickly understanding the main points and essential information from a large volume of news articles or textual content. By automatically summarizing news articles, it saves time and effort by providing users with a brief overview without having to read the entire article.
This is a small and easy-to-use desktop application that allows exporting Web of Science API Expanded and InCites API data in Excel/CSV/JSON/XML with a configurable and flexible data export structure.
displays article information, if scopus eid is known
Simplify your reading with Summarizer, an open-source article summarizer that transforms lengthy articles into clear and concise summaries.
Readability / Html Content / Article Extractor & Web Scrapping library written in PHP
displays article information if trdizin article number is known
Build and deployed my Own GPT AI Website with React Vite.js and Turn it Into a SaaS Business ($$$)
This automation, which provides automatic Font, Size, Line Spacing, Page Margins, Paragraph Indents info and Citation Controls, has been developed using the "DocumentFormat.OpenXml" library.
Arachnio client library for Java 11+
Add a description, image, and links to the article-extractor topic page so that developers can more easily learn about it.
To associate your repository with the article-extractor topic, visit your repo's landing page and select "manage topics."