Skip to content

eliias/webtext

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

webtext

Uniform web text processing

The webtext library offers a uniform interface for online text processing. It allows developers to abstract text from e.g. a website, so that it can be easily pre-processed, analysed, filtered, and transformed. A common use case is pre-processing and filtering for NLP use cases (and LLMs).

Usage

pip install webtext

Parts

from webtext import WebText

text = WebText("This is some text.\n\nAnother paragraph.")
text.paragraphs()
text.sentences()
text.words()

Contributing

Tests

python -m pytest

Release

poetry build
poetry publish

About

Uniform web text processing.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages