Skip to content

Cybercubist/Delphine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Delphine

The task: Create a small lightweighted platform for fast 1-page texts analysis. Texts are going to be on financial topic. It should be handy for everyday use.

The solution: We've created a simple platform based on spaCy library / API. There are few features:

1. Input module. Allows you copy and paste text into platform.
2. Data extraction module. Returns following data as dict:
    a) Noun counting. Extracts all nouns from text and rank them by their frequency.
    b) Entities exctraction. Finds all the enteties in text (dates, organizations, money, percents).
    c) Chunks. Finds all word chunks.
3. Visualization module. Returns entered text with some spaCy-based visualization (with highlighted entities).
4. Text analysis module. Allows you to find sentences in text cointaining information you are interested in.

All the modules are made in MVP style, and may be significantly improved later. We've created a clear and simple architecture for future feature improvement.

Current version: Delphine 1.0.0. Delphine has it's own version number since 27.11.2023.

Updates:

17.11.2023:

  1. Added "find sentence" function. Now you can find a sentence which contains words, entities, numbers or chunks you are interested in.
  2. Small architecture improvements. Data extraction part and visualization part are now in separated cells. Lists (for example trash token list) are now in a separated file.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published