Testing and benchmarking some of the existing NLP libraries in Apache Spark
-
Updated
Jan 11, 2019 - Scala
Testing and benchmarking some of the existing NLP libraries in Apache Spark
Tutorial for Topic Modelling using PySpark and Spark NLP
Text summarization algorithms using PySpark
Models and Pipelines for the Spark NLP library
Final project of "Big Data Analytics and Business Intelligence" course.
An implementation of NLP Sandbox PHI Annotator API based on Spark NLP
Compilation of NLP notebooks from various sources that address several technical challenges.
SparkNLP and Healthcare SparkNLP based analysis of scientific literature on equine colic.
Project that captures information about all Dark Souls 3 (DS3) weapons and performs textual analysis on.
This project is a Spark ML pipeline using Pyspark for NLP, using annotators: DocumentAssembler, Tokenizer, WordEmbeddingsModel, PerceptronModel & NerCrfModel. It prints a transformed DataFrame showing POS & NER columns, and analyzes any relationship between found entities & their POS attributes. Hands-on experience with Spark, Pyspark & Spark-NLP.
NLP functions with John Snow's Spark NLP in the Java language
Instructions and code for the workshop "From Big Data to NLP Insights: Unlocking the Power of PySpark and Spark NLP"
contains notebooks on topic modeling, spark and pandas implementation
Add a description, image, and links to the spark-nlp topic page so that developers can more easily learn about it.
To associate your repository with the spark-nlp topic, visit your repo's landing page and select "manage topics."