Skip to content

gcdunn/ntc_analytics_2020

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

This project was created for the 2020 Nashville Analytics Summit hosted by the Nashville Technology Council.

Abstract: In natural language processing, topic models are used to extract meaningful and human-interpretable topics from a corpus. However, tuning topic models for large corpora can be time consuming and computationally expensive. By monitoring topic coherence as a function of corpus size, we can determine how to efficiently create a high quality topic model. In this talk, we will demonstrate this technique using the English Wikipedia corpus.

I used the following papers, packages, and tutorials in this work:

About

NTC Analytics Summit 2020

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published