Skip to content

Latest commit

 

History

History

BaseWorkshop

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 

Topic Modelling Workshop for the Beginners

Latent Dirichlet Allocation (LDA) modeling a a type of Topic modelling
(Not original work, content collected from various sources as described in the bottom of this page)

  • LDA states that each document in a corpus is a combination of a fixed number of topics.
  • A topic has a probability of generating various words, where the words are all the observed words in the corpus.
  • These ‘hidden’ topics are then surfaced based on the likelihood of word co-occurrence
LDA Model Visualization

Datasets

Articles

GitHub Repos