Class: Treat::Workers::Extractors::TopicWords::LDA

Inherits:

Object

Object
Treat::Workers::Extractors::TopicWords::LDA

show all

Defined in:: lib/treat/workers/extractors/topic_words/lda.rb

Overview

Topic word retrieval using a thin wrapper over a C implementation of Latent Dirichlet Allocation (LDA), a statistical model that posits each document is a mixture of a small number of topics and that each word’s creation is attributable to one of the document’s topics.

Original paper: Blei, David, Ng, Andrew, and Jordan, Michael. 2003. Latent dirichlet allocation. Journal of Machine Learning Research. 3 (Mar. 2003), 993-1022.

Constant Summary collapse

DefaultOptions = Default options for the LDA algorithm.

{
  :num_topics => 20,
  :words_per_topic => 10,
  :iterations => 20,
  :vocabulary => nil
}

Class Method Summary collapse

.topic_words(collection, options = {}) ⇒ Object

Retrieve the topic words of a collection.

Class Method Details

.topic_words(collection, options = {}) ⇒ `Object`

Retrieve the topic words of a collection.