Module: Company

Defined in:
lib/company/mapping/document_utils/basic_tokenizer.rb,
lib/company/mapping.rb,
lib/company/mapping/version.rb,
lib/company/mapping/tfidf/tfidf.rb,
lib/company/mapping/company_mapper.rb,
lib/company/mapping/document_utils/corpus.rb,
lib/company/mapping/tfidf/tf/term_frequency.rb,
lib/company/mapping/document_utils/text_document.rb,
lib/company/mapping/document_utils/company_corpus.rb,
lib/company/mapping/tfidf/tf/normalized_term_frequency.rb,
lib/company/mapping/vector_similarity/cosine_similarity.rb,
lib/company/mapping/tfidf/idf/inverse_document_frequency.rb

Overview

BasicTokenizer breaks given strings to a set of tokens. As tokens are regarded the words and the sequences of the numbers the string contains.

Defined Under Namespace

Modules: Mapping