Module: Company
- Defined in:
- lib/company/mapping/document_utils/basic_tokenizer.rb,
lib/company/mapping.rb,
lib/company/mapping/version.rb,
lib/company/mapping/tfidf/tfidf.rb,
lib/company/mapping/company_mapper.rb,
lib/company/mapping/document_utils/corpus.rb,
lib/company/mapping/tfidf/tf/term_frequency.rb,
lib/company/mapping/document_utils/text_document.rb,
lib/company/mapping/document_utils/company_corpus.rb,
lib/company/mapping/tfidf/tf/normalized_term_frequency.rb,
lib/company/mapping/vector_similarity/cosine_similarity.rb,
lib/company/mapping/tfidf/idf/inverse_document_frequency.rb
Overview
BasicTokenizer breaks given strings to a set of tokens. As tokens are regarded the words and the sequences of the numbers the string contains.
Defined Under Namespace
Modules: Mapping