Class: PragmaticSegmenter::Languages::Common::Cleaner

Inherits:
Cleaner
  • Object
show all
Defined in:
lib/pragmatic_segmenter/languages/common.rb

Constant Summary

Constants inherited from Cleaner

Cleaner::ConsecutiveForwardSlashRule, Cleaner::ConsecutivePeriodsRule, Cleaner::DoubleNewLineRule, Cleaner::DoubleNewLineWithSpaceRule, Cleaner::EscapedCarriageReturnRule, Cleaner::EscapedNewLineRule, Cleaner::InlineFormattingRule, Cleaner::NEWLINE_IN_MIDDLE_OF_SENTENCE_REGEX, Cleaner::NO_SPACE_BETWEEN_SENTENCES_DIGIT_REGEX, Cleaner::NO_SPACE_BETWEEN_SENTENCES_REGEX, Cleaner::NewLineFollowedByBulletRule, Cleaner::NewLineFollowedByPeriodRule, Cleaner::NewLineInMiddleOfWordRule, Cleaner::NoSpaceBetweenSentencesDigitRule, Cleaner::NoSpaceBetweenSentencesRule, Cleaner::PDF_NewLineInMiddleOfSentenceNoSpacesRule, Cleaner::PDF_NewLineInMiddleOfSentenceRule, Cleaner::QuotationsFirstRule, Cleaner::QuotationsSecondRule, Cleaner::ReplaceNewlineWithCarriageReturnRule, Cleaner::TableOfContentsRule, Cleaner::TypoEscapedCarriageReturnRule, Cleaner::TypoEscapedNewLineRule, Cleaner::URL_EMAIL_KEYWORDS

Constants included from Rules

Rules::AbbreviationsWithMultiplePeriodsAndEmailRule, Rules::ExtraWhiteSpaceRule, Rules::GeoLocationRule, Rules::QuestionMarkInQuotationRule, Rules::SingleNewLineRule, Rules::SubSingleQuoteRule

Instance Attribute Summary

Attributes inherited from Cleaner

#doc_type, #text

Method Summary

Methods inherited from Cleaner

#clean, #initialize

Constructor Details

This class inherits a constructor from PragmaticSegmenter::Cleaner