Module: PragmaticSegmenter::Rules
- Included in:
- Cleaner, Process, PunctuationReplacer
- Defined in:
- lib/pragmatic_segmenter/rules.rb,
lib/pragmatic_segmenter/cleaner.rb,
lib/pragmatic_segmenter/ellipsis.rb
Defined Under Namespace
Modules: DoublePunctuationRules, EllipsisRules, EscapeRegexReservedCharacters, ExclamationPointRules, HtmlRules, ReinsertEllipsisRules, SubEscapedRegexReservedCharacters, SubSymbolsRules
Constant Summary collapse
- AbbreviationsWithMultiplePeriodsAndEmailRule =
Rubular: rubular.com/r/EUbZCNfgei
Rule.new(/(\w)(\.)(\w)/, '\1∮\3')
- GeoLocationRule =
Rubular: rubular.com/r/G2opjedIm9
Rule.new(/(?<=[a-zA-z]°)\.(?=\s*\d+)/, '∯')
- SingleNewLineRule =
Rule.new(/\n/, 'ȹ')
- SubSingleQuoteRule =
Rule.new(/&⎋&/, "'")
- ExtraWhiteSpaceRule =
Rule.new(/\s{3,}/, ' ')
- QuestionMarkInQuotationRule =
Rubular: rubular.com/r/aXPUGm6fQh
Rule.new(/\?(?=(\'|\"))/, '&ᓷ&')