Class: PragmaticSegmenter::Processor

Inherits:
Object
  • Object
show all
Defined in:
lib/pragmatic_segmenter/processor.rb

Overview

This class processing segmenting the text.

Direct Known Subclasses

Languages::Deutsch::Processor

Instance Attribute Summary collapse

Instance Method Summary collapse

Constructor Details

#initialize(language: Languages::Common) ⇒ Processor

Returns a new instance of Processor.



15
16
17
# File 'lib/pragmatic_segmenter/processor.rb', line 15

def initialize(language: Languages::Common)
  @language = language
end

Instance Attribute Details

#textObject (readonly)

Returns the value of attribute text.



14
15
16
# File 'lib/pragmatic_segmenter/processor.rb', line 14

def text
  @text
end

Instance Method Details

#process(text:) ⇒ Object



19
20
21
22
23
24
25
26
27
# File 'lib/pragmatic_segmenter/processor.rb', line 19

def process(text:)
  @text = List.new(text: text).add_line_break
  replace_abbreviations
  replace_numbers
  replace_continuous_punctuation
  @text.apply(@language::Abbreviations::WithMultiplePeriodsAndEmailRule)
  @text.apply(@language::GeoLocationRule)
  split_into_segments
end