Class: PragmaticSegmenter::Process
- Inherits:
-
Object
- Object
- PragmaticSegmenter::Process
- Defined in:
- lib/pragmatic_segmenter/process.rb
Overview
This class processing segmenting the text.
Direct Known Subclasses
Languages::Arabic::Process, Languages::Common::Process, Languages::Deutsch::Process, Languages::Japanese::Process, Languages::Persian::Process, Languages::Russian::Process
Instance Attribute Summary collapse
-
#text ⇒ Object
readonly
Returns the value of attribute text.
Instance Method Summary collapse
-
#initialize(text:, language: Languages::Common) ⇒ Process
constructor
A new instance of Process.
- #process ⇒ Object
Constructor Details
Instance Attribute Details
#text ⇒ Object (readonly)
Returns the value of attribute text.
14 15 16 |
# File 'lib/pragmatic_segmenter/process.rb', line 14 def text @text end |
Instance Method Details
#process ⇒ Object
20 21 22 23 24 25 26 27 28 |
# File 'lib/pragmatic_segmenter/process.rb', line 20 def process reformatted_text = List.new(text: text).add_line_break reformatted_text = replace_abbreviations(reformatted_text) reformatted_text = replace_numbers(reformatted_text) reformatted_text = replace_continuous_punctuation(reformatted_text) reformatted_text.apply(@language::AbbreviationsWithMultiplePeriodsAndEmailRule) reformatted_text.apply(@language::GeoLocationRule) split_into_segments(reformatted_text) end |