Class: PragmaticTokenizer::Languages::French::SingleQuotes

Inherits:
Object
  • Object
show all
Defined in:
lib/pragmatic_tokenizer/languages/french.rb

Constant Summary collapse

ALNUM_QUOTE =

why can’t we directly reference constants from Languages::Common?

PragmaticTokenizer::Languages::Common::SingleQuotes::ALNUM_QUOTE
QUOTE_WORD =
PragmaticTokenizer::Languages::Common::SingleQuotes::QUOTE_WORD
L_APOSTROPHE =
/l'/i

Instance Method Summary collapse

Instance Method Details

#handle_single_quotes(text) ⇒ Object



16
17
18
19
20
21
22
# File 'lib/pragmatic_tokenizer/languages/french.rb', line 16

def handle_single_quotes(text)
  replacement = PragmaticTokenizer::Languages::Common::PUNCTUATION_MAP["'".freeze]
  text.gsub!(L_APOSTROPHE, '\1 l' << replacement << ' ')
  text.gsub!(ALNUM_QUOTE,  '\1 '  << replacement << ' ')
  text.gsub!(QUOTE_WORD,   ' '    << replacement)
  text
end