Class: TwitterCldr::Segmentation::KoreanBreakEngine

Inherits:
CjBreakEngine show all
Includes:
Singleton
Defined in:
lib/twitter_cldr/segmentation/korean_break_engine.rb

Constant Summary

Constants inherited from CjBreakEngine

CjBreakEngine::KATAKANA_COSTS, CjBreakEngine::LARGE_NUMBER, CjBreakEngine::MAX_KATAKANA_COST, CjBreakEngine::MAX_KATAKANA_GROUP_LENGTH, CjBreakEngine::MAX_KATAKANA_LENGTH, CjBreakEngine::MAX_SNLP, CjBreakEngine::MAX_WORD_SIZE

Class Method Summary collapse

Methods inherited from DictionaryBreakEngine

#each_boundary

Class Method Details

.word_setObject



14
15
16
17
18
19
20
# File 'lib/twitter_cldr/segmentation/korean_break_engine.rb', line 14

def self.word_set
  @word_set ||= begin
    uset = TwitterCldr::Shared::UnicodeSet.new
    uset.add_range(0xAC00..0xD7A3)
    uset.to_set
  end
end