Class: CSKit::Parsers::ScienceHealth::ScienceHealthTokenizer

Inherits:
Tokenizer
  • Object
show all
Defined in:
lib/cskit/parsers/science_health/science_health_tokenizer.rb

Constant Summary collapse

PATTERNS =
{
  left_paren:  /\A\(/,
  right_paren: /\A\)/,
  dash:        /\A-/,
  colon:       /\A:/,
  comma:       /\A,/,
  to:          /\Ato/,
  only:        /\Aonly(?=\))/,
  cardinality: /\A(1st|2nd|3rd|4th)/,
  page_number: /\A(vii|viii|ix|x|xi|xii)(?=:)/,  # must precede a colon
  number:      /\A\d+/,
  text:        /\A[^\s\(\):,]+/,
  space:       /\A[\s\t]+/
}

Instance Attribute Summary

Attributes inherited from Tokenizer

#citation

Method Summary

Methods inherited from Tokenizer

#each_token, #initialize

Constructor Details

This class inherits a constructor from CSKit::Parsers::Tokenizer