Class: CorpusProcessor::Parsers::Lampada

Inherits:
Object
  • Object
show all
Defined in:
lib/corpus-processor/parsers/lampada.rb

Overview

The parser for the corpus in LâMPADA format.

Instance Method Summary collapse

Constructor Details

#initialize(categories = CorpusProcessor::Categories.default) ⇒ Lampada

Returns a new instance of Lampada.

Parameters:

  • categories (Hash) (defaults to: CorpusProcessor::Categories.default)

    the categories definitions loaded by Categories.



5
6
7
# File 'lib/corpus-processor/parsers/lampada.rb', line 5

def initialize categories = CorpusProcessor::Categories.default
  self.categories = categories.fetch :input
end

Instance Method Details

#parse(corpus) ⇒ Array<CorpusProcessor::Token>

Parse the corpus in LâMPADA format.

Parameters:

  • corpus (String)

    the original corpus.

Returns:



13
14
15
# File 'lib/corpus-processor/parsers/lampada.rb', line 13

def parse corpus
  process_nodes Nokogiri::XML(corpus).css('P')
end