Class: LanguageParser::TokenStream

Inherits:

Array

Object
Array
LanguageParser::TokenStream

show all

Defined in:: lib/cgialib/lp/Tokenizer.rb

Overview

class : TokenStream

An array of tokens with many useful helper methods

Instance Method Summary collapse

#code_only ⇒ Object

code_only().
#comments_only ⇒ Object

comments_only().
#find(code_value) ⇒ Object

find( code_value ).
#find_and_remove(code_value) ⇒ Object

find_and_remove( code_value ).
#find_pattern(pattern) ⇒ Object

find_pattern( pattern ).
#get_comments(start) ⇒ Object

get_comments( start ).
#initialize ⇒ TokenStream constructor

initialize().
#strip! ⇒ Object

strip!().
#to_s ⇒ Object

to_s().

Constructor Details

#initialize ⇒ `TokenStream`

initialize()

Initalizes the array

# File 'lib/cgialib/lp/Tokenizer.rb', line 95

def initialize()
  
  super()
  
end

Instance Method Details

#code_only ⇒ `Object`

code_only()

Returns a new token stream with the code tokens only

# File 'lib/cgialib/lp/Tokenizer.rb', line 132

def code_only()
  
  out = TokenStream.new()
  
  each { |tok| out.push( tok ) if ( tok.is_a?( CodeToken ) ) }
  
  out
  
end

#comments_only ⇒ `Object`

comments_only()

Returns a new token stream with the comments only

# File 'lib/cgialib/lp/Tokenizer.rb', line 146

def comments_only()
  
  out = TokenStream.new()
  
  each { |tok| out.push( tok ) if ( tok.is_a?( CommentToken ) ) }
  
  out
  
end

#find(code_value) ⇒ `Object`

find( code_value )

code_value - The code token text as a string

This finds a CodeToken with the same text as the input (case insensitive) and returns the index.

# File 'lib/cgialib/lp/Tokenizer.rb', line 250

def find( code_value )
  
  each_index { |index|
  
    next unless ( at( index ).is_a?( CodeToken ) )
  
    return index if ( at( index ).to_s.downcase == code_value.to_s.downcase )
  
  }
  
  nil
  
end

#find_and_remove(code_value) ⇒ `Object`

find_and_remove( code_value )

code_value - The code token text as a string

This is the same as find, but it removes the item. It returns true if it found something and false if not.

# File 'lib/cgialib/lp/Tokenizer.rb', line 271

def find_and_remove( code_value )
  
  index = find( code_value )
  
  delete_at( index ) if ( index != nil )
  
  ( index == nil ) ? false : true
  
end

#find_pattern(pattern) ⇒ `Object`

find_pattern( pattern )

pattern - A pattern array

This searches the set of tokens for a pattern of strings. The array should contain a series of strings and lambdas. In the first pass the pattern finder looks for the strings. If a match is found the lambdas are called with the token in the original string for that position.

For example:

find_pattern( “primary”, “key”, “(”, lambda { |name| print name }, “)” )

Would print “myname” if the original sequence was “primary key ( myname )”

# File 'lib/cgialib/lp/Tokenizer.rb', line 171

def find_pattern( pattern )
  
  code = code_only
  
  delta = ( code.length - pattern.length ) + 1
  
  delta.times { |start|
  
    found = true
  
    pattern.each_index { |index|
  
      if ( pattern[ index ].is_a?( String ) )
  
        unless ( pattern[ index ].downcase == code[ start + index ].to_s.downcase )
          found = false 
        end
  
      end
  
    }
  
    next unless ( found )
  
    pattern.each_index { |index|
  
      unless ( pattern[ index ].is_a?( String ) )
  
        pattern[index].call( code[ start + index ].to_s.downcase )
  
      end
  
    }
  
    return true
  
  }
  
  false
  
end

#get_comments(start) ⇒ `Object`

get_comments( start )

start - The start index

This method looks backwards from the starting index to find all of the comments and to put them together into a comment stream. It stops if it finds new CodeTokens.

# File 'lib/cgialib/lp/Tokenizer.rb', line 221

def get_comments( start )
  
  commentStream = TokenStream.new()
  
  index = start - 1
  
  while ( index > -1 )
  
    break if ( at( index ).is_a?( CodeToken ) )
  
    commentStream.unshift( at( index ) )
  
    index -= 1
  
  end
  
  comments = commentStream.map { |tok| tok.to_s }.join( "" )
  
  comments
  
end

#strip! ⇒ `Object`

strip!()

Deletes any leading or trailing whitespace

# File 'lib/cgialib/lp/Tokenizer.rb', line 117

def strip!()
  
  while( first.is_a?( WhitespaceToken ) )
    shift
  end
  while( last.is_a?( WhitespaceToken ) )
    pop
  end
  
end

#to_s ⇒ `Object`

to_s()

Converts all of the tokens back to text

# File 'lib/cgialib/lp/Tokenizer.rb', line 105

def to_s()
  
  text = ""
  each { |tok| text += tok.to_s }
  text
  
end

Class: LanguageParser::TokenStream

Overview

Instance Method Summary collapse

Constructor Details

#initialize ⇒ TokenStream

Instance Method Details

#code_only ⇒ Object

#comments_only ⇒ Object

#find(code_value) ⇒ Object

#find_and_remove(code_value) ⇒ Object

#find_pattern(pattern) ⇒ Object

#get_comments(start) ⇒ Object

#strip! ⇒ Object

#to_s ⇒ Object