Class: Spacy::Span
Overview
See also spaCy Python API document for Span.
Instance Attribute Summary collapse
-
#doc ⇒ Doc
readonly
The document to which the span belongs.
-
#py_span ⇒ Object
readonly
A Python
Spaninstance accessible viaPyCall.
Instance Method Summary collapse
-
#[](range) ⇒ Object
Returns a span if a range object is given or a token if an integer representing the position of the doc is given.
-
#as_doc ⇒ Doc
Creates a document instance from the span.
-
#conjuncts ⇒ Array<Token>
Returns tokens conjugated to the root of the span.
-
#each ⇒ Object
Iterates over the elements in the span yielding a token instance each time.
-
#ents ⇒ Array<Span>
Returns an array of spans that represents named entities.
-
#initialize(doc, py_span: nil, start_index: nil, end_index: nil, options: {}) ⇒ Span
constructor
It is recommended to use Doc#span method to create a span.
-
#label ⇒ String
Returns the label.
-
#lefts ⇒ Array<Token>
Returns tokens that are to the left of the span, whose heads are within the span.
-
#method_missing(name, *args) ⇒ Object
Methods defined in Python but not wrapped in ruby-spacy can be called by this dynamic method handling mechanism.
-
#noun_chunks ⇒ Array<Span>
Returns an array of spans of noun chunks.
-
#rights ⇒ Array<Token>
Returns Tokens that are to the right of the span, whose heads are within the span.
-
#root ⇒ Token
Returns the head token.
-
#sent ⇒ Span
Returns a span that represents the sentence that the given span is part of.
-
#sents ⇒ Array<Span>
Returns an array of spans that represents sentences.
-
#similarity(other) ⇒ Float
Returns a semantic similarity estimate.
-
#subtree ⇒ Array<Token>
Returns Tokens that are within the span and tokens that descend from them.
-
#tokens ⇒ Array<Token>
Returns an array of tokens contained in the span.
Constructor Details
#initialize(doc, py_span: nil, start_index: nil, end_index: nil, options: {}) ⇒ Span
It is recommended to use Doc#span method to create a span. If you need to
create one using #initialize, there are two method signatures:
Span.new(doc, py_span: Object) or Span.new(doc, start_index: Integer, end_index: Integer, options: Hash).
366 367 368 369 370 371 372 373 |
# File 'lib/ruby-spacy.rb', line 366 def initialize(doc, py_span: nil, start_index: nil, end_index: nil, options: {}) @doc = doc if py_span @py_span = py_span else @py_span = PySpan.(@doc.py_doc, start_index, end_index + 1, ) end end |
Dynamic Method Handling
This class handles dynamic methods through the method_missing method
#method_missing(name, *args) ⇒ Object
Methods defined in Python but not wrapped in ruby-spacy can be called by this dynamic method handling mechanism.
508 509 510 |
# File 'lib/ruby-spacy.rb', line 508 def method_missing(name, *args) @py_span.send(name, *args) end |
Instance Attribute Details
#doc ⇒ Doc (readonly)
Returns the document to which the span belongs.
351 352 353 |
# File 'lib/ruby-spacy.rb', line 351 def doc @doc end |
#py_span ⇒ Object (readonly)
Returns a Python Span instance accessible via PyCall.
348 349 350 |
# File 'lib/ruby-spacy.rb', line 348 def py_span @py_span end |
Instance Method Details
#[](range) ⇒ Object
Returns a span if a range object is given or a token if an integer representing the position of the doc is given.
439 440 441 442 443 444 445 446 |
# File 'lib/ruby-spacy.rb', line 439 def [](range) if range.is_a?(Range) py_span = @py_span[range] return Span.new(@doc, start_index: py_span.start, end_index: py_span.end - 1) else return Token.new(@py_span[range]) end end |
#as_doc ⇒ Doc
Creates a document instance from the span
457 458 459 |
# File 'lib/ruby-spacy.rb', line 457 def as_doc Doc.new(@doc.py_nlp, text: self.text) end |
#conjuncts ⇒ Array<Token>
Returns tokens conjugated to the root of the span.
463 464 465 466 467 468 469 |
# File 'lib/ruby-spacy.rb', line 463 def conjuncts conjunct_array = [] PyCall::List.(@py_span.conjuncts).each do |py_conjunct| conjunct_array << Token.new(py_conjunct) end conjunct_array end |
#each ⇒ Object
Iterates over the elements in the span yielding a token instance each time.
386 387 388 389 390 |
# File 'lib/ruby-spacy.rb', line 386 def each PyCall::List.(@py_span).each do |py_token| yield Token.new(py_token) end end |
#ents ⇒ Array<Span>
Returns an array of spans that represents named entities.
422 423 424 425 426 427 428 |
# File 'lib/ruby-spacy.rb', line 422 def ents ent_array = [] PyCall::List.(@py_span.ents).each do |py_span| ent_array << Span.new(@doc, py_span: py_span) end ent_array end |
#label ⇒ String
Returns the label
503 504 505 |
# File 'lib/ruby-spacy.rb', line 503 def label @py_span.label_ end |
#lefts ⇒ Array<Token>
Returns tokens that are to the left of the span, whose heads are within the span.
473 474 475 476 477 478 479 |
# File 'lib/ruby-spacy.rb', line 473 def lefts left_array = [] PyCall::List.(@py_span.lefts).each do |py_left| left_array << Token.new(py_left) end left_array end |
#noun_chunks ⇒ Array<Span>
Returns an array of spans of noun chunks.
394 395 396 397 398 399 400 401 |
# File 'lib/ruby-spacy.rb', line 394 def noun_chunks chunk_array = [] py_chunks = PyCall::List.(@py_span.noun_chunks) py_chunks.each do |py_span| chunk_array << Span.new(@doc, py_span: py_span) end chunk_array end |
#rights ⇒ Array<Token>
Returns Tokens that are to the right of the span, whose heads are within the span.
483 484 485 486 487 488 489 |
# File 'lib/ruby-spacy.rb', line 483 def rights right_array = [] PyCall::List.(@py_span.rights).each do |py_right| right_array << Token.new(py_right) end right_array end |
#root ⇒ Token
Returns the head token
405 406 407 |
# File 'lib/ruby-spacy.rb', line 405 def root Token.new(@py_span.root) end |
#sent ⇒ Span
Returns a span that represents the sentence that the given span is part of.
432 433 434 435 |
# File 'lib/ruby-spacy.rb', line 432 def sent py_span = @py_span.sent return Span.new(@doc, py_span: py_span) end |
#sents ⇒ Array<Span>
Returns an array of spans that represents sentences.
411 412 413 414 415 416 417 418 |
# File 'lib/ruby-spacy.rb', line 411 def sents sentence_array = [] py_sentences = PyCall::List.(@py_span.sents) py_sentences.each do |py_span| sentence_array << Span.new(@doc, py_span: py_span) end sentence_array end |
#similarity(other) ⇒ Float
Returns a semantic similarity estimate.
451 452 453 |
# File 'lib/ruby-spacy.rb', line 451 def similarity(other) py_span.similarity(other.py_span) end |
#subtree ⇒ Array<Token>
Returns Tokens that are within the span and tokens that descend from them.
493 494 495 496 497 498 499 |
# File 'lib/ruby-spacy.rb', line 493 def subtree subtree_array = [] PyCall::List.(@py_span.subtree).each do |py_subtree| subtree_array << Token.new(py_subtree) end subtree_array end |