Class: Sphinx::Client

Inherits:

Object

Object
Sphinx::Client

show all

Defined in:: lib/sphinx/sphinx/client.rb

Overview

:startdoc:

Direct Known Subclasses

Zinx::Client

Constant Summary collapse

SEARCHD_COMMAND_SEARCH = search command

SEARCHD_COMMAND_EXCERPT = excerpt command

SEARCHD_COMMAND_UPDATE = update command

SEARCHD_COMMAND_KEYWORDS = keywords command

VER_COMMAND_SEARCH = search command version

0x119

VER_COMMAND_EXCERPT = excerpt command version

0x102

VER_COMMAND_UPDATE = update command version

0x102

VER_COMMAND_KEYWORDS = keywords command version

0x100

SEARCHD_OK = general success, command-specific reply follows

SEARCHD_ERROR = general failure, command-specific reply may follow

SEARCHD_RETRY = temporaty failure, client should retry later

SEARCHD_WARNING = general success, warning message and command-specific reply follow

SPH_MATCH_ALL = match all query words

SPH_MATCH_ANY = match any query word

SPH_MATCH_PHRASE = match this exact phrase

SPH_MATCH_BOOLEAN = match this boolean query

SPH_MATCH_EXTENDED = match this extended query

SPH_MATCH_FULLSCAN = match all document IDs w/o fulltext query, apply filters

SPH_MATCH_EXTENDED2 = extended engine V2 (TEMPORARY, WILL BE REMOVED IN 0.9.8-RELEASE)

SPH_RANK_PROXIMITY_BM25 = default mode, phrase proximity major factor and BM25 minor one

SPH_RANK_BM25 = statistical mode, BM25 ranking only (faster but worse quality)

SPH_RANK_NONE = no ranking, all matches get a weight of 1

SPH_RANK_WORDCOUNT = simple word-count weighting, rank is a weighted sum of per-field keyword occurence counts

SPH_RANK_PROXIMITY = phrase proximity

SPH_SORT_RELEVANCE = sort by document relevance desc, then by date

SPH_SORT_ATTR_DESC = sort by document date desc, then by relevance desc

SPH_SORT_ATTR_ASC = sort by document date asc, then by relevance desc

SPH_SORT_TIME_SEGMENTS = sort by time segments (hour/day/week/etc) desc, then by relevance desc

SPH_SORT_EXTENDED = sort by SQL-like expression (eg. “@relevance DESC, price ASC, @id DESC”)

SPH_SORT_EXPR = sort by arithmetic expression in descending order (eg. “@id + max(@weight,1000)*boost + log(price)”)

SPH_FILTER_VALUES = filter by integer values set

SPH_FILTER_RANGE = filter by integer range

SPH_FILTER_FLOATRANGE = filter by float range

SPH_ATTR_INTEGER = this attr is just an integer

SPH_ATTR_TIMESTAMP = this attr is a timestamp

SPH_ATTR_ORDINAL = this attr is an ordinal string number (integer at search time, specially handled at indexing time)

SPH_ATTR_BOOL = this attr is a boolean bit field

SPH_ATTR_FLOAT = this attr is a float

SPH_ATTR_BIGINT = signed 64-bit integer

SPH_ATTR_STRING = string

SPH_ATTR_MULTI = this attr has multiple values (0 or more)

0x40000001

SPH_ATTR_MULTI64 =

0x40000002

SPH_GROUPBY_DAY = group by day

SPH_GROUPBY_WEEK = group by week

SPH_GROUPBY_MONTH = group by month

SPH_GROUPBY_YEAR = group by year

SPH_GROUPBY_ATTR = group by attribute value

SPH_GROUPBY_ATTRPAIR = group by sequential attrs pair

Instance Method Summary collapse

#AddQuery(query, index = '*', comment = '') ⇒ Object

Add query to batch.
#BuildExcerpts(docs, index, words, opts = {}) ⇒ Object

Connect to searchd server and generate exceprts from given documents.
#BuildKeywords(query, index, hits) ⇒ Object

Connect to searchd server, and generate keyword list for a given query.
#GetLastError ⇒ Object

Get last error message.
#GetLastWarning ⇒ Object

Get last warning message.
#initialize ⇒ Client constructor

Constructs the Sphinx::Client object and sets options to their default values.
#Query(query, index = '*', comment = '') ⇒ Object

index is index name (or names) to query.
#ResetFilters ⇒ Object

Clear all filters (for multi-queries).
#ResetGroupBy ⇒ Object

Clear groupby settings (for multi-queries).
#ResetOverrides ⇒ Object

Clear all attribute value overrides (for multi-queries).
#RunQueries ⇒ Object

Run queries batch.
#SetFieldWeights(weights) ⇒ Object

Bind per-field weights by name.
#SetFilter(attribute, values, exclude = false) ⇒ Object

Set values filter.
#SetFilterFloatRange(attribute, min, max, exclude = false) ⇒ Object

Set float range filter.
#SetFilterRange(attribute, min, max, exclude = false) ⇒ Object

Set range filter.
#SetGeoAnchor(attrlat, attrlong, lat, long) ⇒ Object

Setup anchor point for geosphere distance calculations.
#SetGroupBy(attribute, func, groupsort = '@group desc') ⇒ Object

Set grouping attribute and function.
#SetGroupDistinct(attribute) ⇒ Object

Set count-distinct attribute for group-by queries.
#SetIDRange(min, max) ⇒ Object

Set IDs range to match.
#SetIndexWeights(weights) ⇒ Object

Bind per-index weights by name.
#SetLimits(offset, limit, max = 0, cutoff = 0) ⇒ Object

Set offset and count into result set, and optionally set max-matches and cutoff limits.
#SetMatchMode(mode) ⇒ Object

Set matching mode.
#SetMaxQueryTime(max) ⇒ Object

Set maximum query time, in milliseconds, per-index, integer, 0 means “do not limit”.
#SetOverride(attrname, attrtype, values) ⇒ Object

There can be only one override per attribute.
#SetRankingMode(ranker) ⇒ Object

Set ranking mode.
#SetRetries(count, delay = 0) ⇒ Object

Set distributed retries count and delay.
#SetSelect(select) ⇒ Object

Set select-list (attributes or expressions), SQL-like syntax.
#SetServer(host, port) ⇒ Object

Set searchd host name (string) and port (integer).
#SetSortMode(mode, sortby = '') ⇒ Object

Set matches sorting mode.
#SetWeights(weights) ⇒ Object

Bind per-field weights by order.
#UpdateAttributes(index, attrs, values, mva = false) ⇒ Object

Batch update given attributes in given rows in given indexes.

Constructor Details

#initialize ⇒ `Client`

Constructs the Sphinx::Client object and sets options to their default values.

# File 'lib/sphinx/sphinx/client.rb', line 172

def initialize
  # per-client-object settings
  @host          = 'localhost'             # searchd host (default is "localhost")
  @port          = 9312                    # searchd port (default is 9312)
  
  # per-query settings
  @offset        = 0                       # how many records to seek from result-set start (default is 0)
  @limit         = 20                      # how many records to return from result-set starting at offset (default is 20)
  @mode          = SPH_MATCH_ALL           # query matching mode (default is SPH_MATCH_ALL)
  @weights       = []                      # per-field weights (default is 1 for all fields)
  @sort          = SPH_SORT_RELEVANCE      # match sorting mode (default is SPH_SORT_RELEVANCE)
  @sortby        = ''                      # attribute to sort by (defualt is "")
  @min_id        = 0                       # min ID to match (default is 0, which means no limit)
  @max_id        = 0                       # max ID to match (default is 0, which means no limit)
  @filters       = []                      # search filters
  @groupby       = ''                      # group-by attribute name
  @groupfunc     = SPH_GROUPBY_DAY         # function to pre-process group-by attribute value with
  @groupsort     = '@group desc'           # group-by sorting clause (to sort groups in result set with)
  @groupdistinct = ''                      # group-by count-distinct attribute
  @maxmatches    = 1000                    # max matches to retrieve
  @cutoff        = 0                       # cutoff to stop searching at (default is 0)
  @retrycount    = 0                       # distributed retries count
  @retrydelay    = 0                       # distributed retries delay
  @anchor        = []                      # geographical anchor point
  @indexweights  = []                      # per-index weights
  @ranker        = SPH_RANK_PROXIMITY_BM25 # ranking mode (default is SPH_RANK_PROXIMITY_BM25)
  @maxquerytime  = 0                       # max query time, milliseconds (default is 0, do not limit) 
  @fieldweights  = {}                      # per-field-name weights
  @overrides     = []                      # per-query attribute values overrides
  @select        = '*'                     # select-list (attributes or expressions, with optional aliases)

  # per-reply fields (for single-query case)
  @error         = ''                      # last error message
  @warning       = ''                      # last warning message
  
  @reqs          = []                      # requests storage (for multi-query case)
  @mbenc         = ''                      # stored mbstring encoding
end

Instance Method Details

#AddQuery(query, index = '*', comment = '') ⇒ `Object`

Add query to batch.

Batch queries enable searchd to perform internal optimizations, if possible; and reduce network connection overheads in all cases.

For instance, running exactly the same query with different groupby settings will enable searched to perform expensive full-text search and ranking operation only once, but compute multiple groupby results from its output.

Parameters are exactly the same as in Query call. Returns index to results array returned by RunQueries call.

# File 'lib/sphinx/sphinx/client.rb', line 565

def AddQuery(query, index = '*', comment = '')
  # build request
  
  # mode and limits
  request = Request.new
  request.put_int @offset, @limit, @mode, @ranker, @sort
  request.put_string @sortby
  # query itself
  request.put_string query
  # weights
  request.put_int_array @weights
  # indexes
  request.put_string index
  # id64 range marker
  request.put_int 1
  # id64 range
  request.put_int64 @min_id.to_i, @max_id.to_i 
  
  # filters
  request.put_int @filters.length
  @filters.each do |filter|
    request.put_string filter['attr']
    request.put_int filter['type']

    case filter['type']
      when SPH_FILTER_VALUES
        request.put_int64_array filter['values']
      when SPH_FILTER_RANGE
        request.put_int64 filter['min'], filter['max']
      when SPH_FILTER_FLOATRANGE
        request.put_float filter['min'], filter['max']
      else
        raise SphinxInternalError, 'Internal error: unhandled filter type'
    end
    request.put_int filter['exclude'] ? 1 : 0
  end
  
  # group-by clause, max-matches count, group-sort clause, cutoff count
  request.put_int @groupfunc
  request.put_string @groupby
  request.put_int @maxmatches
  request.put_string @groupsort
  request.put_int @cutoff, @retrycount, @retrydelay
  request.put_string @groupdistinct
  
  # anchor point
  if @anchor.empty?
    request.put_int 0
  else
    request.put_int 1
    request.put_string @anchor['attrlat'], @anchor['attrlong']
    request.put_float @anchor['lat'], @anchor['long']
  end
  
  # per-index weights
  request.put_int @indexweights.length
  @indexweights.each do |idx, weight|
    request.put_string idx
    request.put_int weight
  end
  
  # max query time
  request.put_int @maxquerytime
  
  # per-field weights
  request.put_int @fieldweights.length
  @fieldweights.each do |field, weight|
    request.put_string field
    request.put_int weight
  end
  
  # comment
  request.put_string comment
  
  # attribute overrides
  request.put_int @overrides.length
  for entry in @overrides do
    request.put_string entry['attr']
    request.put_int entry['type'], entry['values'].size
    entry['values'].each do |id, val|
      assert { id.instance_of?(Fixnum) || id.instance_of?(Bignum) }
      assert { val.instance_of?(Fixnum) || val.instance_of?(Bignum) || val.instance_of?(Float) }
      
      request.put_int64 id
      case entry['type']
        when SPH_ATTR_FLOAT
          request.put_float val
        when SPH_ATTR_BIGINT
          request.put_int64 val
        else
          request.put_int val
      end
    end
  end
  
  # select-list
  request.put_string @select
  
  # store request to requests array
  @reqs << request.to_s;
  return @reqs.length - 1
end

#BuildExcerpts(docs, index, words, opts = {}) ⇒ `Object`

Connect to searchd server and generate exceprts from given documents.

docs – an array of strings which represent the documents’ contents
index – a string specifiying the index which settings will be used

for stemming, lexing and case folding

words – a string which contains the words to highlight
opts is a hash which contains additional optional highlighting parameters.

You can use following parameters:

'before_match' – a string to insert before a set of matching words, default is “<b>”
'after_match' – a string to insert after a set of matching words, default is “<b>”
'chunk_separator' – a string to insert between excerpts chunks, default is “ … ”
'limit' – max excerpt size in symbols (codepoints), default is 256
'around' – how much words to highlight around each match, default is 5
'exact_phrase' – whether to highlight exact phrase matches only, default is false
'single_passage' – whether to extract single best passage only, default is false
'use_boundaries' – whether to extract passages by phrase boundaries setup in tokenizer
'weight_order' – whether to order best passages in document (default) or weight order

Returns false on failure. Returns an array of string excerpts on success.

# File 'lib/sphinx/sphinx/client.rb', line 830

def BuildExcerpts(docs, index, words, opts = {})
  assert { docs.instance_of? Array }
  assert { index.instance_of? String }
  assert { words.instance_of? String }
  assert { opts.instance_of? Hash }

  # fixup options
  opts['before_match'] ||= '<b>';
  opts['after_match'] ||= '</b>';
  opts['chunk_separator'] ||= ' ... ';
	  opts['html_strip_mode'] ||= 'index';
  opts['limit'] ||= 256;
	  opts['limit_passages'] ||= 0;
	  opts['limit_words'] ||= 0;
  opts['around'] ||= 5;
	  opts['start_passage_id'] ||= 1;
  opts['exact_phrase'] ||= false
  opts['single_passage'] ||= false
  opts['use_boundaries'] ||= false
  opts['weight_order'] ||= false
	  opts['load_files'] ||= false
	  opts['allow_empty'] ||= false
  
  # build request
  
  # v.1.0 req
  flags = 1
  flags |= 2  if opts['exact_phrase']
  flags |= 4  if opts['single_passage']
  flags |= 8  if opts['use_boundaries']
  flags |= 16 if opts['weight_order']
	  flags |= 32 if opts['query_mode']
	  flags |= 64 if opts['force_all_words']
	  flags |= 128 if opts['load_files']
	  flags |= 256 if opts['allow_empty']
  
  request = Request.new
  request.put_int 0, flags # mode=0, flags=1 (remove spaces)
  # req index
  request.put_string index
  # req words
  request.put_string words
  
  # options
  request.put_string opts['before_match']
  request.put_string opts['after_match']
  request.put_string opts['chunk_separator']
  request.put_int opts['limit'].to_i, opts['around'].to_i
	  
	  # options v1.2
	  request.put_int opts['limit_passages'].to_i
	  request.put_int opts['limit_words'].to_i
	  request.put_int opts['start_passage_id'].to_i
	  request.put_string opts['html_strip_mode']
  
  # documents
  request.put_int docs.size
  docs.each do |doc|
    assert { doc.instance_of? String }

    request.put_string doc
  end
  
  response = PerformRequest(:excerpt, request)
  
  # parse response
  begin
    res = []
    docs.each do |doc|
      res << response.get_string
    end
  rescue EOFError
    @error = 'incomplete reply'
    raise SphinxResponseError, @error
  end
  return res
end

#BuildKeywords(query, index, hits) ⇒ `Object`

Connect to searchd server, and generate keyword list for a given query.

Returns an array of words on success.

# File 'lib/sphinx/sphinx/client.rb', line 911

def BuildKeywords(query, index, hits)
  assert { query.instance_of? String }
  assert { index.instance_of? String }
  assert { hits.instance_of?(TrueClass) || hits.instance_of?(FalseClass) }
  
  # build request
  request = Request.new
  # v.1.0 req
  request.put_string query # req query
  request.put_string index # req index
  request.put_int hits ? 1 : 0

  response = PerformRequest(:keywords, request)
  
  # parse response
  begin
    res = []
    nwords = response.get_int
    0.upto(nwords - 1) do |i|
      tokenized = response.get_string
      normalized = response.get_string
      
      entry = { 'tokenized' => tokenized, 'normalized' => normalized }
      entry['docs'], entry['hits'] = response.get_ints(2) if hits
      
      res << entry
    end
  rescue EOFError
    @error = 'incomplete reply'
    raise SphinxResponseError, @error
  end
  
  return res
end

#GetLastError ⇒ `Object`

Get last error message.



212
213
214

# File 'lib/sphinx/sphinx/client.rb', line 212

def GetLastError
  @error
end

#GetLastWarning ⇒ `Object`

Get last warning message.



217
218
219

# File 'lib/sphinx/sphinx/client.rb', line 217

def GetLastWarning
  @warning
end

#Query(query, index = '*', comment = '') ⇒ `Object`

index is index name (or names) to query. default value is “*” which means to query all indexes. Accepted characters for index names are letters, numbers, dash, and underscore; everything else is considered a separator. Therefore, all the following calls are valid and will search two indexes:

sphinx.Query('test query', 'main delta')
sphinx.Query('test query', 'main;delta')
sphinx.Query('test query', 'main, delta')

Index order matters. If identical IDs are found in two or more indexes, weight and attribute values from the very last matching index will be used for sorting and returning to client. Therefore, in the example above, matches from “delta” index will always “win” over matches from “main”.

Returns false on failure. Returns hash which has the following keys on success:

'matches' – array of hashes ‘group’, ‘id’, where ‘id’ is document_id.
'total' – total amount of matches retrieved (upto SPH_MAX_MATCHES, see sphinx.h)
'total_found' – total amount of matching documents in index
'time' – search time
'words' – hash which maps query terms (stemmed!) to (‘docs’, ‘hits’) hash

# File 'lib/sphinx/sphinx/client.rb', line 536

def Query(query, index = '*', comment = '')
  assert { @reqs.empty? }
  @reqs = []
  
  self.AddQuery(query, index, comment)
  results = self.RunQueries
  
  # probably network error; error message should be already filled
  return false unless results.instance_of?(Array)
  
  @error = results[0]['error']
  @warning = results[0]['warning']
  
  return false if results[0]['status'] == SEARCHD_ERROR
  return results[0]
end

#ResetFilters ⇒ `Object`

Clear all filters (for multi-queries).

# File 'lib/sphinx/sphinx/client.rb', line 492

def ResetFilters
  @filters = []
  @anchor = []
end

#ResetGroupBy ⇒ `Object`

Clear groupby settings (for multi-queries).

# File 'lib/sphinx/sphinx/client.rb', line 498

def ResetGroupBy
  @groupby       = ''
  @groupfunc     = SPH_GROUPBY_DAY
  @groupsort     = '@group desc'
  @groupdistinct = ''
end

#ResetOverrides ⇒ `Object`

Clear all attribute value overrides (for multi-queries).



506
507
508

# File 'lib/sphinx/sphinx/client.rb', line 506

def ResetOverrides
  @overrides = []
end

#RunQueries ⇒ `Object`

Run queries batch.

Returns an array of result sets on success. Returns false on network IO failure.

Each result set in returned array is a hash which containts the same keys as the hash returned by Query, plus:

'error' – search error for this query
'words' – hash which maps query terms (stemmed!) to ( “docs”, “hits” ) hash

# File 'lib/sphinx/sphinx/client.rb', line 678

def RunQueries
  if @reqs.empty?
    @error = 'No queries defined, issue AddQuery() first'
    return false
  end

  req = @reqs.join('')
  nreqs = @reqs.length
  @reqs = []
  response = PerformRequest(:search, req, nreqs)
 
  # parse response
  begin
    results = []
    ires = 0
    while ires < nreqs
      ires += 1
      result = {}
      
      result['error'] = ''
      result['warning'] = ''
      
      # extract status
      status = result['status'] = response.get_int
      if status != SEARCHD_OK
        message = response.get_string
        if status == SEARCHD_WARNING
          result['warning'] = message
        else
          result['error'] = message
          results << result
          next
        end
      end
  
      # read schema
      fields = []
      attrs = {}
      attrs_names_in_order = []
      
      nfields = response.get_int
      while nfields > 0
        nfields -= 1
        fields << response.get_string
      end
      result['fields'] = fields
  
      nattrs = response.get_int
      while nattrs > 0
        nattrs -= 1
        attr = response.get_string
        type = response.get_int
        attrs[attr] = type
        attrs_names_in_order << attr
      end
      result['attrs'] = attrs
      
      # read match count
      count = response.get_int
      id64 = response.get_int
      
      # read matches
      result['matches'] = []
      while count > 0
        count -= 1
        
        if id64 != 0
          doc = response.get_int64
          weight = response.get_int
        else
          doc, weight = response.get_ints(2)
        end
  
        r = {} # This is a single result put in the result['matches'] array
        r['id'] = doc
        r['weight'] = weight
        attrs_names_in_order.each do |a|
          r['attrs'] ||= {}
  
          case attrs[a]
            when SPH_ATTR_BIGINT
              # handle 64-bit ints
              r['attrs'][a] = response.get_int64
            when SPH_ATTR_FLOAT
              # handle floats
              r['attrs'][a] = response.get_float
when SPH_ATTR_STRING
  # handle string
  r['attrs'][a] = response.get_string
            else
              # handle everything else as unsigned ints
              val = response.get_int
              if attrs[a]==SPH_ATTR_MULTI
                r['attrs'][a] = []
                1.upto(val) do
                  r['attrs'][a] << response.get_int
                end
              elsif attrs[a]==SPH_ATTR_MULTI64
                r['attrs'][a] = []
	val = val/2
                1.upto(val) do
                  r['attrs'][a] << response.get_int64
                end
              else
                r['attrs'][a] = val
              end
          end
        end
        result['matches'] << r
      end
      result['total'], result['total_found'], msecs, words = response.get_ints(4)
      result['time'] = '%.3f' % (msecs / 1000.0)
  
      result['words'] = {}
      while words > 0
        words -= 1
        word = response.get_string
        docs, hits = response.get_ints(2)
        result['words'][word] = { 'docs' => docs, 'hits' => hits }
      end
      
      results << result
    end
  #rescue EOFError
  #  @error = 'incomplete reply'
  #  raise SphinxResponseError, @error
  end
  
  return results
end

#SetFieldWeights(weights) ⇒ `Object`

Bind per-field weights by name.

Takes string (field name) to integer name (field weight) hash as an argument.

Takes precedence over SetWeights().
Unknown names will be silently ignored.
Unbound fields will be silently given a weight of 1.

# File 'lib/sphinx/sphinx/client.rb', line 311

def SetFieldWeights(weights)
  assert { weights.instance_of? Hash }
  weights.each do |name, weight|
    assert { name.instance_of? String }
    assert { weight.instance_of? Fixnum }
  end

  @fieldweights = weights
end

#SetFilter(attribute, values, exclude = false) ⇒ `Object`

Set values filter.

Only match those records where attribute column values are in specified set.

# File 'lib/sphinx/sphinx/client.rb', line 348

def SetFilter(attribute, values, exclude = false)
  assert { attribute.instance_of? String }
  assert { values.instance_of? Array }
  assert { !values.empty? }

  if values.instance_of?(Array) && values.size > 0
    values.each do |value|
      assert { value.instance_of? Fixnum }
    end
  
    @filters << { 'type' => SPH_FILTER_VALUES, 'attr' => attribute, 'exclude' => exclude, 'values' => values }
  end
end

#SetFilterFloatRange(attribute, min, max, exclude = false) ⇒ `Object`

Set float range filter.

Only match those records where attribute column value is beetwen min and max (including min and max).

# File 'lib/sphinx/sphinx/client.rb', line 379

def SetFilterFloatRange(attribute, min, max, exclude = false)
  assert { attribute.instance_of? String }
  assert { min.instance_of? Float }
  assert { max.instance_of? Float }
  assert { min <= max }

  @filters << { 'type' => SPH_FILTER_FLOATRANGE, 'attr' => attribute, 'exclude' => exclude, 'min' => min, 'max' => max }
end

#SetFilterRange(attribute, min, max, exclude = false) ⇒ `Object`

Set range filter.

Only match those records where attribute column value is beetwen min and max (including min and max).

# File 'lib/sphinx/sphinx/client.rb', line 366

def SetFilterRange(attribute, min, max, exclude = false)
  assert { attribute.instance_of? String }
  assert { min.instance_of? Fixnum or min.instance_of? Bignum }
  assert { max.instance_of? Fixnum or max.instance_of? Bignum }
  assert { min <= max }

  @filters << { 'type' => SPH_FILTER_RANGE, 'attr' => attribute, 'exclude' => exclude, 'min' => min, 'max' => max }
end

#SetGeoAnchor(attrlat, attrlong, lat, long) ⇒ `Object`

Setup anchor point for geosphere distance calculations.

Required to use @geodist in filters and sorting distance will be computed to this point. Latitude and longitude must be in radians.

attrlat – is the name of latitude attribute
attrlong – is the name of longitude attribute
lat – is anchor point latitude, in radians
long – is anchor point longitude, in radians

# File 'lib/sphinx/sphinx/client.rb', line 398

def SetGeoAnchor(attrlat, attrlong, lat, long)
  assert { attrlat.instance_of? String }
  assert { attrlong.instance_of? String }
  assert { lat.instance_of? Float }
  assert { long.instance_of? Float }

  @anchor = { 'attrlat' => attrlat, 'attrlong' => attrlong, 'lat' => lat, 'long' => long }
end

#SetGroupBy(attribute, func, groupsort = '@group desc') ⇒ `Object`

Set grouping attribute and function.

In grouping mode, all matches are assigned to different groups based on grouping function value.

Each group keeps track of the total match count, and the best match (in this group) according to current sorting function.

The final result set contains one best match per group, with grouping function value and matches count attached.

Groups in result set could be sorted by any sorting clause, including both document attributes and the following special internal Sphinx attributes:

@id - match document ID;
@weight, @rank, @relevance - match weight;
@group - groupby function value;
@count - amount of matches in group.

the default mode is to sort by groupby value in descending order, ie. by ‘@group desc’.

‘total_found’ would contain total amount of matching groups over the whole index.

WARNING: grouping is done in fixed memory and thus its results are only approximate; so there might be more groups reported in total_found than actually present. @count might also be underestimated.

For example, if sorting by relevance and grouping by “published” attribute with SPH_GROUPBY_DAY function, then the result set will contain one most relevant match per each day when there were any matches published, with day number and per-day match count attached, and sorted by day number in descending order (ie. recent days first).

# File 'lib/sphinx/sphinx/client.rb', line 443

def SetGroupBy(attribute, func, groupsort = '@group desc')
  assert { attribute.instance_of? String }
  assert { groupsort.instance_of? String }
  assert { func == SPH_GROUPBY_DAY \
        || func == SPH_GROUPBY_WEEK \
        || func == SPH_GROUPBY_MONTH \
        || func == SPH_GROUPBY_YEAR \
        || func == SPH_GROUPBY_ATTR \
        || func == SPH_GROUPBY_ATTRPAIR }

  @groupby = attribute
  @groupfunc = func
  @groupsort = groupsort
end

#SetGroupDistinct(attribute) ⇒ `Object`

Set count-distinct attribute for group-by queries.

# File 'lib/sphinx/sphinx/client.rb', line 459

def SetGroupDistinct(attribute)
  assert { attribute.instance_of? String }
  @groupdistinct = attribute
end

#SetIDRange(min, max) ⇒ `Object`

Set IDs range to match.

Only match records if document ID is beetwen min_id and max_id (inclusive).

# File 'lib/sphinx/sphinx/client.rb', line 335

def SetIDRange(min, max)
  assert { min.instance_of?(Fixnum) or min.instance_of?(Bignum) }
  assert { max.instance_of?(Fixnum) or max.instance_of?(Bignum) }
  assert { min <= max }

  @min_id = min
  @max_id = max
end

#SetIndexWeights(weights) ⇒ `Object`

Bind per-index weights by name.

# File 'lib/sphinx/sphinx/client.rb', line 322

def SetIndexWeights(weights)
  assert { weights.instance_of? Hash }
  weights.each do |index, weight|
    assert { index.instance_of? String }
    assert { weight.instance_of? Fixnum }
  end
  
  @indexweights = weights
end

#SetLimits(offset, limit, max = 0, cutoff = 0) ⇒ `Object`

Set offset and count into result set, and optionally set max-matches and cutoff limits.

# File 'lib/sphinx/sphinx/client.rb', line 232

def SetLimits(offset, limit, max = 0, cutoff = 0)
  assert { offset.instance_of? Fixnum }
  assert { limit.instance_of? Fixnum }
  assert { max.instance_of? Fixnum }
  assert { offset >= 0 }
  assert { limit > 0 }
  assert { max >= 0 }

  @offset = offset
  @limit = limit
  @maxmatches = max if max > 0
  @cutoff = cutoff if cutoff > 0
end

#SetMatchMode(mode) ⇒ `Object`

Set matching mode.

# File 'lib/sphinx/sphinx/client.rb', line 255

def SetMatchMode(mode)
  assert { mode == SPH_MATCH_ALL \
        || mode == SPH_MATCH_ANY \
        || mode == SPH_MATCH_PHRASE \
        || mode == SPH_MATCH_BOOLEAN \
        || mode == SPH_MATCH_EXTENDED \
        || mode == SPH_MATCH_FULLSCAN \
        || mode == SPH_MATCH_EXTENDED2 }

  @mode = mode
end

#SetMaxQueryTime(max) ⇒ `Object`

Set maximum query time, in milliseconds, per-index, integer, 0 means “do not limit”

# File 'lib/sphinx/sphinx/client.rb', line 248

def SetMaxQueryTime(max)
  assert { max.instance_of? Fixnum }
  assert { max >= 0 }
  @maxquerytime = max
end

#SetOverride(attrname, attrtype, values) ⇒ `Object`

There can be only one override per attribute. values must be a hash that maps document IDs to attribute values.

# File 'lib/sphinx/sphinx/client.rb', line 477

def SetOverride(attrname, attrtype, values)
   assert { attrname.instance_of? String }
   assert { [SPH_ATTR_INTEGER, SPH_ATTR_TIMESTAMP, SPH_ATTR_BOOL, SPH_ATTR_FLOAT, SPH_ATTR_BIGINT].include?(attrtype) }
   assert { values.instance_of? Hash }

   @overrides << { 'attr' => attrname, 'type' => attrtype, 'values' => values }
end

#SetRankingMode(ranker) ⇒ `Object`

Set ranking mode.

# File 'lib/sphinx/sphinx/client.rb', line 268

def SetRankingMode(ranker)
  assert { ranker == SPH_RANK_PROXIMITY_BM25 \
        || ranker == SPH_RANK_BM25 \
        || ranker == SPH_RANK_NONE \
        || ranker == SPH_RANK_WORDCOUNT \
        || ranker == SPH_RANK_PROXIMITY }

  @ranker = ranker
end

#SetRetries(count, delay = 0) ⇒ `Object`

Set distributed retries count and delay.

# File 'lib/sphinx/sphinx/client.rb', line 465

def SetRetries(count, delay = 0)
  assert { count.instance_of? Fixnum }
  assert { delay.instance_of? Fixnum }
  
  @retrycount = count
  @retrydelay = delay
end

#SetSelect(select) ⇒ `Object`

Set select-list (attributes or expressions), SQL-like syntax.

# File 'lib/sphinx/sphinx/client.rb', line 486

def SetSelect(select)
  assert { select.instance_of? String }
  @select = select
end

#SetServer(host, port) ⇒ `Object`

Set searchd host name (string) and port (integer).

# File 'lib/sphinx/sphinx/client.rb', line 222

def SetServer(host, port)
  assert { host.instance_of? String }
  assert { port.instance_of? Fixnum }

  @host = host
  @port = port
end

#SetSortMode(mode, sortby = '') ⇒ `Object`

Set matches sorting mode.

# File 'lib/sphinx/sphinx/client.rb', line 279

def SetSortMode(mode, sortby = '')
  assert { mode == SPH_SORT_RELEVANCE \
        || mode == SPH_SORT_ATTR_DESC \
        || mode == SPH_SORT_ATTR_ASC \
        || mode == SPH_SORT_TIME_SEGMENTS \
        || mode == SPH_SORT_EXTENDED \
        || mode == SPH_SORT_EXPR }
  assert { sortby.instance_of? String }
  assert { mode == SPH_SORT_RELEVANCE || !sortby.empty? }

  @sort = mode
  @sortby = sortby
end

#SetWeights(weights) ⇒ `Object`

Bind per-field weights by order.

DEPRECATED; use SetFieldWeights() instead.

# File 'lib/sphinx/sphinx/client.rb', line 296

def SetWeights(weights)
  assert { weights.instance_of? Array }
  weights.each do |weight|
    assert { weight.instance_of? Fixnum }
  end

  @weights = weights
end

#UpdateAttributes(index, attrs, values, mva = false) ⇒ `Object`

Batch update given attributes in given rows in given indexes.

index is a name of the index to be updated
attrs is an array of attribute name strings.
values is a hash where key is document id, and value is an array of
mva identifies whether update MVA

new attribute values

Returns number of actually updated documents (0 or more) on success. Returns -1 on failure.

Usage example:

sphinx.UpdateAttributes('test1', ['group_id'], { 1 => [456] })

# File 'lib/sphinx/sphinx/client.rb', line 959

def UpdateAttributes(index, attrs, values, mva = false)
  # verify everything
  assert { index.instance_of? String }
  assert { mva.instance_of?(TrueClass) || mva.instance_of?(FalseClass) }
  
  assert { attrs.instance_of? Array }
  attrs.each do |attr|
    assert { attr.instance_of? String }
  end
  
  assert { values.instance_of? Hash }
  values.each do |id, entry|
    assert { id.instance_of? Fixnum }
    assert { entry.instance_of? Array }
    assert { entry.length == attrs.length }
    entry.each do |v|
      if mva
        assert { v.instance_of? Array }
        v.each { |vv| assert { vv.instance_of? Fixnum } }
      else
        assert { v.instance_of? Fixnum }
      end
    end
  end
  
  # build request
  request = Request.new
  request.put_string index
  
  request.put_int attrs.length
  for attr in attrs
    request.put_string attr
    request.put_int mva ? 1 : 0
  end
  
  request.put_int values.length
  values.each do |id, entry|
    request.put_int64 id
    if mva
      entry.each { |v| request.put_int_array v }
    else
      request.put_int(*entry)
    end
  end
  
  response = PerformRequest(:update, request)
  
  # parse response
  begin
    return response.get_int
  rescue EOFError
    @error = 'incomplete reply'
    raise SphinxResponseError, @error
  end
end

Class: Sphinx::Client

Overview

Direct Known Subclasses

Constant Summary collapse

Instance Method Summary collapse

Constructor Details

#initialize ⇒ Client

Instance Method Details

#AddQuery(query, index = '*', comment = '') ⇒ Object

#BuildExcerpts(docs, index, words, opts = {}) ⇒ Object

#BuildKeywords(query, index, hits) ⇒ Object

#GetLastError ⇒ Object

#GetLastWarning ⇒ Object

#Query(query, index = '*', comment = '') ⇒ Object

#ResetFilters ⇒ Object

#ResetGroupBy ⇒ Object

#ResetOverrides ⇒ Object

#RunQueries ⇒ Object

#SetFieldWeights(weights) ⇒ Object

#SetFilter(attribute, values, exclude = false) ⇒ Object

#SetFilterFloatRange(attribute, min, max, exclude = false) ⇒ Object

#SetFilterRange(attribute, min, max, exclude = false) ⇒ Object

#SetGeoAnchor(attrlat, attrlong, lat, long) ⇒ Object

#SetGroupBy(attribute, func, groupsort = '@group desc') ⇒ Object

#SetGroupDistinct(attribute) ⇒ Object

#SetIDRange(min, max) ⇒ Object

#SetIndexWeights(weights) ⇒ Object

#SetLimits(offset, limit, max = 0, cutoff = 0) ⇒ Object

#SetMatchMode(mode) ⇒ Object

#SetMaxQueryTime(max) ⇒ Object

#SetOverride(attrname, attrtype, values) ⇒ Object

#SetRankingMode(ranker) ⇒ Object

#SetRetries(count, delay = 0) ⇒ Object

#SetSelect(select) ⇒ Object

#SetServer(host, port) ⇒ Object

#SetSortMode(mode, sortby = '') ⇒ Object

#SetWeights(weights) ⇒ Object

#UpdateAttributes(index, attrs, values, mva = false) ⇒ Object