Class: ElasticGraph::Elasticsearch::Client

Inherits:

Object

Object
ElasticGraph::Elasticsearch::Client

show all

Defined in:: lib/elastic_graph/elasticsearch/client.rb

Instance Attribute Summary collapse

#cluster_name ⇒ Object readonly

Returns the value of attribute cluster_name.

Instance Method Summary collapse

#bulk(body:, refresh: false) ⇒ Object
#create_index(index:, body:) ⇒ Object
#delete_all_documents(index: "_all") ⇒ Object

Synchronously deletes all documents in the cluster.
#delete_index_template(index_template_name) ⇒ Object
#delete_indices(*index_names) ⇒ Object
#delete_script(id:) ⇒ Object
#get_cluster_health ⇒ Object

Cluster APIs.
#get_flat_cluster_settings ⇒ Object
#get_index(index_name) ⇒ Object

Index APIs.
#get_index_template(index_template_name) ⇒ Object

Index Template APIs.
#get_node_os_stats ⇒ Object
#get_script(id:) ⇒ Object

Gets the script with the given ID.
#initialize(cluster_name, url:, faraday_adapter: nil, retry_on_failure: 3, logger: nil) ⇒ Client constructor

A new instance of Client.
#list_indices_matching(index_expression) ⇒ Object
#msearch(body:, headers: nil) ⇒ Object

Document APIs.
#put_index_mapping(index:, body:) ⇒ Object
#put_index_settings(index:, body:) ⇒ Object
#put_index_template(name:, body:) ⇒ Object
#put_persistent_cluster_settings(settings) ⇒ Object

We only support persistent settings here because Elasticsearch docs recommend against using transient settings: www.elastic.co/guide/en/elasticsearch/reference/8.13/cluster-update-settings.html.
#put_script(id:, body:, context:) ⇒ Object

Constructor Details

#initialize(cluster_name, url:, faraday_adapter: nil, retry_on_failure: 3, logger: nil) ⇒ `Client`

Returns a new instance of Client.

# File 'lib/elastic_graph/elasticsearch/client.rb', line 24

def initialize(cluster_name, url:, faraday_adapter: nil, retry_on_failure: 3, logger: nil)
  @cluster_name = cluster_name

  @raw_client = ::Elasticsearch::Client.new(
    adapter: faraday_adapter,
    url: url,
    retry_on_failure: retry_on_failure,
    # We use `logger` for both the tracer and logger to log everything we can. While the trace and log output do overlap, one is
    # not a strict superset of the other (for example, warnings go to `logger`, while full request bodies go to `tracer`).
    logger: logger,
    tracer: logger
  ) do |faraday|
    faraday.use Support::FaradayMiddleware::MSearchUsingGetInsteadOfPost
    faraday.use Support::FaradayMiddleware::SupportTimeouts

    # Note: this overrides the default retry exceptions, which includes `Faraday::TimeoutError`.
    # That's important because we do NOT want a retry on timeout -- a timeout indicates a slow,
    # expensive query, and is transformed to a `RequestExceededDeadlineError` by `SupportTimeouts`,
    # anyway.
    #
    # In addition, it's worth noting that the retry middleware ONLY retries known idempotent HTTP
    # methods (e.g. get/put/delete/head/options). POST requests will not be retried. We could
    # configure it to make it retry POSTs but we'd need to do an analysis of all ElasticGraph requests to
    # make sure all POST requests are truly idempotent, and at least for now, it's sufficient to skip
    # any POST requests we make.
    faraday.request :retry,
      exceptions: [::Faraday::ConnectionFailed, ::Faraday::RetriableResponse],
      max: retry_on_failure,
      retry_statuses: [500, 502, 503] # Internal Server Error, Bad Gateway, Service Unavailable

    yield faraday if block_given?
  end

  # Here we call `app` on each Faraday connection as a way to force it to resolve
  # all configured middlewares and adapters. If it cannot load a required dependency
  # (e.g. `httpx`), it'll fail fast with a clear error.
  #
  # Without this, we would instead get an error when the client was used to make
  # a request for the first time, which isn't as ideal.
  @raw_client.transport.connections.each { |c| c.connection.app }
end

Instance Attribute Details

#cluster_name ⇒ `Object` (readonly)

Returns the value of attribute cluster_name.



22
23
24

# File 'lib/elastic_graph/elasticsearch/client.rb', line 22

def cluster_name
  @cluster_name
end

Instance Method Details

#bulk(body:, refresh: false) ⇒ `Object`



181
182
183

# File 'lib/elastic_graph/elasticsearch/client.rb', line 181

def bulk(body:, refresh: false)
  transform_errors { |c| c.bulk(body: body, filter_path: DATASTORE_BULK_FILTER_PATH, refresh: refresh).body }
end

#create_index(index:, body:) ⇒ `Object`



149
150
151

# File 'lib/elastic_graph/elasticsearch/client.rb', line 149

def create_index(index:, body:)
  transform_errors { |c| c.indices.create(index: index, body: body).body }
end

#delete_all_documents(index: "_all") ⇒ `Object`

Synchronously deletes all documents in the cluster. Intended for tests to give ourselves a clean slate. Supports an ‘index` argument so the caller can limit the deletion to a specific “scope” (e.g. a set of indices with a common prefix).

Overrides ‘scroll` to `10s` to avoid getting a “Trying to create too many scroll contexts” error, as discussed here: discuss.elastic.co/t/too-many-scroll-contexts-with-update-by-query-and-or-delete-by-query/282325/1

# File 'lib/elastic_graph/elasticsearch/client.rb', line 190

def delete_all_documents(index: "_all")
  transform_errors do |client|
    client.delete_by_query(index: index, body: {query: {match_all: _ = {}}}, refresh: true, scroll: "10s").body
  end
end

#delete_index_template(index_template_name) ⇒ `Object`



124
125
126

# File 'lib/elastic_graph/elasticsearch/client.rb', line 124

def delete_index_template(index_template_name)
  transform_errors { |c| c.indices.delete_index_template(name: [index_template_name], ignore: [404]).body }
end

#delete_indices(*index_names) ⇒ `Object`

# File 'lib/elastic_graph/elasticsearch/client.rb', line 161

def delete_indices(*index_names)
  # `allow_no_indices: true` is needed when we attempt to delete a non-existing index to avoid errors. For rollover indices,
  # when we delete the actual indices, we will always perform a wildcard deletion, and `allow_no_indices: true` is needed.
  #
  # Note that the Elasticsearch API documentation[^1] says that `allow_no_indices` defaults to `true` but a Elasticsearch Ruby
  # client code comment[^2] says it defaults to `false`. Regardless, we don't want to rely on the default behavior that could change.
  #
  # [^1]: https://www.elastic.co/guide/en/elasticsearch/reference/8.12/indices-delete-index.html#delete-index-api-query-params
  # [^2]: https://github.com/elastic/elasticsearch-ruby/blob/8.12/elasticsearch-api/lib/elasticsearch/api/actions/indices/delete.rb#L31
  transform_errors do |client|
    client.indices.delete(index: index_names, ignore_unavailable: true, allow_no_indices: true).body
  end
end

#delete_script(id:) ⇒ `Object`

# File 'lib/elastic_graph/elasticsearch/client.rb', line 102

def delete_script(id:)
  transform_errors { |c| c.delete_script(id: id).body }
rescue ::Elastic::Transport::Transport::Errors::NotFound
  # it's ok if it's already not there.
end

#get_cluster_health ⇒ `Object`

Cluster APIs



68
69
70

# File 'lib/elastic_graph/elasticsearch/client.rb', line 68

def get_cluster_health
  transform_errors { |c| c.cluster.health.body }
end

#get_flat_cluster_settings ⇒ `Object`



76
77
78

# File 'lib/elastic_graph/elasticsearch/client.rb', line 76

def get_flat_cluster_settings
  transform_errors { |c| c.cluster.get_settings(flat_settings: true).body }
end

#get_index(index_name) ⇒ `Object`

Index APIs

# File 'lib/elastic_graph/elasticsearch/client.rb', line 130

def get_index(index_name)
  transform_errors do |client|
    client.indices.get(
      index: index_name,
      ignore_unavailable: true,
      flat_settings: true
    )[index_name] || {}
  end
end

#get_index_template(index_template_name) ⇒ `Object`

Index Template APIs

# File 'lib/elastic_graph/elasticsearch/client.rb', line 110

def get_index_template(index_template_name)
  transform_errors do |client|
    client.indices.get_index_template(name: index_template_name, flat_settings: true).fetch("index_templates").to_h do |entry|
      [entry.fetch("name"), entry.fetch("index_template")]
    end.dig(index_template_name) || {}
  end
rescue ::Elastic::Transport::Transport::Errors::NotFound
  {}
end

#get_node_os_stats ⇒ `Object`



72
73
74

# File 'lib/elastic_graph/elasticsearch/client.rb', line 72

def get_node_os_stats
  transform_errors { |c| c.nodes.stats(metric: "os").body }
end

#get_script(id:) ⇒ `Object`

Gets the script with the given ID. Returns ‘nil` if the script does not exist.

# File 'lib/elastic_graph/elasticsearch/client.rb', line 92

def get_script(id:)
  transform_errors { |c| c.get_script(id: id).body }
rescue ::Elastic::Transport::Transport::Errors::NotFound
  nil
end

#list_indices_matching(index_expression) ⇒ `Object`

# File 'lib/elastic_graph/elasticsearch/client.rb', line 140

def list_indices_matching(index_expression)
  transform_errors do |client|
    client
      .cat
      .indices(index: index_expression, format: "json", h: ["index"])
      .map { |index_hash| index_hash.fetch("index") }
  end
end

#msearch(body:, headers: nil) ⇒ `Object`

Document APIs



177
178
179

# File 'lib/elastic_graph/elasticsearch/client.rb', line 177

def msearch(body:, headers: nil)
  transform_errors { |c| c.msearch(body: body, headers: headers).body }
end

#put_index_mapping(index:, body:) ⇒ `Object`



153
154
155

# File 'lib/elastic_graph/elasticsearch/client.rb', line 153

def put_index_mapping(index:, body:)
  transform_errors { |c| c.indices.put_mapping(index: index, body: body).body }
end

#put_index_settings(index:, body:) ⇒ `Object`



157
158
159

# File 'lib/elastic_graph/elasticsearch/client.rb', line 157

def put_index_settings(index:, body:)
  transform_errors { |c| c.indices.put_settings(index: index, body: body).body }
end

#put_index_template(name:, body:) ⇒ `Object`



120
121
122

# File 'lib/elastic_graph/elasticsearch/client.rb', line 120

def put_index_template(name:, body:)
  transform_errors { |c| c.indices.put_index_template(name: name, body: body).body }
end

#put_persistent_cluster_settings(settings) ⇒ `Object`

We only support persistent settings here because Elasticsearch docs recommend against using transient settings: www.elastic.co/guide/en/elasticsearch/reference/8.13/cluster-update-settings.html

> We no longer recommend using transient cluster settings. Use persistent cluster settings instead. If a cluster becomes unstable, > transient settings can clear unexpectedly, resulting in a potentially undesired cluster configuration.



85
86
87

# File 'lib/elastic_graph/elasticsearch/client.rb', line 85

def put_persistent_cluster_settings(settings)
  transform_errors { |c| c.cluster.put_settings(body: {persistent: settings}).body }
end

#put_script(id:, body:, context:) ⇒ `Object`



98
99
100

# File 'lib/elastic_graph/elasticsearch/client.rb', line 98

def put_script(id:, body:, context:)
  transform_errors { |c| c.put_script(id: id, body: body, context: context).body }
end

Class: ElasticGraph::Elasticsearch::Client

Instance Attribute Summary collapse

Instance Method Summary collapse

Constructor Details

#initialize(cluster_name, url:, faraday_adapter: nil, retry_on_failure: 3, logger: nil) ⇒ Client

Instance Attribute Details

#cluster_name ⇒ Object (readonly)

Instance Method Details

#bulk(body:, refresh: false) ⇒ Object

#create_index(index:, body:) ⇒ Object

#delete_all_documents(index: "_all") ⇒ Object

#delete_index_template(index_template_name) ⇒ Object

#delete_indices(*index_names) ⇒ Object

#delete_script(id:) ⇒ Object

#get_cluster_health ⇒ Object

#get_flat_cluster_settings ⇒ Object

#get_index(index_name) ⇒ Object

#get_index_template(index_template_name) ⇒ Object

#get_node_os_stats ⇒ Object

#get_script(id:) ⇒ Object

#list_indices_matching(index_expression) ⇒ Object

#msearch(body:, headers: nil) ⇒ Object

#put_index_mapping(index:, body:) ⇒ Object

#put_index_settings(index:, body:) ⇒ Object

#put_index_template(name:, body:) ⇒ Object

#put_persistent_cluster_settings(settings) ⇒ Object

#put_script(id:, body:, context:) ⇒ Object