Class: Google::Cloud::Bigquery::QueryJob::Updater

Inherits:

Google::Cloud::Bigquery::QueryJob

Object
Job
Google::Cloud::Bigquery::QueryJob
Google::Cloud::Bigquery::QueryJob::Updater

show all

Defined in:: lib/google/cloud/bigquery/query_job.rb

Overview

Yielded to a block to accumulate changes for a patch request.

Attributes collapse

#cache=(value) ⇒ Object
Specifies to look in the query cache for results.
#clustering_fields=(fields) ⇒ Object
Sets one or more fields on which the destination table should be clustered.
#create=(value) ⇒ Object
Sets the create disposition for creating the query results table.
#dataset=(value) ⇒ Object
Sets the default dataset of tables referenced in the query.
#dryrun=(value) ⇒ Object (also: #dry_run=)
Sets the dry run flag for the query job.
#encryption=(val) ⇒ Object
Sets the encryption configuration of the destination table.
#external=(value) ⇒ Object
Sets definitions for external tables used in the query.
#flatten=(value) ⇒ Object
Flatten nested and repeated fields in legacy SQL queries.
#labels=(value) ⇒ Object
Sets the labels to use for the job.
#large_results=(value) ⇒ Object
Allow large results for a legacy SQL query.
#legacy_sql=(value) ⇒ Object
Sets the query syntax to legacy SQL.
#location=(value) ⇒ Object
Sets the geographic location where the job should run.
#maximum_bytes_billed=(value) ⇒ Object
Sets the maximum bytes billed for the query.
#params=(params) ⇒ Object
Sets the query parameters.
#priority=(value) ⇒ Object
Sets the priority of the query.
#standard_sql=(value) ⇒ Object
Sets the query syntax to standard SQL.
#table=(value) ⇒ Object
Sets the destination for the query results table.
#time_partitioning_expiration=(expiration) ⇒ Object
Sets the partition expiration for the destination table.
#time_partitioning_field=(field) ⇒ Object
Sets the field on which to partition the destination table.
#time_partitioning_require_filter=(val) ⇒ Object
If set to true, queries over the destination table will require a partition filter that can be used for partition elimination to be specified.
#time_partitioning_type=(type) ⇒ Object
Sets the partitioning for the destination table.
#udfs=(value) ⇒ Object
Sets user defined functions for the query.
#write=(value) ⇒ Object
Sets the write disposition for when the query results table exists.

Methods inherited from Google::Cloud::Bigquery::QueryJob

#batch?, #bytes_processed, #cache?, #cache_hit?, #clustering?, #clustering_fields, #data, #ddl?, #ddl_operation_performed, #ddl_target_table, #destination, #dml?, #dryrun?, #encryption, #flatten?, #interactive?, #large_results?, #legacy_sql?, #maximum_billing_tier, #maximum_bytes_billed, #num_dml_affected_rows, #query_plan, #standard_sql?, #statement_type, #time_partitioning?, #time_partitioning_expiration, #time_partitioning_field, #time_partitioning_require_filter?, #time_partitioning_type, #udfs, #wait_until_done!

Methods inherited from Job

#cancel, #configuration, #created_at, #done?, #ended_at, #error, #errors, #failed?, #job_id, #labels, #location, #pending?, #project_id, #reload!, #rerun!, #running?, #started_at, #state, #statistics, #status, #user_email, #wait_until_done!

Instance Method Details

#cache=(value) ⇒ `Object`

Specifies to look in the query cache for results.

Parameters:

value (Boolean) —
Whether to look for the result in the query cache. The query cache is a best-effort cache that will be flushed whenever tables in the query are modified. The default value is true. For more information, see query caching.



696
697
698

# File 'lib/google/cloud/bigquery/query_job.rb', line 696

def cache= value
  @gapi.configuration.query.use_query_cache = value
end

#clustering_fields=(fields) ⇒ `Object`

Sets one or more fields on which the destination table should be clustered. Must be specified with time-based partitioning, data in the table will be first partitioned and subsequently clustered.

Only top-level, non-repeated, simple-type fields are supported. When you cluster a table using multiple columns, the order of columns you specify is important. The order of the specified columns determines the sort order of the data.

See Google::Cloud::Bigquery::QueryJob#clustering_fields.

Examples:

require "google/cloud/bigquery"

bigquery = Google::Cloud::Bigquery.new
dataset = bigquery.dataset "my_dataset"
destination_table = dataset.table "my_destination_table",
                                  skip_lookup: true

job = dataset.query_job "SELECT * FROM my_table" do |job|
  job.table = destination_table
  job.time_partitioning_type = "DAY"
  job.time_partitioning_field = "dob"
  job.clustering_fields = ["last_name", "first_name"]
end

job.wait_until_done!
job.done? #=> true

Parameters:

fields (Array<String>) —
The clustering fields. Only top-level, non-repeated, simple-type fields are supported.

#create=(value) ⇒ `Object`

Sets the create disposition for creating the query results table.

create new tables. The default value is needed.

The following values are supported:

needed - Create the table if it does not exist.
never - The table must already exist. A 'notFound' error is raised if the table does not exist.

Parameters:

value (String) —
Specifies whether the job is allowed to

# File 'lib/google/cloud/bigquery/query_job.rb', line 785

def create= value
  @gapi.configuration.query.create_disposition =
    Convert.create_disposition value
end

#dataset=(value) ⇒ `Object`

Sets the default dataset of tables referenced in the query.

Parameters:

value (Dataset) —
The default dataset to use for unqualified table names in the query.

# File 'lib/google/cloud/bigquery/query_job.rb', line 733

def dataset= value
  @gapi.configuration.query.default_dataset =
    @service.dataset_ref_from value
end

#dryrun=(value) ⇒ `Object` Also known as: dry_run=

Sets the dry run flag for the query job.

Parameters:

value (Boolean) —
If set, don't actually run this job. A valid query will return a mostly empty response with some processing statistics, while an invalid query will return the same error it would if it wasn't a dry run..



818
819
820

# File 'lib/google/cloud/bigquery/query_job.rb', line 818

def dryrun= value
  @gapi.configuration.dry_run = value
end

#encryption=(val) ⇒ `Object`

Sets the encryption configuration of the destination table.

Examples:

require "google/cloud/bigquery"

bigquery = Google::Cloud::Bigquery.new
dataset = bigquery.dataset "my_dataset"

key_name = "projects/a/locations/b/keyRings/c/cryptoKeys/d"
encrypt_config = bigquery.encryption kms_key: key_name
job = bigquery.query_job "SELECT 1;" do |job|
  job.table = dataset.table "my_table", skip_lookup: true
  job.encryption = encrypt_config
end

Parameters:

val (Google::Cloud::BigQuery::EncryptionConfiguration) —
Custom encryption configuration (e.g., Cloud KMS keys).

# File 'lib/google/cloud/bigquery/query_job.rb', line 952

def encryption= val
  @gapi.configuration.query.update!(
    destination_encryption_configuration: val.to_gapi
  )
end

#external=(value) ⇒ `Object`

Sets definitions for external tables used in the query.

Parameters:

value (Hash<String|Symbol, External::DataSource>) —
A Hash that represents the mapping of the external tables to the table names used in the SQL query. The hash keys are the table names, and the hash values are the external table objects.

# File 'lib/google/cloud/bigquery/query_job.rb', line 907

def external= value
  external_table_pairs = value.map do |name, obj|
    [String(name), obj.to_gapi]
  end
  external_table_hash = Hash[external_table_pairs]
  @gapi.configuration.query.table_definitions = external_table_hash
end

#flatten=(value) ⇒ `Object`

Flatten nested and repeated fields in legacy SQL queries.

Parameters:

value (Boolean) —
This option is specific to Legacy SQL. Flattens all nested and repeated fields in the query results. The default value is true. large_results parameter must be true if this is set to false.



722
723
724

# File 'lib/google/cloud/bigquery/query_job.rb', line 722

def flatten= value
  @gapi.configuration.query.flatten_results = value
end

#labels=(value) ⇒ `Object`

Sets the labels to use for the job.

Parameters:

value (Hash) —
A hash of user-provided labels associated with the job. You can use these to organize and group your jobs. Label keys and values can be no longer than 63 characters, can only contain lowercase letters, numeric characters, underscores and dashes. International characters are allowed. Label values are optional. Label keys must start with a letter and each label in the list must have a different key.



861
862
863

# File 'lib/google/cloud/bigquery/query_job.rb', line 861

def labels= value
  @gapi.configuration.update! labels: value
end

#large_results=(value) ⇒ `Object`

Allow large results for a legacy SQL query.

Parameters:

value (Boolean) —
This option is specific to Legacy SQL. If true, allows the query to produce arbitrarily large result tables at a slight cost in performance. Requires table parameter to be set.



709
710
711

# File 'lib/google/cloud/bigquery/query_job.rb', line 709

def large_results= value
  @gapi.configuration.query.allow_large_results = value
end

#legacy_sql=(value) ⇒ `Object`

Sets the query syntax to legacy SQL.

Parameters:

value (Boolean) —
Specifies whether to use BigQuery's legacy SQL dialect for this query. If set to false, the query will use BigQuery's standard SQL dialect. Optional. The default value is false.



877
878
879

# File 'lib/google/cloud/bigquery/query_job.rb', line 877

def legacy_sql= value
  @gapi.configuration.query.use_legacy_sql = value
end

#location=(value) ⇒ `Object`

Sets the geographic location where the job should run. Required except for US and EU.

Examples:

require "google/cloud/bigquery"

bigquery = Google::Cloud::Bigquery.new
dataset = bigquery.dataset "my_dataset"

job = bigquery.query_job "SELECT 1;" do |query|
  query.table = dataset.table "my_table", skip_lookup: true
  query.location = "EU"
end

Parameters:

value (String) —
A geographic location, such as "US", "EU" or "asia-northeast1". Required except for US and EU.

# File 'lib/google/cloud/bigquery/query_job.rb', line 666

def location= value
  @gapi.job_reference.location = value
  return unless value.nil?

  # Treat assigning value of nil the same as unsetting the value.
  unset = @gapi.job_reference.instance_variables.include? :@location
  @gapi.job_reference.remove_instance_variable :@location if unset
end

#maximum_bytes_billed=(value) ⇒ `Object`

Sets the maximum bytes billed for the query.

Parameters:

value (Integer) —
Limits the bytes billed for this job. Queries that will have bytes billed beyond this limit will fail (without incurring a charge). Optional. If unspecified, this will be set to your project default.



844
845
846

# File 'lib/google/cloud/bigquery/query_job.rb', line 844

def maximum_bytes_billed= value
  @gapi.configuration.query.maximum_bytes_billed = value
end

#params=(params) ⇒ `Object`

Sets the query parameters. Standard SQL only.

Parameters:

params (Array, Hash) —
Used to pass query arguments when the query string contains either positional (?) or named (@myparam) query parameters. If value passed is an array ["foo"], the query must use positional query parameters. If value passed is a hash { myparam: "foo" }, the query must use named query parameters. When set, legacy_sql will automatically be set to false and standard_sql to true.

# File 'lib/google/cloud/bigquery/query_job.rb', line 750

def params= params
  case params
  when Array then
    @gapi.configuration.query.use_legacy_sql = false
    @gapi.configuration.query.parameter_mode = "POSITIONAL"
    @gapi.configuration.query.query_parameters = params.map do |param|
      Convert.to_query_param param
    end
  when Hash then
    @gapi.configuration.query.use_legacy_sql = false
    @gapi.configuration.query.parameter_mode = "NAMED"
    @gapi.configuration.query.query_parameters =
      params.map do |name, param|
        Convert.to_query_param(param).tap do |named_param|
          named_param.name = String name
        end
      end
  else
    raise "Query parameters must be an Array or a Hash."
  end
end

#priority=(value) ⇒ `Object`

Sets the priority of the query.

Parameters:

value (String) —
Specifies a priority for the query. Possible values include INTERACTIVE and BATCH.



682
683
684

# File 'lib/google/cloud/bigquery/query_job.rb', line 682

def priority= value
  @gapi.configuration.query.priority = priority_value value
end

#standard_sql=(value) ⇒ `Object`

Sets the query syntax to standard SQL.

Parameters:

value (Boolean) —
Specifies whether to use BigQuery's standard SQL dialect for this query. If set to true, the query will use standard SQL rather than the legacy SQL dialect. Optional. The default value is true.



893
894
895

# File 'lib/google/cloud/bigquery/query_job.rb', line 893

def standard_sql= value
  @gapi.configuration.query.use_legacy_sql = !value
end

#table=(value) ⇒ `Object`

Sets the destination for the query results table.

Parameters:

value (Table) —
The destination table where the query results should be stored. If not present, a new table will be created according to the create disposition to store the results.



831
832
833

# File 'lib/google/cloud/bigquery/query_job.rb', line 831

def table= value
  @gapi.configuration.query.destination_table = table_ref_from value
end

#time_partitioning_expiration=(expiration) ⇒ `Object`

Sets the partition expiration for the destination table. See Partitioned Tables.

The destination table must also be partitioned. See #time_partitioning_type=.

Examples:

require "google/cloud/bigquery"

bigquery = Google::Cloud::Bigquery.new
dataset = bigquery.dataset "my_dataset"
destination_table = dataset.table "my_destination_table",
                                  skip_lookup: true

job = dataset.query_job "SELECT * FROM UNNEST(" \
                        "GENERATE_TIMESTAMP_ARRAY(" \
                        "'2018-10-01 00:00:00', " \
                        "'2018-10-10 00:00:00', " \
                        "INTERVAL 1 DAY)) AS dob" do |job|
  job.table = destination_table
  job.time_partitioning_type = "DAY"
  job.time_partitioning_expiration = 86_400
end

job.wait_until_done!
job.done? #=> true

Parameters:

expiration (Integer) —
An expiration time, in seconds, for data in partitions.

# File 'lib/google/cloud/bigquery/query_job.rb', line 1078

def time_partitioning_expiration= expiration
  @gapi.configuration.query.time_partitioning ||= \
    Google::Apis::BigqueryV2::TimePartitioning.new
  @gapi.configuration.query.time_partitioning.update! \
    expiration_ms: expiration * 1000
end

#time_partitioning_field=(field) ⇒ `Object`

Sets the field on which to partition the destination table. If not set, the destination table is partitioned by pseudo column _PARTITIONTIME; if set, the table is partitioned by this field. See Partitioned Tables.

The destination table must also be partitioned. See #time_partitioning_type=.

You can only set the partitioning field while creating a table. BigQuery does not allow you to change partitioning on an existing table.

Examples:

require "google/cloud/bigquery"

bigquery = Google::Cloud::Bigquery.new
dataset = bigquery.dataset "my_dataset"
destination_table = dataset.table "my_destination_table",
                                  skip_lookup: true

job = dataset.query_job "SELECT * FROM UNNEST(" \
                        "GENERATE_TIMESTAMP_ARRAY(" \
                        "'2018-10-01 00:00:00', " \
                        "'2018-10-10 00:00:00', " \
                        "INTERVAL 1 DAY)) AS dob" do |job|
  job.table = destination_table
  job.time_partitioning_type  = "DAY"
  job.time_partitioning_field = "dob"
end

job.wait_until_done!
job.done? #=> true

Parameters:

field (String) —
The partition field. The field must be a top-level TIMESTAMP or DATE field. Its mode must be NULLABLE or REQUIRED.

# File 'lib/google/cloud/bigquery/query_job.rb', line 1038

def time_partitioning_field= field
  @gapi.configuration.query.time_partitioning ||= \
    Google::Apis::BigqueryV2::TimePartitioning.new
  @gapi.configuration.query.time_partitioning.update! field: field
end

#time_partitioning_require_filter=(val) ⇒ `Object`

If set to true, queries over the destination table will require a partition filter that can be used for partition elimination to be specified. See Partitioned Tables.

Parameters:

val (Boolean) —
Indicates if queries over the destination table will require a partition filter. The default value is false.

# File 'lib/google/cloud/bigquery/query_job.rb', line 1096

def time_partitioning_require_filter= val
  @gapi.configuration.query.time_partitioning ||= \
    Google::Apis::BigqueryV2::TimePartitioning.new
  @gapi.configuration.query.time_partitioning.update! \
    require_partition_filter: val
end

#time_partitioning_type=(type) ⇒ `Object`

Sets the partitioning for the destination table. See Partitioned Tables.

You can only set the partitioning field while creating a table. BigQuery does not allow you to change partitioning on an existing table.

Examples:

require "google/cloud/bigquery"

bigquery = Google::Cloud::Bigquery.new
dataset = bigquery.dataset "my_dataset"
destination_table = dataset.table "my_destination_table",
                                  skip_lookup: true

job = dataset.query_job "SELECT * FROM UNNEST(" \
                        "GENERATE_TIMESTAMP_ARRAY(" \
                        "'2018-10-01 00:00:00', " \
                        "'2018-10-10 00:00:00', " \
                        "INTERVAL 1 DAY)) AS dob" do |job|
  job.table = destination_table
  job.time_partitioning_type = "DAY"
end

job.wait_until_done!
job.done? #=> true

Parameters:

type (String) —
The partition type. Currently the only supported value is "DAY".

# File 'lib/google/cloud/bigquery/query_job.rb', line 991

def time_partitioning_type= type
  @gapi.configuration.query.time_partitioning ||= \
    Google::Apis::BigqueryV2::TimePartitioning.new
  @gapi.configuration.query.time_partitioning.update! type: type
end

#udfs=(value) ⇒ `Object`

Sets user defined functions for the query.

Parameters:

value (Array<String>, String) —
User-defined function resources used in the query. May be either a code resource to load from a Google Cloud Storage URI (gs://bucket/path), or an inline resource that contains code for a user-defined function (UDF). Providing an inline code resource is equivalent to providing a URI for a file containing the same code. See User-Defined Functions.

# File 'lib/google/cloud/bigquery/query_job.rb', line 927

def udfs= value
  @gapi.configuration.query.user_defined_function_resources =
    udfs_gapi_from value
end

#write=(value) ⇒ `Object`

Sets the write disposition for when the query results table exists.

Parameters:

value (String) —
Specifies the action that occurs if the destination table already exists. The default value is empty.

The following values are supported:
- truncate - BigQuery overwrites the table data.
- append - BigQuery appends the data to the table.
- empty - A 'duplicate' error is returned in the job result if the table exists and contains data.

# File 'lib/google/cloud/bigquery/query_job.rb', line 804

def write= value
  @gapi.configuration.query.write_disposition =
    Convert.write_disposition value
end

Class: Google::Cloud::Bigquery::QueryJob::Updater

Overview

Attributes collapse

Methods inherited from Google::Cloud::Bigquery::QueryJob

Methods inherited from Job

Instance Method Details

#cache=(value) ⇒ Object

#clustering_fields=(fields) ⇒ Object

#create=(value) ⇒ Object

#dataset=(value) ⇒ Object

#dryrun=(value) ⇒ Object Also known as: dry_run=

#encryption=(val) ⇒ Object

#external=(value) ⇒ Object

#flatten=(value) ⇒ Object

#labels=(value) ⇒ Object

#large_results=(value) ⇒ Object

#legacy_sql=(value) ⇒ Object

#location=(value) ⇒ Object

#maximum_bytes_billed=(value) ⇒ Object

#params=(params) ⇒ Object

#priority=(value) ⇒ Object

#standard_sql=(value) ⇒ Object

#table=(value) ⇒ Object

#time_partitioning_expiration=(expiration) ⇒ Object

#time_partitioning_field=(field) ⇒ Object

#time_partitioning_require_filter=(val) ⇒ Object

#time_partitioning_type=(type) ⇒ Object

#udfs=(value) ⇒ Object

#write=(value) ⇒ Object

#cache=(value) ⇒ `Object`

#clustering_fields=(fields) ⇒ `Object`

#create=(value) ⇒ `Object`

#dataset=(value) ⇒ `Object`

#dryrun=(value) ⇒ `Object` Also known as: dry_run=

#encryption=(val) ⇒ `Object`

#external=(value) ⇒ `Object`

#flatten=(value) ⇒ `Object`

#labels=(value) ⇒ `Object`

#large_results=(value) ⇒ `Object`

#legacy_sql=(value) ⇒ `Object`

#location=(value) ⇒ `Object`

#maximum_bytes_billed=(value) ⇒ `Object`

#params=(params) ⇒ `Object`

#priority=(value) ⇒ `Object`

#standard_sql=(value) ⇒ `Object`

#table=(value) ⇒ `Object`

#time_partitioning_expiration=(expiration) ⇒ `Object`

#time_partitioning_field=(field) ⇒ `Object`

#time_partitioning_require_filter=(val) ⇒ `Object`

#time_partitioning_type=(type) ⇒ `Object`

#udfs=(value) ⇒ `Object`

#write=(value) ⇒ `Object`