Class: Gcloud::Bigquery::Table

Inherits:

Object

Object
Gcloud::Bigquery::Table

show all

Defined in:: lib/gcloud/bigquery/table.rb,
lib/gcloud/bigquery/table/list.rb,
lib/gcloud/bigquery/table/schema.rb

Overview

# Table

A named resource representing a BigQuery table that holds zero or more records. Every table is defined by a schema that may contain nested and repeated fields.

Examples:

require "gcloud"

gcloud = Gcloud.new
bigquery = gcloud.bigquery
dataset = bigquery.dataset "my_dataset"

table = dataset.create_table "my_table" do |schema|
  schema.string "first_name", mode: :required
  schema.record "cities_lived", mode: :repeated do |nested_schema|
    nested_schema.string "place", mode: :required
    nested_schema.integer "number_of_years", mode: :required
  end
end

row = {
  "first_name" => "Alice",
  "cities_lived" => [
    {
      "place" => "Seattle",
      "number_of_years" => 5
    },
    {
      "place" => "Stockholm",
      "number_of_years" => 6
    }
  ]
}
table.insert row

Defined Under Namespace

Classes: List, Schema

Instance Attribute Summary collapse

Attributes collapse

#api_url ⇒ Object

A URL that can be used to access the dataset using the REST API.
#created_at ⇒ Object

The time when this table was created.
#dataset_id ⇒ Object

The ID of the ‘Dataset` containing this table.
#description ⇒ Object

The description of the table.
#description=(new_description) ⇒ Object

Updates the description of the table.
#etag ⇒ Object

A string hash of the dataset.
#expires_at ⇒ Object

The time when this table expires.
#fields ⇒ Object

The fields of the table.
#headers ⇒ Object

The names of the columns in the table.
#id ⇒ Object

The combined Project ID, Dataset ID, and Table ID for this table, in the format specified by the [Query Reference](cloud.google.com/bigquery/query-reference#from): ‘project_name:datasetId.tableId`.
#location ⇒ Object

The geographic location where the table should reside.
#modified_at ⇒ Object

The date when this table was last modified.
#name ⇒ Object

The name of the table.
#name=(new_name) ⇒ Object

Updates the name of the table.
#project_id ⇒ Object

The ID of the ‘Project` containing this table.
#query_id ⇒ Object

The value returned by #id, wrapped in square brackets if the Project ID contains dashes, as specified by the [Query Reference](cloud.google.com/bigquery/query-reference#from).
#schema(replace: false) {|schema| ... } ⇒ Object

Returns the table’s schema as hash containing the keys and values returned by the Google Cloud BigQuery [Rest API ](cloud.google.com/bigquery/docs/reference/v2/tables#resource).
#schema=(new_schema) ⇒ Object

Updates the schema of the table.
#table? ⇒ Boolean

Checks if the table’s type is “TABLE”.
#table_id ⇒ Object

A unique ID for this table.
#table_ref ⇒ Object

The gapi fragment containing the Project ID, Dataset ID, and Table ID as a camel-cased hash.
#view? ⇒ Boolean

Checks if the table’s type is “VIEW”.

Data collapse

#bytes_count ⇒ Object

The number of bytes in the table.
#copy(destination_table, create: nil, write: nil, dryrun: nil) ⇒ Gcloud::Bigquery::CopyJob

Copies the data from the table to another table.
#data(token: nil, max: nil, start: nil) ⇒ Gcloud::Bigquery::Data

Retrieves data from the table.
#extract(extract_url, format: nil, compression: nil, delimiter: nil, header: nil, dryrun: nil) ⇒ Gcloud::Bigquery::ExtractJob

Extract the data from the table to a Google Cloud Storage file.
#insert(rows, skip_invalid: nil, ignore_unknown: nil) ⇒ Gcloud::Bigquery::InsertResponse

Inserts data into the table for near-immediate querying, without the need to complete a #load operation before the data can appear in query results.
#link(source_url, create: nil, write: nil, dryrun: nil) ⇒ Gcloud::Bigquery::Job

Links the table to a source table identified by a URI.
#load(file, format: nil, create: nil, write: nil, projection_fields: nil, jagged_rows: nil, quoted_newlines: nil, encoding: nil, delimiter: nil, ignore_unknown: nil, max_bad_records: nil, quote: nil, skip_leading: nil, dryrun: nil) ⇒ Gcloud::Bigquery::LoadJob

Loads data into the table.
#rows_count ⇒ Object

The number of rows in the table.

Lifecycle collapse

.from_gapi(gapi, conn) ⇒ Object
#delete ⇒ Boolean

Permanently deletes the table.
#reload! ⇒ Object (also: #refresh!)

Reloads the table with current data from the BigQuery service.

Instance Method Summary collapse

#initialize ⇒ Table constructor

A new instance of Table.

Constructor Details

#initialize ⇒ `Table`

Returns a new instance of Table.

# File 'lib/gcloud/bigquery/table.rb', line 77

def initialize
  @connection = nil
  @gapi = {}
end

Instance Attribute Details

#connection ⇒ `Object`



69
70
71

# File 'lib/gcloud/bigquery/table.rb', line 69

def connection
  @connection
end

#gapi ⇒ `Object`



73
74
75

# File 'lib/gcloud/bigquery/table.rb', line 73

def gapi
  @gapi
end

Class Method Details

.from_gapi(gapi, conn) ⇒ `Object`

# File 'lib/gcloud/bigquery/table.rb', line 857

def self.from_gapi gapi, conn
  klass = class_for gapi
  klass.new.tap do |f|
    f.gapi = gapi
    f.connection = conn
  end
end

Instance Method Details

#api_url ⇒ `Object`

A URL that can be used to access the dataset using the REST API.

# File 'lib/gcloud/bigquery/table.rb', line 189

def api_url
  ensure_full_data!
  @gapi["selfLink"]
end

#bytes_count ⇒ `Object`

The number of bytes in the table.

# File 'lib/gcloud/bigquery/table.rb', line 218

def bytes_count
  ensure_full_data!
  @gapi["numBytes"]
end

#copy(destination_table, create: nil, write: nil, dryrun: nil) ⇒ `Gcloud::Bigquery::CopyJob`

Copies the data from the table to another table. The destination table argument can also be a string identifier as specified by the [Query Reference](cloud.google.com/bigquery/query-reference#from): ‘project_name:datasetId.tableId`. This is useful for referencing tables in other projects and datasets.

Examples:

require "gcloud"

gcloud = Gcloud.new
bigquery = gcloud.bigquery
dataset = bigquery.dataset "my_dataset"
table = dataset.table "my_table"
destination_table = dataset.table "my_destination_table"

copy_job = table.copy destination_table

Passing a string identifier for the destination table:

require "gcloud"

gcloud = Gcloud.new
bigquery = gcloud.bigquery
dataset = bigquery.dataset "my_dataset"
table = dataset.table "my_table"

copy_job = table.copy "other-project:other_dataset.other_table"

Parameters:

destination_table (Table, String) —

The destination for the copied data.
create (String) (defaults to: nil) —
Specifies whether the job is allowed to create new tables.

The following values are supported:
- ‘needed` - Create the table if it does not exist.
- ‘never` - The table must already exist. A ’notFound’ error is raised if the table does not exist.
write (String) (defaults to: nil) —
Specifies how to handle data already present in the destination table. The default value is ‘empty`.

The following values are supported:
- ‘truncate` - BigQuery overwrites the table data.
- ‘append` - BigQuery appends the data to the table.
- ‘empty` - An error will be returned if the destination table already contains data.

Returns:

(Gcloud::Bigquery::CopyJob)

# File 'lib/gcloud/bigquery/table.rb', line 492

def copy destination_table, create: nil, write: nil, dryrun: nil
  ensure_connection!
  options = { create: create, write: write, dryrun: dryrun }
  resp = connection.copy_table table_ref,
                               get_table_ref(destination_table),
                               options
  if resp.success?
    Job.from_gapi resp.data, connection
  else
    fail ApiError.from_response(resp)
  end
end

#created_at ⇒ `Object`

The time when this table was created.

# File 'lib/gcloud/bigquery/table.rb', line 238

def created_at
  ensure_full_data!
  Time.at(@gapi["creationTime"] / 1000.0)
end

#data(token: nil, max: nil, start: nil) ⇒ `Gcloud::Bigquery::Data`

Retrieves data from the table.

Examples:

require "gcloud"

gcloud = Gcloud.new
bigquery = gcloud.bigquery
dataset = bigquery.dataset "my_dataset"
table = dataset.table "my_table"

data = table.data
data.each do |row|
  puts row["first_name"]
end
more_data = table.data token: data.token

Parameters:

token (String) (defaults to: nil) —

Page token, returned by a previous call, identifying the result set.
max (Integer) (defaults to: nil) —

Maximum number of results to return.
start (Integer) (defaults to: nil) —

Zero-based index of the starting row to read.

Returns:

(Gcloud::Bigquery::Data)

# File 'lib/gcloud/bigquery/table.rb', line 428

def data token: nil, max: nil, start: nil
  ensure_connection!
  options = { token: token, max: max, start: start }
  resp = connection.list_tabledata dataset_id, table_id, options
  if resp.success?
    Data.from_response resp, self
  else
    fail ApiError.from_response(resp)
  end
end

#dataset_id ⇒ `Object`

The ID of the ‘Dataset` containing this table.



98
99
100

# File 'lib/gcloud/bigquery/table.rb', line 98

def dataset_id
  @gapi["tableReference"]["datasetId"]
end

#delete ⇒ `Boolean`

Permanently deletes the table.

Examples:

require "gcloud"

gcloud = Gcloud.new
bigquery = gcloud.bigquery
dataset = bigquery.dataset "my_dataset"
table = dataset.table "my_table"

table.delete

Returns:

(Boolean) —

Returns ‘true` if the table was deleted.

# File 'lib/gcloud/bigquery/table.rb', line 829

def delete
  ensure_connection!
  resp = connection.delete_table dataset_id, table_id
  if resp.success?
    true
  else
    fail ApiError.from_response(resp)
  end
end

#description ⇒ `Object`

The description of the table.

# File 'lib/gcloud/bigquery/table.rb', line 199

def description
  ensure_full_data!
  @gapi["description"]
end

#description=(new_description) ⇒ `Object`

Updates the description of the table.



209
210
211

# File 'lib/gcloud/bigquery/table.rb', line 209

def description= new_description
  patch_gapi! description: new_description
end

#etag ⇒ `Object`

A string hash of the dataset.

# File 'lib/gcloud/bigquery/table.rb', line 179

def etag
  ensure_full_data!
  @gapi["etag"]
end

#expires_at ⇒ `Object`

The time when this table expires. If not present, the table will persist indefinitely. Expired tables will be deleted and their storage reclaimed.

# File 'lib/gcloud/bigquery/table.rb', line 250

def expires_at
  ensure_full_data!
  return nil if @gapi["expirationTime"].nil?
  Time.at(@gapi["expirationTime"] / 1000.0)
end

#extract(extract_url, format: nil, compression: nil, delimiter: nil, header: nil, dryrun: nil) ⇒ `Gcloud::Bigquery::ExtractJob`

Extract the data from the table to a Google Cloud Storage file.

Examples:

require "gcloud"

gcloud = Gcloud.new
bigquery = gcloud.bigquery
dataset = bigquery.dataset "my_dataset"
table = dataset.table "my_table"

extract_job = table.extract "gs://my-bucket/file-name.json",
                            format: "json"

Parameters:

extract_url (Gcloud::Storage::File, String, Array<String>) —

The Google Storage file or file URI pattern(s) to which BigQuery should extract the table data.
format (String) (defaults to: nil) —
The exported file format. The default value is ‘csv`.

The following values are supported:
- ‘csv` - CSV
- ‘json` - [Newline-delimited JSON](jsonlines.org/)
- ‘avro` - [Avro](avro.apache.org/)
compression (String) (defaults to: nil) —

The compression type to use for exported files. Possible values include ‘GZIP` and `NONE`. The default value is `NONE`.
delimiter (String) (defaults to: nil) —

Delimiter to use between fields in the exported data. Default is ,.
header (Boolean) (defaults to: nil) —

Whether to print out a header row in the results. Default is ‘true`.

Returns:

(Gcloud::Bigquery::ExtractJob)

#fields ⇒ `Object`

The fields of the table.

# File 'lib/gcloud/bigquery/table.rb', line 385

def fields
  f = schema["fields"]
  f = f.to_hash if f.respond_to? :to_hash
  f = [] if f.nil?
  f
end

#headers ⇒ `Object`

The names of the columns in the table.



397
398
399

# File 'lib/gcloud/bigquery/table.rb', line 397

def headers
  fields.map { |f| f["name"] }
end

#id ⇒ `Object`

The combined Project ID, Dataset ID, and Table ID for this table, in the format specified by the [Query Reference](cloud.google.com/bigquery/query-reference#from): ‘project_name:datasetId.tableId`. To use this value in queries see #query_id.



130
131
132

# File 'lib/gcloud/bigquery/table.rb', line 130

def id
  @gapi["id"]
end

#insert(rows, skip_invalid: nil, ignore_unknown: nil) ⇒ `Gcloud::Bigquery::InsertResponse`

Inserts data into the table for near-immediate querying, without the need to complete a #load operation before the data can appear in query results.

Examples:

require "gcloud"

gcloud = Gcloud.new
bigquery = gcloud.bigquery
dataset = bigquery.dataset "my_dataset"
table = dataset.table "my_table"

rows = [
  { "first_name" => "Alice", "age" => 21 },
  { "first_name" => "Bob", "age" => 22 }
]
table.insert rows

Parameters:

rows (Hash, Array<Hash>) —

A hash object or array of hash objects containing the data.
skip_invalid (Boolean) (defaults to: nil) —

Insert all valid rows of a request, even if invalid rows exist. The default value is ‘false`, which causes the entire request to fail if any invalid rows exist.
ignore_unknown (Boolean) (defaults to: nil) —

Accept rows that contain values that do not match the schema. The unknown values are ignored. Default is false, which treats unknown values as errors.

Returns:

(Gcloud::Bigquery::InsertResponse)

#link(source_url, create: nil, write: nil, dryrun: nil) ⇒ `Gcloud::Bigquery::Job`

Links the table to a source table identified by a URI.

Parameters:

source_url (String) —

The URI of source table to link.
create (String) (defaults to: nil) —
Specifies whether the job is allowed to create new tables.

The following values are supported:
- ‘needed` - Create the table if it does not exist.
- ‘never` - The table must already exist. A ’notFound’ error is raised if the table does not exist.
write (String) (defaults to: nil) —
Specifies how to handle data already present in the destination table. The default value is ‘empty`.

The following values are supported:
- ‘truncate` - BigQuery overwrites the table data.
- ‘append` - BigQuery appends the data to the table.
- ‘empty` - An error will be returned if the destination table already contains data.

Returns:

(Gcloud::Bigquery::Job)

# File 'lib/gcloud/bigquery/table.rb', line 532

def link source_url, create: nil, write: nil, dryrun: nil
  ensure_connection!
  options = { create: create, write: write, dryrun: dryrun }
  resp = connection.link_table table_ref, source_url, options
  if resp.success?
    Job.from_gapi resp.data, connection
  else
    fail ApiError.from_response(resp)
  end
end

#load(file, format: nil, create: nil, write: nil, projection_fields: nil, jagged_rows: nil, quoted_newlines: nil, encoding: nil, delimiter: nil, ignore_unknown: nil, max_bad_records: nil, quote: nil, skip_leading: nil, dryrun: nil) ⇒ `Gcloud::Bigquery::LoadJob`

Loads data into the table. You can pass a gcloud storage file path or a gcloud storage file instance. Or, you can upload a file directly. See [Loading Data with a POST Request]( cloud.google.com/bigquery/loading-data-post-request#multipart).

### A note about large direct uploads

You may encounter a Broken pipe (Errno::EPIPE) error when attempting to upload large files. To avoid this problem, add the [httpclient](rubygems.org/gems/httpclient) gem to your project, and the line (or lines) of configuration shown below. These lines must execute after you require gcloud but before you make your first gcloud connection. The first statement configures [Faraday](rubygems.org/gems/faraday) to use httpclient. The second statement, which should only be added if you are using a version of Faraday at or above 0.9.2, is a workaround for [this gzip issue](github.com/GoogleCloudPlatform/gcloud-ruby/issues/367).

Examples:

require "gcloud"

gcloud = Gcloud.new
bigquery = gcloud.bigquery
dataset = bigquery.dataset "my_dataset"
table = dataset.table "my_table"

load_job = table.load "gs://my-bucket/file-name.csv"

Pass a gcloud storage file instance:

require "gcloud"
require "gcloud/storage"

gcloud = Gcloud.new
bigquery = gcloud.bigquery
dataset = bigquery.dataset "my_dataset"
table = dataset.table "my_table"

storage = gcloud.storage
bucket = storage.bucket "my-bucket"
file = bucket.file "file-name.csv"
load_job = table.load file

Upload a file directly:

require "gcloud"

gcloud = Gcloud.new
bigquery = gcloud.bigquery
dataset = bigquery.dataset "my_dataset"
table = dataset.table "my_table"

file = File.open "my_data.csv"
load_job = table.load file

require "gcloud"

# Use httpclient to avoid broken pipe errors with large uploads
Faraday.default_adapter = :httpclient

# Only add the following statement if using Faraday >= 0.9.2
# Override gzip middleware with no-op for httpclient
Faraday::Response.register_middleware :gzip =>
                                        Faraday::Response::Middleware

gcloud = Gcloud.new
bigquery = gcloud.bigquery

Parameters:

file (File, Gcloud::Storage::File, String) —

A file or the URI of a Google Cloud Storage file containing data to load into the table.
format (String) (defaults to: nil) —
The exported file format. The default value is ‘csv`.

The following values are supported:
- ‘csv` - CSV
- ‘json` - [Newline-delimited JSON](jsonlines.org/)
- ‘avro` - [Avro](avro.apache.org/)
- ‘datastore_backup` - Cloud Datastore backup
create (String) (defaults to: nil) —
Specifies whether the job is allowed to create new tables.

The following values are supported:
- ‘needed` - Create the table if it does not exist.
- ‘never` - The table must already exist. A ’notFound’ error is raised if the table does not exist.
write (String) (defaults to: nil) —
Specifies how to handle data already present in the table. The default value is ‘empty`.

The following values are supported:
- ‘truncate` - BigQuery overwrites the table data.
- ‘append` - BigQuery appends the data to the table.
- ‘empty` - An error will be returned if the table already contains data.
projection_fields (Array<String>) (defaults to: nil) —

If the ‘format` option is set to `datastore_backup`, indicates which entity properties to load from a Cloud Datastore backup. Property names are case sensitive and must be top-level properties. If not set, BigQuery loads all properties. If any named property isn’t found in the Cloud Datastore backup, an invalid error is returned.
jagged_rows (Boolean) (defaults to: nil) —

Accept rows that are missing trailing optional columns. The missing values are treated as nulls. If ‘false`, records with missing trailing columns are treated as bad records, and if there are too many bad records, an invalid error is returned in the job result. The default value is `false`. Only applicable to CSV, ignored for other formats.
quoted_newlines (Boolean) (defaults to: nil) —

Indicates if BigQuery should allow quoted data sections that contain newline characters in a CSV file. The default value is ‘false`.
encoding (String) (defaults to: nil) —

The character encoding of the data. The supported values are ‘UTF-8` or `ISO-8859-1`. The default value is `UTF-8`.
delimiter (String) (defaults to: nil) —

Specifices the separator for fields in a CSV file. BigQuery converts the string to ‘ISO-8859-1` encoding, and then uses the first byte of the encoded string to split the data in its raw, binary state. Default is ,.
ignore_unknown (Boolean) (defaults to: nil) —
Indicates if BigQuery should allow extra values that are not represented in the table schema. If true, the extra values are ignored. If false, records with extra columns are treated as bad records, and if there are too many bad records, an invalid error is returned in the job result. The default value is ‘false`.

The ‘format` property determines what BigQuery treats as an extra value:
- ‘CSV`: Trailing columns
- ‘JSON`: Named values that don’t match any column names
max_bad_records (Integer) (defaults to: nil) —

The maximum number of bad records that BigQuery can ignore when running the job. If the number of bad records exceeds this value, an invalid error is returned in the job result. The default value is ‘0`, which requires that all records are valid.
quote (String) (defaults to: nil) —

The value that is used to quote data sections in a CSV file. BigQuery converts the string to ISO-8859-1 encoding, and then uses the first byte of the encoded string to split the data in its raw, binary state. The default value is a double-quote ". If your data does not contain quoted sections, set the property value to an empty string. If your data contains quoted newline characters, you must also set the allowQuotedNewlines property to true.
skip_leading (Integer) (defaults to: nil) —

The number of rows at the top of a CSV file that BigQuery will skip when loading the data. The default value is ‘0`. This property is useful if you have header rows in the file that should be skipped.

Returns:

(Gcloud::Bigquery::LoadJob)

# File 'lib/gcloud/bigquery/table.rb', line 748

def load file, format: nil, create: nil, write: nil,
         projection_fields: nil, jagged_rows: nil, quoted_newlines: nil,
         encoding: nil, delimiter: nil, ignore_unknown: nil,
         max_bad_records: nil, quote: nil, skip_leading: nil, dryrun: nil
  ensure_connection!
  options = { format: format, create: create, write: write,
              projection_fields: projection_fields,
              jagged_rows: jagged_rows, quoted_newlines: quoted_newlines,
              encoding: encoding, delimiter: delimiter,
              ignore_unknown: ignore_unknown,
              max_bad_records: max_bad_records, quote: quote,
              skip_leading: skip_leading, dryrun: dryrun }
  return load_storage(file, options) if storage_url? file
  return load_local(file, options) if local_file? file
  fail Gcloud::Bigquery::Error, "Don't know how to load #{file}"
end

#location ⇒ `Object`

The geographic location where the table should reside. Possible values include EU and US. The default value is US.

# File 'lib/gcloud/bigquery/table.rb', line 290

def location
  ensure_full_data!
  @gapi["location"]
end

#modified_at ⇒ `Object`

The date when this table was last modified.

# File 'lib/gcloud/bigquery/table.rb', line 261

def modified_at
  ensure_full_data!
  Time.at(@gapi["lastModifiedTime"] / 1000.0)
end

#name ⇒ `Object`

The name of the table.



161
162
163

# File 'lib/gcloud/bigquery/table.rb', line 161

def name
  @gapi["friendlyName"]
end

#name=(new_name) ⇒ `Object`

Updates the name of the table.



170
171
172

# File 'lib/gcloud/bigquery/table.rb', line 170

def name= new_name
  patch_gapi! name: new_name
end

#project_id ⇒ `Object`

The ID of the ‘Project` containing this table.



107
108
109

# File 'lib/gcloud/bigquery/table.rb', line 107

def project_id
  @gapi["tableReference"]["projectId"]
end

#query_id ⇒ `Object`

The value returned by #id, wrapped in square brackets if the Project ID contains dashes, as specified by the [Query Reference](cloud.google.com/bigquery/query-reference#from). Useful in queries.

Examples:

require "gcloud"

gcloud = Gcloud.new
bigquery = gcloud.bigquery
dataset = bigquery.dataset "my_dataset"
table = dataset.table "my_table"

data = bigquery.query "SELECT name FROM #{table.query_id}"



152
153
154

# File 'lib/gcloud/bigquery/table.rb', line 152

def query_id
  project_id["-"] ? "[#{id}]" : id
end

#reload! ⇒ `Object` Also known as: refresh!

Reloads the table with current data from the BigQuery service.

# File 'lib/gcloud/bigquery/table.rb', line 844

def reload!
  ensure_connection!
  resp = connection.get_table dataset_id, table_id
  if resp.success?
    @gapi = resp.data
  else
    fail ApiError.from_response(resp)
  end
end

#rows_count ⇒ `Object`

The number of rows in the table.

# File 'lib/gcloud/bigquery/table.rb', line 228

def rows_count
  ensure_full_data!
  @gapi["numRows"]
end

#schema(replace: false) {|schema| ... } ⇒ `Object`

Returns the table’s schema as hash containing the keys and values returned by the Google Cloud BigQuery [Rest API ](cloud.google.com/bigquery/docs/reference/v2/tables#resource). This method can also be used to set, replace, or add to the schema by passing a block. See Schema for available methods. To set the schema by passing a hash instead, use #schema=.

Examples:

require "gcloud"

gcloud = Gcloud.new
bigquery = gcloud.bigquery
dataset = bigquery.dataset "my_dataset"
table = dataset.create_table "my_table"

table.schema do |schema|
  schema.string "first_name", mode: :required
  schema.record "cities_lived", mode: :repeated do |nested_schema|
    nested_schema.string "place", mode: :required
    nested_schema.integer "number_of_years", mode: :required
  end
end

Parameters:

replace (Boolean) (defaults to: false) —

Whether to replace the existing schema with the new schema. If ‘true`, the fields will replace the existing schema. If `false`, the fields will be added to the existing schema. When a table already contains data, schema changes must be additive. Thus, the default value is `false`.

Yields:

(schema) —

a block for setting the schema

Yield Parameters:

schema (Table::Schema) —

the object accepting the schema

# File 'lib/gcloud/bigquery/table.rb', line 329

def schema replace: false
  ensure_full_data!
  g = @gapi
  g = g.to_hash if g.respond_to? :to_hash
  s = g["schema"] ||= {}
  return s unless block_given?
  s = nil if replace
  schema_builder = Schema.new s
  yield schema_builder
  self.schema = schema_builder.schema if schema_builder.changed?
end

#schema=(new_schema) ⇒ `Object`

Updates the schema of the table. To update the schema using a block instead, use #schema.

Examples:

require "gcloud"

gcloud = Gcloud.new
bigquery = gcloud.bigquery
dataset = bigquery.dataset "my_dataset"
table = dataset.create_table "my_table"

schema = {
  "fields" => [
    {
      "name" => "first_name",
      "type" => "STRING",
      "mode" => "REQUIRED"
    },
    {
      "name" => "age",
      "type" => "INTEGER",
      "mode" => "REQUIRED"
    }
  ]
}
table.schema = schema

Parameters:

new_schema (Hash) —

A hash containing keys and values as specified by the Google Cloud BigQuery [Rest API ](cloud.google.com/bigquery/docs/reference/v2/tables#resource) .



376
377
378

# File 'lib/gcloud/bigquery/table.rb', line 376

def schema= new_schema
  patch_gapi! schema: new_schema
end

#table? ⇒ `Boolean`

Checks if the table’s type is “TABLE”.

Returns:

(Boolean)



271
272
273

# File 'lib/gcloud/bigquery/table.rb', line 271

def table?
  @gapi["type"] == "TABLE"
end

#table_id ⇒ `Object`

A unique ID for this table. The ID must contain only letters (a-z, A-Z), numbers (0-9), or underscores (_). The maximum length is 1,024 characters.



89
90
91

# File 'lib/gcloud/bigquery/table.rb', line 89

def table_id
  @gapi["tableReference"]["tableId"]
end

#table_ref ⇒ `Object`

The gapi fragment containing the Project ID, Dataset ID, and Table ID as a camel-cased hash.

# File 'lib/gcloud/bigquery/table.rb', line 115

def table_ref
  table_ref = @gapi["tableReference"]
  table_ref = table_ref.to_hash if table_ref.respond_to? :to_hash
  table_ref
end

#view? ⇒ `Boolean`

Checks if the table’s type is “VIEW”.

Returns:

(Boolean)



280
281
282

# File 'lib/gcloud/bigquery/table.rb', line 280

def view?
  @gapi["type"] == "VIEW"
end

Class: Gcloud::Bigquery::Table

Overview

Examples:

Defined Under Namespace

Instance Attribute Summary collapse

Attributes collapse

Data collapse

Lifecycle collapse

Instance Method Summary collapse

Constructor Details

#initialize ⇒ Table

Instance Attribute Details

#connection ⇒ Object

#gapi ⇒ Object

Class Method Details

.from_gapi(gapi, conn) ⇒ Object

Instance Method Details

#api_url ⇒ Object

#bytes_count ⇒ Object

#copy(destination_table, create: nil, write: nil, dryrun: nil) ⇒ Gcloud::Bigquery::CopyJob

Examples:

Passing a string identifier for the destination table:

#created_at ⇒ Object

#data(token: nil, max: nil, start: nil) ⇒ Gcloud::Bigquery::Data

Examples:

#dataset_id ⇒ Object

#delete ⇒ Boolean

Examples:

#description ⇒ Object

#description=(new_description) ⇒ Object

#etag ⇒ Object

#expires_at ⇒ Object

#extract(extract_url, format: nil, compression: nil, delimiter: nil, header: nil, dryrun: nil) ⇒ Gcloud::Bigquery::ExtractJob

Examples:

#fields ⇒ Object

#headers ⇒ Object

#id ⇒ Object

#insert(rows, skip_invalid: nil, ignore_unknown: nil) ⇒ Gcloud::Bigquery::InsertResponse

Examples:

#link(source_url, create: nil, write: nil, dryrun: nil) ⇒ Gcloud::Bigquery::Job

#load(file, format: nil, create: nil, write: nil, projection_fields: nil, jagged_rows: nil, quoted_newlines: nil, encoding: nil, delimiter: nil, ignore_unknown: nil, max_bad_records: nil, quote: nil, skip_leading: nil, dryrun: nil) ⇒ Gcloud::Bigquery::LoadJob

Examples:

Pass a gcloud storage file instance:

Upload a file directly:

#location ⇒ Object

#modified_at ⇒ Object

#name ⇒ Object

#name=(new_name) ⇒ Object

#project_id ⇒ Object

#query_id ⇒ Object

Examples:

#reload! ⇒ Object Also known as: refresh!

#rows_count ⇒ Object

#schema(replace: false) {|schema| ... } ⇒ Object

Examples:

#schema=(new_schema) ⇒ Object

Examples:

#table? ⇒ Boolean

#table_id ⇒ Object

#table_ref ⇒ Object

#view? ⇒ Boolean

#initialize ⇒ `Table`

#connection ⇒ `Object`

#gapi ⇒ `Object`

.from_gapi(gapi, conn) ⇒ `Object`

#api_url ⇒ `Object`

#bytes_count ⇒ `Object`

#copy(destination_table, create: nil, write: nil, dryrun: nil) ⇒ `Gcloud::Bigquery::CopyJob`

#created_at ⇒ `Object`

#data(token: nil, max: nil, start: nil) ⇒ `Gcloud::Bigquery::Data`

#dataset_id ⇒ `Object`

#delete ⇒ `Boolean`

#description ⇒ `Object`

#description=(new_description) ⇒ `Object`

#etag ⇒ `Object`

#expires_at ⇒ `Object`

#extract(extract_url, format: nil, compression: nil, delimiter: nil, header: nil, dryrun: nil) ⇒ `Gcloud::Bigquery::ExtractJob`

#fields ⇒ `Object`

#headers ⇒ `Object`

#id ⇒ `Object`

#insert(rows, skip_invalid: nil, ignore_unknown: nil) ⇒ `Gcloud::Bigquery::InsertResponse`

#link(source_url, create: nil, write: nil, dryrun: nil) ⇒ `Gcloud::Bigquery::Job`

#load(file, format: nil, create: nil, write: nil, projection_fields: nil, jagged_rows: nil, quoted_newlines: nil, encoding: nil, delimiter: nil, ignore_unknown: nil, max_bad_records: nil, quote: nil, skip_leading: nil, dryrun: nil) ⇒ `Gcloud::Bigquery::LoadJob`

#location ⇒ `Object`

#modified_at ⇒ `Object`

#name ⇒ `Object`

#name=(new_name) ⇒ `Object`

#project_id ⇒ `Object`

#query_id ⇒ `Object`

#reload! ⇒ `Object` Also known as: refresh!

#rows_count ⇒ `Object`

#schema(replace: false) {|schema| ... } ⇒ `Object`

#schema=(new_schema) ⇒ `Object`

#table? ⇒ `Boolean`

#table_id ⇒ `Object`

#table_ref ⇒ `Object`

#view? ⇒ `Boolean`