Class: Aws::Firehose::Types::ParquetSerDe

Inherits:

Struct

Object
Struct
Aws::Firehose::Types::ParquetSerDe

show all

Includes:: Structure

Defined in:: lib/aws-sdk-firehose/types.rb

Overview

A serializer to use for converting data to the Parquet format before storing it in Amazon S3. For more information, see [Apache Parquet].

[1]: parquet.apache.org/docs/

Constant Summary collapse

SENSITIVE =

[]

Instance Attribute Summary collapse

#block_size_bytes ⇒ Integer

The Hadoop Distributed File System (HDFS) block size.
#compression ⇒ String

The compression code to use over data blocks.
#enable_dictionary_compression ⇒ Boolean

Indicates whether to enable dictionary compression.
#max_padding_bytes ⇒ Integer

The maximum amount of padding to apply.
#page_size_bytes ⇒ Integer

The Parquet page size.
#writer_version ⇒ String

Indicates the version of row format to output.

Instance Attribute Details

#block_size_bytes ⇒ `Integer`

The Hadoop Distributed File System (HDFS) block size. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 256 MiB and the minimum is 64 MiB. Firehose uses this value for padding calculations.

# File 'lib/aws-sdk-firehose/types.rb', line 3894

class ParquetSerDe < Struct.new(
  :block_size_bytes,
  :page_size_bytes,
  :compression,
  :enable_dictionary_compression,
  :max_padding_bytes,
  :writer_version)
  SENSITIVE = []
  include Aws::Structure
end

#compression ⇒ `String`

The compression code to use over data blocks. The possible values are ‘UNCOMPRESSED`, `SNAPPY`, and `GZIP`, with the default being `SNAPPY`. Use `SNAPPY` for higher decompression speed. Use `GZIP` if the compression ratio is more important than speed.

# File 'lib/aws-sdk-firehose/types.rb', line 3894

class ParquetSerDe < Struct.new(
  :block_size_bytes,
  :page_size_bytes,
  :compression,
  :enable_dictionary_compression,
  :max_padding_bytes,
  :writer_version)
  SENSITIVE = []
  include Aws::Structure
end

#enable_dictionary_compression ⇒ `Boolean`

Indicates whether to enable dictionary compression.

# File 'lib/aws-sdk-firehose/types.rb', line 3894

class ParquetSerDe < Struct.new(
  :block_size_bytes,
  :page_size_bytes,
  :compression,
  :enable_dictionary_compression,
  :max_padding_bytes,
  :writer_version)
  SENSITIVE = []
  include Aws::Structure
end

#max_padding_bytes ⇒ `Integer`

The maximum amount of padding to apply. This is useful if you intend to copy the data from Amazon S3 to HDFS before querying. The default is 0.

# File 'lib/aws-sdk-firehose/types.rb', line 3894

class ParquetSerDe < Struct.new(
  :block_size_bytes,
  :page_size_bytes,
  :compression,
  :enable_dictionary_compression,
  :max_padding_bytes,
  :writer_version)
  SENSITIVE = []
  include Aws::Structure
end

#page_size_bytes ⇒ `Integer`

The Parquet page size. Column chunks are divided into pages. A page is conceptually an indivisible unit (in terms of compression and encoding). The minimum value is 64 KiB and the default is 1 MiB.

# File 'lib/aws-sdk-firehose/types.rb', line 3894

class ParquetSerDe < Struct.new(
  :block_size_bytes,
  :page_size_bytes,
  :compression,
  :enable_dictionary_compression,
  :max_padding_bytes,
  :writer_version)
  SENSITIVE = []
  include Aws::Structure
end

#writer_version ⇒ `String`

Indicates the version of row format to output. The possible values are ‘V1` and `V2`. The default is `V1`.

# File 'lib/aws-sdk-firehose/types.rb', line 3894

class ParquetSerDe < Struct.new(
  :block_size_bytes,
  :page_size_bytes,
  :compression,
  :enable_dictionary_compression,
  :max_padding_bytes,
  :writer_version)
  SENSITIVE = []
  include Aws::Structure
end

Class: Aws::Firehose::Types::ParquetSerDe

Overview

Constant Summary collapse

Instance Attribute Summary collapse

Instance Attribute Details

#block_size_bytes ⇒ Integer

#compression ⇒ String

#enable_dictionary_compression ⇒ Boolean

#max_padding_bytes ⇒ Integer

#page_size_bytes ⇒ Integer