Class: Aws::Comprehend::Types::DatasetInputDataConfig

Inherits:
Struct
  • Object
show all
Includes:
Structure
Defined in:
lib/aws-sdk-comprehend/types.rb

Overview

Specifies the format and location of the input data for the dataset.

Constant Summary collapse

SENSITIVE =
[]

Instance Attribute Summary collapse

Instance Attribute Details

#augmented_manifestsArray<Types::DatasetAugmentedManifestsListItem>

A list of augmented manifest files that provide training data for your custom model. An augmented manifest file is a labeled dataset that is produced by Amazon SageMaker Ground Truth.



1737
1738
1739
1740
1741
1742
1743
1744
# File 'lib/aws-sdk-comprehend/types.rb', line 1737

class DatasetInputDataConfig < Struct.new(
  :augmented_manifests,
  :data_format,
  :document_classifier_input_data_config,
  :entity_recognizer_input_data_config)
  SENSITIVE = []
  include Aws::Structure
end

#data_formatString

‘COMPREHEND_CSV`: The data format is a two-column CSV file, where the first column contains labels and the second column contains documents.

‘AUGMENTED_MANIFEST`: The data format

Returns:

  • (String)


1737
1738
1739
1740
1741
1742
1743
1744
# File 'lib/aws-sdk-comprehend/types.rb', line 1737

class DatasetInputDataConfig < Struct.new(
  :augmented_manifests,
  :data_format,
  :document_classifier_input_data_config,
  :entity_recognizer_input_data_config)
  SENSITIVE = []
  include Aws::Structure
end

#document_classifier_input_data_configTypes::DatasetDocumentClassifierInputDataConfig

The input properties for training a document classifier model.

For more information on how the input file is formatted, see

Preparing training data][1

in the Comprehend Developer Guide.

[1]: docs.aws.amazon.com/comprehend/latest/dg/prep-classifier-data.html



1737
1738
1739
1740
1741
1742
1743
1744
# File 'lib/aws-sdk-comprehend/types.rb', line 1737

class DatasetInputDataConfig < Struct.new(
  :augmented_manifests,
  :data_format,
  :document_classifier_input_data_config,
  :entity_recognizer_input_data_config)
  SENSITIVE = []
  include Aws::Structure
end

#entity_recognizer_input_data_configTypes::DatasetEntityRecognizerInputDataConfig

The input properties for training an entity recognizer model.



1737
1738
1739
1740
1741
1742
1743
1744
# File 'lib/aws-sdk-comprehend/types.rb', line 1737

class DatasetInputDataConfig < Struct.new(
  :augmented_manifests,
  :data_format,
  :document_classifier_input_data_config,
  :entity_recognizer_input_data_config)
  SENSITIVE = []
  include Aws::Structure
end