Class: Aws::SageMaker::Types::ProductionVariantServerlessConfig

Inherits:
Struct
  • Object
show all
Includes:
Aws::Structure
Defined in:
lib/aws-sdk-sagemaker/types.rb

Overview

Specifies the serverless configuration for an endpoint variant.

Constant Summary collapse

SENSITIVE =
[]

Instance Attribute Summary collapse

Instance Attribute Details

#max_concurrencyInteger

The maximum number of concurrent invocations your serverless endpoint can process.

Returns:

  • (Integer)


37619
37620
37621
37622
37623
37624
37625
# File 'lib/aws-sdk-sagemaker/types.rb', line 37619

class ProductionVariantServerlessConfig < Struct.new(
  :memory_size_in_mb,
  :max_concurrency,
  :provisioned_concurrency)
  SENSITIVE = []
  include Aws::Structure
end

#memory_size_in_mbInteger

The memory size of your serverless endpoint. Valid values are in 1 GB increments: 1024 MB, 2048 MB, 3072 MB, 4096 MB, 5120 MB, or 6144 MB.

Returns:

  • (Integer)


37619
37620
37621
37622
37623
37624
37625
# File 'lib/aws-sdk-sagemaker/types.rb', line 37619

class ProductionVariantServerlessConfig < Struct.new(
  :memory_size_in_mb,
  :max_concurrency,
  :provisioned_concurrency)
  SENSITIVE = []
  include Aws::Structure
end

#provisioned_concurrencyInteger

The amount of provisioned concurrency to allocate for the serverless endpoint. Should be less than or equal to ‘MaxConcurrency`.

<note markdown=“1”> This field is not supported for serverless endpoint recommendations for Inference Recommender jobs. For more information about creating an Inference Recommender job, see [CreateInferenceRecommendationsJobs].

</note>

[1]: docs.aws.amazon.com/sagemaker/latest/APIReference/API_CreateInferenceRecommendationsJob.html

Returns:

  • (Integer)


37619
37620
37621
37622
37623
37624
37625
# File 'lib/aws-sdk-sagemaker/types.rb', line 37619

class ProductionVariantServerlessConfig < Struct.new(
  :memory_size_in_mb,
  :max_concurrency,
  :provisioned_concurrency)
  SENSITIVE = []
  include Aws::Structure
end