Class: Aws::SageMaker::Types::InferenceComponentCapacitySize

Inherits:
Struct
  • Object
show all
Includes:
Aws::Structure
Defined in:
lib/aws-sdk-sagemaker/types.rb

Overview

Specifies the type and size of the endpoint capacity to activate for a rolling deployment or a rollback strategy. You can specify your batches as either of the following:

  • A count of inference component copies

  • The overall percentage or your fleet

For a rollback strategy, if you don’t specify the fields in this object, or if you set the ‘Value` parameter to 100%, then SageMaker AI uses a blue/green rollback strategy and rolls all traffic back to the blue fleet.

Constant Summary collapse

SENSITIVE =
[]

Instance Attribute Summary collapse

Instance Attribute Details

#typeString

Specifies the endpoint capacity type.

COPY_COUNT

: The endpoint activates based on the number of inference component

copies.

CAPACITY_PERCENT

: The endpoint activates based on the specified percentage of

capacity.

Returns:

  • (String)


25888
25889
25890
25891
25892
25893
# File 'lib/aws-sdk-sagemaker/types.rb', line 25888

class InferenceComponentCapacitySize < Struct.new(
  :type,
  :value)
  SENSITIVE = []
  include Aws::Structure
end

#valueInteger

Defines the capacity size, either as a number of inference component copies or a capacity percentage.

Returns:

  • (Integer)


25888
25889
25890
25891
25892
25893
# File 'lib/aws-sdk-sagemaker/types.rb', line 25888

class InferenceComponentCapacitySize < Struct.new(
  :type,
  :value)
  SENSITIVE = []
  include Aws::Structure
end