Class: Google::Cloud::AIPlatform::V1::DeployRequest

Inherits:

Object

Object
Google::Cloud::AIPlatform::V1::DeployRequest

show all

Extended by:: Protobuf::MessageExts::ClassMethods

Includes:: Protobuf::MessageExts

Defined in:: proto_docs/google/cloud/aiplatform/v1/model_garden_service.rb

Overview

Request message for ModelGardenService.Deploy.

Defined Under Namespace

Classes: DeployConfig, EndpointConfig, ModelConfig

Instance Attribute Summary collapse

#deploy_config ⇒ ::Google::Cloud::AIPlatform::V1::DeployRequest::DeployConfig
Optional.
#destination ⇒ ::String
Required.
#endpoint_config ⇒ ::Google::Cloud::AIPlatform::V1::DeployRequest::EndpointConfig
Optional.
#hugging_face_model_id ⇒ ::String
The Hugging Face model to deploy.
#model_config ⇒ ::Google::Cloud::AIPlatform::V1::DeployRequest::ModelConfig
Optional.
#publisher_model_name ⇒ ::String
The Model Garden model to deploy.

Instance Attribute Details

#deploy_config ⇒ `::Google::Cloud::AIPlatform::V1::DeployRequest::DeployConfig`

Returns Optional. The deploy config to use for the deployment. If not specified, the default deploy config will be used.

Returns:

(::Google::Cloud::AIPlatform::V1::DeployRequest::DeployConfig) —
Optional. The deploy config to use for the deployment. If not specified, the default deploy config will be used.

# File 'proto_docs/google/cloud/aiplatform/v1/model_garden_service.rb', line 82

class DeployRequest
  include ::Google::Protobuf::MessageExts
  extend ::Google::Protobuf::MessageExts::ClassMethods

  # The model config to use for the deployment.
  # @!attribute [rw] accept_eula
  #   @return [::Boolean]
  #     Optional. Whether the user accepts the End User License Agreement (EULA)
  #     for the model.
  # @!attribute [rw] hugging_face_access_token
  #   @return [::String]
  #     Optional. The Hugging Face read access token used to access the model
  #     artifacts of gated models.
  # @!attribute [rw] hugging_face_cache_enabled
  #   @return [::Boolean]
  #     Optional. If true, the model will deploy with a cached version instead of
  #     directly downloading the model artifacts from Hugging Face. This is
  #     suitable for VPC-SC users with limited internet access.
  # @!attribute [rw] model_display_name
  #   @return [::String]
  #     Optional. The user-specified display name of the uploaded model. If not
  #     set, a default name will be used.
  # @!attribute [rw] container_spec
  #   @return [::Google::Cloud::AIPlatform::V1::ModelContainerSpec]
  #     Optional. The specification of the container that is to be used when
  #     deploying. If not set, the default container spec will be used.
  # @!attribute [rw] model_user_id
  #   @return [::String]
  #     Optional. The ID to use for the uploaded Model, which will become the
  #     final component of the model resource name. When not provided, Vertex AI
  #     will generate a value for this ID. When Model Registry model is provided,
  #     this field will be ignored.
  #
  #     This value may be up to 63 characters, and valid characters are
  #     `[a-z0-9_-]`. The first character cannot be a number or hyphen.
  class ModelConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods
  end

  # The endpoint config to use for the deployment.
  # @!attribute [rw] endpoint_display_name
  #   @return [::String]
  #     Optional. The user-specified display name of the endpoint. If not set, a
  #     default name will be used.
  # @!attribute [rw] dedicated_endpoint_enabled
  #   @deprecated This field is deprecated and may be removed in the next major version update.
  #   @return [::Boolean]
  #     Optional. Deprecated. Use dedicated_endpoint_disabled instead.
  #     If true, the endpoint will be exposed through a
  #     dedicated DNS [Endpoint.dedicated_endpoint_dns]. Your request to the
  #     dedicated DNS will be isolated from other users' traffic and will have
  #     better performance and reliability. Note: Once you enabled dedicated
  #     endpoint, you won't be able to send request to the shared DNS
  #     \\{region}-aiplatform.googleapis.com. The limitations will be removed soon.
  # @!attribute [rw] dedicated_endpoint_disabled
  #   @return [::Boolean]
  #     Optional. By default, if dedicated endpoint is enabled, the endpoint will
  #     be exposed through a dedicated DNS [Endpoint.dedicated_endpoint_dns].
  #     Your request to the dedicated DNS will be isolated from other users'
  #     traffic and will have better performance and reliability. Note: Once you
  #     enabled dedicated endpoint, you won't be able to send request to the
  #     shared DNS \\{region}-aiplatform.googleapis.com. The limitations will be
  #     removed soon.
  #
  #     If this field is set to true, the dedicated endpoint will be disabled
  #     and the deployed model will be exposed through the shared DNS
  #     \\{region}-aiplatform.googleapis.com.
  # @!attribute [rw] endpoint_user_id
  #   @return [::String]
  #     Optional. Immutable. The ID to use for endpoint, which will become the
  #     final component of the endpoint resource name. If not provided, Vertex AI
  #     will generate a value for this ID.
  #
  #     If the first character is a letter, this value may be up to 63
  #     characters, and valid characters are `[a-z0-9-]`. The last character must
  #     be a letter or number.
  #
  #     If the first character is a number, this value may be up to 9 characters,
  #     and valid characters are `[0-9]` with no leading zeros.
  #
  #     When using HTTP/JSON, this field is populated
  #     based on a query string argument, such as `?endpoint_id=12345`. This is
  #     the fallback for fields that are not included in either the URI or the
  #     body.
  class EndpointConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods
  end

  # The deploy config to use for the deployment.
  # @!attribute [rw] dedicated_resources
  #   @return [::Google::Cloud::AIPlatform::V1::DedicatedResources]
  #     Optional. The dedicated resources to use for the endpoint. If not set,
  #     the default resources will be used.
  # @!attribute [rw] fast_tryout_enabled
  #   @return [::Boolean]
  #     Optional. If true, enable the QMT fast tryout feature for this model if
  #     possible.
  # @!attribute [rw] system_labels
  #   @return [::Google::Protobuf::Map{::String => ::String}]
  #     Optional. System labels for Model Garden deployments.
  #     These labels are managed by Google and for tracking purposes only.
  class DeployConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods

    # @!attribute [rw] key
    #   @return [::String]
    # @!attribute [rw] value
    #   @return [::String]
    class SystemLabelsEntry
      include ::Google::Protobuf::MessageExts
      extend ::Google::Protobuf::MessageExts::ClassMethods
    end
  end
end

#destination ⇒ `::String`

Returns Required. The resource name of the Location to deploy the model in. Format: projects/{project}/locations/{location}.

Returns:

(::String) —
Required. The resource name of the Location to deploy the model in. Format: projects/{project}/locations/{location}

# File 'proto_docs/google/cloud/aiplatform/v1/model_garden_service.rb', line 82

class DeployRequest
  include ::Google::Protobuf::MessageExts
  extend ::Google::Protobuf::MessageExts::ClassMethods

  # The model config to use for the deployment.
  # @!attribute [rw] accept_eula
  #   @return [::Boolean]
  #     Optional. Whether the user accepts the End User License Agreement (EULA)
  #     for the model.
  # @!attribute [rw] hugging_face_access_token
  #   @return [::String]
  #     Optional. The Hugging Face read access token used to access the model
  #     artifacts of gated models.
  # @!attribute [rw] hugging_face_cache_enabled
  #   @return [::Boolean]
  #     Optional. If true, the model will deploy with a cached version instead of
  #     directly downloading the model artifacts from Hugging Face. This is
  #     suitable for VPC-SC users with limited internet access.
  # @!attribute [rw] model_display_name
  #   @return [::String]
  #     Optional. The user-specified display name of the uploaded model. If not
  #     set, a default name will be used.
  # @!attribute [rw] container_spec
  #   @return [::Google::Cloud::AIPlatform::V1::ModelContainerSpec]
  #     Optional. The specification of the container that is to be used when
  #     deploying. If not set, the default container spec will be used.
  # @!attribute [rw] model_user_id
  #   @return [::String]
  #     Optional. The ID to use for the uploaded Model, which will become the
  #     final component of the model resource name. When not provided, Vertex AI
  #     will generate a value for this ID. When Model Registry model is provided,
  #     this field will be ignored.
  #
  #     This value may be up to 63 characters, and valid characters are
  #     `[a-z0-9_-]`. The first character cannot be a number or hyphen.
  class ModelConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods
  end

  # The endpoint config to use for the deployment.
  # @!attribute [rw] endpoint_display_name
  #   @return [::String]
  #     Optional. The user-specified display name of the endpoint. If not set, a
  #     default name will be used.
  # @!attribute [rw] dedicated_endpoint_enabled
  #   @deprecated This field is deprecated and may be removed in the next major version update.
  #   @return [::Boolean]
  #     Optional. Deprecated. Use dedicated_endpoint_disabled instead.
  #     If true, the endpoint will be exposed through a
  #     dedicated DNS [Endpoint.dedicated_endpoint_dns]. Your request to the
  #     dedicated DNS will be isolated from other users' traffic and will have
  #     better performance and reliability. Note: Once you enabled dedicated
  #     endpoint, you won't be able to send request to the shared DNS
  #     \\{region}-aiplatform.googleapis.com. The limitations will be removed soon.
  # @!attribute [rw] dedicated_endpoint_disabled
  #   @return [::Boolean]
  #     Optional. By default, if dedicated endpoint is enabled, the endpoint will
  #     be exposed through a dedicated DNS [Endpoint.dedicated_endpoint_dns].
  #     Your request to the dedicated DNS will be isolated from other users'
  #     traffic and will have better performance and reliability. Note: Once you
  #     enabled dedicated endpoint, you won't be able to send request to the
  #     shared DNS \\{region}-aiplatform.googleapis.com. The limitations will be
  #     removed soon.
  #
  #     If this field is set to true, the dedicated endpoint will be disabled
  #     and the deployed model will be exposed through the shared DNS
  #     \\{region}-aiplatform.googleapis.com.
  # @!attribute [rw] endpoint_user_id
  #   @return [::String]
  #     Optional. Immutable. The ID to use for endpoint, which will become the
  #     final component of the endpoint resource name. If not provided, Vertex AI
  #     will generate a value for this ID.
  #
  #     If the first character is a letter, this value may be up to 63
  #     characters, and valid characters are `[a-z0-9-]`. The last character must
  #     be a letter or number.
  #
  #     If the first character is a number, this value may be up to 9 characters,
  #     and valid characters are `[0-9]` with no leading zeros.
  #
  #     When using HTTP/JSON, this field is populated
  #     based on a query string argument, such as `?endpoint_id=12345`. This is
  #     the fallback for fields that are not included in either the URI or the
  #     body.
  class EndpointConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods
  end

  # The deploy config to use for the deployment.
  # @!attribute [rw] dedicated_resources
  #   @return [::Google::Cloud::AIPlatform::V1::DedicatedResources]
  #     Optional. The dedicated resources to use for the endpoint. If not set,
  #     the default resources will be used.
  # @!attribute [rw] fast_tryout_enabled
  #   @return [::Boolean]
  #     Optional. If true, enable the QMT fast tryout feature for this model if
  #     possible.
  # @!attribute [rw] system_labels
  #   @return [::Google::Protobuf::Map{::String => ::String}]
  #     Optional. System labels for Model Garden deployments.
  #     These labels are managed by Google and for tracking purposes only.
  class DeployConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods

    # @!attribute [rw] key
    #   @return [::String]
    # @!attribute [rw] value
    #   @return [::String]
    class SystemLabelsEntry
      include ::Google::Protobuf::MessageExts
      extend ::Google::Protobuf::MessageExts::ClassMethods
    end
  end
end

#endpoint_config ⇒ `::Google::Cloud::AIPlatform::V1::DeployRequest::EndpointConfig`

Returns Optional. The endpoint config to use for the deployment. If not specified, the default endpoint config will be used.

Returns:

(::Google::Cloud::AIPlatform::V1::DeployRequest::EndpointConfig) —
Optional. The endpoint config to use for the deployment. If not specified, the default endpoint config will be used.

# File 'proto_docs/google/cloud/aiplatform/v1/model_garden_service.rb', line 82

class DeployRequest
  include ::Google::Protobuf::MessageExts
  extend ::Google::Protobuf::MessageExts::ClassMethods

  # The model config to use for the deployment.
  # @!attribute [rw] accept_eula
  #   @return [::Boolean]
  #     Optional. Whether the user accepts the End User License Agreement (EULA)
  #     for the model.
  # @!attribute [rw] hugging_face_access_token
  #   @return [::String]
  #     Optional. The Hugging Face read access token used to access the model
  #     artifacts of gated models.
  # @!attribute [rw] hugging_face_cache_enabled
  #   @return [::Boolean]
  #     Optional. If true, the model will deploy with a cached version instead of
  #     directly downloading the model artifacts from Hugging Face. This is
  #     suitable for VPC-SC users with limited internet access.
  # @!attribute [rw] model_display_name
  #   @return [::String]
  #     Optional. The user-specified display name of the uploaded model. If not
  #     set, a default name will be used.
  # @!attribute [rw] container_spec
  #   @return [::Google::Cloud::AIPlatform::V1::ModelContainerSpec]
  #     Optional. The specification of the container that is to be used when
  #     deploying. If not set, the default container spec will be used.
  # @!attribute [rw] model_user_id
  #   @return [::String]
  #     Optional. The ID to use for the uploaded Model, which will become the
  #     final component of the model resource name. When not provided, Vertex AI
  #     will generate a value for this ID. When Model Registry model is provided,
  #     this field will be ignored.
  #
  #     This value may be up to 63 characters, and valid characters are
  #     `[a-z0-9_-]`. The first character cannot be a number or hyphen.
  class ModelConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods
  end

  # The endpoint config to use for the deployment.
  # @!attribute [rw] endpoint_display_name
  #   @return [::String]
  #     Optional. The user-specified display name of the endpoint. If not set, a
  #     default name will be used.
  # @!attribute [rw] dedicated_endpoint_enabled
  #   @deprecated This field is deprecated and may be removed in the next major version update.
  #   @return [::Boolean]
  #     Optional. Deprecated. Use dedicated_endpoint_disabled instead.
  #     If true, the endpoint will be exposed through a
  #     dedicated DNS [Endpoint.dedicated_endpoint_dns]. Your request to the
  #     dedicated DNS will be isolated from other users' traffic and will have
  #     better performance and reliability. Note: Once you enabled dedicated
  #     endpoint, you won't be able to send request to the shared DNS
  #     \\{region}-aiplatform.googleapis.com. The limitations will be removed soon.
  # @!attribute [rw] dedicated_endpoint_disabled
  #   @return [::Boolean]
  #     Optional. By default, if dedicated endpoint is enabled, the endpoint will
  #     be exposed through a dedicated DNS [Endpoint.dedicated_endpoint_dns].
  #     Your request to the dedicated DNS will be isolated from other users'
  #     traffic and will have better performance and reliability. Note: Once you
  #     enabled dedicated endpoint, you won't be able to send request to the
  #     shared DNS \\{region}-aiplatform.googleapis.com. The limitations will be
  #     removed soon.
  #
  #     If this field is set to true, the dedicated endpoint will be disabled
  #     and the deployed model will be exposed through the shared DNS
  #     \\{region}-aiplatform.googleapis.com.
  # @!attribute [rw] endpoint_user_id
  #   @return [::String]
  #     Optional. Immutable. The ID to use for endpoint, which will become the
  #     final component of the endpoint resource name. If not provided, Vertex AI
  #     will generate a value for this ID.
  #
  #     If the first character is a letter, this value may be up to 63
  #     characters, and valid characters are `[a-z0-9-]`. The last character must
  #     be a letter or number.
  #
  #     If the first character is a number, this value may be up to 9 characters,
  #     and valid characters are `[0-9]` with no leading zeros.
  #
  #     When using HTTP/JSON, this field is populated
  #     based on a query string argument, such as `?endpoint_id=12345`. This is
  #     the fallback for fields that are not included in either the URI or the
  #     body.
  class EndpointConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods
  end

  # The deploy config to use for the deployment.
  # @!attribute [rw] dedicated_resources
  #   @return [::Google::Cloud::AIPlatform::V1::DedicatedResources]
  #     Optional. The dedicated resources to use for the endpoint. If not set,
  #     the default resources will be used.
  # @!attribute [rw] fast_tryout_enabled
  #   @return [::Boolean]
  #     Optional. If true, enable the QMT fast tryout feature for this model if
  #     possible.
  # @!attribute [rw] system_labels
  #   @return [::Google::Protobuf::Map{::String => ::String}]
  #     Optional. System labels for Model Garden deployments.
  #     These labels are managed by Google and for tracking purposes only.
  class DeployConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods

    # @!attribute [rw] key
    #   @return [::String]
    # @!attribute [rw] value
    #   @return [::String]
    class SystemLabelsEntry
      include ::Google::Protobuf::MessageExts
      extend ::Google::Protobuf::MessageExts::ClassMethods
    end
  end
end

#hugging_face_model_id ⇒ `::String`

Returns The Hugging Face model to deploy. Format: Hugging Face model ID like google/gemma-2-2b-it.

Note: The following fields are mutually exclusive: hugging_face_model_id, publisher_model_name. If a field in that set is populated, all other fields in the set will automatically be cleared.

Returns:

(::String) —
The Hugging Face model to deploy. Format: Hugging Face model ID like google/gemma-2-2b-it.

Note: The following fields are mutually exclusive: hugging_face_model_id, publisher_model_name. If a field in that set is populated, all other fields in the set will automatically be cleared.

# File 'proto_docs/google/cloud/aiplatform/v1/model_garden_service.rb', line 82

class DeployRequest
  include ::Google::Protobuf::MessageExts
  extend ::Google::Protobuf::MessageExts::ClassMethods

  # The model config to use for the deployment.
  # @!attribute [rw] accept_eula
  #   @return [::Boolean]
  #     Optional. Whether the user accepts the End User License Agreement (EULA)
  #     for the model.
  # @!attribute [rw] hugging_face_access_token
  #   @return [::String]
  #     Optional. The Hugging Face read access token used to access the model
  #     artifacts of gated models.
  # @!attribute [rw] hugging_face_cache_enabled
  #   @return [::Boolean]
  #     Optional. If true, the model will deploy with a cached version instead of
  #     directly downloading the model artifacts from Hugging Face. This is
  #     suitable for VPC-SC users with limited internet access.
  # @!attribute [rw] model_display_name
  #   @return [::String]
  #     Optional. The user-specified display name of the uploaded model. If not
  #     set, a default name will be used.
  # @!attribute [rw] container_spec
  #   @return [::Google::Cloud::AIPlatform::V1::ModelContainerSpec]
  #     Optional. The specification of the container that is to be used when
  #     deploying. If not set, the default container spec will be used.
  # @!attribute [rw] model_user_id
  #   @return [::String]
  #     Optional. The ID to use for the uploaded Model, which will become the
  #     final component of the model resource name. When not provided, Vertex AI
  #     will generate a value for this ID. When Model Registry model is provided,
  #     this field will be ignored.
  #
  #     This value may be up to 63 characters, and valid characters are
  #     `[a-z0-9_-]`. The first character cannot be a number or hyphen.
  class ModelConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods
  end

  # The endpoint config to use for the deployment.
  # @!attribute [rw] endpoint_display_name
  #   @return [::String]
  #     Optional. The user-specified display name of the endpoint. If not set, a
  #     default name will be used.
  # @!attribute [rw] dedicated_endpoint_enabled
  #   @deprecated This field is deprecated and may be removed in the next major version update.
  #   @return [::Boolean]
  #     Optional. Deprecated. Use dedicated_endpoint_disabled instead.
  #     If true, the endpoint will be exposed through a
  #     dedicated DNS [Endpoint.dedicated_endpoint_dns]. Your request to the
  #     dedicated DNS will be isolated from other users' traffic and will have
  #     better performance and reliability. Note: Once you enabled dedicated
  #     endpoint, you won't be able to send request to the shared DNS
  #     \\{region}-aiplatform.googleapis.com. The limitations will be removed soon.
  # @!attribute [rw] dedicated_endpoint_disabled
  #   @return [::Boolean]
  #     Optional. By default, if dedicated endpoint is enabled, the endpoint will
  #     be exposed through a dedicated DNS [Endpoint.dedicated_endpoint_dns].
  #     Your request to the dedicated DNS will be isolated from other users'
  #     traffic and will have better performance and reliability. Note: Once you
  #     enabled dedicated endpoint, you won't be able to send request to the
  #     shared DNS \\{region}-aiplatform.googleapis.com. The limitations will be
  #     removed soon.
  #
  #     If this field is set to true, the dedicated endpoint will be disabled
  #     and the deployed model will be exposed through the shared DNS
  #     \\{region}-aiplatform.googleapis.com.
  # @!attribute [rw] endpoint_user_id
  #   @return [::String]
  #     Optional. Immutable. The ID to use for endpoint, which will become the
  #     final component of the endpoint resource name. If not provided, Vertex AI
  #     will generate a value for this ID.
  #
  #     If the first character is a letter, this value may be up to 63
  #     characters, and valid characters are `[a-z0-9-]`. The last character must
  #     be a letter or number.
  #
  #     If the first character is a number, this value may be up to 9 characters,
  #     and valid characters are `[0-9]` with no leading zeros.
  #
  #     When using HTTP/JSON, this field is populated
  #     based on a query string argument, such as `?endpoint_id=12345`. This is
  #     the fallback for fields that are not included in either the URI or the
  #     body.
  class EndpointConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods
  end

  # The deploy config to use for the deployment.
  # @!attribute [rw] dedicated_resources
  #   @return [::Google::Cloud::AIPlatform::V1::DedicatedResources]
  #     Optional. The dedicated resources to use for the endpoint. If not set,
  #     the default resources will be used.
  # @!attribute [rw] fast_tryout_enabled
  #   @return [::Boolean]
  #     Optional. If true, enable the QMT fast tryout feature for this model if
  #     possible.
  # @!attribute [rw] system_labels
  #   @return [::Google::Protobuf::Map{::String => ::String}]
  #     Optional. System labels for Model Garden deployments.
  #     These labels are managed by Google and for tracking purposes only.
  class DeployConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods

    # @!attribute [rw] key
    #   @return [::String]
    # @!attribute [rw] value
    #   @return [::String]
    class SystemLabelsEntry
      include ::Google::Protobuf::MessageExts
      extend ::Google::Protobuf::MessageExts::ClassMethods
    end
  end
end

#model_config ⇒ `::Google::Cloud::AIPlatform::V1::DeployRequest::ModelConfig`

Returns Optional. The model config to use for the deployment. If not specified, the default model config will be used.

Returns:

(::Google::Cloud::AIPlatform::V1::DeployRequest::ModelConfig) —
Optional. The model config to use for the deployment. If not specified, the default model config will be used.

# File 'proto_docs/google/cloud/aiplatform/v1/model_garden_service.rb', line 82

class DeployRequest
  include ::Google::Protobuf::MessageExts
  extend ::Google::Protobuf::MessageExts::ClassMethods

  # The model config to use for the deployment.
  # @!attribute [rw] accept_eula
  #   @return [::Boolean]
  #     Optional. Whether the user accepts the End User License Agreement (EULA)
  #     for the model.
  # @!attribute [rw] hugging_face_access_token
  #   @return [::String]
  #     Optional. The Hugging Face read access token used to access the model
  #     artifacts of gated models.
  # @!attribute [rw] hugging_face_cache_enabled
  #   @return [::Boolean]
  #     Optional. If true, the model will deploy with a cached version instead of
  #     directly downloading the model artifacts from Hugging Face. This is
  #     suitable for VPC-SC users with limited internet access.
  # @!attribute [rw] model_display_name
  #   @return [::String]
  #     Optional. The user-specified display name of the uploaded model. If not
  #     set, a default name will be used.
  # @!attribute [rw] container_spec
  #   @return [::Google::Cloud::AIPlatform::V1::ModelContainerSpec]
  #     Optional. The specification of the container that is to be used when
  #     deploying. If not set, the default container spec will be used.
  # @!attribute [rw] model_user_id
  #   @return [::String]
  #     Optional. The ID to use for the uploaded Model, which will become the
  #     final component of the model resource name. When not provided, Vertex AI
  #     will generate a value for this ID. When Model Registry model is provided,
  #     this field will be ignored.
  #
  #     This value may be up to 63 characters, and valid characters are
  #     `[a-z0-9_-]`. The first character cannot be a number or hyphen.
  class ModelConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods
  end

  # The endpoint config to use for the deployment.
  # @!attribute [rw] endpoint_display_name
  #   @return [::String]
  #     Optional. The user-specified display name of the endpoint. If not set, a
  #     default name will be used.
  # @!attribute [rw] dedicated_endpoint_enabled
  #   @deprecated This field is deprecated and may be removed in the next major version update.
  #   @return [::Boolean]
  #     Optional. Deprecated. Use dedicated_endpoint_disabled instead.
  #     If true, the endpoint will be exposed through a
  #     dedicated DNS [Endpoint.dedicated_endpoint_dns]. Your request to the
  #     dedicated DNS will be isolated from other users' traffic and will have
  #     better performance and reliability. Note: Once you enabled dedicated
  #     endpoint, you won't be able to send request to the shared DNS
  #     \\{region}-aiplatform.googleapis.com. The limitations will be removed soon.
  # @!attribute [rw] dedicated_endpoint_disabled
  #   @return [::Boolean]
  #     Optional. By default, if dedicated endpoint is enabled, the endpoint will
  #     be exposed through a dedicated DNS [Endpoint.dedicated_endpoint_dns].
  #     Your request to the dedicated DNS will be isolated from other users'
  #     traffic and will have better performance and reliability. Note: Once you
  #     enabled dedicated endpoint, you won't be able to send request to the
  #     shared DNS \\{region}-aiplatform.googleapis.com. The limitations will be
  #     removed soon.
  #
  #     If this field is set to true, the dedicated endpoint will be disabled
  #     and the deployed model will be exposed through the shared DNS
  #     \\{region}-aiplatform.googleapis.com.
  # @!attribute [rw] endpoint_user_id
  #   @return [::String]
  #     Optional. Immutable. The ID to use for endpoint, which will become the
  #     final component of the endpoint resource name. If not provided, Vertex AI
  #     will generate a value for this ID.
  #
  #     If the first character is a letter, this value may be up to 63
  #     characters, and valid characters are `[a-z0-9-]`. The last character must
  #     be a letter or number.
  #
  #     If the first character is a number, this value may be up to 9 characters,
  #     and valid characters are `[0-9]` with no leading zeros.
  #
  #     When using HTTP/JSON, this field is populated
  #     based on a query string argument, such as `?endpoint_id=12345`. This is
  #     the fallback for fields that are not included in either the URI or the
  #     body.
  class EndpointConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods
  end

  # The deploy config to use for the deployment.
  # @!attribute [rw] dedicated_resources
  #   @return [::Google::Cloud::AIPlatform::V1::DedicatedResources]
  #     Optional. The dedicated resources to use for the endpoint. If not set,
  #     the default resources will be used.
  # @!attribute [rw] fast_tryout_enabled
  #   @return [::Boolean]
  #     Optional. If true, enable the QMT fast tryout feature for this model if
  #     possible.
  # @!attribute [rw] system_labels
  #   @return [::Google::Protobuf::Map{::String => ::String}]
  #     Optional. System labels for Model Garden deployments.
  #     These labels are managed by Google and for tracking purposes only.
  class DeployConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods

    # @!attribute [rw] key
    #   @return [::String]
    # @!attribute [rw] value
    #   @return [::String]
    class SystemLabelsEntry
      include ::Google::Protobuf::MessageExts
      extend ::Google::Protobuf::MessageExts::ClassMethods
    end
  end
end

#publisher_model_name ⇒ `::String`

Returns The Model Garden model to deploy. Format: publishers/{publisher}/models/{publisher_model}@{version_id}, or publishers/hf-{hugging-face-author}/models/{hugging-face-model-name}@001.

Note: The following fields are mutually exclusive: publisher_model_name, hugging_face_model_id. If a field in that set is populated, all other fields in the set will automatically be cleared.

Returns:

(::String) —
The Model Garden model to deploy. Format: publishers/{publisher}/models/{publisher_model}@{version_id}, or publishers/hf-{hugging-face-author}/models/{hugging-face-model-name}@001.

Note: The following fields are mutually exclusive: publisher_model_name, hugging_face_model_id. If a field in that set is populated, all other fields in the set will automatically be cleared.

# File 'proto_docs/google/cloud/aiplatform/v1/model_garden_service.rb', line 82

class DeployRequest
  include ::Google::Protobuf::MessageExts
  extend ::Google::Protobuf::MessageExts::ClassMethods

  # The model config to use for the deployment.
  # @!attribute [rw] accept_eula
  #   @return [::Boolean]
  #     Optional. Whether the user accepts the End User License Agreement (EULA)
  #     for the model.
  # @!attribute [rw] hugging_face_access_token
  #   @return [::String]
  #     Optional. The Hugging Face read access token used to access the model
  #     artifacts of gated models.
  # @!attribute [rw] hugging_face_cache_enabled
  #   @return [::Boolean]
  #     Optional. If true, the model will deploy with a cached version instead of
  #     directly downloading the model artifacts from Hugging Face. This is
  #     suitable for VPC-SC users with limited internet access.
  # @!attribute [rw] model_display_name
  #   @return [::String]
  #     Optional. The user-specified display name of the uploaded model. If not
  #     set, a default name will be used.
  # @!attribute [rw] container_spec
  #   @return [::Google::Cloud::AIPlatform::V1::ModelContainerSpec]
  #     Optional. The specification of the container that is to be used when
  #     deploying. If not set, the default container spec will be used.
  # @!attribute [rw] model_user_id
  #   @return [::String]
  #     Optional. The ID to use for the uploaded Model, which will become the
  #     final component of the model resource name. When not provided, Vertex AI
  #     will generate a value for this ID. When Model Registry model is provided,
  #     this field will be ignored.
  #
  #     This value may be up to 63 characters, and valid characters are
  #     `[a-z0-9_-]`. The first character cannot be a number or hyphen.
  class ModelConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods
  end

  # The endpoint config to use for the deployment.
  # @!attribute [rw] endpoint_display_name
  #   @return [::String]
  #     Optional. The user-specified display name of the endpoint. If not set, a
  #     default name will be used.
  # @!attribute [rw] dedicated_endpoint_enabled
  #   @deprecated This field is deprecated and may be removed in the next major version update.
  #   @return [::Boolean]
  #     Optional. Deprecated. Use dedicated_endpoint_disabled instead.
  #     If true, the endpoint will be exposed through a
  #     dedicated DNS [Endpoint.dedicated_endpoint_dns]. Your request to the
  #     dedicated DNS will be isolated from other users' traffic and will have
  #     better performance and reliability. Note: Once you enabled dedicated
  #     endpoint, you won't be able to send request to the shared DNS
  #     \\{region}-aiplatform.googleapis.com. The limitations will be removed soon.
  # @!attribute [rw] dedicated_endpoint_disabled
  #   @return [::Boolean]
  #     Optional. By default, if dedicated endpoint is enabled, the endpoint will
  #     be exposed through a dedicated DNS [Endpoint.dedicated_endpoint_dns].
  #     Your request to the dedicated DNS will be isolated from other users'
  #     traffic and will have better performance and reliability. Note: Once you
  #     enabled dedicated endpoint, you won't be able to send request to the
  #     shared DNS \\{region}-aiplatform.googleapis.com. The limitations will be
  #     removed soon.
  #
  #     If this field is set to true, the dedicated endpoint will be disabled
  #     and the deployed model will be exposed through the shared DNS
  #     \\{region}-aiplatform.googleapis.com.
  # @!attribute [rw] endpoint_user_id
  #   @return [::String]
  #     Optional. Immutable. The ID to use for endpoint, which will become the
  #     final component of the endpoint resource name. If not provided, Vertex AI
  #     will generate a value for this ID.
  #
  #     If the first character is a letter, this value may be up to 63
  #     characters, and valid characters are `[a-z0-9-]`. The last character must
  #     be a letter or number.
  #
  #     If the first character is a number, this value may be up to 9 characters,
  #     and valid characters are `[0-9]` with no leading zeros.
  #
  #     When using HTTP/JSON, this field is populated
  #     based on a query string argument, such as `?endpoint_id=12345`. This is
  #     the fallback for fields that are not included in either the URI or the
  #     body.
  class EndpointConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods
  end

  # The deploy config to use for the deployment.
  # @!attribute [rw] dedicated_resources
  #   @return [::Google::Cloud::AIPlatform::V1::DedicatedResources]
  #     Optional. The dedicated resources to use for the endpoint. If not set,
  #     the default resources will be used.
  # @!attribute [rw] fast_tryout_enabled
  #   @return [::Boolean]
  #     Optional. If true, enable the QMT fast tryout feature for this model if
  #     possible.
  # @!attribute [rw] system_labels
  #   @return [::Google::Protobuf::Map{::String => ::String}]
  #     Optional. System labels for Model Garden deployments.
  #     These labels are managed by Google and for tracking purposes only.
  class DeployConfig
    include ::Google::Protobuf::MessageExts
    extend ::Google::Protobuf::MessageExts::ClassMethods

    # @!attribute [rw] key
    #   @return [::String]
    # @!attribute [rw] value
    #   @return [::String]
    class SystemLabelsEntry
      include ::Google::Protobuf::MessageExts
      extend ::Google::Protobuf::MessageExts::ClassMethods
    end
  end
end

Class: Google::Cloud::AIPlatform::V1::DeployRequest

Overview

Defined Under Namespace

Instance Attribute Summary collapse

Instance Attribute Details

#deploy_config ⇒ ::Google::Cloud::AIPlatform::V1::DeployRequest::DeployConfig

#destination ⇒ ::String

#endpoint_config ⇒ ::Google::Cloud::AIPlatform::V1::DeployRequest::EndpointConfig

#hugging_face_model_id ⇒ ::String

#model_config ⇒ ::Google::Cloud::AIPlatform::V1::DeployRequest::ModelConfig

#publisher_model_name ⇒ ::String

#deploy_config ⇒ `::Google::Cloud::AIPlatform::V1::DeployRequest::DeployConfig`

#destination ⇒ `::String`

#endpoint_config ⇒ `::Google::Cloud::AIPlatform::V1::DeployRequest::EndpointConfig`

#hugging_face_model_id ⇒ `::String`

#model_config ⇒ `::Google::Cloud::AIPlatform::V1::DeployRequest::ModelConfig`

#publisher_model_name ⇒ `::String`