Class: OpenAI::Models::Realtime::RealtimeTruncationRetentionRatio

Inherits:

Internal::Type::BaseModel

Object
Internal::Type::BaseModel
OpenAI::Models::Realtime::RealtimeTruncationRetentionRatio

show all

Defined in:: lib/openai/models/realtime/realtime_truncation_retention_ratio.rb

Defined Under Namespace

Classes: TokenLimits

Instance Attribute Summary collapse

#retention_ratio ⇒ Float

Fraction of post-instruction conversation tokens to retain (0.0 - 1.0) when the conversation exceeds the input token limit.
#token_limits ⇒ OpenAI::Models::Realtime::RealtimeTruncationRetentionRatio::TokenLimits^?

Optional custom token limits for this truncation strategy.
#type ⇒ Symbol, :retention_ratio

Use retention ratio truncation.

Instance Method Summary collapse

#initialize(retention_ratio:, token_limits: nil, type: :retention_ratio) ⇒ Object constructor

Some parameter documentations has been truncated, see RealtimeTruncationRetentionRatio for more details.

Constructor Details

#initialize(retention_ratio:, token_limits: nil, type: :retention_ratio) ⇒ `Object`

Some parameter documentations has been truncated, see OpenAI::Models::Realtime::RealtimeTruncationRetentionRatio for more details.

Retain a fraction of the conversation tokens when the conversation exceeds the input token limit. This allows you to amortize truncations across multiple turns, which can help improve cached token usage.

Parameters:

Fraction of post-instruction conversation tokens to retain (0.0 - 1.0) when
(defaults to: nil)

Optional custom token limits for this truncation strategy. If not provided, the
(defaults to: :retention_ratio)

Use retention ratio truncation.

# File 'lib/openai/models/realtime/realtime_truncation_retention_ratio.rb', line 29

Instance Attribute Details

#retention_ratio ⇒ `Float`

Fraction of post-instruction conversation tokens to retain (0.0 - 1.0) when the conversation exceeds the input token limit. Setting this to 0.8 means that messages will be dropped until 80% of the maximum allowed tokens are used. This helps reduce the frequency of truncations and improve cache rates.

Returns:

14	# File 'lib/openai/models/realtime/realtime_truncation_retention_ratio.rb', line 14 required :retention_ratio, Float

#token_limits ⇒ `OpenAI::Models::Realtime::RealtimeTruncationRetentionRatio::TokenLimits`^?

Optional custom token limits for this truncation strategy. If not provided, the model’s default token limits will be used.

Returns:

27	# File 'lib/openai/models/realtime/realtime_truncation_retention_ratio.rb', line 27 optional :token_limits, -> { OpenAI::Realtime::RealtimeTruncationRetentionRatio::TokenLimits }

#type ⇒ `Symbol`, `:retention_ratio`

Use retention ratio truncation.

Returns:

20	# File 'lib/openai/models/realtime/realtime_truncation_retention_ratio.rb', line 20 required :type, const: :retention_ratio

Class: OpenAI::Models::Realtime::RealtimeTruncationRetentionRatio

Defined Under Namespace

Instance Attribute Summary collapse

Instance Method Summary collapse

Methods inherited from Internal::Type::BaseModel

Methods included from Internal::Type::Converter

Methods included from Internal::Util::SorbetRuntimeSupport

Constructor Details

#initialize(retention_ratio:, token_limits: nil, type: :retention_ratio) ⇒ Object

Instance Attribute Details

#retention_ratio ⇒ Float

#token_limits ⇒ OpenAI::Models::Realtime::RealtimeTruncationRetentionRatio::TokenLimits?

#type ⇒ Symbol, :retention_ratio

#initialize(retention_ratio:, token_limits: nil, type: :retention_ratio) ⇒ `Object`

#retention_ratio ⇒ `Float`

#token_limits ⇒ `OpenAI::Models::Realtime::RealtimeTruncationRetentionRatio::TokenLimits`^?

#type ⇒ `Symbol`, `:retention_ratio`