Class: OpenAI::Resources::Audio::Transcriptions

Inherits:

Object

Object
OpenAI::Resources::Audio::Transcriptions

show all

Defined in:: lib/openai/resources/audio/transcriptions.rb

Overview

Turn audio into text or text into audio.

Instance Method Summary collapse

Constructor Details

#initialize(client:) ⇒ `Transcriptions`

This method is part of a private API. You should avoid using this method if possible, as it may be removed or be changed in the future.

Returns a new instance of Transcriptions.

Parameters:

client (OpenAI::Client)



125
126
127

# File 'lib/openai/resources/audio/transcriptions.rb', line 125

def initialize(client:)
  @client = client
end

Instance Method Details

#create(file:, model:, chunking_strategy: nil, include: nil, known_speaker_names: nil, known_speaker_references: nil, language: nil, prompt: nil, response_format: nil, temperature: nil, timestamp_granularities: nil, request_options: {}) ⇒ `OpenAI::Models::Audio::Transcription`, ...

See #create_streaming for streaming counterpart.

Some parameter documentations has been truncated, see Models::Audio::TranscriptionCreateParams for more details.

Transcribes audio into the input language.

Returns a transcription object in json, diarized_json, or verbose_json format, or a stream of transcript events.

Parameters:

file (Pathname, StringIO, IO, String, OpenAI::FilePart) —

The audio file object (not file name) to transcribe, in one of these formats: fl
model (String, Symbol, OpenAI::Models::AudioModel) —

ID of the model to use. The options are gpt-4o-transcribe, ‘gpt-4o-mini-transc
chunking_strategy (Symbol, :auto, OpenAI::Models::Audio::TranscriptionCreateParams::ChunkingStrategy::VadConfig, nil) —

Controls how the audio is cut into chunks. When set to ‘“auto”`, the server firs
include (Array<Symbol, OpenAI::Models::Audio::TranscriptionInclude>) —

Additional information to include in the transcription response.
known_speaker_names (Array<String>) —

Optional list of speaker names that correspond to the audio samples provided in
known_speaker_references (Array<String>) —

Optional list of audio samples (as [data URLs](developer.mozilla.org/en-
language (String) —

The language of the input audio. Supplying the input language in [ISO-639-1](htt
prompt (String) —

An optional text to guide the model’s style or continue a previous audio segment
response_format (Symbol, OpenAI::Models::AudioResponseFormat) —

The format of the output, in one of these options: json, text, srt, ‘verbo
temperature (Float) —

The sampling temperature, between 0 and 1. Higher values like 0.8 will make the
timestamp_granularities (Array<Symbol, OpenAI::Models::Audio::TranscriptionCreateParams::TimestampGranularity>) —

The timestamp granularities to populate for this transcription. ‘response_format
request_options (OpenAI::RequestOptions, Hash{Symbol=>Object}, nil)

Returns:

(OpenAI::Models::Audio::Transcription, OpenAI::Models::Audio::TranscriptionDiarized, OpenAI::Models::Audio::TranscriptionVerbose)

#create_streaming(file:, model:, chunking_strategy: nil, include: nil, known_speaker_names: nil, known_speaker_references: nil, language: nil, prompt: nil, response_format: nil, temperature: nil, timestamp_granularities: nil, request_options: {}) ⇒ `OpenAI::Internal::Stream<OpenAI::Models::Audio::TranscriptionTextSegmentEvent, OpenAI::Models::Audio::TranscriptionTextDeltaEvent, OpenAI::Models::Audio::TranscriptionTextDoneEvent>`

See #create for non-streaming counterpart.

Some parameter documentations has been truncated, see Models::Audio::TranscriptionCreateParams for more details.

Transcribes audio into the input language.

Returns a transcription object in json, diarized_json, or verbose_json format, or a stream of transcript events.

Parameters:

file (Pathname, StringIO, IO, String, OpenAI::FilePart) —

The audio file object (not file name) to transcribe, in one of these formats: fl
model (String, Symbol, OpenAI::Models::AudioModel) —

ID of the model to use. The options are gpt-4o-transcribe, ‘gpt-4o-mini-transc
chunking_strategy (Symbol, :auto, OpenAI::Models::Audio::TranscriptionCreateParams::ChunkingStrategy::VadConfig, nil) —

Controls how the audio is cut into chunks. When set to ‘“auto”`, the server firs
include (Array<Symbol, OpenAI::Models::Audio::TranscriptionInclude>) —

Additional information to include in the transcription response.
known_speaker_names (Array<String>) —

Optional list of speaker names that correspond to the audio samples provided in
known_speaker_references (Array<String>) —

Optional list of audio samples (as [data URLs](developer.mozilla.org/en-
language (String) —

The language of the input audio. Supplying the input language in [ISO-639-1](htt
prompt (String) —

An optional text to guide the model’s style or continue a previous audio segment
response_format (Symbol, OpenAI::Models::AudioResponseFormat) —

The format of the output, in one of these options: json, text, srt, ‘verbo
temperature (Float) —

The sampling temperature, between 0 and 1. Higher values like 0.8 will make the
timestamp_granularities (Array<Symbol, OpenAI::Models::Audio::TranscriptionCreateParams::TimestampGranularity>) —

The timestamp granularities to populate for this transcription. ‘response_format
request_options (OpenAI::RequestOptions, Hash{Symbol=>Object}, nil)

Returns:

(OpenAI::Internal::Stream<OpenAI::Models::Audio::TranscriptionTextSegmentEvent, OpenAI::Models::Audio::TranscriptionTextDeltaEvent, OpenAI::Models::Audio::TranscriptionTextDoneEvent>)

Class: OpenAI::Resources::Audio::Transcriptions

Overview

Instance Method Summary collapse

Constructor Details

#initialize(client:) ⇒ Transcriptions

Instance Method Details

#create(file:, model:, chunking_strategy: nil, include: nil, known_speaker_names: nil, known_speaker_references: nil, language: nil, prompt: nil, response_format: nil, temperature: nil, timestamp_granularities: nil, request_options: {}) ⇒ OpenAI::Models::Audio::Transcription, ...

#initialize(client:) ⇒ `Transcriptions`

#create(file:, model:, chunking_strategy: nil, include: nil, known_speaker_names: nil, known_speaker_references: nil, language: nil, prompt: nil, response_format: nil, temperature: nil, timestamp_granularities: nil, request_options: {}) ⇒ `OpenAI::Models::Audio::Transcription`, ...