Ruby OpenAI

Gem Version GitHub license CircleCI Build Status Maintainability

Use the OpenAI API with Ruby! 🤖❤️

Generate text with ChatGPT, transcribe and translate audio with Whisper, or create images with DALL·E...

Check out Ruby AI Builders on Discord!


Add this line to your application's Gemfile:

gem "ruby-openai"

And then execute:

$ bundle install

Gem install

Or install with:

$ gem install ruby-openai

and require with:

require "openai"


The ::Ruby::OpenAI module has been removed and all classes have been moved under the top level ::OpenAI module. To upgrade, change require 'ruby/openai' to require 'openai' and change all references to Ruby::OpenAI to OpenAI.



For a quick test you can pass your token directly to a new client:

client = "access_token_goes_here")

With Config

For a more robust setup, you can configure the gem with your API keys, for example in an openai.rb initializer file. Never hardcode secrets into your codebase - instead use something like dotenv to pass the keys safely into your environments.

OpenAI.configure do |config|
    config.access_token = ENV.fetch('OPENAI_ACCESS_TOKEN')
    config.organization_id = ENV.fetch('OPENAI_ORGANIZATION_ID') # Optional.

Then you can create a client like this:

client =

Custom timeout or base URI

The default timeout for any OpenAI request is 120 seconds. You can change that passing the request_timeout when initializing the client. You can also change the base URI used for all requests, eg. to use observability tools like Helicone:

    client =
        access_token: "access_token_goes_here",
        uri_base: "",
        request_timeout: 240

or when configuring the gem:

    OpenAI.configure do |config|
        config.access_token = ENV.fetch("OPENAI_ACCESS_TOKEN")
        config.organization_id = ENV.fetch("OPENAI_ORGANIZATION_ID") # Optional
        config.uri_base = "" # Optional
        config.request_timeout = 240 # Optional


There are different models that can be used to generate text. For a full list and to retrieve information about a single models:

client.models.retrieve(id: "text-ada-001")



ChatGPT is a model that can be used to generate text in a conversational style. You can use it to generate a response to a sequence of messages:

response =
    parameters: {
        model: "gpt-3.5-turbo", # Required.
        messages: [{ role: "user", content: "Hello!"}], # Required.
        temperature: 0.7,
puts response.dig("choices", 0, "message", "content")
# => "Hello! How may I assist you today?"


Hit the OpenAI API for a completion using other GPT-3 models:

response = client.completions(
    parameters: {
        model: "text-davinci-001",
        prompt: "Once upon a time",
        max_tokens: 5
puts response["choices"].map { |c| c["text"] }
# => [", there lived a great"]


Send a string and some instructions for what to do to the string:

response = client.edits(
    parameters: {
        model: "text-davinci-edit-001",
        input: "What day of the wek is it?",
        instruction: "Fix the spelling mistakes"
puts response.dig("choices", 0, "text")
# => What day of the week is it?


You can use the embeddings endpoint to get a vector of numbers representing an input. You can then compare these vectors for different inputs to efficiently check how similar the inputs are.

    parameters: {
        model: "babbage-similarity",
        input: "The food was delicious and the waiter..."


Put your data in a .jsonl file like this:

{"prompt":"Overjoyed with my new phone! ->", "completion":" positive"}
{"prompt":"@lakers disappoint for a third straight night ->", "completion":" negative"}

and pass the path to client.files.upload to upload it to OpenAI, and then interact with it:

client.files.upload(parameters: { file: "path/to/sentiment.jsonl", purpose: "fine-tune" })
client.files.retrieve(id: 123)
client.files.content(id: 123)
client.files.delete(id: 123)


Upload your fine-tuning data in a .jsonl file as above and get its ID:

response = client.files.upload(parameters: { file: "path/to/sentiment.jsonl", purpose: "fine-tune" })
file_id = JSON.parse(response.body)["id"]

You can then use this file ID to create a fine-tune model:

response = client.finetunes.create(
    parameters: {
    training_file: file_id,
    model: "text-ada-001"
fine_tune_id = JSON.parse(response.body)["id"]

That will give you the fine-tune ID. If you made a mistake you can cancel the fine-tune model before it is processed:

client.finetunes.cancel(id: fine_tune_id)

You may need to wait a short time for processing to complete. Once processed, you can use list or retrieve to get the name of the fine-tuned model:

response = client.finetunes.retrieve(id: fine_tune_id)
fine_tuned_model = JSON.parse(response.body)["fine_tuned_model"]

This fine-tuned model name can then be used in completions:

response = client.completions(
    parameters: {
        model: fine_tuned_model,
        prompt: "I love Mondays!"
JSON.parse(response.body)["choices"].map { |c| c["text"] }

You can delete the fine-tuned model when you are done with it:

client.finetunes.delete(fine_tuned_model: fine_tuned_model)

Image Generation

Generate an image using DALL·E! The size of any generated images must be one of 256x256, 512x512 or 1024x1024 - if not specified the image will default to 1024x1024.

response = client.images.generate(parameters: { prompt: "A baby sea otter cooking pasta wearing a hat of some sort", size: "256x256" })
puts response.dig("data", 0, "url")
# => ""


Image Edit

Fill in the transparent part of an image, or upload a mask with transparent sections to indicate the parts of an image that can be changed according to your prompt...

response = client.images.edit(parameters: { prompt: "A solid red Ruby on a blue background", image: "image.png", mask: "mask.png" })
puts response.dig("data", 0, "url")
# => ""


Image Variations

Create n variations of an image.

response = client.images.variations(parameters: { image: "image.png", n: 2 })
puts response.dig("data", 0, "url")
# => ""

Ruby Ruby


Pass a string to check if it violates OpenAI's Content Policy:

response = client.moderations(parameters: { input: "I'm worried about that." })
puts response.dig("results", 0, "category_scores", "hate")
# => 5.505014632944949e-05


Whisper is a speech to text model that can be used to generate text based on audio files:


The translations API takes as input the audio file in any of the supported languages and transcribes the audio into English.

response = client.translate(
    parameters: {
        model: "whisper-1",
        file:'path_to_file', 'rb'),
puts response.parsed_response['text']
# => "Translation of the text"


The transcriptions API takes as input the audio file you want to transcribe and returns the text in the desired output file format.

response = client.transcribe(
    parameters: {
        model: "whisper-1",
        file:'path_to_file', 'rb'),
puts response.parsed_response['text']
# => "Transcription of the text"


After checking out the repo, run bin/setup to install dependencies. You can run bin/console for an interactive prompt that will allow you to experiment.

To install this gem onto your local machine, run bundle exec rake install.


First run the specs without VCR so they actually hit the API. This will cost about 2 cents. You'll need to add your OPENAI_ACCESS_TOKEN= in .env.

  NO_VCR=true bundle exec rspec

Then update the version number in version.rb, update, run bundle install to update Gemfile.lock, and then run bundle exec rake release, which will create a git tag for the version, push git commits and tags, and push the .gem file to


Bug reports and pull requests are welcome on GitHub at This project is intended to be a safe, welcoming space for collaboration, and contributors are expected to adhere to the code of conduct.


The gem is available as open source under the terms of the MIT License.

Code of Conduct

Everyone interacting in the Ruby OpenAI project's codebases, issue trackers, chat rooms and mailing lists is expected to follow the code of conduct.