RubyDSP | Documentation
🚧 Status: This (HOBBY) project is currently in early development. It is hopefully functional, but API changes are expected. There is no warranty regarding anything 🗿.
RubyDSP is an audio processing and DSP Ruby gem. Ultimately, it aims to be librosa-wannabe for Ruby in some far utopian future which might never come. It uses C++ under the hood, utilizing miniaudio and Rice.
I made this gem to try Rice, as I would like to be able to bring some C++ speed to Ruby, oh my beloved....
Features
Fast: Basically all of the code is written in C++. While not extremely optimized currently, it still absolutely shreds native Ruby.
Format Agnostic Loading: Automatically decodes standard audio formats (WAV, MP3, FLAC) via
miniaudio.Note: While the loading of these formats is supported,
miniaudioencodes only in.wav. While other encodings might be considered in the future, they would require more dependencies and thus are not available right now.Zero-Dependency Native Build: No need to install
ffmpegorlibsndfileon your system.YARD Support: Includes pure-Ruby stubs (in
stubs, duh) for IDE autocomplete and inline documentation.
Installation
Add this line to your application's Gemfile:
gem 'ruby_dsp'
And then execute:
$ bundle install
Or install it yourself directly via:
$ gem install ruby_dsp
(Note: Installing this gem requires a modern C++ compiler, as it builds the native extensions directly on your machine upon installation. It requires Ruby 3.0+).
Quick Start
Here is a quick look at what you can do with a loaded AudioTrack:
require 'ruby_dsp'
# Load an audio file
track = RubyDSP::AudioTrack.new("raw_vocals.wav")
puts track
# => ['raw_vocals.wav', 12.450s duration, 2 channel(s), 48000Hz sample rate]
# Do stuff!
track.to_mono! # Averages channels into mono
track.resample!(44100) # Linearly resamples to target rate
track.trim_silence!(-60.0) # Strips leading/trailing silence below -60dB
# Edit & Manipulate
track.normalize!(-1.0) # Scales audio to target peak dBFS
track.fade_in!(0.5) # Adds a 0.5s linear fade-in
track.fade_out!(0.5) # Adds a 0.5s linear fade-out
track.pad!(1.0, 1.0) # Pads 1s of silence to both head and tail
track.pad_to_duration!(15.0) # Centers audio evenly into a 15s window
# Analysis & Math
puts "Peak Amp: #{track.peak_amp}"
puts "Overall RMS: #{track.rms}"
puts "Overall ZCR: #{track.zcr}"
# You can also get framed analysis for time-series data:
# framed_rms_data = track.framed_rms(2048, 512) also works
framed_rms_data = track.framed_rms(frame_length: 2048, hop_length: 512)
# Save the results
track.save_track("processed_vocals.wav")
Development
If you want to clone the repo and work on C++ guts, start with:
- Clone the repo and run
bundle installto grab the development dependencies. - Run
rake test— this will automatically compile the C++extconf.rband run the Minitest suite. - Run
rake doc:generate ; rake doc:server— this will compile the YARD stubs into HTML and boot a live-reloading local web server athttp://localhost:8808so you can read the docs!
License
The gem is available as open source under the terms of the MIT License.
Cheers! - RC