Fast data processing with Ruby and Apache Arrow

I introduced Ruby and Apache Arrow integration including the “super fast large data interchange and processing” Apache Arrow feature at RubyKaigi Takeout 2021.

This talk introduces how we can use the “super fast large data interchange and processing” Apache Arrow feature in Ruby. Here are some use cases:

  • Fast data retrieval (fast ((pluck))) from DB such as MySQL and PostgreSQL for batch processes in a Ruby on Rails application

  • Fast data interchange with JavaScript for dynamic visualization in a Ruby on Rails application

  • Fast OLAP with in-process DB such as DuckDB and Apache Arrow DataFusion in a Ruby on Rails application or irb session

License

Slide

CC BY-SA 4.0

Use the followings for notation of the author:

* Sutou Kouhei

CC BY-SA 4.0

Author: ClearCode Inc.

It is used in page header and some pages in the slide.

For author

Show

rake

Publish

rake publish

For viewers

Install

gem install rabbit-slide-kou-rubykaigi-2022

Show

rabbit rabbit-slide-kou-rubykaigi-2022.gem