Athena input plugin for Embulk

Athena input plugin for Embulk loads records from Athena(AWS).

Overview

  • Plugin type: input
  • Resume supported: no
  • Cleanup supported: no
  • Guess supported: no

Configuration

  • database: description (string, required)
  • athena_url: description (string, required)
  • s3_staging_dir: description (string, required)
  • access_key: description (string, required)
  • secret_key: description (string, required)
  • query: description (string, required)
  • columns: description (string, required)
  • options: description (string, default: {})

Example

in:
  type: athena
  database: log_test
  athena_url: "jdbc:awsathena://athena.ap-northeast-1.amazonaws.com:443"
  s3_staging_dir: "s3://aws-athena-query-results-11111111111-ap-northeast-1/"
  access_key: ""
  secret_key: ""
  query: |
    select uid, created_at from log_test.sample
  columns:
    - {name: uid, type: string}
    - {name: created_at, type: timestamp}

Build

$ docker-compose up -d
$ docker-compose exec embulk bash
embulk>$ ./gradlew gem  # -t to watch change of files and rebuild continuously