Hakoy

Parse and organize CSV data into timestamp-sliced directories.

Usage

conf = {
  db_dir:        'your file dir to store results',
  output_format: 'csv', # default
  timestamp_key: 'timestamp',
  required_keys: [
    'order_id',
    'customer_id',
    'product_id',
    'price',
    'timestamp'
  ]
}

Hakoy.('data/order.csv', conf)

TODO

  • Queue up multiple rows to write to a file; currently it does a file open/close for every row.
  • Better unique key generation algorithm. It is too primitive now.