Hi! - Don't use this gem yet, it's still very much being developed.

Jules

A data mining scraper with a high level of abstraction. It's capable of finding lists, menus, titles and contact data.

Jules uses semantics, patterns and NLP to find data, so you don't have to specify exactly where it is. You'll no longer have to make different scrapers for every new website you want to scrape.

gem 'jules'

Examples

The following examples show you how to use Jules.

Lists

html = File.open('web-page.html', 'rb') { |f| f.read }
j = Jules::HTML(html)
lists = j.lists

The following example gets lists only when they contain certain data types.

j = Jules::HTML(html)
lists = j.lists(
  required: [:date, :price],
  optional: [:download_link]
)

Jules Abstractions

  • Lists
  • Titles
  • Menus

Jules Data Types

  • Date :date
  • Price :price
  • Filesize :filesize
  • Download url :download_url
  • Telephone number :telephone_number