Hi! - Don't use this gem yet, it's still very much being developed.
Jules
A data mining scraper with a high level of abstraction. It's capable of finding lists, menus, titles and contact data.
Jules uses semantics, patterns and NLP to find data, so you don't have to specify exactly where it is. You'll no longer have to make different scrapers for every new website you want to scrape.
gem 'jules'
Examples
The following examples show you how to use Jules.
Lists
html = File.open('web-page.html', 'rb') { |f| f.read }
j = Jules::HTML(html)
lists = j.lists
The following example gets lists only when they contain certain data types.
j = Jules::HTML(html)
lists = j.lists(
required: [:date, :price],
optional: [:download_link]
)
Jules Abstractions
- Lists
- Titles
- Menus
Jules Data Types
- Date :date
- Price :price
- Filesize :filesize
- Download url :download_url
- Telephone number :telephone_number