ATTENTION

This library is under heavy development and currently not working properly. Please be patient for a usable version.

SemanticCrawler

This project encapsulates data gathering from different sources. It simplifies the extension of internal data with public available knowledge. The major strength is the use of semantic technology to bypass complex NLP (natural language processing).

Supported Sources

TODO

  • LinkedGeoData - LGD(see linkedgeodata.org)

  • Freebase

  • Geonames

  • DBPedia

  • Different Government Sources

Installation

$ git clone git://github.com/obale/semantic_crawler.git
$ cd semantic_crawler
$ bundle install
$ rake build
$ rake install pkg/semantic-crawler-*.gem

If the library is stable enough it will be published at rubygems.org. That simplifies the installation.

You can add this library also as dependency in your Gemfile:

gem "semantic-crawler", :git => "git://github.com/obale/semantic_crawler.git"                   # for the master branch or
gem "semantic-crawler", :git => "git://github.com/obale/semantic_crawler.git", :tags => "xyz"   # for the xyz tag

Examples

Fetch Factbook information about Austria:

>> require "semantic_crawler"
>> austria = SemanticCrawler::Factbook::Country.new("austria")
>> puts austria.background
>> puts austria.climate

Changelog

see CHANGELOG.rdoc

Tested with

  • Ruby 1.8.7-p358 and Rails 3.2.2

  • Ruby 1.9.3-p125 and Rails 3.2.2

License

© 2012 by Alex Oberhauser for Sigimera, published under MIT license.

Warranty

This software is provided “as is” and without any express or implied warranties, including, without limitation, the implied warranties of merchantibility and fitness for a particular purpose.