Class: GovukSeedCrawler::Indexer
- Inherits:
-
Object
- Object
- GovukSeedCrawler::Indexer
- Defined in:
- lib/govuk_seed_crawler/indexer.rb
Instance Attribute Summary collapse
-
#urls ⇒ Object
readonly
Returns the value of attribute urls.
Instance Method Summary collapse
-
#initialize(site_root) ⇒ Indexer
constructor
A new instance of Indexer.
Constructor Details
#initialize(site_root) ⇒ Indexer
Returns a new instance of Indexer.
7 8 9 10 11 12 13 14 15 16 |
# File 'lib/govuk_seed_crawler/indexer.rb', line 7 def initialize(site_root) raise "No site_root defined" unless site_root GovukSeedCrawler.logger.info("Retrieving list of URLs for #{site_root}") sitemap = SitemapParser.new("#{site_root}/sitemap.xml", { recurse: true }) @urls = sitemap.to_a GovukSeedCrawler.logger.info("Found #{@urls.count} URLs") end |
Instance Attribute Details
#urls ⇒ Object (readonly)
Returns the value of attribute urls.
5 6 7 |
# File 'lib/govuk_seed_crawler/indexer.rb', line 5 def urls @urls end |