Class: GovukSeedCrawler::Indexer

Inherits:
Object
  • Object
show all
Defined in:
lib/govuk_seed_crawler/indexer.rb

Instance Attribute Summary collapse

Instance Method Summary collapse

Constructor Details

#initialize(site_root) ⇒ Indexer

Returns a new instance of Indexer.



7
8
9
10
11
12
13
14
15
16
# File 'lib/govuk_seed_crawler/indexer.rb', line 7

def initialize(site_root)
  raise "No site_root defined" unless site_root

  GovukSeedCrawler.logger.info("Retrieving list of URLs for #{site_root}")

  sitemap = SitemapParser.new("#{site_root}/sitemap.xml", { recurse: true })
  @urls = sitemap.to_a

  GovukSeedCrawler.logger.info("Found #{@urls.count} URLs")
end

Instance Attribute Details

#urlsObject (readonly)

Returns the value of attribute urls.



5
6
7
# File 'lib/govuk_seed_crawler/indexer.rb', line 5

def urls
  @urls
end