Module: OcrFile::ImageEngines::Pdftoppm

Extended by:
Pdftoppm
Included in:
Pdftoppm
Defined in:
lib/ocr-file/image_engines/pdftoppm.rb

Instance Method Summary collapse

Instance Method Details

#images_from_pdf(pdf_path, save_path, filename: 'image', filetype: 'png', quality: 100, dpi: 300, verbose: true) ⇒ Object

TODO: other options www.xpdfreader.com/pdftoppm-man.html password −mono Generate a monochrome PBM file (instead of an RGB PPM file). −gray Generate a grayscale PGM file (instead of an RGB PPM file). −cmyk Generate a CMYK PAM file (instead of an RGB PPM file).



12
13
14
15
16
17
18
19
20
21
22
23
24
# File 'lib/ocr-file/image_engines/pdftoppm.rb', line 12

def images_from_pdf(pdf_path, save_path, filename: 'image', filetype: 'png', quality: 100, dpi: 300, verbose: true)
  print 'Generating screenshots of each PDF page ... '

  if filetype == 'jpg'
    `pdftoppm -jpeg -jpegopt quality=#{quality} -r #{dpi} "#{pdf_path}" "#{save_path}/#{filename}"`
  else
    `pdftoppm -#{filetype} -r #{dpi} #{pdf_path} #{save_path}/#{filename}`
  end

  puts 'Complete!'

  OcrFile::FileHelpers.fetch_temp_image_paths(save_path, filename, filetype)
end