Module: OcrFile::ImageEngines::Pdftoppm
Instance Method Summary collapse
-
#images_from_pdf(pdf_path, save_path, filename: 'image', filetype: 'png', quality: 100, dpi: 300, verbose: true) ⇒ Object
TODO: other options www.xpdfreader.com/pdftoppm-man.html password −mono Generate a monochrome PBM file (instead of an RGB PPM file).
Instance Method Details
#images_from_pdf(pdf_path, save_path, filename: 'image', filetype: 'png', quality: 100, dpi: 300, verbose: true) ⇒ Object
TODO: other options www.xpdfreader.com/pdftoppm-man.html password −mono Generate a monochrome PBM file (instead of an RGB PPM file). −gray Generate a grayscale PGM file (instead of an RGB PPM file). −cmyk Generate a CMYK PAM file (instead of an RGB PPM file).
12 13 14 15 16 17 18 19 20 21 22 23 24 |
# File 'lib/ocr-file/image_engines/pdftoppm.rb', line 12 def images_from_pdf(pdf_path, save_path, filename: 'image', filetype: 'png', quality: 100, dpi: 300, verbose: true) print 'Generating screenshots of each PDF page ... ' if filetype == 'jpg' `pdftoppm -jpeg -jpegopt quality=#{quality} -r #{dpi} "#{pdf_path}" "#{save_path}/#{filename}"` else `pdftoppm -#{filetype} -r #{dpi} #{pdf_path} #{save_path}/#{filename}` end puts 'Complete!' OcrFile::FileHelpers.fetch_temp_image_paths(save_path, filename, filetype) end |