Class: Pdfcrowd::PdfToHtmlClient
- Inherits:
-
Object
- Object
- Pdfcrowd::PdfToHtmlClient
- Defined in:
- lib/pdfcrowd.rb
Overview
Conversion from PDF to HTML.
Instance Method Summary collapse
-
#convertFile(file) ⇒ Object
Convert a local file.
-
#convertFileToFile(file, file_path) ⇒ Object
Convert a local file and write the result to a local file.
-
#convertFileToStream(file, out_stream) ⇒ Object
Convert a local file and write the result to an output stream.
-
#convertRawData(data) ⇒ Object
Convert raw data.
-
#convertRawDataToFile(data, file_path) ⇒ Object
Convert raw data to a file.
-
#convertRawDataToStream(data, out_stream) ⇒ Object
Convert raw data and write the result to an output stream.
-
#convertStream(in_stream) ⇒ Object
Convert the contents of an input stream.
-
#convertStreamToFile(in_stream, file_path) ⇒ Object
Convert the contents of an input stream and write the result to a local file.
-
#convertStreamToStream(in_stream, out_stream) ⇒ Object
Convert the contents of an input stream and write the result to an output stream.
-
#convertUrl(url) ⇒ Object
Convert a PDF.
-
#convertUrlToFile(url, file_path) ⇒ Object
Convert a PDF and write the result to a local file.
-
#convertUrlToStream(url, out_stream) ⇒ Object
Convert a PDF and write the result to an output stream.
-
#getConsumedCreditCount ⇒ Object
Get the number of credits consumed by the last conversion.
-
#getDebugLogUrl ⇒ Object
Get the URL of the debug log for the last conversion.
-
#getJobId ⇒ Object
Get the job id.
-
#getOutputSize ⇒ Object
Get the size of the output in bytes.
-
#getPageCount ⇒ Object
Get the number of pages in the output document.
-
#getRemainingCreditCount ⇒ Object
Get the number of conversion credits available in your account.
-
#getVersion ⇒ Object
Get the version details.
-
#initialize(user_name, api_key) ⇒ PdfToHtmlClient
constructor
Constructor for the Pdfcrowd API client.
-
#isZippedOutput ⇒ Object
A helper method to determine if the output file is a zip archive.
-
#setAuthor(author) ⇒ Object
Set the HTML author.
-
#setClientUserAgent(agent) ⇒ Object
Specifies the User-Agent HTTP header that the client library will use when interacting with the API.
-
#setConverterVersion(version) ⇒ Object
Set the converter version.
-
#setCssMode(mode) ⇒ Object
Specifies where the style sheets are stored.
-
#setCustomCss(css) ⇒ Object
Apply custom CSS to the output HTML document.
-
#setDebugLog(value) ⇒ Object
Turn on the debug logging.
-
#setDpi(dpi) ⇒ Object
Set the output graphics DPI.
-
#setFontMode(mode) ⇒ Object
Specifies where the fonts are stored.
-
#setForceZip(value) ⇒ Object
Enforces the zip output format.
-
#setHtmlNamespace(prefix) ⇒ Object
Add the specified prefix to all id and class attributes in the HTML content, creating a namespace for safe integration into another HTML document.
-
#setHttpProxy(proxy) ⇒ Object
A proxy server used by Pdfcrowd conversion process for accessing the source URLs with HTTP scheme.
-
#setHttpsProxy(proxy) ⇒ Object
A proxy server used by Pdfcrowd conversion process for accessing the source URLs with HTTPS scheme.
-
#setImageFormat(image_format) ⇒ Object
Specifies the format for the output images.
-
#setImageMode(mode) ⇒ Object
Specifies where the images are stored.
-
#setKeywords(keywords) ⇒ Object
Associate keywords with the HTML document.
-
#setPdfPassword(password) ⇒ Object
Password to open the encrypted PDF file.
-
#setPrintPageRange(pages) ⇒ Object
Set the page range to print.
-
#setProxy(host, port, user_name, password) ⇒ Object
Specifies an HTTP proxy that the API client library will use to connect to the internet.
-
#setRetryCount(count) ⇒ Object
Specifies the number of automatic retries when the 502 or 503 HTTP status code is received.
-
#setScaleFactor(factor) ⇒ Object
Set the scaling factor (zoom) for the main page area.
-
#setSplitLigatures(value) ⇒ Object
Converts ligatures, two or more letters combined into a single glyph, back into their individual ASCII characters.
-
#setSubject(subject) ⇒ Object
Set the HTML subject.
-
#setTag(tag) ⇒ Object
Tag the conversion with a custom value.
-
#setTitle(title) ⇒ Object
Set the HTML title.
-
#setType3Mode(mode) ⇒ Object
Sets the processing mode for handling Type 3 fonts.
-
#setUseHttp(value) ⇒ Object
Specifies if the client communicates over HTTP or HTTPS with Pdfcrowd API.
-
#setUserAgent(agent) ⇒ Object
Set a custom user agent HTTP header.
Constructor Details
#initialize(user_name, api_key) ⇒ PdfToHtmlClient
Constructor for the Pdfcrowd API client.
-
user_name- Your username at Pdfcrowd. -
api_key- Your API key.
5478 5479 5480 5481 5482 5483 5484 5485 5486 5487 |
# File 'lib/pdfcrowd.rb', line 5478 def initialize(user_name, api_key) @helper = ConnectionHelper.new(user_name, api_key) @fields = { 'input_format'=>'pdf', 'output_format'=>'html' } @file_id = 1 @files = {} @raw_data = {} end |
Instance Method Details
#convertFile(file) ⇒ Object
Convert a local file.
-
file- The path to a local file to convert. The file must exist and not be empty. -
Returns - Byte array containing the conversion output.
5543 5544 5545 5546 5547 5548 5549 5550 |
# File 'lib/pdfcrowd.rb', line 5543 def convertFile(file) if (!(File.file?(file) && !File.zero?(file))) raise Error.new(Pdfcrowd.(file, "convertFile", "pdf-to-html", "The file must exist and not be empty.", "convert_file"), 470); end @files['file'] = file @helper.post(@fields, @files, @raw_data) end |
#convertFileToFile(file, file_path) ⇒ Object
Convert a local file and write the result to a local file.
-
file- The path to a local file to convert. The file must exist and not be empty. -
file_path- The output file path. The string must not be empty. The converter generates an HTML or ZIP file. If ZIP file is generated, the file path must have a ZIP or zip extension.
5569 5570 5571 5572 5573 5574 5575 5576 5577 5578 5579 5580 5581 5582 5583 5584 5585 5586 5587 |
# File 'lib/pdfcrowd.rb', line 5569 def convertFileToFile(file, file_path) if (!(!file_path.nil? && !file_path.empty?)) raise Error.new(Pdfcrowd.(file_path, "convertFileToFile::file_path", "pdf-to-html", "The string must not be empty.", "convert_file_to_file"), 470); end if (!(isOutputTypeValid(file_path))) raise Error.new(Pdfcrowd.(file_path, "convertFileToFile::file_path", "pdf-to-html", "The converter generates an HTML or ZIP file. If ZIP file is generated, the file path must have a ZIP or zip extension.", "convert_file_to_file"), 470); end output_file = open(file_path, "wb") begin convertFileToStream(file, output_file) output_file.close() rescue Error => why output_file.close() FileUtils.rm(file_path) raise end end |
#convertFileToStream(file, out_stream) ⇒ Object
Convert a local file and write the result to an output stream.
-
file- The path to a local file to convert. The file must exist and not be empty. -
out_stream- The output stream that will contain the conversion output.
5556 5557 5558 5559 5560 5561 5562 5563 |
# File 'lib/pdfcrowd.rb', line 5556 def convertFileToStream(file, out_stream) if (!(File.file?(file) && !File.zero?(file))) raise Error.new(Pdfcrowd.(file, "convertFileToStream::file", "pdf-to-html", "The file must exist and not be empty.", "convert_file_to_stream"), 470); end @files['file'] = file @helper.post(@fields, @files, @raw_data, out_stream) end |
#convertRawData(data) ⇒ Object
Convert raw data.
-
data- The raw content to be converted. -
Returns - Byte array with the output.
5593 5594 5595 5596 |
# File 'lib/pdfcrowd.rb', line 5593 def convertRawData(data) @raw_data['file'] = data @helper.post(@fields, @files, @raw_data) end |
#convertRawDataToFile(data, file_path) ⇒ Object
Convert raw data to a file.
-
data- The raw content to be converted. -
file_path- The output file path. The string must not be empty. The converter generates an HTML or ZIP file. If ZIP file is generated, the file path must have a ZIP or zip extension.
5611 5612 5613 5614 5615 5616 5617 5618 5619 5620 5621 5622 5623 5624 5625 5626 5627 5628 5629 |
# File 'lib/pdfcrowd.rb', line 5611 def convertRawDataToFile(data, file_path) if (!(!file_path.nil? && !file_path.empty?)) raise Error.new(Pdfcrowd.(file_path, "convertRawDataToFile::file_path", "pdf-to-html", "The string must not be empty.", "convert_raw_data_to_file"), 470); end if (!(isOutputTypeValid(file_path))) raise Error.new(Pdfcrowd.(file_path, "convertRawDataToFile::file_path", "pdf-to-html", "The converter generates an HTML or ZIP file. If ZIP file is generated, the file path must have a ZIP or zip extension.", "convert_raw_data_to_file"), 470); end output_file = open(file_path, "wb") begin convertRawDataToStream(data, output_file) output_file.close() rescue Error => why output_file.close() FileUtils.rm(file_path) raise end end |
#convertRawDataToStream(data, out_stream) ⇒ Object
Convert raw data and write the result to an output stream.
-
data- The raw content to be converted. -
out_stream- The output stream that will contain the conversion output.
5602 5603 5604 5605 |
# File 'lib/pdfcrowd.rb', line 5602 def convertRawDataToStream(data, out_stream) @raw_data['file'] = data @helper.post(@fields, @files, @raw_data, out_stream) end |
#convertStream(in_stream) ⇒ Object
Convert the contents of an input stream.
-
in_stream- The input stream with source data. -
Returns - Byte array containing the conversion output.
5635 5636 5637 5638 |
# File 'lib/pdfcrowd.rb', line 5635 def convertStream(in_stream) @raw_data['stream'] = in_stream.read @helper.post(@fields, @files, @raw_data) end |
#convertStreamToFile(in_stream, file_path) ⇒ Object
Convert the contents of an input stream and write the result to a local file.
-
in_stream- The input stream with source data. -
file_path- The output file path. The string must not be empty. The converter generates an HTML or ZIP file. If ZIP file is generated, the file path must have a ZIP or zip extension.
5653 5654 5655 5656 5657 5658 5659 5660 5661 5662 5663 5664 5665 5666 5667 5668 5669 5670 5671 |
# File 'lib/pdfcrowd.rb', line 5653 def convertStreamToFile(in_stream, file_path) if (!(!file_path.nil? && !file_path.empty?)) raise Error.new(Pdfcrowd.(file_path, "convertStreamToFile::file_path", "pdf-to-html", "The string must not be empty.", "convert_stream_to_file"), 470); end if (!(isOutputTypeValid(file_path))) raise Error.new(Pdfcrowd.(file_path, "convertStreamToFile::file_path", "pdf-to-html", "The converter generates an HTML or ZIP file. If ZIP file is generated, the file path must have a ZIP or zip extension.", "convert_stream_to_file"), 470); end output_file = open(file_path, "wb") begin convertStreamToStream(in_stream, output_file) output_file.close() rescue Error => why output_file.close() FileUtils.rm(file_path) raise end end |
#convertStreamToStream(in_stream, out_stream) ⇒ Object
Convert the contents of an input stream and write the result to an output stream.
-
in_stream- The input stream with source data. -
out_stream- The output stream that will contain the conversion output.
5644 5645 5646 5647 |
# File 'lib/pdfcrowd.rb', line 5644 def convertStreamToStream(in_stream, out_stream) @raw_data['stream'] = in_stream.read @helper.post(@fields, @files, @raw_data, out_stream) end |
#convertUrl(url) ⇒ Object
Convert a PDF.
-
url- The address of the PDF to convert. The supported protocols are http:// and https://. -
Returns - Byte array containing the conversion output.
5493 5494 5495 5496 5497 5498 5499 5500 |
# File 'lib/pdfcrowd.rb', line 5493 def convertUrl(url) unless /(?i)^https?:\/\/.*$/.match(url) raise Error.new(Pdfcrowd.(url, "convertUrl", "pdf-to-html", "The supported protocols are http:// and https://.", "convert_url"), 470); end @fields['url'] = url @helper.post(@fields, @files, @raw_data) end |
#convertUrlToFile(url, file_path) ⇒ Object
Convert a PDF and write the result to a local file.
-
url- The address of the PDF to convert. The supported protocols are http:// and https://. -
file_path- The output file path. The string must not be empty. The converter generates an HTML or ZIP file. If ZIP file is generated, the file path must have a ZIP or zip extension.
5519 5520 5521 5522 5523 5524 5525 5526 5527 5528 5529 5530 5531 5532 5533 5534 5535 5536 5537 |
# File 'lib/pdfcrowd.rb', line 5519 def convertUrlToFile(url, file_path) if (!(!file_path.nil? && !file_path.empty?)) raise Error.new(Pdfcrowd.(file_path, "convertUrlToFile::file_path", "pdf-to-html", "The string must not be empty.", "convert_url_to_file"), 470); end if (!(isOutputTypeValid(file_path))) raise Error.new(Pdfcrowd.(file_path, "convertUrlToFile::file_path", "pdf-to-html", "The converter generates an HTML or ZIP file. If ZIP file is generated, the file path must have a ZIP or zip extension.", "convert_url_to_file"), 470); end output_file = open(file_path, "wb") begin convertUrlToStream(url, output_file) output_file.close() rescue Error => why output_file.close() FileUtils.rm(file_path) raise end end |
#convertUrlToStream(url, out_stream) ⇒ Object
Convert a PDF and write the result to an output stream.
-
url- The address of the PDF to convert. The supported protocols are http:// and https://. -
out_stream- The output stream that will contain the conversion output.
5506 5507 5508 5509 5510 5511 5512 5513 |
# File 'lib/pdfcrowd.rb', line 5506 def convertUrlToStream(url, out_stream) unless /(?i)^https?:\/\/.*$/.match(url) raise Error.new(Pdfcrowd.(url, "convertUrlToStream::url", "pdf-to-html", "The supported protocols are http:// and https://.", "convert_url_to_stream"), 470); end @fields['url'] = url @helper.post(@fields, @files, @raw_data, out_stream) end |
#getConsumedCreditCount ⇒ Object
Get the number of credits consumed by the last conversion.
-
Returns - The number of credits.
5894 5895 5896 |
# File 'lib/pdfcrowd.rb', line 5894 def getConsumedCreditCount() return @helper.getConsumedCreditCount() end |
#getDebugLogUrl ⇒ Object
Get the URL of the debug log for the last conversion.
-
Returns - The link to the debug log.
5879 5880 5881 |
# File 'lib/pdfcrowd.rb', line 5879 def getDebugLogUrl() return @helper.getDebugLogUrl() end |
#getJobId ⇒ Object
Get the job id.
-
Returns - The unique job identifier.
5900 5901 5902 |
# File 'lib/pdfcrowd.rb', line 5900 def getJobId() return @helper.getJobId() end |
#getOutputSize ⇒ Object
Get the size of the output in bytes.
-
Returns - The count of bytes.
5912 5913 5914 |
# File 'lib/pdfcrowd.rb', line 5912 def getOutputSize() return @helper.getOutputSize() end |
#getPageCount ⇒ Object
Get the number of pages in the output document.
-
Returns - The page count.
5906 5907 5908 |
# File 'lib/pdfcrowd.rb', line 5906 def getPageCount() return @helper.getPageCount() end |
#getRemainingCreditCount ⇒ Object
Get the number of conversion credits available in your account. This method can only be called after a call to one of the convertXtoY methods. The returned value can differ from the actual count if you run parallel conversions. The special value 999999 is returned if the information is not available.
-
Returns - The number of credits.
5888 5889 5890 |
# File 'lib/pdfcrowd.rb', line 5888 def getRemainingCreditCount() return @helper.getRemainingCreditCount() end |
#getVersion ⇒ Object
Get the version details.
-
Returns - API version, converter version, and client version.
5918 5919 5920 |
# File 'lib/pdfcrowd.rb', line 5918 def getVersion() return "client " + CLIENT_VERSION + ", API v2, converter " + @helper.getConverterVersion() end |
#isZippedOutput ⇒ Object
A helper method to determine if the output file is a zip archive. The output of the conversion may be either an HTML file or a zip file containing the HTML and its external assets.
-
Returns - True if the conversion output is a zip file, otherwise False.
5819 5820 5821 |
# File 'lib/pdfcrowd.rb', line 5819 def isZippedOutput() @fields.fetch('image_mode', '') == 'separate' || @fields.fetch('css_mode', '') == 'separate' || @fields.fetch('font_mode', '') == 'separate' || @fields.fetch('force_zip', false) == true end |
#setAuthor(author) ⇒ Object
Set the HTML author. The author from the input PDF is used by default.
-
author- The HTML author. -
Returns - The converter object.
5854 5855 5856 5857 |
# File 'lib/pdfcrowd.rb', line 5854 def setAuthor() @fields['author'] = self end |
#setClientUserAgent(agent) ⇒ Object
Specifies the User-Agent HTTP header that the client library will use when interacting with the API.
-
agent- The user agent string. -
Returns - The converter object.
5984 5985 5986 5987 |
# File 'lib/pdfcrowd.rb', line 5984 def setClientUserAgent(agent) @helper.setUserAgent(agent) self end |
#setConverterVersion(version) ⇒ Object
Set the converter version. Different versions may produce different output. Choose which one provides the best output for your case.
-
version- The version identifier. Allowed values are 24.04, 20.10, 18.10, latest. -
Returns - The converter object.
5961 5962 5963 5964 5965 5966 5967 5968 |
# File 'lib/pdfcrowd.rb', line 5961 def setConverterVersion(version) unless /(?i)^(24.04|20.10|18.10|latest)$/.match(version) raise Error.new(Pdfcrowd.(version, "setConverterVersion", "pdf-to-html", "Allowed values are 24.04, 20.10, 18.10, latest.", "set_converter_version"), 470); end @helper.setConverterVersion(version) self end |
#setCssMode(mode) ⇒ Object
Specifies where the style sheets are stored.
-
mode- The style sheet storage mode. Allowed values are embed, separate. -
Returns - The converter object.
5747 5748 5749 5750 5751 5752 5753 5754 |
# File 'lib/pdfcrowd.rb', line 5747 def setCssMode(mode) unless /(?i)^(embed|separate)$/.match(mode) raise Error.new(Pdfcrowd.(mode, "setCssMode", "pdf-to-html", "Allowed values are embed, separate.", "set_css_mode"), 470); end @fields['css_mode'] = mode self end |
#setCustomCss(css) ⇒ Object
Apply custom CSS to the output HTML document. It allows you to modify the visual appearance and layout. Tip: Using !important in custom CSS provides a way to prioritize and override conflicting styles.
-
css- A string containing valid CSS. The string must not be empty. -
Returns - The converter object.
5795 5796 5797 5798 5799 5800 5801 5802 |
# File 'lib/pdfcrowd.rb', line 5795 def setCustomCss(css) if (!(!css.nil? && !css.empty?)) raise Error.new(Pdfcrowd.(css, "setCustomCss", "pdf-to-html", "The string must not be empty.", "set_custom_css"), 470); end @fields['custom_css'] = css self end |
#setDebugLog(value) ⇒ Object
Turn on the debug logging. Details about the conversion are stored in the debug log. The URL of the log can be obtained from the getDebugLogUrl method or available in conversion statistics.
-
value- Set to true to enable the debug logging. -
Returns - The converter object.
5872 5873 5874 5875 |
# File 'lib/pdfcrowd.rb', line 5872 def setDebugLog(value) @fields['debug_log'] = value self end |
#setDpi(dpi) ⇒ Object
Set the output graphics DPI.
-
dpi- The DPI value. -
Returns - The converter object.
5712 5713 5714 5715 |
# File 'lib/pdfcrowd.rb', line 5712 def setDpi(dpi) @fields['dpi'] = dpi self end |
#setFontMode(mode) ⇒ Object
Specifies where the fonts are stored.
-
mode- The font storage mode. Allowed values are embed, separate. -
Returns - The converter object.
5760 5761 5762 5763 5764 5765 5766 5767 |
# File 'lib/pdfcrowd.rb', line 5760 def setFontMode(mode) unless /(?i)^(embed|separate)$/.match(mode) raise Error.new(Pdfcrowd.(mode, "setFontMode", "pdf-to-html", "Allowed values are embed, separate.", "set_font_mode"), 470); end @fields['font_mode'] = mode self end |
#setForceZip(value) ⇒ Object
Enforces the zip output format.
-
value- Set to true to get the output as a zip archive. -
Returns - The converter object.
5827 5828 5829 5830 |
# File 'lib/pdfcrowd.rb', line 5827 def setForceZip(value) @fields['force_zip'] = value self end |
#setHtmlNamespace(prefix) ⇒ Object
Add the specified prefix to all id and class attributes in the HTML content, creating a namespace for safe integration into another HTML document. This ensures unique identifiers, preventing conflicts when merging with other HTML.
-
prefix- The prefix to add before each id and class attribute name. Start with a letter or underscore, and use only letters, numbers, hyphens, underscores, or colons. -
Returns - The converter object.
5808 5809 5810 5811 5812 5813 5814 5815 |
# File 'lib/pdfcrowd.rb', line 5808 def setHtmlNamespace(prefix) unless /(?i)^[a-z_][a-z0-9_:-]*$/.match(prefix) raise Error.new(Pdfcrowd.(prefix, "setHtmlNamespace", "pdf-to-html", "Start with a letter or underscore, and use only letters, numbers, hyphens, underscores, or colons.", "set_html_namespace"), 470); end @fields['html_namespace'] = prefix self end |
#setHttpProxy(proxy) ⇒ Object
A proxy server used by Pdfcrowd conversion process for accessing the source URLs with HTTP scheme. It can help to circumvent regional restrictions or provide limited access to your intranet.
-
proxy- The value must have format DOMAIN_OR_IP_ADDRESS:PORT. -
Returns - The converter object.
5935 5936 5937 5938 5939 5940 5941 5942 |
# File 'lib/pdfcrowd.rb', line 5935 def setHttpProxy(proxy) unless /(?i)^([a-z0-9]+(-[a-z0-9]+)*\.)+[a-z0-9]{1,}:\d+$/.match(proxy) raise Error.new(Pdfcrowd.(proxy, "setHttpProxy", "pdf-to-html", "The value must have format DOMAIN_OR_IP_ADDRESS:PORT.", "set_http_proxy"), 470); end @fields['http_proxy'] = proxy self end |
#setHttpsProxy(proxy) ⇒ Object
A proxy server used by Pdfcrowd conversion process for accessing the source URLs with HTTPS scheme. It can help to circumvent regional restrictions or provide limited access to your intranet.
-
proxy- The value must have format DOMAIN_OR_IP_ADDRESS:PORT. -
Returns - The converter object.
5948 5949 5950 5951 5952 5953 5954 5955 |
# File 'lib/pdfcrowd.rb', line 5948 def setHttpsProxy(proxy) unless /(?i)^([a-z0-9]+(-[a-z0-9]+)*\.)+[a-z0-9]{1,}:\d+$/.match(proxy) raise Error.new(Pdfcrowd.(proxy, "setHttpsProxy", "pdf-to-html", "The value must have format DOMAIN_OR_IP_ADDRESS:PORT.", "set_https_proxy"), 470); end @fields['https_proxy'] = proxy self end |
#setImageFormat(image_format) ⇒ Object
Specifies the format for the output images.
-
image_format- The image format. Allowed values are png, jpg, svg. -
Returns - The converter object.
5734 5735 5736 5737 5738 5739 5740 5741 |
# File 'lib/pdfcrowd.rb', line 5734 def setImageFormat(image_format) unless /(?i)^(png|jpg|svg)$/.match(image_format) raise Error.new(Pdfcrowd.(image_format, "setImageFormat", "pdf-to-html", "Allowed values are png, jpg, svg.", "set_image_format"), 470); end @fields['image_format'] = image_format self end |
#setImageMode(mode) ⇒ Object
Specifies where the images are stored.
-
mode- The image storage mode. Allowed values are embed, separate, none. -
Returns - The converter object.
5721 5722 5723 5724 5725 5726 5727 5728 |
# File 'lib/pdfcrowd.rb', line 5721 def setImageMode(mode) unless /(?i)^(embed|separate|none)$/.match(mode) raise Error.new(Pdfcrowd.(mode, "setImageMode", "pdf-to-html", "Allowed values are embed, separate, none.", "set_image_mode"), 470); end @fields['image_mode'] = mode self end |
#setKeywords(keywords) ⇒ Object
Associate keywords with the HTML document. Keywords from the input PDF are used by default.
-
keywords- The string containing the keywords. -
Returns - The converter object.
5863 5864 5865 5866 |
# File 'lib/pdfcrowd.rb', line 5863 def setKeywords(keywords) @fields['keywords'] = keywords self end |
#setPdfPassword(password) ⇒ Object
Password to open the encrypted PDF file.
-
password- The input PDF password. -
Returns - The converter object.
5677 5678 5679 5680 |
# File 'lib/pdfcrowd.rb', line 5677 def setPdfPassword(password) @fields['pdf_password'] = password self end |
#setPrintPageRange(pages) ⇒ Object
Set the page range to print.
-
pages- A comma separated list of page numbers or ranges. -
Returns - The converter object.
5699 5700 5701 5702 5703 5704 5705 5706 |
# File 'lib/pdfcrowd.rb', line 5699 def setPrintPageRange(pages) unless /^(?:\s*(?:\d+|(?:\d*\s*\-\s*\d+)|(?:\d+\s*\-\s*\d*))\s*,\s*)*\s*(?:\d+|(?:\d*\s*\-\s*\d+)|(?:\d+\s*\-\s*\d*))\s*$/.match(pages) raise Error.new(Pdfcrowd.(pages, "setPrintPageRange", "pdf-to-html", "A comma separated list of page numbers or ranges.", "set_print_page_range"), 470); end @fields['print_page_range'] = pages self end |
#setProxy(host, port, user_name, password) ⇒ Object
Specifies an HTTP proxy that the API client library will use to connect to the internet.
-
host- The proxy hostname. -
port- The proxy port. -
user_name- The username. -
password- The password. -
Returns - The converter object.
6005 6006 6007 6008 |
# File 'lib/pdfcrowd.rb', line 6005 def setProxy(host, port, user_name, password) @helper.setProxy(host, port, user_name, password) self end |
#setRetryCount(count) ⇒ Object
Specifies the number of automatic retries when the 502 or 503 HTTP status code is received. The status code indicates a temporary network issue. This feature can be disabled by setting to 0.
-
count- Number of retries. -
Returns - The converter object.
6014 6015 6016 6017 |
# File 'lib/pdfcrowd.rb', line 6014 def setRetryCount(count) @helper.setRetryCount(count) self end |
#setScaleFactor(factor) ⇒ Object
Set the scaling factor (zoom) for the main page area.
-
factor- The percentage value. Must be a positive integer number. -
Returns - The converter object.
5686 5687 5688 5689 5690 5691 5692 5693 |
# File 'lib/pdfcrowd.rb', line 5686 def setScaleFactor(factor) if (!(Integer(factor) > 0)) raise Error.new(Pdfcrowd.(factor, "setScaleFactor", "pdf-to-html", "Must be a positive integer number.", "set_scale_factor"), 470); end @fields['scale_factor'] = factor self end |
#setSplitLigatures(value) ⇒ Object
Converts ligatures, two or more letters combined into a single glyph, back into their individual ASCII characters.
-
value- Set to true to split ligatures. -
Returns - The converter object.
5786 5787 5788 5789 |
# File 'lib/pdfcrowd.rb', line 5786 def setSplitLigatures(value) @fields['split_ligatures'] = value self end |
#setSubject(subject) ⇒ Object
Set the HTML subject. The subject from the input PDF is used by default.
-
subject- The HTML subject. -
Returns - The converter object.
5845 5846 5847 5848 |
# File 'lib/pdfcrowd.rb', line 5845 def setSubject(subject) @fields['subject'] = subject self end |
#setTag(tag) ⇒ Object
Tag the conversion with a custom value. The tag is used in conversion statistics. A value longer than 32 characters is cut off.
-
tag- A string with the custom tag. -
Returns - The converter object.
5926 5927 5928 5929 |
# File 'lib/pdfcrowd.rb', line 5926 def setTag(tag) @fields['tag'] = tag self end |
#setTitle(title) ⇒ Object
Set the HTML title. The title from the input PDF is used by default.
-
title- The HTML title. -
Returns - The converter object.
5836 5837 5838 5839 |
# File 'lib/pdfcrowd.rb', line 5836 def setTitle(title) @fields['title'] = title self end |
#setType3Mode(mode) ⇒ Object
Sets the processing mode for handling Type 3 fonts.
-
mode- The type3 font mode. Allowed values are raster, convert. -
Returns - The converter object.
5773 5774 5775 5776 5777 5778 5779 5780 |
# File 'lib/pdfcrowd.rb', line 5773 def setType3Mode(mode) unless /(?i)^(raster|convert)$/.match(mode) raise Error.new(Pdfcrowd.(mode, "setType3Mode", "pdf-to-html", "Allowed values are raster, convert.", "set_type3_mode"), 470); end @fields['type3_mode'] = mode self end |
#setUseHttp(value) ⇒ Object
Specifies if the client communicates over HTTP or HTTPS with Pdfcrowd API. Warning: Using HTTP is insecure as data sent over HTTP is not encrypted. Enable this option only if you know what you are doing.
-
value- Set to true to use HTTP. -
Returns - The converter object.
5975 5976 5977 5978 |
# File 'lib/pdfcrowd.rb', line 5975 def setUseHttp(value) @helper.setUseHttp(value) self end |
#setUserAgent(agent) ⇒ Object
Set a custom user agent HTTP header. It can be useful if you are behind a proxy or a firewall.
-
agent- The user agent string. -
Returns - The converter object.
5993 5994 5995 5996 |
# File 'lib/pdfcrowd.rb', line 5993 def setUserAgent(agent) @helper.setUserAgent(agent) self end |