Class: RubyTikaApp
- Inherits:
-
Object
show all
- Defined in:
- lib/ruby_tika_app.rb
Defined Under Namespace
Classes: CommandFailedError, Error
Instance Method Summary
collapse
Constructor Details
#initialize(document) ⇒ RubyTikaApp
Returns a new instance of RubyTikaApp.
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
|
# File 'lib/ruby_tika_app.rb', line 17
def initialize(document)
@document = if (document =~ %r{https?:\/\/[\S]+}) == 0
document
else
"file://#{document}"
end
java_cmd = 'java'
java_args = '-server -Djava.awt.headless=true'
ext_dir = File.join(File.dirname(__FILE__))
tika_path = "#{ext_dir}/../ext/tika-app-1.19.1.jar"
tika_config_path = "#{ext_dir}/../ext/tika-config.xml"
@tika_cmd = "#{java_cmd} #{java_args} -jar '#{tika_path}' --config='#{tika_config_path}'"
end
|
Instance Method Details
#to_html ⇒ Object
37
38
39
|
# File 'lib/ruby_tika_app.rb', line 37
def to_html
run_tika('--html')
end
|
#to_json ⇒ Object
41
42
43
|
# File 'lib/ruby_tika_app.rb', line 41
def to_json
run_tika('--json')
end
|
53
54
55
|
# File 'lib/ruby_tika_app.rb', line 53
def to_metadata
run_tika('--metadata')
end
|
#to_text ⇒ Object
45
46
47
|
# File 'lib/ruby_tika_app.rb', line 45
def to_text
run_tika('--text')
end
|
#to_text_main ⇒ Object
49
50
51
|
# File 'lib/ruby_tika_app.rb', line 49
def to_text_main
run_tika('--text-main')
end
|
#to_xml ⇒ Object
33
34
35
|
# File 'lib/ruby_tika_app.rb', line 33
def to_xml
run_tika('--xml')
end
|