Class: Subprocess::Process

Inherits:

Object

Object
Subprocess::Process

Defined in:: lib/subprocess.rb

Overview

A child process. The preferred way of spawning a subprocess is through the functions on Subprocess (especially check_call and check_output).

Instance Attribute Summary collapse

#command ⇒ Array<String> readonly
The command this process was invoked with.
#pid ⇒ Integer readonly
The process ID of the spawned process.
#status ⇒ ::Process::Status^? readonly
The exit status code of the process.
#stderr ⇒ IO readonly
The IO that is connected to this process's stderr.
#stdin ⇒ IO readonly
The IO that is connected to this process's stdin.
#stdout ⇒ IO readonly
The IO that is connected to this process's stdout.

Instance Method Summary collapse

#communicate(input = nil, timeout_s = nil) {|stdout, stderr| ... } ⇒ Array(String, String)^?
Write the (optional) input to the process's stdin and read the contents of stdout and stderr.
#drain_fd(fd, buf = nil) ⇒ true, false
Do nonblocking reads from fd, appending all data read into buf.
#initialize(cmd, opts = {}) {|process| ... } ⇒ Process constructor
Create a new process.
#poll ⇒ ::Process::Status^?
Poll the child, setting (and returning) its status.
#send_signal(signal) ⇒ Integer
Does exactly what it says on the box.
#terminate ⇒ Integer
Sends SIGTERM to the process.
#wait ⇒ ::Process::Status
Wait for the child to return, setting and returning the status of the child.

Constructor Details

#initialize(cmd, opts = {}) {|process| ... } ⇒ `Process`

Create a new process.

Parameters:

cmd (Array<String>) —
The command to run and its arguments (in the style of an argv array). Unlike Python's subprocess module, cmd cannot be a String.
opts (Hash) (defaults to: {}) —
a customizable set of options

Options Hash (opts):

:stdin (IO, Integer, String, Subprocess::PIPE, nil) —
The IO, file descriptor number, or file name to use for the process's standard input. If the magic value Subprocess::PIPE is passed, a new pipe will be opened.
:stdout (IO, Integer, String, Subprocess::PIPE, nil) —
The IO, file descriptor number, or file name to use for the process's standard output. If the magic value Subprocess::PIPE is passed, a pipe will be opened and attached to the process.
:stderr (IO, Integer, String, Subprocess::PIPE, Subprocess::STDOUT, nil) —
The IO, file descriptor number, or file name to use for the process's standard error. If the special value Subprocess::PIPE is passed, a pipe will be opened and attached to the process. If the special value STDOUT is passed, the process's stderr will be redirected to its stdout (much like bash's 2>&1).
:cwd (String) —
The directory to change to before executing the child process.
:env (Hash<String, String>) —
The environment to use in the child process.
:retain_fds (Array<Integer>) —
An array of file descriptor numbers that should not be closed before executing the child process. Note that, unlike Python (which has :close_fds defaulting to false), all file descriptors not specified here will be closed.
:exec_opts (Hash) —
A hash that will be merged into the options hash of the call to Kernel#exec.
:preexec_fn (Proc) —
A function that will be called in the child process immediately before executing cmd.

Yields:

(process) —
Yields the just-spawned Subprocess::Process to the optional block. This occurs after all of Subprocess::Process's error handling has been completed, and is a great place to call #communicate, especially when used in conjunction with check_call.

Yield Parameters:

process (Process) —
The process that was just spawned.

Raises:

(ArgumentError)

# File 'lib/subprocess.rb', line 268

def initialize(cmd, opts={}, &blk)
  raise ArgumentError, "cmd must be an Array of strings" unless Array === cmd
  raise ArgumentError, "cmd cannot be empty" if cmd.empty?

  @command = cmd

  # Figure out what file descriptors we should pass on to the child (and
  # make externally visible ourselves)
  @child_stdin, @stdin = parse_fd(opts[:stdin], 'r')
  @child_stdout, @stdout = parse_fd(opts[:stdout], 'w')
  unless opts[:stderr] == STDOUT
    @child_stderr, @stderr = parse_fd(opts[:stderr], 'w')
  end

  retained_fds = Set.new(opts[:retain_fds] || [])

  # A control pipe for ferrying errors back from the child
  control_r, control_w = IO.pipe

  @pid = fork do
    begin
      ::STDIN.reopen(@child_stdin) if @child_stdin
      ::STDOUT.reopen(@child_stdout) if @child_stdout
      if opts[:stderr] == STDOUT
        ::STDERR.reopen(::STDOUT)
      else
        ::STDERR.reopen(@child_stderr) if @child_stderr
      end

      # Set up a new environment if we're requested to do so.
      if opts[:env]
        ENV.clear
        begin
          ENV.update(opts[:env])
        rescue TypeError => e
          raise ArgumentError, "`env` option must be a hash where all keys and values are strings (#{e})"
        end
      end

      # Call the user back, maybe?
      if opts[:preexec_fn]
        if opts[:cwd]
          Dir.chdir(opts[:cwd], &opts[:preexec_fn])
        else
          opts[:preexec_fn].call
        end
      end

      options = {close_others: true}.merge(opts.fetch(:exec_opts, {}))
      if opts[:retain_fds]
        retained_fds.each { |fd| options[fd] = fd }
      end
      if opts[:cwd]
        # We use the chdir option to `exec` since wrapping the
        # `exec` in a Dir.chdir block caused these sporadic errors on macOS:
        # Too many open files - getcwd (Errno::EMFILE)
        options[:chdir] = opts[:cwd]
      end

      begin
        # Ruby's Kernel#exec will call an exec(3) variant if called with two
        # or more arguments, but when called with just a single argument will
        # spawn a subshell with that argument as the command. Since we always
        # want to call exec(3), we use the third exec form, which passes a
        # [cmdname, argv0] array as its first argument and never invokes a
        # subshell.
        exec([cmd[0], cmd[0]], *cmd[1..-1], options)
      rescue TypeError => e
        raise ArgumentError, "cmd must be an Array of strings (#{e})"
      end

    rescue Exception => e
      # Dump all errors up to the parent through the control socket
      Marshal.dump(e, control_w)
      control_w.flush
    end

    # Something has gone terribly, terribly wrong if we're hitting this :(
    exit!(1)
  end

  # Meanwhile, in the parent process...

  # First, let's close some things we shouldn't have access to
  @child_stdin.close if our_fd?(opts[:stdin])
  @child_stdout.close if our_fd?(opts[:stdout])
  @child_stderr.close if our_fd?(opts[:stderr])
  control_w.close

  # Any errors during the spawn process? We'll get past this point when the
  # child execs and the OS closes control_w
  begin
    e = Marshal.load(control_r)
    e = "Unknown Failure" unless e.is_a?(Exception) || e.is_a?(String)
    # Because we're throwing an exception and not returning a
    # Process, we need to make sure the child gets reaped
    wait
    raise e
  rescue EOFError # Nothing to read? Great!
  ensure
    control_r.close
  end

  # Everything is okay. Good job, team!
  blk.call(self) if blk
end

Instance Attribute Details

#command ⇒ `Array<String>` (readonly)

Returns The command this process was invoked with.

Returns:

(Array<String>) —
The command this process was invoked with.



219
220
221

# File 'lib/subprocess.rb', line 219

def command
  @command
end

#pid ⇒ `Integer` (readonly)

Returns The process ID of the spawned process.

Returns:

(Integer) —
The process ID of the spawned process.



222
223
224

# File 'lib/subprocess.rb', line 222

def pid
  @pid
end

#status ⇒ `::Process::Status`^? (readonly)

Returns The exit status code of the process. Only set after the process has exited.

Returns:

(::Process::Status, nil) —
The exit status code of the process. Only set after the process has exited.



226
227
228

# File 'lib/subprocess.rb', line 226

def status
  @status
end

#stderr ⇒ `IO` (readonly)

Returns The IO that is connected to this process's stderr.

Returns:

(IO) —
The IO that is connected to this process's stderr.



216
217
218

# File 'lib/subprocess.rb', line 216

def stderr
  @stderr
end

#stdin ⇒ `IO` (readonly)

Returns The IO that is connected to this process's stdin.

Returns:

(IO) —
The IO that is connected to this process's stdin.



210
211
212

# File 'lib/subprocess.rb', line 210

def stdin
  @stdin
end

#stdout ⇒ `IO` (readonly)

Returns The IO that is connected to this process's stdout.

Returns:

(IO) —
The IO that is connected to this process's stdout.



213
214
215

# File 'lib/subprocess.rb', line 213

def stdout
  @stdout
end

Instance Method Details

#communicate(input = nil, timeout_s = nil) {|stdout, stderr| ... } ⇒ `Array(String, String)`^?

Write the (optional) input to the process's stdin and read the contents of stdout and stderr. If a block is provided, stdout and stderr are yielded as they are read. Otherwise they are buffered in memory and returned when the process exits. Do this all using IO::select, so we don't deadlock due to full pipe buffers.

This is only really useful if you set some of :stdin, :stdout, and :stderr to Subprocess::PIPE.

Parameters:

input (String) (defaults to: nil) —
A string to feed to the child's standard input.
timeout_s (Numeric) (defaults to: nil) —
Raise CommunicateTimeout if communicate does not finish after timeout_s seconds.

Yield Parameters:

stdout (String) —
Data read from stdout since the last yield
stderr (String) —
Data read from stderr since the last yield

Returns:

(Array(String, String), nil) —
An array of two elements: the data read from the child's standard output and standard error, respectively. Returns nil if a block is provided.

Raises:

(ArgumentError)

# File 'lib/subprocess.rb', line 428

def communicate(input=nil, timeout_s=nil)
  raise ArgumentError if !input.nil? && @stdin.nil?

  stdout, stderr = "", ""

  # NB: Always force encoding to binary so we handle unicode or binary input
  # correctly across multiple write_nonblock calls, since we manually slice
  # the input depending on how many bytes were written
  input = input.dup.force_encoding('BINARY') unless input.nil?

  @stdin.close if (input.nil? || input.empty?) && !@stdin.nil?

  timeout_at = Time.now + timeout_s if timeout_s

  self.class.catching_sigchld(pid) do |global_read, self_read|
    wait_r = [@stdout, @stderr, self_read, global_read].compact
    wait_w = [input && @stdin].compact
    done = false
    while !done
      # If the process has exited, we want to drain any remaining output before returning
      if poll
        ready_r = wait_r - [self_read, global_read]
        ready_w = []
        done = true
      else
        ready_r, ready_w = select_until(wait_r, wait_w, [], timeout_at)
        raise CommunicateTimeout.new(@command, stdout, stderr) if ready_r.nil?
      end

      if ready_r.include?(@stdout)
        if drain_fd(@stdout, stdout)
          wait_r.delete(@stdout)
        end
      end

      if ready_r.include?(@stderr)
        if drain_fd(@stderr, stderr)
          wait_r.delete(@stderr)
        end
      end

      if ready_r.include?(global_read)
        if drain_fd(global_read)
          raise "Unexpected internal error -- someone closed the global self-pipe!"
        end
        self.class.wakeup_sigchld
      end

      if ready_r.include?(self_read)
        if drain_fd(self_read)
          raise "Unexpected internal error -- someone closed our self-pipe!"
        end
      end

      if ready_w.include?(@stdin)
        written = 0
        begin
          written = @stdin.write_nonblock(input)
        rescue EOFError # Maybe I shouldn't catch this...
        rescue Errno::EINTR
        rescue IO::WaitWritable
          # On OS X, a pipe can raise EAGAIN even after select indicates
          # that it is writable. Once the process consumes from the pipe,
          # the next write should succeed and we should make forward progress.
          # Until then, treat this as not writing any bytes and continue looping.
          # For details see: https://github.com/stripe/subprocess/pull/22
          nil
        rescue Errno::EPIPE
          # The other side of the pipe closed before we could
          # write all of our input. This can happen if the
          # process exits prematurely.
          @stdin.close
          wait_w.delete(@stdin)
        end
        input = input[written..input.length]
        if input.empty?
          @stdin.close
          wait_w.delete(@stdin)
        end
      end

      if block_given? && !(stderr.empty? && stdout.empty?)
        yield stdout, stderr
        stdout, stderr = "", ""
      end
    end
  end

  wait

  if block_given?
    nil
  else
    [stdout, stderr]
  end
end

#drain_fd(fd, buf = nil) ⇒ `true`, `false`

Do nonblocking reads from fd, appending all data read into buf.

Parameters:

fd (IO) —
The file to read from.
buf (String) (defaults to: nil) —
A buffer to append the read data to.

Returns:

(true, false) —
Whether fd was closed due to an exceptional condition (EOFError or EPIPE).

# File 'lib/subprocess.rb', line 398

def drain_fd(fd, buf=nil)
  loop do
    tmp = fd.read_nonblock(4096).force_encoding(fd.external_encoding)
    buf << tmp unless buf.nil?
  end
rescue EOFError, Errno::EPIPE
  fd.close
  true
rescue Errno::EINTR
rescue Errno::EWOULDBLOCK, Errno::EAGAIN
  false
end

#poll ⇒ `::Process::Status`^?

Poll the child, setting (and returning) its status. If the child has not terminated, return nil and exit immediately

Returns:

(::Process::Status, nil) —
The exit status of the process



379
380
381

# File 'lib/subprocess.rb', line 379

def poll
  @status ||= (::Process.waitpid2(@pid, ::Process::WNOHANG) || []).last
end

#send_signal(signal) ⇒ `Integer`

Does exactly what it says on the box.

Parameters:

signal (String, Symbol, Integer) —
The signal to send to the child process. Accepts all the same arguments as Ruby's built-in Process::kill, for instance a string like "INT" or "SIGINT", or a signal number like 2.

Returns:

(Integer) —
See Process.kill

#terminate ⇒ `Integer`

Sends SIGTERM to the process.

Returns:

(Integer) —
See #send_signal

#wait ⇒ `::Process::Status`

Wait for the child to return, setting and returning the status of the child.

Returns:

(::Process::Status) —
The exit status of the process



387
388
389

# File 'lib/subprocess.rb', line 387

def wait
  @status ||= ::Process.waitpid2(@pid).last
end

Class: Subprocess::Process

Overview

Instance Attribute Summary collapse

Instance Method Summary collapse

Constructor Details

#initialize(cmd, opts = {}) {|process| ... } ⇒ Process

Instance Attribute Details

#command ⇒ Array<String> (readonly)

#pid ⇒ Integer (readonly)

#status ⇒ ::Process::Status? (readonly)

#stderr ⇒ IO (readonly)

#stdin ⇒ IO (readonly)

#stdout ⇒ IO (readonly)

Instance Method Details

#communicate(input = nil, timeout_s = nil) {|stdout, stderr| ... } ⇒ Array(String, String)?

#drain_fd(fd, buf = nil) ⇒ true, false

#poll ⇒ ::Process::Status?

#send_signal(signal) ⇒ Integer

#terminate ⇒ Integer

#wait ⇒ ::Process::Status

#initialize(cmd, opts = {}) {|process| ... } ⇒ `Process`

#command ⇒ `Array<String>` (readonly)

#pid ⇒ `Integer` (readonly)

#status ⇒ `::Process::Status`^? (readonly)

#stderr ⇒ `IO` (readonly)

#stdin ⇒ `IO` (readonly)

#stdout ⇒ `IO` (readonly)

#communicate(input = nil, timeout_s = nil) {|stdout, stderr| ... } ⇒ `Array(String, String)`^?

#drain_fd(fd, buf = nil) ⇒ `true`, `false`

#poll ⇒ `::Process::Status`^?

#send_signal(signal) ⇒ `Integer`

#terminate ⇒ `Integer`

#wait ⇒ `::Process::Status`