Ever wanted to do terrible, terrible things with DER-encoded ASN.1 data structures? Now you can!

While there are plenty of DER parsers out there, they tend to suffer from the misguided assumption that the data they're parsing is actually valid. Wild idea, it'll never catch on. In contrast, derparse makes the opposite assumption -- that everything is awful. This means that it makes an attempt to parse incomplete and, to a limited degree, corrupted DER-encoded data structures.

Why would you ever want this, you might ask? It's a niche requirement, undoubtedly, and if you're asking the question, that's a good indication that derparse probably isn't what you're looking for -- OpenSSL::ASN1 does a good job of parsing well-formed DER data.

Installation

It's a gem:

gem install derparse

There's also the wonders of the Gemfile:

gem 'derparse'

If you're the sturdy type that likes to run from git:

rake install

Or, if you've eschewed the convenience of Rubygems entirely, then you presumably know what to do already.

Usage

Instantiate a new DerParse object, giving it the DER-encoded data you want to parse:

p = DerParse.new("0\x81\x80\x02\x01\x2a\x04\x44\x65\x72\x50\x61\x72\x73\x65\x02\x01\x45\x6e\x69\x63\x65")

DerParse has a number of instance methods to help you examine and work with the parsed ASN.1 data. For example, you can get a string description of the data and its structure:

p.to_s   # => "long, multi-line string of something something"

which means, of course, that you can print it to screen:

puts p

If you want to process the data, you can use #traverse to walk through every element of the DER, which will yield a <code>DerParse<code>DerParse::Node</code> for each ASN.1 object that is encountered, along with info about the parser state.

p.traverse do |node, depth, offset|
  puts "Parsing depth: #{depth} Offset: #{offset}"
  puts "Header length: #{node.header_length} Data length: #{node.data_length}"
  puts "Tag number: #{node.tag} Tag class: #{node.tag_class} Constructed: #{node.constructed?}"
  puts "Value: #{node.value}"
  puts "INCOMPLETE!" unless node.complete?
end

There are a whole bunch of methods which <code>DerParse<code>DerParse::Node</code> has to help you query what sort of ASN.1 object you're dealing with. It also handles a string in which there are multiple ASN.1 objects encoded sequentially.

It's is particularly important to note the <code>DerParse<code>DerParse::Node<code>DerParse::Node#complete?</code> method. Because DerParse tries to give you as much data as it can, even if the DER is incomplete or corrupt, it needs to let you know when the data it's passing back is known to not be "correct", in a strict sense. What counts as an "incomplete" value, and what impact it has on the value of a given node, can get complicated. For full details, see the documentation for DerParse#traverse.

If you call #traverse without a block, it'll return an iterator, which allows you to be more flexible with your iteration, like constructing complicated handlers for parsed DER structures.

Finally, you can try <code>DerParse<code>DerParse#to_a</code>. This attempts to turn a DER blob into a collection of Ruby objects. It handles many common ASN.1 data types, including integers, octet and bit strings, sequences and sets (turns them into arrays), and so on. If it hits an ASN.1 type it can't handle, it'll give up and raise <code>DerParse<code>DerParse::IncompatibleDataTypeError</code>.

Compatibility and Coverage

The parts of ASN.1 that DerParse best supports are those which I've needed for my own purposes -- mainly sequences, integers, octet strings, and to a lesser extent OIDs. In particular, it's important to note that DerParse is for DER structures, so any BER-specific things like indefinite lengths are not, and probably will not, be supported. Support for additional ASN.1 types are welcomed via a well-tested PR.

Security

ASN.1 has a long and glorious history of being difficult to parse safely. While parsing in a memory-safe language reduces the attack surface, I don't recommend unconstrained parsing of arbitrary input using DerParse. Reasonable care has been taken to not produce any absolute bone-headed bugs, but this code has not been intensively vetted for potential security issues.

That being said, DerParse is intended to be secure, and everyone wants it to be secure. If you do find anything in DerParse that is security sensitive, please e-mail [email protected].

Contributing

See CONTRIBUTING.md.

Licence

Unless otherwise stated, everything in this repo is covered by the following copyright notice:

Copyright (C) 2020  Matt Palmer <[email protected]>

This program is free software: you can redistribute it and/or modify it
under the terms of the GNU General Public License version 3, as
published by the Free Software Foundation.

This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
GNU General Public License for more details.

You should have received a copy of the GNU General Public License
along with this program.  If not, see <http://www.gnu.org/licenses/>.