Why DataMapper?

Open Development

DataMapper sports a very accessible code-base and a welcoming community. Outside contributions and feedback are welcome and encouraged, especially constructive criticism. Make your voice heard! Submit a ticket or patch, speak up on our mailing-list, chat with us on irc, write a spec, get it reviewed, ask for commit rights. It’s as easy as that to become a contributor.

Identity Map

One row in the data-store should equal one object reference. Pretty simple idea. Pretty profound impact. If you run the following code in ActiveRecord you’ll see all false results. Do the same in DataMapper and it’s true all the way down.

repository do
  @parent = Tree.first(:name => 'bob')

  @parent.children.each do |child|
    puts @parent.equal?(child.parent)  # => true
  end
end

This makes DataMapper faster and allocate less resources to get things done.

Dirty Tracking

When you save a model back to your data-store, DataMapper will only write the fields that actually changed. So it plays well with others. You can use it in an Integration data-store without worrying that your application will be a bad actor causing trouble for all of your other processes.

Eager Loading

Ready for something amazing? The following example executes only two queries regardless of how many rows the inner and outer queries return.

repository do
  Zoo.all.each { |zoo| zoo.exhibits.to_a }
end

Pretty impressive huh? The idea is that you aren’t going to load a set of objects and use only an association in just one of them. This should hold up pretty well against a 99% rule. When you don’t want it to work like this, just load the item you want in it’s own set. So the DataMapper thinks ahead. We like to call it “performant by default”. This feature single-handedly wipes out the “N+1 Query Problem”. No need to specify an :include option in your finders.

Laziness Can Be A Virtue

Text fields are expensive in data-stores. They’re generally stored in a different place than the rest of your data. So instead of a fast sequential read from your hard-drive, your data-store server has to hop around all over the place to get what it needs. Since ActiveRecord returns everything by default, adding a text field to a table slows everything down drastically, across the board.

Not so with the DataMapper. Text fields are lazily loaded, meaning they only load when you need them. If you want more control you can enable or disable this feature for any field (not just text-fields) by passing a :lazy option to your field mapping with a value of true or false.

class Animal
  include DataMapper::Resource

  property :name,        String
  property :description, Text, :lazy => false
end

Plus, lazy-loading of Text fields happens automatically and intelligently when working with associations. The following only issues 2 queries to load up all of the notes fields on each animal:

repository do
  Animal.all.each { |animal| animal.description.to_a }
end

Did you notice the #to_a call in the above example? That was necessary because even DataMapper collections are lazy. If you don’t iterate over them, or in this case ask them to become Arrays, they won’t execute until you need them. We needed to call #to_a to force the lazy load because without it, the above example would have only executed one query. This extra bit of laziness can come in very handy, for example:

animals     = Animal.all
description = 'foo'

animals.each do |animal|
  animal.update(:description => description)
end

In the above example, the Animals won’t be retrieved until you actually need them. This comes in handy in cases where you initialize the collection before you know if you need it, like in a web app controller.

Collection Chaining

DataMapper’s lazy collections are also handy because you can get the same effect as named scopes, without any special syntax, eg:

class Animal
  # ... setup ...

  def self.mammals
    all(:mammal => true)
  end

  def self.zoo(zoo)
    all(:zoo => zoo)
  end
end

zoo = Zoo.first(:name => 'Greater Vancouver Zoo')

Animal.mammals.zoo(zoo).to_a  # => executes one query

In the above example, we ask the Animal model for all the mammals, and then all the animals in a specific zoo, and DataMapper will chain the collection queries together and execute a single query to retrieve the matching records. There’s no special syntax, and no custom DSLs to learn, it’s just plain ruby all the way down.

You can even use this on association collections, eg:

zoo.animals.mammals.to_a  # => executes one query

Custom Properties

With DataMapper it is possible to create custom properties for your models. Consider this example:

module DataMapper
  class Property
    class Email < String
      required true
      format   /^([\w\.%\+\-]+)@([\w\-]+\.)+([\w]{2,})$/i
    end
  end
end

class User
  include DataMapper::Resource

  property :id,    Serial
  property :email, Email
end

This way there won’t be a need to repeat same property options every time you add an email to a model. In the example above we create an Email property which is just a String with additional pre-configured options: required and format. Please note that it is possible to override these options when declaring a property, like this:

class Member
  include DataMapper::Resource

  property :id,    Serial
  property :email, Email, :required => false
end

Plays Well With Others

In ActiveRecord, all your fields are mapped, whether you want them or not. This slows things down. In the DataMapper you define your mappings in your model. So instead of an _ALTER TABLE ADD field_ in your data-store, you simply add a property :name, String to your model. DRY. No schema.rb. No migration files to conflict or die without reverting changes. Your model drives the data-store, not the other way around.

Unless of course you want to map to a legacy data-store. Raise your hand if you like seeing a method called col2Name on your model just because that’s what it’s called in an old data-store you can’t afford to change right now? In DataMapper you control the mappings:

class Fruit
  include DataMapper::Resource

  storage_names[:repo] = 'frt'

  property :name, String, :field => 'col2Name'
end

All Ruby, All The Time

It’s great that ActiveRecord allows you to write SQL when you need to, but should we have to so often?

DataMapper supports issuing your own query, but it also provides more helpers and a unique hash-based condition syntax to cover more of the use-cases where issuing your own SQL would have been the only way to go. For example, any finder option that’s non-standard is considered a condition. So you can write Zoo.all(:name => 'Dallas') and DataMapper will look for zoos with the name of ‘Dallas’.

It’s just a little thing, but it’s so much nicer than writing Zoo.find(:all, :conditions => ['name = ?', 'Dallas']). What if you need other comparisons though? Try these:

# 'gt' means greater-than. We also do 'lt'.
Person.all(:age.gt => 30)

# 'gte' means greather-than-or-equal-to. We also do 'lte'.
Person.all(:age.gte => 30)

# 'not' allows you to match all people without the name "bob"
Person.all(:name.not => 'bob')

# If the value of a pair is an Array, we do an IN-clause for you.
Person.all(:name.like => 'S%', :id => [ 1, 2, 3, 4, 5 ])

# Does a NOT IN () clause for you.
Person.all(:name.not => [ 'bob', 'rick', 'steve' ])

See? Fewer SQL fragments dirtying your Ruby code. And that’s just a few of the nice syntax tweaks DataMapper delivers out of the box…

Note on Patches/Pull Requests

  • Fork the project.

  • Make your feature addition or bug fix.

  • Add tests for it. This is important so I don’t break it in a future version unintentionally.

  • Commit, do not mess with rakefile, version, or history. (if you want to have your own version, that is fine but bump version in a commit by itself I can ignore when I pull)

  • Send me a pull request. Bonus points for topic branches.

Copyright © 2011 Dan Kubb. See LICENSE for details.