DUI Build Status

Delete, Update, and Insert

The purpose of this gem is to make it easy to compare two datasets to see what operations would be necessary to merge the new data into the old.

Way back in my stored procedure days, we'd write this type of update by:

1.) Deleting any current records that aren't in the new set, 2.) Updating the current records with any matches in the new set, and 3.) Inserting any new records into the current set.

This gem helps to run this process by doing the comparison for you, giving you three sets of data that you can use to make the appropriate data changes.

Warning: All of the work is done in memory, so I would not run it against millions of records. But for small-to-medium sets of data, it might be of some help.

My inspiration?

Two things:

1.) I love that Ruby's syntax makes it easy to write code that executes a process, regardless of the objects being used (and without going crazy with things like generics). We'd used to write these DUI sprocs by hand, different for each table. I guess object-oriented languages have some benefits...

2.) I've seen a number of imports that start with an call to destroy_all, so an import is essentially a DELETE EVERYTHING, THEN IMPORT EVERYTHING AS NEW process. Yikes. After having to fix a few issues where the system would fail after the data was deleted but before the import was finished, I had to take action.

Simple Example:

require 'DUI'
require 'awesome_print'

class Person
  attr_accessor :name, :email
  def initialize(hash = {})
    hash.each_pair { |k,v| self.send((k.to_s+"=").to_sym, v)}
  end
end

current_data = [Person.new(:email => '[email protected]', :name => "JG"),
                Person.new(:email => '[email protected]', :name => "H Roark"),
                Person.new(:email => '[email protected]', :name => "Peter Keating")]

new_data = [{:email => "[email protected]", :name => "Howard Roark"},
            {:email => "[email protected]", :name => "John Galt"},
            {:email => "[email protected]", :name => "Hank Rearden"}]

a_way_to_match_the_records = lambda {|current, new| current.email == new[:email]}

# create a matcher, provide it with a way to compare the two records
matcher = DUI::Matcher.new(&a_way_to_match_the_records)

# get the results of the matches between the two sets
results = matcher.get_results current_data, new_data

# now we know which records to delete, update, or insert, so let's do it

results.records_to_delete.each do |delete_me|
  current_data.delete delete_me # bye peter!
end

results.records_to_update.each do |match|
  match.current.name = match.new[:name] #fix those names!
end

results.records_to_insert.each do |add_me|
  new_person = Person.new :email => add_me[:email], :name => add_me[:name]
  current_data << new_person # hello hank!
end

ap current_data

Here's the final result:

[
    [0] #<Person:0x7ffae4937920
        attr_accessor :email = "[email protected]",
        attr_accessor :name = "John Galt"
    >,
    [1] #<Person:0x7ffae49376c8
        attr_accessor :email = "[email protected]",
        attr_accessor :name = "Howard Roark"
    >,
    [2] #<Person:0x7ffae4966bf8
        attr_accessor :email = "[email protected]",
        attr_accessor :name = "Hank Rearden"
    >
]