Module: Sequel::Model::Associations::DatasetMethods

Defined in:
lib/sequel/model/associations.rb

Overview

Eager loading makes it so that you can load all associated records for a set of objects in a single query, instead of a separate query for each object.

Two separate implementations are provided. eager should be used most of the time, as it loads associated records using one query per association. However, it does not allow you the ability to filter or order based on columns in associated tables. eager_graph loads all records in a single query using JOINs, allowing you to filter or order based on columns in associated tables. However, eager_graph is usually slower than eager, especially if multiple one_to_many or many_to_many associations are joined.

You can cascade the eager loading (loading associations on associated objects) with no limit to the depth of the cascades. You do this by passing a hash to eager or eager_graph with the keys being associations of the current model and values being associations of the model associated with the current model via the key.

The arguments can be symbols or hashes with symbol keys (for cascaded eager loading). Examples:

Album.eager(:artist).all
Album.eager_graph(:artist).all
Album.eager(:artist, :genre).all
Album.eager_graph(:artist, :genre).all
Album.eager(:artist).eager(:genre).all
Album.eager_graph(:artist).eager(:genre).all
Artist.eager(:albums=>:tracks).all
Artist.eager_graph(:albums=>:tracks).all
Artist.eager(:albums=>{:tracks=>:genre}).all
Artist.eager_graph(:albums=>{:tracks=>:genre}).all

You can also pass a callback as a hash value in order to customize the dataset being eager loaded at query time, analogous to the way the :eager_block association option allows you to customize it at association definition time. For example, if you wanted artists with their albums since 1990:

Artist.eager(:albums => proc{|ds| ds.where{year > 1990}})

Or if you needed albums and their artist’s name only, using a single query:

Albums.eager_graph(:artist => proc{|ds| ds.select(:name)})

To cascade eager loading while using a callback, you substitute the cascaded associations with a single entry hash that has the proc callback as the key and the cascaded associations as the value. This will load artists with their albums since 1990, and also the tracks on those albums and the genre for those tracks:

Artist.eager(:albums => {proc{|ds| ds.where{year > 1990}}=>{:tracks => :genre}})

Instance Method Summary collapse

Instance Method Details

#association_join(*associations) ⇒ Object

Adds one or more INNER JOINs to the existing dataset using the keys and conditions specified by the given association. The following methods also exist for specifying a different type of JOIN:

association_full_join

FULL JOIN

association_inner_join

INNER JOIN

association_left_join

LEFT JOIN

association_right_join

RIGHT JOIN



2119
2120
2121
# File 'lib/sequel/model/associations.rb', line 2119

def association_join(*associations)
  association_inner_join(*associations)
end

#complex_expression_sql_append(sql, op, args) ⇒ Object

If the expression is in the form x = y where y is a Sequel::Model instance, array of Sequel::Model instances, or a Sequel::Model dataset, assume x is an association symbol and look up the association reflection via the dataset’s model. From there, return the appropriate SQL based on the type of association and the values of the foreign/primary keys of y. For most association types, this is a simple transformation, but for many_to_many associations this creates a subquery to the join table.



2130
2131
2132
2133
2134
2135
2136
2137
2138
2139
2140
2141
2142
2143
2144
2145
2146
2147
2148
2149
2150
2151
2152
2153
2154
2155
2156
2157
2158
2159
2160
2161
2162
2163
2164
2165
2166
2167
2168
2169
2170
2171
# File 'lib/sequel/model/associations.rb', line 2130

def complex_expression_sql_append(sql, op, args)
  r = args.at(1)
  if (((op == :'=' || op == :'!=') and r.is_a?(Sequel::Model)) ||
      (multiple = ((op == :IN || op == :'NOT IN') and ((is_ds = r.is_a?(Sequel::Dataset)) or r.all?{|x| x.is_a?(Sequel::Model)}))))
    l = args.at(0)
    if ar = model.association_reflections[l]
      if multiple
        klass = ar.associated_class
        if is_ds
          if r.respond_to?(:model)
            unless r.model <= klass
              # A dataset for a different model class, could be a valid regular query
              return super
            end
          else
            # Not a model dataset, could be a valid regular query
            return super
          end
        else
          unless r.all?{|x| x.is_a?(klass)}
            raise Sequel::Error, "invalid association class for one object for association #{l.inspect} used in dataset filter for model #{model.inspect}, expected class #{klass.inspect}"
          end
        end
      elsif !r.is_a?(ar.associated_class)
        raise Sequel::Error, "invalid association class #{r.class.inspect} for association #{l.inspect} used in dataset filter for model #{model.inspect}, expected class #{ar.associated_class.inspect}"
      end

      if exp = association_filter_expression(op, ar, r)
        literal_append(sql, exp)
      else
        raise Sequel::Error, "invalid association type #{ar[:type].inspect} for association #{l.inspect} used in dataset filter for model #{model.inspect}"
      end
    elsif multiple && (is_ds || r.empty?)
      # Not a query designed for this support, could be a valid regular query
      super
    else
      raise Sequel::Error, "invalid association #{l.inspect} used in dataset filter for model #{model.inspect}"
    end
  else
    super
  end
end

#eager(*associations) ⇒ Object

The preferred eager loading method. Loads all associated records using one query for each association.

The basic idea for how it works is that the dataset is first loaded normally. Then it goes through all associations that have been specified via eager. It loads each of those associations separately, then associates them back to the original dataset via primary/foreign keys. Due to the necessity of all objects being present, you need to use all to use eager loading, as it can’t work with each.

This implementation avoids the complexity of extracting an object graph out of a single dataset, by building the object graph out of multiple datasets, one for each association. By using a separate dataset for each association, it avoids problems such as aliasing conflicts and creating cartesian product result sets if multiple one_to_many or many_to_many eager associations are requested.

One limitation of using this method is that you cannot filter the dataset based on values of columns in an associated table, since the associations are loaded in separate queries. To do that you need to load all associations in the same query, and extract an object graph from the results of that query. If you need to filter based on columns in associated tables, look at eager_graph or join the tables you need to filter on manually.

Each association’s order, if defined, is respected. If the association uses a block or has an :eager_block argument, it is used.



2198
2199
2200
2201
2202
2203
2204
2205
2206
2207
2208
2209
2210
2211
2212
2213
2214
# File 'lib/sequel/model/associations.rb', line 2198

def eager(*associations)
  opt = @opts[:eager]
  opt = opt ? opt.dup : {}
  associations.flatten.each do |association|
    case association
    when Symbol
      check_association(model, association)
      opt[association] = nil
    when Hash
      association.keys.each{|assoc| check_association(model, assoc)}
      opt.merge!(association)
    else
      raise(Sequel::Error, 'Associations must be in the form of a symbol or hash')
    end
  end
  clone(:eager=>opt)
end

#eager_graph(*associations) ⇒ Object

The secondary eager loading method. Loads all associations in a single query. This method should only be used if you need to filter or order based on columns in associated tables.

This method uses Dataset#graph to create appropriate aliases for columns in all the tables. Then it uses the graph’s metadata to build the associations from the single hash, and finally replaces the array of hashes with an array model objects inside all.

Be very careful when using this with multiple one_to_many or many_to_many associations, as you can create large cartesian products. If you must graph multiple one_to_many and many_to_many associations, make sure your filters are narrow if you have a large database.

Each association’s order, if definied, is respected. eager_graph probably won’t work correctly on a limited dataset, unless you are only graphing many_to_one, one_to_one, and one_through_one associations.

Does not use the block defined for the association, since it does a single query for all objects. You can use the :graph_* association options to modify the SQL query.

Like eager, you need to call all on the dataset for the eager loading to work. If you just call each, it will yield plain hashes, each containing all columns from all the tables.



2236
2237
2238
# File 'lib/sequel/model/associations.rb', line 2236

def eager_graph(*associations)
  eager_graph_with_options(associations)
end

#eager_graph_with_options(associations, opts = OPTS) ⇒ Object

Run eager_graph with some options specific to just this call. Unlike eager_graph, this takes the associations as a single argument instead of multiple arguments.

Options:

:join_type

Override the join type specified in the association

:limit_strategy

Use a strategy for handling limits on associations. Appropriate :limit_strategy values are:

true

Pick the most appropriate based on what the database supports

:distinct_on

Force use of DISTINCT ON stategy (*_one associations only)

:window_function

Force use of window function strategy

:ruby

Don’t modify the SQL, implement limits/offsets with array slicing

This can also be a hash with association name symbol keys and one of the above values, to use different strategies per association.

The default is the :ruby strategy. Choosing a different strategy can make your code significantly slower in some cases (perhaps even the majority of cases), so you should only use this if you have benchmarked that it is faster for your use cases.



2259
2260
2261
2262
2263
2264
2265
2266
2267
2268
2269
2270
2271
2272
2273
2274
2275
2276
2277
# File 'lib/sequel/model/associations.rb', line 2259

def eager_graph_with_options(associations, opts=OPTS)
  associations = [associations] unless associations.is_a?(Array)
  if eg = @opts[:eager_graph]
    eg = eg.dup
    [:requirements, :reflections, :reciprocals, :limits].each{|k| eg[k] = eg[k].dup}
    eg[:local] = opts
    ds = clone(:eager_graph=>eg)
    ds.eager_graph_associations(ds, model, ds.opts[:eager_graph][:master], [], *associations)
  else
    # Each of the following have a symbol key for the table alias, with the following values: 
    # :reciprocals :: the reciprocal value to use for this association
    # :reflections :: AssociationReflection instance related to this association
    # :requirements :: array of requirements for this association
    # :limits :: Any limit/offset array slicing that need to be handled in ruby land after loading
    opts = {:requirements=>{}, :master=>alias_symbol(first_source), :reflections=>{}, :reciprocals=>{}, :limits=>{}, :local=>opts, :cartesian_product_number=>0, :row_proc=>row_proc}
    ds = clone(:eager_graph=>opts)
    ds.eager_graph_associations(ds, model, ds.opts[:eager_graph][:master], [], *associations).naked
  end
end

#ungraphedObject

Do not attempt to split the result set into associations, just return results as simple objects. This is useful if you want to use eager_graph as a shortcut to have all of the joins and aliasing set up, but want to do something else with the dataset.



2283
2284
2285
2286
2287
2288
2289
# File 'lib/sequel/model/associations.rb', line 2283

def ungraphed
  ds = super.clone(:eager_graph=>nil)
  if (eg = @opts[:eager_graph]) && (rp = eg[:row_proc])
    ds.row_proc = rp
  end
  ds
end