Class: CTioga2::Data::Dataset

Inherits:

Object

Object
CTioga2::Data::Dataset

Includes:: Log

Defined in:: lib/ctioga2/data/dataset.rb

Overview

This is the central class of the data manipulation in ctioga. It is a series of ‘Y’ DataColumn indexed on a unique ‘X’ DataColumn. This can be used to represent multiple XY data sets, but also XYZ and even more complex data. The actual signification of the various ‘Y’ columns are left to the user.

Instance Attribute Summary collapse

#name ⇒ Object

The name of the Dataset, such as one that could be used in a legend (like for the –auto-legend option of ctioga).
#x ⇒ Object

The X DataColumn.
#ys ⇒ Object

All Y DataColumn (an Array of DataColumn).

Class Method Summary collapse

.create(name, number) ⇒ Object

Creates a.
.dataset_from_spec(name, spec) ⇒ Object

Creates a new Dataset from a specification.
.homogenenous_deltas_indices(indices, vector, tolerance = 1e-3) ⇒ Object

Takes a list of indices, the corresponding vector (ie mapping the indices to the vector gives the actual coordinates) and returns a list of arrays of indices with homogeneous deltas.
.subdivise(x, y, x_idx, y_idx) ⇒ Object

Takes a list of x and y values, and subdivise into non-overlapping groups.

Instance Method Summary collapse

#<<(dataset) ⇒ Object

Concatenates another Dataset to this one.
#all_columns ⇒ Object

Returns all DataColumn objects held by this Dataset.
#apply_formulas(formula) ⇒ Object

Applies formulas to values.
#average_duplicates!(mode = :avg) ⇒ Object

Average all the non-X values of successive data points that have the same X values.
#column_names ⇒ Object

Returns an array with Column names.
#each_values(with_errors = false, expand_nil = true) ⇒ Object

Iterates over all the values of the Dataset.
#has_xy_errors? ⇒ Boolean

Returns true if X or Y columns have errors.
#homogeneous_dtables ⇒ Object

Returns a series of IndexedDTable representing the XYZ data.
#index_on_cols(cols = [2]) ⇒ Object

Returns a hash of Datasets indexed on the values of the columns cols.
#indexed_table ⇒ Object

Returns an IndexedDTable representing the XYZ data.
#initialize(name, columns) ⇒ Dataset constructor

Creates a new Dataset object with the given data columns (Dvector or DataColumn).
#make_contour(level) ⇒ Object

Returns a x,y Function.
#merge_datasets_in(datasets, columns = [0], precision = nil) ⇒ Object

Merges one or more other data sets into this one; one or more columns are designated as “master” columns and their values must match in all datasets.
#naive_smooth!(number) ⇒ Object

Smooths the data using a naive gaussian-like convolution (but not exactly).
#push_only_values(values) ⇒ Object

Almost the same thing as #push_values, but when you don’t care about the min/max things.
#push_values(*values) ⇒ Object

Appends the given values (as yielded by each_values(true)) to the stack.
#reglin(options = {}) ⇒ Object

Massive linear regressions over all X and Y values corresponding to a unique set of all the other Y2…
#select!(evaluator) ⇒ Object

Modifies the dataset to only keep the data for which the block returns true.
#select_formula!(formula) ⇒ Object

Same as #select!, but you give it a text formula instead of a block.
#size ⇒ Object

The overall number of columns.
#sort! ⇒ Object

Sorts all columns according to X values.
#trim!(nb) ⇒ Object

Trims all data columns.
#y ⇒ Object

The main Y column (ie, the first one).
#z ⇒ Object

The Z column, if applicable.
#z_columns ⇒ Object

The number of Z columns.

Methods included from Log

context, counts, debug, error, fatal, #format_exception, #identify, info, init_logger, log_to, logger, set_level, #spawn, warn

Constructor Details

#initialize(name, columns) ⇒ `Dataset`

Creates a new Dataset object with the given data columns (Dvector or DataColumn). #x is the first one

# File 'lib/ctioga2/data/dataset.rb', line 50

def initialize(name, columns)
  columns.each_index do |i|
    if columns[i].is_a? Dobjects::Dvector
      columns[i] = DataColumn.new(columns[i])
    end
  end
  @x = columns[0]
  @ys = columns[1..-1]
  @name = name

  # Cache for the indexed dtable
  @indexed_dtable = nil

  # Cache for the homogeneous dtables
  @homogeneous_dtables = nil
end

Instance Attribute Details

#name ⇒ `Object`

The name of the Dataset, such as one that could be used in a legend (like for the –auto-legend option of ctioga).



44
45
46

# File 'lib/ctioga2/data/dataset.rb', line 44

def name
  @name
end

#x ⇒ `Object`

The X DataColumn



37
38
39

# File 'lib/ctioga2/data/dataset.rb', line 37

def x
  @x
end

#ys ⇒ `Object`

All Y DataColumn (an Array of DataColumn)



40
41
42

# File 'lib/ctioga2/data/dataset.rb', line 40

def ys
  @ys
end

Class Method Details

.create(name, number) ⇒ `Object`

Creates a

# File 'lib/ctioga2/data/dataset.rb', line 68

def self.create(name, number)
  cols = []
  number.times do
    cols << Dobjects::Dvector.new()
  end
  return self.new(name, cols)
end

.dataset_from_spec(name, spec) ⇒ `Object`

Creates a new Dataset from a specification. This function parses a specification in the form of:

a:b:c+
spec=a:spec2=b+

It yields each of the unprocessed text, not necessarily in the order they were read, and expects a Dvector as a return value.

It then builds a suitable Dataset object with these values, and returns it.

It is strongly recommended to use this function for reimplementations of Backends::Backend#query_dataset.

# File 'lib/ctioga2/data/dataset.rb', line 89

def self.dataset_from_spec(name, spec)
  specs = []
  i = 0
  for s in spec.split_at_toplevel(/:/)
    if s =~ /^(x|y\d*|z)(#{DataColumn::ColumnSpecsRE})=(.*)/i
      which, mod, s = $1.downcase,($2 && $2.downcase) || "value",$3
      
      case which
      when /x/
        idx = 0
      when /y(\d+)?/
        if $1
          idx = $1.to_i
        else
          idx = 1
        end
      when /z/
        idx = 2
      end
      specs[idx] ||= {}
      specs[idx][mod] = yield s
    else
      specs[i] = {"value" =>  yield(s)}
      i += 1
    end
  end
  columns = []
  for s in specs
    columns << DataColumn.from_hash(s)
  end
  return Dataset.new(name, columns)
end

.homogenenous_deltas_indices(indices, vector, tolerance = 1e-3) ⇒ `Object`

Takes a list of indices, the corresponding vector (ie mapping the indices to the vector gives the actual coordinates) and returns a list of arrays of indices with homogeneous deltas.

# File 'lib/ctioga2/data/dataset.rb', line 427

def self.homogenenous_deltas_indices(indices, vector, tolerance = 1e-3)
  vct = indices.map do |i|
    vector[i]
  end
  subdiv = Utils::split_homogeneous_deltas(vct, tolerance)
  rv = []
  idx = 0
  for s in subdiv
    rv << indices[idx..idx+s.size-1]
    idx += s.size
  end
  if idx != indices.size
    error { "blundered ?" }
  end
  return rv
end

.subdivise(x, y, x_idx, y_idx) ⇒ `Object`

Takes a list of x and y values, and subdivise into non-overlapping groups.

# File 'lib/ctioga2/data/dataset.rb', line 348

def self.subdivise(x,y, x_idx, y_idx)

  # We make a list of sets. Each element of the list represent
  # one column, and in each set we store the index of of lines
  # that contain data.

  cols = []
  
  x.each_index do |i|
    ix = x_idx[x[i]]
    iy = y_idx[y[i]]

    cols[ix] ||= Set.new
    cols[ix].add(iy)
  end

  # The return value is an array of [ [xindices] [yindices]]
  ret = []

  # Now, the hard part.

  # We run for as long as there are sets ?
  fc = 0
  while fc < cols.size
    # We start with the set of the current column
    st = cols[fc]
    # Empty, go to next column
    if st.size == 0
      fc += 1
      next
    end

    # Set columns that contain the set
    set_cols = [fc]
    # Now, we look for restrictions on the set.
    fc2 = fc + 1
    while fc2 < cols.size
      # if non-void intersection, we stick to that
      inter = st.intersection(cols[fc2])
      # p [fc, fc2, st, inter]
      if inter.size > 0
        st = inter
        set_cols << fc2
        fc2 += 1
        break
      end

      fc2 += 1
      # Try to implement other kinds of restrictions?
    end

    # Now, we have a decent set, we go on until the intersection
    # with the set is not the set.
    while fc2 < cols.size
      inter = st.intersection(cols[fc2])
      if inter.size > 0
        if inter.size == st.size
          set_cols << fc2
        else
          break
        end
      end
      fc2 += 1
    end

    # Now, we have a set and all the indices that match.
    ret << [ set_cols.dup.sort, st.to_a.sort ]
    # And, now, go again through all the columns and remove the set
    for c in set_cols
      cols[c].subtract(st)
    end
  end

  return ret
end

Instance Method Details

#<<(dataset) ⇒ `Object`

Concatenates another Dataset to this one

# File 'lib/ctioga2/data/dataset.rb', line 183

def <<(dataset)
  if dataset.size != self.size
    raise "Can't concatenate datasets that don't have the same number of columns: #{self.size} vs #{dataset.size}"
  end
  @x << dataset.x
  @ys.size.times do |i|
    @ys[i] << dataset.ys[i]
  end
end

#all_columns ⇒ `Object`

Returns all DataColumn objects held by this Dataset



760
761
762

# File 'lib/ctioga2/data/dataset.rb', line 760

def all_columns
  return [@x, *@ys]
end

#apply_formulas(formula) ⇒ `Object`

Applies formulas to values. Formulas are like text-backend specification: “:”-separated specs of the target

# File 'lib/ctioga2/data/dataset.rb', line 311

def apply_formulas(formula)
  columns = []
  columns << Dobjects::Dvector.new(@x.size) do |i|
    i
  end
  columns << @x.values
  for y in @ys
    columns << y.values
  end

  # Names:
  heads = {
    'x' => 1,
    'y' => 2,
    'z' => 3,
  }
  i = 1
  for f in @ys
    heads["y#{i}"] = i+1
    i += 1
  end

  result = []
  for f in formula.split(/:/) do
    fm = Utils::parse_formula(f, nil, heads)
    debug { 
      "Using formula #{fm} for column spec: #{f} (##{result.size})" 
    }
    result << DataColumn.new(Dobjects::Dvector.
                             compute_formula(fm, 
                                             columns))
  end
  return Dataset.new(name + "_mod", result)
end

#average_duplicates!(mode = :avg) ⇒ `Object`

Average all the non-X values of successive data points that have the same X values. It is a naive version that also averages the error columns.

# File 'lib/ctioga2/data/dataset.rb', line 272

def average_duplicates!(mode = :avg)
  last_x = nil
  last_x_first_idx = 0
  xv = @x.values
  i = 0
  vectors = all_vectors
  nb_x = 0
  while i < xv.size
    x = xv[i]
    if ((last_x == x) && (i != (xv.size - 1)))
      # Do nothing
    else
      if last_x_first_idx <= (i - 1)  || 
          ((last_x == x) && (i == (xv.size - 1)))
        if i == (xv.size - 1)
          e = i
        else
          e = i-1
        end                 # The end of the slice.

        # Now, we delegate to the columns the task of averaging.
        @x.average_over(last_x_first_idx, e, nb_x, :avg)
        for c in @ys
          c.average_over(last_x_first_idx, e, nb_x, mode)
        end
        nb_x += 1
      end
      last_x = x
      last_x_first_idx = i
    end
    i += 1
  end
  for c in all_columns
    c.resize!(nb_x)
  end
end

#column_names ⇒ `Object`

Returns an array with Column names.

# File 'lib/ctioga2/data/dataset.rb', line 152

def column_names
  retval = @x.column_names("x")
  @ys.each_index do |i|
    retval += @ys[i].column_names("y#{i+1}")
  end
  return retval
end

#each_values(with_errors = false, expand_nil = true) ⇒ `Object`

Iterates over all the values of the Dataset. Values of optional arguments are those of DataColumn::values_at.

# File 'lib/ctioga2/data/dataset.rb', line 162

def each_values(with_errors = false, expand_nil = true)
  @x.size.times do |i|
    v = @x.values_at(i,with_errors, expand_nil)
    for y in @ys
      v += y.values_at(i,with_errors, expand_nil)
    end
    yield i, *v
  end
end

#has_xy_errors? ⇒ `Boolean`

Returns true if X or Y columns have errors

Returns:

(Boolean)



133
134
135

# File 'lib/ctioga2/data/dataset.rb', line 133

def has_xy_errors?
  return self.y.has_errors? || self.x.has_errors?
end

#homogeneous_dtables ⇒ `Object`

Returns a series of IndexedDTable representing the XYZ data.

# File 'lib/ctioga2/data/dataset.rb', line 445

def homogeneous_dtables()
  if @homogeneous_dtables
    return @homogeneous_dtables
  end
  if @ys.size < 2
    raise "Need at least 3 data columns in dataset '#{@name}'"
  end
  # We convert the index into three x,y and z arrays
  x = @x.values.dup
  y = @ys[0].values.dup
  z = @ys[1].values.dup
  
  xvals = x.sort.uniq
  yvals = y.sort.uniq
  
  # Now building reverse hashes to speed up the conversion:
  x_index = {}
  i = 0
  xvals.each do |v|
    x_index[v] = i
    i += 1
  end

  y_index = {}
  i = 0
  yvals.each do |v|
    y_index[v] = i
    i += 1
  end

  fgrps = []
  if x.size != xvals.size * yvals.size
    # This is definitely not a homogeneous map
    fgrps = Dataset.subdivise(x, y, x_index, y_index)
  else
    fgrps = [ [ x_index.values, y_index.values ] ]
  end

  # Now, we resplit according to the deltas:
  grps = []
  for grp in fgrps
    xv, yv = *grp

    xv_list = Dataset.homogenenous_deltas_indices(xv, xvals)
    yv_list = Dataset.homogenenous_deltas_indices(yv, yvals)

    for cxv in xv_list
      for cyv in yv_list
        grps << [ cxv, cyv]
      end
    end
  end

  # Now we construct a list of indexed dtables
  rv = []
  for grp in grps
    xv = grp[0].sort
    yv = grp[1].sort

    # Build up intermediate hashes
    xvh = {}
    xvl = []
    idx = 0
    for xi in xv
      val = xvals[xi]
      xvh[val] = idx
      xvl << val
      idx += 1
    end

    yvh = {}
    yvl = []
    idx = 0
    for yi in yv
      val = yvals[yi]
      yvh[val] = idx
      yvl << val
      idx += 1
    end
    
    table = Dobjects::Dtable.new(xv.size, yv.size)
    # We initialize all the values to NaN
    table.set(0.0/0.0)
  
    x.each_index do |i|
      ix = xvh[x[i]]
      next unless ix
      iy = yvh[y[i]]
      next unless iy
      # Y first !
      table[iy, ix] = z[i]
    end
    rv << IndexedDTable.new(xvl, yvl, table)
  end
  @homogeneous_dtables = rv
  return rv
end

#index_on_cols(cols = [2]) ⇒ `Object`

Returns a hash of Datasets indexed on the values of the columns cols. Datasets contain the same number of columns.

# File 'lib/ctioga2/data/dataset.rb', line 624

def index_on_cols(cols = [2])
  # Transform column number into index in the each_values call
  cols.map! do |i|
    i*3 
  end

  datasets = {}
  self.each_values(true) do |i,*values|
    signature = cols.map do |i|
      values[i]
    end
    datasets[signature] ||= Dataset.create(name, self.size)
    datasets[signature].push_values(*values)
  end
  return datasets
end

#indexed_table ⇒ `Object`

TODO:

For performance, this will have to be turned into a real

TODO:

The cache should be invalidated when the contents of the

Returns an IndexedDTable representing the XYZ data. Information about errors are not included.

Dtable or Dvector class function. This function is just going to be bad ;-)

Dataset changes (but that will be real hard !)

# File 'lib/ctioga2/data/dataset.rb', line 554

def indexed_table
  if @indexed_dtable
    return @indexed_dtable
  end
  if @ys.size < 2
    raise "Need at least 3 data columns in dataset '#{@name}'"
  end
  # We convert the index into three x,y and z arrays
  x = @x.values.dup
  y = @ys[0].values.dup
  z = @ys[1].values.dup
  
  xvals = x.sort.uniq
  yvals = y.sort.uniq
  
  # Now building reverse hashes to speed up the conversion:
  x_index = {}
  i = 0
  xvals.each do |v|
    x_index[v] = i
    i += 1
  end

  y_index = {}
  i = 0
  yvals.each do |v|
    y_index[v] = i
    i += 1
  end

  if x.size != xvals.size * yvals.size
    error {"Heterogeneous, stopping here for now"}
  end

  table = Dobjects::Dtable.new(xvals.size, yvals.size)
  # We initialize all the values to NaN
  table.set(0.0/0.0)
  
  x.each_index do |i|
    ix = x_index[x[i]]
    iy = y_index[y[i]]
    # Y first !
    table[iy, ix] = z[i]
  end
  @indexed_dtable = IndexedDTable.new(xvals, yvals, table)
  return @indexed_dtable
end

#make_contour(level) ⇒ `Object`

TODO:

add algorithm

Returns a x,y Function

# File 'lib/ctioga2/data/dataset.rb', line 605

def make_contour(level)
  table = indexed_table
  return table.make_contour(level, {'ret' => 'func'} )
end

#merge_datasets_in(datasets, columns = [0], precision = nil) ⇒ `Object`

TODO:

update column names.

TODO:

write provisions for column names, actually ;-)…

Merges one or more other data sets into this one; one or more columns are designated as “master” columns and their values must match in all datasets. Extra columns are simply appended, in the order in which the datasets are given

Comparisons between the values are made in abritrary precision unless precision is given, in which case values only have to match to this given number of digits.

# File 'lib/ctioga2/data/dataset.rb', line 700

def merge_datasets_in(datasets, columns = [0], precision = nil)
  # First thing, the data precision block:

  prec = if precision then
           proc do |x|
      ("%.#{@precision}g" % x) # This does not need to be a Float
    end
         else
           proc {|x| x}   # For exact comparisons
         end

  # First, we build an index of the master columns of the first
  # dataset.

  hash = {}
  self.each_values(false) do |i, *cols|
    signature = columns.map {|j|
      prec.call(cols[j])
    }
    hash[signature] = i
  end

  remove_indices = columns.sort.reverse

  for set in datasets
    old_columns = set.all_columns
    for i in remove_indices
      old_columns.slice!(i)
    end

    # Now, we got rid of the master columns, we add the given
    # number of columns

    new_columns = []
    old_columns.each do |c|
      new_columns << DataColumn.create(@x.size, c.has_errors?)
    end

    set.each_values(false) do |i, *cols|
      signature = columns.map {|j|
        prec.call(cols[j])
      }
      idx = hash[signature]
      if idx
        old_columns.each_index  { |j|
          new_columns[j].
          set_values_at(idx, 
                        * old_columns[j].values_at(i, true, true))
        }
      else
        # Data points are lost
      end
    end
    @ys.concat(new_columns)
  end

end

#naive_smooth!(number) ⇒ `Object`

Smooths the data using a naive gaussian-like convolution (but not exactly). Not for use for reliable data filtering.

# File 'lib/ctioga2/data/dataset.rb', line 612

def naive_smooth!(number)
  kernel = Dobjects::Dvector.new(number) { |i|
    Utils.cnk(number,i)
  }
  mid = number - number/2 - 1
  for y in @ys
    y.convolve!(kernel, mid)
  end
end

#push_only_values(values) ⇒ `Object`

Almost the same thing as #push_values, but when you don’t care about the min/max things.

# File 'lib/ctioga2/data/dataset.rb', line 214

def push_only_values(values)
  @x.push_values(values[0])
  @ys.size.times do |i|
    @ys[i].push_values(values[i+1])
  end
end

#push_values(*values) ⇒ `Object`

Appends the given values (as yielded by each_values(true)) to the stack. Elements of values laying after the last DataColumn in the Dataset are simply ignored. Giving less than there should be will give interesting results.

# File 'lib/ctioga2/data/dataset.rb', line 205

def push_values(*values)
  @x.push_values(*(values[0..2]))
  @ys.size.times do |i|
    @ys[i].push_values(*(values.slice(3*(i+1),3)))
  end
end

#reglin(options = {}) ⇒ `Object`

TODO:

Have the possibility to elaborate on the regression side

Massive linear regressions over all X and Y values corresponding to a unique set of all the other Y2… Yn values.

Returns the [coeffs, lines]

(in particular force b to 0)

# File 'lib/ctioga2/data/dataset.rb', line 650

def reglin(options = {})
  cols = []
  2.upto(self.size-1) do |i|
    cols << i
  end
  datasets = index_on_cols(cols)

  # Create two new datasets:
  # * one that collects the keys and a,b
  # * another that collects the keys and x1,y1, x2y2
  coeffs = Dataset.create("coefficients", self.size)
  lines = Dataset.create("lines", self.size)

  for k,v in datasets
    f = Dobjects::Function.new(v.x.values, v.y.values)
    if options['linear']  # Fit to y = a*x
      d = f.x.dup
      d.mul!(f.x)
      sxx = d.sum
      d.replace(f.x)
      d.mul!(f.y)
      sxy = d.sum
      a = sxy/sxx
      coeffs.push_only_values(k + [a,0])
      lines.push_only_values(k + [f.x.min, a * f.x.min])
      lines.push_only_values(k + [f.x.max, a * f.x.max])
    else
      a,b = f.reglin
      coeffs.push_only_values(k + [a, b])
      lines.push_only_values(k + [f.x.min, b + a * f.x.min])
      lines.push_only_values(k + [f.x.max, b + a * f.x.max])
    end
    
  end

  return [coeffs, lines]
end

#select!(evaluator) ⇒ `Object`

Modifies the dataset to only keep the data for which the block returns true. The block should take the following arguments, in order:

x, xmin, xmax, y, ymin, ymax, y1, y1min, y1max,

_z_, _zmin_, _zmax_, _y2_, _y2min_, _y2max_, _y3_, _y3min_, _y3max_

# File 'lib/ctioga2/data/dataset.rb', line 228

def select!(evaluator)
  target = []
  @x.size.times do |i|
    args = @x.values_at(i, true)
    args.concat(@ys[0].values_at(i, true) * 2)
    if @ys[1]
      args.concat(@ys[1].values_at(i, true) * 2)
      for yvect in @ys[2..-1]
        args.concat(yvect.values_at(i, true))
      end
    end
    if evaluator.compute_unsafe(*args)
      target << i
    end
  end
  for col in all_columns
    col.reindex(target)
  end
end

#select_formula!(formula) ⇒ `Object`

Same as #select!, but you give it a text formula instead of a block. It internally calls #select!, by the way ;-)…

# File 'lib/ctioga2/data/dataset.rb', line 250

def select_formula!(formula)
  names = @x.column_names('x', true)
  names.concat(@x.column_names('y', true))
  names.concat(@x.column_names('y1', true))
  if @ys[1]
    names.concat(@x.column_names('z', true))
    names.concat(@x.column_names('y2', true))
    i = 3
    for yvect in @ys[2..-1]
      names.concat(@x.column_names("y#{i}", true))
      i += 1
    end
  end
  evaluator = Ruby.make_evaluator(formula, names)
  select!(evaluator)
end

#size ⇒ `Object`

The overall number of columns



173
174
175

# File 'lib/ctioga2/data/dataset.rb', line 173

def size
  return 1 + @ys.size
end

#sort! ⇒ `Object`

Sorts all columns according to X values

# File 'lib/ctioga2/data/dataset.rb', line 138

def sort!
  idx_vector = Dobjects::Dvector.new(@x.values.size) do |i|
    i
  end
  f = Dobjects::Function.new(@x.values.dup, idx_vector)
  f.sort
  # Now, idx_vector contains the indices that make X values
  # sorted.
  for col in all_columns
    col.reindex(idx_vector)
  end
end

#trim!(nb) ⇒ `Object`

Trims all data columns. See DataColumn#trim!

# File 'lib/ctioga2/data/dataset.rb', line 195

def trim!(nb)
  for col in all_columns
    col.trim!(nb)
  end
end

#y ⇒ `Object`

The main Y column (ie, the first one)



123
124
125

# File 'lib/ctioga2/data/dataset.rb', line 123

def y
  return @ys[0]
end

#z ⇒ `Object`

The Z column, if applicable



128
129
130

# File 'lib/ctioga2/data/dataset.rb', line 128

def z
  return @ys[1]
end

#z_columns ⇒ `Object`

The number of Z columns



178
179
180

# File 'lib/ctioga2/data/dataset.rb', line 178

def z_columns
  return @ys.size - 1
end

Class: CTioga2::Data::Dataset

Overview

Instance Attribute Summary collapse

Class Method Summary collapse

Instance Method Summary collapse

Methods included from Log

Constructor Details

#initialize(name, columns) ⇒ Dataset

Instance Attribute Details

#name ⇒ Object

#x ⇒ Object

#ys ⇒ Object

Class Method Details

.create(name, number) ⇒ Object

.dataset_from_spec(name, spec) ⇒ Object

.homogenenous_deltas_indices(indices, vector, tolerance = 1e-3) ⇒ Object

.subdivise(x, y, x_idx, y_idx) ⇒ Object

Instance Method Details

#<<(dataset) ⇒ Object

#all_columns ⇒ Object

#apply_formulas(formula) ⇒ Object

#average_duplicates!(mode = :avg) ⇒ Object

#column_names ⇒ Object

#each_values(with_errors = false, expand_nil = true) ⇒ Object

#has_xy_errors? ⇒ Boolean

#homogeneous_dtables ⇒ Object

#index_on_cols(cols = [2]) ⇒ Object

#indexed_table ⇒ Object

#make_contour(level) ⇒ Object

#merge_datasets_in(datasets, columns = [0], precision = nil) ⇒ Object

#naive_smooth!(number) ⇒ Object

#push_only_values(values) ⇒ Object

#push_values(*values) ⇒ Object

#reglin(options = {}) ⇒ Object

#select!(evaluator) ⇒ Object

#select_formula!(formula) ⇒ Object

#size ⇒ Object

#sort! ⇒ Object

#trim!(nb) ⇒ Object

#y ⇒ Object

#z ⇒ Object

#z_columns ⇒ Object

#initialize(name, columns) ⇒ `Dataset`

#name ⇒ `Object`

#x ⇒ `Object`

#ys ⇒ `Object`

.create(name, number) ⇒ `Object`

.dataset_from_spec(name, spec) ⇒ `Object`

.homogenenous_deltas_indices(indices, vector, tolerance = 1e-3) ⇒ `Object`

.subdivise(x, y, x_idx, y_idx) ⇒ `Object`

#<<(dataset) ⇒ `Object`

#all_columns ⇒ `Object`

#apply_formulas(formula) ⇒ `Object`

#average_duplicates!(mode = :avg) ⇒ `Object`

#column_names ⇒ `Object`

#each_values(with_errors = false, expand_nil = true) ⇒ `Object`

#has_xy_errors? ⇒ `Boolean`

#homogeneous_dtables ⇒ `Object`

#index_on_cols(cols = [2]) ⇒ `Object`

#indexed_table ⇒ `Object`

#make_contour(level) ⇒ `Object`

#merge_datasets_in(datasets, columns = [0], precision = nil) ⇒ `Object`

#naive_smooth!(number) ⇒ `Object`

#push_only_values(values) ⇒ `Object`

#push_values(*values) ⇒ `Object`

#reglin(options = {}) ⇒ `Object`

#select!(evaluator) ⇒ `Object`

#select_formula!(formula) ⇒ `Object`

#size ⇒ `Object`

#sort! ⇒ `Object`

#trim!(nb) ⇒ `Object`

#y ⇒ `Object`

#z ⇒ `Object`

#z_columns ⇒ `Object`