Class: SpatialStats::Local::Stat

Inherits:

Object

Object
SpatialStats::Local::Stat

Defined in:: lib/spatial_stats/local/stat.rb

Overview

Stat is the abstract base class for local stats. It defines the methods that are common between all classes and will raise a NotImplementedError on those that are specific for each type of statistic.

Direct Known Subclasses

BivariateMoran, Geary, GetisOrd, Moran, MultivariateGeary

Instance Attribute Summary collapse

#field ⇒ Object

Returns the value of attribute field.
#scope ⇒ Object

Returns the value of attribute scope.
#weights ⇒ Object

Returns the value of attribute weights.

Class Method Summary collapse

.from_observations(x, weights) ⇒ Stat

A new instance of Stat, from vector and weights.

Instance Method Summary collapse

#crand(permutations, rng) ⇒ Numo::Int32

Conditional randomization algorithm used in permutation testing.
#expectation ⇒ Object
#initialize(scope, field, weights) ⇒ Stat constructor

Base class for local stats.
#mc(permutations = 99, seed = nil) ⇒ Array

Permutation test to determine a pseudo p-values of the #stat method.
#mc_bv(permutations, seed) ⇒ Array

Permutation test to determine a pseudo p-values of the #stat method.
#quads ⇒ Array

Determines what quadrant an observation is in.
#stat ⇒ Object
#summary(permutations = 99, seed = nil) ⇒ Array

Summary of the statistic.
#variance ⇒ Object
#x=(values) ⇒ Object (also: #z=)
#y=(values) ⇒ Object
#z_score ⇒ Array

Z-score for each observation of the statistic.

Constructor Details

#initialize(scope, field, weights) ⇒ `Stat`

Base class for local stats

# File 'lib/spatial_stats/local/stat.rb', line 12

def initialize(scope, field, weights)
  @scope = scope
  @field = field
  @weights = weights.standardize
end

Instance Attribute Details

#field ⇒ `Object`

Returns the value of attribute field.



17
18
19

# File 'lib/spatial_stats/local/stat.rb', line 17

def field
  @field
end

#scope ⇒ `Object`

Returns the value of attribute scope.



17
18
19

# File 'lib/spatial_stats/local/stat.rb', line 17

def scope
  @scope
end

#weights ⇒ `Object`

Returns the value of attribute weights.



17
18
19

# File 'lib/spatial_stats/local/stat.rb', line 17

def weights
  @weights
end

Class Method Details

.from_observations(x, weights) ⇒ `Stat`

A new instance of Stat, from vector and weights.

Parameters:

x (Array) —

observations of dataset
weights (WeightsMatrix) —

to define relationships between observations

Returns:

(Stat)

Raises:

(ArgumentError)

# File 'lib/spatial_stats/local/stat.rb', line 26

def self.from_observations(x, weights)
  raise ArgumentError, 'Data size != weights.n' if x.size != weights.n

  instance = new(nil, nil, weights.standardize)
  instance.x = x
  instance
end

Instance Method Details

#crand(permutations, rng) ⇒ `Numo::Int32`

Conditional randomization algorithm used in permutation testing. Returns a matrix with permuted index values that will be used for selecting values from the original data set.

The width of the matrix is the max number of neighbors + 1 which is way less than it would be if the original vector was shuffled in full.

This is super important because most weight matrices are very sparse so the amount of shuffling/multiplication that is done is reduced drastically.

Returns:

(Numo::Int32) —

matrix of shape perms x wc_max + 1

#expectation ⇒ `Object`

Raises:

(NotImplementedError)



38
39
40

# File 'lib/spatial_stats/local/stat.rb', line 38

def expectation
  raise NotImplementedError, 'method expectation not implemented'
end

#mc(permutations = 99, seed = nil) ⇒ `Array`

Permutation test to determine a pseudo p-values of the #stat method. Shuffles x values, recomputes #stat for each variation, then compares to the computed one. The ratio of more extreme values to permutations is returned for each observation.

Parameters:

permutations (Integer) (defaults to: 99) —

to run. Last digit should be 9 to produce round numbers.
seed (Integer) (defaults to: nil) —

used in random number generator for shuffles.

Returns:

(Array) —

of p-values

#mc_bv(permutations, seed) ⇒ `Array`

Permutation test to determine a pseudo p-values of the #stat method. Shuffles y values, hold x values, recomputes #stat for each variation, then compares to the computed one. The ratio of more extreme values to permutations is returned for each observation.

Parameters:

permutations (Integer) —

to run. Last digit should be 9 to produce round numbers.
seed (Integer) —

used in random number generator for shuffles.

Returns:

(Array) —

of p-values

#quads ⇒ `Array`

Determines what quadrant an observation is in. Based on its value compared to its neighbors. This does not work for all stats, since it requires that values be negative.

In a standardized array of z, high values are values greater than 0 and it’s neighbors are determined by the spatial lag and if that is positive then it’s neighbors would be high, low otherwise.

Quadrants are:

HH: a high value surrounded by other high values
LH: a low value surrounded by high values
LL: a low value surrounded by low values
HL: a high value surrounded by low values

Returns:

(Array) —

of labels

# File 'lib/spatial_stats/local/stat.rb', line 228

def quads
  # https://github.com/pysal/esda/blob/master/esda/moran.py#L925
  z_lag = SpatialStats::Utils::Lag.neighbor_average(weights, z)
  zp = z.map(&:positive?)
  lp = z_lag.map(&:positive?)

  # hh = zp & lp
  # lh = zp ^ true & lp
  # ll = zp ^ true & lp ^ true
  # hl = zp next to lp ^ true
  hh = zp.each_with_index.map { |v, idx| v & lp[idx] }
  lh = zp.each_with_index.map { |v, idx| (v ^ true) & lp[idx] }
  ll = zp.each_with_index.map { |v, idx| (v ^ true) & (lp[idx] ^ true) }
  hl = zp.each_with_index.map { |v, idx| v & (lp[idx] ^ true) }

  # now zip lists and map them to proper terms
  quad_terms = %w[HH LH LL HL]
  hh.zip(lh, ll, hl).map do |feature|
    quad_terms[feature.index(true)]
  end
end

#stat ⇒ `Object`

Raises:

(NotImplementedError)



34
35
36

# File 'lib/spatial_stats/local/stat.rb', line 34

def stat
  raise NotImplementedError, 'method stat not defined'
end

#summary(permutations = 99, seed = nil) ⇒ `Array`

Summary of the statistic. Computes stat, mc, and groups then returns the values in a hash array.

Parameters:

permutations (Integer) (defaults to: 99) —

to run. Last digit should be 9 to produce round numbers.
seed (Integer) (defaults to: nil) —

used in random number generator for shuffles.

Returns:

(Array)

# File 'lib/spatial_stats/local/stat.rb', line 258

def summary(permutations = 99, seed = nil)
  p_vals = mc(permutations, seed)
  data = weights.keys.zip(stat, p_vals, groups)
  data.map do |row|
    { key: row[0], stat: row[1], p: row[2], group: row[3] }
  end
end

#variance ⇒ `Object`

Raises:

(NotImplementedError)



42
43
44

# File 'lib/spatial_stats/local/stat.rb', line 42

def variance
  raise NotImplementedError, 'method variance not implemented'
end

#x=(values) ⇒ `Object` Also known as: z=



46
47
48

# File 'lib/spatial_stats/local/stat.rb', line 46

def x=(values)
  @x = values.standardize
end

#y=(values) ⇒ `Object`



51
52
53

# File 'lib/spatial_stats/local/stat.rb', line 51

def y=(values)
  @y = values.standardize
end

#z_score ⇒ `Array`

Z-score for each observation of the statistic.

Returns:

(Array) —

of the number of deviations from the mean

# File 'lib/spatial_stats/local/stat.rb', line 59

def z_score
  numerators = stat.map { |v| v - expectation }
  denominators = variance.map { |v| Math.sqrt(v) }
  numerators.each_with_index.map do |numerator, idx|
    numerator / denominators[idx]
  end
end

Class: SpatialStats::Local::Stat

Overview

Direct Known Subclasses

Instance Attribute Summary collapse

Class Method Summary collapse

Instance Method Summary collapse

Constructor Details

#initialize(scope, field, weights) ⇒ Stat

Instance Attribute Details

#field ⇒ Object

#scope ⇒ Object

#weights ⇒ Object

Class Method Details

.from_observations(x, weights) ⇒ Stat

Instance Method Details

#crand(permutations, rng) ⇒ Numo::Int32

#expectation ⇒ Object

#mc(permutations = 99, seed = nil) ⇒ Array

#mc_bv(permutations, seed) ⇒ Array

#quads ⇒ Array

#stat ⇒ Object

#summary(permutations = 99, seed = nil) ⇒ Array

#variance ⇒ Object

#x=(values) ⇒ Object Also known as: z=

#y=(values) ⇒ Object

#z_score ⇒ Array

#initialize(scope, field, weights) ⇒ `Stat`

#field ⇒ `Object`

#scope ⇒ `Object`

#weights ⇒ `Object`

.from_observations(x, weights) ⇒ `Stat`

#crand(permutations, rng) ⇒ `Numo::Int32`

#expectation ⇒ `Object`

#mc(permutations = 99, seed = nil) ⇒ `Array`

#mc_bv(permutations, seed) ⇒ `Array`

#quads ⇒ `Array`

#stat ⇒ `Object`

#summary(permutations = 99, seed = nil) ⇒ `Array`

#variance ⇒ `Object`

#x=(values) ⇒ `Object` Also known as: z=

#y=(values) ⇒ `Object`

#z_score ⇒ `Array`