DefaultStrategy

Instance Constructors

new DefaultStrategy()

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def asInstanceOf[T0]: T0

Definition Classes
Any
def batchReduce[D](reduction: Reduction, valueDestinationPairs: Seq[(PerDeviceValue[OutputLike], Option[D])])(implicit arg0: Destination[D], context: CrossTowerContext): Seq[DistributedValue[OutputLike]]

Combines multiple reduce calls into one for faster execution.
Combines multiple reduce calls into one for faster execution.
reduction
Reduction method to use.
valueDestinationPairs
Sequence of values to reduce pairs with destinations to copy the reduced values to.
returns
Reduced values.

Definition Classes
DistributionStrategy
def broadcast[O <: OutputLike](value: O, devices: Seq[DeviceSpecification] = Seq.empty)(implicit context: CrossTowerContext): MirroredValue[O]

Mirrors value to all worker devices.
Mirrors value to all worker devices.
value
Value to broadcast.
devices
Destination devices.
returns
Mirrored value.

Definition Classes
DefaultStrategy → DistributionStrategy
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
def colocateVariablesWith[R](colocationOps: Set[Op])(block: ⇒ R)(implicit context: DistributionContext): R

Executes block within a scope that controls which devices variables will be created on.
Executes block within a scope that controls which devices variables will be created on.
No operations should be added to the graph inside this scope; it should only be used when creating variables (some implementations work by changing variable creation and others work by using a colocateWith scope). This may only be used inside DistributionStrategy.scope.
For example:
```
distributionStrategy.scope {
  val variable1 = tf.variable(...)
  distributionStrategy.colocateVariablesWith(Set(variable1.op)) {
    // `variable2` and `variable3` will be created on the same device(s) as `variable1`.
    val variable2 = tf.variable(...)
    val variable3 = tf.variable(...)
  }

  def fn(v1: Variable, v2: Variable, v3: Variable): Unit = {
    // Operates on `v1` from `variable1`, `v2` from `variable2`, and `v3` from `variable3`.
  }

  // `fn` runs on every device `v1` is on, and `v2` and `v3` will be there too.
  distributionStrategy.update(variable1, fn, variable2, variable3)
}
```
colocationOps
Variables created in block will be on the same set of devices as these ops.
block
Code block to execute in this scope.
returns
Value returned by block.

Definition Classes
DefaultStrategy → DistributionStrategy
def configure(sessionConfig: SessionConfig): Unit

Finds and sets the best configuration for the provided TensorFlow session configuration.
Finds and sets the best configuration for the provided TensorFlow session configuration.

Definition Classes
DistributionStrategy
def createVariable: ColocatedVariableGetter

Attributes
protected
Definition Classes
DefaultStrategy → DistributionStrategy
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def fetch(variable: DistributedVariable, destination: String = "/device:CPU:0", fn: (Output) ⇒ Output = (o: Output) => o)(implicit context: CrossTowerContext): Output

Returns a copy of fn(variable.value) on destination.
Returns a copy of fn(variable.value) on destination. This is useful for getting a mirrored variable value onto a device. The method will attempt to avoid a copy by checking if the value is already on the destination device.
variable
Variable (which may be mirrored) to copy and fetch.
destination
Device to copy the variable value to.
fn
Optional function to apply to the value on the source device, before copying.
returns
Fetched value in device.

Definition Classes
DefaultStrategy → DistributionStrategy
Annotations
@throws( ... )
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
def forEachTower[T, R](fn: (Seq[T]) ⇒ R, values: Seq[DistributedValue[T]])(implicit arg0: Distributable[T], context: CrossTowerContext): R

Runs fn once per tower.
Runs fn once per tower.
fn may call tf.currentTowerContext to access fields and methods such as towerID and mergeCall(). mergeCall() is used to communicate between the towers and re-enter the cross-tower context. All towers pause their execution having encountered a mergeCall() call. After that the mergeFn-function is executed. Its results are then unwrapped and given back to each tower call. After that execution resumes until fn is complete or another mergeCall() is encountered.
For example:
```
// Called once in "cross-tower" context.
def mergeFn(distributionStrategy: DistributionStrategy, threePlusTowerID: Int): tf.Output = {
  // Sum the values across towers.
  tf.addN(distribution.unwrap(threePlusTowerID))
}

// Called once per tower in `distributionStrategy`, in a "tower" context.
def fn(three: Int): Output = {
  val towerContext = tf.currentTowerContext
  val v = three + towerContext.towerID
  // Computes the sum of the `v` values across all towers.
  val s = towerContext.mergeCall(mergeFn(_, v))
  s + v
}

distributionStrategy.scope {
  // In "cross-tower" context
  ...
  val mergedResults = distributionStrategy.forEachTower(() => fn(3))
  // `mergedResults` has the values from every tower execution of `fn`.
  val resultsList = distributionStrategy.unwrap(mergedResults)
}
```
fn
Function that will be run once per tower.
values
Wrapped values that will be unwrapped when invoking fn on each tower.
returns
Merged return value of fn across all towers.

Definition Classes
DefaultStrategy → DistributionStrategy
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
def group[T](value: DistributedValue[T], name: String = "Group")(implicit arg0: Distributable[T], context: CrossTowerContext): Op

Acts as a shortcut for tf.group(distributionStrategy.unwrap(value)).
Acts as a shortcut for tf.group(distributionStrategy.unwrap(value)).
value
A value returned by forEachTower(), or a variable created in scope.
name
Name for the created op.
returns
Grouped unwrapped value.

Definition Classes
DistributionStrategy
def hashCode(): Int

Definition Classes
AnyRef → Any
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
def isSingleTower: Boolean

Returns true if there is only a single tower, and false, otherwise.
Returns true if there is only a single tower, and false, otherwise.
If true, forEachTower(fn) will only call fn once. If false, forEachTower(fn) may call fn multiple times.

Definition Classes
DefaultStrategy → DistributionStrategy
def mergeCall[R](mergeFn: (DistributionStrategy) ⇒ R)(implicit context: InTowerContext): R

Merges arguments across towers and runs mergeFn in a cross-tower context.
Merges arguments across towers and runs mergeFn in a cross-tower context.
This allows communication and coordination when there are multiple calls to a model function triggered by a call to forEachTower(modelFn, ...). See MirroredDistribution.forEachTower() for an explanation.
Otherwise, this is equivalent to:
```
val strategy = tf.distribute.currentStrategy
strategy.scope {
  mergeFn(strategy)
}
```
mergeFn
Merge function to invoke from within a cross-tower context.
returns
Result of the mergeFn call, except for per-device values which are unpacked.

Definition Classes
DistributionStrategy
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def nonSlotDevices(variables: Seq[variables.Variable]): Set[DeviceSpecification]

Returns the devices used for non-slot variables.
Returns the devices used for non-slot variables.
Create variables on these devices in a colocateVariablesWith(nonSlotDevices(...)): block. Then, update them using updateNonSlot().
variables
Variables being optimized.
returns
Colocation ops for non-slot variables.

Definition Classes
DefaultStrategy → DistributionStrategy
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
def numTowers: Int

Returns number of towers, for purposes of averaging across towers.
Returns number of towers, for purposes of averaging across towers.

Definition Classes
DefaultStrategy → DistributionStrategy
def parameterDevices: Set[String]

Returns the devices used for variable and updates placement.
Returns the devices used for variable and updates placement.

Definition Classes
DefaultStrategy → DistributionStrategy
def reduce[D](reduction: Reduction, value: PerDeviceValue[OutputLike], destination: Option[D] = None)(implicit arg0: Destination[D], context: CrossTowerContext): MirroredValue[OutputLike]

Combines values across towers into one value.
Combines values across towers into one value.
reduction
Reduction method to use.
value
Value to reduce.
destination
Optional destination on which to copy the reduced value.
returns
Reduced value.

Definition Classes
DefaultStrategy → DistributionStrategy
def scope[R](block: ⇒ R): R

Definition Classes
DistributionStrategy
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def toString(): String

Definition Classes
AnyRef → Any
def towerLocalVariableScope[R](reduction: Reduction)(block: ⇒ R)(implicit context: DistributionContext): R

Executes block within a scope where new variables will not be mirrored.
Executes block within a scope where new variables will not be mirrored.
There will still be one component variable per tower, but there is no requirement that they stay in sync. Instead, when saving them or calling fetch(), we use the value that results when calling reduce() on all the towers' variables. Note that tower-local implies not trainable. Instead, it is expected that each tower will directly update (e.g., using assignAdd()) its local variable instance but only the aggregated value (accessible using fetch()) will be exported from the model. When it is acceptable to only aggregate on export, we greatly reduce communication overhead by using tower-local variables.
Note that all component variables will be initialized to the same value, using the initialization expression from the first tower. The values will match even if the initialization expression uses random numbers.
reduction
Reduction method used to get the value to save when creating checkpoints.
block
Code block to execute in this scope.
returns
Value returned by block.

Definition Classes
DistributionStrategy
def unwrap[T](value: DistributedValue[T])(implicit arg0: Distributable[T], context: CrossTowerContext): Seq[T]

Returns the list of all per-device values contained in value.
Returns the list of all per-device values contained in value.
value
A value returned by forEachTower(), or a variable created in scope.
returns
Sequence of values contained in value.

Definition Classes
DefaultStrategy → DistributionStrategy
def update[T, R](variable: MirroredVariable, fn: (variables.Variable, Seq[T]) ⇒ R, arguments: Seq[MirroredValue[T]])(implicit arg0: Distributable[T], arg1: Distributable[R], context: CrossTowerContext): MirroredValue[R]

Runs fn to update variable using inputs mirrored to the same devices.
Runs fn to update variable using inputs mirrored to the same devices.
If variable is mirrored across multiple devices, then this method implements logic like:
```
val results = variable.index.map {
  case (deviceSpec, variable) => tf.createWith(device = deviceSpec.toString) {
    fn(variable)
  }
}
merged(results)
```
Otherwise this returns fn(variable) colocated with variable.
variable
Variable to update.
fn
Update function to use.
arguments
Mirrored arguments that should be passed to fn.
returns
Merged return value of fn across all towers.

Definition Classes
DefaultStrategy → DistributionStrategy
def updateNonSlot[D, T, R](colocateWith: D, fn: (Seq[T]) ⇒ R, arguments: Seq[MirroredValue[T]])(implicit arg0: Destination[D], arg1: Distributable[T], arg2: Distributable[R], context: CrossTowerContext): MirroredValue[R]

Runs fn on the devices specified by colocateWith, with the provided arguments.
Runs fn on the devices specified by colocateWith, with the provided arguments.
colocateWith
Destination on which to execute fn.
fn
Function to use for the update.
arguments
Mirrored arguments that should be passed to fn.
returns
Merged return value of fn across all towers.

Definition Classes
DefaultStrategy → DistributionStrategy
Annotations
@throws( ... )
Exceptions thrown
InvalidArgumentException If the provided colocateWith argument is invalid (e.g., too many devices).
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
def workerDeviceIndex(implicit context: CrossTowerContext): Map[DeviceSpecification, Int]

Returns a map from worker devices to indices.
Returns a map from worker devices to indices.
TODO: [DISTRIBUTE] Settle on the interface of forEachTower() first. This map might be passed as an argument to forEachTower(), as in:
```
distributionStrategy.scope {
  def fn(deviceIndex: Int): Unit = {
    // `fn` is being executed on device `distributionStrategy.workerDevices(deviceIndex)`.
  }
  distributionStrategy.forEachTower(fn, distributionStrategy.workerDeviceIndex)
}
```
Definition Classes
DefaultStrategy → DistributionStrategy
def workerDevices: Set[String]

Returns the devices used to run forEachTower() calls.
Returns the devices used to run forEachTower() calls.

Definition Classes
DefaultStrategy → DistributionStrategy

Related Doc: package strategies

class DefaultStrategy extends DistributionStrategy

Instance Constructors

new DefaultStrategy()

Value Members

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: Any): Boolean

final def asInstanceOf[T0]: T0

def batchReduce[D](reduction: Reduction, valueDestinationPairs: Seq[(PerDeviceValue[OutputLike], Option[D])])(implicit arg0: Destination[D], context: CrossTowerContext): Seq[DistributedValue[OutputLike]]

def broadcast[O <: OutputLike](value: O, devices: Seq[DeviceSpecification] = Seq.empty)(implicit context: CrossTowerContext): MirroredValue[O]

def clone(): AnyRef

def colocateVariablesWith[R](colocationOps: Set[Op])(block: ⇒ R)(implicit context: DistributionContext): R

def configure(sessionConfig: SessionConfig): Unit

def createVariable: ColocatedVariableGetter

final def eq(arg0: AnyRef): Boolean

def equals(arg0: Any): Boolean

def fetch(variable: DistributedVariable, destination: String = "/device:CPU:0", fn: (Output) ⇒ Output = (o: Output) => o)(implicit context: CrossTowerContext): Output

def finalize(): Unit

def forEachTower[T, R](fn: (Seq[T]) ⇒ R, values: Seq[DistributedValue[T]])(implicit arg0: Distributable[T], context: CrossTowerContext): R

final def getClass(): Class[_]

def group[T](value: DistributedValue[T], name: String = "Group")(implicit arg0: Distributable[T], context: CrossTowerContext): Op

def hashCode(): Int

final def isInstanceOf[T0]: Boolean

def isSingleTower: Boolean

def mergeCall[R](mergeFn: (DistributionStrategy) ⇒ R)(implicit context: InTowerContext): R

final def ne(arg0: AnyRef): Boolean

def nonSlotDevices(variables: Seq[variables.Variable]): Set[DeviceSpecification]

final def notify(): Unit

final def notifyAll(): Unit

def numTowers: Int

def parameterDevices: Set[String]

def reduce[D](reduction: Reduction, value: PerDeviceValue[OutputLike], destination: Option[D] = None)(implicit arg0: Destination[D], context: CrossTowerContext): MirroredValue[OutputLike]

def scope[R](block: ⇒ R): R

final def synchronized[T0](arg0: ⇒ T0): T0

def toString(): String

def towerLocalVariableScope[R](reduction: Reduction)(block: ⇒ R)(implicit context: DistributionContext): R

def unwrap[T](value: DistributedValue[T])(implicit arg0: Distributable[T], context: CrossTowerContext): Seq[T]

def update[T, R](variable: MirroredVariable, fn: (variables.Variable, Seq[T]) ⇒ R, arguments: Seq[MirroredValue[T]])(implicit arg0: Distributable[T], arg1: Distributable[R], context: CrossTowerContext): MirroredValue[R]

def updateNonSlot[D, T, R](colocateWith: D, fn: (Seq[T]) ⇒ R, arguments: Seq[MirroredValue[T]])(implicit arg0: Destination[D], arg1: Distributable[T], arg2: Distributable[R], context: CrossTowerContext): MirroredValue[R]

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

def workerDeviceIndex(implicit context: CrossTowerContext): Map[DeviceSpecification, Int]

def workerDevices: Set[String]

Inherited from DistributionStrategy

Inherited from AnyRef

Inherited from Any

Ungrouped