Package

scala.tools.nsc.backend.jvm

analysis

Permalink

package analysis

Summary on the ASM analyzer framework --------------------------------------

Value

Interpreter

Frame

Analyzer

MaxLocals and MaxStack ----------------------

At the JVM level, long and double values occupy two slots, both as local variables and on the stack, as specified in the JVM spec 2.6.2: "At any point in time, an operand stack has an associated depth, where a value of type long or double contributes two units to the depth and a value of any other type contributes one unit."

For example, a method class A { def f(a: Long, b: Long) = a + b } has MAXSTACK=4 in the classfile. This value is computed by the ClassWriter / MethodWriter when generating the classfile (we always pass COMPUTE_MAXS to the ClassWriter).

For running an ASM Analyzer, long and double values occupy two local variable slots, but only a single slot on the call stack, as shown by the following snippet:

import scala.tools.nsc.backend.jvm._ import scala.tools.nsc.backend.jvm.opt.BytecodeUtils._ import scala.collection.convert.decorateAsScala._ import scala.tools.asm.tree.analysis._

val cn = AsmUtils.readClass("/Users/luc/scala/scala/sandbox/A.class") val m = cn.methods.iterator.asScala.find(_.name == "f").head

// the value is read from the classfile, so it's 4 println(s"maxLocals: ${m.maxLocals}, maxStack: ${m.maxStack}") // maxLocals: 5, maxStack: 4

// we can safely set it to 2 for running the analyzer. m.maxStack = 2

val a = new Analyzer(new BasicInterpreter) a.analyze(cn.name, m) val addInsn = m.instructions.iterator.asScala.find(_.getOpcode == 97).get // LADD Opcode val addFrame = a.frameAt(addInsn, m)

addFrame.getStackSize // 2: the two long values only take one slot each addFrame.getLocals // 5: this takes one slot, the two long parameters take 2 slots each

While running the optimizer, we need to make sure that the maxStack value of a method is large enough for running an ASM analyzer. We don't need to worry if the value is incorrect in the JVM perspective: the value will be re-computed and overwritten in the ClassWriter.

Lessons learnt while benchmarking the alias tracking analysis -------------------------------------------------------------

Profiling

ASM analyzer insights

To benchmark an analysis, instead of benchmarking analysis while it runs in the compiler backend, one can easily run it from a separate program (or the repl). The bytecode to analyze can simply be parsed from a classfile. See example at the end of this comment.

Nullness Analysis in Miguel's Optimizer ---------------------------------------

Miguel implemented alias tracking for nullness analysis differently [1]. Remember that every frame has an array of values. Miguel's idea was to represent aliasing using reference equality in the values array: if two entries in the array point to the same value object, the two entries are aliases in the frame of the given instruction.

While this idea seems elegant at first sight, Miguel's implementation does not merge frames correctly when it comes to aliasing. Assume in frame 1, values (a, b, c) are aliases, while in frame 2 (a, b) are aliases. When merging the second into the first, we have to make sure that c is removed as an alias of (a, b).

It would be possible to implement correct alias set merging in Miguel's approach. However, frame merging is the main hot spot of analysis. The computational complexity of implementing alias set merging by traversing the values array and comparing references is too high. The concrete alias set representation that is used in the current implementation (see class AliasingFrame) makes alias set merging more efficient.

[1] https://github.com/scala-opt/scala/blob/opt/rebase/src/compiler/scala/tools/nsc/backend/bcode/NullnessPropagator.java

Complexity and scaling of analysis ----------------------------------

The time complexity of a data flow analysis depends on:

I measured the running time of an analysis for two examples:

I measured nullness analysis (which tracks aliases) and a SimpleValue analysis. Nullness runs roughly 5x slower (because of alias tracking) at every problem size - this factor doesn't change.

The numbers below are for nullness. Note that the the last column is constant, i.e., the running time is proportional to #ins * #loc^2. Therefore we use this factor when limiting the maximal method size for running an analysis.

#insns #locals time (ms) time / #ins * #loc2 * 106 1305 156 34 1.07 2610 311 165 0.65 3915 466 490 0.57 5220 621 1200 0.59 6525 776 2220 0.56 7830 931 3830 0.56 9135 1086 6570 0.60 10440 1241 9700 0.60 11745 1396 13800 0.60

As a second experiment, nullness analysis was run with varying #insns but constant #locals. The last column shows linear complexity with respect to the method size (linearOffset = 2279):

#insns #locals time (ms) (time + linearOffset) / #insns 5220 621 1090 0.645 6224 621 1690 0.637 7226 621 2280 0.630 8228 621 2870 0.625 9230 621 3530 0.629 10232 621 4130 0.626 11234 621 4770 0.627 12236 621 5520 0.637 13238 621 6170 0.638

When running a BasicValue analysis, the complexity observation is the same (time is proportional to #ins * #loc^2).

Measuring analysis execution time ---------------------------------

See code below.

Source
package.scala
Linear Supertypes
Content Hierarchy
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. analysis
  2. AnyRef
  3. Any
  1. Hide All
  2. Show all
Visibility
  1. Public
  2. All

Type Members

  1. class AliasSet extends AnyRef

    Permalink

    An efficient mutable bit set.

  2. class AliasingAnalyzer[V <: Value] extends Analyzer[V]

    Permalink

    An analyzer that uses AliasingFrames instead of bare Frames.

    An analyzer that uses AliasingFrames instead of bare Frames. This can be used when an analysis needs to track aliases, but doesn't require a more specific Frame subclass.

  3. class AliasingFrame[V <: Value] extends Frame[V]

    Permalink

    A subclass of Frame that tracks aliasing of values stored in local variables and on the stack.

    A subclass of Frame that tracks aliasing of values stored in local variables and on the stack.

    Note: an analysis tracking aliases is roughly 5x slower than a usual analysis (assuming a simple value domain with a fast merge function). For example, nullness analysis is roughly 5x slower than a BasicValue analysis.

    See the doc of package object analysis for some notes on the performance of alias analysis.

  4. class BackendUtils[BT <: BTypes] extends AnyRef

    Permalink

    This component hosts tools and utilities used in the backend that require access to a BTypes instance.

    This component hosts tools and utilities used in the backend that require access to a BTypes instance.

    One example is the AsmAnalyzer class, which runs computeMaxLocalsMaxStack on the methodNode to be analyzed. This method in turn lives inside the BTypes assembly because it queries the per-run cache maxLocalsMaxStackComputed defined in there.

  5. case class ExceptionProducer[V <: Value](handlerFrame: Frame[V]) extends InitialProducer with Product with Serializable

    Permalink
  6. abstract class InitialProducer extends AbstractInsnNode

    Permalink

    A class for pseudo-instructions representing the initial producers of local values that have no producer instruction in the method:

    A class for pseudo-instructions representing the initial producers of local values that have no producer instruction in the method:

    • parameters, including this
    • uninitialized local variables
    • exception values in handlers

    The ASM built-in SourceValue analysis yields an empty producers set for such values. This leads to ambiguities. Example (in Java one can re-assign parameter):

    void foo(int a) { if (a == 0) a = 1; return a; }

    In the first frame of the method, the SoruceValue for parameter a gives an empty set of producer instructions.

    In the frame of the IRETURN instruction, the SoruceValue for parameter a lists a single producer instruction: the ISTORE 1. This makes it look as if there was a single producer for a, where in fact it might still hold the parameter's initial value.

  7. class InitialProducerSourceInterpreter extends SourceInterpreter

    Permalink
  8. abstract class IntIterator extends Iterator[Int]

    Permalink

    An iterator over Int (required to prevent boxing the result of next).

  9. class NullnessAnalyzer extends Analyzer[NullnessValue]

    Permalink

    This class is required to override the newFrame methods, which makes makes sure the analyzer uses NullnessFrames.

  10. class NullnessFrame extends AliasingFrame[NullnessValue]

    Permalink
  11. final class NullnessInterpreter extends Interpreter[NullnessValue]

    Permalink
  12. sealed abstract class NullnessValue extends Value

    Permalink

    Represents the nullness state for a local variable or stack value.

    Represents the nullness state for a local variable or stack value.

    Note that nullness of primitive values is not tracked, it will be always unknown.

  13. case class ParameterProducer(local: Int) extends InitialProducer with Product with Serializable

    Permalink
  14. trait ProdConsAnalyzerImpl extends AnyRef

    Permalink

    This class provides additional queries over ASM's built-in SourceValue analysis.

    This class provides additional queries over ASM's built-in SourceValue analysis.

    The analysis computes for each value in a frame a set of source instructions, which are the potential producers. Most instructions produce either nothing or a stack value. For example, a LOAD instruction is the producer of the value pushed onto the stack. The exception are STORE instructions, which produce a new value for a local variable slot, so they are used as producers for the value they stored.

    Note that pseudo-instructions are used as initial producers for parameters and local variables. See the documentation on class InitialProducer.

    This class implements the following queries over the data computed by the SourceValue analysis:

    • producersForValueAt(insn, slot)
    • consumersOfValueAt(insn, slot)
    • producersForInputsOf(insn)
    • consumersOfOutputsFrom(insn)
    • initialProducersForValueAt(insn, slot)
    • ultimateConsumersOfValueAt(insn, slot)
    • initialProducersForInputsOf(insn)
    • ultimateConsumersOfOutputsFrom(insn)

    The following operations are considered as copying operations:

    • xLOAD, xSTORE
    • DUP, DUP2, DUP_X1, DUP_X2, DUP2_X1, DUP2_X2
    • SWAP
    • CHECKCAST

    If ever needed, we could introduce a mode where primitive conversions (l2i) are considered as copying operations.

    Note on performance: thee data flow analysis (SourceValue / SourceInterpreter, provided by ASM) is roughly 2-3x slower than a simple analysis (like BasicValue). The reason is that the merge function (merging producer sets) is more complex than merging simple basic values. See also the doc comment in the package object analysis.

  15. case class UninitializedLocalProducer(local: Int) extends InitialProducer with Product with Serializable

    Permalink

Inherited from AnyRef

Inherited from Any

Ungrouped