com.twitter.scalding.examples

MergeTest

class MergeTest extends Job

This example job does not yet work. It is a test for Kyro serialization

Linear Supertypes
Job, Serializable, FieldConversions, LowPriorityFieldConversions, AnyRef, Any
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. MergeTest
  2. Job
  3. Serializable
  4. FieldConversions
  5. LowPriorityFieldConversions
  6. AnyRef
  7. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Instance Constructors

  1. new MergeTest(args: Args)

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. def anyToFieldArg(f: Any): Comparable[_]

    Attributes
    protected
    Definition Classes
    LowPriorityFieldConversions
  7. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  8. def asList(f: Fields): List[Comparable[_]]

    Definition Classes
    FieldConversions
  9. def asSet(f: Fields): Set[Comparable[_]]

    Definition Classes
    FieldConversions
  10. def buildFlow: Flow[_]

    combine the config, flowDef and the Mode to produce a flow

    combine the config, flowDef and the Mode to produce a flow

    Definition Classes
    Job
  11. lazy val classIdentifier: String

    Definition Classes
    Job
  12. def clear: Unit

    Definition Classes
    Job
  13. def clone(nextargs: Args): Job

    Copy this job By default, this uses reflection and the single argument Args constructor

    Copy this job By default, this uses reflection and the single argument Args constructor

    Definition Classes
    Job
  14. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  15. def cmpTup(t1: (String, Long), t2: (String, Long)): Int

  16. def config: Map[AnyRef, AnyRef]

    This is the exact config that is passed to the Cascading FlowConnector.

    This is the exact config that is passed to the Cascading FlowConnector. By default: if there are no spill thresholds in mode.config, we replace with defaultSpillThreshold we overwrite io.serializations with ioSerializations we overwrite cascading.tuple.element.comparator.default to defaultComparator we add some scalding keys for debugging/logging

    Tip: override this method, call super, and ++ your additional map to add or overwrite more options

    Definition Classes
    Job
  17. implicit def dateParser: DateParser

    Override this to control how dates are parsed

    Override this to control how dates are parsed

    Definition Classes
    Job
  18. def defaultComparator: Option[Class[_ <: Comparator[_]]]

    Override this if you want to customize comparisons/hashing for your job the config method overwrites using this before sending to cascading

    Override this if you want to customize comparisons/hashing for your job the config method overwrites using this before sending to cascading

    Definition Classes
    Job
  19. def defaultMode(fromFields: Fields, toFields: Fields): Fields

    Rather than give the full power of cascading's selectors, we have a simpler set of rules encoded below: 1) if the input is non-definite (ALL, GROUP, ARGS, etc.

    Rather than give the full power of cascading's selectors, we have a simpler set of rules encoded below: 1) if the input is non-definite (ALL, GROUP, ARGS, etc...) ALL is the output. Perhaps only fromFields=ALL will make sense 2) If one of from or to is a strict super set of the other, SWAP is used. 3) If they are equal, REPLACE is used. 4) Otherwise, ALL is used.

    Definition Classes
    FieldConversions
  20. def defaultSpillThreshold: Int

    Keep 100k tuples in memory by default before spilling Turn this up as high as you can without getting OOM.

    Keep 100k tuples in memory by default before spilling Turn this up as high as you can without getting OOM.

    This is ignored if there is a value set in the incoming mode.config

    Definition Classes
    Job
  21. final def ensureUniqueFields(left: Fields, right: Fields, rightPipe: Pipe): (Fields, Pipe)

    Definition Classes
    FieldConversions
  22. implicit def enumValueToFields(x: Value): Fields

    Definition Classes
    FieldConversions
  23. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  24. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  25. implicit def fieldFields[T <: TraversableOnce[Field[_]]](f: T): RichFields

    Definition Classes
    FieldConversions
  26. implicit def fieldToFields(f: Field[_]): RichFields

    Definition Classes
    FieldConversions
  27. implicit def fields[T <: TraversableOnce[Symbol]](f: T): Fields

    Definition Classes
    FieldConversions
  28. implicit def fieldsToRichFields(fields: Fields): RichFields

    We can't set the field Manifests because cascading doesn't (yet) expose field type information in the Fields API.

    We can't set the field Manifests because cascading doesn't (yet) expose field type information in the Fields API.

    Definition Classes
    FieldConversions
  29. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  30. implicit val flowDef: FlowDef

    Attributes
    protected
    Definition Classes
    Job
  31. implicit def fromEnum[T <: Enumeration](enumeration: T): Fields

    Multi-entry fields.

    Multi-entry fields. This are higher priority than Product conversions so that List will not conflict with Product.

    Definition Classes
    FieldConversions
  32. def fromInputStream(s: InputStream): Array[Byte]

    Definition Classes
    Job
  33. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  34. def getField(f: Fields, idx: Int): Fields

    Definition Classes
    FieldConversions
  35. def hasInts(f: Fields): Boolean

    Definition Classes
    FieldConversions
  36. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  37. implicit def intFields[T <: TraversableOnce[Int]](f: T): Fields

    Definition Classes
    FieldConversions
  38. implicit def intToFields(x: Int): Fields

    Definition Classes
    FieldConversions
  39. implicit def integerToFields(x: Integer): Fields

    Definition Classes
    FieldConversions
  40. def ioSerializations: List[Class[_ <: Serialization[_]]]

    The exact list of Hadoop serializations passed into the config These replace the config serializations Cascading tuple serialization should be in this list, and probably before any custom code

    The exact list of Hadoop serializations passed into the config These replace the config serializations Cascading tuple serialization should be in this list, and probably before any custom code

    Definition Classes
    Job
  41. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  42. implicit def iterableToRichPipe[T](iter: Iterable[T])(implicit set: TupleSetter[T], conv: TupleConverter[T]): RichPipe

    Definition Classes
    Job
  43. def keepAlive: Unit

    Definition Classes
    Job
  44. def listeners: List[FlowListener]

    Definition Classes
    Job
  45. def md5Hex(bytes: Array[Byte]): String

    Definition Classes
    Job
  46. def mergeSort2[T](v1: List[T], v2: List[T], k: Int, cmp: (T, T) ⇒ Int): List[T]

  47. implicit def mode: Mode

    Definition Classes
    Job
  48. def name: String

    Definition Classes
    Job
  49. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  50. final def newSymbol(avoid: Set[Symbol], guess: Symbol, trial: Int = 0): Symbol

    Definition Classes
    FieldConversions
    Annotations
    @tailrec()
  51. def next: Option[Job]

    Implement this method if you want some other jobs to run after the current job.

    Implement this method if you want some other jobs to run after the current job. These will not execute until the current job has run successfully.

    Definition Classes
    Job
  52. final def notify(): Unit

    Definition Classes
    AnyRef
  53. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  54. implicit def parseAnySeqToFields[T <: TraversableOnce[Any]](anyf: T): Fields

    Useful to convert f : Any* to Fields.

    Useful to convert f : Any* to Fields. This handles mixed cases ("hey", 'you). Not sure we should be this flexible, but given that Cascading will throw an exception before scheduling the job, I guess this is okay.

    Definition Classes
    FieldConversions
  55. implicit def pipeToRichPipe(pipe: Pipe): RichPipe

    you should never call this directly, it is here to make the DSL work.

    you should never call this directly, it is here to make the DSL work. Just know, you can treat a Pipe as a RichPipe within a Job

    Definition Classes
    Job
  56. implicit def productToFields(f: Product): Fields

    Handles treating any TupleN as a Fields object.

    Handles treating any TupleN as a Fields object. This is low priority because List is also a Product, but this method will not work for List (because List is Product2(head, tail) and so productIterator won't work as expected. Lists are handled by an implicit in FieldConversions, which have higher priority.

    Definition Classes
    LowPriorityFieldConversions
  57. implicit def read(src: Source): Pipe

    This is implicit so that a Source can be used as the argument to a join or other method that accepts Pipe.

    This is implicit so that a Source can be used as the argument to a join or other method that accepts Pipe.

    Definition Classes
    Job
  58. def run: Boolean

    Definition Classes
    Job
  59. def runFlow: Flow[_]

    Definition Classes
    Job
  60. def skipStrategy: Option[FlowSkipStrategy]

    Definition Classes
    Job
  61. implicit def sourceToRichPipe(src: Source): RichPipe

    This implicit is to enable RichPipe methods directly on Source objects, such as map/flatMap, etc.

    This implicit is to enable RichPipe methods directly on Source objects, such as map/flatMap, etc...

    Note that Mappable is a subclass of Source, and Mappable already has mapTo and flatMapTo BUT WITHOUT incoming fields used (see the Mappable trait). This creates some confusion when using these methods (this is an unfortuate mistake in our design that was not noticed until later). To remove ambiguity, explicitly call .read on any Source that you begin operating with a mapTo/flatMapTo.

    Definition Classes
    Job
  62. def stepStrategy: Option[FlowStepStrategy[_]]

    Definition Classes
    Job
  63. implicit def strFields[T <: TraversableOnce[String]](f: T): Fields

    Definition Classes
    FieldConversions
  64. implicit def stringToFields(x: String): Fields

    Definition Classes
    FieldConversions
  65. implicit def symbolToFields(x: Symbol): Fields

    '* means Fields.

    '* means Fields.ALL, otherwise we take the .name

    Definition Classes
    FieldConversions
  66. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  67. def timeout[T](timeout: AbsoluteDuration)(t: ⇒ T): Option[T]

    Definition Classes
    Job
  68. def toHexString(bytes: Array[Byte]): String

    Definition Classes
    Job
  69. implicit def toPipe[T](iter: Iterable[T])(implicit set: TupleSetter[T], conv: TupleConverter[T]): Pipe

    Definition Classes
    Job
  70. def toString(): String

    Definition Classes
    AnyRef → Any
  71. implicit def tuple2ToFieldsPair[T, U](pair: (T, U))(implicit tf: (T) ⇒ Fields, uf: (U) ⇒ Fields): (Fields, Fields)

    Definition Classes
    FieldConversions
  72. implicit final val uniqueId: UniqueID

    Definition Classes
    Job
  73. implicit def unitToFields(u: Unit): Fields

    Definition Classes
    FieldConversions
  74. def validate: Unit

    Definition Classes
    Job
  75. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  76. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  77. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  78. def write(pipe: Pipe, src: Source): Unit

    This is only here for Java jobs which cannot automatically access the implicit Pipe => RichPipe which makes: pipe.

    This is only here for Java jobs which cannot automatically access the implicit Pipe => RichPipe which makes: pipe.write( ) convenient

    Definition Classes
    Job

Inherited from Job

Inherited from Serializable

Inherited from FieldConversions

Inherited from AnyRef

Inherited from Any

Ungrouped