com.twitter.scalding.source

DailyPrefixSuffixMostRecentSource

abstract class DailyPrefixSuffixMostRecentSource extends MostRecentGoodSource

Linear Supertypes
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. DailyPrefixSuffixMostRecentSource
  2. MostRecentGoodSource
  3. TimePathedSource
  4. TimeSeqPathedSource
  5. FileSource
  6. HfsTapProvider
  7. LocalSourceOverride
  8. SchemedSource
  9. Source
  10. Serializable
  11. AnyRef
  12. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Instance Constructors

  1. new DailyPrefixSuffixMostRecentSource(prefixTemplate: String, suffixTemplate: String, dateRange: DateRange)

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. def allPaths: Iterable[String]

    These are all the paths we will read for this data completely enumerated

    These are all the paths we will read for this data completely enumerated

    Definition Classes
    TimeSeqPathedSource
  7. def allPathsFor(pattern: String): Iterable[String]

    Attributes
    protected
    Definition Classes
    TimeSeqPathedSource
  8. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  9. def checkFlowDefNotNull()(implicit flowDef: FlowDef, mode: Mode): Unit

    Attributes
    protected
    Definition Classes
    Source
  10. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  11. def createHdfsReadTap(hdfsMode: Hdfs): Tap[JobConf, _, _]

    Attributes
    protected
    Definition Classes
    FileSource
  12. def createHfsTap(scheme: Scheme[JobConf, RecordReader[_, _], OutputCollector[_, _], _, _], path: String, sinkMode: SinkMode): Hfs

    Definition Classes
    HfsTapProvider
  13. def createLocalTap(sinkMode: SinkMode): Tap[JobConf, _, _]

    Creates a local tap.

    Creates a local tap.

    sinkMode

    The mode for handling output conflicts.

    Definition Classes
    LocalSourceOverride
  14. def createTap(readOrWrite: AccessMode)(implicit mode: Mode): Tap[_, _, _]

    Subclasses of Source MUST override this method.

    Subclasses of Source MUST override this method. They may call out to TestTapFactory for making Taps suitable for testing.

    Definition Classes
    FileSourceSource
  15. def defaultDurationFor(pattern: String): Option[Duration]

    Override this if you have for instance an hourly pattern but want to run every 6 hours.

    Override this if you have for instance an hourly pattern but want to run every 6 hours. By default, we call TimePathedSource.stepSize(pattern, tz)

    Attributes
    protected
    Definition Classes
    TimeSeqPathedSource
  16. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  17. def equals(that: Any): Boolean

    Definition Classes
    TimeSeqPathedSource → AnyRef → Any
  18. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  19. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  20. def getPathStatuses(conf: Configuration): Iterable[(String, Boolean)]

    Get path statuses based on daterange.

    Get path statuses based on daterange. This tests each path with pathIsGood (which by default checks that there is at least on file in that directory)

    Definition Classes
    TimeSeqPathedSource
  21. def goodHdfsPaths(hdfsMode: Hdfs): Iterable[String]

    Attributes
    protected
    Definition Classes
    MostRecentGoodSourceFileSource
  22. def hashCode(): Int

    Definition Classes
    TimeSeqPathedSource → AnyRef → Any
  23. def hdfsPaths: Iterable[String]

    Definition Classes
    TimeSeqPathedSourceFileSource
  24. def hdfsReadPathsAreGood(conf: Configuration): Boolean

  25. def hdfsScheme: Scheme[JobConf, RecordReader[_, _], OutputCollector[_, _], _, _]

    The scheme to use if the source is on hdfs.

    The scheme to use if the source is on hdfs.

    Definition Classes
    SchemedSource
  26. def hdfsWritePath: String

    Definition Classes
    TimePathedSourceFileSource
  27. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  28. def localPaths: Iterable[String]

    A path to use for the local tap.

    A path to use for the local tap.

    Definition Classes
    TimePathedSourceLocalSourceOverride
  29. def localScheme: Scheme[Properties, InputStream, OutputStream, _, _]

    The scheme to use if the source is local.

    The scheme to use if the source is local.

    Definition Classes
    SchemedSource
  30. def localWritePath: String

    Definition Classes
    LocalSourceOverride
  31. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  32. final def notify(): Unit

    Definition Classes
    AnyRef
  33. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  34. def pathIsGood(p: String, conf: Configuration): Boolean

    Determines if a path is 'valid' for this source.

    Determines if a path is 'valid' for this source. In strict mode all paths must be valid. In non-strict mode, all invalid paths will be filtered out.

    Subclasses can override this to validate paths.

    The default implementation is a quick sanity check to look for missing or empty directories. It is necessary but not sufficient -- there are cases where this will return true but there is in fact missing data.

    TODO: consider writing a more in-depth version of this method in TimePathedSource that looks for TODO: missing days / hours etc.

    Attributes
    protected
    Definition Classes
    FileSource
  35. val pattern: String

    Definition Classes
    TimePathedSource
  36. val patterns: Seq[String]

    Definition Classes
    TimeSeqPathedSource
  37. def read(implicit flowDef: FlowDef, mode: Mode): Pipe

    Definition Classes
    Source
  38. val sinkMode: SinkMode

    Definition Classes
    SchemedSource
  39. def sourceId: String

    This is a name the refers to this exact instance of the source (put another way, if s1.

    This is a name the refers to this exact instance of the source (put another way, if s1.sourceId == s2.sourceId, the job should work the same if one is replaced with the other

    Definition Classes
    Source
  40. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  41. def toString(): String

    Definition Classes
    MostRecentGoodSourceTimeSeqPathedSource → AnyRef → Any
  42. def transformForRead(pipe: Pipe): Pipe

    Attributes
    protected
    Definition Classes
    Source
  43. def transformForWrite(pipe: Pipe): Pipe

    Attributes
    protected
    Definition Classes
    Source
  44. def transformInTest: Boolean

    The mock passed in to scalding.

    The mock passed in to scalding.JobTest may be considered as a mock of the Tap or the Source. By default, as of 0.9.0, it is considered as a Mock of the Source. If you set this to true, the mock in TestMode will be considered to be a mock of the Tap (which must be transformed) and not the Source.

    Definition Classes
    Source
  45. val tz: TimeZone

    Definition Classes
    TimeSeqPathedSource
  46. def validateTaps(mode: Mode): Unit

    Definition Classes
    FileSourceSource
  47. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  48. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  49. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  50. def writeFrom(pipe: Pipe)(implicit flowDef: FlowDef, mode: Mode): Pipe

    write the pipe but return the input so it can be chained into the next operation

    write the pipe but return the input so it can be chained into the next operation

    Definition Classes
    Source

Deprecated Value Members

  1. def readAtSubmitter[T](implicit mode: Mode, conv: TupleConverter[T]): Stream[T]

    Definition Classes
    Source
    Annotations
    @deprecated
    Deprecated

    (Since version 0.9.0) replace with Mappable.toIterator

Inherited from MostRecentGoodSource

Inherited from TimePathedSource

Inherited from TimeSeqPathedSource

Inherited from FileSource

Inherited from HfsTapProvider

Inherited from LocalSourceOverride

Inherited from SchemedSource

Inherited from Source

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped