com.twitter.scalding.examples

WeightedPageRankFromMatrix

class WeightedPageRankFromMatrix extends Job

A weighted PageRank implementation using the Scalding Matrix API. This assumes that all rows and columns are of type Int and values or egde weights are Double. If you want an unweighted PageRank, simply set the weights on the edges to 1.

Input arguments:

d -- damping factor n -- number of nodes in the graph currentIteration -- start with 0 probably maxIterations -- stop after n iterations convergenceThreshold -- using the sum of the absolute difference between iteration solutions, iterating stops once we reach this threshold rootDir -- the root directory holding all starting, intermediate and final data/output

The expected structure of the rootDir is:

rootDir |- iterations | |- 0 <-- a TSV of (row, value) of size n, value can be 1/n (generate this) | |- n <-- holds future iterations/solutions |- edges <-- a TSV of (row, column, value) for edges in the graph |- onesVector <-- a TSV of (row, 1) of size n (generate this) |- diff <-- a single line representing the difference between the last iterations |- constants <-- built at iteration 0, these are constant for any given matrix/graph |- M_hat |- priorVector

Don't forget to set the number of reducers for this job: -D mapred.reduce.tasks=n

Linear Supertypes
Job, Serializable, FieldConversions, LowPriorityFieldConversions, AnyRef, Any
Ordering
  1. Alphabetic
  2. By inheritance
Inherited
  1. WeightedPageRankFromMatrix
  2. Job
  3. Serializable
  4. FieldConversions
  5. LowPriorityFieldConversions
  6. AnyRef
  7. Any
  1. Hide All
  2. Show all
Learn more about member selection
Visibility
  1. Public
  2. All

Instance Constructors

  1. new WeightedPageRankFromMatrix(args: Args)

Value Members

  1. final def !=(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  2. final def !=(arg0: Any): Boolean

    Definition Classes
    Any
  3. final def ##(): Int

    Definition Classes
    AnyRef → Any
  4. final def ==(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  5. final def ==(arg0: Any): Boolean

    Definition Classes
    Any
  6. def M_hat: Matrix[Int, Int, Double]

    Load or generate on first iteration the matrix M^ given A.

  7. implicit def _implicitJobArgs: Args

    Attributes
    protected
    Definition Classes
    Job
  8. def anyToFieldArg(f: Any): Comparable[_]

    Attributes
    protected
    Definition Classes
    LowPriorityFieldConversions
  9. final def asInstanceOf[T0]: T0

    Definition Classes
    Any
  10. def asList(f: Fields): List[Comparable[_]]

    Definition Classes
    FieldConversions
  11. def asSet(f: Fields): Set[Comparable[_]]

    Definition Classes
    FieldConversions
  12. def buildFlow: Flow[_]

    Definition Classes
    Job
  13. def classIdentifier: String

    Definition Classes
    Job
  14. def clear(): Unit

    Definition Classes
    Job
  15. def clone(nextargs: Args): Job

    Definition Classes
    Job
  16. def clone(): AnyRef

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  17. def colVectorFromTsv(input: String): ColVector[Int, Double]

  18. def config: Map[AnyRef, AnyRef]

    Definition Classes
    Job
  19. val convergenceThreshold: Double

  20. val currentIteration: Int

  21. val d: Double

  22. implicit def dateParser: DateParser

    Definition Classes
    Job
  23. def defaultComparator: Option[Class[_ <: Comparator[_]]]

    Definition Classes
    Job
  24. def defaultMode(fromFields: Fields, toFields: Fields): Fields

    Definition Classes
    FieldConversions
  25. def defaultSpillThreshold: Int

    Definition Classes
    Job
  26. val diffLoc: String

  27. val edgesLoc: String

  28. final def ensureUniqueFields(left: Fields, right: Fields, rightPipe: Pipe): (Fields, Pipe)

    Definition Classes
    FieldConversions
  29. implicit def enumValueToFields(x: Value): Fields

    Definition Classes
    FieldConversions
  30. final def eq(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  31. def equals(arg0: Any): Boolean

    Definition Classes
    AnyRef → Any
  32. implicit def fieldFields[T <: TraversableOnce[Field[_]]](f: T): RichFields

    Definition Classes
    FieldConversions
  33. implicit def fieldToFields(f: Field[_]): RichFields

    Definition Classes
    FieldConversions
  34. implicit def fields[T <: TraversableOnce[Symbol]](f: T): Fields

    Definition Classes
    FieldConversions
  35. implicit def fieldsToRichFields(fields: Fields): RichFields

    Definition Classes
    FieldConversions
  36. def finalize(): Unit

    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  37. implicit val flowDef: FlowDef

    Attributes
    protected
    Definition Classes
    Job
  38. implicit def fromEnum[T <: Enumeration](enumeration: T): Fields

    Definition Classes
    FieldConversions
  39. final def getClass(): Class[_]

    Definition Classes
    AnyRef → Any
  40. def getField(f: Fields, idx: Int): Fields

    Definition Classes
    FieldConversions
  41. def handleStats(statsData: CascadingStats): Unit

    Attributes
    protected
    Definition Classes
    Job
  42. def hasInts(f: Fields): Boolean

    Definition Classes
    FieldConversions
  43. def hashCode(): Int

    Definition Classes
    AnyRef → Any
  44. implicit def intFields[T <: TraversableOnce[Int]](f: T): Fields

    Definition Classes
    FieldConversions
  45. implicit def intToFields(x: Int): Fields

    Definition Classes
    FieldConversions
  46. implicit def integerToFields(x: Integer): Fields

    Definition Classes
    FieldConversions
  47. def ioSerializations: List[Class[_ <: Serialization[_]]]

    Definition Classes
    Job
  48. final def isInstanceOf[T0]: Boolean

    Definition Classes
    Any
  49. implicit def iterableToRichPipe[T](iter: Iterable[T])(implicit set: TupleSetter[T], conv: TupleConverter[T]): RichPipe

    Definition Classes
    Job
  50. val iterationsDir: String

  51. def keepAlive(): Unit

    Definition Classes
    Job
  52. def listeners: List[FlowListener]

    Definition Classes
    Job
  53. def matrixFromTsv(input: String): Matrix[Int, Int, Double]

  54. val maxIterations: Int

  55. def measureConvergenceAndStore(): Unit

    Measure convergence by calculating the total of the absolute difference between the previous and next vectors.

    Measure convergence by calculating the total of the absolute difference between the previous and next vectors. This stores the result after calculation.

  56. implicit def mode: Mode

    Definition Classes
    Job
  57. val n: Int

  58. def name: String

    Definition Classes
    Job
  59. final def ne(arg0: AnyRef): Boolean

    Definition Classes
    AnyRef
  60. final def newSymbol(avoid: Set[Symbol], guess: Symbol, trial: Int): Symbol

    Definition Classes
    FieldConversions
    Annotations
    @tailrec()
  61. def next: Option[Job]

    Recurse and iterate again iff we are under the max number of iterations and vector has not converged.

    Recurse and iterate again iff we are under the max number of iterations and vector has not converged.

    Definition Classes
    WeightedPageRankFromMatrix → Job
  62. val nextVector: ColVector[Int, Double]

  63. val nextVectorLoc: String

  64. final def notify(): Unit

    Definition Classes
    AnyRef
  65. final def notifyAll(): Unit

    Definition Classes
    AnyRef
  66. val onesVectorLoc: String

  67. implicit def parseAnySeqToFields[T <: TraversableOnce[Any]](anyf: T): Fields

    Definition Classes
    FieldConversions
  68. implicit def pipeToRichPipe(pipe: Pipe): RichPipe

    Definition Classes
    Job
  69. val previousVector: ColVector[Int, Double]

  70. val previousVectorLoc: String

  71. def priorVector: ColVector[Int, Double]

    Load or generate on first iteration the prior vector given d and n.

  72. implicit def productToFields(f: Product): Fields

    Definition Classes
    LowPriorityFieldConversions
  73. implicit def read(src: Source): Pipe

    Definition Classes
    Job
  74. val rootDir: String

  75. def run: Boolean

    Definition Classes
    Job
  76. implicit def scaldingConfig: Config

    Attributes
    protected
    Definition Classes
    Job
  77. def skipStrategy: Option[FlowSkipStrategy]

    Definition Classes
    Job
  78. implicit def sourceToRichPipe(src: Source): RichPipe

    Definition Classes
    Job
  79. def stepListeners: List[FlowStepListener]

    Definition Classes
    Job
  80. def stepStrategy: Option[FlowStepStrategy[_]]

    Definition Classes
    Job
  81. implicit def strFields[T <: TraversableOnce[String]](f: T): Fields

    Definition Classes
    FieldConversions
  82. implicit def stringToFields(x: String): Fields

    Definition Classes
    FieldConversions
  83. implicit def symbolToFields(x: Symbol): Fields

    Definition Classes
    FieldConversions
  84. final def synchronized[T0](arg0: ⇒ T0): T0

    Definition Classes
    AnyRef
  85. def timeout[T](timeout: AbsoluteDuration)(t: ⇒ T): Option[T]

    Definition Classes
    Job
  86. implicit def toPipe[T](iter: Iterable[T])(implicit set: TupleSetter[T], conv: TupleConverter[T]): Pipe

    Definition Classes
    Job
  87. def toString(): String

    Definition Classes
    AnyRef → Any
  88. implicit def tuple2ToFieldsPair[T, U](pair: (T, U))(implicit tf: (T) ⇒ Fields, uf: (U) ⇒ Fields): (Fields, Fields)

    Definition Classes
    FieldConversions
  89. implicit def unitToFields(u: Unit): Fields

    Definition Classes
    FieldConversions
  90. def validate(): Unit

    Definition Classes
    Job
  91. final def wait(): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  92. final def wait(arg0: Long, arg1: Int): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  93. final def wait(arg0: Long): Unit

    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  94. def write(pipe: Pipe, src: Source): Unit

    Definition Classes
    Job

Inherited from Job

Inherited from Serializable

Inherited from FieldConversions

Inherited from LowPriorityFieldConversions

Inherited from AnyRef

Inherited from Any

Ungrouped