Class

com.twitter.scalding.examples

WeightedPageRankFromMatrix

Related Doc: package examples

Permalink

class WeightedPageRankFromMatrix extends Job

A weighted PageRank implementation using the Scalding Matrix API. This assumes that all rows and columns are of type Int and values or egde weights are Double. If you want an unweighted PageRank, simply set the weights on the edges to 1.

Input arguments:

d -- damping factor n -- number of nodes in the graph currentIteration -- start with 0 probably maxIterations -- stop after n iterations convergenceThreshold -- using the sum of the absolute difference between iteration solutions, iterating stops once we reach this threshold rootDir -- the root directory holding all starting, intermediate and final data/output

The expected structure of the rootDir is:

rootDir |- iterations | |- 0 <-- a TSV of (row, value) of size n, value can be 1/n (generate this) | |- n <-- holds future iterations/solutions |- edges <-- a TSV of (row, column, value) for edges in the graph |- onesVector <-- a TSV of (row, 1) of size n (generate this) |- diff <-- a single line representing the difference between the last iterations |- constants <-- built at iteration 0, these are constant for any given matrix/graph |- M_hat |- priorVector

Don't forget to set the number of reducers for this job: -D mapred.reduce.tasks=n

Linear Supertypes
Job, Serializable, FieldConversions, LowPriorityFieldConversions, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. WeightedPageRankFromMatrix
  2. Job
  3. Serializable
  4. FieldConversions
  5. LowPriorityFieldConversions
  6. AnyRef
  7. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new WeightedPageRankFromMatrix(args: Args)

    Permalink

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. def M_hat: Matrix[Int, Int, Double]

    Permalink

    Load or generate on first iteration the matrix M^ given A.

  5. implicit def _implicitJobArgs: Args

    Permalink
    Attributes
    protected
    Definition Classes
    Job
  6. def anyToFieldArg(f: Any): Comparable[_]

    Permalink
    Attributes
    protected
    Definition Classes
    LowPriorityFieldConversions
  7. val args: Args

    Permalink
    Definition Classes
    Job
  8. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  9. def asList(f: Fields): List[Comparable[_]]

    Permalink
    Definition Classes
    FieldConversions
  10. def asSet(f: Fields): Set[Comparable[_]]

    Permalink
    Definition Classes
    FieldConversions
  11. def buildFlow: Flow[_]

    Permalink
    Definition Classes
    Job
  12. def classIdentifier: String

    Permalink
    Definition Classes
    Job
  13. def clear(): Unit

    Permalink
    Definition Classes
    Job
  14. def clone(nextargs: Args): Job

    Permalink
    Definition Classes
    Job
  15. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  16. def colVectorFromTsv(input: String): ColVector[Int, Double]

    Permalink
  17. def config: Map[AnyRef, AnyRef]

    Permalink
    Definition Classes
    Job
  18. val convergenceThreshold: Double

    Permalink
  19. val currentIteration: Int

    Permalink
  20. val d: Double

    Permalink
  21. implicit def dateParser: DateParser

    Permalink
    Definition Classes
    Job
  22. def defaultComparator: Option[Class[_ <: Comparator[_]]]

    Permalink
    Definition Classes
    Job
  23. def defaultMode(fromFields: Fields, toFields: Fields): Fields

    Permalink
    Definition Classes
    FieldConversions
  24. def defaultSpillThreshold: Int

    Permalink
    Definition Classes
    Job
  25. val diffLoc: String

    Permalink
  26. val edgesLoc: String

    Permalink
  27. final def ensureUniqueFields(left: Fields, right: Fields, rightPipe: Pipe): (Fields, Pipe)

    Permalink
    Definition Classes
    FieldConversions
  28. implicit def enumValueToFields(x: Value): Fields

    Permalink
    Definition Classes
    FieldConversions
  29. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  30. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  31. implicit def fieldFields[T <: TraversableOnce[Field[_]]](f: T): RichFields

    Permalink
    Definition Classes
    FieldConversions
  32. implicit def fieldToFields(f: Field[_]): RichFields

    Permalink
    Definition Classes
    FieldConversions
  33. implicit def fields[T <: TraversableOnce[Symbol]](f: T): Fields

    Permalink
    Definition Classes
    FieldConversions
  34. implicit def fieldsToRichFields(fields: Fields): RichFields

    Permalink
    Definition Classes
    FieldConversions
  35. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  36. implicit val flowDef: FlowDef

    Permalink
    Attributes
    protected
    Definition Classes
    Job
  37. implicit def fromEnum[T <: Enumeration](enumeration: T): Fields

    Permalink
    Definition Classes
    FieldConversions
  38. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  39. def getField(f: Fields, idx: Int): Fields

    Permalink
    Definition Classes
    FieldConversions
  40. def handleStats(statsData: CascadingStats): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Job
  41. def hasInts(f: Fields): Boolean

    Permalink
    Definition Classes
    FieldConversions
  42. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  43. implicit def intFields[T <: TraversableOnce[Int]](f: T): Fields

    Permalink
    Definition Classes
    FieldConversions
  44. implicit def intToFields(x: Int): Fields

    Permalink
    Definition Classes
    FieldConversions
  45. implicit def integerToFields(x: Integer): Fields

    Permalink
    Definition Classes
    FieldConversions
  46. def ioSerializations: List[Class[_ <: Serialization[_]]]

    Permalink
    Definition Classes
    Job
  47. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  48. implicit def iterableToRichPipe[T](iter: Iterable[T])(implicit set: TupleSetter[T], conv: TupleConverter[T]): RichPipe

    Permalink
    Definition Classes
    Job
  49. val iterationsDir: String

    Permalink
  50. def keepAlive(): Unit

    Permalink
    Definition Classes
    Job
  51. def listeners: List[FlowListener]

    Permalink
    Definition Classes
    Job
  52. def matrixFromTsv(input: String): Matrix[Int, Int, Double]

    Permalink
  53. val maxIterations: Int

    Permalink
  54. def measureConvergenceAndStore(): Unit

    Permalink

    Measure convergence by calculating the total of the absolute difference between the previous and next vectors.

    Measure convergence by calculating the total of the absolute difference between the previous and next vectors. This stores the result after calculation.

  55. implicit def mode: Mode

    Permalink
    Definition Classes
    Job
  56. val n: Int

    Permalink
  57. def name: String

    Permalink
    Definition Classes
    Job
  58. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  59. final def newSymbol(avoid: Set[Symbol], guess: Symbol, trial: Int): Symbol

    Permalink
    Definition Classes
    FieldConversions
    Annotations
    @tailrec()
  60. def next: Option[Job]

    Permalink

    Recurse and iterate again iff we are under the max number of iterations and vector has not converged.

    Recurse and iterate again iff we are under the max number of iterations and vector has not converged.

    Definition Classes
    WeightedPageRankFromMatrix → Job
  61. val nextVector: ColVector[Int, Double]

    Permalink
  62. val nextVectorLoc: String

    Permalink
  63. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  64. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  65. val onesVectorLoc: String

    Permalink
  66. implicit def parseAnySeqToFields[T <: TraversableOnce[Any]](anyf: T): Fields

    Permalink
    Definition Classes
    FieldConversions
  67. implicit def pipeToRichPipe(pipe: Pipe): RichPipe

    Permalink
    Definition Classes
    Job
  68. val previousVector: ColVector[Int, Double]

    Permalink
  69. val previousVectorLoc: String

    Permalink
  70. def priorVector: ColVector[Int, Double]

    Permalink

    Load or generate on first iteration the prior vector given d and n.

  71. implicit def productToFields(f: Product): Fields

    Permalink
    Definition Classes
    LowPriorityFieldConversions
  72. implicit def read(src: Source): Pipe

    Permalink
    Definition Classes
    Job
  73. val rootDir: String

    Permalink
  74. def run(): Boolean

    Permalink
    Definition Classes
    Job
  75. implicit def scaldingConfig: Config

    Permalink
    Attributes
    protected
    Definition Classes
    Job
  76. def skipStrategy: Option[FlowSkipStrategy]

    Permalink
    Definition Classes
    Job
  77. implicit def sourceToRichPipe(src: Source): RichPipe

    Permalink
    Definition Classes
    Job
  78. def stepListeners: List[FlowStepListener]

    Permalink
    Definition Classes
    Job
  79. def stepStrategy: Option[FlowStepStrategy[_]]

    Permalink
    Definition Classes
    Job
  80. implicit def strFields[T <: TraversableOnce[String]](f: T): Fields

    Permalink
    Definition Classes
    FieldConversions
  81. implicit def stringToFields(x: String): Fields

    Permalink
    Definition Classes
    FieldConversions
  82. implicit def symbolToFields(x: Symbol): Fields

    Permalink
    Definition Classes
    FieldConversions
  83. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  84. def timeout[T](timeout: AbsoluteDuration)(t: ⇒ T): Option[T]

    Permalink
    Definition Classes
    Job
  85. implicit def toPipe[T](iter: Iterable[T])(implicit set: TupleSetter[T], conv: TupleConverter[T]): Pipe

    Permalink
    Definition Classes
    Job
  86. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  87. implicit def tuple2ToFieldsPair[T, U](pair: (T, U))(implicit tf: (T) ⇒ Fields, uf: (U) ⇒ Fields): (Fields, Fields)

    Permalink
    Definition Classes
    FieldConversions
  88. implicit def unitToFields(u: Unit): Fields

    Permalink
    Definition Classes
    FieldConversions
  89. def validate(): Unit

    Permalink
    Definition Classes
    Job
  90. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  91. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  92. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  93. def write(pipe: Pipe, src: Source): Unit

    Permalink
    Definition Classes
    Job

Inherited from Job

Inherited from Serializable

Inherited from FieldConversions

Inherited from LowPriorityFieldConversions

Inherited from AnyRef

Inherited from Any

Ungrouped