RaykarMulti

Type Members

case class AnnotationWithClassProb(example: Long, clas: Int, prob: Double, annotator: Long, annotation: Int) extends Product with Serializable

Dataset with both annotations and class probability estimated in the previous step, for obtaining the soft frequency matrices.
Dataset with both annotations and class probability estimated in the previous step, for obtaining the soft frequency matrices.

Version
0.1
class AnnotationsLikelihoodAggregator extends Aggregator[EStepEstimationPoint, (Double, Double), Double]

Obtains the likelihood for each example given a class (grouping keys)
Obtains the likelihood for each example given a class (grouping keys)

Version
0.1
case class AnnotationsWithLogisticPrediction(example: Long, clas: Int, prediction: Double, annotator: Long, annotation: Int) extends Product with Serializable

Annotations with logistic prediction information for EStep
Annotations with logistic prediction information for EStep

Version
0.1
case class AnnotatorClassCombination(annotator: Long, clas: Int, k: Int) extends Product with Serializable

Combinations of annotator and classes for frequency calculation taking into account all combinations
Combinations of annotator and classes for frequency calculation taking into account all combinations

Version
0.1
case class AnnotatorClassFrequency(annotator: Long, clas: Int, frequency: Double) extends Product with Serializable

Dataset with soft frequency of (annotator,c) in the annotations dataset Represents the denominator in the corresponding element of the precisions matrices.
Dataset with soft frequency of (annotator,c) in the annotations dataset Represents the denominator in the corresponding element of the precisions matrices.

Version
0.1
case class AnnotatorFrequency(annotator: Long, clas: Int, k: Int, frequency: Double) extends Product with Serializable

Dataset with soft frequency of (annotator,c,k) in the annotations dataset.
Dataset with soft frequency of (annotator,c,k) in the annotations dataset. Represents the numerator in the corresponding element of the precisions matrices.

Version
0.1
class ClassFrequencyAggregator extends Aggregator[(AnnotatorClassCombination, AnnotationWithClassProb), Double, Double]

Obtains the soft frequency of appearance of the key (j,c)
Obtains the soft frequency of appearance of the key (j,c)

Version
0.1
case class EStepEstimationPoint(example: Long, clas: Int, prediction: Double, annotator: Long, annotation: Int, annotationProb: Double) extends Product with Serializable

EStep estimation point with information about annotation probability and the logistic prediction.
EStep estimation point with information about annotation probability and the logistic prediction.

Version
0.1
class FrequencyAggregator extends Aggregator[(AnnotatorClassCombination, AnnotationWithClassProb), Double, Double]

Obtains the soft frequency of appearance of the key (j,c,k)
Obtains the soft frequency of appearance of the key (j,c,k)

Version
0.1
case class LikelihoodPoint(example: Long, clas: Int, mu: Double, annotationsLikelihood: Double) extends Product with Serializable

Likelihood estimation point with annotation likelihood as well as the true class estimation form E Step
Likelihood estimation point with annotation likelihood as well as the true class estimation form E Step

Version
0.1
case class LogisticAnnotatorParams(a: Double, b: Double) extends Product with Serializable

Logistic Annotator params for the LogisticParams aggregator
Logistic Annotator params for the LogisticParams aggregator

Version
0.1
case class LogisticMultiPrediction(example: Long, clas: Int, prob: Double) extends Product with Serializable

Logistic prediction for the full multiclass problem
Logistic prediction for the full multiclass problem

Version
0.1
class LogisticParamAggregator extends Aggregator[(MulticlassAnnotation, DiscreteAnnotatorPrecision), LogisticAnnotatorParams, LogisticAnnotatorParams]

Obtains the soft frequency of appearance of the key (j,c)
Obtains the soft frequency of appearance of the key (j,c)

Version
0.1
case class LogisticParams(example: Long, a: Double, b: Double) extends Product with Serializable

Aggregation of annotator parameters for each example in the one vs all approach for logistic regression.
Aggregation of annotator parameters for each example in the one vs all approach for logistic regression.

Version
0.1
case class LogisticPrediction(example: Long, prob: Double) extends Product with Serializable

Logistic prediction for the one vs all approach
Logistic prediction for the one vs all approach

Version
0.1
case class MuWithLogisticParams(example: Long, mu: Double, a: Double, b: Double) extends Product with Serializable

Mu estimate with logistic params for the example
Mu estimate with logistic params for the example

Version
0.1
case class Normalizer(example: Long, norm: Double) extends Product with Serializable

Normalizer for logistic predictions
Normalizer for logistic predictions

Version
0.1
class RaykarMultiGradient extends Gradient

Computes the gradient for the SGD algorithm
case class RaykarMultiPartialModel(dataset: DataFrame, annotations: Dataset[MulticlassAnnotation], mu: Dataset[MulticlassSoftProb], annotatorPrecision: Dataset[DiscreteAnnotatorPrecision], logisticWeights: Array[Array[Double]], logisticPrediction: Dataset[LogisticMultiPrediction], annotationsLikelihood: Dataset[MulticlassSoftProb], annotatorPriorMatrix: Broadcast[Array[Array[Array[Double]]]], weightsPriorMatrix: Array[Broadcast[Array[Array[Double]]]], likelihood: Double, improvement: Double, annotatorClassCombination: Dataset[AnnotatorClassCombination], nFeatures: Int, nClasses: Int, nAnnotators: Int) extends Product with Serializable

Partial object get data from one step to another.
Partial object get data from one step to another.

Version
0.1
class RaykarMultiUpdater extends Updater

Computes updater for the SGD algorithm.
Computes updater for the SGD algorithm. Adds the regularization priors.

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def apply(dataset: DataFrame, annDataset: Dataset[MulticlassAnnotation], eMIters: Int = 3, eMThreshold: Double = 0.001, gradIters: Int = 100, gradThreshold: Double = 0.1, gradLearning: Double = 0.1, k_prior: Option[Array[Array[Array[Double]]]] = None, w_prior: Option[Array[Array[Array[Double]]]] = None): RaykarMultiModel

Applies the learning algorithm
Applies the learning algorithm
dataset
the dataset with feature vectors.
annDataset
the dataset with the annotations.
gradIters
maximum number of iterations for the GradientDescent algorithm
gradThreshold
threshold for the log likelihood variability for the gradient descent algorithm
gradLearning
learning rate for the gradient descent algorithm
k_prior
prior (Dirichlet distribution hyperparameters) for the estimation of the probability that an annotator correctly a class given another
w_prior
prior for the weights of the logistic regression model
returns
com.enriquegrodrigo.spark.crowd.types.RaykarBinaryModel

Version
0.1
final def asInstanceOf[T0]: T0

Definition Classes
Any
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
def computePointLoss(mui: Double, pi: Double, ai: Double, bi: Double): Double

Computes the negative likelihood of a point (loss)
def computeSigmoid(x: Array[Double], w: Array[Double]): Double

Computes the logistic function for a data point
def eStep(model: RaykarMultiPartialModel): RaykarMultiPartialModel

E Step of the EM algorithm.
E Step of the EM algorithm.

Version
0.1
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
def hashCode(): Int

Definition Classes
AnyRef → Any
def initialization(dataset: DataFrame, annotatorData: Dataset[MulticlassAnnotation], k_prior: Option[Array[Array[Array[Double]]]], w_prior: Option[Array[Array[Array[Double]]]]): RaykarMultiPartialModel

Initialize the parameters.
Initialize the parameters. First ground truth estimation is done using the majority voting algorithm

Version
0.1
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
def logLikelihood(model: RaykarMultiPartialModel): RaykarMultiPartialModel

Obtains the likelihood of the partial model.
Obtains the likelihood of the partial model.

Version
0.1
def mStep(model: RaykarMultiPartialModel, gradIters: Int, gradThreshold: Double, gradLearning: Double): RaykarMultiPartialModel

M Step of the EM algorithm.
M Step of the EM algorithm.

Version
0.1
def matMult(mat: Array[Array[Double]], v: Array[Double]): Array[Double]

Matrix multiplication (TODO: improving using libraries)
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
def step(gradIters: Int, gradThreshold: Double, gradLearning: Double)(model: RaykarMultiPartialModel, i: Int): RaykarMultiPartialModel

Step of the iterative algorithm
Step of the iterative algorithm

Version
0.1
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def toString(): String

Definition Classes
AnyRef → Any
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )

Related Doc: package methods

object RaykarMulti

Type Members

case class AnnotationWithClassProb(example: Long, clas: Int, prob: Double, annotator: Long, annotation: Int) extends Product with Serializable

class AnnotationsLikelihoodAggregator extends Aggregator[EStepEstimationPoint, (Double, Double), Double]

case class AnnotationsWithLogisticPrediction(example: Long, clas: Int, prediction: Double, annotator: Long, annotation: Int) extends Product with Serializable

case class AnnotatorClassCombination(annotator: Long, clas: Int, k: Int) extends Product with Serializable

case class AnnotatorClassFrequency(annotator: Long, clas: Int, frequency: Double) extends Product with Serializable

case class AnnotatorFrequency(annotator: Long, clas: Int, k: Int, frequency: Double) extends Product with Serializable

class ClassFrequencyAggregator extends Aggregator[(AnnotatorClassCombination, AnnotationWithClassProb), Double, Double]

case class EStepEstimationPoint(example: Long, clas: Int, prediction: Double, annotator: Long, annotation: Int, annotationProb: Double) extends Product with Serializable

class FrequencyAggregator extends Aggregator[(AnnotatorClassCombination, AnnotationWithClassProb), Double, Double]

case class LikelihoodPoint(example: Long, clas: Int, mu: Double, annotationsLikelihood: Double) extends Product with Serializable

case class LogisticAnnotatorParams(a: Double, b: Double) extends Product with Serializable

case class LogisticMultiPrediction(example: Long, clas: Int, prob: Double) extends Product with Serializable

class LogisticParamAggregator extends Aggregator[(MulticlassAnnotation, DiscreteAnnotatorPrecision), LogisticAnnotatorParams, LogisticAnnotatorParams]

case class LogisticParams(example: Long, a: Double, b: Double) extends Product with Serializable

case class LogisticPrediction(example: Long, prob: Double) extends Product with Serializable

case class MuWithLogisticParams(example: Long, mu: Double, a: Double, b: Double) extends Product with Serializable

case class Normalizer(example: Long, norm: Double) extends Product with Serializable

class RaykarMultiGradient extends Gradient

class RaykarMultiUpdater extends Updater

Value Members

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: Any): Boolean

final def asInstanceOf[T0]: T0

def clone(): AnyRef

def computePointLoss(mui: Double, pi: Double, ai: Double, bi: Double): Double

def computeSigmoid(x: Array[Double], w: Array[Double]): Double

def eStep(model: RaykarMultiPartialModel): RaykarMultiPartialModel

final def eq(arg0: AnyRef): Boolean

def equals(arg0: Any): Boolean

def finalize(): Unit

final def getClass(): Class[_]

def hashCode(): Int

def initialization(dataset: DataFrame, annotatorData: Dataset[MulticlassAnnotation], k_prior: Option[Array[Array[Array[Double]]]], w_prior: Option[Array[Array[Array[Double]]]]): RaykarMultiPartialModel

final def isInstanceOf[T0]: Boolean

def logLikelihood(model: RaykarMultiPartialModel): RaykarMultiPartialModel

def mStep(model: RaykarMultiPartialModel, gradIters: Int, gradThreshold: Double, gradLearning: Double): RaykarMultiPartialModel

def matMult(mat: Array[Array[Double]], v: Array[Double]): Array[Double]

final def ne(arg0: AnyRef): Boolean

final def notify(): Unit

final def notifyAll(): Unit

def step(gradIters: Int, gradThreshold: Double, gradLearning: Double)(model: RaykarMultiPartialModel, i: Int): RaykarMultiPartialModel

final def synchronized[T0](arg0: ⇒ T0): T0

def toString(): String

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

Inherited from AnyRef

Inherited from Any

Ungrouped