Creates a new dataset keeping only the features in the given set
Creates a new dataset keeping only the features in the given set
Returns the Datum for given row These datums are always represented as RVFDatums
Returns the Datum for given row These datums are always represented as RVFDatums
Removes features that appear less than threshold times in this dataset.
Removes features that appear less than threshold times in this dataset.
Removes features by information gain.
Removes features by information gain.
number of training examples
number of training examples
Convert this dataset to a CounterDataset
Convert this dataset to a CounterDataset
Dataset that represents datums as explicit counters This is more efficient for the training of various algorithms such as random forests