Class FilterInvalidValues

    • Constructor Detail

      • FilterInvalidValues

        public FilterInvalidValues()
        Filter examples that have invalid values in ANY columns.
      • FilterInvalidValues

        public FilterInvalidValues​(String... columnsToFilterIfInvalid)
        Parameters:
        columnsToFilterIfInvalid - Columns to check for invalid values
    • Method Detail

      • transform

        public Schema transform​(Schema inputSchema)
        Get the output schema for this transformation, given an input schema
        Specified by:
        transform in interface Operation<Schema,​Schema>
        Parameters:
        inputSchema -
      • removeExample

        public boolean removeExample​(Object writables)
        Specified by:
        removeExample in interface Filter
        Parameters:
        writables - Example
        Returns:
        true if example should be removed, false to keep
      • removeSequence

        public boolean removeSequence​(Object sequence)
        Specified by:
        removeSequence in interface Filter
        Parameters:
        sequence - sequence example
        Returns:
        true if example should be removed, false to keep
      • removeExample

        public boolean removeExample​(List<Writable> writables)
        Specified by:
        removeExample in interface Filter
        Parameters:
        writables - Example
        Returns:
        true if example should be removed, false to keep
      • removeSequence

        public boolean removeSequence​(List<List<Writable>> sequence)
        Specified by:
        removeSequence in interface Filter
        Parameters:
        sequence - sequence example
        Returns:
        true if example should be removed, false to keep
      • outputColumnName

        public String outputColumnName()
        The output column name after the operation has been applied
        Specified by:
        outputColumnName in interface ColumnOp
        Returns:
        the output column name
      • outputColumnNames

        public String[] outputColumnNames()
        The output column names This will often be the same as the input
        Specified by:
        outputColumnNames in interface ColumnOp
        Returns:
        the output column names
      • columnNames

        public String[] columnNames()
        Returns column names this op is meant to run on
        Specified by:
        columnNames in interface ColumnOp
        Returns:
      • columnName

        public String columnName()
        Returns a singular column name this op is meant to run on
        Specified by:
        columnName in interface ColumnOp
        Returns: