Class/Object

io.projectglow.gff

GffDataSource

Related Docs: object GffDataSource | package gff

Permalink

class GffDataSource extends RelationProvider with SchemaRelationProvider with CreatableRelationProvider with DataSourceRegister

Gff data source to read GFF3 files.

The data source is able to infer the schema or accept a user-specified schema. It flattens the attributes field by creating a column for each tag that appears in the attributes column of the gff file.

The inferred schema will have base fields corresponding to the first 8 columns of gff3 called seqId, source, type, start, end, score, strand, and phase, followed by any official attribute field among id, name, alias, parent, target, gap, derivesfrom, note, dbxref, ontologyterm, and iscircular that appears in the gff tags followed by any unofficial attribute field that appears in the tags. In the inferred schema, the base and official fields will be in the same order as listed above. The unofficial fields will be in alphabetical order.

Any user-specified schema can have any subset of fields corresponding to the 9 columns of gff3 (named seqId, source, type, start, end, score, strand, phase, and attributes), the official attribute fields, and the unofficial attribute fields. The name of the official and unofficial fields should match the tag name in a case-and-underscore-insensitive fashion.

Linear Supertypes
DataSourceRegister, CreatableRelationProvider, SchemaRelationProvider, RelationProvider, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. GffDataSource
  2. DataSourceRegister
  3. CreatableRelationProvider
  4. SchemaRelationProvider
  5. RelationProvider
  6. AnyRef
  7. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new GffDataSource()

    Permalink

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  4. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  5. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  6. def createRelation(sqlContext: SQLContext, mode: SaveMode, parameters: Map[String, String], data: DataFrame): BaseRelation

    Permalink

    Saves a DataFrame to a destination (not supported in this DataSource.)

    Saves a DataFrame to a destination (not supported in this DataSource.)

    Definition Classes
    GffDataSource → CreatableRelationProvider
  7. def createRelation(sqlContext: SQLContext, parameters: Map[String, String], schema: StructType): BaseRelation

    Permalink

    Creates relation with user schema.

    Creates relation with user schema. User-specified schema can have a subset of attribute columns as they will be parsed out of "attributes" column

    sqlContext

    spark sql context

    parameters

    parameters for job

    schema

    user defined schema

    returns

    Base relation

    Definition Classes
    GffDataSource → SchemaRelationProvider
  8. def createRelation(sqlContext: SQLContext, parameters: Map[String, String]): BaseRelation

    Permalink

    Creates relation

    Creates relation

    sqlContext

    spark sql context

    parameters

    parameters for job

    returns

    Base relation

    Definition Classes
    GffDataSource → RelationProvider
  9. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  10. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  11. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  12. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  13. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  14. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  15. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  16. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  17. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  18. def shortName(): String

    Permalink
    Definition Classes
    GffDataSource → DataSourceRegister
  19. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  20. def toString(): String

    Permalink
    Definition Classes
    AnyRef → Any
  21. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  22. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  23. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )

Inherited from DataSourceRegister

Inherited from CreatableRelationProvider

Inherited from SchemaRelationProvider

Inherited from RelationProvider

Inherited from AnyRef

Inherited from Any

Ungrouped