package api
- Alphabetic
- Public
- Protected
Type Members
- sealed abstract final class Accuracy extends Enum[Accuracy] with TEnum
Autogenerated by Thrift Compiler (0.13.0)
Autogenerated by Thrift Compiler (0.13.0)
DO NOT EDIT UNLESS YOU ARE SURE THAT YOU KNOW WHAT YOU ARE DOING
- Annotations
- @Generated()
- class Aggregation extends TBase[Aggregation, _Fields] with Serializable with Cloneable with Comparable[Aggregation]
Chronon provides a powerful aggregations primitive - that takes the familiar aggregation operation, via groupBy in SQL and extends it with three things - windowing, bucketing and auto-explode.
Chronon provides a powerful aggregations primitive - that takes the familiar aggregation operation, via groupBy in SQL and extends it with three things - windowing, bucketing and auto-explode.
:param inputColumn: The column as specified in source.query.selects - on which we need to aggregate with.
:param operation: The type of aggregation that needs to be performed on the inputColumn.
:param argMap: Extra arguments that needs to be passed to some of the operations like LAST_K, APPROX_PERCENTILE.
:param windows: For TEMPORAL case windows are sawtooth. Meaning head slides ahead continuously in time, whereas, the tail only hops ahead, at discrete points in time. Hop is determined by the window size automatically. The maximum hop size is 1/12 of window size. You can specify multiple such windows at once.
Window > 12 days -> Hop Size = 1 day
Window > 12 hours -> Hop Size = 1 hr
Window > 1hr -> Hop Size = 5 minutes
:param buckets: This is an additional layer of aggregation. You can key a group_by by user, and bucket a "item_view" count by "item_category". This will produce one row per user, with column containing map of "item_category" to "view_count". You can specify multiple such buckets at once.
- Annotations
- @SuppressWarnings() @Generated()
- class AggregationPart extends TBase[AggregationPart, _Fields] with Serializable with Cloneable with Comparable[AggregationPart]
Autogenerated by Thrift Compiler (0.13.0)
Autogenerated by Thrift Compiler (0.13.0)
DO NOT EDIT UNLESS YOU ARE SURE THAT YOU KNOW WHAT YOU ARE DOING
- Annotations
- @SuppressWarnings() @Generated()
- class AggregationSelector extends TBase[AggregationSelector, _Fields] with Serializable with Cloneable with Comparable[AggregationSelector]
Autogenerated by Thrift Compiler (0.13.0)
Autogenerated by Thrift Compiler (0.13.0)
DO NOT EDIT UNLESS YOU ARE SURE THAT YOU KNOW WHAT YOU ARE DOING
- Annotations
- @SuppressWarnings() @Generated()
- class DataField extends TBase[DataField, _Fields] with Serializable with Cloneable with Comparable[DataField]
Autogenerated by Thrift Compiler (0.13.0)
Autogenerated by Thrift Compiler (0.13.0)
DO NOT EDIT UNLESS YOU ARE SURE THAT YOU KNOW WHAT YOU ARE DOING
- Annotations
- @SuppressWarnings() @Generated()
- sealed abstract final class DataKind extends Enum[DataKind] with TEnum
Autogenerated by Thrift Compiler (0.13.0)
Autogenerated by Thrift Compiler (0.13.0)
DO NOT EDIT UNLESS YOU ARE SURE THAT YOU KNOW WHAT YOU ARE DOING
- Annotations
- @Generated()
- sealed trait DataType extends Serializable
- class EntitySource extends TBase[EntitySource, _Fields] with Serializable with Cloneable with Comparable[EntitySource]
Entity Sources represent data that gets mutated over-time - at row-level.
Entity Sources represent data that gets mutated over-time - at row-level. This is a group of three data elements. snapshotTable, mutationTable and mutationTopic. mutationTable and mutationTopic are only necessary if we are trying to create realtime or point-in-time aggregations over these sources. Entity sources usually map 1:1 with a database tables in your OLTP store that typically serves live application traffic. When mutation data is absent they map 1:1 to
dim
tables in star schema.:param snapshotTable: Table currently needs to be a 'ds' (date string - yyyy-MM-dd) partitioned hive table. :param mutationTable: Topic is a kafka table. The table contains all the events historically came through this topic. :param mutationTopic: The logic used to scan both the table and the topic. Contains row level transformations and filtering expressed as Spark SQL statements. :param isCumulative: If each new hive partition contains not just the current day's events but the entire set of events since the begininng. The key property is that the events are not mutated across partitions.
- Annotations
- @SuppressWarnings() @Generated()
- class EventSource extends TBase[EventSource, _Fields] with Serializable with Cloneable with Comparable[EventSource]
Event source captures data that is essentially immutable - like user clicks, impressions etc.
Event source captures data that is essentially immutable - like user clicks, impressions etc. It has two parts, an offline table and an online topic. The term
fact
table from star-schema also maps to this concepts. But lacks a notion of topic.:param table: Table currently needs to be a 'ds' (date string - yyyy-MM-dd) partitioned hive table. :param topic: Topic is a kafka table. The table contains all the events historically came through this topic. :param query: The logic used to scan both the table and the topic. Contains row level transformations and filtering expressed as Spark SQL statements. :param isCumulative: If each new hive partition contains not just the current day's events but the entire set of events since the begininng. The key property is that the events are not mutated across partitions.
- Annotations
- @SuppressWarnings() @Generated()
- class ExternalPart extends TBase[ExternalPart, _Fields] with Serializable with Cloneable with Comparable[ExternalPart]
Autogenerated by Thrift Compiler (0.13.0)
Autogenerated by Thrift Compiler (0.13.0)
DO NOT EDIT UNLESS YOU ARE SURE THAT YOU KNOW WHAT YOU ARE DOING
- Annotations
- @SuppressWarnings() @Generated()
- class ExternalSource extends TBase[ExternalSource, _Fields] with Serializable with Cloneable with Comparable[ExternalSource]
Autogenerated by Thrift Compiler (0.13.0)
Autogenerated by Thrift Compiler (0.13.0)
DO NOT EDIT UNLESS YOU ARE SURE THAT YOU KNOW WHAT YOU ARE DOING
- Annotations
- @SuppressWarnings() @Generated()
- class GroupBy extends TBase[GroupBy, _Fields] with Serializable with Cloneable with Comparable[GroupBy]
Autogenerated by Thrift Compiler (0.13.0)
Autogenerated by Thrift Compiler (0.13.0)
DO NOT EDIT UNLESS YOU ARE SURE THAT YOU KNOW WHAT YOU ARE DOING
- Annotations
- @SuppressWarnings() @Generated()
- class GroupByServingInfo extends TBase[GroupByServingInfo, _Fields] with Serializable with Cloneable with Comparable[GroupByServingInfo]
Autogenerated by Thrift Compiler (0.13.0)
Autogenerated by Thrift Compiler (0.13.0)
DO NOT EDIT UNLESS YOU ARE SURE THAT YOU KNOW WHAT YOU ARE DOING
- Annotations
- @SuppressWarnings() @Generated()
- class Join extends TBase[Join, _Fields] with Serializable with Cloneable with Comparable[Join]
Autogenerated by Thrift Compiler (0.13.0)
Autogenerated by Thrift Compiler (0.13.0)
DO NOT EDIT UNLESS YOU ARE SURE THAT YOU KNOW WHAT YOU ARE DOING
- Annotations
- @SuppressWarnings() @Generated()
- class JoinPart extends TBase[JoinPart, _Fields] with Serializable with Cloneable with Comparable[JoinPart]
Autogenerated by Thrift Compiler (0.13.0)
Autogenerated by Thrift Compiler (0.13.0)
DO NOT EDIT UNLESS YOU ARE SURE THAT YOU KNOW WHAT YOU ARE DOING
- Annotations
- @SuppressWarnings() @Generated()
- case class ListType(elementType: DataType) extends DataType with Product with Serializable
- case class MapType(keyType: DataType, valueType: DataType) extends DataType with Product with Serializable
- class MetaData extends TBase[MetaData, _Fields] with Serializable with Cloneable with Comparable[MetaData]
Autogenerated by Thrift Compiler (0.13.0)
Autogenerated by Thrift Compiler (0.13.0)
DO NOT EDIT UNLESS YOU ARE SURE THAT YOU KNOW WHAT YOU ARE DOING
- Annotations
- @SuppressWarnings() @Generated()
- sealed abstract final class Operation extends Enum[Operation] with TEnum
Autogenerated by Thrift Compiler (0.13.0)
Autogenerated by Thrift Compiler (0.13.0)
DO NOT EDIT UNLESS YOU ARE SURE THAT YOU KNOW WHAT YOU ARE DOING
- Annotations
- @Generated()
- case class PartitionSpec(format: String, spanMillis: Long) extends Product with Serializable
- class Query extends TBase[Query, _Fields] with Serializable with Cloneable with Comparable[Query]
Autogenerated by Thrift Compiler (0.13.0)
Autogenerated by Thrift Compiler (0.13.0)
DO NOT EDIT UNLESS YOU ARE SURE THAT YOU KNOW WHAT YOU ARE DOING
- Annotations
- @SuppressWarnings() @Generated()
- trait Row extends AnyRef
- class Source extends TUnion[Source, _Fields]
Autogenerated by Thrift Compiler (0.13.0)
Autogenerated by Thrift Compiler (0.13.0)
DO NOT EDIT UNLESS YOU ARE SURE THAT YOU KNOW WHAT YOU ARE DOING
- Annotations
- @SuppressWarnings() @Generated()
- class StagingQuery extends TBase[StagingQuery, _Fields] with Serializable with Cloneable with Comparable[StagingQuery]
Staging Query encapsulates arbitrary spark computation.
Staging Query encapsulates arbitrary spark computation. One key feature is that the computation follows a "fill-what's-missing" pattern. Basically instead of explicitly specifying dates you specify two macros.
{{ start_date }}
and{{end_date}}
. Chronon will pass in earliest-missing-partition forstart_date
and execution-date / today forend_date
. So the query will compute multiple partitions at once.:param metaData: Contains name, team, output_namespace, execution parameters etc. Things that don't change the semantics of the computation itself. :param query: The spark sql query with date templates. :param startPartition: This is where start_date will be applied from. We expect the query to also produce output hive tables partitioned data starting from startPartition. :param setups: Spark SQL setup statements. Used typically to register UDFs.
- Annotations
- @SuppressWarnings() @Generated()
- case class StructField(name: String, fieldType: DataType) extends Product with Serializable
- case class StructType(name: String, fields: Array[StructField]) extends DataType with Seq[StructField] with Product with Serializable
- class TDataType extends TBase[TDataType, _Fields] with Serializable with Cloneable with Comparable[TDataType]
Autogenerated by Thrift Compiler (0.13.0)
Autogenerated by Thrift Compiler (0.13.0)
DO NOT EDIT UNLESS YOU ARE SURE THAT YOU KNOW WHAT YOU ARE DOING
- Annotations
- @SuppressWarnings() @Generated()
- sealed abstract final class TimeUnit extends Enum[TimeUnit] with TEnum
Autogenerated by Thrift Compiler (0.13.0)
Autogenerated by Thrift Compiler (0.13.0)
DO NOT EDIT UNLESS YOU ARE SURE THAT YOU KNOW WHAT YOU ARE DOING
- Annotations
- @Generated()
- case class UnknownType(any: Any) extends DataType with Product with Serializable
- class Window extends TBase[Window, _Fields] with Serializable with Cloneable with Comparable[Window]
Autogenerated by Thrift Compiler (0.13.0)
Autogenerated by Thrift Compiler (0.13.0)
DO NOT EDIT UNLESS YOU ARE SURE THAT YOU KNOW WHAT YOU ARE DOING
- Annotations
- @SuppressWarnings() @Generated()
Value Members
- case object BinaryType extends DataType with Product with Serializable
- case object BooleanType extends DataType with Product with Serializable
- object Builders
- case object ByteType extends DataType with Product with Serializable
- object Constants
- object DataModel extends Enumeration
- object DataType extends Serializable
- case object DateType extends DataType with Product with Serializable
- case object DoubleType extends DataType with Product with Serializable
- object Extensions
- case object FloatType extends DataType with Product with Serializable
- object HashUtils
- case object IntType extends DataType with Product with Serializable
- case object LongType extends DataType with Product with Serializable
- object QueryUtils
- object Row
- case object ShortType extends DataType with Product with Serializable
- case object StringType extends DataType with Product with Serializable
- object StructType extends Serializable
- object ThriftJsonCodec
- case object TimestampType extends DataType with Product with Serializable