HiveTypeCoercion

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
object BooleanCasts extends Rule[LogicalPlan]

Casts to/from BooleanType are transformed into comparisons since the JVM does not consider Booleans to be numeric types.
object BooleanComparisons extends Rule[LogicalPlan]

Changes Boolean values to Bytes so that expressions like true < false can be Evaluated.
object CaseWhenCoercion extends Rule[LogicalPlan]

Coerces the type of different branches of a CASE WHEN statement to a common type.
object ConvertNaNs extends Rule[LogicalPlan]

Converts string "NaN"s that are in binary operators with a NaN-able types (Float / Double) to the appropriate numeric equivalent.
object DecimalPrecision extends Rule[LogicalPlan]

Calculates and propagates precision for fixed-precision decimals.
Calculates and propagates precision for fixed-precision decimals. Hive has a number of rules for this based on the SQL standard and MS SQL: https://cwiki.apache.org/confluence/download/attachments/27362075/Hive_Decimal_Precision_Scale_Support.pdf https://msdn.microsoft.com/en-us/library/ms190476.aspx
In particular, if we have expressions e1 and e2 with precision/scale p1/s2 and p2/s2 respectively, then the following operations have the following precision / scale:
Operation Result Precision Result Scale ------------------------------------------------------------------------ e1 + e2 max(s1, s2) + max(p1-s1, p2-s2) + 1 max(s1, s2) e1 - e2 max(s1, s2) + max(p1-s1, p2-s2) + 1 max(s1, s2) e1 * e2 p1 + p2 + 1 s1 + s2 e1 / e2 p1 - s1 + s2 + max(6, s1 + p2 + 1) max(6, s1 + p2 + 1) e1 % e2 min(p1-s1, p2-s2) + max(s1, s2) max(s1, s2) e1 union e2 max(s1, s2) + max(p1-s1, p2-s2) max(s1, s2) sum(e1) p1 + 10 s1 avg(e1) p1 + 4 s1 + 4
Catalyst also has unlimited-precision decimals. For those, all ops return unlimited precision.
To implement the rules for fixed-precision types, we introduce casts to turn them to unlimited precision, do the math on unlimited-precision numbers, then introduce casts back to the required fixed precision. This allows us to do all rounding and overflow handling in the cast-to-fixed-precision operator.
In addition, when mixing non-decimal types with decimals, we use the following rules: - BYTE gets turned into DECIMAL(3, 0) - SHORT gets turned into DECIMAL(5, 0) - INT gets turned into DECIMAL(10, 0) - LONG gets turned into DECIMAL(20, 0) - FLOAT and DOUBLE
1. Union operation: FLOAT gets turned into DECIMAL(7, 7), DOUBLE gets turned into DECIMAL(15, 15) (this is the same as Hive) 2. Other operation: FLOAT and DOUBLE cause fixed-length decimals to turn into DOUBLE (this is the same as Hive, but note that unlimited decimals are considered bigger than doubles in WidenTypes)
object Division extends Rule[LogicalPlan]

Hive only performs integral division with the DIV operator.
Hive only performs integral division with the DIV operator. The arguments to / are always converted to fractional types.
object ExpectedInputConversion extends Rule[LogicalPlan]

Casts types according to the expected input types for Expressions that have the trait ExpectsInputTypes.
object FunctionArgumentConversion extends Rule[LogicalPlan]

This ensure that the types for various functions are as expected.
object InConversion extends Rule[LogicalPlan]

Convert all expressions in in() list to the left operator type
object PromoteStrings extends Rule[LogicalPlan]

Promotes strings that appear in arithmetic expressions.
object PropagateTypes extends Rule[LogicalPlan]

Applies any changes to AttributeReference data types that are made by other rules to instances higher in the query tree.
object StringToIntegralCasts extends Rule[LogicalPlan]

When encountering a cast from a string representing a valid fractional number to an integral type the jvm will throw a java.lang.NumberFormatException.
When encountering a cast from a string representing a valid fractional number to an integral type the jvm will throw a java.lang.NumberFormatException. Hive, in contrast, returns the truncated version of this number.
object WidenTypes extends Rule[LogicalPlan]

Widens numeric types and converts strings to numbers when appropriate.
Widens numeric types and converts strings to numbers when appropriate.
Loosely based on rules from "Hadoop: The Definitive Guide" 2nd edition, by Tom White
The implicit conversion rules can be summarized as follows:
- Any integral numeric type can be implicitly converted to a wider type.
- All the integral numeric types, FLOAT, and (perhaps surprisingly) STRING can be implicitly converted to DOUBLE.
- TINYINT, SMALLINT, and INT can all be converted to FLOAT.
- BOOLEAN types cannot be converted to any other type.
Additionally, all types when UNION-ed with strings will be promoted to strings. Other string conversions are handled by PromoteStrings.
Widening types might result in loss of precision in the following cases: - IntegerType to FloatType - LongType to FloatType - LongType to DoubleType
final def asInstanceOf[T0]: T0

Definition Classes
Any
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
def hashCode(): Int

Definition Classes
AnyRef → Any
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def toString(): String

Definition Classes
AnyRef → Any
val typeCoercionRules: List[Rule[LogicalPlan]]
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )

Related Docs: object HiveTypeCoercion | package analysis

trait HiveTypeCoercion extends AnyRef

Value Members

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: Any): Boolean

object BooleanCasts extends Rule[LogicalPlan]

object BooleanComparisons extends Rule[LogicalPlan]

object CaseWhenCoercion extends Rule[LogicalPlan]

object ConvertNaNs extends Rule[LogicalPlan]

object DecimalPrecision extends Rule[LogicalPlan]

object Division extends Rule[LogicalPlan]

object ExpectedInputConversion extends Rule[LogicalPlan]

object FunctionArgumentConversion extends Rule[LogicalPlan]

object InConversion extends Rule[LogicalPlan]

object PromoteStrings extends Rule[LogicalPlan]

object PropagateTypes extends Rule[LogicalPlan]

object StringToIntegralCasts extends Rule[LogicalPlan]

object WidenTypes extends Rule[LogicalPlan]

final def asInstanceOf[T0]: T0

def clone(): AnyRef

final def eq(arg0: AnyRef): Boolean

def equals(arg0: Any): Boolean

def finalize(): Unit

final def getClass(): Class[_]

def hashCode(): Int

final def isInstanceOf[T0]: Boolean

final def ne(arg0: AnyRef): Boolean

final def notify(): Unit

final def notifyAll(): Unit

final def synchronized[T0](arg0: ⇒ T0): T0

def toString(): String

val typeCoercionRules: List[Rule[LogicalPlan]]

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

Inherited from AnyRef

Inherited from Any

Ungrouped