Packages

class ApplyInPandasWithStatePythonRunner extends BasePythonRunner[InType, OutType] with PythonArrowInput[InType] with PythonArrowOutput[OutType]

A variant implementation of ArrowPythonRunner to serve the operation applyInPandasWithState.

Unlike normal ArrowPythonRunner which both input and output (executor <-> python worker) are InternalRow, applyInPandasWithState has side data (state information) in both input and output along with data, which requires different struct on Arrow RecordBatch.

Linear Supertypes
PythonArrowOutput[OutType], PythonArrowInput[InType], BasePythonRunner[InType, OutType], Logging, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. ApplyInPandasWithStatePythonRunner
  2. PythonArrowOutput
  3. PythonArrowInput
  4. BasePythonRunner
  5. Logging
  6. AnyRef
  7. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. Protected

Instance Constructors

  1. new ApplyInPandasWithStatePythonRunner(funcs: Seq[(ChainedPythonFunctions, Long)], evalType: Int, argOffsets: Array[Array[Int]], inputSchema: StructType, _timeZoneId: String, initialWorkerConf: Map[String, String], stateEncoder: ExpressionEncoder[Row], keySchema: StructType, outputSchema: StructType, stateValueSchema: StructType, pythonMetrics: Map[String, SQLMetric], jobArtifactUUID: Option[String])

Type Members

  1. implicit class LogStringContext extends AnyRef
    Definition Classes
    Logging
  2. class MonitorThread extends Thread
    Definition Classes
    BasePythonRunner
  3. class ReaderInputStream extends InputStream
    Definition Classes
    BasePythonRunner
  4. abstract class ReaderIterator extends Iterator[OUT]
    Definition Classes
    BasePythonRunner
  5. abstract class Writer extends AnyRef
    Definition Classes
    BasePythonRunner

Value Members

  1. final def !=(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  2. final def ##: Int
    Definition Classes
    AnyRef → Any
  3. final def ==(arg0: Any): Boolean
    Definition Classes
    AnyRef → Any
  4. val accumulator: PythonAccumulator
    Attributes
    protected
    Definition Classes
    BasePythonRunner
  5. val argOffsets: Array[Array[Int]]
    Attributes
    protected
    Definition Classes
    BasePythonRunner
  6. final def asInstanceOf[T0]: T0
    Definition Classes
    Any
  7. val authSocketTimeout: Long
    Attributes
    protected
    Definition Classes
    BasePythonRunner
  8. val bufferSize: Int
    Definition Classes
    ApplyInPandasWithStatePythonRunner → BasePythonRunner
  9. def clone(): AnyRef
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.CloneNotSupportedException]) @IntrinsicCandidate() @native()
  10. def close(): Unit
    Attributes
    protected
    Definition Classes
    PythonArrowInput
  11. def compute(inputIterator: Iterator[InType], partitionIndex: Int, context: TaskContext): Iterator[OutType]
    Definition Classes
    BasePythonRunner
  12. def deserializeColumnarBatch(batch: ColumnarBatch, schema: StructType): OutType

    Deserialize ColumnarBatch received from the Python worker to produce the output.

    Deserialize ColumnarBatch received from the Python worker to produce the output. Schema info for given ColumnarBatch is also provided as well.

    Attributes
    protected
    Definition Classes
    ApplyInPandasWithStatePythonRunner → PythonArrowOutput
  13. val envVars: Map[String, String]
    Attributes
    protected
    Definition Classes
    BasePythonRunner
  14. final def eq(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  15. def equals(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef → Any
  16. val errorOnDuplicatedFieldNames: Boolean
    Definition Classes
    ApplyInPandasWithStatePythonRunner → PythonArrowInput
  17. val evalType: Int
    Attributes
    protected
    Definition Classes
    BasePythonRunner
  18. val faultHandlerEnabled: Boolean
    Definition Classes
    ApplyInPandasWithStatePythonRunner → BasePythonRunner
  19. val funcs: Seq[ChainedPythonFunctions]
    Attributes
    protected
    Definition Classes
    BasePythonRunner
  20. final def getClass(): Class[_ <: AnyRef]
    Definition Classes
    AnyRef → Any
    Annotations
    @IntrinsicCandidate() @native()
  21. def handleMetadataAfterExec(stream: DataInputStream): Unit
    Attributes
    protected
    Definition Classes
    PythonArrowOutput
  22. def handleMetadataBeforeExec(stream: DataOutputStream): Unit

    This method sends out the additional metadata before sending out actual data.

    This method sends out the additional metadata before sending out actual data.

    Specifically, this class overrides this method to also write the schema for state value.

    Attributes
    protected
    Definition Classes
    ApplyInPandasWithStatePythonRunner → PythonArrowInput
  23. def hashCode(): Int
    Definition Classes
    AnyRef → Any
    Annotations
    @IntrinsicCandidate() @native()
  24. def initializeLogIfNecessary(isInterpreter: Boolean, silent: Boolean): Boolean
    Attributes
    protected
    Definition Classes
    Logging
  25. def initializeLogIfNecessary(isInterpreter: Boolean): Unit
    Attributes
    protected
    Definition Classes
    Logging
  26. final def isInstanceOf[T0]: Boolean
    Definition Classes
    Any
  27. def isTraceEnabled(): Boolean
    Attributes
    protected
    Definition Classes
    Logging
  28. val jobArtifactUUID: Option[String]
    Attributes
    protected
    Definition Classes
    BasePythonRunner
  29. val largeVarTypes: Boolean
    Attributes
    protected
    Definition Classes
    ApplyInPandasWithStatePythonRunner → PythonArrowInput
  30. def log: Logger
    Attributes
    protected
    Definition Classes
    Logging
  31. def logDebug(msg: => String, throwable: Throwable): Unit
    Attributes
    protected
    Definition Classes
    Logging
  32. def logDebug(entry: LogEntry, throwable: Throwable): Unit
    Attributes
    protected
    Definition Classes
    Logging
  33. def logDebug(entry: LogEntry): Unit
    Attributes
    protected
    Definition Classes
    Logging
  34. def logDebug(msg: => String): Unit
    Attributes
    protected
    Definition Classes
    Logging
  35. def logError(msg: => String, throwable: Throwable): Unit
    Attributes
    protected
    Definition Classes
    Logging
  36. def logError(entry: LogEntry, throwable: Throwable): Unit
    Attributes
    protected
    Definition Classes
    Logging
  37. def logError(entry: LogEntry): Unit
    Attributes
    protected
    Definition Classes
    Logging
  38. def logError(msg: => String): Unit
    Attributes
    protected
    Definition Classes
    Logging
  39. def logInfo(msg: => String, throwable: Throwable): Unit
    Attributes
    protected
    Definition Classes
    Logging
  40. def logInfo(entry: LogEntry, throwable: Throwable): Unit
    Attributes
    protected
    Definition Classes
    Logging
  41. def logInfo(entry: LogEntry): Unit
    Attributes
    protected
    Definition Classes
    Logging
  42. def logInfo(msg: => String): Unit
    Attributes
    protected
    Definition Classes
    Logging
  43. def logName: String
    Attributes
    protected
    Definition Classes
    Logging
  44. def logTrace(msg: => String, throwable: Throwable): Unit
    Attributes
    protected
    Definition Classes
    Logging
  45. def logTrace(entry: LogEntry, throwable: Throwable): Unit
    Attributes
    protected
    Definition Classes
    Logging
  46. def logTrace(entry: LogEntry): Unit
    Attributes
    protected
    Definition Classes
    Logging
  47. def logTrace(msg: => String): Unit
    Attributes
    protected
    Definition Classes
    Logging
  48. def logWarning(msg: => String, throwable: Throwable): Unit
    Attributes
    protected
    Definition Classes
    Logging
  49. def logWarning(entry: LogEntry, throwable: Throwable): Unit
    Attributes
    protected
    Definition Classes
    Logging
  50. def logWarning(entry: LogEntry): Unit
    Attributes
    protected
    Definition Classes
    Logging
  51. def logWarning(msg: => String): Unit
    Attributes
    protected
    Definition Classes
    Logging
  52. final def ne(arg0: AnyRef): Boolean
    Definition Classes
    AnyRef
  53. def newReaderIterator(stream: DataInputStream, writer: Writer, startTime: Long, env: SparkEnv, worker: PythonWorker, pid: Option[Int], releasedOrClosed: AtomicBoolean, context: TaskContext): Iterator[OutType]
    Attributes
    protected
    Definition Classes
    PythonArrowOutput
  54. def newWriter(env: SparkEnv, worker: PythonWorker, inputIterator: Iterator[InType], partitionIndex: Int, context: TaskContext): Writer
    Attributes
    protected
    Definition Classes
    PythonArrowInput
  55. final def notify(): Unit
    Definition Classes
    AnyRef
    Annotations
    @IntrinsicCandidate() @native()
  56. final def notifyAll(): Unit
    Definition Classes
    AnyRef
    Annotations
    @IntrinsicCandidate() @native()
  57. val pythonExec: String
    Definition Classes
    ApplyInPandasWithStatePythonRunner → BasePythonRunner
  58. val pythonMetrics: Map[String, SQLMetric]
    Definition Classes
    ApplyInPandasWithStatePythonRunner → PythonArrowOutput → PythonArrowInput
  59. val pythonVer: String
    Attributes
    protected
    Definition Classes
    BasePythonRunner
  60. val root: VectorSchemaRoot
    Attributes
    protected
    Definition Classes
    PythonArrowInput
  61. lazy val schema: StructType
    Attributes
    protected
    Definition Classes
    ApplyInPandasWithStatePythonRunner → PythonArrowInput
  62. val simplifiedTraceback: Boolean
    Definition Classes
    ApplyInPandasWithStatePythonRunner → BasePythonRunner
  63. final def synchronized[T0](arg0: => T0): T0
    Definition Classes
    AnyRef
  64. lazy val timeZoneId: String
    Attributes
    protected
    Definition Classes
    ApplyInPandasWithStatePythonRunner → PythonArrowInput
  65. val timelyFlushEnabled: Boolean
    Attributes
    protected
    Definition Classes
    BasePythonRunner
  66. val timelyFlushTimeoutNanos: Long
    Attributes
    protected
    Definition Classes
    BasePythonRunner
  67. def toString(): String
    Definition Classes
    AnyRef → Any
  68. final def wait(arg0: Long, arg1: Int): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.InterruptedException])
  69. final def wait(arg0: Long): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.InterruptedException]) @native()
  70. final def wait(): Unit
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.InterruptedException])
  71. def withLogContext(context: HashMap[String, String])(body: => Unit): Unit
    Attributes
    protected
    Definition Classes
    Logging
  72. val workerConf: Map[String, String]
    Attributes
    protected
    Definition Classes
    ApplyInPandasWithStatePythonRunner → PythonArrowInput
  73. def writeNextInputToArrowStream(root: VectorSchemaRoot, writer: ArrowStreamWriter, dataOut: DataOutputStream, inputIterator: Iterator[InType]): Boolean

    Read the (key, state, values) from input iterator and construct Arrow RecordBatches, and write constructed RecordBatches to the writer.

    Read the (key, state, values) from input iterator and construct Arrow RecordBatches, and write constructed RecordBatches to the writer.

    See ApplyInPandasWithStateWriter for more details.

    Attributes
    protected
    Definition Classes
    ApplyInPandasWithStatePythonRunner → PythonArrowInput
  74. def writeUDF(dataOut: DataOutputStream): Unit
    Attributes
    protected
    Definition Classes
    ApplyInPandasWithStatePythonRunner → PythonArrowInput
  75. val writer: ArrowStreamWriter
    Attributes
    protected
    Definition Classes
    PythonArrowInput

Deprecated Value Members

  1. def finalize(): Unit
    Attributes
    protected[lang]
    Definition Classes
    AnyRef
    Annotations
    @throws(classOf[java.lang.Throwable]) @Deprecated
    Deprecated

    (Since version 9)

Inherited from PythonArrowOutput[OutType]

Inherited from PythonArrowInput[InType]

Inherited from BasePythonRunner[InType, OutType]

Inherited from Logging

Inherited from AnyRef

Inherited from Any

Ungrouped