Data
functions
DataAccessor
access
DataEnrichRoot
model
DataLoad
dataloads
DataLoadPointer
pointers
DataSpec
dataspecs
Debug
LogLevels
DecoderRegistry
HttpMessage
DefaultCharset
StringContent Sparkling
DefaultLineBuffer
HdfsIO
DefaultProps
EntitiesConstants
DefaultRetries
HttpClient
DefaultSleepMillis
HttpClient
DefaultTagFieldMapping
EntitiesConstants
DefaultTimeoutMillis
HttpClient
DependentFieldPointer
pointers
Derivatives
model
DigestUtil
util
DistributedConfig
archivespark
Dynamic
LoadingStrategy
data
WarcRecord
dataLoad
EnrichRootCompanion WarcRecord WaybackRecord
dataPath
HadoopDataSpec TextDataSpec TextFileDataSpec CdxHdfsSpec WarcCdxHdfsSpec WaybackCdxHdfsSpec
dataloads
model
dataspecs
archivespark
debug
Log
decompress
GzipUtil IOUtil
decompressConcatenated
GzipUtil
decompressConcatenatedWithPosition
GzipUtil
defaultDigestHash
WarcRecord
defaultField
Entities EnrichFunc GlobalEnrichFunc HttpPayload WarcPayload
defaultLoadingStrategy
HdfsIO
delete
HdfsIO
derive
Entities HtmlAttribute HtmlTag HtmlTags StringContent Values EnrichFunc GlobalEnrichFunc HttpPayload WarcPayload
digest
CdxRecord
dir
HdfsIO
disableCheckpointing
Sparkling
distinct
IteratorUtil RddUtil
distinctByValue
GenericHelpersRDD
distinctOrdered
IteratorUtil
distinctValue
EnrichableRDD
doPartitions
RddUtil
drop
IteratorUtil
dropUntil
IteratorUtil
dropWhile
IteratorUtil
dynamicCopyLocalThreshold
HdfsIO