FieldPointer
pointers
FileLocalityInputFormat
util
FileLocalityRecordReader
FileLocalityInputFormat
FilePathMap
util
FileStreamRecord
raw
field
Enrichable
fieldName
EnrichFunc DependentFieldPointer GenericNamedFieldPointer NamedFieldPointer
fields
Entities HtmlAttribute HtmlTag HtmlTags StringContent Values Derivatives EnrichFunc GlobalEnrichFunc HttpPayload WarcPayload
fileBuffer
IOUtil
filePathMap
WarcHdfsCdxSpecBase
fileSize
HdfsBlockStream
files
HdfsIO
filterExists
EnrichableRDD
filterNoException
EnrichableRDD
filterNonEmpty
EnrichableRDD
filterPrefixes
StringUtil
filterValue
EnrichableRDD
first
Html
fix
Time14Util
flatMap
CleanupIterator
flatMapValues
EnrichableRDD
flush
HdfsFileWriter NonClosingOutputStream TypedInOutWriter
format
HadoopDataSpec
formatNumber
StringUtil
fromBytes
StringUtil
fromFiles
WarcSpec
fromFilesWithCdx
WarcSpec
fromInputStream
StringUtil
fromPath
CdxSpec
fromString
CdxRecord
fromUrl
SurtUtil
fromWayback
WarcSpec
fromWaybackByCdxQuery
WarcSpec
fromWaybackWithLocalCdx
WarcSpec
fs
HdfsIO
func
AbsoluteUrl Data HtmlText LowerCase SURT GlobalEnrichFunc DependentFieldPointer
functions
archivespark warc