A B C D E F G H I J K L M N O P Q R S T U V
All Classes All Packages
All Classes All Packages
All Classes All Packages
A
- add(RdfSource) - Method in interface net.sansa_stack.spark.io.rdf.input.api.RdfSourceCollection
- add(RdfSource) - Method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceCollectionImpl
- aggregateUsingJavaCollector(JavaRDD<? extends T>, Collector<? super T, A, R>) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOps
-
Aggregate a JavaRDD using a serializable Collector.
- allowOverwriteFiles - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- andThen(JavaPairRddFunction<K, V, KO, VO>) - Method in interface net.sansa_stack.spark.rdd.function.ToJavaPairRddFunction
- andThen(JavaPairRddFunction<KO, VO, KX, VX>) - Method in interface net.sansa_stack.spark.rdd.function.JavaPairRddFunction
- andThen(JavaRddFunction<O, X>) - Method in interface net.sansa_stack.spark.rdd.function.JavaRddFunction
- andThen(JavaRddFunction<O, X>) - Method in interface net.sansa_stack.spark.rdd.function.ToJavaRddFunction
- applyDefaults() - Method in class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop.Builder
- asDatasets() - Method in interface net.sansa_stack.spark.io.rdf.input.api.RdfSource
-
A stream of datasets having one named graph each
- asDatasets() - Method in class net.sansa_stack.spark.io.rdf.input.api.RdfSourceFromRdd
- asDatasets() - Method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceCollectionImpl
- asDatasets() - Method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceFromResourceImpl
- asJavaRdd(RDD<Tuple2<LongWritable, T>>) - Static method in class net.sansa_stack.spark.io.rdf.input.impl.RddRdfLoaders
-
Tiny helper to get the desired JavaRDD
- asJavaRddFunction(RxFunction<I, O>) - Static method in class net.sansa_stack.spark.rdd.op.rx.JavaRddRxOps
-
Wrap a
RxFunction
as aJavaRddFunction
- asModels() - Method in interface net.sansa_stack.spark.io.rdf.input.api.RdfSource
- asModels() - Method in class net.sansa_stack.spark.io.rdf.input.api.RdfSourceFromRdd
- asModels() - Method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceCollectionImpl
- asModels() - Method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceFromResourceImpl
- asQuads() - Method in interface net.sansa_stack.spark.io.rdf.input.api.RdfSource
- asQuads() - Method in class net.sansa_stack.spark.io.rdf.input.api.RdfSourceFromRdd
- asQuads() - Method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceCollectionImpl
- asQuads() - Method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceFromResourceImpl
- asTriples() - Method in interface net.sansa_stack.spark.io.rdf.input.api.RdfSource
- asTriples() - Method in class net.sansa_stack.spark.io.rdf.input.api.RdfSourceFromRdd
- asTriples() - Method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceCollectionImpl
- asTriples() - Method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceFromResourceImpl
- AsyncRdfParserHadoop - Class in net.sansa_stack.spark.io.rdf.loader
-
Async parsing RDF on a single node using hadoop
- AsyncRdfParserHadoop() - Constructor for class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop
- AsyncRdfParserHadoop.Builder<T> - Class in net.sansa_stack.spark.io.rdf.loader
B
- Builder(Configuration, Path, InputFormat<?, T>, RefSupplier<ExecutorService>, StreamRDF, BiConsumer<T, StreamRDF>) - Constructor for class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop.Builder
C
- clazz - Variable in class net.sansa_stack.spark.io.rdf.input.impl.RddRdfLoaderImpl
- clearDataset() - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- clone() - Method in class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop.Builder
- close() - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- conf - Variable in class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop.Builder
- conf - Variable in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- configureFrom(RddRdfWriterSettings<?>) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- consoleOutSupplier - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- convertToNode - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfOpsImpl
- convertToNode(JavaRDD<T>) - Method in interface net.sansa_stack.spark.io.rdf.output.RddRdfOps
- convertToNode(JavaRDD<T>) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfOpsImpl
- convertToQuad - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfOpsImpl
- convertToQuad(JavaRDD<T>) - Method in interface net.sansa_stack.spark.io.rdf.output.RddRdfOps
- convertToQuad(JavaRDD<T>) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfOpsImpl
- convertToTriple - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfOpsImpl
- convertToTriple(JavaRDD<T>) - Method in interface net.sansa_stack.spark.io.rdf.output.RddRdfOps
- convertToTriple(JavaRDD<T>) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfOpsImpl
- copyFrom(RdfPostProcessingSettings) - Method in interface net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsMutable
- copyInto(RdfPostProcessingSettingsMutable) - Method in interface net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettings
- create() - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFactory
- create(boolean, SerializableBiConsumer<T, StreamRDF>, JavaRddFunction<T, Triple>, JavaRddFunction<T, Quad>, JavaRddFunction<T, Node>, SerializableFunction<? super T, Comparable<?>>) - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfOpsImpl
-
Create method.
- create(Class<T>, Class<? extends FileInputFormat<LongWritable, T>>) - Static method in class net.sansa_stack.spark.io.rdf.input.impl.RddRdfLoaders
-
Create a default RddRdfLoader
- create(Configuration, SerializableSupplier<LinkSparqlUpdate>) - Static method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- create(Path, FileSystem, Lang) - Method in interface net.sansa_stack.spark.io.rdf.input.api.RdfSourceFactory
- create(Path, FileSystem, Lang) - Method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceFactoryImpl
- create(InputFormat<?, X>, BiConsumer<X, StreamRDF>) - Static method in class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop.Builder
- createBinding(List<Var>, List<String>) - Static method in class net.sansa_stack.spark.io.csv.input.CsvDataSources
-
Util method to create a binding from a list of variables and a list of strings.
- createDefault() - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFormatRegistry
- createForDataset() - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfOpsImpl
- createForDataset() - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- createForDatasetGraph() - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfOpsImpl
- createForDatasetGraph() - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- createForGraph() - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfOpsImpl
- createForGraph() - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- createForModel() - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfOpsImpl
- createForModel() - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- createForQuad() - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfOpsImpl
- createForQuad() - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- createForTriple() - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfOpsImpl
- createForTriple() - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- createJavaRdd(SparkContext, String, Class<T>, Class<? extends FileInputFormat<LongWritable, T>>) - Static method in class net.sansa_stack.spark.io.rdf.input.impl.RddRdfLoaders
- createRdd(SparkContext, String, Class<T>, Class<? extends FileInputFormat<LongWritable, T>>) - Static method in class net.sansa_stack.spark.io.rdf.input.impl.RddRdfLoaders
- createRddOfBindings(JavaSparkContext, String, UnivocityHadoopConf) - Static method in class net.sansa_stack.spark.io.csv.input.CsvDataSources
- createRddOfBindings(JavaSparkContext, String, UnivocityHadoopConf, Function<List<String>, Binding>) - Static method in class net.sansa_stack.spark.io.csv.input.CsvDataSources
- createRddOfDatasetCore(SparkContext, String, Class<T>, Class<? extends FileInputFormat<LongWritable, T>>) - Static method in class net.sansa_stack.spark.io.rdf.input.impl.RddRdfLoaders
- createStreamRDFFactory(RDFFormat, boolean, PrefixMapping) - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
-
Create a function that can create a StreamRDF instance that is backed by the given OutputStream.
- CsvDataSources - Class in net.sansa_stack.spark.io.csv.input
- CsvDataSources() - Constructor for class net.sansa_stack.spark.io.csv.input.CsvDataSources
D
- declaredPrefixes - Variable in class net.sansa_stack.spark.io.rdf.input.api.RdfSourceFromRdd
- deferOutputForUsedPrefixes - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
-
Only for console output: Instead of writing tuples out immediatly, collect up to this number of tuples in order to derive the used prefixes.
- delegate - Variable in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- delete() - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- delete(Node) - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- deletePartitionFolderAfterMerge - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- dispatcher - Variable in class net.sansa_stack.spark.io.rdf.input.api.RdfSourceFromRdd
- dispatcher - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- distinct - Variable in class net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsBase
- distinctPartitions - Variable in class net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsBase
E
- enrichRddWithRowNumIfNeeded(JavaRDD<Binding>, Query) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfBindingsOps
-
If the given query mentions a variable ?ROWNUM (upper case) then the input rdd of bindings is zipped with index
- executorServiceRef - Variable in class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop.Builder
F
- fileInputFormatClass - Variable in class net.sansa_stack.spark.io.rdf.input.impl.RddRdfLoaderImpl
- find(Lang, Class<T>) - Method in interface net.sansa_stack.spark.io.rdf.input.api.RddRdfLoaderRegistry
-
Search the registry for a loader that can supply a resource of language 'lang' as records of type rdfType.
- find(Lang, Class<T>) - Method in class net.sansa_stack.spark.io.rdf.input.impl.RddRdfLoaderRegistryImpl
- flatMapToNamedModels(JavaRDD<? extends Dataset>) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfDatasetsOps
- forDataset(JavaRDD<? extends DatasetOneNg>) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFactory
- forDatasetGraph(JavaRDD<? extends DatasetGraphOneNg>) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFactory
- forGraph(JavaRDD<? extends Graph>) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFactory
- FormatEntry(Class<?>, Class<?>, Class<? extends OutputFormat>) - Constructor for class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFormatRegistry.FormatEntry
- forModel(JavaRDD<? extends Model>) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFactory
- forQuad() - Static method in class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop.Builder
- forQuad(JavaRDD<? extends Quad>) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFactory
- forTriple() - Static method in class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop.Builder
- forTriple(JavaRDD<? extends Triple>) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFactory
- from(SparkSession) - Static method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceFactoryImpl
- fromRdd(JavaRDD<?>) - Static method in class net.sansa_stack.spark.util.JavaSparkContextUtils
G
- get() - Static method in class net.sansa_stack.spark.io.rdf.input.impl.RddRdfLoaderRegistryImpl
- get() - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- get(String) - Method in interface net.sansa_stack.spark.io.rdf.input.api.RdfSourceFactory
- get(String, FileSystem) - Method in interface net.sansa_stack.spark.io.rdf.input.api.RdfSourceFactory
- get(String, Lang) - Method in interface net.sansa_stack.spark.io.rdf.input.api.RdfSourceFactory
- get(Path, FileSystem, Lang) - Method in interface net.sansa_stack.spark.io.rdf.input.api.RdfSourceFactory
- get(Node) - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- get(Lang) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFormatRegistry
- getConsoleOutSupplier() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- getDataset() - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- getDefaultFileSystem() - Static method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceFactoryImpl
- getDefaults() - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFormatRegistry
-
The default registry for elephas output formats based on
Lang
. - getDelegate() - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- getDistinct() - Method in interface net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettings
- getDistinct() - Method in class net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsBase
- getDistinctPartitions() - Method in interface net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettings
- getDistinctPartitions() - Method in class net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsBase
- getEffectiveRdd(RdfPostProcessingSettings) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
-
Create the effective RDD w.r.t.
- getGlobalPrefixMapping() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- getInputFormat() - Method in class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop.Builder
- getInstance() - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFormatRegistry
- getKeyClass() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFormatRegistry.FormatEntry
- getKeyFunction() - Method in interface net.sansa_stack.spark.io.rdf.output.RddRdfOps
- getKeyFunction() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfOpsImpl
- getLang() - Method in interface net.sansa_stack.spark.io.rdf.input.api.RdfSourceFromResource
-
Get the language of the source; may have been probed for by an RdfSourceFactory
- getLang() - Method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceFromResourceImpl
- getMembers() - Method in interface net.sansa_stack.spark.io.rdf.input.api.RdfSourceCollection
- getMembers() - Method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceCollectionImpl
- getOptimizePrefixes() - Method in interface net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettings
- getOptimizePrefixes() - Method in class net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsBase
- getOutputFormat() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- getOutputFormatClass() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFormatRegistry.FormatEntry
- getPartitionFolder() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- getPostProcessingSettings() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- getRdd() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- getSort() - Method in interface net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettings
- getSort() - Method in class net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsBase
- getSortAscending() - Method in interface net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettings
- getSortAscending() - Method in class net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsBase
- getSortPartitions() - Method in interface net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettings
- getSortPartitions() - Method in class net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsBase
- getTargetFile() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- getValueClass() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFormatRegistry.FormatEntry
- globalPrefixMapping - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- GRAPH - net.sansa_stack.spark.io.rdf.input.api.RdfSourceRecordType
- groupBy(JavaRDD<Triple>, SerializableFunction<? super Triple, K>) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfTriplesOps
- groupByNamedGraph(JavaRDD<Quad>) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfQuadsOps
-
group quads by graph IRI into a pairs (graphIri, Model)
- groupByObjectNodes(JavaRDD<Triple>) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfTriplesOps
- groupByObjects(JavaRDD<Triple>) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfTriplesOps
- groupByPredicateNodes(JavaRDD<Triple>) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfTriplesOps
-
This function mainly exists for completeness
- groupByPredicates(JavaRDD<Triple>) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfTriplesOps
-
This function mainly exists for completeness
- groupBySubjectNodes(JavaRDD<Triple>) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfTriplesOps
- groupBySubjects(JavaRDD<Triple>) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfTriplesOps
- groupKeysAndReduceValues(JavaPairRDD<K, V>, boolean, boolean, int, Function2<V, V, V>) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOps
-
Convenience helper to group values by keys, optionally sort them and reduce the values.
- groupNamedGraphsByGraphIri(JavaRDD<? extends Dataset>, boolean, boolean, int) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfDatasetsOps
-
Group all graphs by their named graph IRIs.
- groupNamedModels(JavaPairRDD<K, Model>, boolean, boolean, int) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfNamedModelsOps
-
Group and/or sort named models by their graph iri
- groupTriplesIntoModels(JavaPairRDD<K, Triple>) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfTriplesOps
H
- hadoopConfiguration - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
I
- identity() - Static method in interface net.sansa_stack.spark.rdd.function.JavaPairRddFunction
- identity() - Static method in interface net.sansa_stack.spark.rdd.function.JavaRddFunction
- inputFile - Variable in class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop.Builder
- inputFormat - Variable in class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop.Builder
- isAllowOverwriteFiles() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- isClosed() - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- isConsoleOutput() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
-
If neither partition folder nor targe file is set the output goes to the console
- isDeletePartitionFolderAfterMerge() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- isEmpty() - Method in interface net.sansa_stack.spark.io.rdf.input.api.RdfSourceCollection
- isEmpty() - Method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceCollectionImpl
- isMapQuadsToTriplesForTripleLangs() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- isPartitionsAsIndependentFiles() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- isUseCoalesceOne() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- isUseElephas() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
J
- JavaPairRddFunction<KI,VI,KO,VO> - Interface in net.sansa_stack.spark.rdd.function
-
Interface for building chains of transformations over JavaRDDs and JavaPairRDDs.
- JavaRddConsumer<T> - Interface in net.sansa_stack.spark.rdd.function
-
Consumer interface for JavaRDDs.
- JavaRddFunction<I,O> - Interface in net.sansa_stack.spark.rdd.function
-
Interface for building chains of transformations over JavaRDDs and JavaPairRDDs.
- JavaRddOfBindingsOps - Class in net.sansa_stack.spark.rdd.op.rdf
- JavaRddOfBindingsOps() - Constructor for class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfBindingsOps
- JavaRddOfDatasetsOps - Class in net.sansa_stack.spark.rdd.op.rdf
- JavaRddOfDatasetsOps() - Constructor for class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfDatasetsOps
- JavaRddOfNamedModelsOps - Class in net.sansa_stack.spark.rdd.op.rdf
-
Operations on the RDD[(String, Model)] type.
- JavaRddOfNamedModelsOps() - Constructor for class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfNamedModelsOps
- JavaRddOfQuadsOps - Class in net.sansa_stack.spark.rdd.op.rdf
- JavaRddOfQuadsOps() - Constructor for class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfQuadsOps
- JavaRddOfResourcesOps - Class in net.sansa_stack.spark.rdd.op.rdf
- JavaRddOfResourcesOps() - Constructor for class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfResourcesOps
- JavaRddOfTriplesOps - Class in net.sansa_stack.spark.rdd.op.rdf
- JavaRddOfTriplesOps() - Constructor for class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfTriplesOps
- JavaRddOps - Class in net.sansa_stack.spark.rdd.op.rdf
- JavaRddOps() - Constructor for class net.sansa_stack.spark.rdd.op.rdf.JavaRddOps
- JavaRddRxOps - Class in net.sansa_stack.spark.rdd.op.rx
- JavaRddRxOps() - Constructor for class net.sansa_stack.spark.rdd.op.rx.JavaRddRxOps
- JavaSparkContextUtils - Class in net.sansa_stack.spark.util
- JavaSparkContextUtils() - Constructor for class net.sansa_stack.spark.util.JavaSparkContextUtils
- JenaKryoRegistrator - Class in net.sansa_stack.spark.io.rdf.kryo
- JenaKryoRegistrator() - Constructor for class net.sansa_stack.spark.io.rdf.kryo.JenaKryoRegistrator
K
- keyClass - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFormatRegistry.FormatEntry
- keyFunction - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfOpsImpl
L
- lang - Variable in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceFromResourceImpl
- LinkDatasetGraphSansa - Class in net.sansa_stack.spark.io.rdf.loader
-
A
LinkDatasetGraph
implementation that loads files via the sansa parser and sends the data (triples and quads) to a sink. - LinkDatasetGraphSansa(Configuration, SerializableSupplier<StreamRDF>, Transactional) - Constructor for class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- load(String) - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- load(Graph) - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- load(Node, String) - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- load(Node, Graph) - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- load(SparkContext, String) - Method in interface net.sansa_stack.spark.io.rdf.input.api.RddRdfLoader
- load(SparkContext, String) - Method in class net.sansa_stack.spark.io.rdf.input.impl.RddRdfLoaderImpl
- loadDataset(String) - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- loadDataset(DatasetGraph) - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- loadDefaults(RddRdfLoaderRegistry) - Static method in class net.sansa_stack.spark.io.rdf.input.impl.RddRdfLoaderRegistryImpl
M
- mapPartitions(JavaPairRDD<K, V>, RxFunction<Tuple2<K, V>, O>) - Static method in class net.sansa_stack.spark.rdd.op.rx.JavaRddRxOps
- mapPartitions(JavaRDD<I>, RxFunction<I, O>) - Static method in class net.sansa_stack.spark.rdd.op.rx.JavaRddRxOps
-
Map operation based on a flowable transformer
- mapQuadsToTriplesForTripleLangs - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
-
Whether to convert quads to triples if a triple-based output format is requested
- mapToDatasets(JavaPairRDD<String, Model>) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfNamedModelsOps
-
Map each (name, model) pair to a dataset with the same information
- mapToDatasets(JavaRDD<Resource>) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfResourcesOps
-
Map every IRI resource to a dataset having a single named graph matching the IRI.
- mapToNamedModels(JavaRDD<Resource>) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfResourcesOps
-
Map IRI resources to a named model.
- mapToResourceInDataset(JavaPairRDD<String, Model>) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfNamedModelsOps
- mapToResources(JavaPairRDD<String, Model>) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfNamedModelsOps
- members - Variable in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceCollectionImpl
- mergeFolder(Path, Path, String, Comparator<? super Path>) - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- MIXED_QUAD - Static variable in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceCollectionImpl
- MIXED_TRIPLE - Static variable in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceCollectionImpl
- mutate(Consumer<RddRdfWriter<T>>) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
-
Pass this object to a consumer.
N
- NAMED_GRAPH - net.sansa_stack.spark.io.rdf.input.api.RdfSourceRecordType
- net.sansa_stack.spark.io.csv.input - package net.sansa_stack.spark.io.csv.input
- net.sansa_stack.spark.io.rdf.input.api - package net.sansa_stack.spark.io.rdf.input.api
- net.sansa_stack.spark.io.rdf.input.impl - package net.sansa_stack.spark.io.rdf.input.impl
- net.sansa_stack.spark.io.rdf.kryo - package net.sansa_stack.spark.io.rdf.kryo
- net.sansa_stack.spark.io.rdf.loader - package net.sansa_stack.spark.io.rdf.loader
- net.sansa_stack.spark.io.rdf.output - package net.sansa_stack.spark.io.rdf.output
- net.sansa_stack.spark.rdd.function - package net.sansa_stack.spark.rdd.function
- net.sansa_stack.spark.rdd.op.rdf - package net.sansa_stack.spark.rdd.op.rdf
- net.sansa_stack.spark.rdd.op.rx - package net.sansa_stack.spark.rdd.op.rx
- net.sansa_stack.spark.util - package net.sansa_stack.spark.util
- newRdfSourceCollection() - Method in interface net.sansa_stack.spark.io.rdf.input.api.RdfSourceFactory
-
Return a collection to which RdfSources can be added from which a union rdd can be obtained
- newRdfSourceCollection() - Method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceFactoryImpl
O
- ofQuads(JavaRDD<Quad>) - Static method in class net.sansa_stack.spark.io.rdf.input.api.RdfSources
- ofTriples(JavaRDD<Triple>) - Static method in class net.sansa_stack.spark.io.rdf.input.api.RdfSources
- optimizePrefixes - Variable in class net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsBase
- outputFormat - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- outputFormatClass - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFormatRegistry.FormatEntry
P
- parse(Path, Configuration, StreamRDF) - Static method in class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop
- parseRaw(Path, Configuration, InputFormat<?, T>, ExecutorService, StreamRDF, BiConsumer<T, StreamRDF>) - Static method in class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop
-
The sink must be started beforehand!
- partitionFolder - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- partitionMapperNQuads(Iterator<Quad>) - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- partitionMapperNTriples(Iterator<Triple>) - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
-
Save the RDD to a single file.
- partitionMapperRDFStream(Function<OutputStream, StreamRDF>, BiConsumer<? super T, StreamRDF>) - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- partitionsAsIndependentFiles - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- path - Variable in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceFromResourceImpl
- peekDeclaredPrefixes() - Method in interface net.sansa_stack.spark.io.rdf.input.api.RdfSource
-
At present this creates a model holding an RDF sample based on a file's starting bytes.
- peekDeclaredPrefixes() - Method in class net.sansa_stack.spark.io.rdf.input.api.RdfSourceFromRdd
- peekDeclaredPrefixes() - Method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceCollectionImpl
- peekDeclaredPrefixes() - Method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceFromResourceImpl
- peekPrefixes(SparkContext, String) - Method in interface net.sansa_stack.spark.io.rdf.input.api.RddRdfLoader
-
Peek prefixes w.r.t.
- peekPrefixes(SparkContext, String) - Method in class net.sansa_stack.spark.io.rdf.input.impl.RddRdfLoaderImpl
- postProcess(JavaRDD<Triple>, boolean, boolean, boolean, int) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfTriplesOps
-
Sort quads by their string representation (relies on
NodeFmtLib.str(org.apache.jena.graph.Triple)
) - postProcess(JavaRDD<Quad>, boolean, boolean, boolean, int) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfQuadsOps
-
Post process RDF data - sort, distinct, repartition Sort quads by their string representation (relies on
NodeFmtLib.str(org.apache.jena.graph.Triple)
) - postProcessingSettings - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- probeLang(Path, FileSystem) - Static method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceFactoryImpl
- put(String) - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- put(Graph) - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- put(Node, String) - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- put(Node, Graph) - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- putDataset(String) - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- putDataset(DatasetGraph) - Method in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
Q
- QUAD - net.sansa_stack.spark.io.rdf.input.api.RdfSourceRecordType
R
- rdd - Variable in class net.sansa_stack.spark.io.rdf.input.api.RdfSourceFromRdd
- rdd - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- RddRdfLoader<T> - Interface in net.sansa_stack.spark.io.rdf.input.api
-
An RddRdfLoader provides rdf-related methods to operate on paths w.r.t.
- RddRdfLoaderImpl<T> - Class in net.sansa_stack.spark.io.rdf.input.impl
- RddRdfLoaderImpl(Class<T>, Class<? extends FileInputFormat<LongWritable, T>>) - Constructor for class net.sansa_stack.spark.io.rdf.input.impl.RddRdfLoaderImpl
- RddRdfLoaderRegistry - Interface in net.sansa_stack.spark.io.rdf.input.api
- RddRdfLoaderRegistryImpl - Class in net.sansa_stack.spark.io.rdf.input.impl
-
A registry for RddRdfLoaders that can supply input of a specific
Lang
to an RDD of a requested type (Triples, Quads, Datasets, ett). - RddRdfLoaderRegistryImpl() - Constructor for class net.sansa_stack.spark.io.rdf.input.impl.RddRdfLoaderRegistryImpl
- RddRdfLoaders - Class in net.sansa_stack.spark.io.rdf.input.impl
- RddRdfLoaders() - Constructor for class net.sansa_stack.spark.io.rdf.input.impl.RddRdfLoaders
- RddRdfOps<T> - Interface in net.sansa_stack.spark.io.rdf.output
-
Interface that captures common operations on RDD<T@gt;
- RddRdfOpsImpl<T> - Class in net.sansa_stack.spark.io.rdf.output
- RddRdfOpsImpl(boolean, BiConsumer<T, StreamRDF>, JavaRddFunction<T, Triple>, JavaRddFunction<T, Quad>, JavaRddFunction<T, Node>, Function<? super T, Comparable<?>>) - Constructor for class net.sansa_stack.spark.io.rdf.output.RddRdfOpsImpl
- RddRdfWriter<T> - Class in net.sansa_stack.spark.io.rdf.output
-
A fluent API for configuration of how to save an RDD of RDF data to disk.
- RddRdfWriter(RddRdfOpsImpl<T>) - Constructor for class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- RddRdfWriterFactory - Class in net.sansa_stack.spark.io.rdf.output
-
A factory for
RddRdfWriter
instances which enables validation of settings at an early stage. - RddRdfWriterFactory() - Constructor for class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFactory
- RddRdfWriterFormatRegistry - Class in net.sansa_stack.spark.io.rdf.output
-
Registry for mapping between jena's
RDFFormat
and hadoop'sOutputFormat
. - RddRdfWriterFormatRegistry() - Constructor for class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFormatRegistry
- RddRdfWriterFormatRegistry.FormatEntry - Class in net.sansa_stack.spark.io.rdf.output
- RddRdfWriterSettings<SELF extends RddRdfWriterSettings> - Class in net.sansa_stack.spark.io.rdf.output
- RddRdfWriterSettings() - Constructor for class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- RdfPostProcessingSettings - Interface in net.sansa_stack.spark.io.rdf.output
- RdfPostProcessingSettingsBase - Class in net.sansa_stack.spark.io.rdf.output
- RdfPostProcessingSettingsBase() - Constructor for class net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsBase
- RdfPostProcessingSettingsBase(Boolean, Integer, Boolean, Boolean, Integer, Boolean) - Constructor for class net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsBase
- RdfPostProcessingSettingsMutable - Interface in net.sansa_stack.spark.io.rdf.output
- RdfSource - Interface in net.sansa_stack.spark.io.rdf.input.api
- RdfSourceCollection - Interface in net.sansa_stack.spark.io.rdf.input.api
- RdfSourceCollectionImpl - Class in net.sansa_stack.spark.io.rdf.input.impl
- RdfSourceCollectionImpl(SparkSession) - Constructor for class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceCollectionImpl
- RdfSourceCollectionImpl(SparkSession, Collection<RdfSource>) - Constructor for class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceCollectionImpl
- RdfSourceFactory - Interface in net.sansa_stack.spark.io.rdf.input.api
-
Turn source references into sources of RDF data.
- RdfSourceFactoryImpl - Class in net.sansa_stack.spark.io.rdf.input.impl
-
Implementation of a source factory based on spark/hadoop.
- RdfSourceFactoryImpl(SparkSession) - Constructor for class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceFactoryImpl
- RdfSourceFromRdd<T> - Class in net.sansa_stack.spark.io.rdf.input.api
- RdfSourceFromRdd(JavaRDD<T>, RddRdfOps<T>, Model) - Constructor for class net.sansa_stack.spark.io.rdf.input.api.RdfSourceFromRdd
- RdfSourceFromResource - Interface in net.sansa_stack.spark.io.rdf.input.api
-
Abstraction of a source of RDF.
- RdfSourceFromResourceImpl - Class in net.sansa_stack.spark.io.rdf.input.impl
- RdfSourceFromResourceImpl(SparkSession, Path, Lang) - Constructor for class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceFromResourceImpl
- RdfSourceRecordType - Enum in net.sansa_stack.spark.io.rdf.input.api
-
The record type of an rdf source
- RdfSources - Class in net.sansa_stack.spark.io.rdf.input.api
- RdfSources() - Constructor for class net.sansa_stack.spark.io.rdf.input.api.RdfSources
- register(Lang, Class<T>, RddRdfLoader<T>) - Method in interface net.sansa_stack.spark.io.rdf.input.api.RddRdfLoaderRegistry
-
Registration where the (underlying hadoop inputformat-based) loader loads items of type X and the result type is also X
- register(Lang, Class<T>, RddRdfLoader<T>) - Method in class net.sansa_stack.spark.io.rdf.input.impl.RddRdfLoaderRegistryImpl
- register(Lang, RddRdfWriterFormatRegistry.FormatEntry) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFormatRegistry
- registerClasses(Kryo) - Method in class net.sansa_stack.spark.io.rdf.kryo.JenaKryoRegistrator
- registerMapped(Lang, Class<T>, RddRdfLoader<X>) - Method in interface net.sansa_stack.spark.io.rdf.input.api.RddRdfLoaderRegistry
-
Registration where the (underlying hadoop inputformat-based) loader loads items of type X (such as triples) but the result is mapped to type Y (such as quads)
- registerMapped(Lang, Class<T>, RddRdfLoader<X>) - Method in class net.sansa_stack.spark.io.rdf.input.impl.RddRdfLoaderRegistryImpl
- registry - Variable in class net.sansa_stack.spark.io.rdf.input.impl.RddRdfLoaderRegistryImpl
- registry - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFormatRegistry
- requireLoader(Lang, Class<T>) - Static method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceFromResourceImpl
- ROWNUM - Static variable in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfBindingsOps
-
The special ?ROWNUM variable supported by tarql
- run() - Method in class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop.Builder
- run() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- runActual() - Method in class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop.Builder
- runOutputToConsole() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- runSpark() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
-
Run the save action according to configuration
- runUnchecked() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
-
Same as .run() just without the checked IOException
S
- saveToFolder(JavaRDD<T>, String, RDFFormat, boolean, PrefixMapping, BiConsumer<T, StreamRDF>) - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
-
Save the data in Trig/Turtle or its sub-formats (n-quads/n-triples) format.
- saveUsingElephas(JavaRDD<T>, Path, Lang, SerializableFunction<? super T, ?>) - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- self() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- self() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFactory
- self() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- sendRecordToStreamRDF - Variable in class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop.Builder
- sendRecordToStreamRDF - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfOpsImpl
- sendRecordToStreamRDF(T, StreamRDF) - Method in interface net.sansa_stack.spark.io.rdf.output.RddRdfOps
- sendRecordToStreamRDF(T, StreamRDF) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfOpsImpl
- sendToStreamRDF(JavaRDD<T>, SerializableBiConsumer<T, StreamRDF>, SerializableSupplier<StreamRDF>) - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- setAllowOverwriteFiles(boolean) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- setConf(Configuration) - Method in class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop.Builder
- setConsoleOutput() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- setConsoleOutSupplier(Supplier<OutputStream>) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- setDeferOutputForUsedPrefixes(long) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- setDeletePartitionFolderAfterMerge(boolean) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- setDistinct(Boolean) - Method in class net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsBase
- setDistinct(Boolean) - Method in interface net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsMutable
- setDistinctPartitions(Integer) - Method in class net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsBase
- setDistinctPartitions(Integer) - Method in interface net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsMutable
- setGlobalPrefixMapping(Map<String, String>) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- setGlobalPrefixMapping(PrefixMapping) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
-
Set a prefix mapping to be used "globally" across all partitions.
- setInputFile(Path) - Method in class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop.Builder
- setMapQuadsToTriplesForTripleLangs(boolean) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
-
Whether to convert quads to triples if a triple-based output format is requested Jena by default discards any quad outside of the default graph when writing to a triple format.
- setOptimizePrefixes(Boolean) - Method in class net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsBase
- setOptimizePrefixes(Boolean) - Method in interface net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsMutable
- setOutputFormat(String) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
-
Raises an exception if the format is not found
- setOutputFormat(RDFFormat) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- setPartitionFolder(String) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- setPartitionFolder(Path) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- setPartitionsAsIndependentFiles(boolean) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- setPostProcessingSettings(RdfPostProcessingSettingsMutable) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- setRdd(JavaRDD<? extends T>) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- setSink(StreamRDF) - Method in class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop.Builder
- setSort(Boolean) - Method in class net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsBase
- setSort(Boolean) - Method in interface net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsMutable
- setSortAscending(Boolean) - Method in class net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsBase
- setSortAscending(Boolean) - Method in interface net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsMutable
- setSortPartitions(Integer) - Method in class net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsBase
- setSortPartitions(Integer) - Method in interface net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsMutable
- setTargetFile(String) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- setTargetFile(Path) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- setUseCoalesceOne(boolean) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- setUseElephas(boolean) - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- sink - Variable in class net.sansa_stack.spark.io.rdf.loader.AsyncRdfParserHadoop.Builder
- sinkFactory - Variable in class net.sansa_stack.spark.io.rdf.loader.LinkDatasetGraphSansa
- sort - Variable in class net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsBase
- sortAscending - Variable in class net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsBase
- sortPartitions - Variable in class net.sansa_stack.spark.io.rdf.output.RdfPostProcessingSettingsBase
- sparkContext - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- sparkSession - Variable in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceCollectionImpl
- sparkSession - Variable in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceFactoryImpl
- sparkSession - Variable in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceFromResourceImpl
T
- targetFile - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- tarqlDatasets(JavaRDD<Binding>, Query) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfBindingsOps
-
Each binding becomes its own dataset
- tarqlQuads(JavaRDD<Binding>, Query) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfBindingsOps
-
Use an RDD of bindings as initial bindings for a construct query in order to yield quads.
- tarqlTriples(JavaRDD<Binding>, Query) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfBindingsOps
-
Use an RDD of bindings as initial bindings for a construct query in order to yield triples.
- toGraphName(Node) - Static method in class net.sansa_stack.spark.rdd.op.rdf.JavaRddOfTriplesOps
-
Map a node losslessly to an IRI suitable for use as a graph name This is needed to e.g.
- ToJavaPairRddFunction<I,K,V> - Interface in net.sansa_stack.spark.rdd.function
-
Interface for building chains of transformations over JavaRDDs and JavaPairRDDs.
- ToJavaRddFunction<K,V,O> - Interface in net.sansa_stack.spark.rdd.function
-
Interface for building chains of transformations over JavaRDDs and JavaPairRDDs.
- toPairRdd(ToJavaPairRddFunction<O, K, V>) - Method in interface net.sansa_stack.spark.rdd.function.JavaRddFunction
- toPairRdd(ToJavaPairRddFunction<O, KX, VX>) - Method in interface net.sansa_stack.spark.rdd.function.ToJavaRddFunction
- toRdd(ToJavaRddFunction<K, V, O>) - Method in interface net.sansa_stack.spark.rdd.function.ToJavaPairRddFunction
- toRdd(ToJavaRddFunction<KO, VO, X>) - Method in interface net.sansa_stack.spark.rdd.function.JavaPairRddFunction
- toString(PrefixMapping, RDFFormat) - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
-
Convert a prefix mapping to a string
- TRIPLE - net.sansa_stack.spark.io.rdf.input.api.RdfSourceRecordType
U
- union(SparkSession, Collection<I>, Function<I, RDD<T>>) - Static method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceCollectionImpl
- useCoalesceOne - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- useElephas - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterSettings
- usesQuads - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfOpsImpl
- usesQuads() - Method in interface net.sansa_stack.spark.io.rdf.input.api.RdfSource
-
Whether this source is based on a quad model
- usesQuads() - Method in class net.sansa_stack.spark.io.rdf.input.api.RdfSourceFromRdd
- usesQuads() - Method in interface net.sansa_stack.spark.io.rdf.input.api.RdfSourceFromResource
- usesQuads() - Method in class net.sansa_stack.spark.io.rdf.input.impl.RdfSourceCollectionImpl
- usesQuads() - Method in interface net.sansa_stack.spark.io.rdf.output.RddRdfOps
-
Whether the operations provided by this class are based on quads
- usesQuads() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfOpsImpl
V
- validate() - Method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFactory
- validate(RddRdfWriterSettings<?>) - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- validateOutFolder(Path, Configuration, boolean) - Static method in class net.sansa_stack.spark.io.rdf.output.RddRdfWriter
- valueClass - Variable in class net.sansa_stack.spark.io.rdf.output.RddRdfWriterFormatRegistry.FormatEntry
- valueOf(String) - Static method in enum net.sansa_stack.spark.io.rdf.input.api.RdfSourceRecordType
-
Returns the enum constant of this type with the specified name.
- values() - Static method in enum net.sansa_stack.spark.io.rdf.input.api.RdfSourceRecordType
-
Returns an array containing the constants of this enum type, in the order they are declared.
All Classes All Packages