Pipelines:
bulkload
- loads RDF in the triplestore,statistics
- reads triples from the SPO index, generates statistics (aka Prospects Table) about the triples and writes them to a separate index.elasticsearch
- reads triples from the SPO index, generates the full text index and writes it in Elasticsearch.
At the moment, only the DataFabric's fork of Apache Rya is supported.
Current implementations were tested with Google Dataflow only.