Apache Beam Pipelines for Apache Rya

Pipelines:

bulkload - loads RDF in the triplestore,
statistics - reads triples from the SPO index, generates statistics (aka Prospects Table) about the triples and writes them to a separate index.
elasticsearch - reads triples from the SPO index, generates the full text index and writes it in Elasticsearch.

At the moment, only the DataFabric's fork of Apache Rya is supported.

Supported runners

Current implementations were tested with Google Dataflow only.