Skip to content

Latest commit

 

History

History
13 lines (8 loc) · 706 Bytes

README.md

File metadata and controls

13 lines (8 loc) · 706 Bytes

Apache Beam Pipelines for Apache Rya

Pipelines:

  • bulkload - loads RDF in the triplestore,
  • statistics - reads triples from the SPO index, generates statistics (aka Prospects Table) about the triples and writes them to a separate index.
  • elasticsearch - reads triples from the SPO index, generates the full text index and writes it in Elasticsearch.

At the moment, only the DataFabric's fork of Apache Rya is supported.

Supported runners

Current implementations were tested with Google Dataflow only.