PyKEEN Benchmarking Results

This repository contains the results from the reproducibility and benchmarking studies described in

Bringing Light Into the Dark: A Large-scale Evaluation of Knowledge Graph Embedding Models Under a Unified Framework.
Ali, M., Berrendorf, M., Hoyt, C. T., Vermue, L., Galkin, M., Sharifzadeh, S., Fischer, A., Tresp, V., & Lehmann, J. (2020).
arXiv, 2006.13365.

This repository itself is archived on Zenodo at .

Reproducibility Study

In this study, we use the KGEMs reimplemented in PyKEEN and the authors' best reported hyper-parameters to make reproductions of past experiments. The experimental artifacts from the reproducibility study can be found here.

Benchmarking (Ablation) Study

In this study, we conduct a large number of hyper-parameter optimizations to investigate the effects of certain aspects of models (training assumption, loss function, regularizer, optimizer, negative sampling strategy, HPO methodology, training strategy). The experimental artifacts from the ablation study can be found here.

We provide an additional tool to search through these configurations at ablation/search.py, by finding configurations with optimal validation H@10 for a number of different queries. You can also run this script without full installation, as long as click and pandas are available. General usage information can be obtained by python3 ablation/search.py --help. Moreover, here are a few examples:

the overall best configuration

python3 ablation/search.py

the best configuration for the dataset FB15k-237

python3 ablation/search.py --dataset fb15k237

the best configuration for the dataset FB15k-237 using the distmult model

python3 ablation/search.py --dataset fb15k237 --model distmult

the top-3 configuration for the dataset WN18-RR using LCWA training loop

python3 ablation/search.py --dataset wn18rr --training-loop lcwa --at-most 3

Regeneration of Charts

All configuration for installation of relevant code, collation of results, and generation of charts is included in the tox.ini configuration that can be run with:

$ pip install tox
$ tox

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

PyKEEN Benchmarking Results

Reproducibility Study

Benchmarking (Ablation) Study

Regeneration of Charts

Files

README.md

Latest commit

History

README.md

File metadata and controls

PyKEEN Benchmarking Results

Reproducibility Study

Benchmarking (Ablation) Study

Regeneration of Charts