Skip to content

Commit

Permalink
Continue work on paper.
Browse files Browse the repository at this point in the history
  • Loading branch information
willgryan committed Sep 29, 2023
1 parent e1e29e9 commit 8165544
Show file tree
Hide file tree
Showing 2 changed files with 12 additions and 2 deletions.
10 changes: 10 additions & 0 deletions joss/paper.bib
Original file line number Diff line number Diff line change
Expand Up @@ -438,4 +438,14 @@ @article{RN49
type = {Journal Article}
}

@software{william_ryan_2023_8156248,
author = {William Ryan},
title = {willgryan/PAVER: Pre-release to generate DOI},
month = jul,
year = 2023,
publisher = {Zenodo},
version = {pre-release},
doi = {10.5281/zenodo.8156248},
url = {https://doi.org/10.5281/zenodo.8156248}
}

4 changes: 2 additions & 2 deletions joss/paper.md
Original file line number Diff line number Diff line change
Expand Up @@ -46,11 +46,11 @@ Multiomics is used extensively in biological research today. However, the develo

# Overview

PAVER uses vector embeddings to help interpret pathway analyses. Embeddings encode the meaning of pathways into numerical representations which can then be clustered and visualized \autoref{fig:overview}. To identify which pathway is most representative of a cluster, PAVER first takes the average embedding of all pathways in a cluster, which captures it's overall meaning [@RN49]. It then finds which pathway is most similar to that average embedding and labels the cluster with that pathway. This allows PAVER to automatically curate long lists of pathways into groups and identify which pathway is most representative of each group.
PAVER uses vector embeddings to help interpret pathway analyses. Embeddings encode the meaning of pathways into numerical representations which can then be clustered and visualized \autoref{fig:overview}. To identify which pathway is most representative of a cluster, PAVER first takes the average embedding of all pathways in a cluster, which captures it's overall meaning [@RN49]. It then finds which pathway is most similar to the average embedding and labels the cluster with that pathway. This allows PAVER to automatically curate long lists of pathways into groups and identify which pathway is most representative of each group.

![PAVER uses numerical representations of pathways to find functionally related clusters.\label{fig:overview}](figures/overview.png)

PAVER was designed to be easy to use by researchers and students with minimal coding experience. It has already been using in a number of scientific publications to aid in the intepretation of pathway analyses [@RN78]. PAVER can use embeddings for any pathway database. We have pre-computed vector representations for Gene Ontology using the recent anc2vec model, available here: https://github.com/willgryan/PAVER_embeddings [@RN13].
PAVER was designed to be easy to use by researchers and students with minimal coding experience. PAVER has already been using in a number of scientific publications to aid in the intepretation of pathway analyses [@william_ryan_2023_8156248, @RN78]. We have pre-computed vector representations for Gene Ontology using the recent anc2vec model [@RN13], available here: https://github.com/willgryan/PAVER_embeddings. However, embeddings for any pathway database can be used with PAVER.

# Licensing and Availability

Expand Down

0 comments on commit 8165544

Please sign in to comment.