Core, dispensable, and private genome #200

beantkapoor786 · 2024-08-01T16:26:19Z

beantkapoor786
Aug 1, 2024

Once I have gone through the PHG build and load module, how can I calculate the core, dispensable, and private genome percentage?

btmonier · 2024-08-01T20:11:36Z

btmonier
Aug 1, 2024
Maintainer

If you are talking about unique and shared haplotypes across the genome, one option would be to use rPHG2 and generate a haplotype ID matrix to summarize unique IDs across each column (reference ranges):

library(rPHG2)

phgLib <- "path/to/your/phgv2/lib/dir"
initPhg(phgLib)

hvcfFiles <- hvcfDir |> list.files(pattern = ".h.vcf")

graph <- hvcfFiles |> 
    PhgLocalCon() |> 
    buildHaplotypeGraph()

hapIds <- graph |> readHapIds()

hapProfile <- data.frame(
    ref_range  = hapIds |> colnames(),
    n_uniq_ids = hapIds |> apply(2, \(it) it |> unique() |> length())
)

1 reply

beantkapoor786 Aug 1, 2024
Author

Thank you, will try this.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Core, dispensable, and private genome #200

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Core, dispensable, and private genome #200

beantkapoor786 Aug 1, 2024

Replies: 1 comment · 1 reply

btmonier Aug 1, 2024 Maintainer

beantkapoor786 Aug 1, 2024 Author

beantkapoor786
Aug 1, 2024

Replies: 1 comment 1 reply

btmonier
Aug 1, 2024
Maintainer

beantkapoor786 Aug 1, 2024
Author