Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Incorrect VST normalization #244

Open
bolak92 opened this issue Sep 1, 2023 · 0 comments
Open

Incorrect VST normalization #244

bolak92 opened this issue Sep 1, 2023 · 0 comments

Comments

@bolak92
Copy link

bolak92 commented Sep 1, 2023

Describe the bug

While attempting to run VST (Variance Stabilizing Transformation) normalization using AlphaPept, I encountered several issues that suggest the normalization process might not be functioning as intended.

Issue 1: Axis of Normalization
Upon debugging, it appears that the normalization is being performed across proteins (columns in ds.mat) rather than across samples (rows). Below is a screenshot of a table that supports my hypothesis:
image

To Reproduce:
I used a standard ProteinGroups.txt file and preprocessed it using the following code:

ds.preprocess(
    remove_contaminations=True,
    normalization = "vst"
)

Issue 2: Inconsistent PCA Graphs
The PCA graphs generated post-normalization are inconsistent, both in terms of axis scales and explained variance. Here's a screenshot for reference:
dim_red_PCA_HealthStatus_group_circle

Issue 3: VST vs VSN Normalization
Is the VST normalization in AlphaPept intended to perform similarly to the VSN normalization method available in R? For reference, here is the VSN documentation.

Additional Information
Operating System: Windows 10
Python Environment: Conda

I would appreciate any guidance or fixes for these issues. Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant