-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Understanding Prediction Results #11
Comments
Hi, If you only want to predict RT values for a handful of testing data (without training), I would suggest using:
where max_rt is the maximum RT value among your 78 items (see here for details). In "config.py" the only change would be:
Best, |
Hi there, thank you for the response! So in the config.py file should I specify the max_rt of my data in minutes and then the scale of 60 converts that to seconds? Like so: conv1_kernel = 12 And then when I run the script below you told me to run I put the max_rt in seconds?
I will also note that the data I am trying to predict is from an RPLC system. Am I using the right training files? Thanks, |
Hi, sorry for the delayed response. Yes, the model provided here is also from an RPLC system [1], however, this model usually cannot be directly applied to another dataset from RPLC, because the gradients are usually different. So directly running "prediction_emb_cpu.py" would only give estimated RT of peptides other than their precise retention times under your chromatographic condition. To obtain a more accurate prediction, a bunch of calibration peptides is typically needed (i.e. transfer learning). [1] A Repository of Assays to Quantify 10,000 Human Proteins by SWATH-MS. Sci. Data 2014, 1, 140031, DOI: 10.1038/sdata.2014.31 |
hello,
I am trying to make sense of some results I got from the model, but it seems like they are on different scales. Here is a sample output below. This result is
observed | predicted
0.06345 | 0.72494
0.07636 | 0.37529
0.082 | 0.66338
0.08482 | 0.5969
0.08264 | 0.46543
0.07091 | 0.43927
0.067 | 0.26192
0.07445 | 0.25262
0.06682 | 0.26192
0.05955 | 0.40488
`
I have a dataset of 78 peptides I would like to test on, so I put that dataset in the test_path folder. However, I left all the other parameters the same. The RT for the test data is in seconds.
train_path = 'data/mod_train_2.txt'
test_path = 'data/DeepRTtest.txt'
result_path = 'work/mod_pred_test.txt'
log_path = 'work/mod_test.log'
save_prefix = 'work/mod/2/3'
pretrain_path = ''
dict_path = ''
conv1_kernel = 12
conv2_kernel = 12
min_rt = 0
max_rt = 110
time_scale = 60
max_length = 50
Thank you for the help!
The text was updated successfully, but these errors were encountered: