-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GlycineMine: “experiment” maps are not really useable (?) #205
Comments
per @maxglycine:
|
@maxglycine with respect to the QTL info in the datastore, I think much of what is there for soybean may have resulted from some processing Sam did against info dumped from the mysql version of soybase. He probably assumed that if it was worth being in the database, it was worth going into the datastore. It's certainly OK with me if we decide to drop the experiment maps. However, looking at the data in one example: /usr/local/www/data/v2/Glycine/max/qtl/Young_x_PI416937.qtl.Lee_Bailey_1996a/glyma.Young_x_PI416937.qtl.Lee_Bailey_1996a.qtlmrk.tsv.gz I don't think the decision on using only Composite2003 positions needs to change our DS naming conventions, though I'm also not wedded to the conventions we have for QTLs. Not sure how much of the current code depends on our naming conventions here, but I doubt there's much code outside the intermine loaders that's trying to use it (maybe also cmap-js?) Maybe this could be an agenda item for Thursday's meeting. |
@maxglycine I've now dug into Sam's code a bit and I do think there's a bit of a bug in that he's only reporting a single qtl-linkage group relationship when there may be multiple. For example in the db we have:
but in the glyma.Young_x_PI416937.gen.Lee_Bailey_1996a.qtlmrk.tsv file we have only: Now, it looks like it is pretty easy to simply restrict the target map to GmComposite2003 and report those as being the associations which avoids this complication. But, I did notice that some of the QTL end up getting "left behind" using this approach, e.g. Seed protein 4-12 which has only an experimental map position:
Maybe this is OK, but I thought I should check and also verify that you weren't overlooking GmComposite1999; including it would be possible but would require that I actually fix the bug with multiple linkage group assignments rather than just avoid it by having only the one target map set. Let me know your thoughts when you get a chance. |
Andrew @adf-ncgr the SoyBase Classic has only been displaying the Composite2003 genetic map positions and suppressing any mention of the "other" maps, so doing that in the mines are OK with me. Yes, it means that some QTL will not have a position on the Composite2003 map. We have two ways to go. One way is to ignore anything W/O a Composite2003 position the other is to return the unique list of all QTL and any Composite2003 position related to them. Understanding that not all QTL will have a position. I would personally come down on option 2, the unique list of QTL with any Composite 2003 position for each QTL. Since I don't know the data model and path queries it would be up to you to tell me what is possible. @StevenCannon-USDA @jd-campbell |
per @maxglycine :
per @adf-ncgr :
The text was updated successfully, but these errors were encountered: