Add NER method to suggest ENVO triad from description #3

cmungall · 2021-07-12T23:35:05Z

https://github.com/cmungall/sample-annotator/tree/main/sample_annotator/text_mining

To start with, parse sample['description'], to populate sample['env_{broad_scale,local_scale,medium}'] if they are not already populated

I think this should be done by calling runner, but will need a pypi release monarch-initiative/ontorunner#9

or is it easier to just wrap oger directly for now

also for now we could just check in the nodes.tsv directly. See how we include mixs.json within the package

for now, be conservative and only use labels or exact synonyms

The text was updated successfully, but these errors were encountered:

cmungall · 2021-07-12T23:44:22Z

As a first pass, just hardcode ENVO for all 3 fields regardless of package

Then for next pass, we will have a curated configuration file like this:

-
field: env_broad_scale
packages:
  - soil
termsets:
  - ontology: envo
     branches:
       - ENVO:01000254 ## environment system
     exclude_descendants_of:
       - ENVO:01001788 ##  marine ecosystem
-
field: env_local_scale
package: host-associated
termsets:
  - ontology: UBERON
...

that will customize which ontologies are used where

hrshdhgd · 2021-07-14T16:20:57Z

Just an FYI, OGER does not have a PyPI release either.

hrshdhgd · 2021-07-15T16:57:26Z

@cmungall , how do you envision the input file coming in for NER to look like: A tsv file within the project (locally i.e. ./text_mining/data/input) or remotely located (url) ?

I'm guessing the input tsv (or db) will be generated by @turbomam through his parsing work from the large XML?

cmungall · 2021-07-22T15:59:24Z

I answered @hrshdhgd's questions on our 1-on1. It's clear now that he doesn't have to worry about formats, the goal is to implement functionality within the python framework all you care about is datamodel

cmungall assigned hrshdhgd and realmarcin Jul 12, 2021

hrshdhgd added a commit that referenced this issue Jul 15, 2021

dynamically create settings file #3

fc76d1f

hrshdhgd added a commit that referenced this issue Jul 15, 2021

minor edit #3

cc1a48c

hrshdhgd mentioned this issue Jul 28, 2021

NER #9

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add NER method to suggest ENVO triad from description #3

Add NER method to suggest ENVO triad from description #3

cmungall commented Jul 12, 2021 •

edited

Loading

cmungall commented Jul 12, 2021

hrshdhgd commented Jul 14, 2021

hrshdhgd commented Jul 15, 2021 •

edited

Loading

cmungall commented Jul 22, 2021

Add NER method to suggest ENVO triad from description #3

Add NER method to suggest ENVO triad from description #3

Comments

cmungall commented Jul 12, 2021 • edited Loading

cmungall commented Jul 12, 2021

hrshdhgd commented Jul 14, 2021

hrshdhgd commented Jul 15, 2021 • edited Loading

cmungall commented Jul 22, 2021

cmungall commented Jul 12, 2021 •

edited

Loading

hrshdhgd commented Jul 15, 2021 •

edited

Loading