Any idea how to improve the performance when handling large ontologies? #44

leonqli · 2018-06-13T20:32:12Z

No description provided.

lambdamusic · 2018-12-06T23:15:36Z

Do you have any sample ontology in mind? I'm trying to wrap my head around this problem so it'd be useful to have some sample models for testing.

leonqli · 2018-12-10T16:42:58Z

you may try with this ontology: http://bioportal.bioontology.org/ontologies/NCBITAXON

lambdamusic · 2019-01-03T12:03:00Z

The main problem is that ontospy attempts to build the entire ontology model in memory, and that takes time if there are many classes and properties to correlated.

I've tried using threads, but with no real performance improvements as the main tasks (extract classes, properties, concepts etc..) tend to be reliant on each other.

For very large ontologies maybe it's more indicated to use a triplestore. Otherwise I'm kind of out of ideas here..

leonqli · 2019-01-03T12:30:46Z

You may want to take look of https://pythonhosted.org/Owlready2/ It seems to having better performance on large ontologies.

lambdamusic · 2019-01-03T12:45:06Z

Thanks! Looks like they use an ad-hoc back end, maybe that's it. Will look more into it
Update: the back end is an optimized SQLite index eg view here

leonqli · 2019-01-10T17:01:06Z

Yes, they use SQLite as backend. Do you think it is helpful for improving the performance of ontospy?

jclerman · 2022-05-17T17:43:14Z

It's also not too difficult to load an ontology into Apache Fuseki Jena. The main issue is the non-Python dependency (Fuseki), but once the store is running it's easy to use rdflib to mediate querying.

lambdamusic added the question label Dec 7, 2018

lambdamusic mentioned this issue Dec 7, 2018

get_rdf(path) retrieve an empty model #35

Open

raghav-kukreti mentioned this issue May 29, 2020

Ontology builder parallelization #92

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Any idea how to improve the performance when handling large ontologies? #44

Any idea how to improve the performance when handling large ontologies? #44

leonqli commented Jun 13, 2018

lambdamusic commented Dec 6, 2018

leonqli commented Dec 10, 2018

lambdamusic commented Jan 3, 2019

leonqli commented Jan 3, 2019

lambdamusic commented Jan 3, 2019 •

edited

Loading

leonqli commented Jan 10, 2019

jclerman commented May 17, 2022

Any idea how to improve the performance when handling large ontologies? #44

Any idea how to improve the performance when handling large ontologies? #44

Comments

leonqli commented Jun 13, 2018

lambdamusic commented Dec 6, 2018

leonqli commented Dec 10, 2018

lambdamusic commented Jan 3, 2019

leonqli commented Jan 3, 2019

lambdamusic commented Jan 3, 2019 • edited Loading

leonqli commented Jan 10, 2019

jclerman commented May 17, 2022

lambdamusic commented Jan 3, 2019 •

edited

Loading