Skip to content

Latest commit

 

History

History
80 lines (41 loc) · 3.55 KB

readme-zh.md

File metadata and controls

80 lines (41 loc) · 3.55 KB

主页

http://pdd.wangmengsd.com/

医疗诊断患病数据集

这是医疗数据集的显示工程。利用spring-boot做了url重定向,前端使用thymeleaf 作为MVC的model传递数据的接受者。

数据集介绍

该数据集是rdf数据集,关于理疗的,里面有诊断,有用药等,在线展示仅仅展示了其中一部分数据,下载nt格式的数据,可以获取完整的数据集。

利用这些数据集,你可以进行sparql查询,进行实体关系识别,进行医疗数据挖掘,等等。

所以重要的是该数据集,不是这个工程!

What is PDD Graph

What is PDD Graph (Patient-Disease-Drug Graph):

Electronic medical records contain multi-format electronic medical data that consist of an abundance of medical knowledge. Facing with patients symptoms, experienced caregivers make right medical decisions based on their professional knowledge that accurately grasps relationships between symptoms, diagnosis, and treatments. We aim to capture these relationships by constructing a large and high-quality heterogeneous graph linking patients, diseases, and drugs (PDD) in EMRs.

Specifically, we extract important medical entities from MIMIC-III (Medical Information Mart for Intensive Care III) and automatically link them with the existing biomedical knowledge graphs, including ICD-9 ontology and DrugBank. The PDD graph presented is accessible on the Web via the SPARQL endpoint, and provides a pathway for medical discovery and applications, such as effective treatment recommendations.

A subgraph of PDD is illustrated in the followng figure to betterunderstand the PDD graph.

数据集下载

Home page to konw how to download

服务器带宽有限,推荐去 Datahub 下载

Update

  • V1.3

    We have attached DDI triples in the latest version 1.3. These DDI triples are extracted from DrugBank and will be applied to conveniently retrieve the possible adverse drug combinations taken by corresponding patients.

    A specific example please refer to the Tutorial SPARQL Query Example5

  • V1.2

    Fix the bugs in "diagnose_icd_information.nt".

    In the new version, we have eliminated an engineering bug that was made when label matching of ICD-9 codes. This bug results in the linking failures of 380 diseases in MIMIC-III.

    For diseases in the latest PDD version, the overall number of diseases is 6985, and 6,983 diseases are connected to ICD-9 ontology. The only two failed matching codes are '71970' and 'NULL', which are not included in ICD-9 ontology.

  • V1.1

    Add Patient BMI data.

本工程使用技术

不是指数据集工程。

  • spring-boot
  • thymeleaf
  • jquery

效果展示

当你想要查询出一个实体时候,直接点击即可查看该实体有哪些东西。例如:

在我们发布的Patient-Disease-Drug 数据集里,可以在线查询,可以看到数据如下:

数据集在线查询展示

当我们点击其中某一条数据的时候,可以看到现在的系统展示的东西

http://kmap.xjtudlc.com/pdd_data/resource/145834

该系统展示

License

This work is licensed under a Creative Commons Attribution 4.0 International License.