integrated Dietary Supplements Knowledge Base (iDISK)

This readme.txt file was generated on 2019-11-21

Title of Dataset
Integrated Dietary Supplement Knowledge Base (iDISK)

Author Information
Rizvi, Rubina F
Vasilakes, Jake A
Adam, Terrence J
Melton, Genevieve B
Bishop, Jeffrey R
Bian, Jiang
Tao, Cui
Zhang, Rui (zhan1386@umn.edu)

University of Minnesota Institute for Health Informatics, Natural Language Processing / Information Extraction (NLP/IE) Program

Funding information:
This research was supported by National Center for Complementary and Integrative Health Award (#R01AT009457) (Zhang) and the Agency for Healthcare Research and Quality grant (#1R01HS022085) (Melton).

SHARING/ACCESS INFORMATION

License:

CC-By-SA Attribution-ShareAlike 3.0 United States

Recommended citation for the data:

    Rizvi, Rubina F; Vasilakes, Jake A; Adam, Terrence J; Melton, Genevieve B; Bishop, Jeffrey R; Bian, Jiang; Tao, Cui; Zhang, Rui. (2019). Integrated Dietary Supplement Knowledge Base (iDISK). Retrieved from the Data Repository for the University of Minnesota, https://doi.org/10.13020/d6bm3v. 

** Description **
The integrated Dietary Supplements Knowledge Base (iDISK) covers a variety of
dietary supplements, including vitamins, herbs, minerals, etc. It was standardized
and integrated from the Dietary Supplements Label Database (DSLD), the “About Herbs”
database from Memorial Sloan Kettering Cancer Center (MSKCC), the Canadian Natural Health
Products and Ingredients database (NHP), and the Natural Medicines Comprehensive Database
(NMCD) developed by the Therapeutic Research Center (TRC). iDISK contains a variety of
attributes and relationships describing information about each dietary supplement such as
which products it is an ingredient of and what drugs it might interact with.
iDISK is available as a Neo4j graph database as well as UMLS style RRF files.
See the README for details on installing and using iDISK.
The code used to build iDISK is publicly available at https://github.com/jvasilakes/idisk.

NMCD is a commercial database and we have obtained special permission to use and redistribute the data publicly.

General Schema

iDISK contains 7 concept types and 6 relation types:

Concept Types

Following the UMLS, concepts are collections of synonymous atoms. An atom is a term
(e.g. a possible name for a supplement, such as “Ginkgo” or “Ginkgo Biloba”) from a
given data source.

Relation types

Neo4j Release

The Neo4j release idisk-neo4j-<version>.dump can be loaded into an existing Neo4j graph using
the command

bin/neo4j-admin load --from=/path/to/idisk_neo4j-<version>.dump --database=<database>

This is most easily done from the “Terminal” tab in the Neo4j Desktop.

RRF Release

The RRF release idisk-rrf-<version>.zip borrows heavily from format of the UMLS Metathesaurus. There are four files:

The RRF format is a flat-file pipe-delimited format similar to CSV.
A description of the fields of each file follows:

MRSTY.RRF

MRCONSO.RRF

MRSAT.RRF

MRREL.RRF