HCLSIG/CDS/Datasets and ontologies
- 1 Datasets and ontologies relevant for the CDS task force
- 1.1 Datasets
- 1.1.1 Drug datasets
- 1.1.2 Datasets containing associations between genetic variation, associated phenotypes and genetic tests
- 18.104.22.168 Pharmacogenomics Knowledgebase / PharmGKB
- 22.214.171.124 GWAS Central (formerly called HGVbaseG2P)
- 126.96.36.199 SNPedia
- 188.8.131.52 GET-Evidence (evidence.personalgenomes.org)
- 184.108.40.206 Online Mendelian Inheritance in Man (OMIM)
- 220.127.116.11 dbGaP
- 18.104.22.168 HuGE Navigator
- 22.214.171.124 Genetic Association Database (GAD)
- 126.96.36.199 Genotator
- 188.8.131.52 NCBI GeneTests
- 184.108.40.206 Genetics Home Reference
- 1.1.3 Genome databases with general data about genetic variation and human genomes
- 1.1.4 Collections of personal genetic data
- 1.2 Relevant ontologies and taxonomies
- 1.2.1 Suggested Ontology for Pharmacogenomics (SO-PHARM)
- 1.2.2 Pharmacogenomics Ontology (PO)
- 1.2.3 Pharmacogenomics Relationship Ontology (PHARE)
- 1.2.4 Sequence Ontology (SO)
- 1.2.5 Disease Ontology
- 1.2.6 Human Phenotype Ontology (HPO)
- 1.2.7 Mammalian Phenotype Ontology
- 1.2.8 Phenotypic Quality Ontology (PATO)
- 1.2.9 Logical Observation Identifiers Names and Codes (LOINC)
- 1.3 Formats and schemas
- 1.1 Datasets
Datasets and ontologies relevant for the CDS task force
Drugbank.ca provides drug (i.e., chemical, pharmacological and pharmaceutical) data with comprehensive drug target (i.e., sequence, structure, and pathway) information. It also includes information on drug-drug and drug-food interactions.
A linked version of the NLM's RxNorm database that connects prescription drugs, ingredients, and NDC through RXCUI a concept unique identifier. RxNorm is a product developed by NIH’s National Library of Medicine. It currently interlinks 12 different drug vocabularies around a unique concept identifier. Due to licensing only six of the drug vocabularies are made available as part of the LODD cloud. This includes: Medical Subject Headings,, Metathesaurus FDA National Drug Code Directory, Metathesaurus FDA Structured Product Labels, National Drug File, RxNorm Vocabulary, Veterans Health Administration National Drug File
NDF-RT is the terminology used by FDA and the FedMed collaboration to code these essential pharmacologic properties of medications: Mechanism of Action Physiologic Effect Structural Class
Known and predicted metabolic inhibition drug-drug interactions with links to and summaries of evidence. HTML rendering: http://dbmi-icode-01.dbmi.pitt.edu/dikb-evidence/front-page.html D2R and SPARQL endpoint: http://dbmi-icode-01.dbmi.pitt.edu:2020/.
Datasets containing associations between genetic variation, associated phenotypes and genetic tests
A large database of curated knowledge and raw data about associations between genes, genetic variants, drug response and disease.
GWAS Central (formerly called HGVbaseG2P)
A database of genome-wide association studies that also provides summaries of study results.
A wiki-based platform containing information on phenotypes associated with SNP variants, population prevalence of genetic variants and SNP microarrays.
A large database of automatically annotated and then manually curated information about the impact of genetic variations. Example: http://evidence.personalgenomes.org/MYL2-A13T
Information about diseases with Mendelian inheritance, including references to the implicated genes.
Results of studies that have investigated the interaction of genotype and phenotype.
Information on population prevalence of genetic variants, gene-disease associations, gene-gene and gene- environment interactions, and evaluation of genetic tests.
Diseases associated with genetic variants.
Aggregated gene-disease relationship data containing an integrated view over other datasets.
Genome databases with general data about genetic variation and human genomes
An internationally recognized reference database, providing stable genomic DNA sequences and identifies for regions of the human genome.
Large-scale genetic structural variation data (e.g., insertions, deletions).
Collections of personal genetic data
Genome sequences of over 1000 volunteers
Database of the Estonian Genome Center, University of Tartu
A collection of genetic data associated with health and lifestyle data of over 50,000 persons.
Whole-genome data donated by volunteers.
Relevant ontologies and taxonomies
Suggested Ontology for Pharmacogenomics (SO-PHARM)
A complex ontology covering the representation of genetic variation and pharmacgenomics.
Pharmacogenomics Ontology (PO)
Represents PharmGKB data; ontology for measures and outcomes.
Pharmacogenomics Relationship Ontology (PHARE)
Proposes concepts and roles to represent relationships of pharmacogenomics interest. Used for representing findings extracted from texts.
Contains terms often used for the annotation of sequences and features, including detailed description of different types of sequence variations.
An ontology of human diseases.
Mammalian Phenotype Ontology
Phenotypic Quality Ontology (PATO)
An ontology of types of phenotypic properties.
An established coding system for clinical lab results. Contains many identifiers for results of genetic tests.
Formats and schemas
A simple XML schema for the representation of SNPs . Maintained by the Object Management Group (OMG).
===== Genomic Sequence Variation Markup Language (GSVML), ISO 25720:2009 An XML schema geared towa . Maintained by the International Organization for Standardization (ISO).