Skip to content

Class: Dataset

A reference to a publicly available omics or phenotype dataset

URI: dismech:Dataset

 classDiagram
    class Dataset
    click Dataset href "../Dataset/"
      Dataset : accession

      Dataset : conditions

      Dataset : data_type





        Dataset --> "0..1" DatasetTypeEnum : data_type
        click DatasetTypeEnum href "../DatasetTypeEnum/"



      Dataset : description

      Dataset : evidence





        Dataset --> "*" EvidenceItem : evidence
        click EvidenceItem href "../EvidenceItem/"



      Dataset : exposures





        Dataset --> "*" ExposureDescriptor : exposures
        click ExposureDescriptor href "../ExposureDescriptor/"



      Dataset : findings





        Dataset --> "*" Finding : findings
        click Finding href "../Finding/"



      Dataset : genes





        Dataset --> "*" GeneDescriptor : genes
        click GeneDescriptor href "../GeneDescriptor/"



      Dataset : notes

      Dataset : organism





        Dataset --> "0..1" OrganismDescriptor : organism
        click OrganismDescriptor href "../OrganismDescriptor/"



      Dataset : platform

      Dataset : publication

      Dataset : sample_count

      Dataset : sample_types





        Dataset --> "*" SampleTypeDescriptor : sample_types
        click SampleTypeDescriptor href "../SampleTypeDescriptor/"



      Dataset : title

Slots

Name Cardinality and Range Description Inheritance
accession 1
Uriorcurie
Dataset accession identifier as a CURIE (e direct
title 0..1
String
Title of the publication direct
description 0..1 recommended
String
A description of the dataset direct
organism 0..1
OrganismDescriptor
The organism from which samples were derived direct
data_type 0..1
DatasetTypeEnum
The type of omics or other data in the dataset direct
sample_types *
SampleTypeDescriptor
Types of biological samples in the dataset direct
sample_count 0..1
Integer
Total number of samples in the dataset direct
conditions *
String
Experimental conditions or disease states represented direct
exposures *
ExposureDescriptor
Environmental exposures studied in the dataset direct
genes *
GeneDescriptor
direct
platform 0..1
String
Sequencing or array platform used direct
publication 0..1
PMID
Associated publication (PMID) direct
findings *
Finding
Key findings or claims extracted from this source (publication or dataset) direct
evidence * recommended
EvidenceItem
direct
notes 0..1
String
direct

Usages

used by used in type used
Disease datasets range Dataset

Comments

  • Supports GEO, ArrayExpress, SRA, dbGaP, GTEx, ENCODE, phenopacket-store, and other repositories

Identifier and Mapping Information

Schema Source

  • from schema: https://w3id.org/monarch-initiative/dismech

Mappings

Mapping Type Mapped Value
self dismech:Dataset
native dismech:Dataset

LinkML Source

Direct

name: Dataset
description: A reference to a publicly available omics or phenotype dataset
comments:
- Supports GEO, ArrayExpress, SRA, dbGaP, GTEx, ENCODE, phenopacket-store, and other
  repositories
from_schema: https://w3id.org/monarch-initiative/dismech
slots:
- accession
- title
- description
- organism
- data_type
- sample_types
- sample_count
- conditions
- exposures
- genes
- platform
- publication
- findings
- evidence
- notes
slot_usage:
  description:
    name: description
    description: A description of the dataset. This may typically be redundant with
      the `title` slot, but the description is more human-readable and may be used
      to communicate nuances not captured by the rigid standardization of the title
      slot.
    recommended: true

Induced

name: Dataset
description: A reference to a publicly available omics or phenotype dataset
comments:
- Supports GEO, ArrayExpress, SRA, dbGaP, GTEx, ENCODE, phenopacket-store, and other
  repositories
from_schema: https://w3id.org/monarch-initiative/dismech
slot_usage:
  description:
    name: description
    description: A description of the dataset. This may typically be redundant with
      the `title` slot, but the description is more human-readable and may be used
      to communicate nuances not captured by the rigid standardization of the title
      slot.
    recommended: true
attributes:
  accession:
    name: accession
    implements:
    - linkml:authoritative_reference
    description: Dataset accession identifier as a CURIE (e.g., geo:GSE67472)
    from_schema: https://w3id.org/monarch-initiative/dismech
    rank: 1000
    identifier: true
    alias: accession
    owner: Dataset
    domain_of:
    - Dataset
    range: uriorcurie
    required: true
  title:
    name: title
    implements:
    - linkml:title
    description: Title of the publication
    from_schema: https://w3id.org/monarch-initiative/dismech
    rank: 1000
    alias: title
    owner: Dataset
    domain_of:
    - Dataset
    - PublicationReference
    range: string
  description:
    name: description
    description: A description of the dataset. This may typically be redundant with
      the `title` slot, but the description is more human-readable and may be used
      to communicate nuances not captured by the rigid standardization of the title
      slot.
    from_schema: https://w3id.org/monarch-initiative/dismech
    rank: 1000
    alias: description
    owner: Dataset
    domain_of:
    - Descriptor
    - GeneticContext
    - Dataset
    - ClinicalTrial
    - ComputationalModel
    - DifferentialDiagnosis
    - Subtype
    - CausalEdge
    - TreatmentMechanismTarget
    - EpidemiologyInfo
    - Pathophysiology
    - Phenotype
    - HistopathologyFinding
    - Environmental
    - Disease
    - Stage
    - AgentLifeCycle
    - AgentLifeCycleStage
    - AnimalModel
    - Treatment
    - InfectiousAgent
    - Transmission
    - Assay
    - Diagnosis
    - Inheritance
    - Variant
    - FunctionalEffect
    - Mechanism
    - ModelingConsideration
    - Definition
    - CriteriaSet
    - ConditionDescriptor
    - GOEnrichment
    - ComorbidityHypothesis
    - UpstreamConditionHypothesis
    - MechanisticHypothesis
    range: string
    recommended: true
  organism:
    name: organism
    description: The organism from which samples were derived
    from_schema: https://w3id.org/monarch-initiative/dismech
    rank: 1000
    alias: organism
    owner: Dataset
    domain_of:
    - Dataset
    range: OrganismDescriptor
    inlined: true
  data_type:
    name: data_type
    description: The type of omics or other data in the dataset
    from_schema: https://w3id.org/monarch-initiative/dismech
    rank: 1000
    alias: data_type
    owner: Dataset
    domain_of:
    - Dataset
    range: DatasetTypeEnum
  sample_types:
    name: sample_types
    description: Types of biological samples in the dataset
    from_schema: https://w3id.org/monarch-initiative/dismech
    rank: 1000
    alias: sample_types
    owner: Dataset
    domain_of:
    - Dataset
    range: SampleTypeDescriptor
    multivalued: true
    inlined: true
    inlined_as_list: true
  sample_count:
    name: sample_count
    description: Total number of samples in the dataset
    from_schema: https://w3id.org/monarch-initiative/dismech
    rank: 1000
    alias: sample_count
    owner: Dataset
    domain_of:
    - Dataset
    range: integer
  conditions:
    name: conditions
    description: Experimental conditions or disease states represented
    from_schema: https://w3id.org/monarch-initiative/dismech
    rank: 1000
    alias: conditions
    owner: Dataset
    domain_of:
    - Dataset
    range: string
    multivalued: true
  exposures:
    name: exposures
    description: Environmental exposures studied in the dataset
    from_schema: https://w3id.org/monarch-initiative/dismech
    rank: 1000
    alias: exposures
    owner: Dataset
    domain_of:
    - Dataset
    range: ExposureDescriptor
    multivalued: true
    inlined: true
    inlined_as_list: true
  genes:
    name: genes
    examples:
    - value: '[{preferred_term: HLA-DQ2}, {preferred_term: INS}]'
    from_schema: https://w3id.org/monarch-initiative/dismech
    rank: 1000
    alias: genes
    owner: Dataset
    domain_of:
    - GeneticContext
    - Dataset
    - Subtype
    - Pathophysiology
    - AnimalModel
    range: GeneDescriptor
    multivalued: true
    inlined: true
    inlined_as_list: true
  platform:
    name: platform
    description: Sequencing or array platform used
    from_schema: https://w3id.org/monarch-initiative/dismech
    rank: 1000
    alias: platform
    owner: Dataset
    domain_of:
    - Dataset
    range: string
  publication:
    name: publication
    description: Associated publication (PMID)
    from_schema: https://w3id.org/monarch-initiative/dismech
    rank: 1000
    alias: publication
    owner: Dataset
    domain_of:
    - Dataset
    - ComputationalModel
    range: PMID
  findings:
    name: findings
    description: Key findings or claims extracted from this source (publication or
      dataset)
    from_schema: https://w3id.org/monarch-initiative/dismech
    rank: 1000
    alias: findings
    owner: Dataset
    domain_of:
    - Dataset
    - ComputationalModel
    - PublicationReference
    range: Finding
    multivalued: true
    inlined: true
    inlined_as_list: true
  evidence:
    name: evidence
    from_schema: https://w3id.org/monarch-initiative/dismech
    rank: 1000
    alias: evidence
    owner: Dataset
    domain_of:
    - PhenotypeContext
    - Dataset
    - ClinicalTrial
    - ComputationalModel
    - DifferentialDiagnosis
    - Subtype
    - CausalEdge
    - TreatmentMechanismTarget
    - Finding
    - Prevalence
    - ProgressionInfo
    - EpidemiologyInfo
    - Pathophysiology
    - Phenotype
    - Biochemical
    - HistopathologyFinding
    - Genetic
    - Environmental
    - Stage
    - AgentLifeCycle
    - AgentLifeCycleStage
    - AnimalModel
    - Treatment
    - InfectiousAgent
    - Transmission
    - Diagnosis
    - Inheritance
    - Variant
    - ModelingConsideration
    - ClassificationAssignment
    - Definition
    - CriteriaSet
    - AssociationSignal
    - AssociationStatistics
    - ComorbidityHypothesis
    - UpstreamConditionHypothesis
    - MechanisticHypothesis
    range: EvidenceItem
    recommended: true
    multivalued: true
    inlined: true
    inlined_as_list: true
  notes:
    name: notes
    examples:
    - value: Contagious stage where symptoms appear and the bacteria can be spread
        to others.
    from_schema: https://w3id.org/monarch-initiative/dismech
    rank: 1000
    alias: notes
    owner: Dataset
    domain_of:
    - GeneticContext
    - OnsetDescriptor
    - PhenotypeContext
    - Dataset
    - ClinicalTrial
    - ComputationalModel
    - DifferentialDiagnosis
    - Prevalence
    - ProgressionInfo
    - EpidemiologyInfo
    - Pathophysiology
    - Phenotype
    - Biochemical
    - HistopathologyFinding
    - Genetic
    - Environmental
    - Disease
    - Stage
    - AgentLifeCycle
    - AgentLifeCycleStage
    - Treatment
    - Transmission
    - Diagnosis
    - ClassificationAssignment
    - Definition
    - CriteriaSet
    - TermMapping
    - MappingConsistency
    - ComorbidityAssociation
    - AssociationSignal
    - AssociationMetric
    - AssociationStatistics
    - MechanisticHypothesis
    range: string