Class: Dataset
A reference to a publicly available omics or phenotype dataset
URI: dismech:Dataset
classDiagram
class Dataset
click Dataset href "../Dataset/"
Dataset : accession
Dataset : conditions
Dataset : data_type
Dataset --> "0..1" DatasetTypeEnum : data_type
click DatasetTypeEnum href "../DatasetTypeEnum/"
Dataset : description
Dataset : evidence
Dataset --> "*" EvidenceItem : evidence
click EvidenceItem href "../EvidenceItem/"
Dataset : exposures
Dataset --> "*" ExposureDescriptor : exposures
click ExposureDescriptor href "../ExposureDescriptor/"
Dataset : findings
Dataset --> "*" Finding : findings
click Finding href "../Finding/"
Dataset : genes
Dataset --> "*" GeneDescriptor : genes
click GeneDescriptor href "../GeneDescriptor/"
Dataset : notes
Dataset : organism
Dataset --> "0..1" OrganismDescriptor : organism
click OrganismDescriptor href "../OrganismDescriptor/"
Dataset : platform
Dataset : publication
Dataset : sample_count
Dataset : sample_types
Dataset --> "*" SampleTypeDescriptor : sample_types
click SampleTypeDescriptor href "../SampleTypeDescriptor/"
Dataset : title
Slots
| Name | Cardinality and Range | Description | Inheritance |
|---|---|---|---|
| accession | 1 Uriorcurie |
Dataset accession identifier as a CURIE (e | direct |
| title | 0..1 String |
Title of the publication | direct |
| description | 0..1 recommended String |
A description of the dataset | direct |
| organism | 0..1 OrganismDescriptor |
The organism from which samples were derived | direct |
| data_type | 0..1 DatasetTypeEnum |
The type of omics or other data in the dataset | direct |
| sample_types | * SampleTypeDescriptor |
Types of biological samples in the dataset | direct |
| sample_count | 0..1 Integer |
Total number of samples in the dataset | direct |
| conditions | * String |
Experimental conditions or disease states represented | direct |
| exposures | * ExposureDescriptor |
Environmental exposures studied in the dataset | direct |
| genes | * GeneDescriptor |
direct | |
| platform | 0..1 String |
Sequencing or array platform used | direct |
| publication | 0..1 PMID |
Associated publication (PMID) | direct |
| findings | * Finding |
Key findings or claims extracted from this source (publication or dataset) | direct |
| evidence | * recommended EvidenceItem |
direct | |
| notes | 0..1 String |
direct |
Usages
| used by | used in | type | used |
|---|---|---|---|
| Disease | datasets | range | Dataset |
Comments
- Supports GEO, ArrayExpress, SRA, dbGaP, GTEx, ENCODE, phenopacket-store, and other repositories
Identifier and Mapping Information
Schema Source
- from schema: https://w3id.org/monarch-initiative/dismech
Mappings
| Mapping Type | Mapped Value |
|---|---|
| self | dismech:Dataset |
| native | dismech:Dataset |
LinkML Source
Direct
name: Dataset
description: A reference to a publicly available omics or phenotype dataset
comments:
- Supports GEO, ArrayExpress, SRA, dbGaP, GTEx, ENCODE, phenopacket-store, and other
repositories
from_schema: https://w3id.org/monarch-initiative/dismech
slots:
- accession
- title
- description
- organism
- data_type
- sample_types
- sample_count
- conditions
- exposures
- genes
- platform
- publication
- findings
- evidence
- notes
slot_usage:
description:
name: description
description: A description of the dataset. This may typically be redundant with
the `title` slot, but the description is more human-readable and may be used
to communicate nuances not captured by the rigid standardization of the title
slot.
recommended: true
Induced
name: Dataset
description: A reference to a publicly available omics or phenotype dataset
comments:
- Supports GEO, ArrayExpress, SRA, dbGaP, GTEx, ENCODE, phenopacket-store, and other
repositories
from_schema: https://w3id.org/monarch-initiative/dismech
slot_usage:
description:
name: description
description: A description of the dataset. This may typically be redundant with
the `title` slot, but the description is more human-readable and may be used
to communicate nuances not captured by the rigid standardization of the title
slot.
recommended: true
attributes:
accession:
name: accession
implements:
- linkml:authoritative_reference
description: Dataset accession identifier as a CURIE (e.g., geo:GSE67472)
from_schema: https://w3id.org/monarch-initiative/dismech
rank: 1000
identifier: true
alias: accession
owner: Dataset
domain_of:
- Dataset
range: uriorcurie
required: true
title:
name: title
implements:
- linkml:title
description: Title of the publication
from_schema: https://w3id.org/monarch-initiative/dismech
rank: 1000
alias: title
owner: Dataset
domain_of:
- Dataset
- PublicationReference
range: string
description:
name: description
description: A description of the dataset. This may typically be redundant with
the `title` slot, but the description is more human-readable and may be used
to communicate nuances not captured by the rigid standardization of the title
slot.
from_schema: https://w3id.org/monarch-initiative/dismech
rank: 1000
alias: description
owner: Dataset
domain_of:
- Descriptor
- GeneticContext
- Dataset
- ClinicalTrial
- ComputationalModel
- DifferentialDiagnosis
- Subtype
- CausalEdge
- TreatmentMechanismTarget
- EpidemiologyInfo
- Pathophysiology
- Phenotype
- HistopathologyFinding
- Environmental
- Disease
- Stage
- AgentLifeCycle
- AgentLifeCycleStage
- AnimalModel
- Treatment
- InfectiousAgent
- Transmission
- Assay
- Diagnosis
- Inheritance
- Variant
- FunctionalEffect
- Mechanism
- ModelingConsideration
- Definition
- CriteriaSet
- ConditionDescriptor
- GOEnrichment
- ComorbidityHypothesis
- UpstreamConditionHypothesis
- MechanisticHypothesis
range: string
recommended: true
organism:
name: organism
description: The organism from which samples were derived
from_schema: https://w3id.org/monarch-initiative/dismech
rank: 1000
alias: organism
owner: Dataset
domain_of:
- Dataset
range: OrganismDescriptor
inlined: true
data_type:
name: data_type
description: The type of omics or other data in the dataset
from_schema: https://w3id.org/monarch-initiative/dismech
rank: 1000
alias: data_type
owner: Dataset
domain_of:
- Dataset
range: DatasetTypeEnum
sample_types:
name: sample_types
description: Types of biological samples in the dataset
from_schema: https://w3id.org/monarch-initiative/dismech
rank: 1000
alias: sample_types
owner: Dataset
domain_of:
- Dataset
range: SampleTypeDescriptor
multivalued: true
inlined: true
inlined_as_list: true
sample_count:
name: sample_count
description: Total number of samples in the dataset
from_schema: https://w3id.org/monarch-initiative/dismech
rank: 1000
alias: sample_count
owner: Dataset
domain_of:
- Dataset
range: integer
conditions:
name: conditions
description: Experimental conditions or disease states represented
from_schema: https://w3id.org/monarch-initiative/dismech
rank: 1000
alias: conditions
owner: Dataset
domain_of:
- Dataset
range: string
multivalued: true
exposures:
name: exposures
description: Environmental exposures studied in the dataset
from_schema: https://w3id.org/monarch-initiative/dismech
rank: 1000
alias: exposures
owner: Dataset
domain_of:
- Dataset
range: ExposureDescriptor
multivalued: true
inlined: true
inlined_as_list: true
genes:
name: genes
examples:
- value: '[{preferred_term: HLA-DQ2}, {preferred_term: INS}]'
from_schema: https://w3id.org/monarch-initiative/dismech
rank: 1000
alias: genes
owner: Dataset
domain_of:
- GeneticContext
- Dataset
- Subtype
- Pathophysiology
- AnimalModel
range: GeneDescriptor
multivalued: true
inlined: true
inlined_as_list: true
platform:
name: platform
description: Sequencing or array platform used
from_schema: https://w3id.org/monarch-initiative/dismech
rank: 1000
alias: platform
owner: Dataset
domain_of:
- Dataset
range: string
publication:
name: publication
description: Associated publication (PMID)
from_schema: https://w3id.org/monarch-initiative/dismech
rank: 1000
alias: publication
owner: Dataset
domain_of:
- Dataset
- ComputationalModel
range: PMID
findings:
name: findings
description: Key findings or claims extracted from this source (publication or
dataset)
from_schema: https://w3id.org/monarch-initiative/dismech
rank: 1000
alias: findings
owner: Dataset
domain_of:
- Dataset
- ComputationalModel
- PublicationReference
range: Finding
multivalued: true
inlined: true
inlined_as_list: true
evidence:
name: evidence
from_schema: https://w3id.org/monarch-initiative/dismech
rank: 1000
alias: evidence
owner: Dataset
domain_of:
- PhenotypeContext
- Dataset
- ClinicalTrial
- ComputationalModel
- DifferentialDiagnosis
- Subtype
- CausalEdge
- TreatmentMechanismTarget
- Finding
- Prevalence
- ProgressionInfo
- EpidemiologyInfo
- Pathophysiology
- Phenotype
- Biochemical
- HistopathologyFinding
- Genetic
- Environmental
- Stage
- AgentLifeCycle
- AgentLifeCycleStage
- AnimalModel
- Treatment
- InfectiousAgent
- Transmission
- Diagnosis
- Inheritance
- Variant
- ModelingConsideration
- ClassificationAssignment
- Definition
- CriteriaSet
- AssociationSignal
- AssociationStatistics
- ComorbidityHypothesis
- UpstreamConditionHypothesis
- MechanisticHypothesis
range: EvidenceItem
recommended: true
multivalued: true
inlined: true
inlined_as_list: true
notes:
name: notes
examples:
- value: Contagious stage where symptoms appear and the bacteria can be spread
to others.
from_schema: https://w3id.org/monarch-initiative/dismech
rank: 1000
alias: notes
owner: Dataset
domain_of:
- GeneticContext
- OnsetDescriptor
- PhenotypeContext
- Dataset
- ClinicalTrial
- ComputationalModel
- DifferentialDiagnosis
- Prevalence
- ProgressionInfo
- EpidemiologyInfo
- Pathophysiology
- Phenotype
- Biochemical
- HistopathologyFinding
- Genetic
- Environmental
- Disease
- Stage
- AgentLifeCycle
- AgentLifeCycleStage
- Treatment
- Transmission
- Diagnosis
- ClassificationAssignment
- Definition
- CriteriaSet
- TermMapping
- MappingConsistency
- ComorbidityAssociation
- AssociationSignal
- AssociationMetric
- AssociationStatistics
- MechanisticHypothesis
range: string