Checklist: Minimum Information About a Single Amplified Genome (Misag)
Minimum Information About a Single Amplified Genome
Terms
MIXS ID | Name | Cardinality and Range | Description |
---|---|---|---|
MIXS:0001107 | samp_name | 1 String |
A local identifier or name that for the material sample used for extracting n... |
MIXS:0000017 | size_frac | 0..1 String |
Filtering pore size used in sample preparation |
MIXS:0000043 | lib_screen | 0..1 recommended String |
Specific enrichment or screening methods applied before and/or after creating... |
MIXS:0000062 | ref_db | 0..1 String |
List of database(s) used for ORF annotation, along with version number and re... |
MIXS:0000038 | nucl_acid_amp | 0..1 recommended String |
A link to a literature reference, electronic resource or a standard operating... |
MIXS:0000039 | lib_size | 0..1 recommended Integer |
Total number of clones in the library prepared for the project |
MIXS:0000005 | contam_screen_input | 0..1 ContamScreenInputEnum |
The type of sequence data used as input |
MIXS:0000047 | mid | 0..1 recommended String |
Molecular barcodes, called Multiplex Identifiers (MIDs), that are used to spe... |
MIXS:0000057 | assembly_name | 0..1 recommended String |
Name/version of the assembly provided by the submitter that is used in the ge... |
MIXS:0000113 | temp | 0..1 recommended String |
Temperature of the sample at the time of sampling |
MIXS:0000069 | compl_score | 1 String |
Completeness score is typically based on either the fraction of markers found... |
MIXS:0000067 | trnas | 0..1 String |
The total number of tRNAs identified from the SAG or MAG |
MIXS:0000037 | nucl_acid_ext | 0..1 recommended String |
A link to a literature reference, electronic resource or a standard operating... |
MIXS:0000001 | samp_size | 0..1 recommended String |
The total amount or size (volume (ml), mass (g) or area (m2) ) of sample coll... |
MIXS:0000094 | alt | 0..1 recommended String |
Heights of objects such as airplanes, space shuttles, rockets, atmospheric ba... |
MIXS:0000026 | source_mat_id | * recommended String |
A unique identifier assigned to a material sample (as defined by http://rs |
MIXS:0000111 | samp_vol_we_dna_ext | 0..1 String |
Volume (ml) or mass (g) of total collected sample processed for DNA extractio... |
MIXS:0000040 | lib_reads_seqd | 0..1 recommended Integer |
Total number of clones sequenced from the library |
MIXS:0000015 | rel_to_oxygen | 0..1 RelToOxygenEnum |
Is this organism an aerobe, anaerobe? Please note that aerobic and anaerobic ... |
MIXS:0000006 | wga_amp_kit | 0..1 String |
Kit used to amplify genomic DNA in preparation for sequencing |
MIXS:0000074 | decontam_software | 0..1 String |
Tool(s) used in contamination screening |
MIXS:0000002 | samp_collect_device | 0..1 recommended String |
The device used to collect an environmental sample |
MIXS:0000060 | number_contig | 0..1 Integer |
Total number of contigs in the cleaned/submitted assembly that makes up a giv... |
MIXS:0000068 | trna_ext_software | 0..1 String |
Tools used for tRNA identification |
MIXS:0000054 | sc_lysis_method | 0..1 String |
Name of the kit or standard protocol used for cell(s) or particle(s) lysis |
MIXS:0000041 | lib_layout | 0..1 recommended LibLayoutEnum |
Specify whether to expect single, paired, or other configuration of reads |
MIXS:0000073 | contam_screen_param | 0..1 String |
Specific parameters used in the decontamination sofware, such as reference da... |
MIXS:0000056 | assembly_qual | 1 AssemblyQualEnum |
The assembly quality category is based on sets of criteria outlined for each ... |
MIXS:0000025 | ref_biomaterial | 0..1 String |
Primary publication if isolated before genome publication; otherwise, primary... |
MIXS:0000092 | project_name | 1 String |
Name of the project within which the sequencing was organized |
MIXS:0000042 | lib_vector | 0..1 recommended String |
Cloning vector type(s) used in construction of libraries |
MIXS:0000048 | adapters | 0..1 recommended String |
Adapters provide priming sequences for both amplification and sequencing of t... |
MIXS:0001321 | neg_cont_type | 0..1 recommended NegContTypeEnum |
The substance or equipment used as a negative control in an investigation |
MIXS:0000058 | assembly_software | 1 String |
Tool(s) used for assembly, including version number and parameters |
MIXS:0000053 | tax_ident | 1 TaxIdentEnum |
The phylogenetic marker(s) used to assign an organism name to the SAG or MAG |
MIXS:0000072 | contam_score | 1 Float |
The contamination score is based on the fraction of single-copy genes that ar... |
MIXS:0000059 | annot | 0..1 String |
Tool used for annotation, or for cases where annotation was provided by a com... |
MIXS:0000066 | x16s_recover_software | 0..1 String |
Tools used for 16S rRNA gene extraction |
MIXS:0000065 | x16s_recover | 0..1 Boolean |
Can a 16S gene be recovered from the submitted SAG or MAG? |
MIXS:0001322 | pos_cont_type | 0..1 recommended String |
The substance, mixture, product, or apparatus used to verify that a process w... |
MIXS:0000061 | feat_pred | 0..1 String |
Method used to predict UViGs features such as ORFs, integration site, etc |
MIXS:0000070 | compl_software | 1 String |
Tools used for completion estimate, i |
MIXS:0000013 | env_local_scale | 1 String |
Report the entity or entities which are in the sample or specimen s local vic... |
MIXS:0000075 | sort_tech | 1 SortTechEnum |
Method used to sort/isolate cells or particles of interest |
MIXS:0000016 | samp_mat_process | 0..1 recommended String |
A brief description of any processing applied to the sample during or after r... |
MIXS:0000063 | sim_search_meth | 0..1 String |
Tool used to compare ORFs with database, along with version and cutoffs used |
MIXS:0000018 | depth | 0..1 recommended String |
The vertical distance below local surface |
MIXS:0001225 | samp_collect_method | 0..1 recommended String |
The method employed for collecting the sample |
MIXS:0000055 | wga_amp_appr | 1 WgaAmpApprEnum |
Method used to amplify genomic DNA in preparation for sequencing |
MIXS:0000071 | compl_appr | 0..1 ComplApprEnum |
The approach used to determine the completeness of a given genomic assembly, ... |
MIXS:0000014 | env_medium | 1 String |
Report the environmental material(s) immediately surrounding the sample or sp... |
MIXS:0001320 | samp_taxon_id | 1 String |
NCBI taxon id of the sample |
MIXS:0000010 | geo_loc_name | 1 String |
The geographical origin of the sample as defined by the country or sea name f... |
MIXS:0000076 | sc_lysis_approach | 1 ScLysisApproachEnum |
Method used to free DNA from interior of the cell(s) or particle(s) |
MIXS:0000011 | collection_date | 1 Datetime |
The time of sampling, either as an instance (single point in time) or interva... |
MIXS:0000050 | seq_meth | 1 String |
Sequencing machine used |
MIXS:0000009 | lat_lon | 1 String |
The geographical origin of the sample as defined by latitude and longitude |
MIXS:0000093 | elev | 0..1 recommended String |
Elevation of the sampling site is its height above a fixed reference point, m... |
MIXS:0000012 | env_broad_scale | 1 String |
Report the major environmental system the sample or specimen came from |
MIXS:0000064 | tax_class | 0..1 String |
Method used for taxonomic classification, along with reference database used,... |
MIXS:0000008 | experimental_factor | * recommended String |
Variable aspects of an experiment design that can be used to describe an expe... |
MIXS:0000091 | associated_resource | * recommended String |
A related resource that is referenced, cited, or otherwise associated to the ... |
MIXS:0000090 | sop | * recommended String |
Standard operating procedures used in assembly and/or annotation of genomes, ... |
Aliases
- misag
LinkML Source
Direct
name: Misag
description: Minimum Information About a Single Amplified Genome
title: Minimum Information About a Single Amplified Genome
from_schema: https://w3id.org/mixs
aliases:
- misag
is_a: Checklist
mixin: true
slots:
- samp_name
- size_frac
- lib_screen
- ref_db
- nucl_acid_amp
- lib_size
- contam_screen_input
- mid
- assembly_name
- temp
- compl_score
- trnas
- nucl_acid_ext
- samp_size
- alt
- source_mat_id
- samp_vol_we_dna_ext
- lib_reads_seqd
- rel_to_oxygen
- wga_amp_kit
- decontam_software
- samp_collect_device
- number_contig
- trna_ext_software
- sc_lysis_method
- lib_layout
- contam_screen_param
- assembly_qual
- ref_biomaterial
- project_name
- lib_vector
- adapters
- neg_cont_type
- assembly_software
- tax_ident
- contam_score
- annot
- x16s_recover_software
- x16s_recover
- pos_cont_type
- feat_pred
- compl_software
- env_local_scale
- sort_tech
- samp_mat_process
- sim_search_meth
- depth
- samp_collect_method
- wga_amp_appr
- compl_appr
- env_medium
- samp_taxon_id
- geo_loc_name
- sc_lysis_approach
- collection_date
- seq_meth
- lat_lon
- elev
- env_broad_scale
- tax_class
- experimental_factor
- associated_resource
- sop
slot_usage:
adapters:
name: adapters
recommended: true
alt:
name: alt
recommended: true
assembly_name:
name: assembly_name
recommended: true
assembly_qual:
name: assembly_qual
required: true
assembly_software:
name: assembly_software
required: true
compl_score:
name: compl_score
required: true
compl_software:
name: compl_software
required: true
contam_score:
name: contam_score
required: true
depth:
name: depth
examples:
- value: 10 meter
recommended: true
elev:
name: elev
recommended: true
experimental_factor:
name: experimental_factor
recommended: true
lib_layout:
name: lib_layout
recommended: true
lib_reads_seqd:
name: lib_reads_seqd
recommended: true
lib_screen:
name: lib_screen
recommended: true
lib_size:
name: lib_size
recommended: true
lib_vector:
name: lib_vector
recommended: true
mid:
name: mid
recommended: true
nucl_acid_amp:
name: nucl_acid_amp
recommended: true
nucl_acid_ext:
name: nucl_acid_ext
recommended: true
samp_collect_device:
name: samp_collect_device
examples:
- value: swab, biopsy, niskin bottle, push core, drag swab [GENEPIO:0002713]
recommended: true
samp_collect_method:
name: samp_collect_method
examples:
- value: swabbing
recommended: true
samp_mat_process:
name: samp_mat_process
recommended: true
samp_size:
name: samp_size
recommended: true
sc_lysis_approach:
name: sc_lysis_approach
required: true
sop:
name: sop
recommended: true
sort_tech:
name: sort_tech
required: true
source_mat_id:
name: source_mat_id
recommended: true
tax_ident:
name: tax_ident
required: true
temp:
name: temp
recommended: true
wga_amp_appr:
name: wga_amp_appr
required: true
class_uri: MIXS:0010010
Induced
name: Misag
description: Minimum Information About a Single Amplified Genome
title: Minimum Information About a Single Amplified Genome
from_schema: https://w3id.org/mixs
aliases:
- misag
is_a: Checklist
mixin: true
slot_usage:
adapters:
name: adapters
recommended: true
alt:
name: alt
recommended: true
assembly_name:
name: assembly_name
recommended: true
assembly_qual:
name: assembly_qual
required: true
assembly_software:
name: assembly_software
required: true
compl_score:
name: compl_score
required: true
compl_software:
name: compl_software
required: true
contam_score:
name: contam_score
required: true
depth:
name: depth
examples:
- value: 10 meter
recommended: true
elev:
name: elev
recommended: true
experimental_factor:
name: experimental_factor
recommended: true
lib_layout:
name: lib_layout
recommended: true
lib_reads_seqd:
name: lib_reads_seqd
recommended: true
lib_screen:
name: lib_screen
recommended: true
lib_size:
name: lib_size
recommended: true
lib_vector:
name: lib_vector
recommended: true
mid:
name: mid
recommended: true
nucl_acid_amp:
name: nucl_acid_amp
recommended: true
nucl_acid_ext:
name: nucl_acid_ext
recommended: true
samp_collect_device:
name: samp_collect_device
examples:
- value: swab, biopsy, niskin bottle, push core, drag swab [GENEPIO:0002713]
recommended: true
samp_collect_method:
name: samp_collect_method
examples:
- value: swabbing
recommended: true
samp_mat_process:
name: samp_mat_process
recommended: true
samp_size:
name: samp_size
recommended: true
sc_lysis_approach:
name: sc_lysis_approach
required: true
sop:
name: sop
recommended: true
sort_tech:
name: sort_tech
required: true
source_mat_id:
name: source_mat_id
recommended: true
tax_ident:
name: tax_ident
required: true
temp:
name: temp
recommended: true
wga_amp_appr:
name: wga_amp_appr
required: true
attributes:
samp_name:
name: samp_name
annotations:
Preferred_unit:
tag: Preferred_unit
value: ''
description: A local identifier or name that for the material sample used for
extracting nucleic acids, and subsequent sequencing. It can refer either to
the original material collected or to any derived sub-samples. It can have any
format, but we suggest that you make it concise, unique and consistent within
your lab, and as informative as possible. INSDC requires every sample name from
a single Submitter to be unique. Use of a globally unique identifier for the
field source_mat_id is recommended in addition to sample_name
title: sample name
examples:
- value: ISDsoil1
in_subset:
- investigation
from_schema: https://w3id.org/mixs
keywords:
- sample
slot_uri: MIXS:0001107
alias: samp_name
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- Air
- BuiltEnvironment
- FoodAnimalAndAnimalFeed
- FoodFarmEnvironment
- FoodFoodProductionFacility
- FoodHumanFoods
- HostAssociated
- HumanAssociated
- HumanGut
- HumanOral
- HumanSkin
- HumanVaginal
- HydrocarbonResourcesCores
- HydrocarbonResourcesFluidsSwabs
- MicrobialMatBiofilm
- MiscellaneousNaturalOrArtificialEnvironment
- PlantAssociated
- Sediment
- Soil
- SymbiontAssociated
- WastewaterSludge
- Water
range: string
required: true
size_frac:
name: size_frac
annotations:
Expected_value:
tag: Expected_value
value: filter size value range
description: Filtering pore size used in sample preparation
title: size fraction selected
examples:
- value: 0-0.22 micrometer
in_subset:
- nucleic acid sequence source
from_schema: https://w3id.org/mixs
keywords:
- fraction
- size
string_serialization: '{float}-{float} {unit}'
slot_uri: MIXS:0000017
alias: size_frac
owner: Misag
domain_of:
- Mimag
- MimarksS
- Mims
- Misag
- Miuvig
range: string
lib_screen:
name: lib_screen
annotations:
Expected_value:
tag: Expected_value
value: screening strategy name
description: Specific enrichment or screening methods applied before and/or after
creating libraries
title: library screening strategy
examples:
- value: enriched, screened, normalized
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- library
slot_uri: MIXS:0000043
alias: lib_screen
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
range: string
recommended: true
ref_db:
name: ref_db
annotations:
Expected_value:
tag: Expected_value
value: names, versions, and references of databases
description: List of database(s) used for ORF annotation, along with version number
and reference to website or publication
title: reference database(s)
examples:
- value: pVOGs;5;http://dmk-brain.ecn.uiowa.edu/pVOGs/ Grazziotin et al. 2017
doi:10.1093/nar/gkw975
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- database
string_serialization: '{database};{version};{reference}'
slot_uri: MIXS:0000062
alias: ref_db
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- Mims
- Misag
- Miuvig
range: string
nucl_acid_amp:
name: nucl_acid_amp
description: A link to a literature reference, electronic resource or a standard
operating procedure (SOP), that describes the enzymatic amplification (PCR,
TMA, NASBA) of specific nucleic acids
title: nucleic acid amplification
examples:
- value: https://phylogenomics.me/protocols/16s-pcr-protocol/
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
slot_uri: MIXS:0000038
alias: nucl_acid_amp
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
range: string
recommended: true
pattern: ^^PMID:\d+$|^doi:10.\d{2,9}/.*$|^https?:\/\/(?:www\.)?[-a-zA-Z0-9@:%._\+~#=]{1,256}\.[a-zA-Z0-9()]{1,6}\b(?:[-a-zA-Z0-9()@:%_\+.~#?&\/=]*)$$
structured_pattern:
syntax: ^{PMID}|{DOI}|{URL}$
interpolated: true
partial_match: true
lib_size:
name: lib_size
description: Total number of clones in the library prepared for the project
title: library size
examples:
- value: '50'
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- library
- size
slot_uri: MIXS:0000039
alias: lib_size
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
range: integer
recommended: true
contam_screen_input:
name: contam_screen_input
description: The type of sequence data used as input
title: contamination screening input
examples:
- value: contigs
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
slot_uri: MIXS:0000005
alias: contam_screen_input
owner: Misag
domain_of:
- Mimag
- Misag
range: ContamScreenInputEnum
mid:
name: mid
description: Molecular barcodes, called Multiplex Identifiers (MIDs), that are
used to specifically tag unique samples in a sequencing run. Sequence should
be reported in uppercase letters
title: multiplex identifiers
examples:
- value: GTGAATAT
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- identifier
slot_uri: MIXS:0000047
alias: mid
owner: Misag
domain_of:
- Mimag
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
range: string
recommended: true
pattern: ^[ACGTRKSYMWBHDVN]+$
structured_pattern:
syntax: ^{ambiguous_nucleotides}$
interpolated: true
partial_match: true
assembly_name:
name: assembly_name
annotations:
Expected_value:
tag: Expected_value
value: name and version of assembly
description: Name/version of the assembly provided by the submitter that is used
in the genome browsers and in the community
title: assembly name
examples:
- value: HuRef, JCVI_ISG_i3_1.0
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
string_serialization: '{text} {text}'
slot_uri: MIXS:0000057
alias: assembly_name
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- Mims
- Misag
- Miuvig
- Agriculture
range: string
recommended: true
temp:
name: temp
annotations:
Preferred_unit:
tag: Preferred_unit
value: degree Celsius
description: Temperature of the sample at the time of sampling
title: temperature
examples:
- value: 25 degree Celsius
in_subset:
- environment
from_schema: https://w3id.org/mixs
keywords:
- temperature
slot_uri: MIXS:0000113
alias: temp
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
- Air
- FoodAnimalAndAnimalFeed
- FoodFarmEnvironment
- FoodHumanFoods
- HostAssociated
- HumanAssociated
- HumanGut
- HumanOral
- HumanSkin
- HumanVaginal
- HydrocarbonResourcesCores
- HydrocarbonResourcesFluidsSwabs
- MicrobialMatBiofilm
- MiscellaneousNaturalOrArtificialEnvironment
- PlantAssociated
- Sediment
- Soil
- SymbiontAssociated
- WastewaterSludge
- Water
range: string
recommended: true
pattern: ^[-+]?[0-9]*\.?[0-9]+(?:[eE][-+]?[0-9]+)?( *- *[-+]?[0-9]*\.?[0-9]+(?:[eE][-+]?[0-9]+)?)?
*([^\s-]{1,2}|[^\s-]+.+[^\s-]+)$
structured_pattern:
syntax: ^{scientific_float}( *- *{scientific_float})? *{text}$
interpolated: true
partial_match: true
compl_score:
name: compl_score
annotations:
Expected_value:
tag: Expected_value
value: quality;percent completeness
description: 'Completeness score is typically based on either the fraction of
markers found as compared to a database or the percent of a genome found as
compared to a closely related reference genome. High Quality Draft: >90%, Medium
Quality Draft: >50%, and Low Quality Draft: < 50% should have the indicated
completeness scores'
title: completeness score
examples:
- value: med;60%
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- score
string_serialization: '[high|med|low];{percentage}'
slot_uri: MIXS:0000069
alias: compl_score
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- Misag
- Miuvig
range: string
required: true
trnas:
name: trnas
annotations:
Expected_value:
tag: Expected_value
value: value from 0-21
description: The total number of tRNAs identified from the SAG or MAG
title: number of standard tRNAs extracted
examples:
- value: '18'
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- number
string_serialization: '{integer}'
slot_uri: MIXS:0000067
alias: trnas
owner: Misag
domain_of:
- Mimag
- Misag
- Miuvig
range: string
nucl_acid_ext:
name: nucl_acid_ext
description: A link to a literature reference, electronic resource or a standard
operating procedure (SOP), that describes the material separation to recover
the nucleic acid fraction from a sample
title: nucleic acid extraction
examples:
- value: https://mobio.com/media/wysiwyg/pdfs/protocols/12888.pdf
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
slot_uri: MIXS:0000037
alias: nucl_acid_ext
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
- FoodAnimalAndAnimalFeed
- FoodFarmEnvironment
- FoodFoodProductionFacility
- FoodHumanFoods
range: string
recommended: true
pattern: ^^PMID:\d+$|^doi:10.\d{2,9}/.*$|^https?:\/\/(?:www\.)?[-a-zA-Z0-9@:%._\+~#=]{1,256}\.[a-zA-Z0-9()]{1,6}\b(?:[-a-zA-Z0-9()@:%_\+.~#?&\/=]*)$$
structured_pattern:
syntax: ^{PMID}|{DOI}|{URL}$
interpolated: true
partial_match: true
samp_size:
name: samp_size
description: The total amount or size (volume (ml), mass (g) or area (m2) ) of
sample collected
title: amount or size of sample collected
examples:
- value: 5 liter
in_subset:
- nucleic acid sequence source
from_schema: https://w3id.org/mixs
keywords:
- sample
- size
slot_uri: MIXS:0000001
alias: samp_size
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
- FoodAnimalAndAnimalFeed
- FoodFarmEnvironment
- FoodFoodProductionFacility
- FoodHumanFoods
range: string
recommended: true
pattern: ^[-+]?[0-9]*\.?[0-9]+(?:[eE][-+]?[0-9]+)?( *- *[-+]?[0-9]*\.?[0-9]+(?:[eE][-+]?[0-9]+)?)?
*([^\s-]{1,2}|[^\s-]+.+[^\s-]+)$
structured_pattern:
syntax: ^{scientific_float}( *- *{scientific_float})? *{text}$
interpolated: true
partial_match: true
alt:
name: alt
annotations:
Preferred_unit:
tag: Preferred_unit
value: meter
description: Heights of objects such as airplanes, space shuttles, rockets, atmospheric
balloons and heights of places such as atmospheric layers and clouds. It is
used to measure the height of an object which is above the earth's surface.
In this context, the altitude measurement is the vertical distance between the
earth's surface above sea level and the sampled position in the air
title: altitude
examples:
- value: 100 meter
in_subset:
- environment
from_schema: https://w3id.org/mixs
slot_uri: MIXS:0000094
alias: alt
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- Air
- HostAssociated
- MiscellaneousNaturalOrArtificialEnvironment
- SymbiontAssociated
range: string
recommended: true
pattern: ^[-+]?[0-9]*\.?[0-9]+(?:[eE][-+]?[0-9]+)?( *- *[-+]?[0-9]*\.?[0-9]+(?:[eE][-+]?[0-9]+)?)?
*([^\s-]{1,2}|[^\s-]+.+[^\s-]+)$
structured_pattern:
syntax: ^{scientific_float}( *- *{scientific_float})? *{text}$
interpolated: true
partial_match: true
source_mat_id:
name: source_mat_id
annotations:
Expected_value:
tag: Expected_value
value: 'for cultures of microorganisms: identifiers for two culture collections;
for other material a unique arbitrary identifer'
description: A unique identifier assigned to a material sample (as defined by
http://rs.tdwg.org/dwc/terms/materialSampleID, and as opposed to a particular
digital record of a material sample) used for extracting nucleic acids, and
subsequent sequencing. The identifier can refer either to the original material
collected or to any derived sub-samples. The INSDC qualifiers /specimen_voucher,
/bio_material, or /culture_collection may or may not share the same value as
the source_mat_id field. For instance, the /specimen_voucher qualifier and source_mat_id
may both contain 'UAM:Herps:14' , referring to both the specimen voucher and
sampled tissue with the same identifier. However, the /culture_collection qualifier
may refer to a value from an initial culture (e.g. ATCC:11775) while source_mat_id
would refer to an identifier from some derived culture from which the nucleic
acids were extracted (e.g. xatc123 or ark:/2154/R2)
title: source material identifiers
examples:
- value: MPI012345
in_subset:
- nucleic acid sequence source
from_schema: https://w3id.org/mixs
keywords:
- identifier
- material
- source
slot_uri: MIXS:0000026
alias: source_mat_id
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
- SymbiontAssociated
range: string
recommended: true
multivalued: true
samp_vol_we_dna_ext:
name: samp_vol_we_dna_ext
annotations:
Preferred_unit:
tag: Preferred_unit
value: milliliter, gram, milligram, square centimeter
description: 'Volume (ml) or mass (g) of total collected sample processed for
DNA extraction. Note: total sample collected should be entered under the term
Sample Size (MIXS:0000001)'
title: sample volume or weight for DNA extraction
examples:
- value: 1500 milliliter
in_subset:
- nucleic acid sequence source
from_schema: https://w3id.org/mixs
keywords:
- dna
- sample
- volume
- weight
slot_uri: MIXS:0000111
alias: samp_vol_we_dna_ext
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
- Air
- FoodAnimalAndAnimalFeed
- FoodFarmEnvironment
- FoodFoodProductionFacility
- FoodHumanFoods
- HostAssociated
- HumanAssociated
- HumanGut
- HumanOral
- HumanSkin
- HumanVaginal
- HydrocarbonResourcesCores
- HydrocarbonResourcesFluidsSwabs
- MicrobialMatBiofilm
- MiscellaneousNaturalOrArtificialEnvironment
- PlantAssociated
- Sediment
- Soil
- SymbiontAssociated
- WastewaterSludge
- Water
range: string
pattern: ^[-+]?[0-9]*\.?[0-9]+(?:[eE][-+]?[0-9]+)?( *- *[-+]?[0-9]*\.?[0-9]+(?:[eE][-+]?[0-9]+)?)?
*([^\s-]{1,2}|[^\s-]+.+[^\s-]+)$
structured_pattern:
syntax: ^{scientific_float}( *- *{scientific_float})? *{text}$
interpolated: true
partial_match: true
lib_reads_seqd:
name: lib_reads_seqd
description: Total number of clones sequenced from the library
title: library reads sequenced
examples:
- value: '20'
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- library
slot_uri: MIXS:0000040
alias: lib_reads_seqd
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
range: integer
recommended: true
rel_to_oxygen:
name: rel_to_oxygen
description: Is this organism an aerobe, anaerobe? Please note that aerobic and
anaerobic are valid descriptors for microbial environments
title: relationship to oxygen
examples:
- value: aerobe
in_subset:
- nucleic acid sequence source
from_schema: https://w3id.org/mixs
keywords:
- oxygen
- relationship
slot_uri: MIXS:0000015
alias: rel_to_oxygen
owner: Misag
domain_of:
- MigsBa
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
range: RelToOxygenEnum
wga_amp_kit:
name: wga_amp_kit
annotations:
Expected_value:
tag: Expected_value
value: kit name
description: Kit used to amplify genomic DNA in preparation for sequencing
title: WGA amplification kit
examples:
- value: qiagen repli-g
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- kit
slot_uri: MIXS:0000006
alias: wga_amp_kit
owner: Misag
domain_of:
- Misag
- Miuvig
range: string
decontam_software:
name: decontam_software
annotations:
Expected_value:
tag: Expected_value
value: enumeration
description: Tool(s) used in contamination screening
title: decontamination software
examples:
- value: anvi'o
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- software
string_serialization: '[checkm/refinem|anvi''o|prodege|bbtools:decontaminate.sh|acdc|combination]'
slot_uri: MIXS:0000074
alias: decontam_software
owner: Misag
domain_of:
- Mimag
- Misag
range: string
samp_collect_device:
name: samp_collect_device
annotations:
Expected_value:
tag: Expected_value
value: device name
description: The device used to collect an environmental sample. This field accepts
terms listed under environmental sampling device (http://purl.obolibrary.org/obo/ENVO).
This field also accepts terms listed under specimen collection device (http://purl.obolibrary.org/obo/GENEPIO_0002094)
title: sample collection device
examples:
- value: swab, biopsy, niskin bottle, push core, drag swab [GENEPIO:0002713]
in_subset:
- nucleic acid sequence source
from_schema: https://w3id.org/mixs
keywords:
- device
- sample
string_serialization: '{termLabel} [{termID}]|{text}'
slot_uri: MIXS:0000002
alias: samp_collect_device
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
- FoodAnimalAndAnimalFeed
- FoodFarmEnvironment
- FoodFoodProductionFacility
- FoodHumanFoods
range: string
recommended: true
number_contig:
name: number_contig
description: Total number of contigs in the cleaned/submitted assembly that makes
up a given genome, SAG, MAG, or UViG
title: number of contigs
examples:
- value: '40'
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- number
slot_uri: MIXS:0000060
alias: number_contig
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- Mims
- Misag
- Miuvig
range: integer
trna_ext_software:
name: trna_ext_software
description: Tools used for tRNA identification
title: tRNA extraction software
examples:
- value: infernal;v2;default parameters
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- software
slot_uri: MIXS:0000068
alias: trna_ext_software
owner: Misag
domain_of:
- Mimag
- Misag
- Miuvig
range: string
pattern: ^([^\s-]{1,2}|[^\s-]+.+[^\s-]+);([^\s-]{1,2}|[^\s-]+.+[^\s-]+);([^\s-]{1,2}|[^\s-]+.+[^\s-]+)$
structured_pattern:
syntax: ^{software};{version};{parameters}$
interpolated: true
partial_match: true
sc_lysis_method:
name: sc_lysis_method
annotations:
Expected_value:
tag: Expected_value
value: kit, protocol name
description: Name of the kit or standard protocol used for cell(s) or particle(s)
lysis
title: single cell or viral particle lysis kit protocol
examples:
- value: ambion single cell lysis kit
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- kit
- particle
- protocol
- single
slot_uri: MIXS:0000054
alias: sc_lysis_method
owner: Misag
domain_of:
- Misag
- Miuvig
range: string
lib_layout:
name: lib_layout
description: Specify whether to expect single, paired, or other configuration
of reads
title: library layout
examples:
- value: paired
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- library
slot_uri: MIXS:0000041
alias: lib_layout
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
range: LibLayoutEnum
recommended: true
contam_screen_param:
name: contam_screen_param
annotations:
Expected_value:
tag: Expected_value
value: enumeration;value or name
description: Specific parameters used in the decontamination sofware, such as
reference database, coverage, and kmers. Combinations of these parameters may
also be used, i.e. kmer and coverage, or reference database and kmer
title: contamination screening parameters
examples:
- value: kmer
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- parameter
string_serialization: '[ref db|kmer|coverage|combination];{text|integer}'
slot_uri: MIXS:0000073
alias: contam_screen_param
owner: Misag
domain_of:
- Mimag
- Misag
range: string
assembly_qual:
name: assembly_qual
description: 'The assembly quality category is based on sets of criteria outlined
for each assembly quality category. For MISAG/MIMAG; Finished: Single, validated,
contiguous sequence per replicon without gaps or ambiguities with a consensus
error rate equivalent to Q50 or better. High Quality Draft:Multiple fragments
where gaps span repetitive regions. Presence of the large subunit (LSU) RNA,
small subunit (SSU) and the presence of 5.8S rRNA or 5S rRNA depending on whether
it is a eukaryotic or prokaryotic genome, respectively. Medium Quality Draft:Many
fragments with little to no review of assembly other than reporting of standard
assembly statistics. Low Quality Draft:Many fragments with little to no review
of assembly other than reporting of standard assembly statistics. Assembly statistics
include, but are not limited to total assembly size, number of contigs, contig
N50/L50, and maximum contig length. For MIUVIG; Finished: Single, validated,
contiguous sequence per replicon without gaps or ambiguities, with extensive
manual review and editing to annotate putative gene functions and transcriptional
units. High-quality draft genome: One or multiple fragments, totaling 90%
of the expected genome or replicon sequence or predicted complete. Genome fragment(s):
One or multiple fragments, totalling < 90% of the expected genome or replicon
sequence, or for which no genome size could be estimated'
title: assembly quality
examples:
- value: High-quality draft genome
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- quality
slot_uri: MIXS:0000056
alias: assembly_qual
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- Mims
- Misag
- Miuvig
- Agriculture
range: AssemblyQualEnum
required: true
ref_biomaterial:
name: ref_biomaterial
description: Primary publication if isolated before genome publication; otherwise,
primary genome report
title: reference for biomaterial
examples:
- value: doi:10.1016/j.syapm.2018.01.009
in_subset:
- nucleic acid sequence source
from_schema: https://w3id.org/mixs
slot_uri: MIXS:0000025
alias: ref_biomaterial
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- Mims
- Misag
- Miuvig
range: string
pattern: ^^PMID:\d+$|^doi:10.\d{2,9}/.*$|^https?:\/\/(?:www\.)?[-a-zA-Z0-9@:%._\+~#=]{1,256}\.[a-zA-Z0-9()]{1,6}\b(?:[-a-zA-Z0-9()@:%_\+.~#?&\/=]*)$$
structured_pattern:
syntax: ^{PMID}|{DOI}|{URL}$
interpolated: true
partial_match: true
project_name:
name: project_name
description: Name of the project within which the sequencing was organized
title: project name
examples:
- value: Forest soil metagenome
in_subset:
- investigation
from_schema: https://w3id.org/mixs
keywords:
- project
slot_uri: MIXS:0000092
alias: project_name
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- Air
- BuiltEnvironment
- FoodAnimalAndAnimalFeed
- FoodFarmEnvironment
- FoodFoodProductionFacility
- FoodHumanFoods
- HostAssociated
- HumanAssociated
- HumanGut
- HumanOral
- HumanSkin
- HumanVaginal
- HydrocarbonResourcesCores
- HydrocarbonResourcesFluidsSwabs
- MicrobialMatBiofilm
- MiscellaneousNaturalOrArtificialEnvironment
- PlantAssociated
- Sediment
- Soil
- SymbiontAssociated
- WastewaterSludge
- Water
range: string
required: true
lib_vector:
name: lib_vector
annotations:
Expected_value:
tag: Expected_value
value: vector
description: Cloning vector type(s) used in construction of libraries
title: library vector
examples:
- value: Bacteriophage P1
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- library
slot_uri: MIXS:0000042
alias: lib_vector
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
range: string
recommended: true
adapters:
name: adapters
description: Adapters provide priming sequences for both amplification and sequencing
of the sample-library fragments. Both adapters should be reported; in uppercase
letters
title: adapters
examples:
- value: AATGATACGGCGACCACCGAGATCTACACGCT;CAAGCAGAAGACGGCATACGAGAT
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
slot_uri: MIXS:0000048
alias: adapters
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
range: string
recommended: true
structured_pattern:
syntax: ^{adapter_A_DNA_sequence};{adapter_B_DNA_sequence}$
interpolated: true
partial_match: true
neg_cont_type:
name: neg_cont_type
annotations:
Expected_value:
tag: Expected_value
value: enumeration or text
description: The substance or equipment used as a negative control in an investigation
title: negative control type
in_subset:
- investigation
from_schema: https://w3id.org/mixs
keywords:
- type
slot_uri: MIXS:0001321
alias: neg_cont_type
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
range: NegContTypeEnum
recommended: true
assembly_software:
name: assembly_software
description: Tool(s) used for assembly, including version number and parameters
title: assembly software
examples:
- value: metaSPAdes;3.11.0;kmer set 21,33,55,77,99,121, default parameters otherwise
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- software
slot_uri: MIXS:0000058
alias: assembly_software
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
range: string
required: true
pattern: ^([^\s-]{1,2}|[^\s-]+.+[^\s-]+);([^\s-]{1,2}|[^\s-]+.+[^\s-]+);([^\s-]{1,2}|[^\s-]+.+[^\s-]+)$
structured_pattern:
syntax: ^{software};{version};{parameters}$
interpolated: true
partial_match: true
tax_ident:
name: tax_ident
description: The phylogenetic marker(s) used to assign an organism name to the
SAG or MAG
title: taxonomic identity marker
examples:
- value: other
description: was other <colon> rpoB gene
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- identifier
- marker
- taxon
slot_uri: MIXS:0000053
alias: tax_ident
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- Misag
- Miuvig
range: TaxIdentEnum
required: true
contam_score:
name: contam_score
description: 'The contamination score is based on the fraction of single-copy
genes that are observed more than once in a query genome. The following scores
are acceptable for; High Quality Draft: < 5%, Medium Quality Draft: < 10%, Low
Quality Draft: < 10%. Contamination must be below 5% for a SAG or MAG to be
deposited into any of the public databases'
title: contamination score
examples:
- value: '0.01'
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- score
slot_uri: MIXS:0000072
alias: contam_score
owner: Misag
domain_of:
- Mimag
- Misag
range: float
required: true
annot:
name: annot
annotations:
Expected_value:
tag: Expected_value
value: name of tool or pipeline used, or annotation source description
description: Tool used for annotation, or for cases where annotation was provided
by a community jamboree or model organism database rather than by a specific
submitter
title: annotation
examples:
- value: prokka
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
slot_uri: MIXS:0000059
alias: annot
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- Mims
- Misag
- Miuvig
- Agriculture
range: string
x16s_recover_software:
name: x16s_recover_software
description: Tools used for 16S rRNA gene extraction
title: 16S recovery software
examples:
- value: rambl;v2;default parameters
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- recover
- software
slot_uri: MIXS:0000066
alias: x16s_recover_software
owner: Misag
domain_of:
- Mimag
- Misag
range: string
pattern: ^([^\s-]{1,2}|[^\s-]+.+[^\s-]+);([^\s-]{1,2}|[^\s-]+.+[^\s-]+);([^\s-]{1,2}|[^\s-]+.+[^\s-]+)$
structured_pattern:
syntax: ^{software};{version};{parameters}$
interpolated: true
partial_match: true
x16s_recover:
name: x16s_recover
description: Can a 16S gene be recovered from the submitted SAG or MAG?
title: 16S recovered
examples:
- value: 'yes'
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- recover
slot_uri: MIXS:0000065
alias: x16s_recover
owner: Misag
domain_of:
- Mimag
- Misag
range: boolean
pos_cont_type:
name: pos_cont_type
description: The substance, mixture, product, or apparatus used to verify that
a process which is part of an investigation delivers a true positive
title: positive control type
in_subset:
- investigation
from_schema: https://w3id.org/mixs
keywords:
- type
string_serialization: '{term} or {text}'
slot_uri: MIXS:0001322
alias: pos_cont_type
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
range: string
recommended: true
feat_pred:
name: feat_pred
description: Method used to predict UViGs features such as ORFs, integration site,
etc
title: feature prediction
examples:
- value: Prodigal;2.6.3;default parameters
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- feature
- predict
slot_uri: MIXS:0000061
alias: feat_pred
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- Mims
- Misag
- Miuvig
range: string
pattern: ^([^\s-]{1,2}|[^\s-]+.+[^\s-]+);([^\s-]{1,2}|[^\s-]+.+[^\s-]+);([^\s-]{1,2}|[^\s-]+.+[^\s-]+)$
structured_pattern:
syntax: ^{software};{version};{parameters}$
interpolated: true
partial_match: true
compl_software:
name: compl_software
annotations:
Expected_value:
tag: Expected_value
value: names and versions of software(s) used
description: Tools used for completion estimate, i.e. checkm, anvi'o, busco
title: completeness software
examples:
- value: checkm
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- software
string_serialization: '{software};{version}'
slot_uri: MIXS:0000070
alias: compl_software
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- Misag
- Miuvig
range: string
required: true
env_local_scale:
name: env_local_scale
annotations:
Expected_value:
tag: Expected_value
value: Environmental entities having causal influences upon the entity at
time of sampling
description: 'Report the entity or entities which are in the sample or specimen
s local vicinity and which you believe have significant causal influences on
your sample or specimen. We recommend using EnvO terms which are of smaller
spatial grain than your entry for env_broad_scale. Terms, such as anatomical
sites, from other OBO Library ontologies which interoperate with EnvO (e.g.
UBERON) are accepted in this field. EnvO documentation about how to use the
field: https://github.com/EnvironmentOntology/envo/wiki/Using-ENVO-with-MIxS'
title: local environmental context
examples:
- value: hillside [ENVO:01000333]
in_subset:
- environment
from_schema: https://w3id.org/mixs
keywords:
- context
- environmental
slot_uri: MIXS:0000013
alias: env_local_scale
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
range: string
required: true
structured_pattern:
syntax: ^{termLabel} \[{termID}\]$
interpolated: true
partial_match: true
sort_tech:
name: sort_tech
description: Method used to sort/isolate cells or particles of interest
title: sorting technology
examples:
- value: optical manipulation
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
slot_uri: MIXS:0000075
alias: sort_tech
owner: Misag
domain_of:
- Misag
- Miuvig
range: SortTechEnum
required: true
samp_mat_process:
name: samp_mat_process
description: A brief description of any processing applied to the sample during
or after retrieving the sample from environment, or a link to the relevant protocol(s)
performed
title: sample material processing
examples:
- value: filtering of seawater, storing samples in ethanol
in_subset:
- nucleic acid sequence source
from_schema: https://w3id.org/mixs
keywords:
- material
- process
- sample
slot_uri: MIXS:0000016
alias: samp_mat_process
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
range: string
recommended: true
sim_search_meth:
name: sim_search_meth
description: Tool used to compare ORFs with database, along with version and cutoffs
used
title: similarity search method
examples:
- value: HMMER3;3.1b2;hmmsearch, cutoff of 50 on score
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- method
slot_uri: MIXS:0000063
alias: sim_search_meth
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- Mims
- Misag
- Miuvig
range: string
pattern: ^([^\s-]{1,2}|[^\s-]+.+[^\s-]+);([^\s-]{1,2}|[^\s-]+.+[^\s-]+);([^\s-]{1,2}|[^\s-]+.+[^\s-]+)$
structured_pattern:
syntax: ^{software};{version};{parameters}$
interpolated: true
partial_match: true
depth:
name: depth
annotations:
Preferred_unit:
tag: Preferred_unit
value: meter
description: The vertical distance below local surface. For sediment or soil samples
depth is measured from sediment or soil surface, respectively. Depth can be
reported as an interval for subsurface samples
title: depth
examples:
- value: 10 meter
in_subset:
- environment
from_schema: https://w3id.org/mixs
keywords:
- depth
slot_uri: MIXS:0000018
alias: depth
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
- FoodFarmEnvironment
- HostAssociated
- MicrobialMatBiofilm
- MiscellaneousNaturalOrArtificialEnvironment
- PlantAssociated
- Sediment
- Soil
- SymbiontAssociated
- WastewaterSludge
- Water
range: string
recommended: true
pattern: ^[-+]?[0-9]*\.?[0-9]+(?:[eE][-+]?[0-9]+)?( *- *[-+]?[0-9]*\.?[0-9]+(?:[eE][-+]?[0-9]+)?)?
*([^\s-]{1,2}|[^\s-]+.+[^\s-]+)$
structured_pattern:
syntax: ^{scientific_float}( *- *{scientific_float})? *{text}$
interpolated: true
partial_match: true
samp_collect_method:
name: samp_collect_method
description: The method employed for collecting the sample
title: sample collection method
examples:
- value: swabbing
in_subset:
- nucleic acid sequence source
from_schema: https://w3id.org/mixs
keywords:
- method
- sample
slot_uri: MIXS:0001225
alias: samp_collect_method
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
- FoodAnimalAndAnimalFeed
- FoodFoodProductionFacility
- FoodHumanFoods
range: string
recommended: true
pattern: ^^PMID:\d+$|^doi:10.\d{2,9}/.*$|^https?:\/\/(?:www\.)?[-a-zA-Z0-9@:%._\+~#=]{1,256}\.[a-zA-Z0-9()]{1,6}\b(?:[-a-zA-Z0-9()@:%_\+.~#?&\/=]*)$|([^\s-]{1,2}|[^\s-]+.+[^\s-]+)$
structured_pattern:
syntax: ^{PMID}|{DOI}|{URL}|{text}$
interpolated: true
partial_match: true
wga_amp_appr:
name: wga_amp_appr
description: Method used to amplify genomic DNA in preparation for sequencing
title: WGA amplification approach
examples:
- value: mda based
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
slot_uri: MIXS:0000055
alias: wga_amp_appr
owner: Misag
domain_of:
- Misag
- Miuvig
range: WgaAmpApprEnum
required: true
compl_appr:
name: compl_appr
annotations:
Expected_value:
tag: Expected_value
value: text
description: The approach used to determine the completeness of a given genomic
assembly, which would typically make use of a set of conserved marker genes
or a closely related reference genome. For UViG completeness, include reference
genome or group used, and contig feature suggesting a complete genome
title: completeness approach
examples:
- value: other
description: was other <colon> UViG length compared to the average length of
reference genomes from the P22virus genus (NCBI RefSeq v83)
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
slot_uri: MIXS:0000071
alias: compl_appr
owner: Misag
domain_of:
- Mimag
- Misag
- Miuvig
range: ComplApprEnum
env_medium:
name: env_medium
description: 'Report the environmental material(s) immediately surrounding the
sample or specimen at the time of sampling. We recommend using subclasses of
''environmental material'' (http://purl.obolibrary.org/obo/ENVO_00010483). EnvO
documentation about how to use the field: https://github.com/EnvironmentOntology/envo/wiki/Using-ENVO-with-MIxS
. Terms from other OBO ontologies are permissible as long as they reference
mass/volume nouns (e.g. air, water, blood) and not discrete, countable entities
(e.g. a tree, a leaf, a table top)'
title: environmental medium
examples:
- value: bluegrass field soil [ENVO:00005789]
in_subset:
- environment
from_schema: https://w3id.org/mixs
keywords:
- environmental
slot_uri: MIXS:0000014
alias: env_medium
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
range: string
required: true
pattern: ^([^\s-]{1,2}|[^\s-]+.+[^\s-]+) \[[a-zA-Z]{2,}:[a-zA-Z0-9]\d+\]$
structured_pattern:
syntax: ^{termLabel} \[{termID}\]$
interpolated: true
partial_match: true
samp_taxon_id:
name: samp_taxon_id
description: NCBI taxon id of the sample. Maybe be a single taxon or mixed taxa
sample. Use 'synthetic metagenome for mock community/positive controls, or
'blank sample' for negative controls
title: taxonomy ID of DNA sample
examples:
- value: Gut Metagenome [NCBITaxon:749906]
in_subset:
- investigation
from_schema: https://w3id.org/mixs
keywords:
- dna
- identifier
- sample
- taxon
slot_uri: MIXS:0001320
alias: samp_taxon_id
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
range: string
required: true
pattern: ^([^\s-]{1,2}|[^\s-]+.+[^\s-]+) \[NCBITaxon:\d+\]$
structured_pattern:
syntax: ^{text} \[{NCBItaxon_id}\]$
interpolated: true
partial_match: true
geo_loc_name:
name: geo_loc_name
description: The geographical origin of the sample as defined by the country or
sea name followed by specific region name. Country or sea names should be chosen
from the INSDC country list (http://insdc.org/country.html), or the GAZ ontology
(http://purl.bioontology.org/ontology/GAZ)
title: geographic location (country and/or sea,region)
examples:
- value: 'USA: Maryland, Bethesda'
in_subset:
- environment
from_schema: https://w3id.org/mixs
keywords:
- geographic
- location
slot_uri: MIXS:0000010
alias: geo_loc_name
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- FoodAnimalAndAnimalFeed
- FoodFarmEnvironment
- FoodFoodProductionFacility
- FoodHumanFoods
- SymbiontAssociated
range: string
required: true
pattern: '^([^\s-]{1,2}|[^\s-]+.+[^\s-]+): ([^\s-]{1,2}|[^\s-]+.+[^\s-]+), ([^\s-]{1,2}|[^\s-]+.+[^\s-]+)$'
structured_pattern:
syntax: '^{text}: {text}, {text}$'
interpolated: true
partial_match: true
sc_lysis_approach:
name: sc_lysis_approach
description: Method used to free DNA from interior of the cell(s) or particle(s)
title: single cell or viral particle lysis approach
examples:
- value: enzymatic
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- particle
- single
slot_uri: MIXS:0000076
alias: sc_lysis_approach
owner: Misag
domain_of:
- Misag
- Miuvig
range: ScLysisApproachEnum
required: true
collection_date:
name: collection_date
description: 'The time of sampling, either as an instance (single point in time)
or interval. In case no exact time is available, the date/time can be right
truncated i.e. all of these are valid times: 2008-01-23T19:23:10+00:00; 2008-01-23T19:23:10;
2008-01-23; 2008-01; 2008; Except: 2008-01; 2008 all are ISO8601 compliant'
title: collection date
examples:
- value: '2013-03-25T12:42:31+01:00'
in_subset:
- environment
from_schema: https://w3id.org/mixs
keywords:
- date
slot_uri: MIXS:0000011
alias: collection_date
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- FoodAnimalAndAnimalFeed
- FoodFarmEnvironment
- FoodFoodProductionFacility
- FoodHumanFoods
- SymbiontAssociated
range: datetime
required: true
seq_meth:
name: seq_meth
description: Sequencing machine used. Where possible the term should be taken
from the OBI list of DNA sequencers (http://purl.obolibrary.org/obo/OBI_0400103)
title: sequencing method
examples:
- value: 454 Genome Sequencer FLX [OBI:0000702]
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- method
slot_uri: MIXS:0000050
alias: seq_meth
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
- FoodAnimalAndAnimalFeed
- FoodFarmEnvironment
- FoodFoodProductionFacility
- FoodHumanFoods
range: string
required: true
pattern: ^([^\s-]{1,2}|[^\s-]+.+[^\s-]+)|(([^\s-]{1,2}|[^\s-]+.+[^\s-]+) \[[a-zA-Z]{2,}:[a-zA-Z0-9]\d+\])$
structured_pattern:
syntax: ^{text}|({termLabel} \[{termID}\])$
interpolated: true
partial_match: true
lat_lon:
name: lat_lon
description: The geographical origin of the sample as defined by latitude and
longitude. The values should be reported in decimal degrees, limited to 8 decimal
points, and in WGS84 system
title: geographic location (latitude and longitude)
examples:
- value: 50.586825 6.408977
in_subset:
- environment
from_schema: https://w3id.org/mixs
keywords:
- geographic
- location
slot_uri: MIXS:0000009
alias: lat_lon
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- FoodAnimalAndAnimalFeed
- FoodFarmEnvironment
- FoodFoodProductionFacility
- FoodHumanFoods
- SymbiontAssociated
range: string
required: true
pattern: ^(-?((?:[0-8]?[0-9](?:\.\d{0,8})?)|90)) -?[0-9]+(?:\.[0-9]{0,8})?$|^-?(1[0-7]{1,2})$
structured_pattern:
syntax: ^{lat} {lon}$
interpolated: true
partial_match: true
elev:
name: elev
annotations:
Preferred_unit:
tag: Preferred_unit
value: meter
description: Elevation of the sampling site is its height above a fixed reference
point, most commonly the mean sea level. Elevation is mainly used when referring
to points on the earth's surface, while altitude is used for points above the
surface, such as an aircraft in flight or a spacecraft in orbit
title: elevation
examples:
- value: 100 meter
in_subset:
- environment
from_schema: https://w3id.org/mixs
keywords:
- elevation
slot_uri: MIXS:0000093
alias: elev
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
- Air
- HostAssociated
- HydrocarbonResourcesCores
- MicrobialMatBiofilm
- MiscellaneousNaturalOrArtificialEnvironment
- PlantAssociated
- Sediment
- Soil
- SymbiontAssociated
- Water
range: string
recommended: true
pattern: ^[-+]?[0-9]*\.?[0-9]+(?:[eE][-+]?[0-9]+)?( *- *[-+]?[0-9]*\.?[0-9]+(?:[eE][-+]?[0-9]+)?)?
*([^\s-]{1,2}|[^\s-]+.+[^\s-]+)$
structured_pattern:
syntax: ^{scientific_float}( *- *{scientific_float})? *{text}$
interpolated: true
partial_match: true
env_broad_scale:
name: env_broad_scale
description: 'Report the major environmental system the sample or specimen came
from. The system(s) identified should have a coarse spatial grain, to provide
the general environmental context of where the sampling was done (e.g. in the
desert or a rainforest). We recommend using subclasses of EnvO s biome class: http://purl.obolibrary.org/obo/ENVO_00000428.
EnvO documentation about how to use the field: https://github.com/EnvironmentOntology/envo/wiki/Using-ENVO-with-MIxS'
title: broad-scale environmental context
examples:
- value: rangeland biome [ENVO:01000247]
in_subset:
- environment
from_schema: https://w3id.org/mixs
keywords:
- context
- environmental
slot_uri: MIXS:0000012
alias: env_broad_scale
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
range: string
required: true
pattern: ^([^\s-]{1,2}|[^\s-]+.+[^\s-]+) \[[a-zA-Z]{2,}:[a-zA-Z0-9]\d+\]$
structured_pattern:
syntax: ^{termLabel} \[{termID}\]$
interpolated: true
partial_match: true
tax_class:
name: tax_class
description: Method used for taxonomic classification, along with reference database
used, classification rank, and thresholds used to classify new genomes
title: taxonomic classification
examples:
- value: vConTACT vContact2 (references from NCBI RefSeq v83, genus rank classification,
default parameters)
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- classification
- taxon
slot_uri: MIXS:0000064
alias: tax_class
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- Mims
- Misag
- Miuvig
range: string
experimental_factor:
name: experimental_factor
annotations:
Expected_value:
tag: Expected_value
value: text or EFO and/or OBI
description: Variable aspects of an experiment design that can be used to describe
an experiment, or set of experiments, in an increasingly detailed manner. This
field accepts ontology terms from Experimental Factor Ontology (EFO) and/or
Ontology for Biomedical Investigations (OBI)
title: experimental factor
examples:
- value: time series design [EFO:0001779]
in_subset:
- investigation
from_schema: https://w3id.org/mixs
keywords:
- experimental
- factor
string_serialization: '{termLabel} [{termID}]|{text}'
slot_uri: MIXS:0000008
alias: experimental_factor
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- FoodAnimalAndAnimalFeed
- FoodFoodProductionFacility
- FoodHumanFoods
range: string
recommended: true
multivalued: true
pattern: ^\S+.*\S+ \[[a-zA-Z]{2,}:\d+\]$
associated_resource:
name: associated_resource
annotations:
Expected_value:
tag: Expected_value
value: reference to resource
description: A related resource that is referenced, cited, or otherwise associated
to the sequence
title: relevant electronic resources
examples:
- value: http://www.earthmicrobiome.org/
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- resource
string_serialization: '{PMID}|{DOI}|{URL}'
slot_uri: MIXS:0000091
alias: associated_resource
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
range: string
recommended: true
multivalued: true
sop:
name: sop
annotations:
Expected_value:
tag: Expected_value
value: reference to SOP
description: Standard operating procedures used in assembly and/or annotation
of genomes, metagenomes or environmental sequences
title: relevant standard operating procedures
examples:
- value: http://press.igsb.anl.gov/earthmicrobiome/protocols-and-standards/its/
in_subset:
- sequencing
from_schema: https://w3id.org/mixs
keywords:
- procedures
string_serialization: '{PMID}|{DOI}|{URL}'
slot_uri: MIXS:0000090
alias: sop
owner: Misag
domain_of:
- MigsBa
- MigsEu
- MigsOrg
- MigsPl
- MigsVi
- Mimag
- MimarksC
- MimarksS
- Mims
- Misag
- Miuvig
- Agriculture
range: string
recommended: true
multivalued: true
class_uri: MIXS:0010010