| Title: | Glycan Structure Database |
|---|---|
| Description: | Provides a comprehensive database of glycan structures from GlyTouCan, including fully determined glycan structures with complete linkage, substituent, anomer, and monosaccharide information. This database serves as a foundational resource for the glycoverse ecosystem, enabling glycan structure analysis, comparison, and research applications. |
| Authors: | Bin Fu [aut, cre, cph] (ORCID: <https://orcid.org/0000-0001-8567-2997>) |
| Maintainer: | Bin Fu <[email protected]> |
| License: | MIT + file LICENSE |
| Version: | 0.5.0 |
| Built: | 2026-06-03 08:35:44 UTC |
| Source: | https://github.com/glycoverse/glydb |
Get unique glycan compositions from glydb_data as a glyrepr::glycan_composition() vector.
glydb_compositions( mono_type = "concrete", species = NULL, glycan_type = NULL, mono_range = NULL )glydb_compositions( mono_type = "concrete", species = NULL, glycan_type = NULL, mono_range = NULL )
mono_type |
Either "generic" or "concrete". Default is "concrete".
See |
species |
A string of specie names. See |
glycan_type |
A string of glycan types. Can be "N", "O-GalNAc", "O-GlcNAc", "O-Man", "O-Fuc", "O-Glc". Default is NULL, which means glycans of all types are included. |
mono_range |
A named list for filtering compositions by monosaccharide counts.
Each element should be an integer vector of length 2 specifying the minimum and maximum
count for that monosaccharide. Monosaccharides not specified will be excluded (count = 0).
Use |
A glyrepr::glycan_composition() vector, with a confidence attribute as a
numeric vector with the same length.
The returned value has a confidence attribute:
a numeric vector of the same length as the result containing log-transformed
citation counts for each glycan in glydb_data.
When multiple glycans are aggregated into lower-resolution structures or compositions,
the maximum confidence score is retained.
Note that the confidence attribute will be lost after any vector operation like subsetting.
Therefore, if used with glyanno, the returned value should not be modified manually.
glydb_compositions() glydb_compositions(mono_type = "generic") glydb_compositions(species = "Homo sapiens") glydb_compositions(glycan_type = "N") glydb_compositions(glycan_type = "N", mono_range = list(Hex = c(5L, 10L))) glydb_compositions(mono_range = list(Hex = c(3L, 9L), HexNAc = c(2L, 6L)))glydb_compositions() glydb_compositions(mono_type = "generic") glydb_compositions(species = "Homo sapiens") glydb_compositions(glycan_type = "N") glydb_compositions(glycan_type = "N", mono_range = list(Hex = c(5L, 10L))) glydb_compositions(mono_range = list(Hex = c(3L, 9L), HexNAc = c(2L, 6L)))
A curated dataset of fully determined glycans from GlyTouCan. "Fully determined" means that all linkages, substituents, anomers, and monosaccharides are fully specified. The dataset is derived from the GlyTouCan v2.11.1 release, with 7,125 glycan structures currently available.
glydb_dataglydb_data
A tibble with 7,125 rows and 5 variables:
glytoucan_ac: GlyTouCan accession.
glycan_structure: Glycan structure (glyrepr::glycan_structure()).
glycan_composition: Glycan composition (glyrepr::glycan_composition()).
species: Specie names, separated by semicolons. Unknown species are NAs.
glycan_type: Glycan type, one of "N", "O-GalNAc", "O-GlcNAc", "O-Man", "O-Fuc", "O-Glc".
Get a character vector of supported species from glydb_data.
glydb_species()glydb_species()
A character vector of supported species.
glydb_species()glydb_species()
Get unique glycan structures from glydb_data as a glyrepr::glycan_structure() vector.
glydb_structures( structure_level = "intact", species = NULL, glycan_type = NULL, mono_range = NULL )glydb_structures( structure_level = "intact", species = NULL, glycan_type = NULL, mono_range = NULL )
structure_level |
Either "intact", "topological", or "basic". Default is "intact".
See |
species |
A string of specie names. See |
glycan_type |
A string of glycan types. Can be "N", "O-GalNAc", "O-GlcNAc", "O-Man", "O-Fuc", "O-Glc". Default is NULL, which means glycans of all types are included. |
mono_range |
A named list for filtering structures by monosaccharide counts.
Each element should be an integer vector of length 2 specifying the minimum and maximum
count for that monosaccharide. Monosaccharides not specified will be excluded (count = 0).
Use |
A glyrepr::glycan_structure() vector, with a confidence attribute as a
numeric vector with the same length.
The returned value has a confidence attribute:
a numeric vector of the same length as the result containing log-transformed
citation counts for each glycan in glydb_data.
When multiple glycans are aggregated into lower-resolution structures or compositions,
the maximum confidence score is retained.
Note that the confidence attribute will be lost after any vector operation like subsetting.
Therefore, if used with glyanno, the returned value should not be modified manually.
glydb_structures() glydb_structures(structure_level = "topological") glydb_structures(structure_level = "basic") glydb_structures(species = "Homo sapiens") glydb_structures(glycan_type = "N") glydb_structures(glycan_type = "N", mono_range = list(Hex = c(5L, 10L))) glydb_structures(mono_range = list(Hex = c(3L, 9L), HexNAc = c(2L, 6L)))glydb_structures() glydb_structures(structure_level = "topological") glydb_structures(structure_level = "basic") glydb_structures(species = "Homo sapiens") glydb_structures(glycan_type = "N") glydb_structures(glycan_type = "N", mono_range = list(Hex = c(5L, 10L))) glydb_structures(mono_range = list(Hex = c(3L, 9L), HexNAc = c(2L, 6L)))
Fetch GlyTouCan accessions from the GlyGen API and parse the returned IUPAC
strings as glyrepr::glycan_structure() values.
glytoucan_to_struc(glytoucan_ac)glytoucan_to_struc(glytoucan_ac)
glytoucan_ac |
A character vector of GlyTouCan accessions. |
A glyrepr::glycan_structure() vector. Accessions that cannot be
fetched or parsed are returned as NA values in their original positions,
and a warning is emitted.
glytoucan_to_struc("G17689DH")glytoucan_to_struc("G17689DH")