Gene loci information

Transcript annotation

  • This transcript has been annotated as Neutral alpha-glucosidase AB.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_1 g12218 g12218.t1 TSS g12218.t1 21788052 21788052
chr_1 g12218 g12218.t1 isoform g12218.t1 21788418 21791376
chr_1 g12218 g12218.t1 exon g12218.t1.exon1 21788418 21788521
chr_1 g12218 g12218.t1 cds g12218.t1.CDS1 21788418 21788521
chr_1 g12218 g12218.t1 exon g12218.t1.exon2 21788583 21788724
chr_1 g12218 g12218.t1 cds g12218.t1.CDS2 21788583 21788724
chr_1 g12218 g12218.t1 exon g12218.t1.exon3 21788788 21791102
chr_1 g12218 g12218.t1 cds g12218.t1.CDS3 21788788 21791102
chr_1 g12218 g12218.t1 exon g12218.t1.exon4 21791163 21791376
chr_1 g12218 g12218.t1 cds g12218.t1.CDS4 21791163 21791376
chr_1 g12218 g12218.t1 TTS g12218.t1 21791521 21791521

Sequences

>g12218.t1 Gene=g12218 Length=2775
ATGAGTCGTAGACTGTGTCTCCTCATTTTCACTCTTTTCATTATAAATGCATTAGCTGTA
GACAAAAATAATTTTAAGACTTGTGAACAAGCCAGTTTTTGCAAACGTTCCCGTAATGTA
AATTATGAGAAATCACCTTACGAAATTGTGCCGAACACATTAAATATTAATGCAAACTAC
ATCACAGCTGATATAGTCAATAAAGAAAATGGACACTTGTTCGTGCTTAAACTTGAAGGC
ATCAAAGGAAATACATTCCATTTTGAAATTGATGAGAAAACTCCACTTAAGCCACGTTAT
CGTGTTGTAGATGCATTGCAAAAAGAACCAACATATGAGGCTGTTAAAGTATCGTCAAGC
ACATCTGACATAACTGTGACATGTGGCGAGAATAAAGCAGTTATATCGACTGCACCATTC
AAAATAGATTTCTATCAAAAAGATGTTTTGTCAGTAACTGCAAACAGTAAAGCAAAAATG
CATTTCGAACATTTACGTAAAAAATTAGAAGGCGAACAACATCAGCAAAATAATGAAAAT
GAAGAAAAAGACAATGTAGTTGATAATAATCAACAAATACAAGATGAACCAGGAACTTGG
GAAGAAAATTTTAAAGGACATCATGATACCAAACCTAACGGACCAACAGCAGTCGCTCTT
GATTTCACTTTTCATCAATCAAACGTTCTCTTTGGAATTCCTGAACATGCAGACTCATTT
GCTCTTAAAACAACAGTTGGAAATGGTGATCCTTATCGCTTATACAATTTAGATGTGTTT
GAGTATGAATTGAATAGCAGAATGGCATTATATGGTTCAGTTCCAGTTATTTATGGTCAT
GGTGAAAAAGGAACTACTGGAATTTTCTGGCAAAATGCTGCTGAAACATGGATTGATATT
GCAGGCACTTCAAATGAACAAAATGTTATGGAGTCAATAGTAAACTTTGTTGCTGGATCT
CGTAATCAAGATGCTCCTTCTGCTCATTTTATCTCAGAGTCTGGAATTATGGATGTTTTC
GTTCTTCTTGGACCAACACCAAATGAAGCTTTCAAACAATATACTGATCTCATTGGGCAT
GGAAATTTGCCACAAATGTTTGCAATTGGCTATCATCAATGCCGTTGGAATTATAATGAT
GAACAAGATGTCATGAGCGTTGCTCACAAATTTGACGAACACGATATTCCTATGGATACA
ATGTGGCTTGATATTGAATATACTGATAGCAAAAAATATTTCACTTGGGATCATCAAAAA
TTTCCACATCCAGTTGAAATGATTAAAAATCTCACTGACATGGGCAGACATTTGACTTTT
ATTATTGATCCTCATATTAAACGTGATTCCGGATATTTTTTCCACAATGATTGTACTGAT
CGTGGATATTATATTAAAACAAAAGAAGGAAAAGACTACGAGGGATGGTGTTGGCCTGGT
TCTGCAAGTTATCCTGATTTCTTTAATCCTGAAATTCGCAAATATTATGCCGATCAATAT
TTGATGGAAAATTTCCATGAAAATTCACCTGAAACAGGAATTTGGAATGATATGAATGAA
CCTTCAGTTTTTAATGGACCTGAAATTACAATGTTAAAGGATAATCTTCATTATGGTGGC
TGGGAACATCGTGATGTTCACAATCTTTATGGACAAATGCAACTTGTTGGAACTTTTGAT
GGATTGATTAGACGCAGTCAAAATCAATTGCGTCCATTTATTCTCACTCGTGCTCACTTT
TCTGGAACACAACGTTATGCTGCGGTATGGACAGGTGATAATGCAGCAGAATGGGGACAT
TTAGAAGCCGCAATTCCTATGTGTTTATCACTATCTGTTGCTGGAATTTCATTTTGCGGA
TCAGATGTTGGTGGTTTCTTTGGAAATCCTGAAGCTGAGCTTTTCTCTCGTTGGTATCAA
ACTAGTGCATTCATGCCATTCTTCAGAAGTCATGCTCATATTGATACTAAACGTCGTGAA
CCATGGTTATTCCCAGAAGAAACTAAGTTGATTATTCGTGATGCAATTAGAAAGCGTTAT
TCCTTCTTGCCTTTCTGGTATACAATTTTCTATGAACATGAACGCACTGGTATGCCTATT
ATGAGACCATTACTCGCTCAATATCCATTAGATAAGAATGTCTTTTCAATTGACACACAT
TTCATGTTGAGTGATAAACTATTAATTCGTCCTGTTATGCAAAAAGGAGCAAGCAAAGTT
GATGTTTATTTCCCATCACGCAATGGTGATAAACAAGGAGACATTTGGTATGATATTGAT
GATTATCGTAAAATCGATCATGTTGGAAGTGAAAGTGTAACAGTTAATGCTTATAAAGTT
CCCGTCTTTCAACGTGGTGGTACAATCATTCCAAGGAAGATGAGAATTCGTCGTTCAGCA
ATTCTTATGCATGATGATCCAATTACATTAGTTGTTGCTCTTGATGCTAAAAAACATGCC
AAAGGAACATTATATATTGATGATGAGAAGAGTTATGGCTATCGCCAAGGAAAATATTTG
TACTTAGAATTTGAATTTAAAGATGGTGTTTTGTCAAGCAGACTGATTGATGAAAGTGCA
TCATATAACACAAAAACATGGCTTGAACGAGTTATTATTGTCGGCTTAGATAAAGTGCCA
ACTGAAGCAACATTAAGAACATCATCGAATCAATCGACAAAACTTGAAATTGTTGATGCC
TCAAAGACTTCTTTCACAATTAGAAAGCCTGGTGTTAGCATGATGGAACCAACGTGGTCG
ATTACACTCAACTAA

>g12218.t1 Gene=g12218 Length=924
MSRRLCLLIFTLFIINALAVDKNNFKTCEQASFCKRSRNVNYEKSPYEIVPNTLNINANY
ITADIVNKENGHLFVLKLEGIKGNTFHFEIDEKTPLKPRYRVVDALQKEPTYEAVKVSSS
TSDITVTCGENKAVISTAPFKIDFYQKDVLSVTANSKAKMHFEHLRKKLEGEQHQQNNEN
EEKDNVVDNNQQIQDEPGTWEENFKGHHDTKPNGPTAVALDFTFHQSNVLFGIPEHADSF
ALKTTVGNGDPYRLYNLDVFEYELNSRMALYGSVPVIYGHGEKGTTGIFWQNAAETWIDI
AGTSNEQNVMESIVNFVAGSRNQDAPSAHFISESGIMDVFVLLGPTPNEAFKQYTDLIGH
GNLPQMFAIGYHQCRWNYNDEQDVMSVAHKFDEHDIPMDTMWLDIEYTDSKKYFTWDHQK
FPHPVEMIKNLTDMGRHLTFIIDPHIKRDSGYFFHNDCTDRGYYIKTKEGKDYEGWCWPG
SASYPDFFNPEIRKYYADQYLMENFHENSPETGIWNDMNEPSVFNGPEITMLKDNLHYGG
WEHRDVHNLYGQMQLVGTFDGLIRRSQNQLRPFILTRAHFSGTQRYAAVWTGDNAAEWGH
LEAAIPMCLSLSVAGISFCGSDVGGFFGNPEAELFSRWYQTSAFMPFFRSHAHIDTKRRE
PWLFPEETKLIIRDAIRKRYSFLPFWYTIFYEHERTGMPIMRPLLAQYPLDKNVFSIDTH
FMLSDKLLIRPVMQKGASKVDVYFPSRNGDKQGDIWYDIDDYRKIDHVGSESVTVNAYKV
PVFQRGGTIIPRKMRIRRSAILMHDDPITLVVALDAKKHAKGTLYIDDEKSYGYRQGKYL
YLEFEFKDGVLSSRLIDESASYNTKTWLERVIIVGLDKVPTEATLRTSSNQSTKLEIVDA
SKTSFTIRKPGVSMMEPTWSITLN

Protein features from InterProScan

Transcript Database ID Name Start End E.value
19 g12218.t1 CDD cd14752 GH31_N 216 359 3.39372E-24
20 g12218.t1 CDD cd06603 GH31_GANC_GANAB_alpha 359 831 0.0
13 g12218.t1 Coils Coil Coil 162 196 -
12 g12218.t1 Gene3D G3DSA:3.20.20.80 Glycosidases 362 681 4.3E-110
10 g12218.t1 Gene3D G3DSA:2.60.40.1180 - 683 786 1.2E-24
11 g12218.t1 Gene3D G3DSA:2.60.40.1180 - 787 920 1.2E-15
23 g12218.t1 MobiDBLite mobidb-lite consensus disorder prediction 170 189 -
3 g12218.t1 PANTHER PTHR22762 ALPHA-GLUCOSIDASE 12 912 0.0
4 g12218.t1 PANTHER PTHR22762:SF54 BCDNA.GH04962 12 912 0.0
1 g12218.t1 Pfam PF13802 Galactose mutarotase-like 229 299 1.8E-22
2 g12218.t1 Pfam PF01055 Glycosyl hydrolases family 31 340 790 8.4E-143
15 g12218.t1 Phobius SIGNAL_PEPTIDE Signal peptide region 1 19 -
16 g12218.t1 Phobius SIGNAL_PEPTIDE_N_REGION N-terminal region of a signal peptide. 1 4 -
17 g12218.t1 Phobius SIGNAL_PEPTIDE_H_REGION Hydrophobic region of a signal peptide. 5 15 -
18 g12218.t1 Phobius SIGNAL_PEPTIDE_C_REGION C-terminal region of a signal peptide. 16 19 -
14 g12218.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 20 924 -
22 g12218.t1 ProSitePatterns PS00129 Glycosyl hydrolases family 31 active site. 513 520 -
6 g12218.t1 SUPERFAMILY SSF74650 Galactose mutarotase-like 54 347 1.54E-16
7 g12218.t1 SUPERFAMILY SSF51445 (Trans)glycosidases 349 697 1.14E-82
5 g12218.t1 SUPERFAMILY SSF51011 Glycosyl hydrolase domain 699 786 8.63E-13
9 g12218.t1 SignalP_EUK SignalP-noTM SignalP-noTM 1 19 -
21 g12218.t1 SignalP_GRAM_NEGATIVE SignalP-noTM SignalP-noTM 1 19 -
8 g12218.t1 SignalP_GRAM_POSITIVE SignalP-TM SignalP-TM 1 19 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0030246 carbohydrate binding MF
GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds MF
GO:0005975 carbohydrate metabolic process BP
GO:0003824 catalytic activity MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values