Gene loci information

Transcript annotation

  • This transcript has been annotated as Probable beta-hexosaminidase fdl.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_2 g8590 g8590.t2 isoform g8590.t2 31610577 31613577
chr_2 g8590 g8590.t2 exon g8590.t2.exon1 31610577 31610767
chr_2 g8590 g8590.t2 cds g8590.t2.CDS1 31610577 31610767
chr_2 g8590 g8590.t2 TSS g8590.t2 31610606 31610606
chr_2 g8590 g8590.t2 exon g8590.t2.exon2 31611361 31611839
chr_2 g8590 g8590.t2 cds g8590.t2.CDS2 31611361 31611839
chr_2 g8590 g8590.t2 exon g8590.t2.exon3 31611907 31612063
chr_2 g8590 g8590.t2 cds g8590.t2.CDS3 31611907 31612063
chr_2 g8590 g8590.t2 exon g8590.t2.exon4 31612121 31612404
chr_2 g8590 g8590.t2 cds g8590.t2.CDS4 31612121 31612404
chr_2 g8590 g8590.t2 exon g8590.t2.exon5 31612465 31612553
chr_2 g8590 g8590.t2 cds g8590.t2.CDS5 31612465 31612553
chr_2 g8590 g8590.t2 exon g8590.t2.exon6 31612613 31612742
chr_2 g8590 g8590.t2 cds g8590.t2.CDS6 31612613 31612742
chr_2 g8590 g8590.t2 exon g8590.t2.exon7 31612804 31612861
chr_2 g8590 g8590.t2 cds g8590.t2.CDS7 31612804 31612861
chr_2 g8590 g8590.t2 exon g8590.t2.exon8 31612924 31613260
chr_2 g8590 g8590.t2 cds g8590.t2.CDS8 31612924 31613260
chr_2 g8590 g8590.t2 exon g8590.t2.exon9 31613326 31613577
chr_2 g8590 g8590.t2 cds g8590.t2.CDS9 31613326 31613577
chr_2 g8590 g8590.t2 TTS g8590.t2 NA NA

Sequences

>g8590.t2 Gene=g8590 Length=1977
ATGGCATTTACAACATCACTCAGAAGGCTTCTGATAGGTCTTTTATTTGTAATGGCATTA
TGTTTGATGTTATTATATTTTAATGAAAGTCAGAGTCTTCTGAAACCTGCGTCATCCTTT
CTCATCAATAGCTTTGATACAAGACACAATAAACTCAAGTTAGTGGCTGATGTAGCTGAT
AATTTTAAAATTCCATCAGAAAAAGCGTGGACGTACAAATGTGTTAATCAAAAATGCGTT
AGACAGCATTATTCAAATAAGAATGAAAAAAGAGTTCCATTTACGTCATGTAGTATGCTA
TGTGGAAGTTACTCAAAAATTTGGCCAGAGCCAACAAAATCTTACATTGGAACAAGTGCC
AATAGTTTTAATTTAAATGATGTTCAATTTAAAATCAAAACTCCTTTTAAGAATGTTGAG
AATTTATTAGAAAGTGCATTTTCAATTTTTATGGAAGAGATTAAGGATATTTTTCATTCG
TCATCGGATCGAAAGACTGAAGAAGTTAGTAAGAGCAGCACAACTGCTAATTATAAGGAG
CGTTCGCCATTAGACACTAAACATCGACGTCATAATCTTACTAATGTTAATATTTTTCTA
TATGTCATCAAAACTTCTGAAATTCATCTTTCATTAACTACAGACGAATGTTATAATTTA
ACTATGATAAATGAACATCAAACAGTTGAAGTAAGAATTACTGCCAATTCATTTTTTGGT
GCTCGCAATGGCCTCGCAACTTTGCAGCAATTAATTTGGTTTGATGATGAAGATGAGTCA
TTGAAAATTATAGGAAGTGCAAATATTGCTGATTGTCCCAAATTTAACTATCGAGGTCTA
ATGCTTGACACGTCACGGCATTATTTTTCAGTTGAATCAATTAAGAGAACATTAGTCGCT
ATGTCACATTCAAAATTAAATCGATTTCATTGGCATCTCACTGATTCACAGAGTTTTCCA
TTTGTATCAAAATATTATCCTGAGCTTGCACAGTATGGAGCTTACTCATCTAATGAGATT
TATACGCGTGATGATGTTAGAGATATTGCTAATTTTGCTCAAGTTAGAGGAATACAAATA
TTAGCAGAAATCGATGCACCTGCTCATGCTGGTAATGGTTGGCAGTGGGGTCCCGAAAAA
GGTTTAGGTGAATTGAGCTTATGTGTTAATCAACAGCCTTGGAGTAGTTATTGTGGAGAA
CCACCATGCGGTCAATTAAATCCAAAAAATAACAACACTTTTATAATTTTACAAAGACTT
TATCAAGAACTTTTAGAATTGACTGGTCCAACTGATATGTTTCATTTAGGTGGAGATGAA
GTTAATCTCGAATGCTGGGCACAATATTTTAACGATACAGATCTTCGTTCATTATGGTGT
GATTTTATGCAGCATGCATATCATCGATTAACAATTGCCAATGGTGGACAATCAATTAAG
ATGGCAAATGTTTGGTCGAGCGGTCTTACTAGCATGCCATGTCTATCAAATAAGAATTTT
GCTGTACAAGTTTGGGGTGGAAGTCAATGGTCTGAAAATTATCAGTTGATTGATGCCGGA
TATAATTTAATTTTATCGCATGTCGATGCTTGGTATCTCGATTGTGGTTTTGGCAATTGG
AGAGCGACAGGAGAGGCTGCATGTTCGCCTTATAGAACGTGGCAGCAAGTTTATAAACAT
AAACCTTGGCAAGCTATGAGACTTAATAAACAGCAAATGAAACAGATTCTTGGCGGAGAG
GTTTGCATGTGGAGTGAGCAAGTCGATGAAAGTTCATTGGATTCTCGTGTCTGGCCTAGA
GCTGCAGCTTTAGCTGAACGCTTATGGAGTGATCCACAAGATGAAAATGAATATGCAATT
TCCAAAGAAACATTTAATCGCTTTTCAATGTTTCGCAATAGATTAGTGCAATTGGGTTTA
AAAGCAGAACCAATATTTCCCAAATATTGCGCTCAAAATCAGGATGAATGTGTATGA

>g8590.t2 Gene=g8590 Length=658
MAFTTSLRRLLIGLLFVMALCLMLLYFNESQSLLKPASSFLINSFDTRHNKLKLVADVAD
NFKIPSEKAWTYKCVNQKCVRQHYSNKNEKRVPFTSCSMLCGSYSKIWPEPTKSYIGTSA
NSFNLNDVQFKIKTPFKNVENLLESAFSIFMEEIKDIFHSSSDRKTEEVSKSSTTANYKE
RSPLDTKHRRHNLTNVNIFLYVIKTSEIHLSLTTDECYNLTMINEHQTVEVRITANSFFG
ARNGLATLQQLIWFDDEDESLKIIGSANIADCPKFNYRGLMLDTSRHYFSVESIKRTLVA
MSHSKLNRFHWHLTDSQSFPFVSKYYPELAQYGAYSSNEIYTRDDVRDIANFAQVRGIQI
LAEIDAPAHAGNGWQWGPEKGLGELSLCVNQQPWSSYCGEPPCGQLNPKNNNTFIILQRL
YQELLELTGPTDMFHLGGDEVNLECWAQYFNDTDLRSLWCDFMQHAYHRLTIANGGQSIK
MANVWSSGLTSMPCLSNKNFAVQVWGGSQWSENYQLIDAGYNLILSHVDAWYLDCGFGNW
RATGEAACSPYRTWQQVYKHKPWQAMRLNKQQMKQILGGEVCMWSEQVDESSLDSRVWPR
AAALAERLWSDPQDENEYAISKETFNRFSMFRNRLVQLGLKAEPIFPKYCAQNQDECV

Protein features from InterProScan

Transcript Database ID Name Start End E.value
23 g8590.t2 CDD cd06562 GH20_HexA_HexB-like 275 639 5.11695E-165
15 g8590.t2 Gene3D G3DSA:3.30.379.10 Chitobiase 66 272 2.9E-41
16 g8590.t2 Gene3D G3DSA:3.20.20.80 Glycosidases 273 658 1.1E-125
25 g8590.t2 MobiDBLite mobidb-lite consensus disorder prediction 165 186 -
3 g8590.t2 PANTHER PTHR22600 BETA-HEXOSAMINIDASE 5 657 1.6E-261
4 g8590.t2 PANTHER PTHR22600:SF3 BETA-HEXOSAMINIDASE FDL-RELATED 5 657 1.6E-261
22 g8590.t2 PIRSF PIRSF001093 B-hxosamndse_ab_euk_ 49 658 1.2E-149
10 g8590.t2 PRINTS PR00738 Glycosyl hydrolase family 20 signature 231 251 1.1E-52
11 g8590.t2 PRINTS PR00738 Glycosyl hydrolase family 20 signature 269 286 1.1E-52
7 g8590.t2 PRINTS PR00738 Glycosyl hydrolase family 20 signature 298 319 1.1E-52
9 g8590.t2 PRINTS PR00738 Glycosyl hydrolase family 20 signature 352 369 1.1E-52
6 g8590.t2 PRINTS PR00738 Glycosyl hydrolase family 20 signature 406 424 1.1E-52
8 g8590.t2 PRINTS PR00738 Glycosyl hydrolase family 20 signature 429 442 1.1E-52
12 g8590.t2 PRINTS PR00738 Glycosyl hydrolase family 20 signature 576 592 1.1E-52
5 g8590.t2 PRINTS PR00738 Glycosyl hydrolase family 20 signature 593 610 1.1E-52
1 g8590.t2 Pfam PF14845 beta-acetyl hexosaminidase like 107 251 1.5E-18
2 g8590.t2 Pfam PF00728 Glycosyl hydrolase family 20, catalytic domain 275 611 4.3E-88
18 g8590.t2 Phobius SIGNAL_PEPTIDE Signal peptide region 1 30 -
19 g8590.t2 Phobius SIGNAL_PEPTIDE_N_REGION N-terminal region of a signal peptide. 1 9 -
20 g8590.t2 Phobius SIGNAL_PEPTIDE_H_REGION Hydrophobic region of a signal peptide. 10 25 -
21 g8590.t2 Phobius SIGNAL_PEPTIDE_C_REGION C-terminal region of a signal peptide. 26 30 -
17 g8590.t2 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 31 658 -
13 g8590.t2 SUPERFAMILY SSF55545 beta-N-acetylhexosaminidase-like domain 106 274 9.12E-19
14 g8590.t2 SUPERFAMILY SSF51445 (Trans)glycosidases 275 619 7.58E-102
24 g8590.t2 TMHMM TMhelix Region of a membrane-bound protein predicted to be embedded in the membrane. 7 26 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0004563 beta-N-acetylhexosaminidase activity MF
GO:0004553 hydrolase activity, hydrolyzing O-glycosyl compounds MF
GO:0005975 carbohydrate metabolic process BP

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values