Gene loci information

Transcript annotation

  • This transcript has been annotated as UDP-glucose:glycoprotein glucosyltransferase.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_3 g4147 g4147.t1 TTS g4147.t1 30948674 30948674
chr_3 g4147 g4147.t1 isoform g4147.t1 30948882 30954007
chr_3 g4147 g4147.t1 exon g4147.t1.exon1 30948882 30948916
chr_3 g4147 g4147.t1 cds g4147.t1.CDS1 30948882 30948916
chr_3 g4147 g4147.t1 exon g4147.t1.exon2 30949000 30949196
chr_3 g4147 g4147.t1 cds g4147.t1.CDS2 30949000 30949196
chr_3 g4147 g4147.t1 exon g4147.t1.exon3 30949266 30949287
chr_3 g4147 g4147.t1 cds g4147.t1.CDS3 30949266 30949287
chr_3 g4147 g4147.t1 exon g4147.t1.exon4 30949357 30949398
chr_3 g4147 g4147.t1 cds g4147.t1.CDS4 30949357 30949398
chr_3 g4147 g4147.t1 exon g4147.t1.exon5 30949464 30949802
chr_3 g4147 g4147.t1 cds g4147.t1.CDS5 30949464 30949802
chr_3 g4147 g4147.t1 exon g4147.t1.exon6 30949892 30951798
chr_3 g4147 g4147.t1 cds g4147.t1.CDS6 30949892 30951798
chr_3 g4147 g4147.t1 exon g4147.t1.exon7 30951872 30953911
chr_3 g4147 g4147.t1 cds g4147.t1.CDS7 30951872 30953911
chr_3 g4147 g4147.t1 exon g4147.t1.exon8 30954003 30954007
chr_3 g4147 g4147.t1 cds g4147.t1.CDS8 30954003 30954007
chr_3 g4147 g4147.t1 TSS g4147.t1 30954136 30954136

Sequences

>g4147.t1 Gene=g4147 Length=4587
ATGAGAATGGGTGTAATTAAATCAATTTTTAGTATAGTGATTTTCATTATTAGTATAAGT
TTATCATATGCAGCAAAAAGCCATCCGATCTCAACTCTAATCAATGCGAAATTTTCATTG
ACACCAGTACAGCTGGAAATATCAGAATATTTGTCAGACAGTTCAAATCAAAAATTCTGG
ATTTTTATAGATGAGCTGACTAAAGTCAATCTTGATGGTTTACAGACAGACCAACAACGT
TATAAAACTGCGATCGATATTGCAGGAAAACATTTAAGTCATGCTCAAATCAAGTTATTG
AAGTTATCATTATCTTTAAGATCACTCACGCCACGCATACAATCACACTTTATGATTGCT
GATGACATTTTAAAGCGTGGTGATTGTGAATATGCTAAAGCATTTGTTATGAGTAGTAAT
GAGTTAATTTGTTCAGTGAATGATTTAAAGAAAAAATTTAAGGATTTCAAACCATCAAAT
GAAAATTCAGACCTGTATAGCTTTGATCACATTTTTCCTGGTACAGAAACTAACAATTTT
GTTAATGTGCTTTATGGAGAAGTCGGCAGTAAGGAATTTAACGAATTTCATAATCTTCTT
AAGACTGAAATTGCAAATGGCAAAAACATTAAATATGTAGCAAGGCATTTTATTCGTCAT
CGTTCTCAAACAAAAGTTCGTCTCAGCGGATATGGTGTCGAATTGCATTTGAAATCAACT
GAATACAAAGCACAAGATGATTCACCAAGGAAAGATAATGAGAAGCTCTTTGAATCGGAT
TCAGAAGATACACAAGTTGAAGGATTTAATTTTAAAAAATTAAAAGAGATTTACCCACAT
TTGTCACATTCTTTAGACAAGATGAGAGCCAGTTTGCTTGAAAAGAATGAAGAGATTTCT
CCTTTAAAAGCTTGGGAATTTCAAGAACTTGGTTTGCAAGCAGCAGCACGTGTAGCTTCC
ATTCAGGGTGAGGAGGCTCTCTCAATTCTTCAATTCACAGCTCAAAATTTTCCTAGTCAA
GCCAAATCTCTCATTCATACTAAAGTAAGTGATGATTTTAAGGCAGAAATGAAAAACAAT
ATTGAAGTTCTTGCAAGAAATTTGAACCTTCAACCACCTGATGCCGCACTCTTTATCAAT
GGTCTTTATTTTGATGCCGAAACTCTTGATGTTGAAACCCTTCTTGATACAATCAAAAAA
GAATCAATGATACTTGATGGATTAAATCAAATTGGATTGAAAGGAAGTGCATCAGCACCA
TTGCTTGCTCTTGATTTTGCATCTCAAGCTAAAGAATTTGCAATTGATATTCGCGATTCA
TCTATCATTTGGATCAACGATTTGGAAGTTGATAAGGAATATAAAAGATGGGGTAGCTCA
GTAATGGACATGTTAAGACCAACTTTTCCAGGTATGATGAGAAGCGTTAGAAAAAACTTT
TTCAACCTTTTATTGGTATTTGATCCTATCAAACCTGAAGCACGTGATATTATCCGCATG
TCTGAAAGTTTCATTGTTAACATGGCCCCAATTCGTCTTGGTTTAGTCTTTGAAACAACA
CGCTCAGTGAATAACGAAGGAGAAACGAATATTGTTTATCGTACCATTAATTGTGCCTTC
AATTACATGCATCAAAAGAAAGGAACACGTGAAGCATTGAGTTTTCTCTTAGATATTTTT
GCGTCGGTAGAAAAGGATAAAGATGTTGATTTAGAGACTATTCGTAAAGTTTTTAAAAAA
ACAAATACAAAGCTAAGCAGTGCAGAAATTGAAGATGTTTTAGGTCAAGATTCAGATTTT
GATTATGGACGACAATTGTCTGAAGAATTTATTGAACGATTGGGCGTAAAAGTAATTCCT
CAGGCGTTGCTTAATGGTGTCCTCTTAAATCAAAAATCACTTAATCGCGATGAATATGAA
GAGCTTATTTTAACAGAAATCATGCAACAGACACCCACACTTCAGAAGGCTATTTATCGT
GGAGAATTAAGTGATGGTGAAAATGTTATTGATTATTTAATGCAACAACCACATGTGATG
CTTAGATTAAATCAAAGAATTTTATCGAACGATAATCCACAAAACTTGGATATGCATAAT
GGCAAAGCATATCCTGATATTGAAGACGTTAAAATTCTTGCAACATTAAATAATGAGGAC
TTAACGGCAACATTATTGAAAAATATACATTATTTTGAACCGAAGAGTTCGGGTGAAAAA
TTTATGAAGAGCAGACTGCATTTTGTCACGATTAAAGTCGTTACTGATTTAAACACGAAA
CGTGGTAAAAATCTTCTCAGAAATGCTCTCGAATATTTAAAGGGAAGCAGTGGAACGCGA
TTAACTTTCATACCAAATGCTGATAAGTCAGAAGCTACTTCGAAAGATGAATATAATTTA
AATGCGATTGTCTGGAGCATTTTAAATACATATGAGGGAAAAGAAGCGACAGAACGTACA
CTTCGTATACTCAATGGAAAAGAAGAAATTTCTGACAGTGTTAAAGGATTCCTTAAAGCA
ACTGAATTGCATCTCAAAATGCTTCGAGTTTATTGTCAACGAGTTCTTAAAATGAAAAGT
AGTGAAACTAGTGTCATTGTTAATGGAAGAATATTTGGACCATTTGAGGATGAGGAAACA
TTTACAGTTGATGATTTCAATTTGATTGAGAAAATCAATCAACAACAGTACATAAATAAA
ATAAAACTCGCTTTCAAATCAATAAAAGTTGAAGATTTTGACCTTGAATTAAGCTCAGAT
TTAATGCTTCGACTTTTGTCACTTTTGATACCACGTCATTCATCAAAAAATCGTTTCAAT
ATTCCTGCAGAATTACGTGAAGATTTCACAGTTGTAAAATTATTACCAAAAGTCAAAAAT
GAGCCAAGTTTTAATTTAGTAGCAGTTGTTGATCCTGCATCACGTGGCGCTCAAAAACTC
TCACCACTTCTCAATTTGCTACGTCAAGTTGTCAATTGTGACTTAAAACTTTTCCTTTGT
GCTGTTGACAAACACTCAGACATGCCTGTGAAAAATTTCTATAGATATGTAATTGAACCA
GAGCTGCAGTTCACAACTGAAGGTAAATTAACAAATGGTCCAGCTGCCAAATTTGTAGGA
TTACCAGCTAAACATTTATTGACACAGAACCTCGCAGTGCCAGAAAATTGGATGGTTGAT
TTAATAAGATCTGTGTATGATTTAGACAATATACGTTTATCAGATATAGGCGGACCTGTT
CATAGTGAATATGAACTTGAATACTTACTTTTAGAGGGACACTGTTTCGATACAACGTCT
GGATCTCCGCCACGTGGTCTGCAATTCATTTTAGGAACAAAAGAACAAGAAGCAGTTGTT
GATACAATTGTGATGGCTAATTTAGGATACTTTCAACTTAAAGCAAATCCTGGTGCATGG
ACATTAAAATTACGGCATGGAAAAAGCTCAGAAATCTACGACATAACAAATGTTGATGGA
TTGAATACAATTCACTCTGTTGAAGATGGTTTTGTATCTGTAGTCGTCAATAGCTTCCGT
TCTCATGTCCTTAAAGTTCGTGTCACAAAAAAGCCAGAGATGCTAAATGTCGACTTATTG
GGTGATAATGATGAGCCATCAAGCGGTATTTGGAACTCAATTACGAGCACATTCAGTGGT
TCAAATGCAGAGTCAAATATTGAAACTATCAACATTTTCTCGGTCGCATCTGGCCATCTT
TATGAACGTCTTTTGCGCATTATGATGCTGTCGTTGTTGAAGCATACAAAATCTCCTGTG
AAATTTTGGTTCTTGAAAAACTATTTATCGCCACAATTCAAAGATTTCCTACCAGCAATG
AGCCGTGAATATAATTTTGACTATGAACTAGTGCAATATAAATGGCCCAGATGGCTTCAT
CAACAAACTGAAAAACAAAGAACAATCTGGGGATATAAAATTCTGTTTTTAGATGTTCTA
TTTCCTCTGAATGTAAAGAAGATTATTTTTGTTGATGCGGATCAAATTGTGAGAGCAGAC
ATGAAGGAACTTTATGAAATGGATTTGAATGGTGCACCATATGGCTACGTACCATTTTGT
GACTCGCGAAAAGAAATGGATGGTTTCCGATTTTGGAATTCAGGTTATTGGAGGAATCAT
TTACAAGGTCGCAAATACCATATTTCTGCACTATATGTTGTGGATTTGAAAAGATTTAGA
AAAATTGCTGCCGGTGATAGATTAAGAGGACAATATCAAGCTTTAAGTCAAGATCCTAAT
TCTCTTAGTAACCTTGATCAAGATTTGCCTAATAATATGGTTCATCAAGTTCAGATTAAA
TCTCTTCCACAAGAGTGGTTATGGTGTGAGACTTGGTGTAGCGATGATGGATTGAAACAT
GCCAAAACAATTGATTTATGTAATAATCCTTTGACAAAAGAAGCTAAATTAACGGCAGCT
CAAAGAATTGTACCAGAATGGAAAGATTATGATAATGAAATAAAAAACCTAATGGCACGA
ATTGATGATGACGAACATCAAGAGCATGTTACAATTCACAAACAATATGATGAGAAAACA
CCAAATGAGAAGCACATCGAGCTGTGA

>g4147.t1 Gene=g4147 Length=1528
MRMGVIKSIFSIVIFIISISLSYAAKSHPISTLINAKFSLTPVQLEISEYLSDSSNQKFW
IFIDELTKVNLDGLQTDQQRYKTAIDIAGKHLSHAQIKLLKLSLSLRSLTPRIQSHFMIA
DDILKRGDCEYAKAFVMSSNELICSVNDLKKKFKDFKPSNENSDLYSFDHIFPGTETNNF
VNVLYGEVGSKEFNEFHNLLKTEIANGKNIKYVARHFIRHRSQTKVRLSGYGVELHLKST
EYKAQDDSPRKDNEKLFESDSEDTQVEGFNFKKLKEIYPHLSHSLDKMRASLLEKNEEIS
PLKAWEFQELGLQAAARVASIQGEEALSILQFTAQNFPSQAKSLIHTKVSDDFKAEMKNN
IEVLARNLNLQPPDAALFINGLYFDAETLDVETLLDTIKKESMILDGLNQIGLKGSASAP
LLALDFASQAKEFAIDIRDSSIIWINDLEVDKEYKRWGSSVMDMLRPTFPGMMRSVRKNF
FNLLLVFDPIKPEARDIIRMSESFIVNMAPIRLGLVFETTRSVNNEGETNIVYRTINCAF
NYMHQKKGTREALSFLLDIFASVEKDKDVDLETIRKVFKKTNTKLSSAEIEDVLGQDSDF
DYGRQLSEEFIERLGVKVIPQALLNGVLLNQKSLNRDEYEELILTEIMQQTPTLQKAIYR
GELSDGENVIDYLMQQPHVMLRLNQRILSNDNPQNLDMHNGKAYPDIEDVKILATLNNED
LTATLLKNIHYFEPKSSGEKFMKSRLHFVTIKVVTDLNTKRGKNLLRNALEYLKGSSGTR
LTFIPNADKSEATSKDEYNLNAIVWSILNTYEGKEATERTLRILNGKEEISDSVKGFLKA
TELHLKMLRVYCQRVLKMKSSETSVIVNGRIFGPFEDEETFTVDDFNLIEKINQQQYINK
IKLAFKSIKVEDFDLELSSDLMLRLLSLLIPRHSSKNRFNIPAELREDFTVVKLLPKVKN
EPSFNLVAVVDPASRGAQKLSPLLNLLRQVVNCDLKLFLCAVDKHSDMPVKNFYRYVIEP
ELQFTTEGKLTNGPAAKFVGLPAKHLLTQNLAVPENWMVDLIRSVYDLDNIRLSDIGGPV
HSEYELEYLLLEGHCFDTTSGSPPRGLQFILGTKEQEAVVDTIVMANLGYFQLKANPGAW
TLKLRHGKSSEIYDITNVDGLNTIHSVEDGFVSVVVNSFRSHVLKVRVTKKPEMLNVDLL
GDNDEPSSGIWNSITSTFSGSNAESNIETINIFSVASGHLYERLLRIMMLSLLKHTKSPV
KFWFLKNYLSPQFKDFLPAMSREYNFDYELVQYKWPRWLHQQTEKQRTIWGYKILFLDVL
FPLNVKKIIFVDADQIVRADMKELYEMDLNGAPYGYVPFCDSRKEMDGFRFWNSGYWRNH
LQGRKYHISALYVVDLKRFRKIAAGDRLRGQYQALSQDPNSLSNLDQDLPNNMVHQVQIK
SLPQEWLWCETWCSDDGLKHAKTIDLCNNPLTKEAKLTAAQRIVPEWKDYDNEIKNLMAR
IDDDEHQEHVTIHKQYDEKTPNEKHIEL

Protein features from InterProScan

Transcript Database ID Name Start End E.value
16 g4147.t1 CDD cd06432 GT8_HUGT1_C_like 1230 1477 0.0
10 g4147.t1 Gene3D G3DSA:3.90.550.10 Spore Coat Polysaccharide Biosynthesis Protein SpsA; Chain A 1233 1483 3.2E-20
18 g4147.t1 MobiDBLite mobidb-lite consensus disorder prediction 242 261 -
7 g4147.t1 PANTHER PTHR11226 UDP-GLUCOSE GLYCOPROTEIN:GLUCOSYLTRANSFERASE 16 1509 0.0
4 g4147.t1 Pfam PF18400 Thioredoxin-like domain 41 225 2.3E-44
2 g4147.t1 Pfam PF18401 Thioredoxin-like domain 295 427 5.5E-30
6 g4147.t1 Pfam PF18402 Thioredoxin-like domain 435 687 4.2E-61
5 g4147.t1 Pfam PF18403 Thioredoxin-like domain 716 929 6.2E-39
3 g4147.t1 Pfam PF06427 UDP-glucose:Glycoprotein Glucosyltransferase 1092 1200 5.6E-38
1 g4147.t1 Pfam PF18404 Glucosyltransferase 24 1230 1497 1.2E-148
12 g4147.t1 Phobius SIGNAL_PEPTIDE Signal peptide region 1 24 -
13 g4147.t1 Phobius SIGNAL_PEPTIDE_N_REGION N-terminal region of a signal peptide. 1 7 -
14 g4147.t1 Phobius SIGNAL_PEPTIDE_H_REGION Hydrophobic region of a signal peptide. 8 19 -
15 g4147.t1 Phobius SIGNAL_PEPTIDE_C_REGION C-terminal region of a signal peptide. 20 24 -
11 g4147.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 25 1528 -
8 g4147.t1 SUPERFAMILY SSF53448 Nucleotide-diphospho-sugar transferases 1230 1494 2.17E-49
9 g4147.t1 SignalP_EUK SignalP-TM SignalP-TM 1 24 -
17 g4147.t1 TMHMM TMhelix Region of a membrane-bound protein predicted to be embedded in the membrane. 5 24 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0006486 protein glycosylation BP
GO:0003980 UDP-glucose:glycoprotein glucosyltransferase activity MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values