Gene loci information

Transcript annotation

  • This transcript has been annotated as Exostosin-2.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_1 g10825 g10825.t1 TTS g10825.t1 12150033 12150033
chr_1 g10825 g10825.t1 isoform g10825.t1 12150122 12155696
chr_1 g10825 g10825.t1 exon g10825.t1.exon1 12150122 12150481
chr_1 g10825 g10825.t1 cds g10825.t1.CDS1 12150122 12150481
chr_1 g10825 g10825.t1 exon g10825.t1.exon2 12150538 12150734
chr_1 g10825 g10825.t1 cds g10825.t1.CDS2 12150538 12150734
chr_1 g10825 g10825.t1 exon g10825.t1.exon3 12150795 12150947
chr_1 g10825 g10825.t1 cds g10825.t1.CDS3 12150795 12150947
chr_1 g10825 g10825.t1 exon g10825.t1.exon4 12151030 12151848
chr_1 g10825 g10825.t1 cds g10825.t1.CDS4 12151030 12151848
chr_1 g10825 g10825.t1 exon g10825.t1.exon5 12153467 12155022
chr_1 g10825 g10825.t1 cds g10825.t1.CDS5 12153467 12155022
chr_1 g10825 g10825.t1 exon g10825.t1.exon6 12155078 12155158
chr_1 g10825 g10825.t1 cds g10825.t1.CDS6 12155078 12155158
chr_1 g10825 g10825.t1 exon g10825.t1.exon7 12155221 12155696
chr_1 g10825 g10825.t1 cds g10825.t1.CDS7 12155221 12155696
chr_1 g10825 g10825.t1 TSS g10825.t1 12155782 12155782

Sequences

>g10825.t1 Gene=g10825 Length=3642
ATGAAAGATAAAAATAGTGCCATAAAAGTTTTCTTAAGTGACAAAGTTGGAATATCTGGT
GGACTTGTTTTAGGATTTATAGCTTTAGTTTTTATTTTTAATTCACAATTGAAGCATCCT
GAAGTTTTCAATTTAAACACTTATAATCATGTTCGCTTCGATAAACTATCAGAATTGAGT
CGTACACGTAATCCTAACTGCACAATATTCGATTGTTTCAATGTATATCGATGTGGAAGT
CATCAGAACAAAATTAGTATTTATATTTATCCTCTTACGGAATATTCTGATGACAACAAA
AAATCAAAAACATTTATAACAAAAGAATTTTATCAAATATTAAAGGCAATAATCAAAAGT
CCTTATTATACTTCAAATCCGCAAGAGGCTTGTTTATTCGTACCAAGTATAGATTTATTG
AACCAAAATCTTATTGATAAGAGTTTAGTTGAAAAAGCACTTGCATCATTAAATTATTGG
AATAATGGACGCAATCATATCTTGTTTAACATGCTCCCTGGAGAAGCTCCCAATTATTCT
ACAGTTTTTGATGTGAAATCAGGAGATGCGATAATTGCTGGAGCAGGGTTTAATACACAA
ACATATCGATATGGATTTGATTTTTCAATTCCGTTCTATAGTCCATTGTTGGACAATTAT
GTGAAGAAGACAGGTGATAAAACAACCAAGAACTATTTTCTAATTTCACCACAACTCAAC
ATGTATGACTATCATCGACGACTTATGCAAGAAATTGTATTAGAAGACTCTAATAATAAT
ATTCTTTTGCTTCAAAAATGTACTCAAGAGATTTTTGAAGATAAAATCTCATCTAATCTA
GAACTCATTGACCATGACATGCGATGCTCATTTCCTGCTTTAACAGCATTAGAGTATCCT
GAAATTTTAGAGCAAGGAATATTTTGTCTTGTTATTAGAGGCGTTAGACTTGCTCAACCA
TCTCTTCTCGAATCATTAGCAACAGGCTGTATTCCTGTTATTGTAGCCGACAACATTATA
CTGCCATTTTCTGAAATTATCGATTGGACATTAGCTTCAATTTCTGTACGTGAGGCTGAT
TTGCATTCAATTTCATCAGTATTAAAAGCAGTTTCACAGACTCGTATTGAAGAGTTACAA
CGACAAGGGCGCTTTTTATATGAAAAATATTTTAAAAATATTGAAACAATTGTTCATTCG
ATGTTGGAAGAATTAAATGATAGAGTATTTCCACATTTATCTCTTGACTACAATAATTGG
AATATAGAAAATCTGCCAAATTCTGCTCAAAATCCACTATTTTTGCCATTAATGGGATCA
AAATCACTTGGTTTCACTGCCGTCATTCTTACATATGACCGCGTTGAGAGTCTCTTTACA
CTTATTCAAAAATTGTCATTTGTACCATCTCTGCAAAAAATACTCGTCATTTGGAATAAT
CAGAAAAAGCAACCACCACCGATGTCATCATTTCCAAAAATAACTAAACCAATAAAAATT
ATTCAGACCAAGGCAAATAAATTGTCAAATCGCTTTTATCCATATCCTGAAATAGAAACA
GAAGCCGTGCTAACAATTGATGATGATATTACTATGCTAACTAGCGATGAACTCGATTTT
GGTTTTGAAGTATGGCGTGAATTTCCTGATCGCATTGTTGGATTTCCATCACGAACACAT
GTATTTGAAAACAATGAGTATAAATATGAATCAGAATGGACGAATTCATTATCAATGGTT
TTAACTGGTGTTGCATTTCATCACAAATATTGGAATTATGCTTATACAACTTCAATGCCT
GGTAATATCAAAGAATATGTTGATGAACATATGAATTGTGAAGATATTGCTATGAATTTT
TTGGTCTCAAATAGGACTGGAAAAGGTCCAATAAAAGTAACACCACGTAAAAAATTTAAA
TGTCATTCGAGACAATGTACAAATGCTGAAATGCTCTCAAGTGATCAAGGACATATGATC
GAGAGAAGTGAATGCATTAATTATTTTACAAAAATATATGGTGTAATGCCTCTAAAAACA
GTTGAATATCGAGCCGATCCTGTTCTTTTCCTCGATAATTTCCCAGAAAAATTAAAACGA
TTTAATGATGTAGAGAATCCCACACTTTCTTCAATTGGTATCAATCATGTTGAAGGTGGT
TGGCCAAAAGATATCAATCGTTTAGATGAAGAGCAAACAGCTCGTTATCGTAAGAAACAA
GAAAAGGATGAATCGTATGTGACCCAAATGAAAGGTCTTATAAAGTCATGCGAAAATGCT
ATTTATCAGAATAATGCTGTTAATTTATATGAAACATATTTCGAAGATATGGAGCAGATA
GATTTAAAAGAGGAATATTCTTCGAAAACTTTGAACATGTTTCGTGATAGAACTAATAGA
AATATTAGAAAAATAGCATGGAATCCAGATGATTCAACAAAATTTACAACAGCACATTGT
GGCAATCAGACATACTTTGATTATTTGAATGATGAGTCTAATACGATGCATATTTGGGAT
ATTGAGTATACTAAGAAGCCTATAAGAACTTTTGATAACCATGCGCAATCATTATGCTTC
GAATATAATCCAAAAGAACATTCGACCTTAGTCACTGGAATGATCACAGGGCAAATAGCT
ATATATGATACAAGACAGAATACTCAATTTCCACAAACAGTTTCATATCGAGAAAATTCA
CATAAAGACCAAGTTAATGCTGTTACATTTTATTGCTCAAAATCAAATATGGAGTTTTTT
AGTGGTAGCTCAAGCGGAGAAATTTATTGGTGGGATTCGAGAATGATTGAAAATGGACCT
ATTGAAGTTCTCGTATTGCAGCCAGAAGTAATGCAAGATAGTATAGCAAAAAAGGAAGAG
AAGGCTTTTGGTGTGGTAGCATTAGAGTATGAGAGCACTATTCCAACAAAATATATTGTA
GGAACAGATCATGGAGTGGTTTATATTTGCAATAAGCGTTTTAAGACACCGGCTGATAGA
ATTTATTCTAAAGTTCAATGCTATAATGGTGTAGTATCAGCAGTTCAAAGAAATCCTTCA
TTTTTGAAATTTTTTTTGAGTGTTGGTGATTTTCAAGCGAAACTTTGGTGTGAAGAGTTG
AAGGAGCAACCAATTTTTTGGACTAAAGAATATTCTTCTGAACTCACATTTGGTTGTTGG
AATGCTATTAGATGTTCGAGCTTTTATTTATGCAGAATGGATGGAGTTTTTGATGCATGG
GATATTATTCATAGAAGCGACAGGCCTGTTTTATCAACTAAAATTTCAGATCACACGCTG
TTAACATGTTCACCTCATAAGGAAGGAAAATTAATCCTTGTTGGCACAAGTGGTGGTGAT
ATACATTTATTACAGCTATCAGAGAACTTAGCCACAACAACAGTTAATGATAGACCACAT
ATGGCAGTCGTCCTCGATAGAGAGACACGAAGAGAAAAGCTTCTTGAAAATAAACAACGT
GAGGTGAAGTTAAAAGAAAAGGAGAAAGAGCGAGAACAAATTCGAATGAAGTTAGGCTTG
CCAGATCCTGAAGAAGTAGAGATGGCGTTAGCTAACGATTTGGCTAAACAAGCAGAAGTA
AATTTTCATTTACAGATTGAGCAGTTGAAATTACATTGTTAA

>g10825.t1 Gene=g10825 Length=1213
MKDKNSAIKVFLSDKVGISGGLVLGFIALVFIFNSQLKHPEVFNLNTYNHVRFDKLSELS
RTRNPNCTIFDCFNVYRCGSHQNKISIYIYPLTEYSDDNKKSKTFITKEFYQILKAIIKS
PYYTSNPQEACLFVPSIDLLNQNLIDKSLVEKALASLNYWNNGRNHILFNMLPGEAPNYS
TVFDVKSGDAIIAGAGFNTQTYRYGFDFSIPFYSPLLDNYVKKTGDKTTKNYFLISPQLN
MYDYHRRLMQEIVLEDSNNNILLLQKCTQEIFEDKISSNLELIDHDMRCSFPALTALEYP
EILEQGIFCLVIRGVRLAQPSLLESLATGCIPVIVADNIILPFSEIIDWTLASISVREAD
LHSISSVLKAVSQTRIEELQRQGRFLYEKYFKNIETIVHSMLEELNDRVFPHLSLDYNNW
NIENLPNSAQNPLFLPLMGSKSLGFTAVILTYDRVESLFTLIQKLSFVPSLQKILVIWNN
QKKQPPPMSSFPKITKPIKIIQTKANKLSNRFYPYPEIETEAVLTIDDDITMLTSDELDF
GFEVWREFPDRIVGFPSRTHVFENNEYKYESEWTNSLSMVLTGVAFHHKYWNYAYTTSMP
GNIKEYVDEHMNCEDIAMNFLVSNRTGKGPIKVTPRKKFKCHSRQCTNAEMLSSDQGHMI
ERSECINYFTKIYGVMPLKTVEYRADPVLFLDNFPEKLKRFNDVENPTLSSIGINHVEGG
WPKDINRLDEEQTARYRKKQEKDESYVTQMKGLIKSCENAIYQNNAVNLYETYFEDMEQI
DLKEEYSSKTLNMFRDRTNRNIRKIAWNPDDSTKFTTAHCGNQTYFDYLNDESNTMHIWD
IEYTKKPIRTFDNHAQSLCFEYNPKEHSTLVTGMITGQIAIYDTRQNTQFPQTVSYRENS
HKDQVNAVTFYCSKSNMEFFSGSSSGEIYWWDSRMIENGPIEVLVLQPEVMQDSIAKKEE
KAFGVVALEYESTIPTKYIVGTDHGVVYICNKRFKTPADRIYSKVQCYNGVVSAVQRNPS
FLKFFLSVGDFQAKLWCEELKEQPIFWTKEYSSELTFGCWNAIRCSSFYLCRMDGVFDAW
DIIHRSDRPVLSTKISDHTLLTCSPHKEGKLILVGTSGGDIHLLQLSENLATTTVNDRPH
MAVVLDRETRREKLLENKQREVKLKEKEKEREQIRMKLGLPDPEEVEMALANDLAKQAEV
NFHLQIEQLKLHC

Protein features from InterProScan

Transcript Database ID Name Start End E.value
9 g10825.t1 Coils Coil Coil 1151 1177 -
8 g10825.t1 Gene3D G3DSA:3.90.550.10 Spore Coat Polysaccharide Biosynthesis Protein SpsA; Chain A 442 696 1.4E-89
7 g10825.t1 Gene3D G3DSA:2.130.10.10 - 778 1168 1.7E-18
3 g10825.t1 PANTHER PTHR11062:SF128 EXOSTOSIN-2 23 654 6.0E-146
4 g10825.t1 PANTHER PTHR11062 EXOSTOSIN HEPARAN SULFATE GLYCOSYLTRANSFERASE -RELATED 23 654 6.0E-146
2 g10825.t1 Pfam PF03016 Exostosin family 82 370 4.2E-45
1 g10825.t1 Pfam PF09258 Glycosyl transferase family 64 domain 445 690 2.9E-88
10 g10825.t1 Phobius CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the cytoplasm. 1 11 -
12 g10825.t1 Phobius TRANSMEMBRANE Region of a membrane-bound protein predicted to be embedded in the membrane. 12 33 -
11 g10825.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 34 1213 -
17 g10825.t1 SMART SM00320 WD40_4 787 840 270.0
16 g10825.t1 SMART SM00320 WD40_4 844 883 5.5
15 g10825.t1 SMART SM00320 WD40_4 891 932 1.6
14 g10825.t1 SMART SM00320 WD40_4 998 1037 170.0
18 g10825.t1 SMART SM00320 WD40_4 1088 1125 210.0
6 g10825.t1 SUPERFAMILY SSF53448 Nucleotide-diphospho-sugar transferases 445 691 8.44E-26
5 g10825.t1 SUPERFAMILY SSF50978 WD40 repeat-like 798 1127 2.88E-28
13 g10825.t1 TMHMM TMhelix Region of a membrane-bound protein predicted to be embedded in the membrane. 16 33 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0006486 protein glycosylation BP
GO:0016757 glycosyltransferase activity MF
GO:0005515 protein binding MF
GO:0016021 integral component of membrane CC
GO:0015020 glucuronosyltransferase activity MF
GO:0015012 heparan sulfate proteoglycan biosynthetic process BP
GO:0006024 glycosaminoglycan biosynthetic process BP

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values