Gene loci information

Transcript annotation

  • This transcript has been annotated as Bifunctional heparan sulfate N-deacetylase/N-sulfotransferase.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_2 g8720 g8720.t1 isoform g8720.t1 32550289 32566292
chr_2 g8720 g8720.t1 exon g8720.t1.exon1 32550289 32550638
chr_2 g8720 g8720.t1 cds g8720.t1.CDS1 32550289 32550638
chr_2 g8720 g8720.t1 exon g8720.t1.exon2 32551628 32551828
chr_2 g8720 g8720.t1 cds g8720.t1.CDS2 32551628 32551828
chr_2 g8720 g8720.t1 exon g8720.t1.exon3 32558103 32559238
chr_2 g8720 g8720.t1 cds g8720.t1.CDS3 32558103 32559238
chr_2 g8720 g8720.t1 exon g8720.t1.exon4 32560502 32560771
chr_2 g8720 g8720.t1 cds g8720.t1.CDS4 32560502 32560771
chr_2 g8720 g8720.t1 exon g8720.t1.exon5 32560901 32561388
chr_2 g8720 g8720.t1 cds g8720.t1.CDS5 32560901 32561388
chr_2 g8720 g8720.t1 exon g8720.t1.exon6 32561458 32561810
chr_2 g8720 g8720.t1 cds g8720.t1.CDS6 32561458 32561810
chr_2 g8720 g8720.t1 exon g8720.t1.exon7 32566190 32566292
chr_2 g8720 g8720.t1 cds g8720.t1.CDS7 32566190 32566292
chr_2 g8720 g8720.t1 TTS g8720.t1 32566589 32566589
chr_2 g8720 g8720.t1 TSS g8720.t1 NA NA

Sequences

>g8720.t1 Gene=g8720 Length=2901
ATGGAGCGGATCCCTATGTTGATGAAAACAGCTCCGAACGATATTACATTAACACCACTG
CCATCGTCCTCATTGTCGTCATTTAAGGATGAAAAGCCAAGGAAATTTTATGACATGCGA
CATTTGTTGCCGACAAAGTCACCAGTGAGCAGTAGTAGCAGCAGTACAAATGGACTTATA
ATAAACAATTATGGAATCAACAGTAAAAATAGCTTTATGACACGATTGTGTTGTCACATG
ATGAAAGGAGTACAGAGGAACATTCAAAAATGTGTTGCGGCACTTGTGCTCATATCATTT
TTCAGTATAATCTTTTTCACACAATATATGGATAGTAGTCCACTTGTTGGACTCATTCAT
CGTGATACAAAGCCAATGCCATTGATCCATTGTCAAACACTAAATAAATCACCGTCTGAT
CCATCGCAAACACATGATCACAGGTCAGAGGCGAGATTACGGATAGACTCGAAAGTGCTT
GTATTTGTTGAAACGACATATAGCAAGTTGGGTCGAGAGATAGCGGAAGAACTTGTGTAT
AATAGAATCAAATATAAAATTGAGGTGTCCGGAAAAAGCCTTCCTGTTCTCACAAATTTA
GACAAAGGTCGTTATGGTGTGATTGTATTTGAGAATCTCGATAAATATTTATCAATGGAC
AAATGGAATCGAGAATTACTTGATAAATATTGCCGTGAATATTCTGTTGGTATTGTTGGT
TTTATGAGTGCAAGTGAAGAAACTTTAGTTGGTGCACAATTGAAAGATTTTCCACTTTAT
GTTCATACAAATTTAAGATTGCGTGATGCAAGTCTTAATCCATTTTCACCAGTGTTGCGA
CTTACAAGAGCAGGTGATACAGCTTGGGGTCCACTTCCTGGCCAAGATTGGGCTGTATTT
CAACACAACCATTCGACTTATGAACCACTTGAATTTGCTCAAAAAAATACACTCGATTAT
CCAAATGATAATGGCATTCAACCACCACTTACGACAGTACTTCAAGATCATGGGAAACTT
GATAACATCCAAAGAATCTTTTTTGGAGCTGGTCTCAAATTTTGGCTTCATCGATTACTC
TTTCTTGATGCACTTTCATATTTAAGTCATGGACAATTAAGTGTCAGTTTAAATCGAATG
ATTTTAGTTGATATTGATGATATTTTTGTTGGTGAACGTGGAACACGTTTGAAACCTGAT
GATGTTCATGCATTAATATCAACACAAAGTAGAATAGCAGAAATGGTGCCTGGATTTAGA
TTTAACCTTGGCTTTTCTGGTAAATATTTTCATCATGGCACAAAAGAAGAAAACTTGGGT
GATGACATGCTGCTTCGTAATGTTGATAAATTTACTTGGTTTTCTCACATGTGGAATCAT
CAGCAACCACATCTATATGATAATCTCACTGTGCTTATGAATGACATGATGCTGAATAAA
GATTTTGCCAAGGTAAAGAATATTCCACTCGATTCTGGCTACTCAGTTTCACCACATCAC
TCTGGCGTCTATCCAGTTCATGAACTTCTTTATCAAGCATGGAAAAATGTATGGAATGTC
AGAGTAACATCAACTGAAGAATATCCACATCTGCGACCTGCTCGATTACGAAGGGGATTC
ATACATCGTAATATTATGGTGCTACCGCGTCAGACATGCGGTTTATTTACACATACAATG
TATATAGATCGTTATCCGGGCGGGCGTGATAAGCTCGATGAGTCAATACAAAGAGGAGAA
CTCTTCCAAACAATTGTTTATAATCAAATAAATATTTTCATGACACATATGTCCAACTAT
GGGAGCGATCGATTGGCTCTCTATACATTTGAATCTGTAATTAAGTTCCTACGATGCTGG
ACAAATCTAAAATTAACATCAGCACCGCCATTGCAATTAGGCGAACATTATTTTAAGTTG
CACCCTGAAGAAAGTGATCCTGTTTGGGGAAATCCATGCGAAGATGCAAGGCACCTCAAA
ATATGGTCGCGCAATAAGAGCTGTGATTCATTACCGAAGTTCATGATATTAGGTCCACAA
AAGACTGGAACAACTGCACTATATACTTTTTTAAGTATGCACCCAAGTCTCGCTAGCAAT
TTACCCAGTCCTGATACATTTGAGGAAATTCAATTTTTTAATGGCAATAATTATTATCGT
GGCCTCGATTGGTACATGAGTTTCTTTCCTCTTCAAAATGCAACATCATCGAATAGTGTA
CATATAGGTGGACCATCATCAACAGGCAGCACAAATGCACGATATTTCTTTGAAAAGTCT
GCCACGTATTTTGATGGTGACCTCGTGCCAAAGCGTGCACATCAATTATTACCAAATGCG
AAGCTCGTTGTGATACTAATATCACCAGCAAAACGTGCCTACAGTTGGTATCAACATACA
CGTGCACATGGAGACATCATAGCCAATAATTATAGCTTCTATCAGGTCATCACAGCAACA
GACACTGCACCAAAGCCTTTACGAGATTTACGAAATCGTTGCCTAAATCCTGGCAAATAT
GCGCAACACTTGGAGCGATGGCTTGCCTTTTATCCTGCACAGCAGCTACACATAATTGAC
GGTGAACAGTTGCGACTTAATCCGATTGACGTGATGAATGATTTACAAAGATTTTTGAAA
ATTGCGCCAATCATGGACTATTCAAATCATTTACGATATGATTCAAAGAAAGGCTTTTAT
TGTCAAGTTATTAATGAAACTAGAAATAAATGCTTGGGAAAATCAAAAGGTCGCATTTAT
CCTGAAATGGACGAAAAAAGCACAAAAATTTTACAAAGATATTACTTGAGCCACAATACA
GCTTTAGTGAAGTTATTGAAGAAGTTGGGTTCAAGACCAATTCCAACATGGCTCAAAAAT
GAGCTTTCGACAACGACATGA

>g8720.t1 Gene=g8720 Length=966
MERIPMLMKTAPNDITLTPLPSSSLSSFKDEKPRKFYDMRHLLPTKSPVSSSSSSTNGLI
INNYGINSKNSFMTRLCCHMMKGVQRNIQKCVAALVLISFFSIIFFTQYMDSSPLVGLIH
RDTKPMPLIHCQTLNKSPSDPSQTHDHRSEARLRIDSKVLVFVETTYSKLGREIAEELVY
NRIKYKIEVSGKSLPVLTNLDKGRYGVIVFENLDKYLSMDKWNRELLDKYCREYSVGIVG
FMSASEETLVGAQLKDFPLYVHTNLRLRDASLNPFSPVLRLTRAGDTAWGPLPGQDWAVF
QHNHSTYEPLEFAQKNTLDYPNDNGIQPPLTTVLQDHGKLDNIQRIFFGAGLKFWLHRLL
FLDALSYLSHGQLSVSLNRMILVDIDDIFVGERGTRLKPDDVHALISTQSRIAEMVPGFR
FNLGFSGKYFHHGTKEENLGDDMLLRNVDKFTWFSHMWNHQQPHLYDNLTVLMNDMMLNK
DFAKVKNIPLDSGYSVSPHHSGVYPVHELLYQAWKNVWNVRVTSTEEYPHLRPARLRRGF
IHRNIMVLPRQTCGLFTHTMYIDRYPGGRDKLDESIQRGELFQTIVYNQINIFMTHMSNY
GSDRLALYTFESVIKFLRCWTNLKLTSAPPLQLGEHYFKLHPEESDPVWGNPCEDARHLK
IWSRNKSCDSLPKFMILGPQKTGTTALYTFLSMHPSLASNLPSPDTFEEIQFFNGNNYYR
GLDWYMSFFPLQNATSSNSVHIGGPSSTGSTNARYFFEKSATYFDGDLVPKRAHQLLPNA
KLVVILISPAKRAYSWYQHTRAHGDIIANNYSFYQVITATDTAPKPLRDLRNRCLNPGKY
AQHLERWLAFYPAQQLHIIDGEQLRLNPIDVMNDLQRFLKIAPIMDYSNHLRYDSKKGFY
CQVINETRNKCLGKSKGRIYPEMDEKSTKILQRYYLSHNTALVKLLKKLGSRPIPTWLKN
ELSTTT

Protein features from InterProScan

Transcript Database ID Name Start End E.value
7 g8720.t1 Gene3D G3DSA:3.40.50.300 - 646 966 3.4E-95
3 g8720.t1 PANTHER PTHR10605:SF36 BIFUNCTIONAL HEPARAN SULFATE N-DEACETYLASE/N-SULFOTRANSFERASE 224 962 0.0
4 g8720.t1 PANTHER PTHR10605 HEPARAN SULFATE SULFOTRANSFERASE 224 962 0.0
2 g8720.t1 Pfam PF12062 heparan sulfate-N-deacetylase 98 582 4.2E-227
1 g8720.t1 Pfam PF00685 Sulfotransferase domain 672 936 2.5E-42
8 g8720.t1 Phobius CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the cytoplasm. 1 90 -
10 g8720.t1 Phobius TRANSMEMBRANE Region of a membrane-bound protein predicted to be embedded in the membrane. 91 110 -
9 g8720.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 111 966 -
6 g8720.t1 SUPERFAMILY SSF52540 P-loop containing nucleoside triphosphate hydrolases 647 952 1.63E-84
5 g8720.t1 TMHMM TMhelix Region of a membrane-bound protein predicted to be embedded in the membrane. 91 110 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0016787 hydrolase activity MF
GO:0008146 sulfotransferase activity MF
GO:0015016 [heparan sulfate]-glucosamine N-sulfotransferase activity MF

KEGG

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values