Gene loci information

Transcript annotation

  • This transcript has been annotated as Putative polypeptide N-acetylgalactosaminyltransferase 9.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_3 g2200 g2200.t1 TSS g2200.t1 15942714 15942714
chr_3 g2200 g2200.t1 isoform g2200.t1 15942741 15947160
chr_3 g2200 g2200.t1 exon g2200.t1.exon1 15942741 15944119
chr_3 g2200 g2200.t1 cds g2200.t1.CDS1 15942741 15944119
chr_3 g2200 g2200.t1 exon g2200.t1.exon2 15944175 15944256
chr_3 g2200 g2200.t1 cds g2200.t1.CDS2 15944175 15944256
chr_3 g2200 g2200.t1 exon g2200.t1.exon3 15944313 15944742
chr_3 g2200 g2200.t1 cds g2200.t1.CDS3 15944313 15944742
chr_3 g2200 g2200.t1 exon g2200.t1.exon4 15944816 15944931
chr_3 g2200 g2200.t1 cds g2200.t1.CDS4 15944816 15944931
chr_3 g2200 g2200.t1 exon g2200.t1.exon5 15945010 15945058
chr_3 g2200 g2200.t1 cds g2200.t1.CDS5 15945010 15945058
chr_3 g2200 g2200.t1 exon g2200.t1.exon6 15945380 15945808
chr_3 g2200 g2200.t1 cds g2200.t1.CDS6 15945380 15945808
chr_3 g2200 g2200.t1 exon g2200.t1.exon7 15945878 15946338
chr_3 g2200 g2200.t1 cds g2200.t1.CDS7 15945878 15946338
chr_3 g2200 g2200.t1 exon g2200.t1.exon8 15946399 15946553
chr_3 g2200 g2200.t1 cds g2200.t1.CDS8 15946399 15946553
chr_3 g2200 g2200.t1 exon g2200.t1.exon9 15946620 15947160
chr_3 g2200 g2200.t1 cds g2200.t1.CDS9 15946620 15947160
chr_3 g2200 g2200.t1 TTS g2200.t1 NA NA

Sequences

>g2200.t1 Gene=g2200 Length=3642
ATGTTCGAATATCAACGTTATCGAAATCATCACCGTCGACAAATCAAAACAATATTGAAA
ATCATTATTTTTATCACGCTTTTATGGTGGCTTTTTATTTTTGGATTTAGGCAAAATGAA
AACGATATTGATAATGGCATCGTGGAACAATTAGATAGAAATCAAATTGAAAAGTCAAAT
CTGTACGCTGAAAATGAGGATCAAGATAAAGAAGCACTTGAATTAGAAAAAAATCAAGAT
GAAATTATGAAAGAGTTTCTACACAGACAAAAAGAAAATATGAAATTCGAAGAAGAAAAA
ATAAAAAAAGAAATTTCATTTGGTTCCATTAAATCAGCATTAAATAATATGGATCTTTTC
TCATTAAATGATTTATTAGGAGATTATGCTAAAGAAATAAGGAAAGAAGATTTTAAAGAA
ATTTTTGGTGATTTGAGTAATATAAATGTTTTCGATGTTGTTGGGAAAGCTAACAATTTC
ATAGATAATGCTAAAGAAAAAATTGAAAATGAATCCGAAAAACCAAAAGAAAAGGAAAAA
GCGATTGCTATAAAAATTCCACAGAATGAAATCATTATTCATAAGAAAGAAGAAACGACG
ATAAAAGATGAAGAACAAAATCTAGAAGATTCAAGACCAGATGAAATTGTTGATGAGGAA
AAAGAACCTTTTGGTCATTATGGAAAAGCTGTTACACTTCCATCAGATATTCCCGAAAAC
ATTAAGAAATTAGTTGAAGAAGGATGGAAAAATCATGAATTTAATGAATATGTTTCAAAT
TTAATTCCAATTAATCGAACTTTGCGTGATTTTCGTTCTGATTATTGTCGTGATATGCAG
AATGTTTATTCAAAAAATGTACCAAAAGTTTCGATTGTGATGGTCTTTCATAATGAAGCA
TGGAGCACTTTAATGCGTTCAATTCAGTCAATTCTCAATAGAACACCAGAAGAAGTGATT
CATGAGATAATTTTAGTTGATGATTGTTCAACTATGGAGCATTTGAAAGTGCAATTAGAT
GAATTTGTAAAAACTAACATCAAAATACGTTTGATTCGTCTTCCCGAGAGAAAAGGATTG
ATTTATGCACGCAATATTGGAGCTTCAGAAGCAAAAAGTGAAATTTTAGTTTTTCTTGAT
TCTCACATTGAATGCACAAAAGGATGGATTGAACCATTAATTGATAGAGTGATATTAAAC
GCAACAACGATTGCTGTTCCGGTTATTGAAATAATTAATGATCAAACATTCGCTTTAGAA
CCAAAAGATCATCCAACCTATGTGACTTTAGGTGGATTCAGTTGGAATCTTCAATTTCGA
TGGTTTCATAAGCAAACTTCTGAACTGAAAGAACCACAAGCACCTATCAAATCACCAACC
ATGGCTGGTGGACTTTTTGCAATTGCTTCAAAATTTTTCAAGCATCTTGGAATGTATGAT
CCAGATTTGGATCTTTGGGGAGGTGAAAATCTTGAGTTAAGTTTCAAAGCATGGATGTGT
GGTGGATCTATTGAAATTATACCGTGTTCTCATGTTGGTCACGTTTTTAGAAAAAAGTCA
CCGTATGTATGGCGAACGGGTGTCGATGTTTTGAGAAAAAATTCATTGCGTGTTGCAAAA
GTTTGGATGGACGATTATGCCGTGTTTTATAACTATGCAACAGGCTTTGAAAAAACAGAT
TATGGCAATATAAGTGCGCGTGTAAAACTTCGTCAAGATTTACAATGTAAAAGTTTTCGA
TGGTACTTAAAGAATGTTTATCCTGAGAAAAAAGTTCCATCAGAAGGTATTGCTTATGGG
CAAATTCAAAATCTTGGATATGGTGCTAAAATGTGTTTAGATGGAAAAGCAGCAAAAGAT
TCTCCATCACTTACTGTTAAGTATTGTCACGGACTTGGCGGAAACCAATTTTGGCAGTAT
GATGGTGAAATTGCAAGAGATAATTATTGTGTGACATATGATGCTCATAATTTAATGACA
CAACATTGTAAACGAAGCAGTAAACAGCTCTTCACATATGATCCTGATGAGAAGCAAATT
TATATTCATCATAGCGAAACAGTAGAAATAGATTCAGAAGAGAGAAAATTAAATGAAAAA
TTTAAAGCTAGCTTTATTGTGAATTTAGTAAAACCACCTTTAAATGTTTTTGGAAATAAT
GTAGTTGTTGGGGAATTAGGTGAACCAGTCCATTTACCAAGCAATTTATCAGATGAAATC
AAAAAATTAATTGATGATGGATGGAAAAAGCATGAATTTAATCAATATATTTCAGATTTG
ATTTCAGTAAAAAGAACTTTAAATGATTTTAGAATTGATGAATGTAAAAATCTTATTTAT
CTAAAAGAACTTCCACAAACATCAGTCATCATTACATTTTACAATGAAGCATGGTCAACG
CTTTTAAGGACTGTTCATTCAGTGCTAGATAGAGGTGGAGATCATGTGTTAGAAGTGATT
TTGGTCGATGACTTTTCTGACATGGTTCATTTAAAAGACCCACTTGACAGTTATTTCAAA
AATTTCCATAAAGTCAAAATTATTAGAATGTCATTTCGTGTTGGCCTTATTAAATCAAGA
ATTAATGGATTTTTAGAATCTAAAGGTGAAATTGTTACATTTCTTGATTCTCATTGTGAA
TGTGCTGATGGATGGCTTGAACCATTACTTGCAAAAGTTGCTGAAGATAATCAAATTGTG
GTAGTTCCTGTTATAGACACAATTGATGTAAATACATTCGAATATCAATGGATAAAAAGT
ATAAATTCAATTCCAATTGGTGGTTTTGACTGGACATTAAATTTTAAATGGGCTTTGAAT
TTTGTTGAAAATCAAATACCAACATTGCCAATTCGAACACCAGTAATGTCTGGTGGATTG
TTTGCAATCAATAGACATTTTTTCATAAGATTAGGATTATATGATCCAGAACTGGATATA
TGGGGTGGTGAAAATCTCGAATTAAGTTTTAAAACCTGGATGTGTGGTGGTACTTTAGAA
ATTATTCCTTGTTCCCATGTGGGACACGTTTTTCGTTCAAAATCTCCTTATAACAAAGAC
ATTGCGGGAAGAAATGCAATGAAGAGAAATTTAATTCGAGTAGCTCAAGTGTGGATGGAT
GACTATGCAAAATTCTTTCTCATGAGGATACATGCACGTGAGAATGATTATACTGATGTT
TCGATGAGATTGTTTAAGAAGGAAGAATGCAAGAGCTTCAAATGGTACCTTCAGAATATT
TATCCATCACAATTTGATGTAGCTGAAGCTATAGCACAAGGAAAAATGAGAAATTCTAAA
TTTAATGCAATCTGTCTTGAGATGAATGCACAGAGTTATAAATTAGAAAAATGTCGTGAC
AGAAGAGAACAATTTATTGTGTTGACTGATAAAAATGAGCTAAGACGAGACGATTATTGT
TTATCACTTCACAAAAATGGACAAATTAAGCTCGATTATTGTTTTGGGCTTGAGTCAGAG
CATGTTTGGCAGTATGATAAAAACTCACATTTTTTAAAGCACACTCTAGTCCAAAATTGT
CTCTCGATTGACATTGAAAGTAATAGTTTAATAATGGAAAATTGTAGCGATTCTATTGTA
CAACAGTGGACCTTCGATTTTATTAAAAATTTTCATTACTAA

>g2200.t1 Gene=g2200 Length=1213
MFEYQRYRNHHRRQIKTILKIIIFITLLWWLFIFGFRQNENDIDNGIVEQLDRNQIEKSN
LYAENEDQDKEALELEKNQDEIMKEFLHRQKENMKFEEEKIKKEISFGSIKSALNNMDLF
SLNDLLGDYAKEIRKEDFKEIFGDLSNINVFDVVGKANNFIDNAKEKIENESEKPKEKEK
AIAIKIPQNEIIIHKKEETTIKDEEQNLEDSRPDEIVDEEKEPFGHYGKAVTLPSDIPEN
IKKLVEEGWKNHEFNEYVSNLIPINRTLRDFRSDYCRDMQNVYSKNVPKVSIVMVFHNEA
WSTLMRSIQSILNRTPEEVIHEIILVDDCSTMEHLKVQLDEFVKTNIKIRLIRLPERKGL
IYARNIGASEAKSEILVFLDSHIECTKGWIEPLIDRVILNATTIAVPVIEIINDQTFALE
PKDHPTYVTLGGFSWNLQFRWFHKQTSELKEPQAPIKSPTMAGGLFAIASKFFKHLGMYD
PDLDLWGGENLELSFKAWMCGGSIEIIPCSHVGHVFRKKSPYVWRTGVDVLRKNSLRVAK
VWMDDYAVFYNYATGFEKTDYGNISARVKLRQDLQCKSFRWYLKNVYPEKKVPSEGIAYG
QIQNLGYGAKMCLDGKAAKDSPSLTVKYCHGLGGNQFWQYDGEIARDNYCVTYDAHNLMT
QHCKRSSKQLFTYDPDEKQIYIHHSETVEIDSEERKLNEKFKASFIVNLVKPPLNVFGNN
VVVGELGEPVHLPSNLSDEIKKLIDDGWKKHEFNQYISDLISVKRTLNDFRIDECKNLIY
LKELPQTSVIITFYNEAWSTLLRTVHSVLDRGGDHVLEVILVDDFSDMVHLKDPLDSYFK
NFHKVKIIRMSFRVGLIKSRINGFLESKGEIVTFLDSHCECADGWLEPLLAKVAEDNQIV
VVPVIDTIDVNTFEYQWIKSINSIPIGGFDWTLNFKWALNFVENQIPTLPIRTPVMSGGL
FAINRHFFIRLGLYDPELDIWGGENLELSFKTWMCGGTLEIIPCSHVGHVFRSKSPYNKD
IAGRNAMKRNLIRVAQVWMDDYAKFFLMRIHARENDYTDVSMRLFKKEECKSFKWYLQNI
YPSQFDVAEAIAQGKMRNSKFNAICLEMNAQSYKLEKCRDRREQFIVLTDKNELRRDDYC
LSLHKNGQIKLDYCFGLESEHVWQYDKNSHFLKHTLVQNCLSIDIESNSLIMENCSDSIV
QQWTFDFIKNFHY

Protein features from InterProScan

Transcript Database ID Name Start End E.value
23 g2200.t1 CDD cd02510 pp-GalNAc-T 291 587 2.72452E-144
21 g2200.t1 CDD cd00161 RICIN 600 676 1.12443E-7
22 g2200.t1 CDD cd02510 pp-GalNAc-T 788 1081 6.53565E-138
20 g2200.t1 CDD cd00161 RICIN 1094 1205 9.39828E-6
16 g2200.t1 Coils Coil Coil 58 85 -
15 g2200.t1 Coils Coil Coil 154 174 -
13 g2200.t1 Gene3D G3DSA:3.90.550.10 Spore Coat Polysaccharide Biosynthesis Protein SpsA; Chain A 205 594 2.7E-151
12 g2200.t1 Gene3D G3DSA:2.80.10.50 - 595 703 3.5E-16
14 g2200.t1 Gene3D G3DSA:3.90.550.10 Spore Coat Polysaccharide Biosynthesis Protein SpsA; Chain A 713 1089 1.7E-143
11 g2200.t1 Gene3D G3DSA:2.80.10.50 - 1090 1210 3.9E-16
5 g2200.t1 PANTHER PTHR11675 N-ACETYLGALACTOSAMINYLTRANSFERASE 79 683 0.0
6 g2200.t1 PANTHER PTHR11675 N-ACETYLGALACTOSAMINYLTRANSFERASE 733 1207 0.0
4 g2200.t1 Pfam PF00535 Glycosyl transferase family 2 291 471 1.5E-28
2 g2200.t1 Pfam PF00652 Ricin-type beta-trefoil lectin domain 599 687 5.0E-11
3 g2200.t1 Pfam PF00535 Glycosyl transferase family 2 788 970 7.2E-33
1 g2200.t1 Pfam PF00652 Ricin-type beta-trefoil lectin domain 1093 1203 5.8E-12
17 g2200.t1 Phobius CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the cytoplasm. 1 17 -
19 g2200.t1 Phobius TRANSMEMBRANE Region of a membrane-bound protein predicted to be embedded in the membrane. 18 36 -
18 g2200.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 37 1213 -
28 g2200.t1 ProSiteProfiles PS50231 Lectin domain of ricin B chain profile. 598 663 10.545
27 g2200.t1 ProSiteProfiles PS50231 Lectin domain of ricin B chain profile. 1092 1206 15.441
26 g2200.t1 SMART SM00458 ricin_3 599 704 0.0079
25 g2200.t1 SMART SM00458 ricin_3 1093 1206 1.6E-4
10 g2200.t1 SUPERFAMILY SSF53448 Nucleotide-diphospho-sugar transferases 265 589 8.06E-57
7 g2200.t1 SUPERFAMILY SSF50370 Ricin B-like lectins 593 686 6.27E-15
9 g2200.t1 SUPERFAMILY SSF53448 Nucleotide-diphospho-sugar transferases 765 1082 1.41E-52
8 g2200.t1 SUPERFAMILY SSF50370 Ricin B-like lectins 1091 1206 1.87E-17
24 g2200.t1 TMHMM TMhelix Region of a membrane-bound protein predicted to be embedded in the membrane. 17 36 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

There are no GO annotations for this transcript.

KEGG

Orthology

This gene did not have any KEGG ortholog annotations (KAAS, GHOSTZ).

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below. There were no conditions that were differentially expressed