Gene loci information

Transcript annotation

  • This transcript has been annotated as Putative polypeptide N-acetylgalactosaminyltransferase 9.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_3 g2201 g2201.t1 TSS g2201.t1 15947314 15947314
chr_3 g2201 g2201.t1 isoform g2201.t1 15947380 15950456
chr_3 g2201 g2201.t1 exon g2201.t1.exon1 15947380 15948113
chr_3 g2201 g2201.t1 cds g2201.t1.CDS1 15947380 15948113
chr_3 g2201 g2201.t1 exon g2201.t1.exon2 15948178 15948233
chr_3 g2201 g2201.t1 cds g2201.t1.CDS2 15948178 15948233
chr_3 g2201 g2201.t1 exon g2201.t1.exon3 15948293 15948606
chr_3 g2201 g2201.t1 cds g2201.t1.CDS3 15948293 15948606
chr_3 g2201 g2201.t1 exon g2201.t1.exon4 15949432 15949509
chr_3 g2201 g2201.t1 cds g2201.t1.CDS4 15949432 15949509
chr_3 g2201 g2201.t1 exon g2201.t1.exon5 15949569 15949644
chr_3 g2201 g2201.t1 cds g2201.t1.CDS5 15949569 15949644
chr_3 g2201 g2201.t1 exon g2201.t1.exon6 15949705 15950456
chr_3 g2201 g2201.t1 cds g2201.t1.CDS6 15949705 15950456
chr_3 g2201 g2201.t1 TTS g2201.t1 15950464 15950464

Sequences

>g2201.t1 Gene=g2201 Length=2010
ATGTATACGTCAAAATTTAGACTTTGTAGAATTTTATTTACTGCTGGCTTCATATGGCTT
ATGACTGTTTTAACTTTATTTCACTATTGGGAATTGATTATGGAACATCAAATAAAGTTA
ACAAAATTTTCAAAAAATCCTACTGATGATAATTTAGTGTCTCAAGATTTTCATACAGAA
AAGTTAATTCAAACAACCGCGAGCAATAAAGAAGCAGAGAAAATAAGCACGAGAGAGGCA
ACAACGTGGAAAAAATTTACAATAATGACAGATAAACGAACACGAAGACCTCACAAGAGT
AGAAAAACCACAACAGAATCTACCACGACCACTACAACAACCACATCATCCTCCACATCT
ACAACTTTATCAACTACACAAGCAACAACACGAAAAAAGAGAGCATATGAGAGAGTTATC
ACACCTGTTCCTGATGCAATCTACACTCTCCTAAATTTGCCTAAAAATGTTGGCGAAGGT
GGTGCTATCGTTCGAATTCAAGATCCACCACCTGAAATTAAGCAGAAAATTGATGCCGGC
TGGTCGAGACATGAATTTAATGAATTTCTTTCTGATTTAATCTCAGTTAATCGCTCAGTT
CCAGACCCAAGAAGTGAATACTGTAAACAAAAAGGATTATATCTTGAAAATCTGCCAATG
ACGTCAGTTATCATTATTTTTCACAATGAAGCATGGTCGACATTACTAAGGACAGTACAT
TCTGTTCTCAATAGATCTCCACCACATTTAATTAAGGAAATTATTTTAGTTGATGATTAT
TCAAATATGTTACATTTAAAAGAAGCTCTCGAAGATTACATGTCAGATTTTTCTAAAGTA
AAAATTGTTCGAATGCCAAAACGTGTAGGATTAATAAAGGCGAGGATTGCTGGAACTGAA
GCAGCAACAGCAGAAACTCTGACATTTTTAGATTCTCATGTTGAATGTGCAGAAGGTTGG
CTTGAACCACTTTTAGATAGAATAGCAAGAAATAAAACCAAAGTTGTTTGCCCTGTGATT
GATGTCATTGATGATGATTCATTAGCATTCAGTTATCAAGATGCAAGTGGATTGCAAGTT
GGCGGTTTTGATTGGGAAATGACTTTTGATTGGCATATGGTATCAGAAAGAGAAAGAAAA
CGTAAGCAAAATCCATGGGAACCAACTTTTTCACCAACTATGGCTGGTGGTTTATTTTCA
ATTGACAAAGAATTCTTCATAAAATTGGGACTTTATGATGATGGTTTTGATATTTGGGGT
GGTGAAAATCTCGAATTAAGTTTTAAAACTTGGATGTGCGGCGGAACTTTGGAAATAATT
CCATGCTCTCACGTTGGACACATCTTTAGAAAAAAATCTCCATATAAATGGCGACCAGGA
GTTGATGTGCTAAAAATAAATACAATAAGACTTGTTGAAGTGTGGCTTGATGATTATGCT
AAATTCTATTATGTTAGAAGAGGCGCAGACAAAGGAAATTTTGGTGATATTTCATCAAGA
GTGGAATTAAGGAAAAATTTAAATTGCAAAAGTTTTAAATGGTACTTGGAAAATGTCTAT
CCAGAACAGATTGTGCCAGATAATCTTGGTGAAGGTTATATTAAAAATATTGGACTTGAT
AATTCACCATGCCTTGATGCTCCAATAATAGACAGTCAAATGAAAACTCAAGTGGCATTT
TATGGATGTCATAATATGGGTGGAAATCAATTTTTTGAATTTTCTCGTGATCATGAAATC
AAAAGAAAAGAGCGATGTTTAGATTATTCAAAAGATACAAATGAATTAGCATTTGTGCCG
TGTCACAAAAGCAAAGGCAATCAATACTGGTCTTATATCATAGACACAAAACAATTTTAT
CATAAGCAGTCCAACAGATGTCTTTCATTAGAGCGACAAAGTTATACATATTTGCCATCA
ATGGTGCATTGTGATGAAAAGAGTAAAAATCAACAATGGGAATTTCAATATGTTTATCCA
GAAAAATTTAATAAAAAAACCACAAATTGA

>g2201.t1 Gene=g2201 Length=669
MYTSKFRLCRILFTAGFIWLMTVLTLFHYWELIMEHQIKLTKFSKNPTDDNLVSQDFHTE
KLIQTTASNKEAEKISTREATTWKKFTIMTDKRTRRPHKSRKTTTESTTTTTTTTSSSTS
TTLSTTQATTRKKRAYERVITPVPDAIYTLLNLPKNVGEGGAIVRIQDPPPEIKQKIDAG
WSRHEFNEFLSDLISVNRSVPDPRSEYCKQKGLYLENLPMTSVIIIFHNEAWSTLLRTVH
SVLNRSPPHLIKEIILVDDYSNMLHLKEALEDYMSDFSKVKIVRMPKRVGLIKARIAGTE
AATAETLTFLDSHVECAEGWLEPLLDRIARNKTKVVCPVIDVIDDDSLAFSYQDASGLQV
GGFDWEMTFDWHMVSERERKRKQNPWEPTFSPTMAGGLFSIDKEFFIKLGLYDDGFDIWG
GENLELSFKTWMCGGTLEIIPCSHVGHIFRKKSPYKWRPGVDVLKINTIRLVEVWLDDYA
KFYYVRRGADKGNFGDISSRVELRKNLNCKSFKWYLENVYPEQIVPDNLGEGYIKNIGLD
NSPCLDAPIIDSQMKTQVAFYGCHNMGGNQFFEFSRDHEIKRKERCLDYSKDTNELAFVP
CHKSKGNQYWSYIIDTKQFYHKQSNRCLSLERQSYTYLPSMVHCDEKSKNQQWEFQYVYP
EKFNKKTTN

Protein features from InterProScan

Transcript Database ID Name Start End E.value
13 g2201.t1 CDD cd02510 pp-GalNAc-T 222 520 2.66328E-172
12 g2201.t1 CDD cd00161 RICIN 532 655 1.38221E-20
8 g2201.t1 Gene3D G3DSA:3.90.550.10 Spore Coat Polysaccharide Biosynthesis Protein SpsA; Chain A 148 526 3.8E-162
7 g2201.t1 Gene3D G3DSA:2.80.10.50 - 528 662 4.3E-33
17 g2201.t1 MobiDBLite mobidb-lite consensus disorder prediction 93 130 -
16 g2201.t1 MobiDBLite mobidb-lite consensus disorder prediction 104 130 -
3 g2201.t1 PANTHER PTHR11675 N-ACETYLGALACTOSAMINYLTRANSFERASE 6 657 1.5E-206
4 g2201.t1 PANTHER PTHR11675:SF112 POLYPEPTIDE N-ACETYLGALACTOSAMINYLTRANSFERASE 9-RELATED 6 657 1.5E-206
2 g2201.t1 Pfam PF00535 Glycosyl transferase family 2 222 405 1.6E-36
1 g2201.t1 Pfam PF00652 Ricin-type beta-trefoil lectin domain 532 653 6.9E-24
9 g2201.t1 Phobius CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the cytoplasm. 1 11 -
11 g2201.t1 Phobius TRANSMEMBRANE Region of a membrane-bound protein predicted to be embedded in the membrane. 12 30 -
10 g2201.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 31 669 -
18 g2201.t1 ProSiteProfiles PS50231 Lectin domain of ricin B chain profile. 530 656 21.459
15 g2201.t1 SMART SM00458 ricin_3 531 656 2.4E-23
6 g2201.t1 SUPERFAMILY SSF53448 Nucleotide-diphospho-sugar transferases 197 522 5.16E-57
5 g2201.t1 SUPERFAMILY SSF50370 Ricin B-like lectins 526 656 6.19E-30
14 g2201.t1 TMHMM TMhelix Region of a membrane-bound protein predicted to be embedded in the membrane. 12 34 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

There are no GO annotations for this transcript.

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below. There were no conditions that were differentially expressed