Gene loci information

Transcript annotation

  • This transcript has been annotated as Polypeptide N-acetylgalactosaminyltransferase 5.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_2 g6549 g6549.t5 TTS g6549.t5 17606620 17606620
chr_2 g6549 g6549.t5 isoform g6549.t5 17607361 17611677
chr_2 g6549 g6549.t5 exon g6549.t5.exon1 17607361 17608688
chr_2 g6549 g6549.t5 cds g6549.t5.CDS1 17608515 17608688
chr_2 g6549 g6549.t5 exon g6549.t5.exon2 17608761 17608929
chr_2 g6549 g6549.t5 cds g6549.t5.CDS2 17608761 17608929
chr_2 g6549 g6549.t5 exon g6549.t5.exon3 17609123 17609187
chr_2 g6549 g6549.t5 cds g6549.t5.CDS3 17609123 17609187
chr_2 g6549 g6549.t5 exon g6549.t5.exon4 17609254 17609393
chr_2 g6549 g6549.t5 cds g6549.t5.CDS4 17609254 17609393
chr_2 g6549 g6549.t5 exon g6549.t5.exon5 17610088 17610268
chr_2 g6549 g6549.t5 cds g6549.t5.CDS5 17610088 17610268
chr_2 g6549 g6549.t5 exon g6549.t5.exon6 17610327 17610355
chr_2 g6549 g6549.t5 cds g6549.t5.CDS6 17610327 17610355
chr_2 g6549 g6549.t5 exon g6549.t5.exon7 17610418 17610493
chr_2 g6549 g6549.t5 cds g6549.t5.CDS7 17610418 17610493
chr_2 g6549 g6549.t5 exon g6549.t5.exon8 17610549 17610782
chr_2 g6549 g6549.t5 cds g6549.t5.CDS8 17610549 17610782
chr_2 g6549 g6549.t5 exon g6549.t5.exon9 17610849 17610949
chr_2 g6549 g6549.t5 cds g6549.t5.CDS9 17610849 17610949
chr_2 g6549 g6549.t5 exon g6549.t5.exon10 17611007 17611230
chr_2 g6549 g6549.t5 cds g6549.t5.CDS10 17611007 17611230
chr_2 g6549 g6549.t5 exon g6549.t5.exon11 17611334 17611677
chr_2 g6549 g6549.t5 cds g6549.t5.CDS11 17611334 17611467
chr_2 g6549 g6549.t5 TSS g6549.t5 NA NA

Sequences

>g6549.t5 Gene=g6549 Length=2891
GTTCATGAAGATAAATTTATCAGTCACCATCAACTTGACGAAGATCCAATTGTTAACGAT
GATGAAATAAATACAAATTTAAATAATGCTGATAATGATTTTAACAATGATGATGACGTG
ATACACATATCGAAAACATATCGATCGACAGATCTAAAGAAATGGCGGCCAGCGCCGGTG
GTGCGTGAAAATGTAGGAAAACCAGGTGAAATGGGCAAGCCGGTCAAAATGAAATCATAT
CAGCAGGAAGAAATGAAAGAGAAATTCAAAGAAAATCAATTCAATTTGTTAGCAAGTGAT
ATGATATGGTTGAATCGATCGCTCGCTGATGTGCGGCACAAAGACTGTCGAACTAAAACA
TACCCAAGTAGATTGCCAACAACAAGTATAGTTATAGTTTTTCATAATGAAGCATGGAGC
ACATTGCTCAGAACAATATGGAGTGTAATTAATCGTTCACCACGACCGTTATTAAAAGAA
ATCATTCTCGTTGATGATGCAAGCGAGCGTGAATATTTAGGTGAAAAATTAGAGGAATAT
GTAAAAACACTGCCTGTTCCAACATCAGTTTTACGCACCGGTAAAAGGTCTGGTCTCATT
AGAGCACGTTTATTGGGTGCTGCTGCTGTAAAAGGTCAAGTTATAACTTTTCTCGATGCT
CATTGTGAATGTACTGAAGGTTGGTTAGAACCTTTGTTGTCGAGAATTGCTTTGGACAGA
AAAACAGTTGTTTGTCCAATAATCGATGTTATTAGTGATGAAACGTTTGAATATGTTACT
GCTTCTGATCAAACTTGGGGTGGTTTTAATTGGAAGCTCAATTTTAGATGGTATCGAGTA
CCTGCTAGAGAAATGGCAAGAAGAAATAATGATAGGACTTCACCTCTTCGAACGCCAACT
ATGGCGGGTGGTTTGTTTTCAATAGATAAAGATTATTTCTATGAAATTGGTTCATATGAC
GAGGGAATGGATATTTGGGGTGGTGAAAATTTGGAAATGAGTTTTCGTATTTGGCAATGC
GGCGGAATTTTGGAAATTGCTCCCTGTAGCCACGTTGGTCATGTTTTCAGAGATAAAAGT
CCATATACTTTCCCTGGCGGTGTGGCAAATATTGTACTTAAAAATGCTGCTCGTGTTGCA
GCTGTTTGGCTTGACGAATGGAAAGAATTTTACTTCGCTATGAGTCCAGGTGCACGAAAA
ACATCAGCTGGAGATGTGTCTGCACGTCTTGCTCTAAGAGAAAAATTAAAGTGCAAGAGT
TTCAGATGGTATCTTGAAAATATTTATCCTGAGAGTCAAATGCCATTAGATTATTATTTC
CTCGGAGAAATTCGAAATGTCGAAAGTCAGAATTGTTTAGATACGATGGGTCATAAATCA
GGAGAGAAAGTTGGATCATCATATTGTCACGGACTGGGAGGTAATCAAGTTTTCGCCTAC
ACTAAACGACAACAAATTATGTCAGATGATAACTGTCTTGATGCCAGTAACGCTCATGGA
CCAGTTAACTTAGTTAGATGTCATGGAATGGGTGGGAATCAGGAATGGAGTTATGATGAG
ACTGACCTTACAATTAAGCATGTGAATTCAGGAAATTGTCTTACAAGAGCCACTCGAGAA
GATCCTTCAACGCCACAACTTCGTCCATGTAACTTTTCTAAAGGTCAACAGTGGTTAATG
CAATCAAAGTTTAAGTGGCAAACAAAGCAAGGTACAGAAGATGAAGAGAGAAGATAGGGG
TTGCTGAAATTGGCCAATATGGATTGATTTAATTTATTTTCCTACATCGTTCCTCTTATC
TTGTACAATATAAGACCCCATTTTTATATAATTTCTTTTTGCAAGAAACATCCGGAACAA
ACTTAAACTAAGTTCTTTAGATATACATAAACCAACAATGAACATGCGATTTCCATTTTA
GACAATCAGTTATTTTCGATAAAATACAAACCACTTAGATAGATTACAAAACAAAAGAAG
AACAAGTATGAAGTAAAAATTAAAATAATTAAGAAAGTTAGCAAAAGAATTAAATCTTTT
GCAAAAAAAATAATGTGATCAACCTATAATATGATGATTAAGTTGAAAAGAAAGGACCTT
TTTGCAACACATTGTTTAGTTCATTATTATTAATTGCGTTTAATACCTCGATGGATTTAG
AGCATTTTTATTAGAGAGTATATATAAAATTGAAATTTGCATATCACAATGCAAGGCACA
ACATAAATATTAGAATGGAACAATTTTTTCTAAGAAAAAAAATAGAGAGAGCCAGAATAA
TTGAGGATGTGTGTTTCAAAAGCCAGAAAATGGAGATAAAAAATTTATATTAAGTATATA
TGTTGAAATTCATCGGATTCATGTGTTTAGAAATTAAGTAGGAATCAATTTATAAAATAT
TTAAAATCCTTTTTATATTAACGTACTGCATTGTGTGCCACGAAGATTTTGGCAAAATAA
AAAAAAAATGAATTAGAAGAAAGAAATTAAAATCTCAATCTTCCGCAAGAACACACATTT
CAAAATGATTTATATTACAAAAAGAGATGTATGTAAATAGAACAAATTTTGAGTAATCAG
TTGATACAATTAAACACATGGTGGAGGTGCAATTTTCTTTACTCCCACAACTATCTTTAA
CAAAAATATCTCTCGTTTATAAAATATTATTTAATGAAGAAAATAGAGTAACGTTATCAA
AAATAATAGTATTTAGTCGAATAAGATGTTATTTTCAATTTTTTTCGCCATTAAATTATT
TTTTAATGAAAATCTTTTCCTCTTTATATTTTTTTTCTTTTGTCTAAATGAATTTTAAAT
AAAGAGTATTTAATTATGAAATTTAATTGTGTTCATTCATTCCTGAAAATTGATAATTTG
GCACTTTTAGA

>g6549.t5 Gene=g6549 Length=508
MGKPVKMKSYQQEEMKEKFKENQFNLLASDMIWLNRSLADVRHKDCRTKTYPSRLPTTSI
VIVFHNEAWSTLLRTIWSVINRSPRPLLKEIILVDDASEREYLGEKLEEYVKTLPVPTSV
LRTGKRSGLIRARLLGAAAVKGQVITFLDAHCECTEGWLEPLLSRIALDRKTVVCPIIDV
ISDETFEYVTASDQTWGGFNWKLNFRWYRVPAREMARRNNDRTSPLRTPTMAGGLFSIDK
DYFYEIGSYDEGMDIWGGENLEMSFRIWQCGGILEIAPCSHVGHVFRDKSPYTFPGGVAN
IVLKNAARVAAVWLDEWKEFYFAMSPGARKTSAGDVSARLALREKLKCKSFRWYLENIYP
ESQMPLDYYFLGEIRNVESQNCLDTMGHKSGEKVGSSYCHGLGGNQVFAYTKRQQIMSDD
NCLDASNAHGPVNLVRCHGMGGNQEWSYDETDLTIKHVNSGNCLTRATREDPSTPQLRPC
NFSKGQQWLMQSKFKWQTKQGTEDEERR

Protein features from InterProScan

Transcript Database ID Name Start End E.value
12 g6549.t5 CDD cd02510 pp-GalNAc-T 59 359 0.000
11 g6549.t5 CDD cd00161 RICIN 372 490 0.000
9 g6549.t5 Gene3D G3DSA:3.90.550.10 Spore Coat Polysaccharide Biosynthesis Protein SpsA; Chain A 1 367 0.000
8 g6549.t5 Gene3D G3DSA:2.80.10.50 - 368 495 0.000
3 g6549.t5 PANTHER PTHR11675 N-ACETYLGALACTOSAMINYLTRANSFERASE 9 497 0.000
4 g6549.t5 PANTHER PTHR11675:SF101 POLYPEPTIDE N-ACETYLGALACTOSAMINYLTRANSFERASE 1 9 497 0.000
2 g6549.t5 Pfam PF00535 Glycosyl transferase family 2 59 243 0.000
1 g6549.t5 Pfam PF00652 Ricin-type beta-trefoil lectin domain 371 488 0.000
10 g6549.t5 ProSiteProfiles PS50231 Lectin domain of ricin B chain profile. 382 491 21.076
7 g6549.t5 SMART SM00458 ricin_3 368 491 0.000
6 g6549.t5 SUPERFAMILY SSF53448 Nucleotide-diphospho-sugar transferases 35 361 0.000
5 g6549.t5 SUPERFAMILY SSF50370 Ricin B-like lectins 364 491 0.000

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

There are no GO annotations for this transcript.

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values