Gene loci information

Transcript annotation

  • This transcript has been annotated as Polypeptide N-acetylgalactosaminyltransferase 1.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_3 g1140 g1140.t5 TTS g1140.t5 8382645 8382645
chr_3 g1140 g1140.t5 isoform g1140.t5 8383259 8384636
chr_3 g1140 g1140.t5 exon g1140.t5.exon1 8383259 8383362
chr_3 g1140 g1140.t5 cds g1140.t5.CDS1 8383259 8383362
chr_3 g1140 g1140.t5 exon g1140.t5.exon2 8383421 8383536
chr_3 g1140 g1140.t5 cds g1140.t5.CDS2 8383421 8383536
chr_3 g1140 g1140.t5 exon g1140.t5.exon3 8383613 8383698
chr_3 g1140 g1140.t5 cds g1140.t5.CDS3 8383613 8383698
chr_3 g1140 g1140.t5 exon g1140.t5.exon4 8383774 8384034
chr_3 g1140 g1140.t5 cds g1140.t5.CDS4 8383774 8384034
chr_3 g1140 g1140.t5 exon g1140.t5.exon5 8384114 8384303
chr_3 g1140 g1140.t5 cds g1140.t5.CDS5 8384114 8384303
chr_3 g1140 g1140.t5 exon g1140.t5.exon6 8384369 8384435
chr_3 g1140 g1140.t5 cds g1140.t5.CDS6 8384369 8384435
chr_3 g1140 g1140.t5 exon g1140.t5.exon7 8384510 8384636
chr_3 g1140 g1140.t5 cds g1140.t5.CDS7 8384510 8384516
chr_3 g1140 g1140.t5 TSS g1140.t5 NA NA

Sequences

>g1140.t5 Gene=g1140 Length=951
ATTGGTGGTTTCACATTAGATGGGCATTTTGATTGGCATGATGTGCCTGAAAGAGAAAGA
GAACGTCAAAGACGCGAGTGTAAAGATGAAATTGAAATTTGTCCAACATATTCACCAACA
ATGGCTGGTGGTCTGTTTGCAATCTCAAGGGACTATTTTTGGGAAATTGGTTCATATGAT
GAGCAGATGGATGGATGGGGTGGCGAAAATCTGGAAATGAGTTTTCGAATCTGGATGTGC
GGTGGAACATTAGAAACAATTCCTTGCAGTCGAATTGGACACATTTTTAGAGAGTTTCAT
CCATATAGCTTTCCAAATGACAAAGATACTCACGGAATCAATACAGTTCGAATGGCAAAA
GTTTGGATGGACGATTATCAAGAGCTTCTTTATATGAATAGACCTGATTTGAGAAATCAT
CCTGATGTAGGCGACGTGACACACAGAAAAGTATTGAGAGATAAACTTAAGTGCAAATCC
TTTGAATGGTATATGCAAAATATATACCCTGAAAAATTCATTCCAACAAGAAATGTTCAA
AATTATGGACGAATATCAGCTATAGAAGACGATCGATTTTGCTTTGATGATTTACAACAG
AATATTGATGAGCCATATAATTTAGGAGTTTATTCATGCTATAAGCATGATATTGCACCA
TCACAACTTTTCTCTTATACTTATAATAAAGTATTACGAACAGAAAGAAGTTGTGCAACT
ATCGATGATCGACGAAGCACTAAATATATTGTGATGATTCCTTGCAATAGTGACGATGAA
GTTACAGATACTTGGGTTCATACTAGTTTTAATCAATTTAAACATGAACAAACGGGTCTT
TGCATAGATCGAAAAAATTTGGATAAAAATTTACTTCATGCAGCTGTATGTGATTCATTA
TCAAAAACACAAAAATGGGAATTTCAAAAGACAAAAAAGCAGAGCATTTAG

>g1140.t5 Gene=g1140 Length=276
MAGGLFAISRDYFWEIGSYDEQMDGWGGENLEMSFRIWMCGGTLETIPCSRIGHIFREFH
PYSFPNDKDTHGINTVRMAKVWMDDYQELLYMNRPDLRNHPDVGDVTHRKVLRDKLKCKS
FEWYMQNIYPEKFIPTRNVQNYGRISAIEDDRFCFDDLQQNIDEPYNLGVYSCYKHDIAP
SQLFSYTYNKVLRTERSCATIDDRRSTKYIVMIPCNSDDEVTDTWVHTSFNQFKHEQTGL
CIDRKNLDKNLLHAAVCDSLSKTQKWEFQKTKKQSI

Protein features from InterProScan

Transcript Database ID Name Start End E.value
11 g1140.t5 CDD cd00161 RICIN 143 268 0.0000000
9 g1140.t5 Gene3D G3DSA:3.90.550.10 Spore Coat Polysaccharide Biosynthesis Protein SpsA; Chain A 1 136 0.0000000
8 g1140.t5 Gene3D G3DSA:2.80.10.50 - 141 274 0.0000000
3 g1140.t5 PANTHER PTHR11675 N-ACETYLGALACTOSAMINYLTRANSFERASE 1 274 0.0000000
4 g1140.t5 PANTHER PTHR11675:SF43 POLYPEPTIDE N-ACETYLGALACTOSAMINYLTRANSFERASE 1 1 274 0.0000000
1 g1140.t5 Pfam PF02709 N-terminal domain of galactosyltransferase 2 56 0.0000018
2 g1140.t5 Pfam PF00652 Ricin-type beta-trefoil lectin domain 142 266 0.0000000
10 g1140.t5 ProSiteProfiles PS50231 Lectin domain of ricin B chain profile. 141 269 14.4720000
7 g1140.t5 SMART SM00458 ricin_3 140 269 0.0000150
6 g1140.t5 SUPERFAMILY SSF53448 Nucleotide-diphospho-sugar transferases 2 131 0.0000000
5 g1140.t5 SUPERFAMILY SSF50370 Ricin B-like lectins 126 269 0.0000000

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

There are no GO annotations for this transcript.

KEGG

Orthology

This gene did not have any KEGG ortholog annotations (KAAS, GHOSTZ).

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values