Gene loci information

Transcript annotation

  • This transcript has been annotated as N-acetylgalactosaminyltransferase 6.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_4 g14846 g14846.t1 isoform g14846.t1 2648577 2651281
chr_4 g14846 g14846.t1 exon g14846.t1.exon1 2648577 2648627
chr_4 g14846 g14846.t1 cds g14846.t1.CDS1 2648577 2648627
chr_4 g14846 g14846.t1 exon g14846.t1.exon2 2648772 2649414
chr_4 g14846 g14846.t1 cds g14846.t1.CDS2 2648772 2649414
chr_4 g14846 g14846.t1 exon g14846.t1.exon3 2649569 2649702
chr_4 g14846 g14846.t1 cds g14846.t1.CDS3 2649569 2649702
chr_4 g14846 g14846.t1 exon g14846.t1.exon4 2649879 2649933
chr_4 g14846 g14846.t1 cds g14846.t1.CDS4 2649879 2649933
chr_4 g14846 g14846.t1 exon g14846.t1.exon5 2649996 2650153
chr_4 g14846 g14846.t1 cds g14846.t1.CDS5 2649996 2650153
chr_4 g14846 g14846.t1 exon g14846.t1.exon6 2650218 2650253
chr_4 g14846 g14846.t1 cds g14846.t1.CDS6 2650218 2650253
chr_4 g14846 g14846.t1 exon g14846.t1.exon7 2650356 2650465
chr_4 g14846 g14846.t1 cds g14846.t1.CDS7 2650356 2650465
chr_4 g14846 g14846.t1 exon g14846.t1.exon8 2650527 2650752
chr_4 g14846 g14846.t1 cds g14846.t1.CDS8 2650527 2650752
chr_4 g14846 g14846.t1 exon g14846.t1.exon9 2650802 2651281
chr_4 g14846 g14846.t1 cds g14846.t1.CDS9 2650802 2651281
chr_4 g14846 g14846.t1 TSS g14846.t1 NA NA
chr_4 g14846 g14846.t1 TTS g14846.t1 NA NA

Sequences

>g14846.t1 Gene=g14846 Length=1893
ATGGTTTCAATTAAAAGATTTTTTATAAAATTAAATTTTATCTTCCTTCAAGCACTCAAG
AGACAATATTACACAGTTCAACTTTTGATTATGATTTTTGCTTTGACATTTGCTGCTCTT
TATATTTTAATCACAATCAACAATGATCCATTACTAGTTGATCAGCCTTACATTTATATT
GAACCATTAGCTTCATATTATCGATTTAAACATTCACCATTAAAAAAAGATTGGCATAAT
TATGAATTTATGGATTTCGAAGCATCTCGAGTTGGTCCTGGTGAGAATGGAACTGGAGTT
TTTTTATCAGGTGACGAAGCTTCTCTTGCTCAACAAATTTTTGAAGAAAATAAACACAAT
GGACTTGTAAGTGACAAAATAGCACGTGATCGAAGTTTACCTGATACTCGACCACCAGAA
TGCATGACACGAACTTATTTGAGTGATTTGCCAAAAGTTTCAATTATTATTCCATTTCAT
AATGAAATTTTAAGCACTTTAACTCGAACTGTTCACAGTGTTTTTAATCGATCGCCACCT
GAATTGCTAAAGGAAGTGATTTTGGTCAATGATCATAGTGACAAAGAACATTGTTATGGT
GAACTTGAGGAATACATTGCAACACATTTTGATATCAATAAAGTAAGAATTTTAGTGCTG
ACAAAGAGATCGGGATTGATGTGGGCGAGATTAGCTGGTGCTCGTGCTGCGAGTGGTGAT
GTGTTGATTTTTATGGATTGTCACACTGAAGCTAATATCAATTGGTTACCACCACTTATA
GAACCAATTGCTTTGAATTATCGTACTTGTGTTTGTCCTTACATTGATGTCATAAATGCA
AAAGATTATCACTACACAGGTCTTCAACATGGAACTCGAGGAGTTTTCAACTGGCAATTG
ATTTATCAATTTTTACCACTTCGACCTGAAGATCAATCTGACCCAACTGAACCTTTCAAA
TCACCCGTCATGATGGGTTGTGTTTTTGCAATTTCTGCTAAATTTTTCTGGGAACTTGGT
GGGTACGATCCAGCACTTGAGATTTGGGGCGGTGAGCAGTATGAATTGAGTTTTAAGGTT
TGGCTTTGTAACGGACAGCAACTTGATGCGCCATGTTCACGAGTTGGTCATCTTTATCGG
CCTCGACCATTCACAAATGCTGGAAATCATACAAATTATGTTTCATACAACTTTAAGCGT
GTTGCAGAAGTTTGGATGGATGAATATGCTCAATATATTTATAAACGTGATGAAAAGAAA
TGGAATGAAATTGATGTTGGTGATATTTCGCATATGATGAATCTTAAGAAAAAACTAAAT
TGCAAACCATTTAAATATTTTTTGGATGAAGTTGCTCCTGATATGCTTGATCGATATCCT
TATATTGAACCACCTTCATTTGCTAGTGGTGCTATTCAATCAATAGCAAATCCACAATAT
TGTGTTGACACATTAGAAACTGAACGAGAAAAACAAGTTGGAATTTATAGATGTCGTTCT
AATCTTGTCAATCCAGGTTGGCATCAAGAATTTAGACTTCGAAATCATCGTGATATTTCA
ATTGAACATTCAAACAGTGACTGCCTTGATTTTAATAATAAAAGAATTCTTTATTACGCT
TGTAAATTTAATCAAGAAAATCAATACTTTCGATATGATTTAAAAACTCAGCAAATTTAT
TGTGGATCAAAATGGCAAAATCAATGCATGGATATTGATATGAGAACAAAATTACTAATT
TATGCACCATGTGATGAAACTAAATTGACACAGAAATGGAAATGGGGATTTTTAAATGAA
ACAATGTTAAATGATTGGACAAATTATGGTAAACCAATCAATGATGAAAAGGAAATTGAA
GATTTACTAAAAGAAGTTATAAATGATAAATAA

>g14846.t1 Gene=g14846 Length=630
MVSIKRFFIKLNFIFLQALKRQYYTVQLLIMIFALTFAALYILITINNDPLLVDQPYIYI
EPLASYYRFKHSPLKKDWHNYEFMDFEASRVGPGENGTGVFLSGDEASLAQQIFEENKHN
GLVSDKIARDRSLPDTRPPECMTRTYLSDLPKVSIIIPFHNEILSTLTRTVHSVFNRSPP
ELLKEVILVNDHSDKEHCYGELEEYIATHFDINKVRILVLTKRSGLMWARLAGARAASGD
VLIFMDCHTEANINWLPPLIEPIALNYRTCVCPYIDVINAKDYHYTGLQHGTRGVFNWQL
IYQFLPLRPEDQSDPTEPFKSPVMMGCVFAISAKFFWELGGYDPALEIWGGEQYELSFKV
WLCNGQQLDAPCSRVGHLYRPRPFTNAGNHTNYVSYNFKRVAEVWMDEYAQYIYKRDEKK
WNEIDVGDISHMMNLKKKLNCKPFKYFLDEVAPDMLDRYPYIEPPSFASGAIQSIANPQY
CVDTLETEREKQVGIYRCRSNLVNPGWHQEFRLRNHRDISIEHSNSDCLDFNNKRILYYA
CKFNQENQYFRYDLKTQQIYCGSKWQNQCMDIDMRTKLLIYAPCDETKLTQKWKWGFLNE
TMLNDWTNYGKPINDEKEIEDLLKEVINDK

Protein features from InterProScan

Transcript Database ID Name Start End E.value
13 g14846.t1 CDD cd02510 pp-GalNAc-T 154 452 2.52945E-153
12 g14846.t1 CDD cd00161 RICIN 470 595 6.73926E-11
8 g14846.t1 Gene3D G3DSA:3.90.550.10 Spore Coat Polysaccharide Biosynthesis Protein SpsA; Chain A 73 462 2.4E-151
7 g14846.t1 Gene3D G3DSA:2.80.10.50 - 463 609 3.1E-20
3 g14846.t1 PANTHER PTHR11675:SF41 POLYPEPTIDE N-ACETYLGALACTOSAMINYLTRANSFERASE 10 71 607 3.0E-151
4 g14846.t1 PANTHER PTHR11675 N-ACETYLGALACTOSAMINYLTRANSFERASE 71 607 3.0E-151
2 g14846.t1 Pfam PF00535 Glycosyl transferase family 2 154 339 4.6E-29
1 g14846.t1 Pfam PF00652 Ricin-type beta-trefoil lectin domain 469 593 5.1E-17
9 g14846.t1 Phobius CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the cytoplasm. 1 22 -
11 g14846.t1 Phobius TRANSMEMBRANE Region of a membrane-bound protein predicted to be embedded in the membrane. 23 44 -
10 g14846.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 45 630 -
16 g14846.t1 ProSiteProfiles PS50231 Lectin domain of ricin B chain profile. 468 596 15.212
15 g14846.t1 SMART SM00458 ricin_3 469 596 3.9E-5
6 g14846.t1 SUPERFAMILY SSF53448 Nucleotide-diphospho-sugar transferases 130 454 2.25E-56
5 g14846.t1 SUPERFAMILY SSF50370 Ricin B-like lectins 462 595 4.92E-17
14 g14846.t1 TMHMM TMhelix Region of a membrane-bound protein predicted to be embedded in the membrane. 23 45 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

There are no GO annotations for this transcript.

KEGG

Orthology

This gene did not have any KEGG ortholog annotations (KAAS, GHOSTZ).

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values