Gene loci information

Transcript annotation

  • This transcript has been annotated as N-acetylgalactosaminyltransferase 6.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_4 g14846 g14846.t2 isoform g14846.t2 2648577 2651281
chr_4 g14846 g14846.t2 exon g14846.t2.exon1 2648577 2649414
chr_4 g14846 g14846.t2 cds g14846.t2.CDS1 2648811 2649414
chr_4 g14846 g14846.t2 exon g14846.t2.exon2 2649569 2649702
chr_4 g14846 g14846.t2 cds g14846.t2.CDS2 2649569 2649702
chr_4 g14846 g14846.t2 exon g14846.t2.exon3 2649879 2649933
chr_4 g14846 g14846.t2 cds g14846.t2.CDS3 2649879 2649933
chr_4 g14846 g14846.t2 exon g14846.t2.exon4 2649996 2650153
chr_4 g14846 g14846.t2 cds g14846.t2.CDS4 2649996 2650153
chr_4 g14846 g14846.t2 exon g14846.t2.exon5 2650218 2650253
chr_4 g14846 g14846.t2 cds g14846.t2.CDS5 2650218 2650253
chr_4 g14846 g14846.t2 exon g14846.t2.exon6 2650356 2650465
chr_4 g14846 g14846.t2 cds g14846.t2.CDS6 2650356 2650465
chr_4 g14846 g14846.t2 exon g14846.t2.exon7 2650527 2650752
chr_4 g14846 g14846.t2 cds g14846.t2.CDS7 2650527 2650752
chr_4 g14846 g14846.t2 exon g14846.t2.exon8 2650802 2651281
chr_4 g14846 g14846.t2 cds g14846.t2.CDS8 2650802 2651281
chr_4 g14846 g14846.t2 TSS g14846.t2 NA NA
chr_4 g14846 g14846.t2 TTS g14846.t2 NA NA

Sequences

>g14846.t2 Gene=g14846 Length=2037
ATGGTTTCAATTAAAAGATTTTTTATAAAATTAAATTTTATCTTCCTTCAAGTAAGTTAA
AAACAAAAAAAAATATTTTTTCTTTATTAATAAACTAGAAAAAGAAGCAAAAAAATTAAA
CTGTCATGTGGAACAACTCATCAATTTAAAGAAATTTTAAAATTTTCACAACACAAGTAA
AAAATTTATTTTTAGGCACTCAAGAGACAATATTACACAGTTCAACTTTTGATTATGATT
TTTGCTTTGACATTTGCTGCTCTTTATATTTTAATCACAATCAACAATGATCCATTACTA
GTTGATCAGCCTTACATTTATATTGAACCATTAGCTTCATATTATCGATTTAAACATTCA
CCATTAAAAAAAGATTGGCATAATTATGAATTTATGGATTTCGAAGCATCTCGAGTTGGT
CCTGGTGAGAATGGAACTGGAGTTTTTTTATCAGGTGACGAAGCTTCTCTTGCTCAACAA
ATTTTTGAAGAAAATAAACACAATGGACTTGTAAGTGACAAAATAGCACGTGATCGAAGT
TTACCTGATACTCGACCACCAGAATGCATGACACGAACTTATTTGAGTGATTTGCCAAAA
GTTTCAATTATTATTCCATTTCATAATGAAATTTTAAGCACTTTAACTCGAACTGTTCAC
AGTGTTTTTAATCGATCGCCACCTGAATTGCTAAAGGAAGTGATTTTGGTCAATGATCAT
AGTGACAAAGAACATTGTTATGGTGAACTTGAGGAATACATTGCAACACATTTTGATATC
AATAAAGTAAGAATTTTAGTGCTGACAAAGAGATCGGGATTGATGTGGGCGAGATTAGCT
GGTGCTCGTGCTGCGAGTGGTGATGTGTTGATTTTTATGGATTGTCACACTGAAGCTAAT
ATCAATTGGTTACCACCACTTATAGAACCAATTGCTTTGAATTATCGTACTTGTGTTTGT
CCTTACATTGATGTCATAAATGCAAAAGATTATCACTACACAGGTCTTCAACATGGAACT
CGAGGAGTTTTCAACTGGCAATTGATTTATCAATTTTTACCACTTCGACCTGAAGATCAA
TCTGACCCAACTGAACCTTTCAAATCACCCGTCATGATGGGTTGTGTTTTTGCAATTTCT
GCTAAATTTTTCTGGGAACTTGGTGGGTACGATCCAGCACTTGAGATTTGGGGCGGTGAG
CAGTATGAATTGAGTTTTAAGGTTTGGCTTTGTAACGGACAGCAACTTGATGCGCCATGT
TCACGAGTTGGTCATCTTTATCGGCCTCGACCATTCACAAATGCTGGAAATCATACAAAT
TATGTTTCATACAACTTTAAGCGTGTTGCAGAAGTTTGGATGGATGAATATGCTCAATAT
ATTTATAAACGTGATGAAAAGAAATGGAATGAAATTGATGTTGGTGATATTTCGCATATG
ATGAATCTTAAGAAAAAACTAAATTGCAAACCATTTAAATATTTTTTGGATGAAGTTGCT
CCTGATATGCTTGATCGATATCCTTATATTGAACCACCTTCATTTGCTAGTGGTGCTATT
CAATCAATAGCAAATCCACAATATTGTGTTGACACATTAGAAACTGAACGAGAAAAACAA
GTTGGAATTTATAGATGTCGTTCTAATCTTGTCAATCCAGGTTGGCATCAAGAATTTAGA
CTTCGAAATCATCGTGATATTTCAATTGAACATTCAAACAGTGACTGCCTTGATTTTAAT
AATAAAAGAATTCTTTATTACGCTTGTAAATTTAATCAAGAAAATCAATACTTTCGATAT
GATTTAAAAACTCAGCAAATTTATTGTGGATCAAAATGGCAAAATCAATGCATGGATATT
GATATGAGAACAAAATTACTAATTTATGCACCATGTGATGAAACTAAATTGACACAGAAA
TGGAAATGGGGATTTTTAAATGAAACAATGTTAAATGATTGGACAAATTATGGTAAACCA
ATCAATGATGAAAAGGAAATTGAAGATTTACTAAAAGAAGTTATAAATGATAAATAA

>g14846.t2 Gene=g14846 Length=600
MIFALTFAALYILITINNDPLLVDQPYIYIEPLASYYRFKHSPLKKDWHNYEFMDFEASR
VGPGENGTGVFLSGDEASLAQQIFEENKHNGLVSDKIARDRSLPDTRPPECMTRTYLSDL
PKVSIIIPFHNEILSTLTRTVHSVFNRSPPELLKEVILVNDHSDKEHCYGELEEYIATHF
DINKVRILVLTKRSGLMWARLAGARAASGDVLIFMDCHTEANINWLPPLIEPIALNYRTC
VCPYIDVINAKDYHYTGLQHGTRGVFNWQLIYQFLPLRPEDQSDPTEPFKSPVMMGCVFA
ISAKFFWELGGYDPALEIWGGEQYELSFKVWLCNGQQLDAPCSRVGHLYRPRPFTNAGNH
TNYVSYNFKRVAEVWMDEYAQYIYKRDEKKWNEIDVGDISHMMNLKKKLNCKPFKYFLDE
VAPDMLDRYPYIEPPSFASGAIQSIANPQYCVDTLETEREKQVGIYRCRSNLVNPGWHQE
FRLRNHRDISIEHSNSDCLDFNNKRILYYACKFNQENQYFRYDLKTQQIYCGSKWQNQCM
DIDMRTKLLIYAPCDETKLTQKWKWGFLNETMLNDWTNYGKPINDEKEIEDLLKEVINDK

Protein features from InterProScan

Transcript Database ID Name Start End E.value
16 g14846.t2 CDD cd02510 pp-GalNAc-T 124 422 8.25847E-154
15 g14846.t2 CDD cd00161 RICIN 440 565 7.82036E-11
9 g14846.t2 Gene3D G3DSA:3.90.550.10 Spore Coat Polysaccharide Biosynthesis Protein SpsA; Chain A 43 432 2.0E-151
8 g14846.t2 Gene3D G3DSA:2.80.10.50 - 433 579 2.8E-20
3 g14846.t2 PANTHER PTHR11675:SF41 POLYPEPTIDE N-ACETYLGALACTOSAMINYLTRANSFERASE 10 41 577 3.0E-151
4 g14846.t2 PANTHER PTHR11675 N-ACETYLGALACTOSAMINYLTRANSFERASE 41 577 3.0E-151
2 g14846.t2 Pfam PF00535 Glycosyl transferase family 2 124 309 4.2E-29
1 g14846.t2 Pfam PF00652 Ricin-type beta-trefoil lectin domain 439 563 4.7E-17
11 g14846.t2 Phobius SIGNAL_PEPTIDE Signal peptide region 1 18 -
12 g14846.t2 Phobius SIGNAL_PEPTIDE_N_REGION N-terminal region of a signal peptide. 1 2 -
13 g14846.t2 Phobius SIGNAL_PEPTIDE_H_REGION Hydrophobic region of a signal peptide. 3 14 -
14 g14846.t2 Phobius SIGNAL_PEPTIDE_C_REGION C-terminal region of a signal peptide. 15 18 -
10 g14846.t2 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 19 600 -
18 g14846.t2 ProSiteProfiles PS50231 Lectin domain of ricin B chain profile. 438 566 15.212
17 g14846.t2 SMART SM00458 ricin_3 439 566 3.9E-5
6 g14846.t2 SUPERFAMILY SSF53448 Nucleotide-diphospho-sugar transferases 100 424 1.97E-56
5 g14846.t2 SUPERFAMILY SSF50370 Ricin B-like lectins 432 565 4.78E-17
7 g14846.t2 SignalP_EUK SignalP-noTM SignalP-noTM 1 18 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

There are no GO annotations for this transcript.

KEGG

Orthology

This gene did not have any KEGG ortholog annotations (KAAS, GHOSTZ).

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values