Gene loci information

Transcript annotation

  • This transcript has been annotated as Polypeptide N-acetylgalactosaminyltransferase 5.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_2 g6549 g6549.t10 TTS g6549.t10 17607799 17607799
chr_2 g6549 g6549.t10 isoform g6549.t10 17608515 17613894
chr_2 g6549 g6549.t10 exon g6549.t10.exon1 17608515 17608688
chr_2 g6549 g6549.t10 exon g6549.t10.exon2 17608761 17608929
chr_2 g6549 g6549.t10 exon g6549.t10.exon3 17609123 17609187
chr_2 g6549 g6549.t10 exon g6549.t10.exon4 17609254 17609393
chr_2 g6549 g6549.t10 exon g6549.t10.exon5 17610088 17610268
chr_2 g6549 g6549.t10 exon g6549.t10.exon6 17610399 17610493
chr_2 g6549 g6549.t10 cds g6549.t10.CDS1 17610404 17610493
chr_2 g6549 g6549.t10 exon g6549.t10.exon7 17610549 17610782
chr_2 g6549 g6549.t10 cds g6549.t10.CDS2 17610549 17610782
chr_2 g6549 g6549.t10 exon g6549.t10.exon8 17610849 17610949
chr_2 g6549 g6549.t10 cds g6549.t10.CDS3 17610849 17610949
chr_2 g6549 g6549.t10 exon g6549.t10.exon9 17611007 17611230
chr_2 g6549 g6549.t10 cds g6549.t10.CDS4 17611007 17611230
chr_2 g6549 g6549.t10 exon g6549.t10.exon10 17611334 17611677
chr_2 g6549 g6549.t10 cds g6549.t10.CDS5 17611334 17611677
chr_2 g6549 g6549.t10 exon g6549.t10.exon11 17612615 17612835
chr_2 g6549 g6549.t10 cds g6549.t10.CDS6 17612615 17612671
chr_2 g6549 g6549.t10 exon g6549.t10.exon12 17613757 17613894
chr_2 g6549 g6549.t10 TSS g6549.t10 17613887 17613887

Sequences

>g6549.t10 Gene=g6549 Length=2086
AGAGTTCAGTTTTATTTTCGTACTAGAGAGAGAGAGAACAAAAATTTAGACAAAAGTAAA
AAGTGTCTGAAAATAAAGTTAAAACTCAGTTACTGATCGGAATTAAAAGAAATATGTGGT
GCAGTTAATAATTAAAAAAAGTTTGAATGAAAACACAAAAAATTTCGTACATCAATCTGC
ACCATCAACCAAACTCAATGTCTAATTATTCGAGAATGTTTCGTGGACGTATACGTACGA
GTACTTGTAGAATTATTCTAATAACTTCTTTAGCATGGCTTCTAATAGACGTGATCATAA
TTATGAAATACACTGATGGTCTCAATGGCGGATTATTTAAAAAATCTAGAGATAATGAGG
TTCATGAAGATAAATTTATCAGTCACCATCAACTTGACGAAGATCCAATTGTTAACGATG
ATGAAATAAATACAAATTTAAATAATGCTGATAATGATTTTAACAATGATGATGACGTGA
TACACATATCGAAAACATATCGATCGACAGATCTAAAGAAATGGCGGCCAGCGCCGGTGG
TGCGTGAAAATGTAGGAAAACCAGGTGAAATGGGCAAGCCGGTCAAAATGAAATCATATC
AGCAGGAAGAAATGAAAGAGAAATTCAAAGAAAATCAATTCAATTTGTTAGCAAGTGATA
TGATATGGTTGAATCGATCGCTCGCTGATGTGCGGCACAAAGACTGTCGAACTAAAACAT
ACCCAAGTAGATTGCCAACAACAAGTATAGTTATAGTTTTTCATAATGAAGCATGGAGCA
CATTGCTCAGAACAATATGGAGTGTAATTAATCGTTCACCACGACCGTTATTAAAAGAAA
TCATTCTCGTTGATGATGCAAGCGAGCGTGAATATTTAGGTGAAAAATTAGAGGAATATG
TAAAAACACTGCCTGTTCCAACATCAGTTTTACGCACCGGTAAAAGGTCTGGTCTCATTA
GAGCACGTTTATTGGGTGCTGCTGCTGTAAAAGGTCAAGTTATAACTTTTCTCGATGCTC
ATTGTGAATGTACTGAAGGTTGGTTAGAACCTTTGTTGTCGAGAATTGCTTTGGACAGAA
AAACAGTTGTTTGTCCAATAATCGATGTTATTAGTGATGAAACGTTTGAATATGTTACTG
CTTCTGATCAAACTTGGGGTGGTTTTAATTGGAAGCTCAATTTTAGATGGTATCGAGTAC
CTGCTAGAGAAATGGCAAGAAGAAATAATGATAGGACTTCACCTCTTCGAACGCCAACTA
TGGCGGGTGGTTTGTTTTCAATAGATAAAGATTATTTCTATGAAATTGGTTCATATGACG
AGGGAATGGATATTTGGGGTAAGTTTAAGTAACTTTTATTTGGCAATGCGGCGGAATTTT
GGAAATTGCTCCCTGTAGCCACGTTGGTCATGTTTTCAGAGATAAAAGTCCATATACTTT
CCCTGGCGGTGTGGCAAATATTGTACTTAAAAATGCTGCTCGTGTTGCAGCTGTTTGGCT
TGACGAATGGAAAGAATTTTACTTCGCTATGAGTCCAGGTGCACGAAAAACATCAGCTGG
AGATGTGTCTGCACGTCTTGCTCTAAGAGAAAAATTAAAGTGCAAGAGTTTCAGATGGTA
TCTTGAAAATATTTATCCTGAGAGTCAAATGCCATTAGATTATTATTTCCTCGGAGAAAT
TCGAAATGTCGAAAGTCAGAATTGTTTAGATACGATGGGTCATAAATCAGGAGAGAAAGT
TGGATCATCATATTGTCACGGACTGGGAGGTAATCAAGTTTTCGCCTACACTAAACGACA
ACAAATTATGTCAGATGATAACTGTCTTGATGCCAGTAACGCTCATGGACCAGTTAACTT
AGTTAGATGTCATGGAATGGGTGGGAATCAGGAATGGAGTTATGATGAGACTGACCTTAC
AATTAAGCATGTGAATTCAGGAAATTGTCTTACAAGAGCCACTCGAGAAGATCCTTCAAC
GCCACAACTTCGTCCATGTAACTTTTCTAAAGGTCAACAGTGGTTAATGCAATCAAAGTT
TAAGTGGCAAACAAAGCAAGGTACAGAAGATGAAGAGAGAAGATAG

>g6549.t10 Gene=g6549 Length=349
MKYTDGLNGGLFKKSRDNEVHEDKFISHHQLDEDPIVNDDEINTNLNNADNDFNNDDDVI
HISKTYRSTDLKKWRPAPVVRENVGKPGEMGKPVKMKSYQQEEMKEKFKENQFNLLASDM
IWLNRSLADVRHKDCRTKTYPSRLPTTSIVIVFHNEAWSTLLRTIWSVINRSPRPLLKEI
ILVDDASEREYLGEKLEEYVKTLPVPTSVLRTGKRSGLIRARLLGAAAVKGQVITFLDAH
CECTEGWLEPLLSRIALDRKTVVCPIIDVISDETFEYVTASDQTWGGFNWKLNFRWYRVP
AREMARRNNDRTSPLRTPTMAGGLFSIDKDYFYEIGSYDEGMDIWGKFK

Protein features from InterProScan

Transcript Database ID Name Start End E.value
5 g6549.t10 Gene3D G3DSA:3.90.550.10 Spore Coat Polysaccharide Biosynthesis Protein SpsA; Chain A 69 348 0
2 g6549.t10 PANTHER PTHR11675 N-ACETYLGALACTOSAMINYLTRANSFERASE 70 346 0
3 g6549.t10 PANTHER PTHR11675:SF101 POLYPEPTIDE N-ACETYLGALACTOSAMINYLTRANSFERASE 1 70 346 0
1 g6549.t10 Pfam PF00535 Glycosyl transferase family 2 148 332 0
4 g6549.t10 SUPERFAMILY SSF53448 Nucleotide-diphospho-sugar transferases 124 345 0

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

There are no GO annotations for this transcript.

KEGG

Orthology

This gene did not have any KEGG ortholog annotations (KAAS, GHOSTZ).

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values