Gene loci information

Transcript annotation

  • This transcript has been annotated as Putative polypeptide N-acetylgalactosaminyltransferase 9.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_4 g16690 g16690.t1 isoform g16690.t1 10522986 10527895
chr_4 g16690 g16690.t1 exon g16690.t1.exon1 10522986 10523332
chr_4 g16690 g16690.t1 cds g16690.t1.CDS1 10522986 10523332
chr_4 g16690 g16690.t1 exon g16690.t1.exon2 10523445 10523679
chr_4 g16690 g16690.t1 cds g16690.t1.CDS2 10523445 10523679
chr_4 g16690 g16690.t1 exon g16690.t1.exon3 10523814 10524054
chr_4 g16690 g16690.t1 cds g16690.t1.CDS3 10523814 10524054
chr_4 g16690 g16690.t1 exon g16690.t1.exon4 10524211 10524643
chr_4 g16690 g16690.t1 cds g16690.t1.CDS4 10524211 10524643
chr_4 g16690 g16690.t1 exon g16690.t1.exon5 10525990 10526230
chr_4 g16690 g16690.t1 cds g16690.t1.CDS5 10525990 10526230
chr_4 g16690 g16690.t1 exon g16690.t1.exon6 10526319 10526391
chr_4 g16690 g16690.t1 cds g16690.t1.CDS6 10526319 10526391
chr_4 g16690 g16690.t1 exon g16690.t1.exon7 10526444 10526786
chr_4 g16690 g16690.t1 cds g16690.t1.CDS7 10526444 10526786
chr_4 g16690 g16690.t1 exon g16690.t1.exon8 10526834 10526998
chr_4 g16690 g16690.t1 cds g16690.t1.CDS8 10526834 10526998
chr_4 g16690 g16690.t1 exon g16690.t1.exon9 10527173 10527371
chr_4 g16690 g16690.t1 cds g16690.t1.CDS9 10527173 10527371
chr_4 g16690 g16690.t1 exon g16690.t1.exon10 10527427 10527545
chr_4 g16690 g16690.t1 cds g16690.t1.CDS10 10527427 10527545
chr_4 g16690 g16690.t1 exon g16690.t1.exon11 10527634 10527895
chr_4 g16690 g16690.t1 cds g16690.t1.CDS11 10527634 10527895
chr_4 g16690 g16690.t1 TSS g16690.t1 NA NA
chr_4 g16690 g16690.t1 TTS g16690.t1 NA NA

Sequences

>g16690.t1 Gene=g16690 Length=2658
ATGAATTTAGAAGAAAGTCCAAAAGCTGATATTTTCCAAATGTCACCTAACAGCTATCCA
AAATTTGATGAATTAGAAAATAATTTGAAGCATTTCTTAAAGAGTTCTATAAAGGAAGAA
GAAAATTTTCAATCTGTCATGAAAAAACCTATGAAAAAGTTTAGAAGTGAAAAAAATTAT
CGATTCAAAGAAACTTTTGTTTGGCCAAAAGTTCCTGATGAAATTCTAACATTGCATCAC
TTTTTGAATTTAACAAATCCAGGTCACATGGGAAAGCCAGTAATTTTACCATCAATTTTA
CCACCAGAAATGCAAACAAAAATAAACAAAAGTTGGGAAATTTATTCAATCAATGAATTT
GTCTCAAAATTAATTCCACTTGATCGAGATTTGCCAGATATAAGACCAGAATATTGTAGA
ACAGTTACTTATGCTGATGAACTGCCAATGACTTCAGTTATTATGGTCTTTCATAATGAA
CCACTTTCAATGATTTTGAGATCTGTTTTTGCTGTTTTTAAAAGAACTTCTGAAAGAATA
TTGAAAGAGATTGTTTTGGTTGATGATTGTAGCACACATGAAAACCTACAAGAAGAACTT
GAAAACTTCATAGCTGACTATAAAAAAATTAAACTTGTAAGACTTCCATTAAGAGTTGGA
TTGATTCGTGCAAGAATGTATGGAGCAATTAATGCCAATGGACCAGCATTAGTTTTTATG
GATGCTCATATTGAAGTGACACCAGGTTGGTTAGAACCACTACTTGATCCACTAGTAAAA
AATCCATTATCATCAACAGTTCCAGTGGTTGATGGTTTGGATGCTAATACACTTGAATAT
AAATATAATGACAATCCAGAAACTTATATGATTGGTGGTTTTAAATGGAATTTAATTTAT
GAATGGATTCCAATAAGAGAAGAAGAAAAGAAACGGATGGGACATTCATGTGCACCTATT
AGAACACCAACAATGCTTGGTGCTTTTTTTGTTATTATGAAAGATAATTTTATTCGTGTT
GGCATGTATGATGAACAATATGAAACTTGGGGTGCTGAAAATTTAGAATTGAGTTTTAAA
GTTTGGATGTGTGGTGGTGAATTATACCAAGTACCATGCAGTCATGCAGCTCATATGTTT
AGAAAACATCATCCTTATACTTTCCCTGGAGGAGGAAATGAAGTTCGTCGAAATACAGAT
CGATTAGCTCATGTTTGGTTGCAGGATTACAAACGATTTTATTATCGAGCAACAAATTTT
AAAAGCCGTGATTTTGGTGACATCAGCAAGCAAGTTAAATTGAGAGAAAGCATCGGTTGC
AAAAGTTTTAAATGGTATATTGAAAATGTTTATCCAGATGTTCAATTTGATGATCGTGTC
ACTGATCCACCACCTGCAGCAGAACTTTGGTCAGAATACAACCAAAATGTGAAGAAACAA
AATAAAATTTCTTATATGTGGGCTTACATATTTTGCATAATTGACATAACTGATCCAAAA
TACATCAAAACTCTTTTCAATTCACATCAACCTCATGCTAAACTTCTAACATTTCCATAT
GGACTTCTTTCACTTGATGGCAATGAAGGACTACGAAGAAGAAAAATTTTAAATCCTTCA
TTTGGAACTTTAAATGTGAGAAAATTAATTCCTGAAATTCAAGAAAAATGTGACAATTTA
GTGAAAGTTCTGGAAAAAAATTATATTGGAAGAATTTTTAGTATTGCAAAAATTTTGGCT
GCATTTTCACTTGATGTGTTTCTGAAAACTTCATTGAGAATTGAAAGAGATTTTACGAGT
GATCTGAACAATGAGATTTTTGATATTTTTAGAGAAGAAACTTATCGAATTAACCGTGTA
TTCTACAATATTCCTCGACTTTTGAATTCAAATTTCTGGATCCAAACAAAGTTAGACAAA
TATGTTGAAGATTTAATGCCAGAAATTTTTAAAAATGCTAAAATACATGAAAAACATCAT
GAAGGTCATTTCACATTCTTTGAATCTCTGCTTAAAAGTGAAGATAAAATAAAAATGCAA
GAAATACAAAAAGAAATTAAAGGTTTAATTATAGAAGGCACAGGACCGGTTGCTTTCACA
TCATCAACAATTTTTTTGATGTTAGCAATGTTTCCAGAAGTTGAAAGAAAAGTTTATGAG
GAAATAATGCAAAACTGCCCAAATAATGACATAGATATGAAATCTATTGGAAATTTGAAA
TATTTAGAATGTGTCATGAATGAAAGCATGAGATTATTTCCAATTAATTTAACTTATGTA
AGAAAAGTTGAAAAAGATTTGAAACTTGATGACAATATAATTTTACCACGAAATTCCTAT
GCTTTTATTCGTACAAAAGATGTTCATCGTGATCCAACAATATGGGGTGATGACGCCAAT
GAATTCAAACCTGAAAGATTCTTGCCAGAAAATATCAAAAAAGTTCATCCCTTTGCTTTT
CATCCGTTTGTTCTTGGACCTCGAATGTGTATTGGATTTTATTATGCAAAATTATTGATG
AAAATTTTGATTACAAATGTCGTGAGAAATTTTCATCTTAAAACAAAGTTGAAATTTAAT
GAACTTGATTTTAAGATAGCAATTTTGTTGAGATTAAAACAAAGTTATATAATTGAGTTG
GAAAAGAGAATGAAATAA

>g16690.t1 Gene=g16690 Length=885
MNLEESPKADIFQMSPNSYPKFDELENNLKHFLKSSIKEEENFQSVMKKPMKKFRSEKNY
RFKETFVWPKVPDEILTLHHFLNLTNPGHMGKPVILPSILPPEMQTKINKSWEIYSINEF
VSKLIPLDRDLPDIRPEYCRTVTYADELPMTSVIMVFHNEPLSMILRSVFAVFKRTSERI
LKEIVLVDDCSTHENLQEELENFIADYKKIKLVRLPLRVGLIRARMYGAINANGPALVFM
DAHIEVTPGWLEPLLDPLVKNPLSSTVPVVDGLDANTLEYKYNDNPETYMIGGFKWNLIY
EWIPIREEEKKRMGHSCAPIRTPTMLGAFFVIMKDNFIRVGMYDEQYETWGAENLELSFK
VWMCGGELYQVPCSHAAHMFRKHHPYTFPGGGNEVRRNTDRLAHVWLQDYKRFYYRATNF
KSRDFGDISKQVKLRESIGCKSFKWYIENVYPDVQFDDRVTDPPPAAELWSEYNQNVKKQ
NKISYMWAYIFCIIDITDPKYIKTLFNSHQPHAKLLTFPYGLLSLDGNEGLRRRKILNPS
FGTLNVRKLIPEIQEKCDNLVKVLEKNYIGRIFSIAKILAAFSLDVFLKTSLRIERDFTS
DLNNEIFDIFREETYRINRVFYNIPRLLNSNFWIQTKLDKYVEDLMPEIFKNAKIHEKHH
EGHFTFFESLLKSEDKIKMQEIQKEIKGLIIEGTGPVAFTSSTIFLMLAMFPEVERKVYE
EIMQNCPNNDIDMKSIGNLKYLECVMNESMRLFPINLTYVRKVEKDLKLDDNIILPRNSY
AFIRTKDVHRDPTIWGDDANEFKPERFLPENIKKVHPFAFHPFVLGPRMCIGFYYAKLLM
KILITNVVRNFHLKTKLKFNELDFKIAILLRLKQSYIIELEKRMK

Protein features from InterProScan

Transcript Database ID Name Start End E.value
23 g16690.t1 CDD cd02510 pp-GalNAc-T 152 451 1.48549E-130
17 g16690.t1 Coils Coil Coil 193 213 -
16 g16690.t1 Coils Coil Coil 877 885 -
14 g16690.t1 Gene3D G3DSA:3.90.550.10 Spore Coat Polysaccharide Biosynthesis Protein SpsA; Chain A 73 458 8.3E-141
15 g16690.t1 Gene3D G3DSA:1.10.630.10 Cytochrome p450 462 885 4.5E-77
3 g16690.t1 PANTHER PTHR11675 N-ACETYLGALACTOSAMINYLTRANSFERASE 96 459 3.5E-142
4 g16690.t1 PANTHER PTHR11675:SF43 POLYPEPTIDE N-ACETYLGALACTOSAMINYLTRANSFERASE 1 96 459 3.5E-142
7 g16690.t1 PRINTS PR00463 E-class P450 group I signature 701 727 5.1E-5
10 g16690.t1 PRINTS PR00385 P450 superfamily signature 710 723 5.9E-6
8 g16690.t1 PRINTS PR00385 P450 superfamily signature 744 755 5.9E-6
6 g16690.t1 PRINTS PR00463 E-class P450 group I signature 820 830 5.1E-5
11 g16690.t1 PRINTS PR00385 P450 superfamily signature 821 830 5.9E-6
5 g16690.t1 PRINTS PR00463 E-class P450 group I signature 830 853 5.1E-5
9 g16690.t1 PRINTS PR00385 P450 superfamily signature 830 841 5.9E-6
2 g16690.t1 Pfam PF00535 Glycosyl transferase family 2 152 327 3.9E-26
1 g16690.t1 Pfam PF00067 Cytochrome P450 468 857 2.2E-55
20 g16690.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 1 567 -
22 g16690.t1 Phobius TRANSMEMBRANE Region of a membrane-bound protein predicted to be embedded in the membrane. 568 588 -
18 g16690.t1 Phobius CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the cytoplasm. 589 688 -
21 g16690.t1 Phobius TRANSMEMBRANE Region of a membrane-bound protein predicted to be embedded in the membrane. 689 711 -
19 g16690.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 712 885 -
12 g16690.t1 SUPERFAMILY SSF53448 Nucleotide-diphospho-sugar transferases 129 453 4.61E-53
13 g16690.t1 SUPERFAMILY SSF48264 Cytochrome P450 469 883 9.82E-76

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0020037 heme binding MF
GO:0016705 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen MF
GO:0005506 iron ion binding MF
GO:0055114 NA NA

KEGG

Orthology

This gene did not have any KEGG ortholog annotations (KAAS, GHOSTZ).

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below. There were no conditions that were differentially expressed