Gene loci information

Transcript annotation

  • This transcript has been annotated as Peroxidase.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_2 g6489 g6489.t1 TSS g6489.t1 17239431 17239431
chr_2 g6489 g6489.t1 isoform g6489.t1 17239481 17243932
chr_2 g6489 g6489.t1 exon g6489.t1.exon1 17239481 17240556
chr_2 g6489 g6489.t1 cds g6489.t1.CDS1 17239481 17240556
chr_2 g6489 g6489.t1 exon g6489.t1.exon2 17240624 17240951
chr_2 g6489 g6489.t1 cds g6489.t1.CDS2 17240624 17240951
chr_2 g6489 g6489.t1 exon g6489.t1.exon3 17241013 17241320
chr_2 g6489 g6489.t1 cds g6489.t1.CDS3 17241013 17241320
chr_2 g6489 g6489.t1 exon g6489.t1.exon4 17241379 17241436
chr_2 g6489 g6489.t1 cds g6489.t1.CDS4 17241379 17241436
chr_2 g6489 g6489.t1 TTS g6489.t1 17241574 17241574
chr_2 g6489 g6489.t1 exon g6489.t1.exon5 17241961 17242001
chr_2 g6489 g6489.t1 cds g6489.t1.CDS5 17241961 17242001
chr_2 g6489 g6489.t1 exon g6489.t1.exon6 17242383 17242767
chr_2 g6489 g6489.t1 cds g6489.t1.CDS6 17242383 17242767
chr_2 g6489 g6489.t1 exon g6489.t1.exon7 17242829 17243528
chr_2 g6489 g6489.t1 cds g6489.t1.CDS7 17242829 17243528
chr_2 g6489 g6489.t1 exon g6489.t1.exon8 17243625 17243932
chr_2 g6489 g6489.t1 cds g6489.t1.CDS8 17243625 17243932

Sequences

>g6489.t1 Gene=g6489 Length=3204
ATGAAAATTTTAATTCTTATCGCGGCGGCAGTAATAACTGCAAGTGTGGATGCAGATGAT
TGTGTTAGTCACACTTATCGAACATTAGATGGTTCATGCAACAATTTAAAACATCCTAAT
TGGGGTAAAGCTGGAACTCCATATGCTCGACTTTTGCCAGCACGTTATGGTGATGGAATT
TTCAGTCCACCAAAGTCAAAAACGGGTGCTGATTTGCCAAGCTCAAGACTTGTATCGACT
ACGATTTTTGATACTATTGATTCACCTGATCCCAATCATACAATTGTCACTATGCAATTT
GGTCAATTTGTTGCACATGATATGAGTTTTGGTGGGGCACCTATTCATCCATCATGCTGC
CAAGATGGAAAAATTGTCTCTCATGATCCATTATGTTATCCAATTATTGTTCCAAATGAC
GACCCAGTTCGCTCAGCCGATGGCATCGAATGTATGAATTTTCAACGCACATTAACCGAC
AGAGATAACGAACTTGATGAAAACCGAGCAAATCAACCTGCACAGCAGATAACAGTTGTG
ACAGGCTTTCTTGACCTTTCTCTCGTGTATGGAAACTCCGAGAAAGAATTAGCGCCCGTA
AGAGAATTTAATGCAGGACGTTTAAAGATGGATATCCGTAATGGAAAAGAATGGCCACCA
CATAACCCTGATGGTGATAAAATTTGTTTTGTTGAGACATCTGGAGAGACTTGCTACTTC
GGAGGCGACCCACGTCTAAATCAAAGTCCTGATTTATCAATTCTTCATATTTATTATATT
CGTGAGCATAATCGACTTGCAGGAATCTTACATGAAATGAATCCAAGTTGGTCAGATGAG
AAACTTTTCCAAGAAGCTCGTCGCATTAATATTGCTCAATATCAATATGTTGTTTATTAT
GAATGGTTGCCATTGTTATTGGGAGAACAAAATATGTTTAAAGCTAAATTGATTTATTAT
AAAGATGGTGGTGAATATGTTGATGACTATGATGAAAATGTTGATCCATCAGCACTCAAT
GATCATGCTGCTTCTGCCTTCCGTTATTTCCATTCTGAAATTGAAGGAAATTTAGAATTA
ATTTCAGAATCACGTGAATGCAAGAAATCAATGAAAATCAGTGATGTGTTCTTACGACCA
AAAATTTTGGAGCAAAATGATAACTTCGATTCTTTTGCACGTGGTATGGCAACTCAAAGA
TTCCAAAAACCTGACAAATACTTTGACATTGAAGTTAGAGAATTTCTTCTTAAGCATTTG
CGCAAATATGGTGATGACATTAGAGCAATTGATATTCAAAGAGGCAGAGATCATGGCATT
GCTAGTTATAACAGTTTCAGAGAATTCTGTGATCTTCCAAAAGCTACAAAATGGGAAGAT
TATTTGGATTTGATTTCTCAAGAAGATATTGATAAATTGAAATCAATTTACCCAAGCTAT
GATGACGTTGACTTGTCAGTTGGTGGAATTCTTGAAAAACGTGTTGATAAATCAACTTTA
ACTAATCCAACTTATTATTGCATTTATATGAAACAATTTTATAATACTCGTGTTGGTGAT
CGCTATTGGTTTGAAAGAAGTGATCCCGAATTTGCATTTACCACATCACAATTGGCTGAA
ATAAGAAAATCAAGCATGTCGAGAATTTTCTGTGACAATGGTAATAATATTTTGAGTATG
CAACCAAATGCCTTCGTTGTTCCATCTGAAAGCAACAAAGTCATTCCATGCACAGAAATT
CCAGCTATTGATTACACTTTATGGCGAGATTTGTTATTTGATCGTAAGAGTGTGAAGATC
AGATACTTGAGAATGTCAAATAATATCTACATTAAAAATGCATGTGTAGTAAATCACGAC
ACGATACAAGAAAATGTGTCAATTTATGTTGAGAATGGTGTCATAAAATTTATTGGTACT
GAATGTGATTTTCCGATTCCGACTAATATTGAAGTGATTGATGCAAGTGGAAAGTATGTA
ATACCCGGTGGAATTGATCCGCATACTCATTTTGAGTTAGAATTTGGTGGTACATTTGCT
GTTGATGATTTTTATCAAGGTACGTGTGCTGCAGTGGCAGGTGGAACAACCACAATCATT
GATTTTGTTATACCTAAGAAAGGACAATCGATTCTTGAAGCATATGAAATTTGGCGAAAA
AGAGCTGATTCAAAAGTTGTCTGTGACTATGGATTGCATTGTGCAATAACATGGTGGAGT
GTGGAAGTTAATAAAGATATGGAAATTTTGGCAAAAGAGAAAGGAATCAGTTCATTTAAA
ATGTTTATGGCTTATAAAGGATTATTTATGCTTGATGATAGTGAACTTTATGAAACATTT
GAACGTTGTCGAGATATAGGAGCATTAGCGCAAGTCCATGCTGAGAATGGAGACATTATT
GCGAAAAATACGAAAAAATTACTTGAAAATGGTGTTAAAGGACCAGAAGGCCATCAACTT
TCACGAACAGAAGACGTTGAAGCTGAAGCTACAAATCGAGCATGTGTCATTGCTCATCAA
ACAAATTGTCCACTTTATGTTGTTCATGTTATGTCAATTTCAGCAGCAGAAGAAGTCGCA
AGAGCGCGTGAACGATGGGGAAAGAATTTTATTTTTGGAGAAACTTTAGCAGCTGCTCTT
GGTGCAAGCGGTGAAGAATATTACGATAAATGTTGGCATCATGCAGCTGCTCATGTTTTA
TCACCTCCATTGCGACCGAGAAAAGAAACACGTGAAGTTTTAATGAAAATGCTAGCAAAT
GATGATCTTCAATCTACTGGTTCTGATAATTGTACTTTTAACAAGAAGCAGAAGGAACTT
GGACTTGATGATTTTTCAAAAATTCCTAATGGTGTAAATGGCGTGGAAGATCGAATGAGT
GTCATTTGGGAAAAAGGGGGGGGAACTGACATTTTTTGTGCAGCGAAAATATTCAATCTA
TATCCAAAGAAAGGAAGTTTGACAGTTGGAGCTGATGCTGATATTGTCATCTGGAATTAT
AAAGATACTAGAACTATTTCAGTTAAAACACATCATCATGCATGCGATTTTAATATTTTT
GAAGGAATGGTTTGTCATGGTGTACCTGAAATTGTCATTGTTGGCGGAAAAATTTGTGTC
CGTGATGGAAAATTGAGTGTTACTCCCGGATCAGGAAAATTTTTACCAAGAAATGCTTTT
AATACGTTTATTTTTAAATCATAA

>g6489.t1 Gene=g6489 Length=1067
MKILILIAAAVITASVDADDCVSHTYRTLDGSCNNLKHPNWGKAGTPYARLLPARYGDGI
FSPPKSKTGADLPSSRLVSTTIFDTIDSPDPNHTIVTMQFGQFVAHDMSFGGAPIHPSCC
QDGKIVSHDPLCYPIIVPNDDPVRSADGIECMNFQRTLTDRDNELDENRANQPAQQITVV
TGFLDLSLVYGNSEKELAPVREFNAGRLKMDIRNGKEWPPHNPDGDKICFVETSGETCYF
GGDPRLNQSPDLSILHIYYIREHNRLAGILHEMNPSWSDEKLFQEARRINIAQYQYVVYY
EWLPLLLGEQNMFKAKLIYYKDGGEYVDDYDENVDPSALNDHAASAFRYFHSEIEGNLEL
ISESRECKKSMKISDVFLRPKILEQNDNFDSFARGMATQRFQKPDKYFDIEVREFLLKHL
RKYGDDIRAIDIQRGRDHGIASYNSFREFCDLPKATKWEDYLDLISQEDIDKLKSIYPSY
DDVDLSVGGILEKRVDKSTLTNPTYYCIYMKQFYNTRVGDRYWFERSDPEFAFTTSQLAE
IRKSSMSRIFCDNGNNILSMQPNAFVVPSESNKVIPCTEIPAIDYTLWRDLLFDRKSVKI
RYLRMSNNIYIKNACVVNHDTIQENVSIYVENGVIKFIGTECDFPIPTNIEVIDASGKYV
IPGGIDPHTHFELEFGGTFAVDDFYQGTCAAVAGGTTTIIDFVIPKKGQSILEAYEIWRK
RADSKVVCDYGLHCAITWWSVEVNKDMEILAKEKGISSFKMFMAYKGLFMLDDSELYETF
ERCRDIGALAQVHAENGDIIAKNTKKLLENGVKGPEGHQLSRTEDVEAEATNRACVIAHQ
TNCPLYVVHVMSISAAEEVARARERWGKNFIFGETLAAALGASGEEYYDKCWHHAAAHVL
SPPLRPRKETREVLMKMLANDDLQSTGSDNCTFNKKQKELGLDDFSKIPNGVNGVEDRMS
VIWEKGGGTDIFCAAKIFNLYPKKGSLTVGADADIVIWNYKDTRTISVKTHHHACDFNIF
EGMVCHGVPEIVIVGGKICVRDGKLSVTPGSGKFLPRNAFNTFIFKS

Protein features from InterProScan

Transcript Database ID Name Start End E.value
27 g6489.t1 CDD cd09823 peroxinectin_like 174 553 1.39097E-171
26 g6489.t1 CDD cd01314 D-HYD 609 1052 0.0
19 g6489.t1 Gene3D G3DSA:1.10.640.10 Myeloperoxidase 9 591 1.9E-182
18 g6489.t1 Gene3D G3DSA:2.30.40.10 Urease 609 1066 2.4E-190
20 g6489.t1 Gene3D G3DSA:3.20.20.140 - 663 1026 2.4E-190
3 g6489.t1 PANTHER PTHR11475 OXIDASE/PEROXIDASE 6 589 7.3E-255
4 g6489.t1 PANTHER PTHR11475:SF86 PEROXIDASE 6 589 7.3E-255
6 g6489.t1 PRINTS PR00457 Animal haem peroxidase signature 50 61 5.4E-30
9 g6489.t1 PRINTS PR00457 Animal haem peroxidase signature 96 111 5.4E-30
10 g6489.t1 PRINTS PR00457 Animal haem peroxidase signature 239 257 5.4E-30
11 g6489.t1 PRINTS PR00457 Animal haem peroxidase signature 257 277 5.4E-30
12 g6489.t1 PRINTS PR00457 Animal haem peroxidase signature 282 308 5.4E-30
8 g6489.t1 PRINTS PR00457 Animal haem peroxidase signature 344 354 5.4E-30
5 g6489.t1 PRINTS PR00457 Animal haem peroxidase signature 466 486 5.4E-30
7 g6489.t1 PRINTS PR00457 Animal haem peroxidase signature 541 555 5.4E-30
1 g6489.t1 Pfam PF03098 Animal haem peroxidase 26 565 3.8E-170
2 g6489.t1 Pfam PF01979 Amidohydrolase family 659 1038 1.8E-22
22 g6489.t1 Phobius SIGNAL_PEPTIDE Signal peptide region 1 18 -
23 g6489.t1 Phobius SIGNAL_PEPTIDE_N_REGION N-terminal region of a signal peptide. 1 2 -
24 g6489.t1 Phobius SIGNAL_PEPTIDE_H_REGION Hydrophobic region of a signal peptide. 3 13 -
25 g6489.t1 Phobius SIGNAL_PEPTIDE_C_REGION C-terminal region of a signal peptide. 14 18 -
21 g6489.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 19 1067 -
30 g6489.t1 ProSiteProfiles PS50292 Animal heme peroxidase superfamily profile. 18 592 137.452
13 g6489.t1 SUPERFAMILY SSF48113 Heme-dependent peroxidases 21 590 5.47E-167
15 g6489.t1 SUPERFAMILY SSF51338 Composite domain of metallo-dependent hydrolases 608 675 6.98E-12
14 g6489.t1 SUPERFAMILY SSF51556 Metallo-dependent hydrolases 663 985 1.43E-86
16 g6489.t1 SUPERFAMILY SSF51338 Composite domain of metallo-dependent hydrolases 973 1061 1.12E-33
17 g6489.t1 SignalP_EUK SignalP-noTM SignalP-noTM 1 18 -
28 g6489.t1 SignalP_GRAM_NEGATIVE SignalP-noTM SignalP-noTM 1 18 -
29 g6489.t1 TIGRFAM TIGR02033 D-hydantoinase: dihydropyrimidinase 610 1057 9.6E-151

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0020037 heme binding MF
GO:0042600 egg chorion CC
GO:0055114 NA NA
GO:0005737 cytoplasm CC
GO:0006979 response to oxidative stress BP
GO:0016787 hydrolase activity MF
GO:0004601 peroxidase activity MF
GO:0016810 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values