Gene loci information

Transcript annotation

  • This transcript has been annotated as Venom dipeptidyl peptidase 4.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_2 g5015 g5015.t1 TTS g5015.t1 6467411 6467411
chr_2 g5015 g5015.t1 isoform g5015.t1 6467883 6470734
chr_2 g5015 g5015.t1 exon g5015.t1.exon1 6467883 6468280
chr_2 g5015 g5015.t1 cds g5015.t1.CDS1 6467883 6468280
chr_2 g5015 g5015.t1 exon g5015.t1.exon2 6468353 6468557
chr_2 g5015 g5015.t1 cds g5015.t1.CDS2 6468353 6468557
chr_2 g5015 g5015.t1 exon g5015.t1.exon3 6468618 6469086
chr_2 g5015 g5015.t1 cds g5015.t1.CDS3 6468618 6469086
chr_2 g5015 g5015.t1 exon g5015.t1.exon4 6469174 6469370
chr_2 g5015 g5015.t1 cds g5015.t1.CDS4 6469174 6469370
chr_2 g5015 g5015.t1 exon g5015.t1.exon5 6469432 6470154
chr_2 g5015 g5015.t1 cds g5015.t1.CDS5 6469432 6470154
chr_2 g5015 g5015.t1 exon g5015.t1.exon6 6470278 6470523
chr_2 g5015 g5015.t1 cds g5015.t1.CDS6 6470278 6470523
chr_2 g5015 g5015.t1 exon g5015.t1.exon7 6470732 6470734
chr_2 g5015 g5015.t1 cds g5015.t1.CDS7 6470732 6470734
chr_2 g5015 g5015.t1 TSS g5015.t1 6470866 6470866

Sequences

>g5015.t1 Gene=g5015 Length=2241
ATGCTAAAGCTTTTGATTGTTTTATTGGTTTATGTGACTGCGAGTGATGCACGAACAATA
AAAAATTTGGCAGATGGTGGTCGATCAGTTTGGTTTAATGAAACAATTCCAAGTAGTTCT
TTTGGAGCAAGTGGTTTTGGTGGCACGTGGATTTCGGGAAATGAATTTACATACAGTTCA
AATAGGAACATGATTAAGCATAATATTATTACGAGAGTGAATGAAACATTACTTGATAGC
ACTTTTTCGGCAAGATGGAGTGGAGCATCATTAACATTATCTCCAGGAAGAACAAAAGTT
TTAGTTCGATATGCACAACGATCAATTTTTAGACATAGCACAGTTGCAAAGTTTTCAGTG
GTTTTTATGGACAATTTAGAAGTTGAATATAAGATTAGTGATGGTGATGAGATTCAATTA
TGCAAATTTTCACCAAATGATGATGGACTTATTTATATTAAACAAAATACAATTTATCAT
GCTACTGATGAACAAATTAAGGCAGGAAGATATTCAGTACAGTCACCACCCGGAGAAGAT
GGAGTTTTATATTTTGGAATTCCAGATTGGGTTTATGAAGAAGAAGTTTTAGGAAGTGAT
GCAGCAGCTTGGTTTTCAAAAGATGGAAAATATCTCGCTTATGCATTTTTTGATGATACA
AATGTAAAAGATTTCATGTATGAGTTATATGGAGATGGTAAAAATGAATATCAATATCCA
AAAGAAGTTCATTTAAGATATCCAAAAGTTGGTGAAGCAAATCCGAGAGTATCACTTTTA
GTCAATGATCTTACTGATTCATTTGGAACACCATTTTCACTGAATTATCCAGAAAAAGTA
ACAACTGATCATATTCTTGGAACAGTCTTTTGGATTACTGATACAAAATTAGGAGCAATT
TGGCTCAATCGTCGCCAAAATTATGGAGTTTTCGTTTCATATTTAACTGATGATAATTGG
AGAATGGAAGAGATTATGGAAATAAACGAACCAAATGGATGGATTGAATTAAACACGCCA
AAGTGTGAAACAACAGGCATCTGTTACTTTATCAATAATGCTGATAATTGGCCATCATTA
ACACGAGTCGACACAAATCAAAAAACCAGCAACGTACTTACGAAAGATAAACATGTCTTA
TCTTATTATGGATCTCGTAATGGAAAGCTTTACTATTTATCTACACCTGGATTATTAAAA
TCATTCGATCGACACCTATATGTCTATGAAAATGATGAAGAGAAATGCTTAACATGTGAA
TTGAAGACACCAGAAGGCAATACTTGTACTTATGTCTCAGCTTCTTTCAGCACTGACCTA
TCTTACTATTCTCTAAGTTGTTCTGGCCCTGATCCATCATTCACAAAAATTTATCAAACT
GATGATCCCAATGCTGAAGCTATTACATGGCAAGAAAATACAGCTTTAAGAACAAAACTT
CGCGATTATGCATTACCGAGAACTGAAATTTTCCATGTTCCAGTTGGTGAATTTAAAGCT
GCCGTTAAAATGCTAATACCACCAGAGATTGATTTTGATAATCAAGACTCGGTAAAGGAA
ATGTATCCAATGCTTGTTCGTGTTTATGGAGGCCCCGGATCAGTTAGAGTTGCAAGCAGT
TTTAGTATTGGTTATCAAACATATCAAGTAACAACAAAGAAAATAATTTATGTTGAAATT
GATGGTCGTGGTACAGGTCAAAAAGGAGTGGATATGATGTTTTCAATCAACGATCGTCTT
GGTACATACGAAATGGAAGATCAAATATCTGTTACAAAACATCTGATTGCAACATACAAG
TTTATTGACCCAAAACGAGTTTCCATATGGGGTTGGAGTTATGGTGGATATGCAACAGCA
ATGACATTGGCAAAAGATACTGAAAATGTATTTAAATGTGGTGTTAGTGTTGCACCTGTT
ACATCATGGATATTTTACGATACAATCTATACAGAAAGATACATGGGAATTAACCCCTAT
TCAAAGCAATATAATGAGAGCGATGTTTCGAATGCAAATCATATTGAAATCATTGGAAAG
CACGATTTCTTGCTTGTTCATGGAAATGCTGATGACAATGTTCACTATCAGCAATCAATG
GTACTTGCTGGAGCTCTCGAGAAAGCAGATATAATGTTTGATCAAATGAGCTATCCTGAT
GAAGCACATTCTCTTTCTGGTGTTTCTCTTCATCTCTATCATACACTTGATAGATTTTGG
GATAAGTGTTTTAACTTATAG

>g5015.t1 Gene=g5015 Length=746
MLKLLIVLLVYVTASDARTIKNLADGGRSVWFNETIPSSSFGASGFGGTWISGNEFTYSS
NRNMIKHNIITRVNETLLDSTFSARWSGASLTLSPGRTKVLVRYAQRSIFRHSTVAKFSV
VFMDNLEVEYKISDGDEIQLCKFSPNDDGLIYIKQNTIYHATDEQIKAGRYSVQSPPGED
GVLYFGIPDWVYEEEVLGSDAAAWFSKDGKYLAYAFFDDTNVKDFMYELYGDGKNEYQYP
KEVHLRYPKVGEANPRVSLLVNDLTDSFGTPFSLNYPEKVTTDHILGTVFWITDTKLGAI
WLNRRQNYGVFVSYLTDDNWRMEEIMEINEPNGWIELNTPKCETTGICYFINNADNWPSL
TRVDTNQKTSNVLTKDKHVLSYYGSRNGKLYYLSTPGLLKSFDRHLYVYENDEEKCLTCE
LKTPEGNTCTYVSASFSTDLSYYSLSCSGPDPSFTKIYQTDDPNAEAITWQENTALRTKL
RDYALPRTEIFHVPVGEFKAAVKMLIPPEIDFDNQDSVKEMYPMLVRVYGGPGSVRVASS
FSIGYQTYQVTTKKIIYVEIDGRGTGQKGVDMMFSINDRLGTYEMEDQISVTKHLIATYK
FIDPKRVSIWGWSYGGYATAMTLAKDTENVFKCGVSVAPVTSWIFYDTIYTERYMGINPY
SKQYNESDVSNANHIEIIGKHDFLLVHGNADDNVHYQQSMVLAGALEKADIMFDQMSYPD
EAHSLSGVSLHLYHTLDRFWDKCFNL

Protein features from InterProScan

Transcript Database ID Name Start End E.value
8 g5015.t1 Gene3D G3DSA:2.140.10.30 - 75 483 2.0E-189
9 g5015.t1 Gene3D G3DSA:3.40.50.1820 - 484 741 2.0E-189
3 g5015.t1 PANTHER PTHR11731 PROTEASE FAMILY S9B,C DIPEPTIDYL-PEPTIDASE IV-RELATED 6 746 2.3E-237
4 g5015.t1 PANTHER PTHR11731:SF154 VENOM DIPEPTIDYL PEPTIDASE 4-LIKE PROTEIN 6 746 2.3E-237
1 g5015.t1 Pfam PF00930 Dipeptidyl peptidase IV (DPP IV) N-terminal region 94 451 4.5E-64
2 g5015.t1 Pfam PF00326 Prolyl oligopeptidase family 546 745 4.1E-48
11 g5015.t1 Phobius SIGNAL_PEPTIDE Signal peptide region 1 17 -
12 g5015.t1 Phobius SIGNAL_PEPTIDE_N_REGION N-terminal region of a signal peptide. 1 3 -
13 g5015.t1 Phobius SIGNAL_PEPTIDE_H_REGION Hydrophobic region of a signal peptide. 4 12 -
14 g5015.t1 Phobius SIGNAL_PEPTIDE_C_REGION C-terminal region of a signal peptide. 13 17 -
10 g5015.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 18 746 -
6 g5015.t1 SUPERFAMILY SSF82171 DPP6 N-terminal domain-like 31 483 4.84E-32
5 g5015.t1 SUPERFAMILY SSF53474 alpha/beta-Hydrolases 497 745 1.07E-51
7 g5015.t1 SignalP_EUK SignalP-noTM SignalP-noTM 1 17 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

Data is missing for g5015/g5015.t1; file /home/yuki.yoshida/nias/analysis/reanalysis/18_revice/midgebase/iupred3/g5015.t1.fa.iupred3.txt does not exist

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0006508 proteolysis BP
GO:0008236 serine-type peptidase activity MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values