Gene loci information

Transcript annotation

  • This transcript has been annotated as Serine protease snake.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_3 g305 g305.t2 TSS g305.t2 2635059 2635059
chr_3 g305 g305.t2 isoform g305.t2 2635100 2636456
chr_3 g305 g305.t2 exon g305.t2.exon1 2635100 2635431
chr_3 g305 g305.t2 cds g305.t2.CDS1 2635209 2635431
chr_3 g305 g305.t2 exon g305.t2.exon2 2635499 2635615
chr_3 g305 g305.t2 cds g305.t2.CDS2 2635499 2635615
chr_3 g305 g305.t2 exon g305.t2.exon3 2635683 2636240
chr_3 g305 g305.t2 cds g305.t2.CDS3 2635683 2636240
chr_3 g305 g305.t2 exon g305.t2.exon4 2636305 2636456
chr_3 g305 g305.t2 cds g305.t2.CDS4 2636305 2636456
chr_3 g305 g305.t2 TTS g305.t2 2636551 2636551

Sequences

>g305.t2 Gene=g305 Length=1159
ATGGCGATAAAATTATTGATAATATTCTTTCTTTTTCCGAACGTTATATATTCTGCTATT
AATGGTAAAAATTAAAGAAATTATTTTAATGTGAATGCTAAAATATTTAATGATTAATTT
AAAAGGAGATCAATGTGTTACAAACAAAGGTGTTAGGGGCATTTGTATGGGTTTTTACGA
ATGTCAGAGCGCAGTCAATGACTACAGATTGGGAATTTTTCCTCAATTGTGTCAAAATTC
AAACAAATATACAGTTTGCTGTCCAGGAAATAATAAAGTACCTTCAAATAATATTAATAG
AATTAATTCGCAAACTCGACCGAGCAAGCAAAAGTGTGATGAATATTCAAAGATAGCCAC
GAATGTCTCACTTTTTGGAGGTCTCTCATTAGGAACATTTCACACTGAAAAAGTTGAAAC
ATATACATGCGAATTGAGTATTTCATTAATTGTGGGTGGTGAACCTGCTAAAGCGAATGA
ATTTCCTCATATGGCTGCATTAGGATGGCGTGGATTAGATAGATATGAATTTAATTGTGG
AGCATCATTAATATCTGACAGATATGTTCTGACTGCCGCTCATTGTGAAAAATGGAATGG
TAAATTACCTGATATAGTTCGTTTAGGTGATTTAGATTTAAGATCAAATTCTGATGGTGT
GCAAGAACAAGATTTTCCTGTCGAAGCTTTCATTCGTCATGAAAATTATGATTCAAGACA
AAAGAAGAATGACATTGCTGTTGTTCGTATGAGATATAATGTAAGAGCATGGCAAAAAAA
TATAAGACCGGCATGTCTTTGGTCTCAACAAAATATAAATCAGAATCAAGTTATTGCTAC
AGGATATGGAAGTACTGGATATGCAGAAGGAAATAGCGATAAACTTTTAAAAGTTAATCT
ATTTGTCTCTAACACCTATAGATGTGGAGAAATAAATATTGATAACACTCAAATTTGTGC
AGGTGATGAAAGAAATAATAGAGATACATGCCAAGGTGACAGCGGAGGTCCACTTCAAAT
CACTCTTCCAAATAATAAGTGCACATTTTATATTGTTGGAATCACAAGTTTTGGTGAAAA
TGTTTGTGGTTATAGCAATGCAGTTTATACACGAGTGAGTGCTTATATAGATTGGATAGA
AAGAATAGTTTGGGGATAA

>g305.t2 Gene=g305 Length=349
MINLKGDQCVTNKGVRGICMGFYECQSAVNDYRLGIFPQLCQNSNKYTVCCPGNNKVPSN
NINRINSQTRPSKQKCDEYSKIATNVSLFGGLSLGTFHTEKVETYTCELSISLIVGGEPA
KANEFPHMAALGWRGLDRYEFNCGASLISDRYVLTAAHCEKWNGKLPDIVRLGDLDLRSN
SDGVQEQDFPVEAFIRHENYDSRQKKNDIAVVRMRYNVRAWQKNIRPACLWSQQNINQNQ
VIATGYGSTGYAEGNSDKLLKVNLFVSNTYRCGEINIDNTQICAGDERNNRDTCQGDSGG
PLQITLPNNKCTFYIVGITSFGENVCGYSNAVYTRVSAYIDWIERIVWG

Protein features from InterProScan

Transcript Database ID Name Start End E.value
10 g305.t2 CDD cd00190 Tryp_SPc 114 346 6.7065E-73
9 g305.t2 Gene3D G3DSA:2.40.10.10 - 114 336 4.3E-64
8 g305.t2 Gene3D G3DSA:2.40.10.10 - 126 344 4.3E-64
2 g305.t2 PANTHER PTHR24260 - 42 348 2.1E-74
3 g305.t2 PANTHER PTHR24260:SF91 FI18411P1-RELATED 42 348 2.1E-74
4 g305.t2 PRINTS PR00722 Chymotrypsin serine protease family (S1) signature 144 159 1.2E-12
6 g305.t2 PRINTS PR00722 Chymotrypsin serine protease family (S1) signature 204 218 1.2E-12
5 g305.t2 PRINTS PR00722 Chymotrypsin serine protease family (S1) signature 291 303 1.2E-12
1 g305.t2 Pfam PF00089 Trypsin 114 343 1.3E-48
11 g305.t2 ProSitePatterns PS00134 Serine proteases, trypsin family, histidine active site. 154 159 -
12 g305.t2 ProSitePatterns PS00135 Serine proteases, trypsin family, serine active site. 292 303 -
16 g305.t2 ProSiteProfiles PS51888 Clip domain profile. 8 51 7.698
15 g305.t2 ProSiteProfiles PS50240 Serine proteases, trypsin domain profile. 114 348 26.551
14 g305.t2 SMART SM00680 clip 9 52 0.0034
13 g305.t2 SMART SM00020 trypsin_2 113 343 4.7E-71
7 g305.t2 SUPERFAMILY SSF50494 Trypsin-like serine proteases 104 348 9.66E-69

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0004252 serine-type endopeptidase activity MF
GO:0006508 proteolysis BP

KEGG

Orthology

This gene did not have any KEGG ortholog annotations (KAAS, GHOSTZ).

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values