Gene loci information

Transcript annotation

  • This transcript has been annotated as Serine protease inhibitor 77Ba.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_1 g12641 g12641.t1 TSS g12641.t1 24968083 24968083
chr_1 g12641 g12641.t1 isoform g12641.t1 24968260 24971610
chr_1 g12641 g12641.t1 exon g12641.t1.exon1 24968260 24968500
chr_1 g12641 g12641.t1 cds g12641.t1.CDS1 24968260 24968500
chr_1 g12641 g12641.t1 exon g12641.t1.exon2 24968558 24969163
chr_1 g12641 g12641.t1 cds g12641.t1.CDS2 24968558 24969163
chr_1 g12641 g12641.t1 exon g12641.t1.exon3 24969225 24969274
chr_1 g12641 g12641.t1 cds g12641.t1.CDS3 24969225 24969274
chr_1 g12641 g12641.t1 exon g12641.t1.exon4 24970017 24970330
chr_1 g12641 g12641.t1 cds g12641.t1.CDS4 24970017 24970330
chr_1 g12641 g12641.t1 exon g12641.t1.exon5 24970398 24970626
chr_1 g12641 g12641.t1 cds g12641.t1.CDS5 24970398 24970626
chr_1 g12641 g12641.t1 exon g12641.t1.exon6 24970686 24971141
chr_1 g12641 g12641.t1 cds g12641.t1.CDS6 24970686 24971141
chr_1 g12641 g12641.t1 exon g12641.t1.exon7 24971198 24971356
chr_1 g12641 g12641.t1 cds g12641.t1.CDS7 24971198 24971356
chr_1 g12641 g12641.t1 exon g12641.t1.exon8 24971515 24971610
chr_1 g12641 g12641.t1 cds g12641.t1.CDS8 24971515 24971610
chr_1 g12641 g12641.t1 TTS g12641.t1 NA NA

Sequences

>g12641.t1 Gene=g12641 Length=2151
ATGAAGGGAAATTTTTACTTCATTTTCTTATTATTATTTTTTGTGAAGTCTTCAGTATTT
TCAATTCGACGTATAAATTCTTGTGGTGTTTCGAAATTTGAAAGAAATAAAAGTTTTATT
GGGCTTGTAGTTAATGGTGATGAAAGTAAACCAGGTGAATGGCCATGGATTGTTTCAATG
TTTAGAAAAGAAAAATATTTTTGTGGTTCATCTTTGATTTCTGATTTGCATTTGTTATCT
GCTGCTCATTGTTTCGAATATTTTGGAATCACTTTTCAACTTAACGATTATTTTGCTCTT
CTTGGTCGATTCAATTTGAAAGACAACAACGAGAAGTTTTCAGTCAATCGAAGTTTTTCT
TCAATTTTTCTTCATTCTGAGTTTAATAATACAGCTGAAGCTTATCGTAGCAATGCTGAC
ATTGCAGTTATTCAAATGTCAGAAAGAGTTCAATTTTCAGATTTTATACAACCAGTTTGT
CTTCCTGAGCCAAATTCTAATTTAGAAAATTTTGATGGAATTGTTGTTGGTTATGGCAAA
AGTGAGTCTGGTGAAATTCATGAAGTGACACCAAAGCAAGCTGAATTGCATTCAATTAAT
GTTTTTGAGTGTTTATTAAAAGATCCTCTGTATAGTTACATTGTGGCTGAAAGAAGTTTT
TGTGCAGGTGGCACTGATGCGATTCCATGTTTGGGAGATTCTGGTGGAGGATTTTATGTA
AAAAATAACAAAAGTGGAAAATACGAAACGAAAGGAATTGTGTCACAAGCACAACATAAT
GGTTGTGATCCAAAAGTTTATGTTGCTTTCGTTGATGTTACCAAGTTTATTGACTGGATT
ATTGAAAAAATGAATGAAATATCTAAACCAGAAGAAAAGAAACGCATTAAAACTGAAATG
AGCTCTGCTGATCGCAAAACAACAAAATTACCTCTTATTTTGCCTACTTCTGTAGATACT
AATATTCGAAAATTATTATGGGATCAAGAATTGTCAGAAAATACTGAACTATTTGGATTG
CACTTATTTTTGTATTTGACTCAATTTGAGTCTGAAAATTTCATGATTAATCCATATTTA
ATTCATTCACTTTTAGCAGTCCTTGCTGAAGGTGCAGTTGGTAATACATATAAAGAAATA
AACAATGCTCTTGGTTTAATAAATAGACAACGAACAAGAGATTTCCATCAATATACTAAC
TTAGCTTTAAGTAAAAGTGCGTCAGATGTGAACTTTCGTAAATTTGCTGCTATGATAGGT
GATCAAAATCGACCGATCACACGCGAATATGAAGACAATTTAGAAAAAATTTATGATGTA
GAATACATTCCTGTAAATTTCAAAAATGTAGACAGAACTTTACGTGAAGTAAATGGCAGA
GTTTCACAAAGTACAAGTGGGTTAATAACTGACGTCATCTCACGTGAAGATGTACTTAAG
ACTCAGTTAATACTGTTAGCATGTACTTATTTCAAAGGAAGCTGGAAAATTGCCTTCAAT
TCGACATTAACAAAGTATGAACCATTTTATGATATTAATGAGGAGAGAATAATTGGAAGA
GTTAATATGATGTCTCAAAGTGGTTCATTTGCTTATGCTTTTAATCAAAATCTTGGCTGT
TATTTTCTTGAATTACCATATGGAATCGACCGTGAATATGCAAAAAGAATTAATTTGCCA
GAAAGCGCTGAAGATCGAATTTCAATGATTGTTGTTTTACCAAAACGTGGTCTTTCATTG
ATTGATGTCATTAATAACATTAGCGTTTATGGAATAAAGACCTTGCTAATAGAATTGAAA
AAAAGTAAAGAAGAAAATAAAAACTTAGAAGTTGAAGTTCATCTTCCACGTTTTGAAATA
AACACATCACTCAATTTAAAGGAGACTTTAAAAGATTTGGAAATAAATGAGATTTTTGAG
AATACAGCAAATTTGTCAGGAATCAATTATAACTACTATGTTTCGTCGATTCTTCACAAG
ACAAAAATTAATGTAGATGAAAAAGGATCTGAAGCACCAACAGTGACAAATTTAATTTCT
GCTAACAAAGTCAAGTTCTATGCTAACAGACCTTTTATTTACTTTATTCTTGACAAAGCC
ACGAGACTCATTCTTTTTGCTGGCGTGTTTAAAAATCCTGCAGTTTTTTAA

>g12641.t1 Gene=g12641 Length=716
MKGNFYFIFLLLFFVKSSVFSIRRINSCGVSKFERNKSFIGLVVNGDESKPGEWPWIVSM
FRKEKYFCGSSLISDLHLLSAAHCFEYFGITFQLNDYFALLGRFNLKDNNEKFSVNRSFS
SIFLHSEFNNTAEAYRSNADIAVIQMSERVQFSDFIQPVCLPEPNSNLENFDGIVVGYGK
SESGEIHEVTPKQAELHSINVFECLLKDPLYSYIVAERSFCAGGTDAIPCLGDSGGGFYV
KNNKSGKYETKGIVSQAQHNGCDPKVYVAFVDVTKFIDWIIEKMNEISKPEEKKRIKTEM
SSADRKTTKLPLILPTSVDTNIRKLLWDQELSENTELFGLHLFLYLTQFESENFMINPYL
IHSLLAVLAEGAVGNTYKEINNALGLINRQRTRDFHQYTNLALSKSASDVNFRKFAAMIG
DQNRPITREYEDNLEKIYDVEYIPVNFKNVDRTLREVNGRVSQSTSGLITDVISREDVLK
TQLILLACTYFKGSWKIAFNSTLTKYEPFYDINEERIIGRVNMMSQSGSFAYAFNQNLGC
YFLELPYGIDREYAKRINLPESAEDRISMIVVLPKRGLSLIDVINNISVYGIKTLLIELK
KSKEENKNLEVEVHLPRFEINTSLNLKETLKDLEINEIFENTANLSGINYNYYVSSILHK
TKINVDEKGSEAPTVTNLISANKVKFYANRPFIYFILDKATRLILFAGVFKNPAVF

Protein features from InterProScan

Transcript Database ID Name Start End E.value
21 g12641.t1 CDD cd00190 Tryp_SPc 43 280 3.38118E-49
22 g12641.t1 CDD cd19598 serpin77Ba-like_insects 331 713 1.61739E-131
15 g12641.t1 Coils Coil Coil 592 612 -
13 g12641.t1 Gene3D G3DSA:2.40.10.10 - 45 274 8.1E-50
14 g12641.t1 Gene3D G3DSA:2.40.10.10 - 55 280 8.1E-50
11 g12641.t1 Gene3D G3DSA:2.30.39.10 - 351 713 1.6E-83
12 g12641.t1 Gene3D G3DSA:3.30.497.10 Antithrombin 358 669 1.6E-83
3 g12641.t1 PANTHER PTHR11461:SF162 SERINE PROTEASE INHIBITOR 77BA 331 714 6.9E-81
4 g12641.t1 PANTHER PTHR11461 SERINE PROTEASE INHIBITOR, SERPIN 331 714 6.9E-81
7 g12641.t1 PRINTS PR00722 Chymotrypsin serine protease family (S1) signature 69 84 1.8E-5
6 g12641.t1 PRINTS PR00722 Chymotrypsin serine protease family (S1) signature 136 150 1.8E-5
5 g12641.t1 PRINTS PR00722 Chymotrypsin serine protease family (S1) signature 227 239 1.8E-5
1 g12641.t1 Pfam PF00089 Trypsin 43 280 2.5E-38
2 g12641.t1 Pfam PF00079 Serpin (serine protease inhibitor) 337 713 2.5E-79
17 g12641.t1 Phobius SIGNAL_PEPTIDE Signal peptide region 1 21 -
18 g12641.t1 Phobius SIGNAL_PEPTIDE_N_REGION N-terminal region of a signal peptide. 1 4 -
19 g12641.t1 Phobius SIGNAL_PEPTIDE_H_REGION Hydrophobic region of a signal peptide. 5 15 -
20 g12641.t1 Phobius SIGNAL_PEPTIDE_C_REGION C-terminal region of a signal peptide. 16 21 -
16 g12641.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 22 716 -
25 g12641.t1 ProSitePatterns PS00134 Serine proteases, trypsin family, histidine active site. 79 84 -
26 g12641.t1 ProSitePatterns PS00284 Serpins signature. 686 696 -
27 g12641.t1 ProSiteProfiles PS50240 Serine proteases, trypsin domain profile. 43 285 23.9
23 g12641.t1 SMART SM00020 trypsin_2 42 280 1.2E-45
24 g12641.t1 SMART SM00093 serpin2 340 713 4.2E-52
8 g12641.t1 SUPERFAMILY SSF50494 Trypsin-like serine proteases 7 285 1.67E-58
9 g12641.t1 SUPERFAMILY SSF56574 Serpins 320 713 3.14E-82
10 g12641.t1 SignalP_EUK SignalP-noTM SignalP-noTM 1 21 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0004252 serine-type endopeptidase activity MF
GO:0005615 extracellular space CC
GO:0006508 proteolysis BP

KEGG

Orthology

Pathway

This gene does not belong to any pathways.

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below. There were no conditions that were differentially expressed