Gene loci information

Transcript annotation

  • This transcript has been annotated as Membrane-bound transcription factor site-1 protease.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_1 g10282 g10282.t1 TSS g10282.t1 8622477 8622477
chr_1 g10282 g10282.t1 isoform g10282.t1 8622660 8625937
chr_1 g10282 g10282.t1 exon g10282.t1.exon1 8622660 8622920
chr_1 g10282 g10282.t1 cds g10282.t1.CDS1 8622660 8622920
chr_1 g10282 g10282.t1 exon g10282.t1.exon2 8622983 8623502
chr_1 g10282 g10282.t1 cds g10282.t1.CDS2 8622983 8623502
chr_1 g10282 g10282.t1 exon g10282.t1.exon3 8623563 8625689
chr_1 g10282 g10282.t1 cds g10282.t1.CDS3 8623563 8625689
chr_1 g10282 g10282.t1 exon g10282.t1.exon4 8625750 8625937
chr_1 g10282 g10282.t1 cds g10282.t1.CDS4 8625750 8625937
chr_1 g10282 g10282.t1 TTS g10282.t1 8626218 8626218

Sequences

>g10282.t1 Gene=g10282 Length=3096
ATGATGAAAAAGAGAAAAGATTTCGTAATTTTCTTTATCTTCGTAATAATTAATTTATTC
AATTGTAATACGACATTGGAAAATGAGAAAGATATTCAAGATGAAGGATTAAAATGCTGC
GAGAATGCAACAGATCAACGTTTAATTGTTGAATTTTCATCAAGTATTGTAGAACATGAA
TACATAATACACTTTAATAATTATTATAAGAAAGAAACACGAAGGAAATACATCATTTTA
GCTCTTGATAATAATTCCGAGATCAAAAATTTTACAATTGTTGAAAGACAAAATCCAGCA
AGCGACTACCCAAGTGACTTTGACGTTATTCATCTCTATGAAAAAACTCCACTAAAAGGA
CTTGATTTGCTTATTAAACATCCTCTTATAAAAAGTGTTAGTCCACAACGAATAGTACAT
AGAACGCTAAAATTTCTAAATAAAGATGATGAACAAGAAGAAGAAATAGTAGCAAATAAT
GAAAATGATGGTGATACAAACAAGGTGAATGATAATGAAGATGATATTATCAATTATATT
TCAAAGCATCTTAAACTTGATACGAGAACTGCCAATTTAAAAAATAAGCAAGGAGAATGT
TGTGTGGCTGAATTTCAAAATTTTAGACGAGGTCTATCATCATCAACTTCAATTGATACC
AGTCAACAGCAGCAGCAACAGTTACCGATAGAAAATAATGTGAATTCTCACTCTAATCGA
CGTTTATTAAGAGCAATTCCTAGGCAAATAACATCAATTTTGAAAGCAGACGCATTATGG
AGCATCGGCATCACTGGCAAAGGAGTGAAAGTTGCTGTTTTCGACACAGGTTTAAGTAAA
TCACATCCACATTTTAAGAAAATCAAAGAGAGGACTAATTGGACAAATGAAAAATCATTG
AGTGATGGGATCTCTCATGGAACTTTTGTCACAGGCATTATCGGTTCTTCAAAAGAATGT
TTAGGATTTGCACCAGATGCTGACATTCATATATATCGTGTATTTACAAGTAATCAAGTG
AGCTACACTTCTTGGTTTCTCGATGCATTCAATTATGCAATACTTAAAAAAATCAATGTC
ATTAATTTGAGCATTGGTGGTCCTGATTTTTTGGATCGTCCATTTGTTGATAAAGTACTT
GAGTTATCAGCAAATAAAGTTATAATGGTATCAGCAATAGGCAATGATGGACCATTATAT
GGAACTCTAAATAATCCCGGTGATCAAATGGATGTTATTGGTGTTGGAGGTATGAATTTT
GAAGAAAAGATTGCTAAATTTTCCTCACGCGGTATGACAACATGGGAACTACCAGAAGGC
TATGGACGATTAAAACCTGACATTGTCACATACGGATCTCAAGTGAAAGGTTCAAATTTA
AAAGGTGGCTGTAGATCACTTTCGGGAACAAGCGTTGCATCACCAGTCGTAGCAGGAGCA
GTCACTCTTATCATATCAGGTGTATTGAAAAAAATGGATTATATTAATCCATCGTCGGTA
AAGCAGGCTCTCATTGAAGGTGCAACACGATTAAATGACAATAATATGTTTGAACAAGGC
CATGGCAAATTAAATGTTCTCAAATCAATTAAAATTTTATCAGAATATCAACCTAAGATA
ACTTTGTCTCCAACTAATTTAGATATGACCGACGATTACATGTTTCCCTACAGTTCGCAA
CCTATTTATTACACAATGCAACCAATAATTGTAAATGTTACTATTTTAAATGGCATATCA
GTTACTGGTCGTGTTGCTGAAGTTCAATGGTATCCATATCAAAATGAAAATGGAAATTTA
TTAAATGTGTCAATTGGATATTCAGAAATTTTATGGCCTTGGTCTGGATGGATGAGTTTG
CATATCGTTGTCAATGAATACGGACAATATTTTGAAGGTGATGCACAAGGGCATGTTGCA
ATTACAATTGAGACTCCTTCAGCTGATGATCCTAGTGAGAAATTAAATTCAACAGTTAAC
TTTTCCATTCGATGTCGAATTATTCCTAAACCACCGAGACAAAAGAGAATTTTATGGGAT
CAATATCATAATCTTCGATATCCGCCAGGTTATCTACCAAGAGATTCTTTAAAAGTAAAA
CAAGACCCACTTGATTGGCGTTCAGATCATATTCATACAAATTTCAAAGATATGTACACA
CATTTAAGATCATCTGGTTATTATGTTGAAGTTTTAGGTCAACCTTTTACTTGTTTTAAC
GCAAGTCACTATGGAACTCTTTTGCTTGTTGACCCAGAAGAAGAGTTTTTCGATGCAGAA
ATTGAAAAATTATATGATGATGTTACTAATAAAAATTTGAGTCTTATTGTTTTTGCTGAC
TGGTACAATACAACTGTAATGAAACATATGAAATTTTATGATCAAAATACACGTCAATGG
TGGGTTCCTGATACTGGTGGTTGTAATGTTCCTGCACTCAATGAATTGTTAAATTCATTC
GGAATAGAATTTGGTGATAAAGTTTTAGAAGGATATTTTACTATGGGCGATCATCATGAC
ATGTATTATGCAAGTGGTACAAGTATCGTTAAATTTCCAAAAGCAAATCAATCAGTTTTG
ATTGAAAGAGATTTAGTTGATCAAGGATCTGAAATTTTGAAACTTCCACAAAATGGTAGT
CATCAAAAAGACCTTCCAGAAATTGATTTAAAGAATGTTCGAGTGAAACACAAAGAGGTT
ATTTTGGGAATGCAACAAACAGTGTCAACTTTAAGTTCAAGAAAGGGAGGAAGAATAGCA
GCTTATGGAGATAGTAATTGTTTAGATTCTACTCATATGGAAAAAGCATGTTTCTGGCTT
TTAGATGCACTTTTGGAATATTCAATGACGAGTCATGTAAGTGGATTATTAAAGAGTATG
AATCGAATGCAAAATATTGAGTTTACGGAGAAAACAAAGCCGCAGAGATTACTCCATAAT
AATTTACACCTGTATTCTAAAATTCTTGATTCTAATGATCGTGATACCAAACGACCACTT
CAAAAATGCCGAAAATTCAATTGGGAGTTTCCATATTTTCTAAATATAACGAGATCATTT
ATTAATGAAGATCCTATCATAACTAACAACATCTAA

>g10282.t1 Gene=g10282 Length=1031
MMKKRKDFVIFFIFVIINLFNCNTTLENEKDIQDEGLKCCENATDQRLIVEFSSSIVEHE
YIIHFNNYYKKETRRKYIILALDNNSEIKNFTIVERQNPASDYPSDFDVIHLYEKTPLKG
LDLLIKHPLIKSVSPQRIVHRTLKFLNKDDEQEEEIVANNENDGDTNKVNDNEDDIINYI
SKHLKLDTRTANLKNKQGECCVAEFQNFRRGLSSSTSIDTSQQQQQQLPIENNVNSHSNR
RLLRAIPRQITSILKADALWSIGITGKGVKVAVFDTGLSKSHPHFKKIKERTNWTNEKSL
SDGISHGTFVTGIIGSSKECLGFAPDADIHIYRVFTSNQVSYTSWFLDAFNYAILKKINV
INLSIGGPDFLDRPFVDKVLELSANKVIMVSAIGNDGPLYGTLNNPGDQMDVIGVGGMNF
EEKIAKFSSRGMTTWELPEGYGRLKPDIVTYGSQVKGSNLKGGCRSLSGTSVASPVVAGA
VTLIISGVLKKMDYINPSSVKQALIEGATRLNDNNMFEQGHGKLNVLKSIKILSEYQPKI
TLSPTNLDMTDDYMFPYSSQPIYYTMQPIIVNVTILNGISVTGRVAEVQWYPYQNENGNL
LNVSIGYSEILWPWSGWMSLHIVVNEYGQYFEGDAQGHVAITIETPSADDPSEKLNSTVN
FSIRCRIIPKPPRQKRILWDQYHNLRYPPGYLPRDSLKVKQDPLDWRSDHIHTNFKDMYT
HLRSSGYYVEVLGQPFTCFNASHYGTLLLVDPEEEFFDAEIEKLYDDVTNKNLSLIVFAD
WYNTTVMKHMKFYDQNTRQWWVPDTGGCNVPALNELLNSFGIEFGDKVLEGYFTMGDHHD
MYYASGTSIVKFPKANQSVLIERDLVDQGSEILKLPQNGSHQKDLPEIDLKNVRVKHKEV
ILGMQQTVSTLSSRKGGRIAAYGDSNCLDSTHMEKACFWLLDALLEYSMTSHVSGLLKSM
NRMQNIEFTEKTKPQRLLHNNLHLYSKILDSNDRDTKRPLQKCRKFNWEFPYFLNITRSF
INEDPIITNNI

Protein features from InterProScan

Transcript Database ID Name Start End E.value
15 g10282.t1 CDD cd07479 Peptidases_S8_SKI-1_like 260 511 1.3588E-177
9 g10282.t1 Gene3D G3DSA:3.40.50.200 - 259 540 6.7E-67
2 g10282.t1 PANTHER PTHR43806:SF7 MEMBRANE-BOUND TRANSCRIPTION FACTOR SITE-1 PROTEASE 56 936 0.0
3 g10282.t1 PANTHER PTHR43806 PEPTIDASE S8 56 936 0.0
6 g10282.t1 PRINTS PR00723 Subtilisin serine protease family (S8) signature 266 285 4.2E-15
5 g10282.t1 PRINTS PR00723 Subtilisin serine protease family (S8) signature 302 315 4.2E-15
4 g10282.t1 PRINTS PR00723 Subtilisin serine protease family (S8) signature 468 484 4.2E-15
1 g10282.t1 Pfam PF00082 Subtilase family 266 522 3.2E-43
11 g10282.t1 Phobius SIGNAL_PEPTIDE Signal peptide region 1 22 -
12 g10282.t1 Phobius SIGNAL_PEPTIDE_N_REGION N-terminal region of a signal peptide. 1 7 -
13 g10282.t1 Phobius SIGNAL_PEPTIDE_H_REGION Hydrophobic region of a signal peptide. 8 17 -
14 g10282.t1 Phobius SIGNAL_PEPTIDE_C_REGION C-terminal region of a signal peptide. 18 22 -
10 g10282.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 23 1031 -
16 g10282.t1 ProSitePatterns PS00137 Serine proteases, subtilase family, histidine active site. 306 316 -
17 g10282.t1 ProSitePatterns PS00138 Serine proteases, subtilase family, serine active site. 469 479 -
18 g10282.t1 ProSiteProfiles PS51892 Serine proteases, subtilase domain profile. 247 530 32.031
7 g10282.t1 SUPERFAMILY SSF52743 Subtilisin-like 48 629 6.33E-66
8 g10282.t1 SignalP_EUK SignalP-noTM SignalP-noTM 1 22 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0004252 serine-type endopeptidase activity MF
GO:0006508 proteolysis BP
GO:0008236 serine-type peptidase activity MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values