Gene loci information

Transcript annotation

  • This transcript has been annotated as Putative Enteropeptidase.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_4 g17252 g17252.t3 TTS g17252.t3 12947986 12947986
chr_4 g17252 g17252.t3 isoform g17252.t3 12948180 12950963
chr_4 g17252 g17252.t3 exon g17252.t3.exon1 12948180 12949072
chr_4 g17252 g17252.t3 cds g17252.t3.CDS1 12948180 12949072
chr_4 g17252 g17252.t3 exon g17252.t3.exon2 12949128 12949430
chr_4 g17252 g17252.t3 cds g17252.t3.CDS2 12949128 12949430
chr_4 g17252 g17252.t3 exon g17252.t3.exon3 12949487 12950086
chr_4 g17252 g17252.t3 cds g17252.t3.CDS3 12949487 12950086
chr_4 g17252 g17252.t3 exon g17252.t3.exon4 12950196 12950431
chr_4 g17252 g17252.t3 cds g17252.t3.CDS4 12950196 12950418
chr_4 g17252 g17252.t3 exon g17252.t3.exon5 12950900 12950963
chr_4 g17252 g17252.t3 TSS g17252.t3 12950974 12950974

Sequences

>g17252.t3 Gene=g17252 Length=2096
AACTTTCATCAAAAGTGAACGTAATTTTTTCTCTCTTCAGTTTTTTACTTGACGGAATAA
TTTGGTTTCATTTCAACATGATTAAATTTTGGCTTTTTTGGATATTTTTAATTTTTACAA
TTCCTTTTGTAATCTCAAATGAAGTCATTGTTACAATTGCCTTTAATGATGACTATAATG
GTGTTAAAGAAAGTTCACCAGGTGAATTTCCATTTCATGCTGCAATATATAGAAGTGACC
ACAGAGGTCTTAATTATACATGCGGTGCATCATTGATAACTGAATACAATCTTTTAACGG
CTGCTCATTGTTTTGAATACAGAGGTTTTTTCATAGAAAGTTATTCTGCAGTTCTTGGAG
TTTATGATTTGACCGACTTAGCAGAAGAATCAAGAGTTTACAGAAAATTTTCATCAATTG
ATATTCATGATGAATTTCAACTTATATCAAATGATGCTCTTAGCAATGCTGACATAGCTG
TTGTGACAATGAACAAAAAAGTTAAATTTTCTGATTTCATAAAACCTGTCAAACTTCCAT
CACCATACATGCAAATAATTGGTCTTATTGGGACAATTTTCGGTTATGGTCAAACATCCT
CAGGTTCACAACATACAAACAAACCACAAAAAGGTGAACTAAAAATTATTAGCAGTCATG
AATGTCAGAAGAAAGGTCAAGATTATCCTGGGATTTTATCATCAAGAAGTTTCTGTGCTG
GTGATAGTTTTGCAACTCCTTGTCAAGGCGATTCTGGTGGTGGATTAGTTATAAAAAATG
ATGAAGGTGAATTTGAAATTATTGGAATTGTTTCACATAGACTTTCTAAAAATGAATGCA
GTCCAAATGATTATGCAGTTTTTGTTGATGTCTCAAGATTTACAGTTTGGATTCAATCAA
AACTTCATCCTTGCTTTTCATCACCTTGTGGACAAAATGGAAAATGCATAAATATTGATG
ACAGAGTCACTTGTGTTTGTCACGATGATTTTATTGGAATTTATCCAAATTGTGAAAAAA
TTGAATGTTGGAAAGTTACAGATTGTGCTGATGATCGAGCTTGTCTAAATAATAAATGTT
TTGATTTATGTGCTTTAGAAAATTGTGGACCAAATTCTGTTTGCCTCATGGAAGACCATA
AGAAAATTTGTGAATGTCAATTGAACTTTAAACTGGAAAAAAATCAATGCACATTAAAAT
CAAATCTTCTCGATGGCAGCTTGTTTAGTTTTAACATACCAAAAAATTGTAATTTTGGAG
AATGTGGATCTTTAAATTATCTCACTTTTTTCGTCTCTTCAAATACGAAAATTTTAATAA
CTGATGCAGAAGTAATGTCAAAACATCAAAATGATAAAATTGATGGTTTTCGAATTTATT
GTGATGATTGCGATAAAAAGGGAAAACAAGACATTTTTCATCTTCCAATCAATGTCATTA
AAATTTTTCCAAATTTACTTGCTTATTTTGCACATCATACAAAAGTTAAAAAATTAACTC
GTGAAAATTTCAAGGAAATGAAGAAATTAGATCAATTGAATCTTCATACTAATGAAATTG
AAGAAATTGATGATGATGCATTTGATGATTTGATAAAACTTGAATATCTTAATCTGTCTG
AAAATAATTTGAAAATTTTGAATCCAAGGATTTTTTTGGCTTTGAGAAATTTGAAATATT
TTCATGTGTCTAAAAATCAACTGACTTCATTAACTGCTCATGTTTTTGAAAATTTGGTTA
ATCTTAAACATTTTATACTCAACGAGAATTACATAAAATCTTTTCCATCTGGAATTTTTG
ATAGATTAACTAACCTTTCAATCATTGATACTCATAACAATCAATTTACAACTTTTGATG
AAAATATTTTCAAAAATAATTTCAAGCTTGAAGGAATTTATTTATGGAACAATAAAATAA
CAAAATTGAGCTACAAATTATTTGAAAATAAAGAATTTTTAAAACGAGTTGATTTAAGTA
GAAATATTTGCATTAATTTACTTTTGTATGTCAATAATGATAAAGGTGAACACATCAATT
TACCAGAAGAAAAAAGAAATGAATTGAAAAATTTACTTATCGGAAAATGTTCATAA

>g17252.t3 Gene=g17252 Length=672
MIKFWLFWIFLIFTIPFVISNEVIVTIAFNDDYNGVKESSPGEFPFHAAIYRSDHRGLNY
TCGASLITEYNLLTAAHCFEYRGFFIESYSAVLGVYDLTDLAEESRVYRKFSSIDIHDEF
QLISNDALSNADIAVVTMNKKVKFSDFIKPVKLPSPYMQIIGLIGTIFGYGQTSSGSQHT
NKPQKGELKIISSHECQKKGQDYPGILSSRSFCAGDSFATPCQGDSGGGLVIKNDEGEFE
IIGIVSHRLSKNECSPNDYAVFVDVSRFTVWIQSKLHPCFSSPCGQNGKCINIDDRVTCV
CHDDFIGIYPNCEKIECWKVTDCADDRACLNNKCFDLCALENCGPNSVCLMEDHKKICEC
QLNFKLEKNQCTLKSNLLDGSLFSFNIPKNCNFGECGSLNYLTFFVSSNTKILITDAEVM
SKHQNDKIDGFRIYCDDCDKKGKQDIFHLPINVIKIFPNLLAYFAHHTKVKKLTRENFKE
MKKLDQLNLHTNEIEEIDDDAFDDLIKLEYLNLSENNLKILNPRIFLALRNLKYFHVSKN
QLTSLTAHVFENLVNLKHFILNENYIKSFPSGIFDRLTNLSIIDTHNNQFTTFDENIFKN
NFKLEGIYLWNNKITKLSYKLFENKEFLKRVDLSRNICINLLLYVNNDKGEHINLPEEKR
NELKNLLIGKCS

Protein features from InterProScan

Transcript Database ID Name Start End E.value
22 g17252.t3 CDD cd00190 Tryp_SPc 35 275 9.56265E-54
23 g17252.t3 CDD cd00053 EGF 281 313 0.0014144
15 g17252.t3 Gene3D G3DSA:2.40.10.10 - 37 266 2.1E-51
16 g17252.t3 Gene3D G3DSA:2.40.10.10 - 45 271 2.1E-51
12 g17252.t3 Gene3D G3DSA:2.10.25.10 Laminin 272 314 1.8E-6
14 g17252.t3 Gene3D G3DSA:3.80.10.10 Ribonuclease Inhibitor 434 540 4.9E-17
13 g17252.t3 Gene3D G3DSA:3.80.10.10 Ribonuclease Inhibitor 541 662 1.0E-19
3 g17252.t3 PANTHER PTHR24260 - 37 276 9.9E-48
4 g17252.t3 PANTHER PTHR24260:SF129 MODULAR SERINE PROTEASE 37 276 9.9E-48
7 g17252.t3 PRINTS PR00722 Chymotrypsin serine protease family (S1) signature 63 78 4.6E-7
5 g17252.t3 PRINTS PR00722 Chymotrypsin serine protease family (S1) signature 128 142 4.6E-7
6 g17252.t3 PRINTS PR00722 Chymotrypsin serine protease family (S1) signature 219 231 4.6E-7
2 g17252.t3 Pfam PF00089 Trypsin 35 272 3.3E-39
1 g17252.t3 Pfam PF13855 Leucine rich repeat 483 542 1.7E-13
18 g17252.t3 Phobius SIGNAL_PEPTIDE Signal peptide region 1 20 -
19 g17252.t3 Phobius SIGNAL_PEPTIDE_N_REGION N-terminal region of a signal peptide. 1 3 -
20 g17252.t3 Phobius SIGNAL_PEPTIDE_H_REGION Hydrophobic region of a signal peptide. 4 15 -
21 g17252.t3 Phobius SIGNAL_PEPTIDE_C_REGION C-terminal region of a signal peptide. 16 20 -
17 g17252.t3 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 21 672 -
34 g17252.t3 ProSitePatterns PS00134 Serine proteases, trypsin family, histidine active site. 73 78 -
41 g17252.t3 ProSiteProfiles PS50240 Serine proteases, trypsin domain profile. 18 277 23.491
42 g17252.t3 ProSiteProfiles PS50026 EGF-like domain profile. 275 313 13.245
36 g17252.t3 ProSiteProfiles PS51450 Leucine-rich repeat profile. 483 504 7.011
40 g17252.t3 ProSiteProfiles PS51450 Leucine-rich repeat profile. 507 528 7.412
39 g17252.t3 ProSiteProfiles PS51450 Leucine-rich repeat profile. 531 552 5.756
38 g17252.t3 ProSiteProfiles PS51450 Leucine-rich repeat profile. 555 576 5.879
37 g17252.t3 ProSiteProfiles PS51450 Leucine-rich repeat profile. 579 600 4.793
35 g17252.t3 ProSiteProfiles PS51450 Leucine-rich repeat profile. 603 624 5.987
25 g17252.t3 SMART SM00020 trypsin_2 33 272 2.3E-45
33 g17252.t3 SMART SM00181 egf_5 278 313 0.0014
32 g17252.t3 SMART SM00181 egf_5 337 372 22.0
29 g17252.t3 SMART SM00369 LRR_typ_2 481 504 0.75
28 g17252.t3 SMART SM00369 LRR_typ_2 505 528 0.36
27 g17252.t3 SMART SM00369 LRR_typ_2 529 552 0.14
31 g17252.t3 SMART SM00369 LRR_typ_2 553 576 0.42
30 g17252.t3 SMART SM00369 LRR_typ_2 577 600 0.67
26 g17252.t3 SMART SM00369 LRR_typ_2 603 624 210.0
9 g17252.t3 SUPERFAMILY SSF50494 Trypsin-like serine proteases 28 276 7.07E-55
10 g17252.t3 SUPERFAMILY SSF57196 EGF/Laminin 273 309 5.08E-6
8 g17252.t3 SUPERFAMILY SSF52058 L domain-like 445 641 5.35E-36
11 g17252.t3 SignalP_EUK SignalP-TM SignalP-TM 1 20 -
24 g17252.t3 TMHMM TMhelix Region of a membrane-bound protein predicted to be embedded in the membrane. 7 29 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0004252 serine-type endopeptidase activity MF
GO:0006508 proteolysis BP
GO:0005515 protein binding MF

KEGG

Orthology

This gene did not have any KEGG ortholog annotations (KAAS, GHOSTZ).

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below. There were no conditions that were differentially expressed