Gene loci information

Transcript annotation

  • This transcript has been annotated as Putative cysteine proteinase CG12163.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_2 g4405 g4405.t23 TTS g4405.t23 2001061 2001061
chr_2 g4405 g4405.t23 isoform g4405.t23 2001441 2003271
chr_2 g4405 g4405.t23 exon g4405.t23.exon1 2001441 2001689
chr_2 g4405 g4405.t23 cds g4405.t23.CDS1 2001442 2001689
chr_2 g4405 g4405.t23 exon g4405.t23.exon2 2001757 2002007
chr_2 g4405 g4405.t23 cds g4405.t23.CDS2 2001757 2002007
chr_2 g4405 g4405.t23 exon g4405.t23.exon3 2002061 2003089
chr_2 g4405 g4405.t23 cds g4405.t23.CDS3 2002061 2003046
chr_2 g4405 g4405.t23 exon g4405.t23.exon4 2003160 2003271
chr_2 g4405 g4405.t23 TSS g4405.t23 NA NA

Sequences

>g4405.t23 Gene=g4405 Length=1641
GTCAAATCGGTGTTCCCGGTGGAATCAGTCCTGTAGAAAACTTTGAAGATGTAAAAATTT
ATGTTCAAGAAGCTATTGATGAAATTAATGATAATGAAGATCCTGATTACATTTTGAAAC
ATATCGTTGAAGCAACCCAACAAGTTGTTGCAGGCATGAGTTATAAAATTAAAGCAGTGT
TTTCCAGAGATGGAAGCGACATTGAATGTGATTTTGATGTATGGGAGCAAGCTTGGATTA
AAGATGGACGTAAAGTTTCAGTTTCTTGCAAAAATGATAAGAAATATAAGTTGACCCAAT
CACCATCTAATCAGCGTGTCAAACGTGATAACACGCTTGAAAGAGTTCTTGGTTTACCAT
CCAATACTGATGATCATGACGATTTGATAAAAATACTTTCTGAACATTTGAAGAGACTCG
ATACTGGAAGTGATGCACAATTTGAATTGGTAAAACTTGAAAAGGTAACTCAACAAGTAG
TAGCTGGAATAAAATATAAAGCAACAGGTATTTTTAAAATTGGCAATGAAGAGAAAAAAT
GTGTTATCGATGTATGGCATCGCTCATGGATTAAGGGAGATGAAGGCACTCAATTAAGCG
CTGATTGTGATAAAGGTGCAACAACTTTCAAGACAAAATCTTCTAGAAAAAGGAGATCAG
TTCATCACCACACACACAATCGTCACAATAGACAATCAGTAAGCGATCATTTTGATGACC
ATCATCATCATACTGATAGACATCATCATCAATACTCAGCTACTGAAGAAATGAAAGAAA
TAAAATCTGAAATTTTATTTAACAATTTCATAACTAAATATAATCGTAAATATGCCAATG
AACTTGAACATAAAATGAGAATGAGAATTTTCAAGAAGAATTTACATAAAATTGAAATGT
TGAATAAGCATGAACAAGGCACTGCAAAGTATGGAATTACAGAATTCGCTGATTTAACTG
AAAAGGAATACTTGCATAAAACTGGTTTGAGAGTGCGTGAAAGACATGAGAATGAATTAG
AAAATCCAATTGCACATATTCCAGAAGTTGAAGATTTACCAACCGAATTTGATTGGAGAG
ATAAATCAGCAGTTACAAGTGTAAAAAATCAAGGAAATTGTGGATCATGCTGGAGTTTTT
CTGTTACAGGAAATATTGAAGGCTTACATGCTATTAAAACTGGAAAACTTGAAGCTTATT
CTGAACAAGAACTTTTGGACTGTGATACAACTGATAATGCTTGCAATGGTGGTTATATGG
ATGATGCTTTTAAAGCAATTGAAAAAATTGGTGGTCTAGAATTAGAAGATGAATATCCTT
ATCAAGCAAGGAAACAAAAGAAATGCTTGTTTAATGCTACTATGAGTCATGTTAAAGTTA
AAGGTGTTGTAGATTTGCCTAAAGGTGATGAAATTGCAATGCAAAAGTTTTTAGTCTCAA
CTGGTCCGATTTCCATTGGCATAAATGCTAATGCTATGCAATTTTATCGTGGTGGTGTTT
CGCATCCATGGAAAGTTCTTTGCAGAAAATCTAATTTAGATCATGGTGTTTTGATTGTTG
GATATGGAATAAAAGAGTATCCCATGTTTAATAAAACTTTACCTTATTGGACTATTAAAA
ATTCATGGGGTCCAAAATGGG

>g4405.t23 Gene=g4405 Length=495
MSYKIKAVFSRDGSDIECDFDVWEQAWIKDGRKVSVSCKNDKKYKLTQSPSNQRVKRDNT
LERVLGLPSNTDDHDDLIKILSEHLKRLDTGSDAQFELVKLEKVTQQVVAGIKYKATGIF
KIGNEEKKCVIDVWHRSWIKGDEGTQLSADCDKGATTFKTKSSRKRRSVHHHTHNRHNRQ
SVSDHFDDHHHHTDRHHHQYSATEEMKEIKSEILFNNFITKYNRKYANELEHKMRMRIFK
KNLHKIEMLNKHEQGTAKYGITEFADLTEKEYLHKTGLRVRERHENELENPIAHIPEVED
LPTEFDWRDKSAVTSVKNQGNCGSCWSFSVTGNIEGLHAIKTGKLEAYSEQELLDCDTTD
NACNGGYMDDAFKAIEKIGGLELEDEYPYQARKQKKCLFNATMSHVKVKGVVDLPKGDEI
AMQKFLVSTGPISIGINANAMQFYRGGVSHPWKVLCRKSNLDHGVLIVGYGIKEYPMFNK
TLPYWTIKNSWGPKW

Protein features from InterProScan

Transcript Database ID Name Start End E.value
12 g4405.t23 CDD cd02248 Peptidase_C1A 302 495 1.75563E-87
10 g4405.t23 Gene3D G3DSA:3.10.450.10 - 60 150 6.4E-11
11 g4405.t23 Gene3D G3DSA:3.90.70.10 Cysteine proteinases 163 495 4.9E-91
18 g4405.t23 MobiDBLite mobidb-lite consensus disorder prediction 157 203 -
19 g4405.t23 MobiDBLite mobidb-lite consensus disorder prediction 160 178 -
17 g4405.t23 MobiDBLite mobidb-lite consensus disorder prediction 179 203 -
3 g4405.t23 PANTHER PTHR12411 CYSTEINE PROTEASE FAMILY C1-RELATED 169 495 9.9E-88
4 g4405.t23 PANTHER PTHR12411:SF444 CATHEPSIN F 169 495 9.9E-88
7 g4405.t23 PRINTS PR00705 Papain cysteine protease (C1) family signature 319 334 3.5E-9
6 g4405.t23 PRINTS PR00705 Papain cysteine protease (C1) family signature 463 473 3.5E-9
5 g4405.t23 PRINTS PR00705 Papain cysteine protease (C1) family signature 484 490 3.5E-9
1 g4405.t23 Pfam PF08246 Cathepsin propeptide inhibitor domain (I29) 215 272 2.7E-12
2 g4405.t23 Pfam PF00112 Papain family cysteine protease 301 495 1.3E-61
14 g4405.t23 ProSitePatterns PS00139 Eukaryotic thiol (cysteine) proteases cysteine active site. 319 330 -
13 g4405.t23 ProSitePatterns PS00639 Eukaryotic thiol (cysteine) proteases histidine active site. 461 471 -
16 g4405.t23 SMART SM00848 Inhibitor_I29_2 215 272 3.0E-18
15 g4405.t23 SMART SM00645 pept_c1 301 495 2.2E-75
8 g4405.t23 SUPERFAMILY SSF54403 Cystatin/monellin 73 140 4.25E-8
9 g4405.t23 SUPERFAMILY SSF54001 Cysteine proteinases 208 495 1.12E-89

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0008234 cysteine-type peptidase activity MF
GO:0006508 proteolysis BP

KEGG

Orthology

This gene did not have any KEGG ortholog annotations (KAAS, GHOSTZ).

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values