Gene loci information

Transcript annotation

  • This transcript has been annotated as Putative cysteine proteinase CG12163.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_2 g4405 g4405.t16 TTS g4405.t16 2001061 2001061
chr_2 g4405 g4405.t16 isoform g4405.t16 2001294 2003744
chr_2 g4405 g4405.t16 exon g4405.t16.exon1 2001294 2001377
chr_2 g4405 g4405.t16 cds g4405.t16.CDS1 2001294 2001377
chr_2 g4405 g4405.t16 exon g4405.t16.exon2 2001442 2001689
chr_2 g4405 g4405.t16 cds g4405.t16.CDS2 2001442 2001689
chr_2 g4405 g4405.t16 exon g4405.t16.exon3 2001757 2002007
chr_2 g4405 g4405.t16 cds g4405.t16.CDS3 2001757 2002007
chr_2 g4405 g4405.t16 exon g4405.t16.exon4 2002061 2003089
chr_2 g4405 g4405.t16 cds g4405.t16.CDS4 2002061 2003089
chr_2 g4405 g4405.t16 exon g4405.t16.exon5 2003160 2003281
chr_2 g4405 g4405.t16 cds g4405.t16.CDS5 2003160 2003281
chr_2 g4405 g4405.t16 exon g4405.t16.exon6 2003373 2003744
chr_2 g4405 g4405.t16 cds g4405.t16.CDS6 2003373 2003597
chr_2 g4405 g4405.t16 TSS g4405.t16 NA NA

Sequences

>g4405.t16 Gene=g4405 Length=2106
GCCGATGAACTGCAACAAAGAACTCAAGAAAATGAAAAAAATTCCGATAATCAAGAAGAT
AGAGAAAATGACACGGCATTGAAGGGTCTCGAAATAGAAATAAAGAAAACTTTTAGTGAG
CTATTTCAAACTAATTCCGATTTTAGAATGAATATAATAGCGCTGATAAATAGAAAAGAT
GATTTGACTGCACAAAAGAATTACAACTATGTAGTCAATATATTAGCAAGTAAACTTAAA
GATAAAATTGAGTCATACAATGAGAGAAGATTTAAAGATGAGCAAACTCAAAATAATTAT
CAGCAAATAAATACTAATAGAACAAAGCGCTCTTACTTTTTTGATTCCCAAAATTCCTTT
CCATCTCACAAGCGTGCTGCTCGTCAAATCGGTGTTCCCGGTGGAATCAGTCCTGTAGAA
AACTTTGAAGATGTAAAAATTTATGTTCAAGAAGCTATTGATGAAATTAATGATAATGAA
GATCCTGATTACATTTTGAAACATATCGTTGAAGCAACCCAACAAGTTGTTGCAGGCATG
AGTTATAAAATTAAAGCAGTGTTTTCCAGAGATGGAAGCGACATTGAATGTGATTTTGAT
GTATGGGAGCAAGCTTGGATTAAAGATGGACGTAAAGTTTCAGTTTCTTGCAAAAATGAT
AAGAAATATAAGTTGACCCAATCACCATCTAATCAGCGTGTCAAACGTGATAACACGCTT
GAAAGAGTTCTTGGTTTACCATCCAATACTGATGATCATGACGATTTGATAAAAATACTT
TCTGAACATTTGAAGAGACTCGATACTGGAAGTGATGCACAATTTGAATTGGTAAAACTT
GAAAAGGTAACTCAACAAGTAGTAGCTGGAATAAAATATAAAGCAACAGGTATTTTTAAA
ATTGGCAATGAAGAGAAAAAATGTGTTATCGATGTATGGCATCGCTCATGGATTAAGGGA
GATGAAGGCACTCAATTAAGCGCTGATTGTGATAAAGGTGCAACAACTTTCAAGACAAAA
TCTTCTAGAAAAAGGAGATCAGTTCATCACCACACACACAATCGTCACAATAGACAATCA
GTAAGCGATCATTTTGATGACCATCATCATCATACTGATAGACATCATCATCAATACTCA
GCTACTGAAGAAATGAAAGAAATAAAATCTGAAATTTTATTTAACAATTTCATAACTAAA
TATAATCGTAAATATGCCAATGAACTTGAACATAAAATGAGAATGAGAATTTTCAAGAAG
AATTTACATAAAATTGAAATGTTGAATAAGCATGAACAAGGCACTGCAAAGTATGGAATT
ACAGAATTCGCTGATTTAACTGAAAAGGAATACTTGCATAAAACTGGTTTGAGAGTGCGT
GAAAGACATGAGAATGAATTAGAAAATCCAATTGCACATATTCCAGAAGTTGAAGATTTA
CCAACCGAATTTGATTGGAGAGATAAATCAGCAGTTACAAGTGTAAAAAATCAAGGAAAT
TGTGGATCATGCTGGAGTTTTTCTGTTACAGGAAATATTGAAGGCTTACATGCTATTAAA
ACTGGAAAACTTGAAGCTTATTCTGAACAAGAACTTTTGGACTGTGATACAACTGATAAT
GCTTGCAATGGTGGTTATATGGATGATGCTTTTAAAGCAATTGAAAAAATTGGTGGTCTA
GAATTAGAAGATGAATATCCTTATCAAGCAAGGAAACAAAAGAAATGCTTGTTTAATGCT
ACTATGAGTCATGTTAAAGTTAAAGGTGTTGTAGATTTGCCTAAAGGTGATGAAATTGCA
ATGCAAAAGTTTTTAGTCTCAACTGGTCCGATTTCCATTGGCATAAATGCTAATGCTATG
CAATTTTATCGTGGTGGTGTTTCGCATCCATGGAAAGTTCTTTGCAGAAAATCTAATTTA
GATCATGGTGTTTTGATTGTTGGATATGGAATAAAAGAGTATCCCATGTTTAATAAAACT
TTACCTTATTGGACTATTAAAAATTCATGGGGTCCAAAATGGGGTGAACAAGGATATTAT
CGAGTTTATCGTGGAGATAACAGTTGTGGAGTTGCAGAAATGGCAAGCAGCGCAGTACTT
GAATAA

>g4405.t16 Gene=g4405 Length=652
MNIIALINRKDDLTAQKNYNYVVNILASKLKDKIESYNERRFKDEQTQNNYQQINTNRTK
RSYFFDSQNSFPSHKRAARQIGVPGGISPVENFEDVKIYVQEAIDEINDNEDPDYILKHI
VEATQQVVAGMSYKIKAVFSRDGSDIECDFDVWEQAWIKDGRKVSVSCKNDKKYKLTQSP
SNQRVKRDNTLERVLGLPSNTDDHDDLIKILSEHLKRLDTGSDAQFELVKLEKVTQQVVA
GIKYKATGIFKIGNEEKKCVIDVWHRSWIKGDEGTQLSADCDKGATTFKTKSSRKRRSVH
HHTHNRHNRQSVSDHFDDHHHHTDRHHHQYSATEEMKEIKSEILFNNFITKYNRKYANEL
EHKMRMRIFKKNLHKIEMLNKHEQGTAKYGITEFADLTEKEYLHKTGLRVRERHENELEN
PIAHIPEVEDLPTEFDWRDKSAVTSVKNQGNCGSCWSFSVTGNIEGLHAIKTGKLEAYSE
QELLDCDTTDNACNGGYMDDAFKAIEKIGGLELEDEYPYQARKQKKCLFNATMSHVKVKG
VVDLPKGDEIAMQKFLVSTGPISIGINANAMQFYRGGVSHPWKVLCRKSNLDHGVLIVGY
GIKEYPMFNKTLPYWTIKNSWGPKWGEQGYYRVYRGDNSCGVAEMASSAVLE

Protein features from InterProScan

Transcript Database ID Name Start End E.value
16 g4405.t16 CDD cd00042 CY 85 168 5.10558E-9
15 g4405.t16 CDD cd02248 Peptidase_C1A 432 649 4.51221E-100
13 g4405.t16 Gene3D G3DSA:3.10.450.10 - 79 172 5.5E-16
12 g4405.t16 Gene3D G3DSA:3.10.450.10 - 190 280 9.7E-11
14 g4405.t16 Gene3D G3DSA:3.90.70.10 Cysteine proteinases 310 651 2.1E-101
26 g4405.t16 MobiDBLite mobidb-lite consensus disorder prediction 287 333 -
25 g4405.t16 MobiDBLite mobidb-lite consensus disorder prediction 290 308 -
24 g4405.t16 MobiDBLite mobidb-lite consensus disorder prediction 309 333 -
4 g4405.t16 PANTHER PTHR13814:SF16 CYSTATIN 344 650 9.4E-101
5 g4405.t16 PANTHER PTHR13814 FETUIN 344 650 9.4E-101
6 g4405.t16 PRINTS PR00705 Papain cysteine protease (C1) family signature 449 464 7.4E-9
8 g4405.t16 PRINTS PR00705 Papain cysteine protease (C1) family signature 593 603 7.4E-9
7 g4405.t16 PRINTS PR00705 Papain cysteine protease (C1) family signature 614 620 7.4E-9
2 g4405.t16 Pfam PF00031 Cystatin domain 85 141 4.3E-6
1 g4405.t16 Pfam PF08246 Cathepsin propeptide inhibitor domain (I29) 345 402 3.9E-12
3 g4405.t16 Pfam PF00112 Papain family cysteine protease 431 649 3.1E-71
19 g4405.t16 ProSitePatterns PS00139 Eukaryotic thiol (cysteine) proteases cysteine active site. 449 460 -
18 g4405.t16 ProSitePatterns PS00639 Eukaryotic thiol (cysteine) proteases histidine active site. 591 601 -
17 g4405.t16 ProSitePatterns PS00640 Eukaryotic thiol (cysteine) proteases asparagine active site. 614 633 -
22 g4405.t16 SMART SM00043 CY_4 82 169 7.0E-10
23 g4405.t16 SMART SM00043 CY_4 193 282 0.18
21 g4405.t16 SMART SM00848 Inhibitor_I29_2 345 402 3.0E-18
20 g4405.t16 SMART SM00645 pept_c1 431 650 2.0E-100
9 g4405.t16 SUPERFAMILY SSF54403 Cystatin/monellin 80 171 1.76E-13
10 g4405.t16 SUPERFAMILY SSF54403 Cystatin/monellin 203 270 4.68E-8
11 g4405.t16 SUPERFAMILY SSF54001 Cysteine proteinases 338 649 4.91E-100

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0008234 cysteine-type peptidase activity MF
GO:0006508 proteolysis BP
GO:0004869 cysteine-type endopeptidase inhibitor activity MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values