Gene loci information

Transcript annotation

  • This transcript has been annotated as Cathepsin B.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_3 g2993 g2993.t43 TTS g2993.t43 21867629 21867629
chr_3 g2993 g2993.t43 isoform g2993.t43 21867833 21869132
chr_3 g2993 g2993.t43 exon g2993.t43.exon1 21867833 21868355
chr_3 g2993 g2993.t43 cds g2993.t43.CDS1 21867912 21868355
chr_3 g2993 g2993.t43 exon g2993.t43.exon2 21868412 21868479
chr_3 g2993 g2993.t43 cds g2993.t43.CDS2 21868412 21868479
chr_3 g2993 g2993.t43 exon g2993.t43.exon3 21868542 21868992
chr_3 g2993 g2993.t43 cds g2993.t43.CDS3 21868542 21868992
chr_3 g2993 g2993.t43 exon g2993.t43.exon4 21869064 21869132
chr_3 g2993 g2993.t43 cds g2993.t43.CDS4 21869064 21869132
chr_3 g2993 g2993.t43 TSS g2993.t43 21869220 21869220

Sequences

>g2993.t43 Gene=g2993 Length=1111
ATGAAATATTTCATATTATTTGCTGCATTGACAGCAGTTTGCTTTGGAGCTGACATTTTT
TCTGATGAATTCATCAATGAAATTAATAAAAAAGCAACAACATGGAAGGCTGGCCATAAT
TTTCATCCAGATACTAATTTGAAATATATCAAGAATTTACTTGGTGTTCATTCCGATTCT
AAGCATTTTAAACTACCTGAGTTATTGCATGACTCACGGGACATAAAAGATTTACCTGAA
AATTTTGATGCTCGTGAACAATGGCCTGATTGTCCAACATTGAGAGAAATAAGAGATCAA
GGAAGTTGTGGTAAGTTCTACACTTTTTTATTCAAAAGAATTTTCGAAAATTTTCGATTT
TCTGAAGGATCTTGTTGGGCTTTTGGCGCTGTTGAGGCAATGAGTGACAGAGTATGCATT
CATAGCAACGCAACTGAACATTTCCACTTTTCTGCTGAACATTTGGTTTCTTGCTGTCAC
ACTTGTGGATTTGGTTGCAATGGTGGCTTTCCAGGATCAGCTTGGAGCTATTGGGTTAGA
AAAGGTATTGTTAGTGGTGGACCATACAATTCTTCAATTGGATGTCAGCCATATGAAATC
GCTCCTTGTGAACATCATGTCAATGGAACTCGTATGCCATGCTCTGGAGAAGGACATACA
CCAAAATGCATGAATAAATGCTCGAATCCCGCTTATAAAGTTGATTTCAAAACAGACAAA
CACTTTGGCAAATCAAGCTATTCAGTAAAGCGTAATGAAGATCAAATTCGTTTGGAAATT
TTCAAAAATGGTCCAGTTGAAGGTGCATTCACAGTCTATGAAGATTTCGTTCAATATAAA
TCTGGAGTTTATCAACATGTTACTGGTAAAGCACTTGGTGGCCATGCTATTAAAATTTTC
GGATGGGGAGTTGAAAATGGTGTCAAATACTGGTTGATTGCTAATAGCTGGAATTCAGGT
ATGTTTATTGAATTTCTGCCTAAATATAAACTAGAAGATTTATTTTTTTTAAAAAAAAAT
AGATTGGGGTGATAATGGAACCTTCAAAATATTGCGAGGAGAAGACCATGTCGGTATTGA
GAGTGAGATTAGCGCTGGTTTGCCAAAGTAA

>g2993.t43 Gene=g2993 Length=343
MKYFILFAALTAVCFGADIFSDEFINEINKKATTWKAGHNFHPDTNLKYIKNLLGVHSDS
KHFKLPELLHDSRDIKDLPENFDAREQWPDCPTLREIRDQGSCGKFYTFLFKRIFENFRF
SEGSCWAFGAVEAMSDRVCIHSNATEHFHFSAEHLVSCCHTCGFGCNGGFPGSAWSYWVR
KGIVSGGPYNSSIGCQPYEIAPCEHHVNGTRMPCSGEGHTPKCMNKCSNPAYKVDFKTDK
HFGKSSYSVKRNEDQIRLEIFKNGPVEGAFTVYEDFVQYKSGVYQHVTGKALGGHAIKIF
GWGVENGVKYWLIANSWNSGMFIEFLPKYKLEDLFFLKKNRLG

Protein features from InterProScan

Transcript Database ID Name Start End E.value
15 g2993.t43 CDD cd02620 Peptidase_C1A_CathepsinB 79 320 2.14551E-127
9 g2993.t43 Gene3D G3DSA:3.90.70.10 Cysteine proteinases 16 323 4.1E-99
3 g2993.t43 PANTHER PTHR12411:SF737 CATHEPSIN B 13 109 7.2E-86
5 g2993.t43 PANTHER PTHR12411 CYSTEINE PROTEASE FAMILY C1-RELATED 13 109 7.2E-86
4 g2993.t43 PANTHER PTHR12411:SF737 CATHEPSIN B 120 318 7.2E-86
6 g2993.t43 PANTHER PTHR12411 CYSTEINE PROTEASE FAMILY C1-RELATED 120 318 7.2E-86
1 g2993.t43 Pfam PF08127 Peptidase family C1 propeptide 20 59 9.7E-19
2 g2993.t43 Pfam PF00112 Papain family cysteine protease 123 319 1.5E-46
11 g2993.t43 Phobius SIGNAL_PEPTIDE Signal peptide region 1 16 -
12 g2993.t43 Phobius SIGNAL_PEPTIDE_N_REGION N-terminal region of a signal peptide. 1 3 -
13 g2993.t43 Phobius SIGNAL_PEPTIDE_H_REGION Hydrophobic region of a signal peptide. 4 11 -
14 g2993.t43 Phobius SIGNAL_PEPTIDE_C_REGION C-terminal region of a signal peptide. 12 16 -
10 g2993.t43 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 17 343 -
16 g2993.t43 SMART SM00645 pept_c1 78 331 2.6E-66
7 g2993.t43 SUPERFAMILY SSF54001 Cysteine proteinases 21 319 3.22E-81
8 g2993.t43 SignalP_EUK SignalP-noTM SignalP-noTM 1 16 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0008234 cysteine-type peptidase activity MF
GO:0006508 proteolysis BP
GO:0004197 cysteine-type endopeptidase activity MF
GO:0050790 regulation of catalytic activity BP

KEGG

Orthology

This gene did not have any KEGG ortholog annotations (KAAS, GHOSTZ).

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values