Gene loci information

Transcript annotation

  • This transcript has been annotated as U2 snRNP-associated SURP motif-containing protein.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_2 g8114 g8114.t7 isoform g8114.t7 28645774 28647001
chr_2 g8114 g8114.t7 exon g8114.t7.exon1 28645774 28646350
chr_2 g8114 g8114.t7 cds g8114.t7.CDS1 28645776 28646350
chr_2 g8114 g8114.t7 exon g8114.t7.exon2 28646417 28646572
chr_2 g8114 g8114.t7 cds g8114.t7.CDS2 28646417 28646572
chr_2 g8114 g8114.t7 exon g8114.t7.exon3 28646645 28646927
chr_2 g8114 g8114.t7 cds g8114.t7.CDS3 28646645 28646927
chr_2 g8114 g8114.t7 exon g8114.t7.exon4 28646996 28647001
chr_2 g8114 g8114.t7 cds g8114.t7.CDS4 28646996 28647001
chr_2 g8114 g8114.t7 TSS g8114.t7 28647098 28647098
chr_2 g8114 g8114.t7 TTS g8114.t7 NA NA

Sequences

>g8114.t7 Gene=g8114 Length=1022
ATGAGTAAATTATCAAAAAGAGAATTAGAGGAAAAGAGGAGAAGAGAAATTGAAGAAGAA
AATGCAAAAGCTTTTGAGGATTTTTATGAAACATTTCAAAACACATCAAACTCAGCTGCT
TCAAAGGTTTGGATAAAAGCAGGAACATACGATGCTGGTTCAAGAAAAGAAGATACAAAG
GAGAAAGGAAAACTTTATAAGCCGACTCCAAAAGCTGGAACTTCTCATGACGATAACTCT
GCTTCTTCTGCTCAAGAATATGCAAGAATGCTAGCAACGTCAGATTCTCAAGAACTCAAA
CAAATGCAAGAGGAACGTGAAGAGAGACATAAATATAAACATGCGGCGAAATCAATGATG
CAAATGCAAATTCACAGTGTTTTAGAACATCCTGAACCGATTTATCGTGAAGTTCAAGAA
TCACAAGGAAGTTTTGATACAGGGGATCCAACAACTACCAATTTATATCTTGGTAATTTA
AATCCTAAAATAACTGAGCAACAGTTAATGGAAATATTTGGAAAATACGGACCATTAGCA
AGTGTTAAAATTATGTGGCCGAGATCTGATGAGGAAAAAGCTCGCGGTAGAAATTGTGGA
TTTGTAGCATTTATGGCAAGAAAAGATGCTGAGCGTGCATTAAGGCGGTTAGCAGGTAAA
GATGTTATGGGTTATGAAATGAAACTTGGTTGGGGCAAAAGTGTCCCAATTATGACTCAT
CCGATTTATATTCCACCAAAATTACAAGAATATGCAATGCCACCACAAAAATCAGGATTA
CCTTTTAATGCTCAACCACTTGGTGATGTTACAAATATCAATGTAGATGAACTTGATTTC
AAATCATATTTAGTTGATGAAGAGAAAAAAAAGCAAATAGATGAGATTTTATCAAAGACT
ATTGTAAAAGTAGTAATACCCACAGAGCGACCATTGCTCATGTTAATACATAGAATGATT
GAATTTGTAATTCGAGAAGGACCAATTTTTGAATCTATGATAATGGCACGAGAACAAAAT
AA

>g8114.t7 Gene=g8114 Length=340
MSKLSKRELEEKRRREIEEENAKAFEDFYETFQNTSNSAASKVWIKAGTYDAGSRKEDTK
EKGKLYKPTPKAGTSHDDNSASSAQEYARMLATSDSQELKQMQEEREERHKYKHAAKSMM
QMQIHSVLEHPEPIYREVQESQGSFDTGDPTTTNLYLGNLNPKITEQQLMEIFGKYGPLA
SVKIMWPRSDEEKARGRNCGFVAFMARKDAERALRRLAGKDVMGYEMKLGWGKSVPIMTH
PIYIPPKLQEYAMPPQKSGLPFNAQPLGDVTNINVDELDFKSYLVDEEKKKQIDEILSKT
IVKVVIPTERPLLMLIHRMIEFVIREGPIFESMIMAREQN

Protein features from InterProScan

Transcript Database ID Name Start End E.value
8 g8114.t7 CDD cd12223 RRM_SR140 152 235 3.7348E-57
7 g8114.t7 Coils Coil Coil 7 27 -
10 g8114.t7 MobiDBLite mobidb-lite consensus disorder prediction 1 20 -
12 g8114.t7 MobiDBLite mobidb-lite consensus disorder prediction 49 84 -
11 g8114.t7 MobiDBLite mobidb-lite consensus disorder prediction 51 67 -
3 g8114.t7 PANTHER PTHR23140 RNA PROCESSING PROTEIN LD23810P 5 340 2.0E-93
4 g8114.t7 PANTHER PTHR23140:SF0 U2 SNRNP-ASSOCIATED SURP MOTIF-CONTAINING PROTEIN 5 340 2.0E-93
2 g8114.t7 Pfam PF00076 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain) 155 227 1.3E-14
1 g8114.t7 Pfam PF01805 Surp module 314 340 1.5E-4
13 g8114.t7 ProSiteProfiles PS50102 Eukaryotic RNA Recognition Motif (RRM) profile. 153 234 16.654
14 g8114.t7 ProSiteProfiles PS50128 SURP motif repeat profile. 315 340 8.891
9 g8114.t7 SMART SM00360 rrm1_1 154 230 7.4E-17
5 g8114.t7 SUPERFAMILY SSF54928 RNA-binding domain, RBD 141 236 1.02E-21
6 g8114.t7 SUPERFAMILY SSF109905 Surp module (SWAP domain) 297 340 4.32E-7

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0006396 RNA processing BP
GO:0003723 RNA binding MF
GO:0003676 nucleic acid binding MF

KEGG

Orthology

This gene did not have any KEGG ortholog annotations (KAAS, GHOSTZ).

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below. There were no conditions that were differentially expressed