Gene loci information

Transcript annotation

  • This transcript has been annotated as U2 snRNP-associated SURP motif-containing protein.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_2 g8114 g8114.t8 isoform g8114.t8 28645774 28647001
chr_2 g8114 g8114.t8 exon g8114.t8.exon1 28645774 28646350
chr_2 g8114 g8114.t8 cds g8114.t8.CDS1 28645776 28646350
chr_2 g8114 g8114.t8 exon g8114.t8.exon2 28646417 28646927
chr_2 g8114 g8114.t8 cds g8114.t8.CDS2 28646417 28646927
chr_2 g8114 g8114.t8 exon g8114.t8.exon3 28646996 28647001
chr_2 g8114 g8114.t8 cds g8114.t8.CDS3 28646996 28647001
chr_2 g8114 g8114.t8 TSS g8114.t8 28647098 28647098
chr_2 g8114 g8114.t8 TTS g8114.t8 NA NA

Sequences

>g8114.t8 Gene=g8114 Length=1094
ATGAGTAAATTATCAAAAAGAGAATTAGAGGAAAAGAGGAGAAGAGAAATTGAAGAAGAA
AATGCAAAAGCTTTTGAGGATTTTTATGAAACATTTCAAAACACATCAAACTCAGCTGCT
TCAAAGGTTTGGATAAAAGCAGGAACATACGATGCTGGTTCAAGAAAAGAAGATACAAAG
GAGAAAGGAAAACTTTATAAGCCGACTCCAAAAGCTGGAACTTCTCATGACGATAACTCT
GCTTCTTCTGCTCAAGAATATGCAAGAATGCTAGCAACGTCAGATTCTCGTAAAGAAATA
ACACCTTTGGGCAAAAAGAAAGCTCAAGAGAAGAAAAAATCTAATTTAGAAATGTTTAAA
GAAGAACTCAAACAAATGCAAGAGGAACGTGAAGAGAGACATAAATATAAACATGCGGCG
AAATCAATGATGCAAATGCAAATTCACAGTGTTTTAGAACATCCTGAACCGATTTATCGT
GAAGTTCAAGAATCACAAGGAAGTTTTGATACAGGGGATCCAACAACTACCAATTTATAT
CTTGGTAATTTAAATCCTAAAATAACTGAGCAACAGTTAATGGAAATATTTGGAAAATAC
GGACCATTAGCAAGTGTTAAAATTATGTGGCCGAGATCTGATGAGGAAAAAGCTCGCGGT
AGAAATTGTGGATTTGTAGCATTTATGGCAAGAAAAGATGCTGAGCGTGCATTAAGGCGG
TTAGCAGGTAAAGATGTTATGGGTTATGAAATGAAACTTGGTTGGGGCAAAAGTGTCCCA
ATTATGACTCATCCGATTTATATTCCACCAAAATTACAAGAATATGCAATGCCACCACAA
AAATCAGGATTACCTTTTAATGCTCAACCACTTGGTGATGTTACAAATATCAATGTAGAT
GAACTTGATTTCAAATCATATTTAGTTGATGAAGAGAAAAAAAAGCAAATAGATGAGATT
TTATCAAAGACTATTGTAAAAGTAGTAATACCCACAGAGCGACCATTGCTCATGTTAATA
CATAGAATGATTGAATTTGTAATTCGAGAAGGACCAATTTTTGAATCTATGATAATGGCA
CGAGAACAAAATAA

>g8114.t8 Gene=g8114 Length=364
MSKLSKRELEEKRRREIEEENAKAFEDFYETFQNTSNSAASKVWIKAGTYDAGSRKEDTK
EKGKLYKPTPKAGTSHDDNSASSAQEYARMLATSDSRKEITPLGKKKAQEKKKSNLEMFK
EELKQMQEEREERHKYKHAAKSMMQMQIHSVLEHPEPIYREVQESQGSFDTGDPTTTNLY
LGNLNPKITEQQLMEIFGKYGPLASVKIMWPRSDEEKARGRNCGFVAFMARKDAERALRR
LAGKDVMGYEMKLGWGKSVPIMTHPIYIPPKLQEYAMPPQKSGLPFNAQPLGDVTNINVD
ELDFKSYLVDEEKKKQIDEILSKTIVKVVIPTERPLLMLIHRMIEFVIREGPIFESMIMA
REQN

Protein features from InterProScan

Transcript Database ID Name Start End E.value
9 g8114.t8 CDD cd12223 RRM_SR140 176 259 1.38552E-56
8 g8114.t8 Coils Coil Coil 7 27 -
7 g8114.t8 Coils Coil Coil 106 129 -
12 g8114.t8 MobiDBLite mobidb-lite consensus disorder prediction 49 84 -
11 g8114.t8 MobiDBLite mobidb-lite consensus disorder prediction 51 67 -
3 g8114.t8 PANTHER PTHR23140 RNA PROCESSING PROTEIN LD23810P 5 364 6.4E-103
4 g8114.t8 PANTHER PTHR23140:SF0 U2 SNRNP-ASSOCIATED SURP MOTIF-CONTAINING PROTEIN 5 364 6.4E-103
2 g8114.t8 Pfam PF00076 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain) 179 251 1.4E-14
1 g8114.t8 Pfam PF01805 Surp module 338 364 1.6E-4
13 g8114.t8 ProSiteProfiles PS50102 Eukaryotic RNA Recognition Motif (RRM) profile. 177 258 16.654
14 g8114.t8 ProSiteProfiles PS50128 SURP motif repeat profile. 339 364 8.891
10 g8114.t8 SMART SM00360 rrm1_1 178 254 7.4E-17
5 g8114.t8 SUPERFAMILY SSF54928 RNA-binding domain, RBD 165 260 1.17E-21
6 g8114.t8 SUPERFAMILY SSF109905 Surp module (SWAP domain) 321 364 4.71E-7

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0006396 RNA processing BP
GO:0003723 RNA binding MF
GO:0003676 nucleic acid binding MF

KEGG

Orthology

This gene did not have any KEGG ortholog annotations (KAAS, GHOSTZ).

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values