Gene loci information

Transcript annotation

  • This transcript has been annotated as pre-mRNA 3’ end processing protein WDR33.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_1 g10507 g10507.t1 TTS g10507.t1 10273461 10273461
chr_1 g10507 g10507.t1 isoform g10507.t1 10273519 10276695
chr_1 g10507 g10507.t1 exon g10507.t1.exon1 10273519 10273856
chr_1 g10507 g10507.t1 cds g10507.t1.CDS1 10273519 10273856
chr_1 g10507 g10507.t1 exon g10507.t1.exon2 10273910 10273975
chr_1 g10507 g10507.t1 cds g10507.t1.CDS2 10273910 10273975
chr_1 g10507 g10507.t1 exon g10507.t1.exon3 10274039 10274667
chr_1 g10507 g10507.t1 cds g10507.t1.CDS3 10274039 10274667
chr_1 g10507 g10507.t1 exon g10507.t1.exon4 10274733 10275191
chr_1 g10507 g10507.t1 cds g10507.t1.CDS4 10274733 10275191
chr_1 g10507 g10507.t1 exon g10507.t1.exon5 10275861 10276012
chr_1 g10507 g10507.t1 cds g10507.t1.CDS5 10275861 10276012
chr_1 g10507 g10507.t1 exon g10507.t1.exon6 10276090 10276223
chr_1 g10507 g10507.t1 cds g10507.t1.CDS6 10276090 10276223
chr_1 g10507 g10507.t1 exon g10507.t1.exon7 10276275 10276410
chr_1 g10507 g10507.t1 cds g10507.t1.CDS7 10276275 10276410
chr_1 g10507 g10507.t1 exon g10507.t1.exon8 10276477 10276695
chr_1 g10507 g10507.t1 cds g10507.t1.CDS8 10276477 10276695
chr_1 g10507 g10507.t1 TSS g10507.t1 10276960 10276960

Sequences

>g10507.t1 Gene=g10507 Length=2133
ATGGAATTTCCACCACCAGATTTACCACCACCATCTTTTCAACCGCCTCAATTTCAACCC
AGACCATACAATCCGAATTTTCATCAACAACATAGGCCCTATTACAATAAATATAAGCAT
GGTCCTCAAATTCAAGATGACTTTGACGGTAAGAGATTGCGTAAAAGTGTTATGCGAAAG
ACTGTCGATTATAATGCTTCAATTGTGAGAGCCTTACAGAATCGGGTTTGGCAGAGAGAT
CATATTGATCGAAGAGCGCTGCAACCTGAAAATAGTTACATTCCTGATCTCATGCCGCCT
ATGAGCTATCTTGACAATCCTAGTAATTCAATTACTACGCGTTTTGTTAAAACTGCTACA
AATAAAATGAGATGTCCTGTTTTCACACTTGCTTGGACGCCTGAAGGTAGACGCTTGATT
ACAGGAGCAAGTTCTGGTGAATTCACTCTTTGGAATGGATTAACTTTTAATTTTGAAACA
ATTCTTCAAGCACATAGTACATCAGTGAGGTCCATGGTGTGGTCTCATAATGACAATTGG
ATGGTAACAGGTGATCATAACGGTTATGTTAAATATTGGCAATCAAACATGAACAACGTA
AAGCAGTTTCAAGCACATAAGGAACCAATTAGAGGACTAAGCTTTTGTCCAACGGATGCA
AAATTTGCAACATGCAGTGATGATGGAACAGTCAGGATATTTGATTTTCTTCGTTGTCAC
GAAGAGCGAGTCATGAGAGGTCACGGTGCTGATGTTAAAACAATTCATTGGCATCCTCAA
AAATCAATTGTAGCATCAGGTAGTAAAGATAATCAACAGCCGATCAAATTATGGGATCCA
AAGAGCGGTACTGCTTTGTGCACTTTGCATGCACATAAATCCACTGTAATGGATTTGAAA
TGGAACGATAATGGTAACTGGCTTGTGACGGCATCGCGTGATCATCTTTTGAAACTGTTT
GATATAAGAAATTTAAAAGAGGAAATGCAAACATTTAGAGGACATAAAAAGGAAGCAAGT
TCTGTATGTTGGCATCCTATTCATGAGGGACTTTTTGTGTCAGGTGGATCTGATGGTCAA
ATTTTGTTTTGGAATGTTGGAACTGATAAGGAAGTTGGAGGAATTGAAGCAGCTCATGAA
AGTATTGTTTGGACACTTGCATGGCATCCAATCGGACACATTCTTTGTTCTGGCTCTAAC
GACCATACAGTTAAATTTTGGACAAGAAATAGACCAGGTGATCAAATGAGAGACAAATAT
AATTTGAATACATTACCAGCTAGTCTTGCTGGTCTCGAAGATTATGAAATGGATGAACAT
ATTGTTATTCCTGGAATGGGAATTGCACCAAGTGATGATCAAGATGAAGATGATGACGAT
GAAGATGTATATGCTCAATCTCAAATGCCTGGAGATTTGTTACAAAAGGATGAAAATTCA
GTTAATAGTAATTCAGAAGCTCTCGATAATGGAGTTATTCCTGGATTGGATCTTGATGGT
GGCAAAAATGATAAAAAATTACCTTACAGCAAGCCTATACCAAAAAATTTCCAAGCTCAA
TGGAATGTTAATGAAAAGAATGACAAACCATCGCATAATCCAACAAACATCATAGAATGT
ATTCAGCATGTGGTTACGAAAATTAACGAAAGATATCCAGGTCTCATTAAAATCGATAAT
CTACGATCTGATAGGATAATTGTTTCTGGAAAAGAATTGGACATTAAGCCTGGTTTTAAA
ATTTATCAAGCGATTATGGATGGTCCAGCTTTCTTTTTTAATTTTCTTCAATCAGAAAAC
ATTATATCATCAGTAGATGAAAATGATGTTATCGAACCTCAAGCGAAACGATTTCGTGCT
GATAATTTTGACAGCAACAGTAATAGCAATAGCAATGATGTAGATTTGAGGTTTTCTCAA
CCAGGAATTCCTTCACTTCTCAATATCAATGTAGGACTGCCGTTAGACAAAGATTTACAG
AAATCTCAGCAACAGCAGCAGCAGAATCAAAGTCCTTGGGAAAACAACGCCGTATTTAAT
AACGGTCAATTTAATAATAACAACGGTAATCAAAATCAAAAGACAAAAACCCGAGAAGGA
CGAAAGCAGGGAAGTCGATGGTCGAGGCGTTAG

>g10507.t1 Gene=g10507 Length=710
MEFPPPDLPPPSFQPPQFQPRPYNPNFHQQHRPYYNKYKHGPQIQDDFDGKRLRKSVMRK
TVDYNASIVRALQNRVWQRDHIDRRALQPENSYIPDLMPPMSYLDNPSNSITTRFVKTAT
NKMRCPVFTLAWTPEGRRLITGASSGEFTLWNGLTFNFETILQAHSTSVRSMVWSHNDNW
MVTGDHNGYVKYWQSNMNNVKQFQAHKEPIRGLSFCPTDAKFATCSDDGTVRIFDFLRCH
EERVMRGHGADVKTIHWHPQKSIVASGSKDNQQPIKLWDPKSGTALCTLHAHKSTVMDLK
WNDNGNWLVTASRDHLLKLFDIRNLKEEMQTFRGHKKEASSVCWHPIHEGLFVSGGSDGQ
ILFWNVGTDKEVGGIEAAHESIVWTLAWHPIGHILCSGSNDHTVKFWTRNRPGDQMRDKY
NLNTLPASLAGLEDYEMDEHIVIPGMGIAPSDDQDEDDDDEDVYAQSQMPGDLLQKDENS
VNSNSEALDNGVIPGLDLDGGKNDKKLPYSKPIPKNFQAQWNVNEKNDKPSHNPTNIIEC
IQHVVTKINERYPGLIKIDNLRSDRIIVSGKELDIKPGFKIYQAIMDGPAFFFNFLQSEN
IISSVDENDVIEPQAKRFRADNFDSNSNSNSNDVDLRFSQPGIPSLLNINVGLPLDKDLQ
KSQQQQQQNQSPWENNAVFNNGQFNNNNGNQNQKTKTREGRKQGSRWSRR

Protein features from InterProScan

Transcript Database ID Name Start End E.value
12 g10507.t1 CDD cd00200 WD40 124 407 1.26205E-64
11 g10507.t1 Gene3D G3DSA:2.130.10.10 - 102 198 1.6E-15
10 g10507.t1 Gene3D G3DSA:2.130.10.10 - 199 323 2.8E-34
9 g10507.t1 Gene3D G3DSA:2.130.10.10 - 324 425 1.1E-22
23 g10507.t1 MobiDBLite mobidb-lite consensus disorder prediction 1 24 -
24 g10507.t1 MobiDBLite mobidb-lite consensus disorder prediction 1 22 -
22 g10507.t1 MobiDBLite mobidb-lite consensus disorder prediction 470 492 -
21 g10507.t1 MobiDBLite mobidb-lite consensus disorder prediction 472 489 -
20 g10507.t1 MobiDBLite mobidb-lite consensus disorder prediction 684 710 -
7 g10507.t1 PANTHER PTHR22836 WD40 REPEAT PROTEIN 18 653 5.9E-250
6 g10507.t1 Pfam PF00400 WD domain, G-beta repeat 160 193 6.6E-4
4 g10507.t1 Pfam PF00400 WD domain, G-beta repeat 198 235 4.7E-6
3 g10507.t1 Pfam PF00400 WD domain, G-beta repeat 243 271 0.045
5 g10507.t1 Pfam PF00400 WD domain, G-beta repeat 286 321 1.2E-5
2 g10507.t1 Pfam PF00400 WD domain, G-beta repeat 328 365 0.0042
1 g10507.t1 Pfam PF00400 WD domain, G-beta repeat 378 407 1.6E-4
25 g10507.t1 ProSiteProfiles PS50294 Trp-Asp (WD) repeats circular profile. 120 417 47.457
30 g10507.t1 ProSiteProfiles PS50082 Trp-Asp (WD) repeats profile. 120 152 9.038
26 g10507.t1 ProSiteProfiles PS50082 Trp-Asp (WD) repeats profile. 162 194 11.411
29 g10507.t1 ProSiteProfiles PS50082 Trp-Asp (WD) repeats profile. 203 235 11.01
28 g10507.t1 ProSiteProfiles PS50082 Trp-Asp (WD) repeats profile. 289 330 12.413
31 g10507.t1 ProSiteProfiles PS50082 Trp-Asp (WD) repeats profile. 332 374 11.544
27 g10507.t1 ProSiteProfiles PS50082 Trp-Asp (WD) repeats profile. 376 407 12.246
15 g10507.t1 SMART SM00320 WD40_4 112 152 0.41
19 g10507.t1 SMART SM00320 WD40_4 155 194 1.6E-5
18 g10507.t1 SMART SM00320 WD40_4 196 235 2.0E-8
17 g10507.t1 SMART SM00320 WD40_4 238 279 7.2E-6
16 g10507.t1 SMART SM00320 WD40_4 282 321 1.4E-8
14 g10507.t1 SMART SM00320 WD40_4 325 365 1.3E-7
13 g10507.t1 SMART SM00320 WD40_4 368 408 1.1E-6
8 g10507.t1 SUPERFAMILY SSF50978 WD40 repeat-like 121 410 3.36E-76

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0005515 protein binding MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values