Gene loci information

Transcript annotation

  • This transcript has been annotated as Protein groucho.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_3 g3549 g3549.t3 isoform g3549.t3 26316219 26389894
chr_3 g3549 g3549.t3 exon g3549.t3.exon1 26316219 26316326
chr_3 g3549 g3549.t3 cds g3549.t3.CDS1 26316219 26316326
chr_3 g3549 g3549.t3 exon g3549.t3.exon2 26316386 26316585
chr_3 g3549 g3549.t3 cds g3549.t3.CDS2 26316386 26316585
chr_3 g3549 g3549.t3 exon g3549.t3.exon3 26316642 26317065
chr_3 g3549 g3549.t3 cds g3549.t3.CDS3 26316642 26317065
chr_3 g3549 g3549.t3 exon g3549.t3.exon4 26317127 26317376
chr_3 g3549 g3549.t3 cds g3549.t3.CDS4 26317127 26317376
chr_3 g3549 g3549.t3 exon g3549.t3.exon5 26317450 26317703
chr_3 g3549 g3549.t3 cds g3549.t3.CDS5 26317450 26317703
chr_3 g3549 g3549.t3 exon g3549.t3.exon6 26317768 26317953
chr_3 g3549 g3549.t3 cds g3549.t3.CDS6 26317768 26317953
chr_3 g3549 g3549.t3 exon g3549.t3.exon7 26318501 26318557
chr_3 g3549 g3549.t3 cds g3549.t3.CDS7 26318501 26318557
chr_3 g3549 g3549.t3 exon g3549.t3.exon8 26319879 26319983
chr_3 g3549 g3549.t3 cds g3549.t3.CDS8 26319879 26319983
chr_3 g3549 g3549.t3 exon g3549.t3.exon9 26320038 26320241
chr_3 g3549 g3549.t3 cds g3549.t3.CDS9 26320038 26320241
chr_3 g3549 g3549.t3 exon g3549.t3.exon10 26327126 26327263
chr_3 g3549 g3549.t3 cds g3549.t3.CDS10 26327126 26327263
chr_3 g3549 g3549.t3 exon g3549.t3.exon11 26336341 26336385
chr_3 g3549 g3549.t3 cds g3549.t3.CDS11 26336341 26336385
chr_3 g3549 g3549.t3 exon g3549.t3.exon12 26344880 26344943
chr_3 g3549 g3549.t3 cds g3549.t3.CDS12 26344880 26344943
chr_3 g3549 g3549.t3 exon g3549.t3.exon13 26348308 26348402
chr_3 g3549 g3549.t3 cds g3549.t3.CDS13 26348308 26348402
chr_3 g3549 g3549.t3 exon g3549.t3.exon14 26389859 26389894
chr_3 g3549 g3549.t3 cds g3549.t3.CDS14 26389859 26389894
chr_3 g3549 g3549.t3 TSS g3549.t3 26390402 26390402
chr_3 g3549 g3549.t3 TTS g3549.t3 NA NA

Sequences

>g3549.t3 Gene=g3549 Length=2166
ATGTATCCGACACCTCAGCGTCATCCCTCGTCTGCGGGACCGCCGCCACAGGGTCCAATC
AAATTTACAATAGCAGACACATTAGAAAGGATTAAAGAAGAATTTAATTTCCTACAAGCA
CAATATCACTCACTTAAATTAGAATGTGAAAAATTAGCAAGTGAAAAAACAGAAATGCAA
AGACATTATGTAATGTATTATGAGATGTCATATGGATTAAATGTGGAAATGCACAAACAG
ACGGAGATAGCAAAAAGACTAAATGCTTTAATTGGACAGCTTTTGCCGTTTTTAGCAGCC
GAACATCAACAACAAGTGGCACAAGCTGTAGAGCGTGCGAAGCAAGTGACGATGCCGGAA
CTAAATGCAATCATTGGGCAACAAATTCATGCACAACAGATTCCAGGTGCTCCTCCACAA
CCTATGCCATCATTAGGTGCACTTGGACCATTGGGAGCATTTGCAGCAGCTGGACCAATG
GGTGGTTTGCCACATGGACCACAGAGTTTATTGAAAAATCCACAAGATCTTCATCGCGAG
GATATTAAGCCACTTGCAGGACCTTCAGCAGAAGAAAGATTGAGGAATTCAGTATCGCCA
GCTGATCGAGAAAAGTATCGTCCACGTTCTCCACTTGATATCGATTCAGAAGCAAAACGT
CGAAAAGAAGAAATTAAAGTGAATCATGAGAGCGATGGTGAAAAAAGTGATCAGGATTTA
GTTGTGGACATCGCAAATGAAACGGGTTCACCTTCACCTCGAGCAAATGGCGATCATTCA
GAATCACGTGATCGTGACAGCATTGGACTCAATGGTGATCGAAATGGAGAAAAAGCTGGC
ACAAGTAAATTAAATAATGTTAATGATCGTCCACCCTCACGTTCTGGTTCAAGTTCATCA
CGCTCAACACCAAATCTCAAAACAAAAGATCTCGATAAACCTGGAACTCCAGGAGCGAAA
GCACGTTCAACAACACCAAATTCAACGCCAGGCGTCCCACCGAAAGCTGTTATTCCATCA
TCACCTGCTGGCTATCCTTCATCTTCACCATATCAGCGACCAGCTGATCCATATCAACGA
CCACCGCCTGATCCTTATGGTCGTATACAACCACTCGGTTATGATCCACATTCACATGTT
CGAACAAATGGAATACCTAATTTACCTGGAGTTGGCGGCAAACCAGCATATTCATTTCAT
GTAAATGCTGAAAATCATTTAACACCTGTGCCCTTTCCACCTGATGCATTAAGTGGTCCT
GGAATCCCGCGACAAGCACGTCAAATCAATACATTATCTCATGGTGAAGTTGTGTGTGCA
GTGACGATATCAAATCCAACTAAATATGTTTATACTGGTGGTAAAGGATGTGTAAAAGTA
TGGGATATTTCACAACCAGGCAACAAAAGTCCCGTCAGTCAACTTGATTGTTTGCAACGA
GACAATTACATTCGTTCTGTGAAACTTCTGCCTGATGGACGTACATTAATTGTTGGTGGT
GAAGCATCAAATTTGAGTATTTGGGATTTAGCGAGTCCAACACCGCGAATTAAAGCCGAA
CTCACTTCAGCTGCACCAGCTTGTTACGCCTTAGCTATCTCACCCGATTCCAAAGTATGT
TTCTCATGCTGTAGTGATGGAAATATTGCAGTATGGGATCTTAACAATCAGACATTAGTG
CGACAATTCCAAGGGCATACAGATGGAGCATCATGCATTGACATTAGTCCTGATGGTTCA
AAATTGTGGACTGGTGGATTGGATAATACAGTAAGATCATGGGATTTAAGAGAAGGACGG
CAACTGCAACAACATGATTTTAGTTCACAAATATTTTCATTAGGCTATTGTCCTACAGGT
GATTGGTTAGCAGTTGGAATGGAAAATTCACATGTGGAAGTGCTGCATGCACAGAAAACA
GAAAAATATCAATTGCATTTGCATGAAAGTTGCGTCCTTTCTCTACGTTTTGCAACATGT
GGCAAGTGGTTTGTATCAACGGGTAAAGATAATTTATTGAATGCATGGCGAACACCTTAT
GGAGCCAGCATATTTCAGTCAAAAGAAACATCTTCGGTGCTCAGCTGTGACATTTCAACT
GACGATAAGTACATTGTTACTGGATCAGGAGACAAGAAAGCGACCGTATATGAGGTCATG
TATTAA

>g3549.t3 Gene=g3549 Length=721
MYPTPQRHPSSAGPPPQGPIKFTIADTLERIKEEFNFLQAQYHSLKLECEKLASEKTEMQ
RHYVMYYEMSYGLNVEMHKQTEIAKRLNALIGQLLPFLAAEHQQQVAQAVERAKQVTMPE
LNAIIGQQIHAQQIPGAPPQPMPSLGALGPLGAFAAAGPMGGLPHGPQSLLKNPQDLHRE
DIKPLAGPSAEERLRNSVSPADREKYRPRSPLDIDSEAKRRKEEIKVNHESDGEKSDQDL
VVDIANETGSPSPRANGDHSESRDRDSIGLNGDRNGEKAGTSKLNNVNDRPPSRSGSSSS
RSTPNLKTKDLDKPGTPGAKARSTTPNSTPGVPPKAVIPSSPAGYPSSSPYQRPADPYQR
PPPDPYGRIQPLGYDPHSHVRTNGIPNLPGVGGKPAYSFHVNAENHLTPVPFPPDALSGP
GIPRQARQINTLSHGEVVCAVTISNPTKYVYTGGKGCVKVWDISQPGNKSPVSQLDCLQR
DNYIRSVKLLPDGRTLIVGGEASNLSIWDLASPTPRIKAELTSAAPACYALAISPDSKVC
FSCCSDGNIAVWDLNNQTLVRQFQGHTDGASCIDISPDGSKLWTGGLDNTVRSWDLREGR
QLQQHDFSSQIFSLGYCPTGDWLAVGMENSHVEVLHAQKTEKYQLHLHESCVLSLRFATC
GKWFVSTGKDNLLNAWRTPYGASIFQSKETSSVLSCDISTDDKYIVTGSGDKKATVYEVM
Y

Protein features from InterProScan

Transcript Database ID Name Start End E.value
15 g3549.t3 CDD cd00200 WD40 433 718 6.85747E-44
14 g3549.t3 Coils Coil Coil 28 55 -
13 g3549.t3 Gene3D G3DSA:2.130.10.10 - 389 721 2.2E-145
26 g3549.t3 MobiDBLite mobidb-lite consensus disorder prediction 184 359 -
27 g3549.t3 MobiDBLite mobidb-lite consensus disorder prediction 196 241 -
28 g3549.t3 MobiDBLite mobidb-lite consensus disorder prediction 257 272 -
25 g3549.t3 MobiDBLite mobidb-lite consensus disorder prediction 278 305 -
5 g3549.t3 PANTHER PTHR10814:SF4 TRANSDUCIN-LIKE ENHANCER PROTEIN 2 7 721 1.8E-297
6 g3549.t3 PANTHER PTHR10814 TRANSDUCIN-LIKE ENHANCER PROTEIN 7 721 1.8E-297
9 g3549.t3 PRINTS PR01850 Groucho/transducin-like enhancer protein signature 620 642 1.6E-54
8 g3549.t3 PRINTS PR01850 Groucho/transducin-like enhancer protein signature 643 661 1.6E-54
11 g3549.t3 PRINTS PR01850 Groucho/transducin-like enhancer protein signature 662 681 1.6E-54
10 g3549.t3 PRINTS PR01850 Groucho/transducin-like enhancer protein signature 682 701 1.6E-54
7 g3549.t3 PRINTS PR01850 Groucho/transducin-like enhancer protein signature 702 721 1.6E-54
1 g3549.t3 Pfam PF03920 Groucho/TLE N-terminal Q-rich domain 7 133 1.2E-67
4 g3549.t3 Pfam PF00400 WD domain, G-beta repeat 431 462 0.095
3 g3549.t3 Pfam PF00400 WD domain, G-beta repeat 529 553 0.15
2 g3549.t3 Pfam PF00400 WD domain, G-beta repeat 559 595 0.0018
24 g3549.t3 ProSitePatterns PS00678 Trp-Asp (WD) repeats signature. 540 554 -
23 g3549.t3 ProSitePatterns PS00678 Trp-Asp (WD) repeats signature. 582 596 -
30 g3549.t3 ProSiteProfiles PS50294 Trp-Asp (WD) repeats circular profile. 431 604 26.92
31 g3549.t3 ProSiteProfiles PS50082 Trp-Asp (WD) repeats profile. 521 562 9.606
33 g3549.t3 ProSiteProfiles PS50082 Trp-Asp (WD) repeats profile. 563 604 15.588
29 g3549.t3 ProSiteProfiles PS50294 Trp-Asp (WD) repeats circular profile. 645 721 9.626
32 g3549.t3 ProSiteProfiles PS50082 Trp-Asp (WD) repeats profile. 645 676 8.503
21 g3549.t3 SMART SM00320 WD40_4 425 462 0.024
17 g3549.t3 SMART SM00320 WD40_4 471 509 0.14
20 g3549.t3 SMART SM00320 WD40_4 514 553 4.4E-4
16 g3549.t3 SMART SM00320 WD40_4 556 595 2.0E-9
19 g3549.t3 SMART SM00320 WD40_4 598 636 53.0
18 g3549.t3 SMART SM00320 WD40_4 638 677 0.0094
22 g3549.t3 SMART SM00320 WD40_4 678 718 0.28
12 g3549.t3 SUPERFAMILY SSF50978 WD40 repeat-like 423 718 5.04E-65

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0005634 nucleus CC
GO:0005515 protein binding MF
GO:0006355 regulation of transcription, DNA-templated BP

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values