Gene loci information

Transcript annotation

  • This transcript has been annotated as Protein groucho.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_3 g3549 g3549.t2 isoform g3549.t2 26316219 26389894
chr_3 g3549 g3549.t2 exon g3549.t2.exon1 26316219 26316326
chr_3 g3549 g3549.t2 cds g3549.t2.CDS1 26316219 26316326
chr_3 g3549 g3549.t2 exon g3549.t2.exon2 26316386 26316585
chr_3 g3549 g3549.t2 cds g3549.t2.CDS2 26316386 26316585
chr_3 g3549 g3549.t2 exon g3549.t2.exon3 26316642 26317065
chr_3 g3549 g3549.t2 cds g3549.t2.CDS3 26316642 26317065
chr_3 g3549 g3549.t2 exon g3549.t2.exon4 26317127 26317376
chr_3 g3549 g3549.t2 cds g3549.t2.CDS4 26317127 26317376
chr_3 g3549 g3549.t2 exon g3549.t2.exon5 26317450 26317703
chr_3 g3549 g3549.t2 cds g3549.t2.CDS5 26317450 26317703
chr_3 g3549 g3549.t2 exon g3549.t2.exon6 26317768 26317953
chr_3 g3549 g3549.t2 cds g3549.t2.CDS6 26317768 26317953
chr_3 g3549 g3549.t2 exon g3549.t2.exon7 26319879 26319983
chr_3 g3549 g3549.t2 cds g3549.t2.CDS7 26319879 26319983
chr_3 g3549 g3549.t2 exon g3549.t2.exon8 26320038 26320241
chr_3 g3549 g3549.t2 cds g3549.t2.CDS8 26320038 26320241
chr_3 g3549 g3549.t2 exon g3549.t2.exon9 26326793 26326825
chr_3 g3549 g3549.t2 cds g3549.t2.CDS9 26326793 26326825
chr_3 g3549 g3549.t2 exon g3549.t2.exon10 26327126 26327263
chr_3 g3549 g3549.t2 cds g3549.t2.CDS10 26327126 26327263
chr_3 g3549 g3549.t2 exon g3549.t2.exon11 26336341 26336385
chr_3 g3549 g3549.t2 cds g3549.t2.CDS11 26336341 26336385
chr_3 g3549 g3549.t2 exon g3549.t2.exon12 26344880 26344943
chr_3 g3549 g3549.t2 cds g3549.t2.CDS12 26344880 26344943
chr_3 g3549 g3549.t2 exon g3549.t2.exon13 26348308 26348402
chr_3 g3549 g3549.t2 cds g3549.t2.CDS13 26348308 26348402
chr_3 g3549 g3549.t2 exon g3549.t2.exon14 26389859 26389894
chr_3 g3549 g3549.t2 cds g3549.t2.CDS14 26389859 26389894
chr_3 g3549 g3549.t2 TSS g3549.t2 26390402 26390402
chr_3 g3549 g3549.t2 TTS g3549.t2 NA NA

Sequences

>g3549.t2 Gene=g3549 Length=2142
ATGTATCCGACACCTCAGCGTCATCCCTCGTCTGCGGGACCGCCGCCACAGGGTCCAATC
AAATTTACAATAGCAGACACATTAGAAAGGATTAAAGAAGAATTTAATTTCCTACAAGCA
CAATATCACTCACTTAAATTAGAATGTGAAAAATTAGCAAGTGAAAAAACAGAAATGCAA
AGACATTATGTAATGTATTATGAGATGTCATATGGATTAAATGTGGAAATGCACAAACAG
ACGGAGATAGCAAAAAGACTAAATGCTTTAATTGGACAGCTTTTGCCGTTTTTAGCAGCC
GAACATCAACAACAAGTGGCACAAGCTGTAGAGCGTGCGAAGCAAGTGACGATGCCGGAA
CTAAATGCAATCATTGGGCAGCATCAACAACAGGGAATTCAACAACTCCTTCAACAAATT
CATGCACAACAGATTCCAGGTGCTCCTCCACAACCTATGCCATCATTAGGTGCACTTGGA
CCATTGGGAGCATTTGCAGCAGCTGGACCAATGGGTGGTTTGCCACATGGACCACAGAGT
TTATTGAAAAATCCACAAGATCTTCATCGCGAGGATATTAAGCCACTTGCAGGACCTTCA
GCAGAAGAAAGATTGAGGAATTCAGTATCGCCAGCTGATCGAGAAAAGTATCGTCCACGT
TCTCCACTTGATATCGATTCAGAAGCAAAACGTCGAAAAGAAGAAATTAAAGTGAATCAT
GGTTCACCTTCACCTCGAGCAAATGGCGATCATTCAGAATCACGTGATCGTGACAGCATT
GGACTCAATGGTGATCGAAATGGAGAAAAAGCTGGCACAAGTAAATTAAATAATGTTAAT
GATCGTCCACCCTCACGTTCTGGTTCAAGTTCATCACGCTCAACACCAAATCTCAAAACA
AAAGATCTCGATAAACCTGGAACTCCAGGAGCGAAAGCACGTTCAACAACACCAAATTCA
ACGCCAGGCGTCCCACCGAAAGCTGTTATTCCATCATCACCTGCTGGCTATCCTTCATCT
TCACCATATCAGCGACCAGCTGATCCATATCAACGACCACCGCCTGATCCTTATGGTCGT
ATACAACCACTCGGTTATGATCCACATTCACATGTTCGAACAAATGGAATACCTAATTTA
CCTGGAGTTGGCGGCAAACCAGCATATTCATTTCATGTAAATGCTGAAAATCATTTAACA
CCTGTGCCCTTTCCACCTGATGCATTAAGTGGTCCTGGAATCCCGCGACAAGCACGTCAA
ATCAATACATTATCTCATGGTGAAGTTGTGTGTGCAGTGACGATATCAAATCCAACTAAA
TATGTTTATACTGGTGGTAAAGGATGTGTAAAAGTATGGGATATTTCACAACCAGGCAAC
AAAAGTCCCGTCAGTCAACTTGATTGTTTGCAACGAGACAATTACATTCGTTCTGTGAAA
CTTCTGCCTGATGGACGTACATTAATTGTTGGTGGTGAAGCATCAAATTTGAGTATTTGG
GATTTAGCGAGTCCAACACCGCGAATTAAAGCCGAACTCACTTCAGCTGCACCAGCTTGT
TACGCCTTAGCTATCTCACCCGATTCCAAAGTATGTTTCTCATGCTGTAGTGATGGAAAT
ATTGCAGTATGGGATCTTAACAATCAGACATTAGTGCGACAATTCCAAGGGCATACAGAT
GGAGCATCATGCATTGACATTAGTCCTGATGGTTCAAAATTGTGGACTGGTGGATTGGAT
AATACAGTAAGATCATGGGATTTAAGAGAAGGACGGCAACTGCAACAACATGATTTTAGT
TCACAAATATTTTCATTAGGCTATTGTCCTACAGGTGATTGGTTAGCAGTTGGAATGGAA
AATTCACATGTGGAAGTGCTGCATGCACAGAAAACAGAAAAATATCAATTGCATTTGCAT
GAAAGTTGCGTCCTTTCTCTACGTTTTGCAACATGTGGCAAGTGGTTTGTATCAACGGGT
AAAGATAATTTATTGAATGCATGGCGAACACCTTATGGAGCCAGCATATTTCAGTCAAAA
GAAACATCTTCGGTGCTCAGCTGTGACATTTCAACTGACGATAAGTACATTGTTACTGGA
TCAGGAGACAAGAAAGCGACCGTATATGAGGTCATGTATTAA

>g3549.t2 Gene=g3549 Length=713
MYPTPQRHPSSAGPPPQGPIKFTIADTLERIKEEFNFLQAQYHSLKLECEKLASEKTEMQ
RHYVMYYEMSYGLNVEMHKQTEIAKRLNALIGQLLPFLAAEHQQQVAQAVERAKQVTMPE
LNAIIGQHQQQGIQQLLQQIHAQQIPGAPPQPMPSLGALGPLGAFAAAGPMGGLPHGPQS
LLKNPQDLHREDIKPLAGPSAEERLRNSVSPADREKYRPRSPLDIDSEAKRRKEEIKVNH
GSPSPRANGDHSESRDRDSIGLNGDRNGEKAGTSKLNNVNDRPPSRSGSSSSRSTPNLKT
KDLDKPGTPGAKARSTTPNSTPGVPPKAVIPSSPAGYPSSSPYQRPADPYQRPPPDPYGR
IQPLGYDPHSHVRTNGIPNLPGVGGKPAYSFHVNAENHLTPVPFPPDALSGPGIPRQARQ
INTLSHGEVVCAVTISNPTKYVYTGGKGCVKVWDISQPGNKSPVSQLDCLQRDNYIRSVK
LLPDGRTLIVGGEASNLSIWDLASPTPRIKAELTSAAPACYALAISPDSKVCFSCCSDGN
IAVWDLNNQTLVRQFQGHTDGASCIDISPDGSKLWTGGLDNTVRSWDLREGRQLQQHDFS
SQIFSLGYCPTGDWLAVGMENSHVEVLHAQKTEKYQLHLHESCVLSLRFATCGKWFVSTG
KDNLLNAWRTPYGASIFQSKETSSVLSCDISTDDKYIVTGSGDKKATVYEVMY

Protein features from InterProScan

Transcript Database ID Name Start End E.value
17 g3549.t2 CDD cd00200 WD40 425 710 6.45891E-44
16 g3549.t2 Coils Coil Coil 28 55 -
15 g3549.t2 Gene3D G3DSA:2.130.10.10 - 381 713 2.1E-145
30 g3549.t2 MobiDBLite mobidb-lite consensus disorder prediction 195 351 -
29 g3549.t2 MobiDBLite mobidb-lite consensus disorder prediction 207 240 -
28 g3549.t2 MobiDBLite mobidb-lite consensus disorder prediction 249 264 -
27 g3549.t2 MobiDBLite mobidb-lite consensus disorder prediction 270 297 -
6 g3549.t2 PANTHER PTHR10814:SF4 TRANSDUCIN-LIKE ENHANCER PROTEIN 2 7 324 2.4E-291
8 g3549.t2 PANTHER PTHR10814 TRANSDUCIN-LIKE ENHANCER PROTEIN 7 324 2.4E-291
5 g3549.t2 PANTHER PTHR10814:SF4 TRANSDUCIN-LIKE ENHANCER PROTEIN 2 358 713 2.4E-291
7 g3549.t2 PANTHER PTHR10814 TRANSDUCIN-LIKE ENHANCER PROTEIN 358 713 2.4E-291
11 g3549.t2 PRINTS PR01850 Groucho/transducin-like enhancer protein signature 612 634 1.6E-54
10 g3549.t2 PRINTS PR01850 Groucho/transducin-like enhancer protein signature 635 653 1.6E-54
9 g3549.t2 PRINTS PR01850 Groucho/transducin-like enhancer protein signature 654 673 1.6E-54
12 g3549.t2 PRINTS PR01850 Groucho/transducin-like enhancer protein signature 674 693 1.6E-54
13 g3549.t2 PRINTS PR01850 Groucho/transducin-like enhancer protein signature 694 713 1.6E-54
1 g3549.t2 Pfam PF03920 Groucho/TLE N-terminal Q-rich domain 7 134 5.5E-68
3 g3549.t2 Pfam PF00400 WD domain, G-beta repeat 423 454 0.094
4 g3549.t2 Pfam PF00400 WD domain, G-beta repeat 521 545 0.15
2 g3549.t2 Pfam PF00400 WD domain, G-beta repeat 551 587 0.0018
26 g3549.t2 ProSitePatterns PS00678 Trp-Asp (WD) repeats signature. 532 546 -
25 g3549.t2 ProSitePatterns PS00678 Trp-Asp (WD) repeats signature. 574 588 -
31 g3549.t2 ProSiteProfiles PS50294 Trp-Asp (WD) repeats circular profile. 423 596 26.92
34 g3549.t2 ProSiteProfiles PS50082 Trp-Asp (WD) repeats profile. 513 554 9.606
35 g3549.t2 ProSiteProfiles PS50082 Trp-Asp (WD) repeats profile. 555 596 15.588
32 g3549.t2 ProSiteProfiles PS50294 Trp-Asp (WD) repeats circular profile. 637 713 9.626
33 g3549.t2 ProSiteProfiles PS50082 Trp-Asp (WD) repeats profile. 637 668 8.503
19 g3549.t2 SMART SM00320 WD40_4 417 454 0.024
18 g3549.t2 SMART SM00320 WD40_4 463 501 0.14
24 g3549.t2 SMART SM00320 WD40_4 506 545 4.4E-4
21 g3549.t2 SMART SM00320 WD40_4 548 587 2.0E-9
23 g3549.t2 SMART SM00320 WD40_4 590 628 53.0
20 g3549.t2 SMART SM00320 WD40_4 630 669 0.0094
22 g3549.t2 SMART SM00320 WD40_4 670 710 0.28
14 g3549.t2 SUPERFAMILY SSF50978 WD40 repeat-like 415 710 4.81E-65

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0005634 nucleus CC
GO:0005515 protein binding MF
GO:0006355 regulation of transcription, DNA-templated BP

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values