Gene loci information

Transcript annotation

  • This transcript has been annotated as E3 ubiquitin-protein ligase Su(dx).

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_2 g6094 g6094.t1 TTS g6094.t1 14087703 14087703
chr_2 g6094 g6094.t1 isoform g6094.t1 14088596 14091621
chr_2 g6094 g6094.t1 exon g6094.t1.exon1 14088596 14088681
chr_2 g6094 g6094.t1 cds g6094.t1.CDS1 14088596 14088681
chr_2 g6094 g6094.t1 exon g6094.t1.exon2 14088743 14088829
chr_2 g6094 g6094.t1 cds g6094.t1.CDS2 14088743 14088829
chr_2 g6094 g6094.t1 exon g6094.t1.exon3 14088902 14088998
chr_2 g6094 g6094.t1 cds g6094.t1.CDS3 14088902 14088998
chr_2 g6094 g6094.t1 exon g6094.t1.exon4 14089057 14089558
chr_2 g6094 g6094.t1 cds g6094.t1.CDS4 14089057 14089558
chr_2 g6094 g6094.t1 exon g6094.t1.exon5 14089634 14090171
chr_2 g6094 g6094.t1 cds g6094.t1.CDS5 14089634 14090171
chr_2 g6094 g6094.t1 exon g6094.t1.exon6 14090229 14091621
chr_2 g6094 g6094.t1 cds g6094.t1.CDS6 14090229 14091621
chr_2 g6094 g6094.t1 TSS g6094.t1 NA NA

Sequences

>g6094.t1 Gene=g6094 Length=2703
ATGGCAGATTCAAATTCGTCAAATAGAACTCCTGCAATTAATCAAGGCTTTCATCAATTA
CAAATAACAATAGAGCATGCATCATTACGTAATTCATCGTTTTTAAAGCCAAATCCATAT
GTAGAACTTTCGGTCGATTCAAATCCTGCTCGCAAAACAGATTATCAAAAGACGACATAT
CTGCCTAAATGGAATGACACATTTACTGTGTTGGTCACACCAACATCAATTTTGCAATTC
CGAGTATATGACCATTCAAATTTTCGCAAAGATTCATTAATTGGTGAACAAACAGTGCAT
TTAGTTCAAATATTACAACATTATAATGGTCGTTGTGAGAATTTAGAATTGGCAATGGAC
CTTCTTTATGAATCAAAATCAGAAGGAAGACAAATAAAAAATGGTGAGCTAGTTGCAGTA
CTAAATGGTTTAAAAATCGATATGGTACAGAATAATGGTAATGATACTCAAGCTCGATCT
GTAACAGATCTCTCGTTTCAAAATGGTGCAATGTTGAGAAGTTCGATACTGAATGGCGGT
ATACGATCACGCATGAGGCTAAGAAGTAGTAATAACAATGGTTCAATACCGACACAACAA
CAACAAAATGGAAATGTGATACCTGCTGGTCAGAATGGTGTAAATAATAGTAGTGGATTA
GTTCAAAGTCCATCAACAGGTGCTGTAAGAAGATCTAGTGGAATGAATCAATGGGATCAA
CCACCATCACAACCGATGAATAACTATCAACGACTTACACCAGCCTTTTCAAATCCTCCT
ATAATGAATCCACAACAAGTTCCTCAACCTCCACCTCAATTGCAAGCTTCTCCTTCGCCA
AATTCTAATAACGCTGTAGTTGCACCGACAGCAGCAGCAGCAGGTGTTGTAGTACCTGCA
CAAAATGGTGAAGTAATGCGAGCTGCTGATGATGATGATGAAGAATTACCTTCAGGTTGG
GAAAGACGAACTGATCCATTTGGAAGAAGATATTATGTAGATCATAATACACGATCAACT
TATTGGGAAAAACCATCGCCTTTACCACCTGGTTGGGAGATTCGTAGAGACCCGAGAGGT
CGAGTTTATTATGTTGATCATAATACAAGAACAACAACATGGCAACGTCCAAATAGTGAG
CGATTAATGCATTTTCAACATTGGCAGGGTCAACGAGCACATGTAATGACACAGGGAAAC
CAAAGATTTCTCTATCCACAACATGCTCAGCAATCAAATACAACTCAAGCTCCAGTGCAA
GATGATGAAGATGGCCTTGGTCCATTGCCTGATGGCTGGGAAAAACGAGTGCAACCTGAT
AATCGAGTTTATTTTGTTAATCATAAAAATAGAACGACACAATGGGAGGATCCAAGAACA
CAGGGACAAGAAGTAAGTTTAATGGGAGCAGAAGGACCTTTACCAAGCGGATGGGAAATA
CGATACACAGCAACTGGTGAAAGATTTTTCGTTGATCATAATACGAGAAGAACAACATTT
GAAGATCCTCGACCAGGAGCTCAAAAAGGTGCAAAAGGAATGTATGGTGTACCTCGTGCA
TATGAAAGATCCTTCCGTTGGAAATTAAGTCAGTTCCGTTATTTATGTCAATCAAATGCT
TTACCGTCACATATAAAGATTACTCTCACGAGACAAACACTTTTCGAAGATTCTTATCAT
CAAATTATGCGATTACCTGCTTATGAACTACGGCGAAGGCTTTACATTATATTTCGAGGA
GAAGAAGGGCTCGATTATGGTGGTGTATCACGTGAATGGTTTTTCTTGTTATCACATGAA
GTTCTTAATCCTATGTACTGCCTATTCGAGTACGCTAACAAATCAAATTATTCGTTACAA
ATTAATCCAGCCTCTTATGTCAATCCAGATCATCTGCAATATTTTAAATTTATAGGACGT
TTCTTAGCTATGGCATTATATCACGGAAGATTTATTTATAGTGGTTTTACGATGCCCTTT
TACAAGCGAATGCTTAATAAAAAGTTGACGACCAAAGATATAGAATCGATAGATCCAGAA
TTTTATAATTCGCTTATATGGGTGCGAGATAATAATATTGATGAATGCGGCTTAGAGCTT
TGGTTTTCAGTAGATTTCGAGGTACTAGGACAAATCATACATCATGAGCTAAAAGCGAAT
GGTGATAAAGAGCGTGTAACTGAAGAAAATAAGGAAGAATATATAACATTAATGACTGAA
TGGAGAATGACAAGAGGTATTGAGGAACAAACTAAGACATTCCTCGATGGTTTTAATGAA
GTCGTGCCGTTGGAATGGTTGAAATATTTTGATGAAAGAGAATTAGAATTAATGCTTTGT
GGAATGCAAGAAATTGATGTCGATGATTGGCAACGGAATTCAATATATCGTCATTACAAT
AGAAATAGTAAACAAGTTGTGTGGTTCTGGCAGTTTGTTCGAGAGACTGATAACGAAAAG
CGTGCACGCTTACTTCAATTTGTCTGTGGTACTTGTCGAGTTCCCGTTGGAGGGTTTGCA
GAATTAATGGGTTCGAATGGGCCTCAAAGATTTTGTATTGAAAAAGTTGGAAAAGACACA
TGGCTGCCTCGTTCTCATACTTGTTTTAATCGATTAGACTTACCACCATATAAAAGTTAT
GATCAACTTGTTGAAAAATTAAATTACGCAATAGAAGAGACAGAAGGATTTGGCCAAGAA
TGA

>g6094.t1 Gene=g6094 Length=900
MADSNSSNRTPAINQGFHQLQITIEHASLRNSSFLKPNPYVELSVDSNPARKTDYQKTTY
LPKWNDTFTVLVTPTSILQFRVYDHSNFRKDSLIGEQTVHLVQILQHYNGRCENLELAMD
LLYESKSEGRQIKNGELVAVLNGLKIDMVQNNGNDTQARSVTDLSFQNGAMLRSSILNGG
IRSRMRLRSSNNNGSIPTQQQQNGNVIPAGQNGVNNSSGLVQSPSTGAVRRSSGMNQWDQ
PPSQPMNNYQRLTPAFSNPPIMNPQQVPQPPPQLQASPSPNSNNAVVAPTAAAAGVVVPA
QNGEVMRAADDDDEELPSGWERRTDPFGRRYYVDHNTRSTYWEKPSPLPPGWEIRRDPRG
RVYYVDHNTRTTTWQRPNSERLMHFQHWQGQRAHVMTQGNQRFLYPQHAQQSNTTQAPVQ
DDEDGLGPLPDGWEKRVQPDNRVYFVNHKNRTTQWEDPRTQGQEVSLMGAEGPLPSGWEI
RYTATGERFFVDHNTRRTTFEDPRPGAQKGAKGMYGVPRAYERSFRWKLSQFRYLCQSNA
LPSHIKITLTRQTLFEDSYHQIMRLPAYELRRRLYIIFRGEEGLDYGGVSREWFFLLSHE
VLNPMYCLFEYANKSNYSLQINPASYVNPDHLQYFKFIGRFLAMALYHGRFIYSGFTMPF
YKRMLNKKLTTKDIESIDPEFYNSLIWVRDNNIDECGLELWFSVDFEVLGQIIHHELKAN
GDKERVTEENKEEYITLMTEWRMTRGIEEQTKTFLDGFNEVVPLEWLKYFDERELELMLC
GMQEIDVDDWQRNSIYRHYNRNSKQVVWFWQFVRETDNEKRARLLQFVCGTCRVPVGGFA
ELMGSNGPQRFCIEKVGKDTWLPRSHTCFNRLDLPPYKSYDQLVEKLNYAIEETEGFGQE

Protein features from InterProScan

Transcript Database ID Name Start End E.value
24 g6094.t1 CDD cd04021 C2_E3_ubiquitin_ligase 17 142 4.49136E-51
25 g6094.t1 CDD cd00201 WW 317 347 3.35148E-10
27 g6094.t1 CDD cd00201 WW 349 377 7.1322E-12
26 g6094.t1 CDD cd00201 WW 430 459 6.87965E-11
28 g6094.t1 CDD cd00201 WW 475 505 8.12794E-9
29 g6094.t1 CDD cd00078 HECTc 545 898 1.89595E-163
18 g6094.t1 Gene3D G3DSA:2.60.40.150 - 16 156 1.8E-27
20 g6094.t1 Gene3D G3DSA:2.20.70.10 - 308 346 2.2E-13
21 g6094.t1 Gene3D G3DSA:2.20.70.10 - 347 383 6.7E-17
19 g6094.t1 Gene3D G3DSA:2.20.70.10 - 396 464 1.1E-18
15 g6094.t1 Gene3D G3DSA:3.90.1750.10 Hect 473 779 2.0E-136
16 g6094.t1 Gene3D G3DSA:3.30.2160.10 Hect 668 746 2.0E-136
17 g6094.t1 Gene3D G3DSA:3.30.2410.10 Hect 781 900 1.4E-47
39 g6094.t1 MobiDBLite mobidb-lite consensus disorder prediction 221 243 -
7 g6094.t1 PANTHER PTHR11254:SF396 NEDD4-LIKE E3 UBIQUITIN-PROTEIN LIGASE WWP2 5 900 0.0
8 g6094.t1 PANTHER PTHR11254 HECT DOMAIN UBIQUITIN-PROTEIN LIGASE 5 900 0.0
22 g6094.t1 PIRSF PIRSF001569 E3_ub_ligase_SMURF1 3 347 1.5E-37
23 g6094.t1 PIRSF PIRSF001569 E3_ub_ligase_SMURF1 346 900 3.8E-263
1 g6094.t1 Pfam PF00168 C2 domain 19 108 2.4E-15
2 g6094.t1 Pfam PF00397 WW domain 316 345 4.3E-10
5 g6094.t1 Pfam PF00397 WW domain 348 377 4.2E-13
3 g6094.t1 Pfam PF00397 WW domain 429 458 3.6E-12
4 g6094.t1 Pfam PF00397 WW domain 474 503 6.4E-8
6 g6094.t1 Pfam PF00632 HECT-domain (ubiquitin-transferase) 596 898 9.5E-95
37 g6094.t1 ProSitePatterns PS01159 WW/rsp5/WWP domain signature. 352 377 -
36 g6094.t1 ProSitePatterns PS01159 WW/rsp5/WWP domain signature. 433 458 -
38 g6094.t1 ProSitePatterns PS01159 WW/rsp5/WWP domain signature. 478 503 -
40 g6094.t1 ProSiteProfiles PS50004 C2 domain profile. 19 98 11.996
42 g6094.t1 ProSiteProfiles PS50020 WW/rsp5/WWP domain profile. 314 347 17.574
41 g6094.t1 ProSiteProfiles PS50020 WW/rsp5/WWP domain profile. 346 379 18.776
43 g6094.t1 ProSiteProfiles PS50020 WW/rsp5/WWP domain profile. 427 460 18.283
44 g6094.t1 ProSiteProfiles PS50020 WW/rsp5/WWP domain profile. 472 505 15.447
45 g6094.t1 ProSiteProfiles PS50237 HECT domain profile. 566 900 103.969
30 g6094.t1 SMART SM00239 C2_3c 19 113 4.4E-9
32 g6094.t1 SMART SM00456 ww_5 315 347 8.8E-11
33 g6094.t1 SMART SM00456 ww_5 348 379 1.2E-11
35 g6094.t1 SMART SM00456 ww_5 428 460 3.3E-13
34 g6094.t1 SMART SM00456 ww_5 473 505 2.9E-11
31 g6094.t1 SMART SM00119 hect_3 564 900 8.9E-172
10 g6094.t1 SUPERFAMILY SSF49562 C2 domain (Calcium/lipid-binding domain, CaLB) 16 130 2.53E-17
13 g6094.t1 SUPERFAMILY SSF51045 WW domain 309 346 8.31E-12
11 g6094.t1 SUPERFAMILY SSF51045 WW domain 346 379 4.3E-13
14 g6094.t1 SUPERFAMILY SSF51045 WW domain 423 461 8.73E-14
12 g6094.t1 SUPERFAMILY SSF51045 WW domain 468 505 3.86E-11
9 g6094.t1 SUPERFAMILY SSF56204 Hect, E3 ligase catalytic domain 526 894 1.23E-131

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0061630 ubiquitin protein ligase activity MF
GO:0006511 ubiquitin-dependent protein catabolic process BP
GO:0005515 protein binding MF
GO:0004842 ubiquitin-protein transferase activity MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values