Gene loci information

Transcript annotation

  • This transcript has been annotated as Putative Pre-mRNA cleavage complex 2 protein Pcf11.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_3 g3854 g3854.t2 TSS g3854.t2 28371998 28371998
chr_3 g3854 g3854.t2 isoform g3854.t2 28372690 28379817
chr_3 g3854 g3854.t2 exon g3854.t2.exon1 28372690 28372854
chr_3 g3854 g3854.t2 cds g3854.t2.CDS1 28372690 28372854
chr_3 g3854 g3854.t2 exon g3854.t2.exon2 28374130 28374255
chr_3 g3854 g3854.t2 cds g3854.t2.CDS2 28374130 28374255
chr_3 g3854 g3854.t2 exon g3854.t2.exon3 28374324 28374656
chr_3 g3854 g3854.t2 cds g3854.t2.CDS3 28374324 28374656
chr_3 g3854 g3854.t2 exon g3854.t2.exon4 28374715 28374774
chr_3 g3854 g3854.t2 cds g3854.t2.CDS4 28374715 28374774
chr_3 g3854 g3854.t2 exon g3854.t2.exon5 28374843 28375524
chr_3 g3854 g3854.t2 cds g3854.t2.CDS5 28374843 28375524
chr_3 g3854 g3854.t2 exon g3854.t2.exon6 28376179 28376326
chr_3 g3854 g3854.t2 cds g3854.t2.CDS6 28376179 28376326
chr_3 g3854 g3854.t2 exon g3854.t2.exon7 28376394 28376466
chr_3 g3854 g3854.t2 cds g3854.t2.CDS7 28376394 28376466
chr_3 g3854 g3854.t2 exon g3854.t2.exon8 28376521 28376830
chr_3 g3854 g3854.t2 cds g3854.t2.CDS8 28376521 28376830
chr_3 g3854 g3854.t2 exon g3854.t2.exon9 28376886 28378409
chr_3 g3854 g3854.t2 cds g3854.t2.CDS9 28376886 28378409
chr_3 g3854 g3854.t2 exon g3854.t2.exon10 28378473 28378723
chr_3 g3854 g3854.t2 cds g3854.t2.CDS10 28378473 28378723
chr_3 g3854 g3854.t2 exon g3854.t2.exon11 28378783 28379275
chr_3 g3854 g3854.t2 cds g3854.t2.CDS11 28378783 28379275
chr_3 g3854 g3854.t2 exon g3854.t2.exon12 28379342 28379817
chr_3 g3854 g3854.t2 cds g3854.t2.CDS12 28379342 28379817
chr_3 g3854 g3854.t2 TTS g3854.t2 NA NA

Sequences

>g3854.t2 Gene=g3854 Length=4641
ATGGACTCTGTAAAGGAAAAGGAAATTGAAGAGGAGTATCTTTCTTCTTTAATGGATCTT
AATGTCAATAGTAAACCCCTAATCAATATGCTAACAATGTTAGCCGAGGACAATCTCGAA
AATGCAGCTATTATCGTGCGAGCGATTGAAAAACACATCTTACAGGTGTCTCCGGAAATT
AAACTCCCAATTCTCTATTTAATTGATTCCATCGTGAAAAATGTCGGTGATAAATACAAA
CAACTTTTTGCCCAAAATATTGTTAACATATTTTGTGGAGTTTTTGAGAAGGTGAATGAA
AAAGTTCGCGAAAAAATGTTCAATCTACGTCAAACTTGGAATGATGTTTTTCCTCAAACG
AAATTATATGCATTAGATGTGAAAGTTAATGTGATGGACAACAATTGGCCAATTACTGCT
AAAGTTTTGCCAAAATCAGTGCATGTGAATCCAAATTTTTTGAAGAAAACTAAAAATCCA
GAGAATCAAGCGAAAGAAGATTTGATGTTGCAGATGCAAGCAAAAGAAAGAGAACTTTTA
GAATTAAAGCAAAGAAAAATTGAGCTGGAACTTATGGTAACGAAAAAGAAAATTGCCGAA
CAGGAAAAAGAGATAAAGGGAATTGTTCCTTCTACATCAAACACAGTTAATTCGGCAATA
TCAATTGCACCAGTTTCTCATCCGATGACTATGACAACATCATCAATGATGATTCCTCAT
CAACAAGGTAGAATTCGAATTGCTCCAGTAAGTTCTACTATGTTATCATCAGCACGTCCT
CGTGATCCACGACTTTCAAAATTGCGTCCACAAATTGAAGTCCCTCAACCATCTTTGCCA
CCAACCTCAATTTTACCTGGTATTATGAAATTACCAAGGATACCTAAATATTCTAATGGA
AATAAATCATCTTCATCTACAACAACATCTTCATTTAGAGATATAGATGAACGTGATTCA
CGCAGAAGACGCGAGAAAGATCATGATGATTCGAGCTCGAAAGAAAAGTCGAGTAAATCG
AGTTCAAGTAGAAAGTCCAGTAAATCCGATTCACCACGAAAGAAGAGTGATGATGACAAG
AAATTATCATCAAAAAATAGTTCTTCACATCATCATAAATCTACGTCTTCGTCATCATCA
CATTCACGATCTCGCTCATCAACAAAAAGTCCATTAAAATCAGAGGTAAAATTAAAAGTT
GATGAGGTTGAATCACTTTTTCAATCGACTGATATGGATATGAGACCTGATAGTACTACA
GCAGCAGCAAGCAGTAAAATAAGCAAAAATCAGCTATTAGATGAACTTCTTCATGATGAA
GATATGAAGTCAAATCAAGAAATGATGATTACAACGAGTAATAACGAAGAATCCAATACA
ATTCAAGTAAATGGAAAATGTGATATAGAAGCAAAAACTGAAGAAGTTGGAAAGAAGAGA
TCAATTGGAGGAGAACAAAGTGAGAGCGAGTCAACTGAACCATCAAAAAAGAAGAATAAA
ACCGATAATGATCCATTATCATTGTTTGGTAATGAAGATGTTGATTTGAGATCAGTTCCA
CAACAACTTGAAAAGAGATCAAAATCAAAGGAGAAAGCAGATTTTGAGACTGTAAGAGCG
AAATTGAATGCATCAAAAATGAAAAGTAAAACTACAGGCGTAAAATTGTTAGAAGAAATT
CAACCAGCTCCTATCAATGAAAATGGTCCACCATCAGAATTGGTTAGGAAAATTTTACAA
AAGAAAAAGAATGATGTTAAAGAACTAGAAGAAAAAGCTTCTTCTCCACCTATTACCACT
GATCGCATTGTTAATGACATAGTCATGACTGAAAAGAAAGACGAAATAGTTGATTATGGT
AGTATGACAGCTGCAGAATTGAGAAATACTTCAGGAGTACCAGCAAGTTTACGAAAGGAA
AAAAGAAAAGAGACCAAATGGAGTCAACCAACTATGCCTTGGGGTATAGCAGCACCTGTA
TTGAGTACTAGACTGATGAATAATCCAAATATTCCTCTACCGCACCATTTTATAAACCCA
TGGGAAAATAATCCGATGATTGTTGCTATGCAAAAACAAAGTCAAGTTCCACTAGTCTTA
CCACAAAGTCAATCACAACCTATATTGAACAATAAAATGCGCACATTACGCTTAGATGGA
ACAAGGGATCATTTGCTTCGTTTTTATGGTGAAATTGCAATTATTTTCAATGAATCTGGC
GAAGCTCATGATATAAGATTTAGTTCAGGTCAGAGTAAAGTTGTTATTGATGATGTATAT
AGTCAAGTTTTGGATTTCAATGATTCTTATAAGCCTATAATAATTGATGGTATTATGCAT
AAAATCAAATTTGGTTCACCAACTCGTGAACTTTATATTGATGAAAATTTTTATGAATGT
TATTTCAATAATCAAATCACACAAATTGTTCTCGGCGATAAAGTTCGTAGAGTTCGAATT
GAAGGCAAAGCACCAGAAGTTAAAATTGGCAACAAACGTAAAGACGTTGTTTTAGGATTA
ATCAATATGATGATTGATGCTGAAATTATGGTTCCCGTCTTTCTTGATACAACCGTGCAA
TATTTTGAATATAAAGGACAAATTTTCACATTGCAATTTGCTGATTTCTTCCTAAGTGTC
ATTATAAATAATGAGCCATTCAAAGTTGAATTTGGTGGCTTACCAAAGAATTTTGTACTT
AATGGACAAAAACATTTTATTCGTTTTACTGTTTTGCCGGATGACATTGTTCCTGGTCAA
GTTAATTTGCATGGCATGAAAAGAACACATCTCTTTAGAAATTGTAAATCTCCACCGCTT
CCAATACAGTCTGATCCAATGATTAGAGATCGTGAAATTAATCAATTTGTTGATAATGAT
GCAATTAATAAGCATTTACCAATACACCACAATCCGCCAGTAACAACAGTTATTGAACCG
CCGCCACTACAACAGCAGCAGCCACAATCAAACATCACACTACCAGATTTGAATATAAGT
GAACTACTTCAACAATTAGTTGCTACTGGAATAATTGGAAGTAATACGAATGCACCTTCT
GCATCTACAGTTTCTAAAGAAGTAGAACAAGCACCAATAAAGCCAATTGAACCGCAAATT
AATCCATCAAAATCATCTGAACCAAAACTGAAAATTATTTCCGTCAATTTATCTCGACCA
GAATCGATTAAAATGAGACAACAAGCTATTATTGATACTTTATATTTGGGAATTCAATGC
AGTAGTTGTGGTCTTCGATTTCCAGTTGAGCAAACAATTAAATATAGTCAGCATTTAGAC
TGGCATTTTAGACAAAATCGTCGTGAACGAGATAGTAAGCGAAAAGCTCATTCCCGTAAA
TGGTACTACAATAGATCAGATTGGATTAAGTATGAAGAAATTGAAGATCTCGACGAAAGA
GAAAAGAACTTTTTTGAAACGCAACAAATGGAAGCGATGGATCAAGGAGAAGATTCAAAT
GGTCTTACACGTATGACAGGACAAGAAACTTATAGTTGTCCTGCAGGACCTGATGATGTT
AATAGATGTTGTGAAATGTGTAATGATCAATTCGATCAGTTCTTCAATGAAGAAACAGAA
GAATGGCATTTGCGATCAGCAATTAAAGTTGATAATAAATTTTTCCATCCAATTTGTTAT
GAAGATTATAAAGCATCTTTTACGCTTGATGAATCAGGTTTAAATGAAGCAGCAAATGAT
TCACAAACTGAAGCAAATGACAGTGAAGTAAAAGTTGAAGATGCAAATGCTGTTAAAGAT
GAGAAGTCTGTAATTAAAAATGAAACAGAAATTGTTAATACTAATACGGAAGATGACGAT
GATGTTATTGTTTTGCCGCCTGAAGAGCCAGTAATTACTGAAATTATTGATGAACAAGAT
CATGAAATGACCGAGCAAAATGAAAGTCCACAGATGCAGTCTACTATCATTGATGATGAC
ATTATGATTCAGGAACCGAAAATTGAAACACAAATTGTCAATGATGATGATGACAGCAAT
GATGCTCAATGTACATCAGAAGATACAAATACTTTACCATTTATCGTTAAAATTAAGGAA
GAACCTAAGGATGATGGATATGAAGATCAAACTGAGGAAGATCCCTTCATTGAAGTGACA
TCAATCAATGAAGAATTGATGCTTGATGATGGACATACACATTCACCTTTTCAACCGGCA
TTAGATGAGAATGCAGTTTTTGACGAATCATCGCTATTCTCGCAGAATTCTAATCATGAG
TCTTCTGTAATTAATGATGACAGTTCACGACATGAACCTTCGAGTGAACCGCTTATGGTT
GGTGGTAACAAAAATATTAAAATAGTTCTATCGTCTTTAGTCCAAAATAATCTATCAAAT
AAGGGCACAGGCTCAAATAATTTTGTAAATAATGATCAAAATGATAATAGCAATATAGAT
ACAAACAATAAATTAATAGATTCCAATAGCAATGACTTACAAAGAAGAATCAGCGATGAT
CGATCAACGCGTAATGAAGATTTACAAGAAAATACTGAATTGCCTTACATTGTTAAAGAG
TCTCTTCAGGGTTTTAATTTTGAAAAAACTGTGACAGTGAAACGTGGAATCGAGAACTCT
GGTTTATGCTCTATCATGTAG

>g3854.t2 Gene=g3854 Length=1546
MDSVKEKEIEEEYLSSLMDLNVNSKPLINMLTMLAEDNLENAAIIVRAIEKHILQVSPEI
KLPILYLIDSIVKNVGDKYKQLFAQNIVNIFCGVFEKVNEKVREKMFNLRQTWNDVFPQT
KLYALDVKVNVMDNNWPITAKVLPKSVHVNPNFLKKTKNPENQAKEDLMLQMQAKERELL
ELKQRKIELELMVTKKKIAEQEKEIKGIVPSTSNTVNSAISIAPVSHPMTMTTSSMMIPH
QQGRIRIAPVSSTMLSSARPRDPRLSKLRPQIEVPQPSLPPTSILPGIMKLPRIPKYSNG
NKSSSSTTTSSFRDIDERDSRRRREKDHDDSSSKEKSSKSSSSRKSSKSDSPRKKSDDDK
KLSSKNSSSHHHKSTSSSSSHSRSRSSTKSPLKSEVKLKVDEVESLFQSTDMDMRPDSTT
AAASSKISKNQLLDELLHDEDMKSNQEMMITTSNNEESNTIQVNGKCDIEAKTEEVGKKR
SIGGEQSESESTEPSKKKNKTDNDPLSLFGNEDVDLRSVPQQLEKRSKSKEKADFETVRA
KLNASKMKSKTTGVKLLEEIQPAPINENGPPSELVRKILQKKKNDVKELEEKASSPPITT
DRIVNDIVMTEKKDEIVDYGSMTAAELRNTSGVPASLRKEKRKETKWSQPTMPWGIAAPV
LSTRLMNNPNIPLPHHFINPWENNPMIVAMQKQSQVPLVLPQSQSQPILNNKMRTLRLDG
TRDHLLRFYGEIAIIFNESGEAHDIRFSSGQSKVVIDDVYSQVLDFNDSYKPIIIDGIMH
KIKFGSPTRELYIDENFYECYFNNQITQIVLGDKVRRVRIEGKAPEVKIGNKRKDVVLGL
INMMIDAEIMVPVFLDTTVQYFEYKGQIFTLQFADFFLSVIINNEPFKVEFGGLPKNFVL
NGQKHFIRFTVLPDDIVPGQVNLHGMKRTHLFRNCKSPPLPIQSDPMIRDREINQFVDND
AINKHLPIHHNPPVTTVIEPPPLQQQQPQSNITLPDLNISELLQQLVATGIIGSNTNAPS
ASTVSKEVEQAPIKPIEPQINPSKSSEPKLKIISVNLSRPESIKMRQQAIIDTLYLGIQC
SSCGLRFPVEQTIKYSQHLDWHFRQNRRERDSKRKAHSRKWYYNRSDWIKYEEIEDLDER
EKNFFETQQMEAMDQGEDSNGLTRMTGQETYSCPAGPDDVNRCCEMCNDQFDQFFNEETE
EWHLRSAIKVDNKFFHPICYEDYKASFTLDESGLNEAANDSQTEANDSEVKVEDANAVKD
EKSVIKNETEIVNTNTEDDDDVIVLPPEEPVITEIIDEQDHEMTEQNESPQMQSTIIDDD
IMIQEPKIETQIVNDDDDSNDAQCTSEDTNTLPFIVKIKEEPKDDGYEDQTEEDPFIEVT
SINEELMLDDGHTHSPFQPALDENAVFDESSLFSQNSNHESSVINDDSSRHEPSSEPLMV
GGNKNIKIVLSSLVQNNLSNKGTGSNNFVNNDQNDNSNIDTNNKLIDSNSNDLQRRISDD
RSTRNEDLQENTELPYIVKESLQGFNFEKTVTVKRGIENSGLCSIM

Protein features from InterProScan

Transcript Database ID Name Start End E.value
8 g3854.t2 CDD cd16982 CID_Pcf11 11 136 1.92543E-57
7 g3854.t2 Coils Coil Coil 165 204 -
6 g3854.t2 Coils Coil Coil 572 592 -
5 g3854.t2 Gene3D G3DSA:1.25.40.90 - 2 130 1.2E-43
17 g3854.t2 MobiDBLite mobidb-lite consensus disorder prediction 253 432 -
11 g3854.t2 MobiDBLite mobidb-lite consensus disorder prediction 297 311 -
18 g3854.t2 MobiDBLite mobidb-lite consensus disorder prediction 312 365 -
15 g3854.t2 MobiDBLite mobidb-lite consensus disorder prediction 369 387 -
20 g3854.t2 MobiDBLite mobidb-lite consensus disorder prediction 388 408 -
21 g3854.t2 MobiDBLite mobidb-lite consensus disorder prediction 411 431 -
16 g3854.t2 MobiDBLite mobidb-lite consensus disorder prediction 452 534 -
19 g3854.t2 MobiDBLite mobidb-lite consensus disorder prediction 452 466 -
14 g3854.t2 MobiDBLite mobidb-lite consensus disorder prediction 468 503 -
10 g3854.t2 MobiDBLite mobidb-lite consensus disorder prediction 518 534 -
13 g3854.t2 MobiDBLite mobidb-lite consensus disorder prediction 1414 1437 -
22 g3854.t2 MobiDBLite mobidb-lite consensus disorder prediction 1482 1508 -
12 g3854.t2 MobiDBLite mobidb-lite consensus disorder prediction 1493 1508 -
2 g3854.t2 PANTHER PTHR15921:SF3 PRE-MRNA CLEAVAGE COMPLEX 2 PROTEIN PCF11 8 1259 2.9E-148
3 g3854.t2 PANTHER PTHR15921 PRE-MRNA CLEAVAGE COMPLEX II 8 1259 2.9E-148
1 g3854.t2 Pfam PF04818 CID domain 20 121 1.2E-14
24 g3854.t2 ProSiteProfiles PS51391 CID domain profile. 5 133 37.607
23 g3854.t2 ProSiteProfiles PS50179 VHS domain profile. 28 117 12.103
9 g3854.t2 SMART SM00582 558neu5 8 130 1.1E-30
4 g3854.t2 SUPERFAMILY SSF48464 ENTH/VHS domain 8 139 6.98E-42

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0006886 intracellular protein transport BP

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values