Gene loci information

Transcript annotation

  • This transcript has been annotated as WD repeat-containing protein 75.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_2 g5194 g5194.t1 isoform g5194.t1 7607958 7611055
chr_2 g5194 g5194.t1 exon g5194.t1.exon1 7607958 7609452
chr_2 g5194 g5194.t1 cds g5194.t1.CDS1 7607958 7609452
chr_2 g5194 g5194.t1 exon g5194.t1.exon2 7609511 7610511
chr_2 g5194 g5194.t1 cds g5194.t1.CDS2 7609511 7610511
chr_2 g5194 g5194.t1 exon g5194.t1.exon3 7610566 7610773
chr_2 g5194 g5194.t1 cds g5194.t1.CDS3 7610566 7610773
chr_2 g5194 g5194.t1 exon g5194.t1.exon4 7610862 7611055
chr_2 g5194 g5194.t1 cds g5194.t1.CDS4 7610862 7611055
chr_2 g5194 g5194.t1 TSS g5194.t1 NA NA
chr_2 g5194 g5194.t1 TTS g5194.t1 NA NA

Sequences

>g5194.t1 Gene=g5194 Length=2898
ATGTGTTTAGACATTGATTATCATTACAACAATCAAAATAGCATAATAAATACGAATTCT
TTGGAAAATCACAATCAAAATATTGACAACAATTCAATCACGATGGAATCAACAACAGAT
TTGAAATATAAAGTGAAAAGTATGTGCGGTGGAAGTGTCATTGAGCATAAACCATTATTT
GATTCAAGTGGAGAAAATATTTATGTAGTGCGAAAAGACAAATTAAGAATCTATAGTGTT
GAAACGGGTGAAATTGTTACAGAATTAGATGACAATAAAGATGGCCAAATAATTGGAATT
TATTTAGAAGATAATCAAACAAATCAATGTATAATAACTTGCACAACAAATGGAACGATT
GCCTTTCGAAAGTTGAAATCGAATGTTATTACAGAAAAGAAGAAATTGAATTTTAAGTTC
TCTATATTAAACAAATTTTTGGTGACAACAATCGATGGCAAATTTCATGGTCTTATTCAC
TATAATGATGAGAAAAATTTTTCACAACTTACACTGATTGAGCTGAGCACTAATAAAATC
GTTCATCGCTTTGATACACCGTTTTTAACTAATCAAGCAGATGTGAAAATGAAGTTTGCA
GATGGTAATGGCATAATTGCAATCATATCAAAAACACATCTCTTTATTATTAATAAAGAG
ACATTAGAAATAATTATGCATAGTGCTCCAAAACTATTGAGTGTTGTTGTCTGTCATCCA
GAACAAAAGATTATTGCAACTGGCGATGTTTATGGAAAAATATATTTATGGAGTAATGTT
TTTAATAAAATGCCAGTAAAAACTGATTTGCATTGGCATCACATGGTTGTGCTAAGTCTT
GCATTTTCTCAATCAGGAACAGTGCTTTATTCTGGTGGTGCTGAATGCGTTTTAGTGAAA
TGGCATATTAGAGAAACAACTTTAGGAAAGAACTTTTTACCTCGCGTTTCTGGTGGAATA
AAACAAATCAGTGTTGATACATTACATGATAAAATTGCAATTTGCATGGATGATAATTCT
ATTCAGATTATAAATTCAAATTTAACTCAATTGAAAACTATTCAAGATTTCACTCAAATT
TCACCATATGACTTGGGATTAAATCAACCATTTCCAGCTGGTATTTGCATCAATCCAAAA
AATAATCATTTAGTCATGAATGGTAAAATTGGTCATTTGCAATTCTTCTCAACAAAAACT
ATGAGACTTTTATTCAATATCGATATCACGCTGCAAAACGTCATTCCGCGTCAAAGAGAA
TTTAATCAATTTTCAACAGAAGTCACTAAAGTAGCTTTCTCAGCATGTGGAATGTGGATG
GCAACTTATGAATGCTGGAATGATAGGATTCATTCACTTGATTCTCGTATAAAGTTTTGG
AGCTTTGATAACATTAAACAAACATATTCACTTCACACTCAAATTGAATATCCTCATGAG
AAAAAAGTTGTAGCAATGCAATTTGCAAATATAGAGAAATCAATAATTTGTGCTAGTGCT
GGTTTAGATCGAGTTGTAAAAATTTGGTCACTTGAAGCATCAGAAGAAATTCAAAATCCA
AAAATGATTTGGATGTTAATTGAACAGATAAATTACAAAAACCTTCCTGTAAAGTGTCTA
AGTTTTTCACAAGATTCATCGTTATTATCAGCAGGATTTGGAAATAGCTTGTGTGTATGG
GACACAACAAATTTTAAATTAAAATGTGCTCTTAGTGCGCCTGCAATAATGGATGGATCT
GTTAATCGTGTTTTGATAACTTTACCAGAGGAAAATTCAAAGAAAAATCTAAACGGAAAT
AAAACAAATTTCATCGAGAAACGACATAAAATTTTGCAAATGATGAATGCAATTATCAAT
GATCCATCTCAAGTGCTTGTTAATAATCTTACACAAGAAAGGAGTAGAATTTTTAAGAGA
AAATTTGACGAAGGAGTAAAGTCACAAGAATTGAAATGCAACGAAAAAAAACTTATTTTT
GATAAAGTAATAGCAACGACTGATTTGAATTTCAATGAAAAACTTCAAATTCTTCACAAA
TTGAACATTTATTACCATATTAGTAATCGTGTAGAAAACGATTTTATAGATTTTATTTCA
AAGAATACATATGATGATATTCATCTTTATAAAGGTCTTCAACAAGGTGTCCTAGAAATC
AAAAACGATGACAAATATAAAATTCTTTGGCGTTTTAAAACTTGGAGAATGCGAGATGTT
AAAAGAAATAGAAAAATCATAACTGTTCGAAAATTACTCAAAAAGCCAATTCGTGAAGAA
GCTTTAAAATTAAAGAAAGCTGAGCCAGGACAAAAGGCGTTACCAATCAAAAATATAAGC
AACATTACGAGCACATTTTTCTGCACAAATGATTTGTCGCATCTATCAGTCGTTACCACA
TCGACTAGACTTTTAATATGGGATTTGCTTACACTTAAAATTCAATCATCTTTCAAAATT
CAGTGTTTGAAATTGGCTCATGATCCTTTGACAAACTTAATTGCAATATTTACAAAACAC
AATGAACTTTTTATCATTCATCCATTACCTGCAATAACTATTTTCCATCAAAAGAATTTG
CCAAATATTCTTGCATTAATTTGGGTACCTCGAGAAATTCCAAAAATGCAATCGTTGAGT
GTCAATTGGCAAGCAACTTCACAACTTTTGTTTTTAAATGAAAACCAAGAAATTTGCACA
TTAAGTTCTCCAACTGATGAAGATGAAACAATTGATTTAACTCCTTACATGAATGAAACA
AACGAAGTCACAGCATCTACAACACCTTTTGCAGCTCTTATATCGAAAAGGCAAAATTAT
AAAGATTCAAATCAAATCACTGTTAGACACATGCTTTCGAATGATTCTGGAAGCATCAAA
GAGGTAAGCAAATATTAA

>g5194.t1 Gene=g5194 Length=965
MCLDIDYHYNNQNSIINTNSLENHNQNIDNNSITMESTTDLKYKVKSMCGGSVIEHKPLF
DSSGENIYVVRKDKLRIYSVETGEIVTELDDNKDGQIIGIYLEDNQTNQCIITCTTNGTI
AFRKLKSNVITEKKKLNFKFSILNKFLVTTIDGKFHGLIHYNDEKNFSQLTLIELSTNKI
VHRFDTPFLTNQADVKMKFADGNGIIAIISKTHLFIINKETLEIIMHSAPKLLSVVVCHP
EQKIIATGDVYGKIYLWSNVFNKMPVKTDLHWHHMVVLSLAFSQSGTVLYSGGAECVLVK
WHIRETTLGKNFLPRVSGGIKQISVDTLHDKIAICMDDNSIQIINSNLTQLKTIQDFTQI
SPYDLGLNQPFPAGICINPKNNHLVMNGKIGHLQFFSTKTMRLLFNIDITLQNVIPRQRE
FNQFSTEVTKVAFSACGMWMATYECWNDRIHSLDSRIKFWSFDNIKQTYSLHTQIEYPHE
KKVVAMQFANIEKSIICASAGLDRVVKIWSLEASEEIQNPKMIWMLIEQINYKNLPVKCL
SFSQDSSLLSAGFGNSLCVWDTTNFKLKCALSAPAIMDGSVNRVLITLPEENSKKNLNGN
KTNFIEKRHKILQMMNAIINDPSQVLVNNLTQERSRIFKRKFDEGVKSQELKCNEKKLIF
DKVIATTDLNFNEKLQILHKLNIYYHISNRVENDFIDFISKNTYDDIHLYKGLQQGVLEI
KNDDKYKILWRFKTWRMRDVKRNRKIITVRKLLKKPIREEALKLKKAEPGQKALPIKNIS
NITSTFFCTNDLSHLSVVTTSTRLLIWDLLTLKIQSSFKIQCLKLAHDPLTNLIAIFTKH
NELFIIHPLPAITIFHQKNLPNILALIWVPREIPKMQSLSVNWQATSQLLFLNENQEICT
LSSPTDEDETIDLTPYMNETNEVTASTTPFAALISKRQNYKDSNQITVRHMLSNDSGSIK
EVSKY

Protein features from InterProScan

Transcript Database ID Name Start End E.value
11 g5194.t1 Gene3D G3DSA:2.130.10.10 - 422 598 0.000
2 g5194.t1 PANTHER PTHR44215 WD REPEAT-CONTAINING PROTEIN 75 48 942 0.000
1 g5194.t1 Pfam PF00400 WD domain, G-beta repeat 537 561 0.220
12 g5194.t1 ProSiteProfiles PS50294 Trp-Asp (WD) repeats circular profile. 226 311 9.230
13 g5194.t1 ProSiteProfiles PS50294 Trp-Asp (WD) repeats circular profile. 452 570 10.153
7 g5194.t1 SMART SM00320 WD40_4 218 258 12.000
6 g5194.t1 SMART SM00320 WD40_4 263 302 0.990
8 g5194.t1 SMART SM00320 WD40_4 306 345 390.000
10 g5194.t1 SMART SM00320 WD40_4 468 510 1.000
9 g5194.t1 SMART SM00320 WD40_4 523 561 1.100
5 g5194.t1 SMART SM00320 WD40_4 772 808 350.000
3 g5194.t1 SUPERFAMILY SSF50978 WD40 repeat-like 60 307 0.000
4 g5194.t1 SUPERFAMILY SSF50978 WD40 repeat-like 235 567 0.000

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

Data is missing for g5194/g5194.t1; file /home/yuki.yoshida/nias/analysis/reanalysis/18_revice/midgebase/iupred3/g5194.t1.fa.iupred3.txt does not exist

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0005515 protein binding MF

KEGG

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values