Gene loci information

Transcript annotation

  • This transcript has been annotated as Myosin heavy chain 95F.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_3 g3978 g3978.t1 isoform g3978.t1 29348497 29355636
chr_3 g3978 g3978.t1 exon g3978.t1.exon1 29348497 29348683
chr_3 g3978 g3978.t1 cds g3978.t1.CDS1 29348497 29348683
chr_3 g3978 g3978.t1 exon g3978.t1.exon2 29348892 29348965
chr_3 g3978 g3978.t1 cds g3978.t1.CDS2 29348892 29348965
chr_3 g3978 g3978.t1 exon g3978.t1.exon3 29349033 29349225
chr_3 g3978 g3978.t1 cds g3978.t1.CDS3 29349033 29349225
chr_3 g3978 g3978.t1 exon g3978.t1.exon4 29351500 29352455
chr_3 g3978 g3978.t1 cds g3978.t1.CDS4 29351500 29352455
chr_3 g3978 g3978.t1 exon g3978.t1.exon5 29352534 29353524
chr_3 g3978 g3978.t1 cds g3978.t1.CDS5 29352534 29353524
chr_3 g3978 g3978.t1 exon g3978.t1.exon6 29353591 29353832
chr_3 g3978 g3978.t1 cds g3978.t1.CDS6 29353591 29353832
chr_3 g3978 g3978.t1 exon g3978.t1.exon7 29353891 29354115
chr_3 g3978 g3978.t1 cds g3978.t1.CDS7 29353891 29354115
chr_3 g3978 g3978.t1 exon g3978.t1.exon8 29354185 29354411
chr_3 g3978 g3978.t1 cds g3978.t1.CDS8 29354185 29354411
chr_3 g3978 g3978.t1 exon g3978.t1.exon9 29354603 29354647
chr_3 g3978 g3978.t1 cds g3978.t1.CDS9 29354603 29354647
chr_3 g3978 g3978.t1 exon g3978.t1.exon10 29354888 29355126
chr_3 g3978 g3978.t1 cds g3978.t1.CDS10 29354888 29355126
chr_3 g3978 g3978.t1 exon g3978.t1.exon11 29355192 29355518
chr_3 g3978 g3978.t1 cds g3978.t1.CDS11 29355192 29355518
chr_3 g3978 g3978.t1 exon g3978.t1.exon12 29355602 29355636
chr_3 g3978 g3978.t1 cds g3978.t1.CDS12 29355602 29355636
chr_3 g3978 g3978.t1 TSS g3978.t1 NA NA
chr_3 g3978 g3978.t1 TTS g3978.t1 NA NA

Sequences

>g3978.t1 Gene=g3978 Length=3741
ATGCTCGATCTTGCCGACTTAGTGTGGGCACGCGATCCGAACGAAGGATATATTCAAGGC
AAATTATCTGAACTTGGTGCTCATGAATATGAAATTATTCCTACGGAAAAAGGTTACCAA
AAGAGAAGCTGTAATATTGATGATATTTTTCCATCGTGTGAGAATAAATCTGATCATGAT
GACAATTGTGAACTTATGTTTTTAAATGAAGCAACTCTCTTAGACAATATTAAGAACAGA
TATTATAAAGATAAAATCTATACATATGTTGCAAACATTTTGATTGCTGTAAATCCATAT
AAAGAAATCAAAGATCTTTATTCAAAAGCAACTATTAAGAGCTACAATGGAAAATCAATT
GGAGAAATGCCACCCCATGTATACGCAATTGCAGATAAAGCAATTAGAGACATGAGAGTA
TTAAAAGTATCTCAAAGTATTATTGTATCTGGAGAGAGCGGCGCAGGGAAGACCGAGTCT
ACTAAATACTTGCTACGTTATTTGTGTGACTCGGTTGAAGCAGCAGGACAGATTGAACAA
AAAATTCTTGATTCAAATCCAATATTGGAAGCATTTGGAAATGCAAAAACTATAAGAAAC
AATAATTCTTCGAGATTTGGGAAATTCATAGAAGTTCATTATGATACAAAGTATCAAGTC
GTTGGCGGTTTTATTTCGCACTATTTGCTTGAGAAAAGTAGAATTTGTACACAAAGTTTA
GAGGAGAGAAATTATCATATTTTTTACCTTTTGTGTGCTGGAGCACCTCAAAGTTTGCGT
GATAAATTTCACATTGGTAAACCTGATGATTACCGTTACTTAGCTGGATGCACACAATAT
TTTGCATCATCAATAACTGACAAACAAATTCAAAACACTCAAAAGTCAAAGGATCATATC
CAAAAAGGCTCTTTGAAAGATCCTATTCTTGACGATTTTATTGATTTTAAAGAATTAGAT
CAAGCTTTATCAAGAATAGGCTTGAATGAATCTATGAGAACTGAAATTTATGGAATTGTG
GCTGCAGTGCTACATTTAGGAAATATTGCATTCGAAGAAAATCCAGAAGATACGCGTGGC
GGATGTCGAATAATGCAAGAGGCTGAAATATCTCTGGAGATTGCATCAAAACTTATTGGA
TGTGATTCTTTTGAGTTGCGTCAAGCTCTAACATCTCGTGTGATGCAATCAAAGGGCGGT
GGAGTTAAAGGAACAATCATTATGGTTCCCTTGAAAATTCAAGAAGCAAAAAATGCTCGT
GATGCACTCGCAAAAGCTCTTTATAGTCGTCTCTTTGACTTTGTTGTATCTCTCATAAAT
CAATCAATTCCATTTCAAAAGTCAAGTTATTATATTGGGGTACTTGATATAGCAGGGTTT
GAGTACTTTCCCCAAAATTCATTGGAACAATTTTGTATTAACTATTGTAATGAAAAGCTT
CAGAAATTTTTCAATGATAATATTCTCGCTAATGAGCAAGATTTGTATAAGCGTGAAGGG
CTCAATGTTCCCGAAATCAAGTTTACTGATAATCAAGATATTATTGAGCTCATTGAATCT
AAAGCAAACGGAATCTTCACATTACTCGATGAAGAATCAAAACTTCCTAAACCATCATTT
GTTCATTTCACTTCTGAAGTGCATGCAGCCTGGAATGGTCATTTCCGAATTTCTTTACCA
AGAGCATCGCGATTAAAAATTCATAGAAGTTTGAGAGATGATGAAGGTTTTCTTATTAGA
CATTTTGCTGGTGCTGTATGCTATTCTACAAATCAATTTATTGAGAAAAATAATGATGCT
CTTCATGCATCATTAGAGTGTCTCATTCAAGAGTCTGGAAATTCAATGATGAAAAAGTTG
TTTTCATCAAATAATAATTCTGTGACAAAAGGAAAATTGTCTTTTATTTCTGTCGGATCA
AAATTCAAAACTCAACTTCAAGAACTTATGGATAAACTTGAAAAAAATGGAACAAATTTC
ATTCGATGTATCAAACCAAACAGTAGAATGACTGACCATGAATTCGAAGGTGGTTTGGCA
CTGAATCAATTAATTTGCAGTGGAACAGCATCTGTTTTAGAGCTTATGTCATTCGGTTTT
CCATCAAGAGTGCCTTTTGCAGAATTATACAATATGTACAAGTCCTACTTGCCCGCAGAC
TTGATAAAATTAGCACCAAGAACTTTCTGTGAAGCTATGTTGCATTCACTTAATCTTAAT
AGCAAAGATTTTAAATTTGGCATCACTAAGGTATTCTTTAGACCTGGAACATATGTTCAA
TTTGATCGAATTATGAAAAGTGATCCGGAAAATCTCAAAGTTATTGTTCAAAATGTTAAA
AAATGGCTCGTTCGATCACGCTGGAAGAAGTCAGTTCATTGTGCAATATGCGTCATTAAA
ATTAAAAATCGTATGATATATAAAAACAAGTGTGTTGTGAAAGCACAAAAGATTATTCGT
GGTTGGCTTGCGCGTCGGCAGCATCGTCCACGATATAAAGGTATTATTAAAGTTAATGCA
TTGAGAAAAAATTTAAAAGAAATGGAGACAATTGTAAATCAATTGAAAAATGAGCGTGAT
ACAATGTTGAAACAGCTAAAAGATATTGATCAGCAAATCGATACTGCTATTAAAAAAATA
AAGTCAAATGAAAAAATTAAATCACAAGAAATTGATGGATACTACTTTATGCTAAATGGA
AAAGCTAATCAGCAAATGAACACGATTAAAATAAAACTTCAAGAGCAGAAAAATGCAGAG
GAGCAAGAGCGTCTTAGACAAATTCAATTGAAAATGGAAGCTGAGCGAAAAGCTAAAGAG
GAAGAAGAGAGAAGAATACGGGAAGAAGAAGAAAATCGTAGAAAGAAGGCCGAAATGGAA
TTACGTAGAAAACAAGAAGAAGCCGAACGATTACGTCTCGAGGAAGAAGACAGAAGAGCG
GCTTTGCTACTTCAAGCGCAATTGGAAAAAGAGGCACAAGAAGATAGCAAGTATCGCCAA
CAACTAGAGCAAGAGAGGCGTGACCATGAGTTAGCATTGCGTTTAGCGCAAGAATCAAAT
GGTCAAATTGATGAAAGTCCAACAATGAGTAGAAACGGAACTCCCGACATGGCACTAAGC
AATCATAATCGACTCATCAGATCTGAAGCTATAAGAGCTCAGCAACAAATAATTGGTAAG
CAAAAGTATGATTTGTCAAAATGGAAGTATTCAGAGCTTCGTGATGCTATCAACACAAGC
TGTGATATTGAGTTACTTGAGGCATGTCGTCATGAATTTCATCGTCGTCTTAAAGTGTAT
CATGCATGGAAAGCAAAGAATCGCAAACGCACTACAATGGATGAAAATGAGCGCGCTCCA
AGAAGTGTAATGGAAAATGCCGGAAAAATTCCATTGCGTACTCAACAAAAATCTTCTTCT
GATCCTAATTCCGTTTCAATTCATCGCTACTTTCGCATTCCATTTATGCGTCCTAACGGA
AATACAAATGATAATACTAATCGAGGTTGGTGGTACGCACATTTTGATGGTTCGTATGTT
GCACGGCAAATGGAATTACATGCAGAAAAACCTCCAATTTTATTGATTGCCGGTGTGGAT
GATATGCAAATGTGTGAATTAAGTTTGGAAGAGACAGGACTAACTCGAAAAAGAGGTGCA
GAAATATTAGAACATGAATTTAATCGTGAATGGGAACGTCATGGAGGTAAACCCTATAAG
GTGCAAAATGGAAAGTCTTAA

>g3978.t1 Gene=g3978 Length=1246
MLDLADLVWARDPNEGYIQGKLSELGAHEYEIIPTEKGYQKRSCNIDDIFPSCENKSDHD
DNCELMFLNEATLLDNIKNRYYKDKIYTYVANILIAVNPYKEIKDLYSKATIKSYNGKSI
GEMPPHVYAIADKAIRDMRVLKVSQSIIVSGESGAGKTESTKYLLRYLCDSVEAAGQIEQ
KILDSNPILEAFGNAKTIRNNNSSRFGKFIEVHYDTKYQVVGGFISHYLLEKSRICTQSL
EERNYHIFYLLCAGAPQSLRDKFHIGKPDDYRYLAGCTQYFASSITDKQIQNTQKSKDHI
QKGSLKDPILDDFIDFKELDQALSRIGLNESMRTEIYGIVAAVLHLGNIAFEENPEDTRG
GCRIMQEAEISLEIASKLIGCDSFELRQALTSRVMQSKGGGVKGTIIMVPLKIQEAKNAR
DALAKALYSRLFDFVVSLINQSIPFQKSSYYIGVLDIAGFEYFPQNSLEQFCINYCNEKL
QKFFNDNILANEQDLYKREGLNVPEIKFTDNQDIIELIESKANGIFTLLDEESKLPKPSF
VHFTSEVHAAWNGHFRISLPRASRLKIHRSLRDDEGFLIRHFAGAVCYSTNQFIEKNNDA
LHASLECLIQESGNSMMKKLFSSNNNSVTKGKLSFISVGSKFKTQLQELMDKLEKNGTNF
IRCIKPNSRMTDHEFEGGLALNQLICSGTASVLELMSFGFPSRVPFAELYNMYKSYLPAD
LIKLAPRTFCEAMLHSLNLNSKDFKFGITKVFFRPGTYVQFDRIMKSDPENLKVIVQNVK
KWLVRSRWKKSVHCAICVIKIKNRMIYKNKCVVKAQKIIRGWLARRQHRPRYKGIIKVNA
LRKNLKEMETIVNQLKNERDTMLKQLKDIDQQIDTAIKKIKSNEKIKSQEIDGYYFMLNG
KANQQMNTIKIKLQEQKNAEEQERLRQIQLKMEAERKAKEEEERRIREEEENRRKKAEME
LRRKQEEAERLRLEEEDRRAALLLQAQLEKEAQEDSKYRQQLEQERRDHELALRLAQESN
GQIDESPTMSRNGTPDMALSNHNRLIRSEAIRAQQQIIGKQKYDLSKWKYSELRDAINTS
CDIELLEACRHEFHRRLKVYHAWKAKNRKRTTMDENERAPRSVMENAGKIPLRTQQKSSS
DPNSVSIHRYFRIPFMRPNGNTNDNTNRGWWYAHFDGSYVARQMELHAEKPPILLIAGVD
DMQMCELSLEETGLTRKRGAEILEHEFNREWERHGGKPYKVQNGKS

Protein features from InterProScan

Transcript Database ID Name Start End E.value
20 g3978.t1 CDD cd01382 MYSc_Myo6 71 754 0.0
18 g3978.t1 Coils Coil Coil 838 872 -
17 g3978.t1 Coils Coil Coil 899 981 -
19 g3978.t1 Coils Coil Coil 985 1019 -
14 g3978.t1 Gene3D G3DSA:3.40.850.10 Kinesin 61 691 3.9E-248
13 g3978.t1 Gene3D G3DSA:1.10.10.820 - 243 326 3.9E-248
12 g3978.t1 Gene3D G3DSA:1.20.120.720 - 327 632 3.9E-248
16 g3978.t1 Gene3D G3DSA:1.20.58.530 - 463 656 3.9E-248
15 g3978.t1 Gene3D G3DSA:3.30.70.1590 - 700 765 3.0E-27
11 g3978.t1 Gene3D G3DSA:1.10.3060.20 - 766 913 5.1E-46
22 g3978.t1 MobiDBLite mobidb-lite consensus disorder prediction 934 970 -
3 g3978.t1 PANTHER PTHR13140:SF794 MYOSIN VIA 8 1239 0.0
4 g3978.t1 PANTHER PTHR13140 MYOSIN 8 1239 0.0
5 g3978.t1 PRINTS PR00193 Myosin heavy chain signature 87 106 2.9E-55
9 g3978.t1 PRINTS PR00193 Myosin heavy chain signature 144 169 2.9E-55
6 g3978.t1 PRINTS PR00193 Myosin heavy chain signature 187 214 2.9E-55
7 g3978.t1 PRINTS PR00193 Myosin heavy chain signature 451 479 2.9E-55
8 g3978.t1 PRINTS PR00193 Myosin heavy chain signature 504 532 2.9E-55
1 g3978.t1 Pfam PF00063 Myosin head (motor domain) 59 754 2.4E-242
2 g3978.t1 Pfam PF16521 Myosin VI cargo binding domain 1148 1236 7.1E-44
23 g3978.t1 ProSiteProfiles PS51456 Myosin motor domain profile. 57 766 225.515
24 g3978.t1 ProSiteProfiles PS50096 IQ motif profile. 808 837 8.133
21 g3978.t1 SMART SM00242 MYSc_2a 52 767 0.0
10 g3978.t1 SUPERFAMILY SSF52540 P-loop containing nucleoside triphosphate hydrolases 56 831 4.47E-233

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

## Warning: Removed 1 row(s) containing missing values (geom_path).

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0005524 ATP binding MF
GO:0005515 protein binding MF
GO:0016459 myosin complex CC
GO:0003774 cytoskeletal motor activity MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values