Gene loci information

Transcript annotation

  • This transcript has been annotated as Unconventional myosin-Ib.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_3 g1950 g1950.t2 TTS g1950.t2 14022700 14022700
chr_3 g1950 g1950.t2 isoform g1950.t2 14023516 14030145
chr_3 g1950 g1950.t2 exon g1950.t2.exon1 14023516 14023642
chr_3 g1950 g1950.t2 cds g1950.t2.CDS1 14023516 14023642
chr_3 g1950 g1950.t2 exon g1950.t2.exon2 14023706 14023833
chr_3 g1950 g1950.t2 cds g1950.t2.CDS2 14023706 14023833
chr_3 g1950 g1950.t2 exon g1950.t2.exon3 14024062 14024214
chr_3 g1950 g1950.t2 cds g1950.t2.CDS3 14024062 14024214
chr_3 g1950 g1950.t2 exon g1950.t2.exon4 14024270 14024405
chr_3 g1950 g1950.t2 cds g1950.t2.CDS4 14024270 14024405
chr_3 g1950 g1950.t2 exon g1950.t2.exon5 14024463 14024569
chr_3 g1950 g1950.t2 cds g1950.t2.CDS5 14024463 14024569
chr_3 g1950 g1950.t2 exon g1950.t2.exon6 14024647 14024871
chr_3 g1950 g1950.t2 cds g1950.t2.CDS6 14024647 14024871
chr_3 g1950 g1950.t2 exon g1950.t2.exon7 14024998 14025054
chr_3 g1950 g1950.t2 cds g1950.t2.CDS7 14024998 14025054
chr_3 g1950 g1950.t2 exon g1950.t2.exon8 14025148 14025517
chr_3 g1950 g1950.t2 cds g1950.t2.CDS8 14025148 14025517
chr_3 g1950 g1950.t2 exon g1950.t2.exon9 14025608 14025794
chr_3 g1950 g1950.t2 cds g1950.t2.CDS9 14025608 14025794
chr_3 g1950 g1950.t2 exon g1950.t2.exon10 14025861 14025997
chr_3 g1950 g1950.t2 cds g1950.t2.CDS10 14025861 14025997
chr_3 g1950 g1950.t2 exon g1950.t2.exon11 14026058 14026194
chr_3 g1950 g1950.t2 cds g1950.t2.CDS11 14026058 14026194
chr_3 g1950 g1950.t2 exon g1950.t2.exon12 14026264 14026500
chr_3 g1950 g1950.t2 cds g1950.t2.CDS12 14026264 14026500
chr_3 g1950 g1950.t2 exon g1950.t2.exon13 14026552 14026757
chr_3 g1950 g1950.t2 cds g1950.t2.CDS13 14026552 14026757
chr_3 g1950 g1950.t2 exon g1950.t2.exon14 14026814 14027056
chr_3 g1950 g1950.t2 cds g1950.t2.CDS14 14026814 14027056
chr_3 g1950 g1950.t2 exon g1950.t2.exon15 14027212 14027296
chr_3 g1950 g1950.t2 cds g1950.t2.CDS15 14027212 14027296
chr_3 g1950 g1950.t2 exon g1950.t2.exon16 14027523 14027536
chr_3 g1950 g1950.t2 cds g1950.t2.CDS16 14027523 14027536
chr_3 g1950 g1950.t2 exon g1950.t2.exon17 14027635 14027698
chr_3 g1950 g1950.t2 cds g1950.t2.CDS17 14027635 14027698
chr_3 g1950 g1950.t2 exon g1950.t2.exon18 14027807 14028872
chr_3 g1950 g1950.t2 cds g1950.t2.CDS18 14027807 14028872
chr_3 g1950 g1950.t2 exon g1950.t2.exon19 14029113 14029231
chr_3 g1950 g1950.t2 cds g1950.t2.CDS19 14029113 14029231
chr_3 g1950 g1950.t2 exon g1950.t2.exon20 14030041 14030145
chr_3 g1950 g1950.t2 cds g1950.t2.CDS20 14030041 14030145
chr_3 g1950 g1950.t2 TSS g1950.t2 14030432 14030432

Sequences

>g1950.t2 Gene=g1950 Length=3903
ATGGATCAAAATGTTGGCTGTTGGGATTCAGTTTTGCTTGAAAACGAGTCAGAAAATTGT
TTCATTTCAAATTTGCACCAGCGATATAAGCGAGATTTTATATATACTTTTTTAGGATCG
CATATAATCTTTTTAAATCCTTATTGTAAACCATCGACAATATTTTCTTCAGATTTAATT
AGTTCGTATGCGGAAAAAAGTTTGTTTCAATTACCGCCACACATATATTCGTTGACAAAT
AATGTTTATAAATCGTTACAAGATAATAATGAAGATCAATGTATAGTGATGCTTGGTGAA
TCTTCAAGTGGAAAAACTGAAAATGCCCGCATGGTCATAAGATTTCTATCAAAAATCTCG
GGAAGATTTATACCACTTCAACGTCAGAGAAGTTCAAACTCTATTGCTAGCTATAAGAGC
TCTCCTAAATCGACGTGTTCGACACCTAAGCATAAATCTCCAACATCAACGATTCAATCA
GTATTATCGCAAGAGAAAACGAGTTGTTTTAAAAGCGAAGGAAGTGGAAATGTTCCTGGT
GTCAAAAGTAAAAAACTTTCAAGAGTTGAATTTGATTTTTCTTATCAAAAGTGCAATGAT
ATAGATGCAAAGCATGATCTCATTAAGTATTGTCCAAAACACAATTGCTGTAACGTGTCA
TCTTCATCTTCAACCGCATCAAATCCAATTGATATTCCAATACGGCGTAAAAGCACTGGA
TATCAACTCCAACAACAACATCTACATCAATATCCAAATCTTCCAGAACTCCCAGGAATC
TCAAAAAGTTTTACCATTTATGAAACAATGAATCGTGTTCAAGTCAATAATAACAACAAA
AAGGTGCCAAATTGTCTTGATCATGTTCAACAGCAGCCGCAGCCAAGTGAATCATCGCGA
CAAACTCGATGTGAAAGCCTAGATCTCATTAAAATGAGTAATCCAAATTCCTTACAAAAA
TCAGATGATGCAATGGCAGCAACAAATTTCAGCAAATTATTTGATGAATGTATACAATTG
TCAAAAAATCAAACTAATAGCAGCTATAATAGAAGCGATGAAAAACGAAATCATTATACA
CAAATTCGTGATCTATCATACGATCGAAATAAAATTAATTTGGATAATTTTAAGAGTGCC
AAACGAAAAGTGCCAATTAAGAATTCTAGAAATGTTGAACTTAGCAATTTAGAAATTCAA
ACTATGAAAGAACGAATCGCACAAGCCGAAATATTTCTCGAAGCAATGGGTAATGCTTCA
ACTTCAAAGAATCGTGATTCAAGCCGATACGGAAAATATTTTGATCTAGAAATCGATTAT
CGTGGCGATCTAATCGGAGGTCATATAATGCATTTTTTATTGGAAAAGACTCGAGTTACG
AAGCAGTTGGAACGAGAAAGAAACTTTCATATATTTTATCAACTTTTGGCCGGAGCCGAT
ATACATTTTCTAAAATCCTTGAAACTACAAAGAAATATCAACAAGTATGATATACTCAAG
GACACAAGCTCAGATGAAGATGACAAATTTCAATTTGCCTTTACTAGAAAGAGCCTAGAC
ATTCTTGGCTTTACGACTGAGGAAACAACTTCAATCTTTAAAATCATTGCAGTGATTTTA
AAATTAGGCAATTTGAATTTTATACCAATTACTAACATCGATGGAACTGAAGGTTGCGAA
ATTTCAAATGATTATGAAATTCGTGATATATCACAACTGCTTGATATCGAAGAACAAATA
CTACTTAATTGTCTTACAAAATCAGGATCATCATGGATGCAATTAGAAAATGGGTCAGAA
CTTGATGCCATTAATGCAGCACTTATTAACAAAGCTTTATGTCGTACACTTTACGGTCGA
CTTTTTACATATGTGGTTAATCGAATTAATGAATCAATGAAGATCAAAAACTTAACAAAT
CGAGGTAGAAATTTGGGTATACTAGATTTTTTTGGTTTCGAATCGCTTGAAAAAAATTCA
TTCGAGGAATTTAACATAAATTATTGCAATGAACGTATCCATCAGAGCTATATTCAAATT
GTGTTAAAAAGTCAGCAAGATTTATATATTAAAGAGGGATTAGAATGGACCAAAATTGAT
TTCTATGATAATTTAGCAGTATGCGATATGATTGATAAGCTGCCACATGGAATATTTTTG
TTGATGGAGGAGCCAAAAGTCATAAACGATGAAATTTTATTACAAAGACTAGGACAGTGT
TGGTCAGGAAATGCTAGTTTTTCCACACAAGATCATATACCACCAAAGTGTTTTCAAATA
CGTCATTTTGCTGGAGCCTTAAATTACAGTATTGAGGGATTTGTAGAAAAAAATTCAGAT
AAAATCCCTAAACATCTAAGTTCTAGTTTATTTCAAAGCAAATTATCAATAGTACAAAAT
TTATTTCCTGAAGGAAATCCAAAACGAGCTTCAAAAAAACCAACAAATTCGAGTTCTATT
TTACGTTCATCACTGCAAAATTTATTATCTCAAATTGAACTGAGAAAATGCCATTACGTA
TTCTGCGTTAAATCGAATGATAAATGTATGCCAAAAGTATTTGAAGTACCAATTGTTCAA
CATCAAGTTCGATTTATGAGTCTTATGCCAGTTGTTGCTCTCTGGAGAAACGGATTCTAT
TTCAATTTTAGTCATTTGAAATTTTTGAGTCGCTATAAGATTTTAAGTCCATTCACATGG
CCTCATTTTCATTCAAGTATTATTGTAGAATCGATTGCTCAAATTATTCGAAGTGTACCT
CTTCCAGCTGCTGAATTTGCAATTGGACTTACCAAAGTTTTCATCAGAAGTCCTCGAACG
CTTTATGAGTTAAATGAATTTCGTAATCATCGATTGAATTCTTTAGCGACACTCATTCAA
AAAGCATTTCGCCGATATTCACAAAGAAAACTTTTTCTTAGAATGAAAAGAAGTCAAATT
ATAATATCAAGTGCATGGAGAACGTGGCGAGAATGTTGGGCAATTCCAGTATCGGAAAGA
AAACATTTATGGGGTTTATATAAAGTGGCTCGCGAAGAATATCGTTTCATAAAATACCGC
AAGCAAGTTGAGTGGGCTGTCAACACAATCCAACGAAATTACATTACATGGAAACGAAGA
CAATTTCTCATGACACTTCCGATGAGATTACACGCAAATAGTCTCAGTCCAATTTCGACA
GAATGGCCAACGGGTCCTAAGTTTCTTTCCGAGTGCTCGCAATTGTTAAAGATAATTTTT
CATAGATGGAGATGCTATAAATATCGTAAAATGTTTGACCAAACAGCTAGAAATCGCATG
CGAGAAAAAGTTACAGCAAGTATTCTATTCAAGGATCGGAAAGCGTCGTATGTTAAAAGT
GTGTCGCATCCGTTTCTTGGTGATTATGTTCGTTTACGACAAAATGTTCAATGGAAAAAG
ATTTGTGTCGAGAATAATGATCAATATGTAGTGTTTGCTGATATTATTAACAAAATTGCT
CGTTCTAGTGGAAAATATGTTCCGATTCTATTGGTATTATCAACCTCTTCAATGTTACTG
TTGGATCAGCGAACTCTTCAAATTAAATATCGTGTACCAGCTTCTGAAATTTATCGCATG
TCATTGAGTCCATATTTGGATGATATTGCTGTTTTTCATGTTAAAGCGTCTGAGCTTGGC
AAAAAGAAGGGAGATTTTGTGTTTCAAACGGGACATGTGATTGAAATTGTAACAAAAATG
TTTTTAGTAATACAAAATGCAACAAGTAAACCACCTGAGATTCAAATCAATCCTGAATTT
GAAGCAAATTTTGGCAATAATGTTGTAATAATGAGCTTTAAACAGCAAATGATGACAGAT
TTAAATAATCAACAATTAACTCGTGTTTCACGAAAAGGAAATCGAATGGAAGTTATTGTC
TAG

>g1950.t2 Gene=g1950 Length=1300
MDQNVGCWDSVLLENESENCFISNLHQRYKRDFIYTFLGSHIIFLNPYCKPSTIFSSDLI
SSYAEKSLFQLPPHIYSLTNNVYKSLQDNNEDQCIVMLGESSSGKTENARMVIRFLSKIS
GRFIPLQRQRSSNSIASYKSSPKSTCSTPKHKSPTSTIQSVLSQEKTSCFKSEGSGNVPG
VKSKKLSRVEFDFSYQKCNDIDAKHDLIKYCPKHNCCNVSSSSSTASNPIDIPIRRKSTG
YQLQQQHLHQYPNLPELPGISKSFTIYETMNRVQVNNNNKKVPNCLDHVQQQPQPSESSR
QTRCESLDLIKMSNPNSLQKSDDAMAATNFSKLFDECIQLSKNQTNSSYNRSDEKRNHYT
QIRDLSYDRNKINLDNFKSAKRKVPIKNSRNVELSNLEIQTMKERIAQAEIFLEAMGNAS
TSKNRDSSRYGKYFDLEIDYRGDLIGGHIMHFLLEKTRVTKQLERERNFHIFYQLLAGAD
IHFLKSLKLQRNINKYDILKDTSSDEDDKFQFAFTRKSLDILGFTTEETTSIFKIIAVIL
KLGNLNFIPITNIDGTEGCEISNDYEIRDISQLLDIEEQILLNCLTKSGSSWMQLENGSE
LDAINAALINKALCRTLYGRLFTYVVNRINESMKIKNLTNRGRNLGILDFFGFESLEKNS
FEEFNINYCNERIHQSYIQIVLKSQQDLYIKEGLEWTKIDFYDNLAVCDMIDKLPHGIFL
LMEEPKVINDEILLQRLGQCWSGNASFSTQDHIPPKCFQIRHFAGALNYSIEGFVEKNSD
KIPKHLSSSLFQSKLSIVQNLFPEGNPKRASKKPTNSSSILRSSLQNLLSQIELRKCHYV
FCVKSNDKCMPKVFEVPIVQHQVRFMSLMPVVALWRNGFYFNFSHLKFLSRYKILSPFTW
PHFHSSIIVESIAQIIRSVPLPAAEFAIGLTKVFIRSPRTLYELNEFRNHRLNSLATLIQ
KAFRRYSQRKLFLRMKRSQIIISSAWRTWRECWAIPVSERKHLWGLYKVAREEYRFIKYR
KQVEWAVNTIQRNYITWKRRQFLMTLPMRLHANSLSPISTEWPTGPKFLSECSQLLKIIF
HRWRCYKYRKMFDQTARNRMREKVTASILFKDRKASYVKSVSHPFLGDYVRLRQNVQWKK
ICVENNDQYVVFADIINKIARSSGKYVPILLVLSTSSMLLLDQRTLQIKYRVPASEIYRM
SLSPYLDDIAVFHVKASELGKKKGDFVFQTGHVIEIVTKMFLVIQNATSKPPEIQINPEF
EANFGNNVVIMSFKQQMMTDLNNQQLTRVSRKGNRMEVIV

Protein features from InterProScan

Transcript Database ID Name Start End E.value
13 g1950.t2 Gene3D G3DSA:3.40.850.10 Kinesin 2 142 1.9E-32
12 g1950.t2 Gene3D G3DSA:3.40.850.10 Kinesin 398 878 7.6E-129
11 g1950.t2 Gene3D G3DSA:1.10.10.820 - 467 520 7.6E-129
10 g1950.t2 Gene3D G3DSA:1.20.120.720 - 521 632 7.6E-129
14 g1950.t2 Gene3D G3DSA:1.20.58.530 - 656 835 7.6E-129
20 g1950.t2 MobiDBLite mobidb-lite consensus disorder prediction 134 158 -
4 g1950.t2 PANTHER PTHR13140:SF802 MYOSIN IB 7 1281 3.6E-281
5 g1950.t2 PANTHER PTHR13140 MYOSIN 7 1281 3.6E-281
6 g1950.t2 PRINTS PR00193 Myosin heavy chain signature 35 54 1.4E-13
7 g1950.t2 PRINTS PR00193 Myosin heavy chain signature 92 117 1.4E-13
8 g1950.t2 PRINTS PR00193 Myosin heavy chain signature 697 725 1.4E-13
1 g1950.t2 Pfam PF00063 Myosin head (motor domain) 9 122 3.7E-30
2 g1950.t2 Pfam PF00063 Myosin head (motor domain) 397 936 2.7E-130
3 g1950.t2 Pfam PF06017 Unconventional myosin tail, actin- and lipid-binding 1103 1276 1.3E-35
16 g1950.t2 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 1 1162 -
17 g1950.t2 Phobius TRANSMEMBRANE Region of a membrane-bound protein predicted to be embedded in the membrane. 1163 1181 -
15 g1950.t2 Phobius CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the cytoplasm. 1182 1300 -
22 g1950.t2 ProSiteProfiles PS51456 Myosin motor domain profile. 5 949 141.841
23 g1950.t2 ProSiteProfiles PS50096 IQ motif profile. 952 979 6.705
21 g1950.t2 ProSiteProfiles PS51757 Class I myosin tail homology (TH1) domain profile. 1114 1300 19.651
19 g1950.t2 SMART SM00242 MYSc_2a 1 950 1.3E-151
18 g1950.t2 SMART SM00015 iq_5 951 973 0.0015
9 g1950.t2 SUPERFAMILY SSF52540 P-loop containing nucleoside triphosphate hydrolases 4 990 3.55E-189

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0005524 ATP binding MF
GO:0005515 protein binding MF
GO:0016459 myosin complex CC
GO:0003774 cytoskeletal motor activity MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values