Gene loci information

Transcript annotation

  • This transcript has been annotated as Tyrosine-protein kinase hopscotch.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_2 g4919 g4919.t1 isoform g4919.t1 5958110 5962852
chr_2 g4919 g4919.t1 exon g4919.t1.exon1 5958110 5958358
chr_2 g4919 g4919.t1 cds g4919.t1.CDS1 5958110 5958358
chr_2 g4919 g4919.t1 exon g4919.t1.exon2 5958458 5958810
chr_2 g4919 g4919.t1 cds g4919.t1.CDS2 5958458 5958810
chr_2 g4919 g4919.t1 exon g4919.t1.exon3 5958897 5959089
chr_2 g4919 g4919.t1 cds g4919.t1.CDS3 5958897 5959089
chr_2 g4919 g4919.t1 exon g4919.t1.exon4 5959362 5960384
chr_2 g4919 g4919.t1 cds g4919.t1.CDS4 5959362 5960384
chr_2 g4919 g4919.t1 exon g4919.t1.exon5 5960448 5960877
chr_2 g4919 g4919.t1 cds g4919.t1.CDS5 5960448 5960877
chr_2 g4919 g4919.t1 exon g4919.t1.exon6 5960936 5961321
chr_2 g4919 g4919.t1 cds g4919.t1.CDS6 5960936 5961321
chr_2 g4919 g4919.t1 exon g4919.t1.exon7 5961385 5961702
chr_2 g4919 g4919.t1 cds g4919.t1.CDS7 5961385 5961702
chr_2 g4919 g4919.t1 exon g4919.t1.exon8 5962343 5962852
chr_2 g4919 g4919.t1 cds g4919.t1.CDS8 5962343 5962852
chr_2 g4919 g4919.t1 TSS g4919.t1 NA NA
chr_2 g4919 g4919.t1 TTS g4919.t1 NA NA

Sequences

>g4919.t1 Gene=g4919 Length=3462
ATGTGGGCAATGGAAACTCGAGAAATCAAAAATTATATATTGAATTACAAAACTGGCAAA
TTTAAAGAGATAAAGCATGACAAGGACGAGACATGTGAAGATTTTTGTAAAGAGTTATGC
CGCAGATGGAATTTTCCACCTTTAGTGCAATTGTTGTTTGGCCTTCGTCTACATGGTACT
AAAATATGGCTTGGAAGTGCACGTCAACTTGTTGCTGATAAACATTATGAATTCAGAATT
CGAATCAAAATTCCTAAATTAGCTGATCTAAACAAGCATGATAAGAACACGTTTGACTAC
TTTTATCATCAAGTACGCTACGATGTACTCCATAATGATATACCTGGACTTATCAATGAG
AATACAAGAAACAAAATTCTTGGTCTTTGTGTCACAGACATGTATGTTGAGATGTTAGAA
AATGAAAATAAAGACTATCCGAATGAGACAAGCAGAAAAAAGGCACGTGAGGAGAAAATG
AAATATTTGAATGTAAACTACAAGAATTACATTCCAAGATTTTTATATGAAAAACATTGG
GTAATATTGCAAATGCGGATCAAAAAATCACTGAAAGGCGTTGATTATAAACATGATCCT
CTTTATGTTAAAACGTCATATATACAACAAGTCGATTCAATGGCACCAAACTATTTGATA
GAAGAATATTGGGGTAAAATTGCATACCCGCAAGAAGATCATATGAGACAAGGAACATGT
CGTGTAAGACTTCAAATTGCACCTTACGACAAAGAACAACCTGGATTGAGAATGCATTAT
GCATATAAAGATACTTGGCGACATATTTCAACATTTGCTGATTTTTATGCTATTCAAATT
GATGCAGATGCTAAGCAAGTGAGATTAGAAATTCAAAACTCACCTCAAGGTTTCCCTATA
ACGATGGATAGTATTGAAGAAATTGAATCATTTGTGAATTGTATGAATATCTATTATCGA
TTGACAGTAAAGTGGACTTGGTATCTCTGTGAGTTGCTTAAATCGCCATCACTTGATTTT
CTTAACAAATATAAAATTCACGGCCCAATTGGTGGAGGATATTCTTATAATAGAATAAAA
GAAATTGGAAAGGGAGTTGGCACATATATCATTCGACAATGTGAAAAAGAATTTGACATA
TATTACATTGATATTTTAACCAAAAAAAATCAAAGTTGTGAAACATTTAAAATAAATGGT
GCTGCAGAAAAATGGCAACTTTATGATAATGAAAATAACGAAATTTCAGCAGAATTTGAT
AATCTTGTGTCACTTGCAAAAAGCATACCCGTTGAAAGTGGTGTCTATAATCGATTACCA
CCGTCATGTTATGAAAAGCCACCTCTATTGCTTCTATGCCAAACTAATATTAAATCTGTA
CCTGCTTCAGCGACGAATACAACGATGATCACAACAACTCAAAATTCACCAGGTGGTTTG
AGAACACAACGACCAGTCGTATTCACTAACGAAGATTTTAGAATGTATATTCCGAGTACA
CGTGAAATTAATGATGAAGCATTTGAGCAGAGAAAAGCTGAATATCGTGATAGAAATACT
GAAGTTACATTGAAACTTTTAAAGACTACCGAGAAATTGACTGAATTTCATTTATTAGCT
GACAAGTGGTCAAAATTAGATATTTCTGAAATAGTAAAATTGAATGGAATTATTCTCAAT
CCAGTTGCTCTTGTTCTTGAACCATTAAAATATGGACCTTTAGATATATTTTTAAGAACA
CACGAATTTAGAAGACAAGTTGTTCCACTAAATCTTGTCGAGACTGCATATTCATTAGCA
AGAGCACTCCACTACTTGCAAGAAAAGCAAATCGTTCACGGTAGAATAAAATGTTCCAGT
TTAGAAGTGATTAAATTTGATCCAGGAAATTCATTTGAAGTAAAACTCGGTGATCCCGGT
TTACCGCGTGATTTGAAAGTTCGTGATGTGCCATGGATTCCGATTGAAGATTATGATGAT
CTCAATAATAGCAGAAACAATTTGAAAGCTGACATATGGGCATATGCCACAACATTATGG
GAGATATTTTCTAGAGGAGAATCTCCATTCACAGAACTTTCAAAAGTTCCAAATATTACT
GAATTTTTCCGAAGAGGAGATCGACTACCAAAGCCAAAAGAATGTGAATTACTGCCAAGA
ATTTATGAGATAATGAAATCAGGATGGGAAGTTGAACCTGAAAAAAGATTTGCACCACAA
ACAATTTTTTCACCATTGCTTGATATAAGTAGGAATTTGTCACGACATTATGAAAGTCCT
ATAAGTTCAAATCCTCGAAGTAATGGAACAATGCAAAGAATGAATGGAGGCATTCGCAAT
GGTCGTCCAAGTTCTTCAAATTCAATGTTTTATAATAATGGAAGTTTAGTTTCTAATGAA
ACTGATCAAACATATATAAGTAGTTTAATGCCAGGAACACTGCACACGAATGCTTATATT
GATGGTGGATCAAGTATAGACAATTCAAGTCAAATATCATTACTTAATGGACATTCTATT
GCCTCTAATGGTTCACATTCAACAACTAATGCATTTATTGAACATAATTTTGATGGTGAA
TGTATGGAGTTAGATGATAATAGAAAATTATCGTTTAGAGGATATATAGGAAGTGGCAAT
TTTGGAGTGGTTTATAAGGGAACTATTGGACCATTAATATTTAATCCTATGGAAGATGAA
GAAGAGGAAGTTGCCATTAAATGCTTTAAACCAATTGATTCATTAAACCAAGCGAAAGAT
TTTCTGCGTGAAGTCAGAATGATGAAAGCTTTAAATCATGAAAATATTGTTAAAATTTAC
GATTTTCATGAAGATTGGCTACTTATAATAATGGAGTATATGTCAGGTGGATCTCTTCAA
GAATTTGTTGCAATACATCGACATGAATTAACAGTTGATGATATACTACAATTTGCTTTA
CATATTGCAAAAGGAATGCACTACTTAGAACAAAATAAAATTGTTCATCGAGATTTGGCT
GCGCGCAATGTGTTAGTGACGAGAAATAGTTCACTTCTATTGTCTGATACTGTTTGTAAA
ATTGCCGATTTTGGTTTGGCTCAATTTACTAATCATTATGGATATTATGAGTCTACAAAT
AACAGAGATCTTCCTCTGCAATGGTATGCTCCAGAAACAATTAGTTGCTTAAAATTTAGC
TCAAAAAATGATGTATGGTCGTTTGGAATAACATTATGGGAAATGTTTTCATTTGGAGAT
ACCCCAAGACTTGTTCCTAAATCAGATTTTAAAGGTGAAGATTTGTTGCAGGCACTTGAA
AAAGGTGAACGTCTTAAATGTCCTAAACATTGTCCTCAAAATATTTATGAAGAACTGATG
CGTGATGTATGTTGGAGTTATAATTCTGATAAAAGGCCAAATTTTGCTGGCATTATTGAA
AAAATTCGAAATCTTTTAATTAGGAATGGTGAATTAGTTTAA

>g4919.t1 Gene=g4919 Length=1153
MWAMETREIKNYILNYKTGKFKEIKHDKDETCEDFCKELCRRWNFPPLVQLLFGLRLHGT
KIWLGSARQLVADKHYEFRIRIKIPKLADLNKHDKNTFDYFYHQVRYDVLHNDIPGLINE
NTRNKILGLCVTDMYVEMLENENKDYPNETSRKKAREEKMKYLNVNYKNYIPRFLYEKHW
VILQMRIKKSLKGVDYKHDPLYVKTSYIQQVDSMAPNYLIEEYWGKIAYPQEDHMRQGTC
RVRLQIAPYDKEQPGLRMHYAYKDTWRHISTFADFYAIQIDADAKQVRLEIQNSPQGFPI
TMDSIEEIESFVNCMNIYYRLTVKWTWYLCELLKSPSLDFLNKYKIHGPIGGGYSYNRIK
EIGKGVGTYIIRQCEKEFDIYYIDILTKKNQSCETFKINGAAEKWQLYDNENNEISAEFD
NLVSLAKSIPVESGVYNRLPPSCYEKPPLLLLCQTNIKSVPASATNTTMITTTQNSPGGL
RTQRPVVFTNEDFRMYIPSTREINDEAFEQRKAEYRDRNTEVTLKLLKTTEKLTEFHLLA
DKWSKLDISEIVKLNGIILNPVALVLEPLKYGPLDIFLRTHEFRRQVVPLNLVETAYSLA
RALHYLQEKQIVHGRIKCSSLEVIKFDPGNSFEVKLGDPGLPRDLKVRDVPWIPIEDYDD
LNNSRNNLKADIWAYATTLWEIFSRGESPFTELSKVPNITEFFRRGDRLPKPKECELLPR
IYEIMKSGWEVEPEKRFAPQTIFSPLLDISRNLSRHYESPISSNPRSNGTMQRMNGGIRN
GRPSSSNSMFYNNGSLVSNETDQTYISSLMPGTLHTNAYIDGGSSIDNSSQISLLNGHSI
ASNGSHSTTNAFIEHNFDGECMELDDNRKLSFRGYIGSGNFGVVYKGTIGPLIFNPMEDE
EEEVAIKCFKPIDSLNQAKDFLREVRMMKALNHENIVKIYDFHEDWLLIIMEYMSGGSLQ
EFVAIHRHELTVDDILQFALHIAKGMHYLEQNKIVHRDLAARNVLVTRNSSLLLSDTVCK
IADFGLAQFTNHYGYYESTNNRDLPLQWYAPETISCLKFSSKNDVWSFGITLWEMFSFGD
TPRLVPKSDFKGEDLLQALEKGERLKCPKHCPQNIYEELMRDVCWSYNSDKRPNFAGIIE
KIRNLLIRNGELV

Protein features from InterProScan

Transcript Database ID Name Start End E.value
12 g4919.t1 CDD cd14473 FERM_B-lobe 97 211 5.40464E-6
13 g4919.t1 CDD cd00192 PTKc 875 1143 1.28556E-100
11 g4919.t1 Gene3D G3DSA:1.10.510.10 Transferase(Phosphotransferase) domain 1 509 759 2.2E-29
10 g4919.t1 Gene3D G3DSA:1.10.510.10 Transferase(Phosphotransferase) domain 1 864 1147 4.0E-74
3 g4919.t1 PANTHER PTHR45807:SF7 TYROSINE-PROTEIN KINASE HOPSCOTCH 16 1144 2.8E-206
4 g4919.t1 PANTHER PTHR45807 TYROSINE-PROTEIN KINASE HOPSCOTCH 16 1144 2.8E-206
5 g4919.t1 PRINTS PR00109 Tyrosine kinase catalytic domain signature 951 964 6.6E-17
6 g4919.t1 PRINTS PR00109 Tyrosine kinase catalytic domain signature 988 1006 6.6E-17
7 g4919.t1 PRINTS PR00109 Tyrosine kinase catalytic domain signature 1063 1085 6.6E-17
2 g4919.t1 Pfam PF07714 Protein tyrosine and serine/threonine kinase 514 741 4.4E-32
1 g4919.t1 Pfam PF07714 Protein tyrosine and serine/threonine kinase 872 1141 1.4E-79
15 g4919.t1 ProSitePatterns PS00107 Protein kinases ATP-binding region signature. 876 907 -
14 g4919.t1 ProSitePatterns PS00109 Tyrosine protein kinases specific active-site signature. 994 1006 -
18 g4919.t1 ProSiteProfiles PS50057 FERM domain profile. 7 326 17.46
19 g4919.t1 ProSiteProfiles PS50011 Protein kinase domain profile. 497 747 14.662
20 g4919.t1 ProSiteProfiles PS50011 Protein kinase domain profile. 870 1146 35.269
17 g4919.t1 SMART SM00295 B41_5 3 223 0.0079
16 g4919.t1 SMART SM00219 tyrkin_6 870 1142 6.8E-100
9 g4919.t1 SUPERFAMILY SSF56112 Protein kinase-like (PK-like) 512 757 7.57E-26
8 g4919.t1 SUPERFAMILY SSF56112 Protein kinase-like (PK-like) 868 1143 6.01E-67

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0005856 cytoskeleton CC
GO:0004713 protein tyrosine kinase activity MF
GO:0005524 ATP binding MF
GO:0004672 protein kinase activity MF
GO:0006468 protein phosphorylation BP

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values