Gene loci information

Transcript annotation

  • This transcript has been annotated as ALK tyrosine kinase receptor.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_2 g5913 g5913.t1 isoform g5913.t1 12799363 12807984
chr_2 g5913 g5913.t1 exon g5913.t1.exon1 12799363 12799755
chr_2 g5913 g5913.t1 cds g5913.t1.CDS1 12799363 12799755
chr_2 g5913 g5913.t1 exon g5913.t1.exon2 12799838 12800280
chr_2 g5913 g5913.t1 cds g5913.t1.CDS2 12799838 12800280
chr_2 g5913 g5913.t1 exon g5913.t1.exon3 12800384 12800540
chr_2 g5913 g5913.t1 cds g5913.t1.CDS3 12800384 12800540
chr_2 g5913 g5913.t1 exon g5913.t1.exon4 12800604 12800683
chr_2 g5913 g5913.t1 cds g5913.t1.CDS4 12800604 12800683
chr_2 g5913 g5913.t1 exon g5913.t1.exon5 12800745 12801193
chr_2 g5913 g5913.t1 cds g5913.t1.CDS5 12800745 12801193
chr_2 g5913 g5913.t1 exon g5913.t1.exon6 12801251 12801458
chr_2 g5913 g5913.t1 cds g5913.t1.CDS6 12801251 12801458
chr_2 g5913 g5913.t1 exon g5913.t1.exon7 12801745 12801843
chr_2 g5913 g5913.t1 cds g5913.t1.CDS7 12801745 12801843
chr_2 g5913 g5913.t1 exon g5913.t1.exon8 12801905 12802240
chr_2 g5913 g5913.t1 cds g5913.t1.CDS8 12801905 12802240
chr_2 g5913 g5913.t1 exon g5913.t1.exon9 12802366 12802503
chr_2 g5913 g5913.t1 cds g5913.t1.CDS9 12802366 12802503
chr_2 g5913 g5913.t1 exon g5913.t1.exon10 12802568 12802745
chr_2 g5913 g5913.t1 cds g5913.t1.CDS10 12802568 12802745
chr_2 g5913 g5913.t1 exon g5913.t1.exon11 12802831 12803122
chr_2 g5913 g5913.t1 cds g5913.t1.CDS11 12802831 12803122
chr_2 g5913 g5913.t1 exon g5913.t1.exon12 12804766 12804931
chr_2 g5913 g5913.t1 cds g5913.t1.CDS12 12804766 12804931
chr_2 g5913 g5913.t1 exon g5913.t1.exon13 12805004 12805678
chr_2 g5913 g5913.t1 cds g5913.t1.CDS13 12805004 12805678
chr_2 g5913 g5913.t1 exon g5913.t1.exon14 12806583 12806830
chr_2 g5913 g5913.t1 cds g5913.t1.CDS14 12806583 12806830
chr_2 g5913 g5913.t1 exon g5913.t1.exon15 12806900 12807984
chr_2 g5913 g5913.t1 cds g5913.t1.CDS15 12806900 12807984
chr_2 g5913 g5913.t1 TSS g5913.t1 12808889 12808889
chr_2 g5913 g5913.t1 TTS g5913.t1 NA NA

Sequences

>g5913.t1 Gene=g5913 Length=4947
ATGAAACATCAAATCTTGTATTTGCTGTTAGTTGCAGCCGCTTTTACAAATGCACAACGT
CCATTGAATACAACAAACACATCAGGCGTGCTTACTGAGCCTTTTTTTATAAAGCCAACA
AGCATGTCAACATCAAAATACTATCCAGAACAACGCGGAGGCGGCAATCCCTATTTCAAT
TCTAATGATTCGCCTGGAAAGGGCAGAAGTAATAGCGGAGGCGGAAACGAACGACTTTTT
CAAGCATCACAACATCCAAATGTTGATATTTTAATTCCGCCATTGAATGGAATTGTTGGT
GATCCTGGAAGTGCACAATCACAAGTGCCAGCATCAAATCGACTGCATAAAGTTAGCAAA
TCACGACAAACAAAATTTAATCAACTTTATCAGCTCGCTAAAAATCCCAATTTTACACTA
AACAATCCTAATCGTGAAAAGAATCAAAATTTTTCATTTGGTCATCGTAATGATTATCAT
CATGGTAATATTATAACATCAAAGAAAGAAGCAAGACCTGAAAGAATTTATAGTAGTCCA
ACTGATACATTTGATCAACCAAAAGCAACATGGTCTGAAGGTCATTCATTTCAAGAAGTA
AAATATCAACTAAATGATAAAAATCATATGCATACACCATCACCACCATCATATTCAGTC
GATGACATTGATTTTGATGAAAAACTCGGTGTAAAATGCACATTTGAAAAGCCTTGTGCT
TGGACTTATGACACTGATGTTGTCGGTACAAATTTTGAAGTGACAACAGGTGCAAATTTA
ACAAGTGCTAATATAACTGGTGTCATGTCAGGACCATCAGCAGACGATTTAAAAGATGCC
AATGGTCATTTTTTGCATTTACCATTAACTCCAAATACAACAACAAGAATTCTTAGATCA
CCTGTATTTAGTTCAACACGTGAACGCTGCTATCTTGAAGTATTTATGCATCAAAGTTCA
ATGGCCTATGGTTCGATCAAAATTGTCATCGAACCAGCAGCATCACGTGAAAATTCTTGG
GTTCCTGCTGAGATCGTGGGCGATGATTTAAGAAAATGGAAACATCATGCATTTGAAATT
GACAGAATCACATTTGATTTTCGAATTCTATTGGAAATTGTGCCAAATGGATTGGGTGGT
GCACTTCGTGGTCATGTGAGCATTGATAATCTTAAAATGAGTAATTGCTTTGCTGATCCA
CCACGTAAAGATTCATGTAGTCTTTCACAAATTAAATGCAATCAAAATAAAATTGCCGTT
TGTATTGAAAATGCTCGCATCTGTGATTTAGTGCAAGATTGTGATGATGCAGAGGATGAA
AATTTAAATTGCGACAAAATACCATTTGGAGGATTATGTGATTTTGAAAATGGCTGGTGT
GGTTGGCAAAATTCGGGAAAAGCCATGTTAGATTGGATGCGTCATATAGGTCCAACACCT
ACAGAGAAAACTGGTCCCGAATTTGACCATACTTATCAGCATACAAATAAATCTGGTCAT
TATATGTTTGTAAATATGGACCAACATGCAAATGATCCAGAAAGAAAAAGTCTCGCAGGA
TTTGCGAGTAATGCTATCATAAATTCGGTGATTTTTAATCCACCACCACCATGTCATTCA
AATATAACTTCACCCTACCGTAATACATGCATGGCACGACTTTTTGTGCATCAATTTGGC
CTAAATCCTGGTAGTTTTAATGTATCGGTAGTTGAAATGAAGGCAAAGGAAAATATCACA
ACTACATTATGGTGGAGCTCACGGTCGATTGGCGACAAGTGGGAGCGTGTCGAAGTAGTG
CTTCCAAACATTACGTCCAAATACTATTTTCAAATTGAGGCACGAAAAGGCATGCGAATT
TACAGTGACGTGGCCATAGACGATTTTTCAATGAGCCCCGAGTGCTTTGGCTTCAACATC
CCACTCGAGCATCTAAATAATTATAACTATTGGGATCCAAGAATTGGAATCCACAAAAAA
CCCCATAGCGATTTTGTAGATAAGAAATTTTATGAACTATCAACATGTGGTGCACATGGC
ATTTATGGTCCCACACCTGAAGCTTGCTCAATTCACTATAATAACACTGAAATGATGAAT
CATGTACGTGTATCAGATAAAGTACCTTTTAAAGGTGTACAAATATGGAAAGTACCTAGC
GAAGGATATTACACAATAATAGTAAAAGGAGCGAGTGGTGGTTTAGGAAGTGGTGGTGTT
TCATCATCACGTGGAGCAATTGTTAGCTCAGTAATGGAACTACACAAGGGAGAAGAAATT
TACATTCTCGTTGGTCAAAAAGGCGAGAATGCATGTATAAAATCAATGGGAATTTCAGAT
GAAGCATGCTCACCTAAAATTTCCAAATATCCTGATCAAAATTCTGTCAACATAATCAAT
CAATTGCGAAATGGTTTACTCGAGAATGGCGCTGGTGGTGGTGGTGGAGCAAGCTTTGTA
TTTTTGTTGAATCAAGTGAACAAAGCTGTTCCATTGATCGTAGCAGGTGGAGGCGGCGGT
TTGGGAATTGGACGATATCTTGATGAAGATATTCAACAGGCAAAAGCATTTGTGCCAGAA
CGTGGTGATGTTACTGGCATTCAAGAGCTTGATATTGGTGAATCAATTGTTGCAGGCCCT
GGAGGTGGTTGGCGACCTCGACATGATTCAGCTCTAAGTATACATTCTGGTGCATCACTT
TTAGAAGGAGCAAGAGGTGGAATAGCATGCTATAATACTATTGGGCAAGACCATAAATAT
GGCAATGGAGCATTTGGTGGTGGCGGCGGCTCATGTCAGTCAGGTGGTGGAGGCGGTGGA
TATGCTGGAGGCAATTCAATGTTGAACTTTACAACAAACTCCTCAGCCAATCCTGATGCA
TTTTCAACTAATGGTGAAGGTGGAACTTCTTATTTGAGTCAAACACGAAGCATTAAGGAG
CTATCGTTTGTTTATCCTGGCTCAAATAGTGGTCATGGATCAGTTATATTGATTCCAGCC
ATTGAAGGTTGCGGTTGTGACTACCGGTGTGTTGCACTTGATGAATATCGATCACTTGTT
GCATGCATTTGTCCAGAAGGATGGCGATTAAAACCTGATAATTTAACATCATGCGAAATT
CTTGAAGAACCAAGCAATGATAAAGTATTAGTCGTAATACTCGGAATTGTATCATTCCTT
TTGGTTTCATTCTTAGTATTATTGATGTGCATGTTGTACACTCGTTTCCAAAGAAGACGA
CAAGCCGAGACGCGCCACAAAATGCTACTCGAACAAGAAGTTCAATTAAGTCGTTTGCGA
CACACCGACGATTCAGCTTTAACAAATTTTAACCCAAATTATGGTTGTGATGGAATTCTC
AATGGCGGAAATGTTGACGTTAAAAGTCTGCCACAAGTTGCACGTGAAAGTCTACGACTT
GTAAAAGCACTTGGACAAGGAGCATTTGGTGAAGTATATCAAGGACTTTATCGTCATCGT
GATGGTGACACAGTTGAGATGCCTGTTGCTGTTAAAACCCTGCCGGAACTGTCAACTGGA
CAGGCCGAATCTGATTTCCTTATGGAAGCTGCCATAATGGCAAAATTTAACCATCCAAAT
ATTGTCCATTTAATTGGCGTATGTTTCGATCGACATCCACGCTTTATTGTTCTTGAGTTG
CTAGCTGGCGGTGATTTAAAGAATTTCTTGCGTGAAGGTCGCAACAAGCCCGAACGTCCA
TCACCATTAACTATGAAAGATCTGATATTTTGCGCACTGGACGTTGCTAAGGGTTGCCGT
TATATGGAGAGCAAACGCTTTATTCATCGAGATATCGCTGCAAGAAATTGTCTACTTAGT
AGCAAAGGTCCAGGTCGAGTTGTCAAGATTGCAGATTTTGGCATGGCTCGAGATATTTAC
AGATCAGACTATTATCGTAAAGGAGGAAAAGCAATGCTTCCAATAAAATGGATGCCGCCT
GAAGCATTTTTGGATGGAATTTTCACGTCAAAAACAGACATCTGGAGTTATGGTGTACTG
TTGTGGGAAGTATTTTCACTTGGTCTCATGCCTTATACGGGGCTGCCAAATCGTGACGTC
ATGCAGTTAGTAACTGGAGGAGGACGACTTGATGCACCACCTGGCTGTCCAGCTGCAATT
TATCGAATAATGGCAGAATGTTGGAATCCAACGCCTGAAGCACGTCCCACGTTTTCAGTT
CTCTTGGAACGTTTGACTGCATGCACACAAGACCCAGAGATCATGAACGCTGCACTGCCA
AGTTTTTTCCGCCCACAATCAATGGAACGTGATGCTACAATAATGAGACCGTCGGGAAAT
GATGATTTCTGCTTGCAAGTTCCCAATTCTTCTGACTATTTAATACCATTACCTGATTCA
CGCGATTCTCGTGCCATAGCAGAGCGTCTGCTTAATGAAGCAACTTGCGTCACACTACCT
GATACTCTATCAACCTACACACTCACAAACTCGCTTAAAATGCCCGAAGCAAACTGCTGG
GAGACATCTTTTACCAATCCCAAGCTCAACAATGGCGTCGACAATGGAATGGATGATCGT
TTGATAAGTTTAGATACACCGACAACAATTCAACCACCACAAGCATTTTGTAATAGTCCA
GTACAAACACTTTCATCAAGTAACAATAACAACAACAACAGCAATGGCAACAAAGCAAAT
AACAACTCAGCTATCACACTTGATCCGGCCACATTACAACAAGGCTATGCAAATGTAAAA
ATGATGAATGGCACTGATAAGCTGGATAATGTCAATGACATGACAAAGTTTAATGGTGGT
GGTACAACAATGGTCATGATGAATAACAATAATAATGGTACAATTGCTAATGGCAATGGC
ATGATAACCAAAGATAAGAATTCAAATCAACCATTTTCGATACCAATGCAGGGCTATAAT
GAAAATCATGCTGAAATCAGCTGCTAA

>g5913.t1 Gene=g5913 Length=1648
MKHQILYLLLVAAAFTNAQRPLNTTNTSGVLTEPFFIKPTSMSTSKYYPEQRGGGNPYFN
SNDSPGKGRSNSGGGNERLFQASQHPNVDILIPPLNGIVGDPGSAQSQVPASNRLHKVSK
SRQTKFNQLYQLAKNPNFTLNNPNREKNQNFSFGHRNDYHHGNIITSKKEARPERIYSSP
TDTFDQPKATWSEGHSFQEVKYQLNDKNHMHTPSPPSYSVDDIDFDEKLGVKCTFEKPCA
WTYDTDVVGTNFEVTTGANLTSANITGVMSGPSADDLKDANGHFLHLPLTPNTTTRILRS
PVFSSTRERCYLEVFMHQSSMAYGSIKIVIEPAASRENSWVPAEIVGDDLRKWKHHAFEI
DRITFDFRILLEIVPNGLGGALRGHVSIDNLKMSNCFADPPRKDSCSLSQIKCNQNKIAV
CIENARICDLVQDCDDAEDENLNCDKIPFGGLCDFENGWCGWQNSGKAMLDWMRHIGPTP
TEKTGPEFDHTYQHTNKSGHYMFVNMDQHANDPERKSLAGFASNAIINSVIFNPPPPCHS
NITSPYRNTCMARLFVHQFGLNPGSFNVSVVEMKAKENITTTLWWSSRSIGDKWERVEVV
LPNITSKYYFQIEARKGMRIYSDVAIDDFSMSPECFGFNIPLEHLNNYNYWDPRIGIHKK
PHSDFVDKKFYELSTCGAHGIYGPTPEACSIHYNNTEMMNHVRVSDKVPFKGVQIWKVPS
EGYYTIIVKGASGGLGSGGVSSSRGAIVSSVMELHKGEEIYILVGQKGENACIKSMGISD
EACSPKISKYPDQNSVNIINQLRNGLLENGAGGGGGASFVFLLNQVNKAVPLIVAGGGGG
LGIGRYLDEDIQQAKAFVPERGDVTGIQELDIGESIVAGPGGGWRPRHDSALSIHSGASL
LEGARGGIACYNTIGQDHKYGNGAFGGGGGSCQSGGGGGGYAGGNSMLNFTTNSSANPDA
FSTNGEGGTSYLSQTRSIKELSFVYPGSNSGHGSVILIPAIEGCGCDYRCVALDEYRSLV
ACICPEGWRLKPDNLTSCEILEEPSNDKVLVVILGIVSFLLVSFLVLLMCMLYTRFQRRR
QAETRHKMLLEQEVQLSRLRHTDDSALTNFNPNYGCDGILNGGNVDVKSLPQVARESLRL
VKALGQGAFGEVYQGLYRHRDGDTVEMPVAVKTLPELSTGQAESDFLMEAAIMAKFNHPN
IVHLIGVCFDRHPRFIVLELLAGGDLKNFLREGRNKPERPSPLTMKDLIFCALDVAKGCR
YMESKRFIHRDIAARNCLLSSKGPGRVVKIADFGMARDIYRSDYYRKGGKAMLPIKWMPP
EAFLDGIFTSKTDIWSYGVLLWEVFSLGLMPYTGLPNRDVMQLVTGGGRLDAPPGCPAAI
YRIMAECWNPTPEARPTFSVLLERLTACTQDPEIMNAALPSFFRPQSMERDATIMRPSGN
DDFCLQVPNSSDYLIPLPDSRDSRAIAERLLNEATCVTLPDTLSTYTLTNSLKMPEANCW
ETSFTNPKLNNGVDNGMDDRLISLDTPTTIQPPQAFCNSPVQTLSSSNNNNNNSNGNKAN
NNSAITLDPATLQQGYANVKMMNGTDKLDNVNDMTKFNGGGTTMVMMNNNNNGTIANGNG
MITKDKNSNQPFSIPMQGYNENHAEISC

Protein features from InterProScan

Transcript Database ID Name Start End E.value
27 g5913.t1 CDD cd06263 MAM 453 635 1.51615E-33
28 g5913.t1 CDD cd05036 PTKc_ALK_LTK 1131 1407 0.0
18 g5913.t1 Gene3D G3DSA:2.60.120.200 - 217 407 1.0E-12
17 g5913.t1 Gene3D G3DSA:2.60.120.200 - 432 640 2.4E-31
16 g5913.t1 Gene3D G3DSA:3.30.200.20 Phosphorylase Kinase; domain 1 1103 1220 2.0E-36
19 g5913.t1 Gene3D G3DSA:1.10.510.10 Transferase(Phosphotransferase) domain 1 1221 1410 2.0E-56
5 g5913.t1 PANTHER PTHR24416:SF540 ANAPLASTIC LYMPHOMA KINASE, ISOFORM A 295 1417 5.3E-166
6 g5913.t1 PANTHER PTHR24416 TYROSINE-PROTEIN KINASE RECEPTOR 295 1417 5.3E-166
10 g5913.t1 PRINTS PR00109 Tyrosine kinase catalytic domain signature 1218 1231 7.3E-32
7 g5913.t1 PRINTS PR00109 Tyrosine kinase catalytic domain signature 1261 1279 7.3E-32
11 g5913.t1 PRINTS PR00109 Tyrosine kinase catalytic domain signature 1313 1323 7.3E-32
8 g5913.t1 PRINTS PR00109 Tyrosine kinase catalytic domain signature 1332 1354 7.3E-32
9 g5913.t1 PRINTS PR00109 Tyrosine kinase catalytic domain signature 1376 1398 7.3E-32
4 g5913.t1 Pfam PF00629 MAM domain, meprin/A5/mu 233 396 2.5E-18
3 g5913.t1 Pfam PF00629 MAM domain, meprin/A5/mu 453 635 2.5E-35
1 g5913.t1 Pfam PF12810 Glycine rich protein 720 993 1.7E-16
2 g5913.t1 Pfam PF07714 Protein tyrosine and serine/threonine kinase 1139 1405 3.2E-91
22 g5913.t1 Phobius SIGNAL_PEPTIDE Signal peptide region 1 18 -
23 g5913.t1 Phobius SIGNAL_PEPTIDE_N_REGION N-terminal region of a signal peptide. 1 4 -
24 g5913.t1 Phobius SIGNAL_PEPTIDE_H_REGION Hydrophobic region of a signal peptide. 5 13 -
26 g5913.t1 Phobius SIGNAL_PEPTIDE_C_REGION C-terminal region of a signal peptide. 14 18 -
21 g5913.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 19 1048 -
25 g5913.t1 Phobius TRANSMEMBRANE Region of a membrane-bound protein predicted to be embedded in the membrane. 1049 1073 -
20 g5913.t1 Phobius CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the cytoplasm. 1074 1648 -
37 g5913.t1 ProSitePatterns PS01209 LDL-receptor class A (LDLRA) domain signature. 421 444 -
40 g5913.t1 ProSitePatterns PS00107 Protein kinases ATP-binding region signature. 1144 1172 -
38 g5913.t1 ProSitePatterns PS00109 Tyrosine protein kinases specific active-site signature. 1267 1279 -
39 g5913.t1 ProSitePatterns PS00239 Receptor tyrosine kinase class II signature. 1298 1306 -
42 g5913.t1 ProSiteProfiles PS50060 MAM domain profile. 231 398 14.266
43 g5913.t1 ProSiteProfiles PS50068 LDL-receptor class A (LDLRA) domain profile. 405 445 10.588
41 g5913.t1 ProSiteProfiles PS50060 MAM domain profile. 451 637 22.602
44 g5913.t1 ProSiteProfiles PS50011 Protein kinase domain profile. 1138 1414 39.17
35 g5913.t1 SMART SM00137 MAM_2 228 398 8.9E-5
33 g5913.t1 SMART SM00192 LDLa_2 405 446 9.7E-5
34 g5913.t1 SMART SM00137 MAM_2 448 637 1.5E-5
32 g5913.t1 SMART SM00219 tyrkin_6 1138 1405 2.7E-139
36 g5913.t1 SMART SM00220 serkin_6 1138 1409 6.6E-44
13 g5913.t1 SUPERFAMILY SSF49899 Concanavalin A-like lectins/glucanases 230 396 5.22E-11
12 g5913.t1 SUPERFAMILY SSF49899 Concanavalin A-like lectins/glucanases 449 635 2.22E-24
14 g5913.t1 SUPERFAMILY SSF56112 Protein kinase-like (PK-like) 1130 1433 4.49E-78
15 g5913.t1 SignalP_EUK SignalP-noTM SignalP-noTM 1 18 -
29 g5913.t1 SignalP_GRAM_NEGATIVE SignalP-noTM SignalP-noTM 1 18 -
30 g5913.t1 TMHMM TMhelix Region of a membrane-bound protein predicted to be embedded in the membrane. 1050 1072 -
31 g5913.t1 TMHMM TMhelix Region of a membrane-bound protein predicted to be embedded in the membrane. 1334 1356 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

Data is missing for g5913/g5913.t1; file /home/yuki.yoshida/nias/analysis/reanalysis/18_revice/midgebase/iupred3/g5913.t1.fa.iupred3.txt does not exist

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0016020 membrane CC
GO:0005524 ATP binding MF
GO:0006468 protein phosphorylation BP
GO:0004713 protein tyrosine kinase activity MF
GO:0007169 transmembrane receptor protein tyrosine kinase signaling pathway BP
GO:0005515 protein binding MF
GO:0004672 protein kinase activity MF
GO:0004714 transmembrane receptor protein tyrosine kinase activity MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values