Gene loci information

Transcript annotation

  • This transcript has been annotated as Protein sevenless.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_1 g12149 g12149.t1 isoform g12149.t1 21385275 21399913
chr_1 g12149 g12149.t1 exon g12149.t1.exon1 21385275 21385989
chr_1 g12149 g12149.t1 cds g12149.t1.CDS1 21385275 21385989
chr_1 g12149 g12149.t1 exon g12149.t1.exon2 21386048 21386176
chr_1 g12149 g12149.t1 cds g12149.t1.CDS2 21386048 21386176
chr_1 g12149 g12149.t1 exon g12149.t1.exon3 21386254 21386395
chr_1 g12149 g12149.t1 cds g12149.t1.CDS3 21386254 21386395
chr_1 g12149 g12149.t1 exon g12149.t1.exon4 21386462 21386834
chr_1 g12149 g12149.t1 cds g12149.t1.CDS4 21386462 21386834
chr_1 g12149 g12149.t1 exon g12149.t1.exon5 21386896 21387194
chr_1 g12149 g12149.t1 cds g12149.t1.CDS5 21386896 21387194
chr_1 g12149 g12149.t1 exon g12149.t1.exon6 21387260 21391812
chr_1 g12149 g12149.t1 cds g12149.t1.CDS6 21387260 21391812
chr_1 g12149 g12149.t1 exon g12149.t1.exon7 21391888 21392261
chr_1 g12149 g12149.t1 cds g12149.t1.CDS7 21391888 21392261
chr_1 g12149 g12149.t1 exon g12149.t1.exon8 21392454 21392536
chr_1 g12149 g12149.t1 cds g12149.t1.CDS8 21392454 21392536
chr_1 g12149 g12149.t1 exon g12149.t1.exon9 21392609 21392753
chr_1 g12149 g12149.t1 cds g12149.t1.CDS9 21392609 21392753
chr_1 g12149 g12149.t1 exon g12149.t1.exon10 21393202 21393435
chr_1 g12149 g12149.t1 cds g12149.t1.CDS10 21393202 21393435
chr_1 g12149 g12149.t1 exon g12149.t1.exon11 21393503 21393859
chr_1 g12149 g12149.t1 cds g12149.t1.CDS11 21393503 21393859
chr_1 g12149 g12149.t1 exon g12149.t1.exon12 21393934 21393993
chr_1 g12149 g12149.t1 cds g12149.t1.CDS12 21393934 21393993
chr_1 g12149 g12149.t1 exon g12149.t1.exon13 21396116 21396193
chr_1 g12149 g12149.t1 cds g12149.t1.CDS13 21396116 21396193
chr_1 g12149 g12149.t1 exon g12149.t1.exon14 21399704 21399913
chr_1 g12149 g12149.t1 cds g12149.t1.CDS14 21399704 21399913
chr_1 g12149 g12149.t1 TSS g12149.t1 21400068 21400068
chr_1 g12149 g12149.t1 TTS g12149.t1 NA NA

Sequences

>g12149.t1 Gene=g12149 Length=7752
ATGAAATACAAATTAATTGGTCTTAATAAACCAACATGCAATAATAATTATAGAAATAAA
TTTGTGATAGCTTTAGTGATTTTTATATTATGTATAAATAAGCAAATTTCATGCATTGAT
AGTGTTGAAGATGAAAAAAGTTTGGAAATTGATATTCCCGAAGAAAAAAATGATTTTGTT
GAAAAGTGCGAGAAAAAATGCAAAGATCAGAATCGTACACAATTTGATACAAATGACAAT
GACGATGGTGTAGATACAGTTTGTAAAAGTAATTGTACAATCAAGCAGTGCGATATTGGT
TGTACACTTTGGGAAAGTGCCTTGGATTCATCATGCCAAAAAGTTTGCAATAAATCTGAA
GAGCAGCAGTATGAATTTGACAATCGACAAATTTATTGCATAAAAGGCTGCAACGATGCC
TCAAATTTTTACTTTCAGTGGATCAAACAAGAAGTGAAATCGCCAGTTGCACCCGCTCTT
ATTCCTGATTCACTTACATCAACTACTTTGTCTCTTGAATGGTCTGTACCATCAAAAATT
ACTGAACTTGCAAAAGGAAATCTTTTTAAAAAGACAAAAAATGAAAAATATTTTGTGCAA
TGCTATGAAGATTACGAAGATGATTGGAAACTTTGTGGAAAACAAACAATTTATGGAAAT
TCAACAATTCACATGGAAAACTTGCAACCATATACAAAATATAGATTCAGAGTGGCATTG
TTATTATCAGAAAATGAAGCAATTTATTCGGAACCAAGTGTAATAATCAGCACAAATGAA
GATGGCATTCCAACCTCGCAACCTCAAATTCTTCAAGTCGGTGCTGTTGATCAGACACGA
ATTATGGTGTCATGGAAATCAGGATTAAGAAATAATGGACCAATTCTATCGTTTAATCTA
CAGATTAGAGATTTATCTCATGGATACACAGCCATTAAGGAAGTTCCAGCATTAAACCCA
ACAAATTATTATATATTTGAGAAGCTATCACCCGAGAGAAACTATTCAATTCAAATAAGA
ACACGAAATGCTCGAGGATTGGGCCCTTATAGTGAAGCAGTTACAATGACAGCTCCTTAT
TTAAAAGACAGTGATGGAACGGAGATTCCTAAATTGCTACTTGCAACTGAACATAAAATT
GTATCACAAGGAAAAGACTTTTTAGCTGCGCAACCAGTAACATTCTATACGACCAATACG
TCTAAAATTACTGCCATTGCATTAAATGTTCGCAAGAAATTGATTTTTGTGGCAGAAGAG
AATGGATGCATTTATCAGGGGTCACTTGATCTAGCAAGAAAGAATGAAAAGAAAGAGGTC
ATTTGTCAGAAAAATGGATTAAATTTCAAGCCCTCACTGTTATCTGTTGATTGGCTAAAT
GATCATCTTTATATTATGGGTGAAATGACATCAAATTCACAACATCAAAGCATCAAGAGT
TGGTCTATATCAAGATGTGATTATGATGGAAAGAAGCTGATTGTTGCTGTTGGTGGACTC
AATGAAAAACCTGCATATATCGAAATTGATCCTTATAATGGTTATCTATTCTGGGTAATT
ACTGGTGAATCAATGGCAGCAGATGGTCTTTTCAAACTTGATTTGGGTGATATTTCGAAT
GGAATCAAACATGAAACAAAGCCAACAAAATTAATTGATCAAACAAAATTAGGTGCATTT
GTAGTTGAACCAACAAGATTTCGATTACTTGTGCCTTATCAAAATGATAATACAATTATG
GCAGTGTCATTGAGTGGAAATTCTGAAGATATTAGAAAGAATACACAAAGTCCATTGCTA
CATTCTGTGAAATCTTTAGTTCAACTTAATGGACTTTTCTATTGGACAAACGGAATGGAA
TATCGAGCAGAAGAATATCATGAAAAACACAATGTTTATTATCATAATTCATATCCTGAT
GCATCAAATACATCTATAGTGGCAATTCGTGTTAATTCATCTGTTTCTCAACCAATTCCT
ATTCCATTGAATCCTCCCAAAAGTGTACAAGCTCTTTTAAGTTCTGATCGTGCTAAAATT
TCATGGGAAACGCCAGAAGCATTTGGTGAACAAGGAAAAGGTGCTTTTAAAAATTGGTTC
TATCGACTTGAAGTATCAGATGGTGAACAGAGGCAGAAAATTGAGAATATCTCAGGAAAT
GCTTATATTGTTGATAGTTTAAAGCCTGATCAGCTTTATTCATTCAAAGTAGCAGCATAC
ACATCAGGTGGATCAGGTCAATGGTCTAAAGAATTTAAAGCAAAAACACTCAAGAGTTCA
GATGAACGTCATCTAATTTGGGCATCAAATGAAGGTCTCATGCAATCAGATGTCACTGGC
GAGAACATCATTACACTCATTTCAAAAGAAGAACTCGGAGATGTAACAATTACTGATGTT
ACATGGTATGATGACCTTCTTTATATTGTTGCCAATGCTACTCTTAGGATTTATAATCGA
ACAAGTGGTATGCTCAATAAATTGAACGAAATCGATTGCGTTGGTGGTGTGGCAATTGAT
TGGATAGGCAAACGTCTTTATTGGTCAAATCCATCGCAACAAGTGATACAGCATAGCAAT
TTGGTCGGTAGACAAGCTGAACCATTGCCATTTAGTGCGACCGTTCGTGAAATTAAAATT
GATGCATTGAGGGGAAATATTTACTATTCAACTGGTCTCACAATTGAAAGTTGTCGCTTA
AATGGTCGCAATGAGCGAAAATATTTTCGGGTTGAGCCATACAGCGGGAAGCAAGTGATA
GGATTGACGCTTGATATGGACAATCAGATGATTTATTGGATTGTTAAGAGCAATTCTGGC
ACGAGTTTGTTTTCTGCAAATTTCATGGACTCTTTGAGTGATAAAGATGAATATGGTGAA
GAAAAACTCACTGAGAAGAATCTTTATGGACCATTAATTCACTTTAGTGATCGACTTGTA
TGGCGACAAAATGATAAAACGATTGTTTTTAGTGATTTGAATGGAAAAAATTTGGCATTT
TTTGAGAATGAAAAGCTTAATGGTTTGACTTATATAGTTGTAATTGATAAAACACATCAT
CAATATCCTAAAACATTGCATAATGAAGTTGTTGTCATTCCAAATTCAGTCACAGCTTCT
TCCGTTCAAATTATCGGCACTTATAAATTTTTCAACATCACATGGGATGCAGTTAAGAAT
GTCAATTATGGCGAAGTGTATTATGATATCAGAGTAAAAGCACCAAAAATTCCTGATGTC
GTTGCAGAACAGAAAAGTAATGTTTTTCAATTTCCAACAAATACTCTTGAGCCTTATACA
TTACTTGAAATTTATATTCGTGCCTCAACTTCGTGGGGTTCATCTACACCAACAAAAATT
CAAATTACTTCACCTCCAGGATATCCAAGTGAACCAACAAGTCCAAGAGTTTTTACACGA
CATCTTTATAGACCGTTTGAAGGAAATGTTCAAATTAGTGCAATTTTTAGATGGTCATTG
CCGAAGCAACCAAATGGAAATATTCTTGGCTACAAAGTGCGTTGTAATGTTTATGAAAAT
ATGGAATTGAAATTACTGAAGAACAAAACAGTGACTGGCTTGGAGCAGATTTTTGAAGGT
CTTAAAAAAGACACTGACTATATTTTTGAAGTGCAAGCATATACAGAAGTTGGTGATGGA
AATTTTTCTGAAACTGTTAAAATAACAACCACATATGAAAGGCCAGTGCCCCGAGTTTTA
GTTTCAACGCAAGAGGATATAATCGAAGTCGATTTAGATAAGCAGCAATCAAGTATGGTA
CAAAGTACGAGAAATCCGATAGTCGTCTTCACACATATTGCGCATGAAAATAAGCTCTAT
TGGTTCAATGATAATAATGAACTTCAATCATACAACATGGAAACGCGAGAAAAAGTCAAA
TTACTGAGCACTAATTCGACTGTGCAAGCAATGACAATAGATTGGATCGGTAGAATCCTT
TATTGGTCACAAAACGATGAAAATCGTGGAGCAATTTATTCTTATAATCTAAACAGAGCT
GAAAACAATAATTATCGTCCATTCAATGACCAGAACTATGCATTTAAAATTGTTGATAGA
GAAGATGTCATTTCTGATTTAGTTGTTTCACCTTACGATAGAAAACTCTTTTGGATTGAA
AACCATGAAAAATTACCAGAAGAATCTGGAATTTATTATCTCGATCTTGATACAAATGAC
ATTAAAATGTTGTTTGATGAAAATGATGTTTGCCTTAATCAAACTACTCTAACAATGTCA
TTCAATCCTGGTTCTTTAATTTTTGCTACATCTCCGATTAATCCGTTAGAGAATATAAAT
ATCCGCAGACATGAATCTATTCTCATTTTTGAAATGCGTAATGGTTTCACAGCAACTGAT
ATAGCAACGAAAAAGTGCTTCGATTTTGGCTCAATTTTTAATGATAAGGGCACAAATTTA
GCAAAAGATAGTAATCGAGTTTATTGGATTAATGAAAATTTAATTTATGCAAGAGATGAT
TTTACTCAAAAGACAATAAATCTTGCAGTTCCACCAAAATCAAATCGTCTCTTGGCTTTT
TATCAACAATACTTTCCGAAAAAACGTTGCCTCATTCCTAATAATAAAAGCAAACGCGTC
AAACTGCTCGCAAGTACAGACACTACATTAAAGATCGAATTACCAAAGCCAGAATTGCCT
CCCGAATGTAAGTTAAAAAGCGTTCCTATAAAGTATACAATTTTATATACAGACACTGAA
CTAGGCAATTTAAGTGCGACTGACTTAAGTTGTTTAAATGGTAGTGATTATTGCCGCAAA
ATTGAAACTTATAATCGTATCGAAACTATTGAAAAACTTAATCCATTTACACATTATTCA
ATTCAAATTGCATTATCAAGTGTATTTGATAAATCGGAACATCTGCAATTTGGTGAAATT
TCTGATTTTCGTACTGATTCTGGCACGCCATCGCCTCCAAGAAATATTTCAGCAATCCCA
TTAAGTTTCAATGAAGTTCTCGTCAATTGGCAACGACCAGCAACATTTAATGCACCTAAA
ATTTACTATAAAATTCTCTGGGAAACAATTCAAACTGAAAGTAACATGAAAAACAATCAT
GAGATCACAGTAAATGACTCAAGTGATAGAAGCGACAATTCGCACAATAATGAATACCTT
TCGACAATTCTCAACAAGATAACACCAAATCAAAAATATAAAATCTCTGTGAGTGCTTGT
TCAAAGAATGAAACATGCAGCGAAAGTGAGAAAATTTACGTAATATCTTACCCAGAACCT
GAAACTATCAAACTTAATTCAATGACGCCAACAAGTATGATTATTGAATGGAACGCTCCT
GAGAATATTTCGCAATTTGAAATTCAGTACTCGAGACAAGATTCATATGATATTGTTAGT
AATGTTAGTAAAACTCACATGTCGGAACATCCAAATTACTTTTTGGTCGACAATTTGGAA
CCAAAAACAAAATATAACTTTTCAATTTCAATAAAATACATCAATAGTAATCATACATAT
CGTTGGATGCCACATAGTAAAATTGAATTTGAAACACTCGGTGATGTACCATCATCGCCA
GGTAAACCAAATGTTGAATTTTGTAAGGAAAAAGTTTGTAAGATTGTATGGAATGCATCA
AAAGAAAATGGTGCAACAATATTAGAATATGTTCTTGAATCAATGAAAATTCAAAATCCA
CTTGAATGGGAAAAATCAAGGTCGAAGAGAGCGATAGATGAGTCAAATGATATCGGTCAA
GAACATAACAATGAAATTGATGAAGTAACAGAAATGCCTACCGAACCTGATAGTGATGAA
AAAGAAAATTGGGTTATAAGATATAATGGAACCGACACTCATTTCATTGCCACTGAATTG
GAGCAAATAGACCAACATGTGTTTCGTGTCAAGGCACAAAATAATTACGGATGGAGTGCG
TATAGTCCTATCAGTGATCTAATCAATTCAACACTCATTAGTGGCAAATTTTTAAACAAC
AACAAAGCAGAGTCAAGAAATATTTTGATGGCAGCAGTGTCAATTATTTTTGCCTTCGTC
ATAATTTTTGTTTCTGTCACATGCCTAATTTTCATATTTTACCGAAATCGTCGTAATAAA
AAAATGAGAGATGGACTTCCAATTCCAGATTTTGAAATGATGGCATTGAGAGACATGCAG
GCTGGTGAAAATATCTTGATTCAAAATAGAAATATCTTGTACAATCACTTTTATGGACCA
GCATTTCTTAATCCGGAAATTAAACATTTACCACAAATTCATCAGAATCAAATAATCATC
ACTGATAGATGTCTTGGAAAAGGCGCATTTGGTGAAGTGTGGAGTGGAATTGTAAAAAAT
GATGATGGAAGTGAAGAACTTGTTGCTATAAAAACACTTCATAAAGGTGCCAATGATGTT
GAAAAACGTGAATTTCTACAAGAAGCTCAATTAATGAGTAATTTTAAACACGATCATATT
TTGCGCTTGATCGGTGTTTGCTTGAATCAAAATGATTGTTTGCTTTATATTGTTATGGAA
TTAATGGAAAGTGGAGATTTGCTAAGTTTTTTAAGAAACAATAGACCAACAATGAATAAG
CAATCTCCATTAAAACTTAATGATTTGATTTCAATGTGTGTTGATGTTGCTTCTGGTTGT
CGCTATTTAGAGGAAATGCATTTTGTCCATAGAGATATTGCAGCAAGAAATTGTTTAGTC
AAAACGACACCGGATATAACTGGAATGAATTTGGTGGTAAAAATCGGCGACTTTGGTCTT
GCCCGAGATATTTATAAAAATGACTATTATCGAAAAGAAGGTGAGGGATTATTGCCTGTG
AGATGGATGGCACCAGAATCATTGATTGATGGTGTTTTTACATCTCAATCAGATGTTTGG
GCTTTTGGTGTTTTAATGTGGGAAGTAATGACACTTGGACATCAACCATATCCTGCTAGA
ACAAATATTGAGGTATTACAATATGTGAGAAGTGGTGGAAGATTACACAGACCTCAACAA
AATTGTCCTGAAGAATTGTATCAATTAATGACAAAATGTTGGAATAGAGTAGACCAACGA
CCGACATTTAGATATTGTCTTGAAATTCTTACACAACTTCATGAAAATTACATGTATATT
TATTCTGATATTGAGCTACCTTTTCCTAATGATATGAACTGCAAATATGTATCGAGTGAT
GATTCTTCGATAATGAATGAGAAATTAAATTCTAATGAAACTAATGAGACATCAGCAGAG
CATCAACTACCAACTTCAATACCAAAATATCTTGAGTTAATGTATGATGAAAATGATGAA
AGAAACACTGAAAATATGTGTGAAGCTCCATTAAATTATAGTGATAAATACCAAAATAAT
AATAACAATATGGAAAATATTGAAAATAATCAAAGAACGACAAAAGACGAAGGCTATGAA
ATTCCAATATCGTTTGATAACGATTTAGATATAAATGCAAATAATGAATTAAATAGTAAT
CTCACTAAGTCACGTACTCTATCGAACTCATCAACTATTAGTCATAAGAGCGATCATAAT
AATCAGCAACAAATACAACAAATTTCACCACCGCCGCATAATCATCACCATCATCCCATT
TTTCCTATACCTATTGAAAATTGTAAGCGATCATCATTAATACTAGATAGAGATCATAAA
ATTTATCCACAAACAAAAATTTTGACGAATGGAGTTGTATCACAAATTAAACATCAAAGT
GGATGGGTTTGA

>g12149.t1 Gene=g12149 Length=2583
MKYKLIGLNKPTCNNNYRNKFVIALVIFILCINKQISCIDSVEDEKSLEIDIPEEKNDFV
EKCEKKCKDQNRTQFDTNDNDDGVDTVCKSNCTIKQCDIGCTLWESALDSSCQKVCNKSE
EQQYEFDNRQIYCIKGCNDASNFYFQWIKQEVKSPVAPALIPDSLTSTTLSLEWSVPSKI
TELAKGNLFKKTKNEKYFVQCYEDYEDDWKLCGKQTIYGNSTIHMENLQPYTKYRFRVAL
LLSENEAIYSEPSVIISTNEDGIPTSQPQILQVGAVDQTRIMVSWKSGLRNNGPILSFNL
QIRDLSHGYTAIKEVPALNPTNYYIFEKLSPERNYSIQIRTRNARGLGPYSEAVTMTAPY
LKDSDGTEIPKLLLATEHKIVSQGKDFLAAQPVTFYTTNTSKITAIALNVRKKLIFVAEE
NGCIYQGSLDLARKNEKKEVICQKNGLNFKPSLLSVDWLNDHLYIMGEMTSNSQHQSIKS
WSISRCDYDGKKLIVAVGGLNEKPAYIEIDPYNGYLFWVITGESMAADGLFKLDLGDISN
GIKHETKPTKLIDQTKLGAFVVEPTRFRLLVPYQNDNTIMAVSLSGNSEDIRKNTQSPLL
HSVKSLVQLNGLFYWTNGMEYRAEEYHEKHNVYYHNSYPDASNTSIVAIRVNSSVSQPIP
IPLNPPKSVQALLSSDRAKISWETPEAFGEQGKGAFKNWFYRLEVSDGEQRQKIENISGN
AYIVDSLKPDQLYSFKVAAYTSGGSGQWSKEFKAKTLKSSDERHLIWASNEGLMQSDVTG
ENIITLISKEELGDVTITDVTWYDDLLYIVANATLRIYNRTSGMLNKLNEIDCVGGVAID
WIGKRLYWSNPSQQVIQHSNLVGRQAEPLPFSATVREIKIDALRGNIYYSTGLTIESCRL
NGRNERKYFRVEPYSGKQVIGLTLDMDNQMIYWIVKSNSGTSLFSANFMDSLSDKDEYGE
EKLTEKNLYGPLIHFSDRLVWRQNDKTIVFSDLNGKNLAFFENEKLNGLTYIVVIDKTHH
QYPKTLHNEVVVIPNSVTASSVQIIGTYKFFNITWDAVKNVNYGEVYYDIRVKAPKIPDV
VAEQKSNVFQFPTNTLEPYTLLEIYIRASTSWGSSTPTKIQITSPPGYPSEPTSPRVFTR
HLYRPFEGNVQISAIFRWSLPKQPNGNILGYKVRCNVYENMELKLLKNKTVTGLEQIFEG
LKKDTDYIFEVQAYTEVGDGNFSETVKITTTYERPVPRVLVSTQEDIIEVDLDKQQSSMV
QSTRNPIVVFTHIAHENKLYWFNDNNELQSYNMETREKVKLLSTNSTVQAMTIDWIGRIL
YWSQNDENRGAIYSYNLNRAENNNYRPFNDQNYAFKIVDREDVISDLVVSPYDRKLFWIE
NHEKLPEESGIYYLDLDTNDIKMLFDENDVCLNQTTLTMSFNPGSLIFATSPINPLENIN
IRRHESILIFEMRNGFTATDIATKKCFDFGSIFNDKGTNLAKDSNRVYWINENLIYARDD
FTQKTINLAVPPKSNRLLAFYQQYFPKKRCLIPNNKSKRVKLLASTDTTLKIELPKPELP
PECKLKSVPIKYTILYTDTELGNLSATDLSCLNGSDYCRKIETYNRIETIEKLNPFTHYS
IQIALSSVFDKSEHLQFGEISDFRTDSGTPSPPRNISAIPLSFNEVLVNWQRPATFNAPK
IYYKILWETIQTESNMKNNHEITVNDSSDRSDNSHNNEYLSTILNKITPNQKYKISVSAC
SKNETCSESEKIYVISYPEPETIKLNSMTPTSMIIEWNAPENISQFEIQYSRQDSYDIVS
NVSKTHMSEHPNYFLVDNLEPKTKYNFSISIKYINSNHTYRWMPHSKIEFETLGDVPSSP
GKPNVEFCKEKVCKIVWNASKENGATILEYVLESMKIQNPLEWEKSRSKRAIDESNDIGQ
EHNNEIDEVTEMPTEPDSDEKENWVIRYNGTDTHFIATELEQIDQHVFRVKAQNNYGWSA
YSPISDLINSTLISGKFLNNNKAESRNILMAAVSIIFAFVIIFVSVTCLIFIFYRNRRNK
KMRDGLPIPDFEMMALRDMQAGENILIQNRNILYNHFYGPAFLNPEIKHLPQIHQNQIII
TDRCLGKGAFGEVWSGIVKNDDGSEELVAIKTLHKGANDVEKREFLQEAQLMSNFKHDHI
LRLIGVCLNQNDCLLYIVMELMESGDLLSFLRNNRPTMNKQSPLKLNDLISMCVDVASGC
RYLEEMHFVHRDIAARNCLVKTTPDITGMNLVVKIGDFGLARDIYKNDYYRKEGEGLLPV
RWMAPESLIDGVFTSQSDVWAFGVLMWEVMTLGHQPYPARTNIEVLQYVRSGGRLHRPQQ
NCPEELYQLMTKCWNRVDQRPTFRYCLEILTQLHENYMYIYSDIELPFPNDMNCKYVSSD
DSSIMNEKLNSNETNETSAEHQLPTSIPKYLELMYDENDERNTENMCEAPLNYSDKYQNN
NNNMENIENNQRTTKDEGYEIPISFDNDLDINANNELNSNLTKSRTLSNSSTISHKSDHN
NQQQIQQISPPPHNHHHHPIFPIPIENCKRSSLILDRDHKIYPQTKILTNGVVSQIKHQS
GWV

Protein features from InterProScan

Transcript Database ID Name Start End E.value
47 g12149.t1 CDD cd00063 FN3 155 257 2.35824E-9
46 g12149.t1 CDD cd00063 FN3 266 357 2.30968E-13
45 g12149.t1 CDD cd00063 FN3 664 756 3.07221E-10
44 g12149.t1 CDD cd00063 FN3 1153 1230 2.56338E-12
43 g12149.t1 CDD cd00063 FN3 1650 1743 3.62023E-8
42 g12149.t1 CDD cd00063 FN3 1760 1830 6.43756E-8
34 g12149.t1 Coils Coil Coil 2457 2477 -
25 g12149.t1 Gene3D G3DSA:2.60.40.10 Immunoglobulins 149 259 8.5E-9
27 g12149.t1 Gene3D G3DSA:2.60.40.10 Immunoglobulins 260 360 2.7E-16
33 g12149.t1 Gene3D G3DSA:2.120.10.30 TolB 369 663 1.6E-16
26 g12149.t1 Gene3D G3DSA:2.60.40.10 Immunoglobulins 664 758 3.7E-13
32 g12149.t1 Gene3D G3DSA:2.120.10.30 TolB 759 1030 1.0E-18
28 g12149.t1 Gene3D G3DSA:2.60.40.10 Immunoglobulins 1118 1235 7.0E-17
31 g12149.t1 Gene3D G3DSA:2.120.10.30 TolB 1236 1538 9.5E-15
24 g12149.t1 Gene3D G3DSA:2.60.40.10 Immunoglobulins 1638 1758 1.3E-12
22 g12149.t1 Gene3D G3DSA:2.60.40.10 Immunoglobulins 1759 1847 9.9E-10
23 g12149.t1 Gene3D G3DSA:2.60.40.10 Immunoglobulins 1848 1990 9.5E-14
29 g12149.t1 Gene3D G3DSA:3.30.200.20 Phosphorylase Kinase; domain 1 2070 2181 3.9E-26
30 g12149.t1 Gene3D G3DSA:1.10.510.10 Transferase(Phosphotransferase) domain 1 2182 2384 1.0E-55
68 g12149.t1 MobiDBLite mobidb-lite consensus disorder prediction 2503 2531 -
69 g12149.t1 MobiDBLite mobidb-lite consensus disorder prediction 2503 2535 -
7 g12149.t1 PANTHER PTHR24416:SF527 PROTO-ONCOGENE TYROSINE-PROTEIN KINASE ROS 223 2518 2.0E-299
8 g12149.t1 PANTHER PTHR24416 TYROSINE-PROTEIN KINASE RECEPTOR 223 2518 2.0E-299
12 g12149.t1 PRINTS PR00109 Tyrosine kinase catalytic domain signature 2179 2192 5.3E-25
9 g12149.t1 PRINTS PR00109 Tyrosine kinase catalytic domain signature 2222 2240 5.3E-25
11 g12149.t1 PRINTS PR00109 Tyrosine kinase catalytic domain signature 2297 2319 5.3E-25
10 g12149.t1 PRINTS PR00109 Tyrosine kinase catalytic domain signature 2342 2364 5.3E-25
6 g12149.t1 Pfam PF00041 Fibronectin type III domain 266 351 2.0E-11
2 g12149.t1 Pfam PF00041 Fibronectin type III domain 668 749 6.6E-7
4 g12149.t1 Pfam PF00041 Fibronectin type III domain 1153 1223 4.1E-8
3 g12149.t1 Pfam PF00041 Fibronectin type III domain 1651 1744 1.9E-8
5 g12149.t1 Pfam PF00041 Fibronectin type III domain 1758 1830 5.1E-9
1 g12149.t1 Pfam PF07714 Protein tyrosine and serine/threonine kinase 2102 2369 5.4E-93
37 g12149.t1 Phobius SIGNAL_PEPTIDE Signal peptide region 1 38 -
38 g12149.t1 Phobius SIGNAL_PEPTIDE_N_REGION N-terminal region of a signal peptide. 1 20 -
39 g12149.t1 Phobius SIGNAL_PEPTIDE_H_REGION Hydrophobic region of a signal peptide. 21 32 -
41 g12149.t1 Phobius SIGNAL_PEPTIDE_C_REGION C-terminal region of a signal peptide. 33 38 -
36 g12149.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 39 2007 -
40 g12149.t1 Phobius TRANSMEMBRANE Region of a membrane-bound protein predicted to be embedded in the membrane. 2008 2034 -
35 g12149.t1 Phobius CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the cytoplasm. 2035 2583 -
67 g12149.t1 ProSitePatterns PS00107 Protein kinases ATP-binding region signature. 2105 2131 -
65 g12149.t1 ProSitePatterns PS00109 Tyrosine protein kinases specific active-site signature. 2228 2240 -
66 g12149.t1 ProSitePatterns PS00239 Receptor tyrosine kinase class II signature. 2263 2271 -
73 g12149.t1 ProSiteProfiles PS50853 Fibronectin type-III domain profile. 154 261 12.986
75 g12149.t1 ProSiteProfiles PS50853 Fibronectin type-III domain profile. 267 361 19.061
76 g12149.t1 ProSiteProfiles PS50853 Fibronectin type-III domain profile. 662 759 14.904
72 g12149.t1 ProSiteProfiles PS50853 Fibronectin type-III domain profile. 1033 1126 9.413
70 g12149.t1 ProSiteProfiles PS50853 Fibronectin type-III domain profile. 1131 1234 17.12
77 g12149.t1 ProSiteProfiles PS50853 Fibronectin type-III domain profile. 1652 1758 10.28
71 g12149.t1 ProSiteProfiles PS50853 Fibronectin type-III domain profile. 1759 1855 13.815
74 g12149.t1 ProSiteProfiles PS50853 Fibronectin type-III domain profile. 1896 1994 7.551
78 g12149.t1 ProSiteProfiles PS50011 Protein kinase domain profile. 2099 2377 37.531
51 g12149.t1 SMART SM00060 FN3_2 155 247 0.053
55 g12149.t1 SMART SM00060 FN3_2 265 348 7.0E-6
60 g12149.t1 SMART SM00135 LY_2 439 493 100.0
57 g12149.t1 SMART SM00060 FN3_2 662 746 5.3E-7
64 g12149.t1 SMART SM00135 LY_2 824 866 0.29
63 g12149.t1 SMART SM00135 LY_2 867 905 98.0
62 g12149.t1 SMART SM00135 LY_2 906 954 95.0
54 g12149.t1 SMART SM00060 FN3_2 1034 1115 12.0
53 g12149.t1 SMART SM00060 FN3_2 1129 1220 0.18
59 g12149.t1 SMART SM00135 LY_2 1298 1342 0.62
61 g12149.t1 SMART SM00135 LY_2 1356 1401 58.0
56 g12149.t1 SMART SM00060 FN3_2 1650 1747 1.7E-6
50 g12149.t1 SMART SM00060 FN3_2 1757 1838 6.9E-4
52 g12149.t1 SMART SM00060 FN3_2 1857 1979 0.12
58 g12149.t1 SMART SM00219 tyrkin_6 2099 2370 3.2E-135
16 g12149.t1 SUPERFAMILY SSF49265 Fibronectin type III 160 357 4.84E-26
13 g12149.t1 SUPERFAMILY SSF63825 YWTD domain 391 621 2.49E-11
18 g12149.t1 SUPERFAMILY SSF49265 Fibronectin type III 657 758 1.0E-12
21 g12149.t1 SUPERFAMILY SSF82171 DPP6 N-terminal domain-like 759 1413 1.88E-15
19 g12149.t1 SUPERFAMILY SSF49265 Fibronectin type III 1050 1235 1.44E-21
14 g12149.t1 SUPERFAMILY SSF63825 YWTD domain 1237 1493 5.89E-17
15 g12149.t1 SUPERFAMILY SSF49265 Fibronectin type III 1566 1757 5.56E-18
17 g12149.t1 SUPERFAMILY SSF49265 Fibronectin type III 1760 1985 1.65E-20
20 g12149.t1 SUPERFAMILY SSF56112 Protein kinase-like (PK-like) 2102 2401 1.73E-73
49 g12149.t1 TMHMM TMhelix Region of a membrane-bound protein predicted to be embedded in the membrane. 2011 2033 -
48 g12149.t1 TMHMM TMhelix Region of a membrane-bound protein predicted to be embedded in the membrane. 2160 2182 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0016020 membrane CC
GO:0005524 ATP binding MF
GO:0006468 protein phosphorylation BP
GO:0004713 protein tyrosine kinase activity MF
GO:0007169 transmembrane receptor protein tyrosine kinase signaling pathway BP
GO:0005515 protein binding MF
GO:0004672 protein kinase activity MF
GO:0004714 transmembrane receptor protein tyrosine kinase activity MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values