Gene loci information

Transcript annotation

  • This transcript has been annotated as histone-arginine N-methyltransferase.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_3 g2655 g2655.t2 TSS g2655.t2 19363007 19363007
chr_3 g2655 g2655.t2 isoform g2655.t2 19363118 19383354
chr_3 g2655 g2655.t2 exon g2655.t2.exon1 19363118 19364228
chr_3 g2655 g2655.t2 cds g2655.t2.CDS1 19363118 19364228
chr_3 g2655 g2655.t2 exon g2655.t2.exon2 19364297 19367519
chr_3 g2655 g2655.t2 cds g2655.t2.CDS2 19364297 19367519
chr_3 g2655 g2655.t2 exon g2655.t2.exon3 19367822 19368000
chr_3 g2655 g2655.t2 cds g2655.t2.CDS3 19367822 19368000
chr_3 g2655 g2655.t2 exon g2655.t2.exon4 19368062 19368276
chr_3 g2655 g2655.t2 cds g2655.t2.CDS4 19368062 19368276
chr_3 g2655 g2655.t2 exon g2655.t2.exon5 19369220 19370884
chr_3 g2655 g2655.t2 cds g2655.t2.CDS5 19369220 19370884
chr_3 g2655 g2655.t2 exon g2655.t2.exon6 19371535 19371667
chr_3 g2655 g2655.t2 cds g2655.t2.CDS6 19371535 19371667
chr_3 g2655 g2655.t2 exon g2655.t2.exon7 19371729 19373548
chr_3 g2655 g2655.t2 cds g2655.t2.CDS7 19371729 19373548
chr_3 g2655 g2655.t2 exon g2655.t2.exon8 19373610 19373728
chr_3 g2655 g2655.t2 cds g2655.t2.CDS8 19373610 19373728
chr_3 g2655 g2655.t2 exon g2655.t2.exon9 19373792 19374065
chr_3 g2655 g2655.t2 cds g2655.t2.CDS9 19373792 19374065
chr_3 g2655 g2655.t2 exon g2655.t2.exon10 19374650 19374781
chr_3 g2655 g2655.t2 cds g2655.t2.CDS10 19374650 19374781
chr_3 g2655 g2655.t2 exon g2655.t2.exon11 19374837 19379666
chr_3 g2655 g2655.t2 cds g2655.t2.CDS11 19374837 19379666
chr_3 g2655 g2655.t2 exon g2655.t2.exon12 19379729 19382621
chr_3 g2655 g2655.t2 cds g2655.t2.CDS12 19379729 19382621
chr_3 g2655 g2655.t2 exon g2655.t2.exon13 19382684 19382949
chr_3 g2655 g2655.t2 cds g2655.t2.CDS13 19382684 19382949
chr_3 g2655 g2655.t2 exon g2655.t2.exon14 19383015 19383197
chr_3 g2655 g2655.t2 cds g2655.t2.CDS14 19383015 19383197
chr_3 g2655 g2655.t2 exon g2655.t2.exon15 19383262 19383354
chr_3 g2655 g2655.t2 cds g2655.t2.CDS15 19383262 19383354
chr_3 g2655 g2655.t2 TTS g2655.t2 19384274 19384274

Sequences

>g2655.t2 Gene=g2655 Length=17136
ATGGATAGTGCAGCGAGTTTTGATGACGATAACTCGGATAGTGATTATATGAACGTTTCC
TCATCATCTGAAAGTTCAGAAGAAAATGAATTTCCAGCAAAACGTAAGAACGTACCAATC
ACTAAACGCAGAGATGGACTCGATATTAGTAAAAATCCATTATCGATTATTAAATCGAGG
CTTAATATTAGGCAATCGACAAGTCAGGCGGCAGTTAAAATGTGTTTAAAGCGAAAATTC
TCTTTTACAAACGCAACGACTACAACAACAACAAAAAAGGAAAATGATGGTTCAGGCTCA
TCATCGTCCACCATTAACTTTCAACTCATTTCGGCTTCTTCGTCTTCGTCACAACCACAA
TCTCAATCTCAATCGCCAAATGATGAACAATCGATGGAGCTTGAGTCACCAAGTGGTAGT
GGCATTGGAAACAACCTTTCTACACTTATGCCTGCACCAATGCAGGGTAGCGATGATCCA
CCATATTTTCCTGAAAAATTTCCAAGTAAACTATGTGCCTTTTGTAACTTGGGTGAACGC
AGTCAGCTCGGACAGGGTGAAATGTTACGATTAGAACTGAGTGAAGAAGATTCGAAAAAT
GCATTAAAATTAAAATCGCAACAATCACCACAGGAAGACGAATCGAAAAATGGTAGTGAT
GATTTAAGTAAATTGTTAAAAAATGCTTCATCGTCTACATTATTACAACAACAATTAAAT
CGTCGTCAAAAAGGCTTAAACAAGTGCAAGAATCCCGTTGCAACAAACGAATATGTTGAT
GAATTGGAAAAAATTGGCTATACGGAAGCCATGGATTTGAGTCTGATTGTTGAAAATGGC
TACTATTATGTTCATCGTGCGTGTGCAATGTGGTCATTTGGTGTTGATCGAGATCCAATT
AATGAGGCATTAAGTAATGTGTCAACTGTACTGAAACAAAGTTTGAATCGTAAATGTACA
CATTGCAATCATTATGGCGCGAGTGCAGTTTGCAAAATAAATTGTCAAAAGTTTTTCCAT
TTTCCATGCATCGCTGCATCAGGGAGTTTTGAAGATTTTCAATCGTGTTCAGTGTACTGT
TTCGATCATTTGTCACAAGTTGCTGTAAATTGTGGTGAAGAGGCCTACTGTCGAGCATGT
TGTAATTTGGGTGATATAAGCAATCTCATGGTGTGTTCAAAATGTGGTGATCATTATCAT
GGAGCATGTGTTGGATTAGCACAACAACCTGGTGTAAGAGCTGGCTGGCAGTGTAAGAAT
TGTCGTTCATGTCAGATTTGTCGCATTCCTGATAATAGCGATGGTCGAACATTAGCTTGT
GAAACATGCGATAAACTTTATCATCCTCAATGTTTAAGACCAATTATGACAACTGTACCA
AAGTACGGATGGAAGTGTCGTTGTTGTCGTATTTGCAGTGATTGCGGTGCACGAACGCCT
GGAGCAGGAGCTTCTTCGCGATGGCATAATCATTTTACAGTTTGCGATTCATGCTATCAG
CAACGTAATAAGGGATATTCTTGTCCAATTTGTCGTAAAGCATATCGTGCTGCTGCTTAT
AGAGAAATGGTGAAATGTTCAATTTGTCAAAAGTTTGTGCATAGTACATGTGACCCAGAG
GCAGATTTAAACGCCTACGAACGAAGAAAGGAAGTAAATCCGGACTATGAGTATACATGC
GGAATGTGTAAAATTGCAACGCAAAATGAACGAGTTAATTTAGCGATGAGAAGAAGCAAT
AGTGGAGATGATGAAAGTCTTTCCGCATCACAAGAGAGTTTAGATGATATTGATATGGAC
ATTGATGGAAGAATGGGAAGTAGAGAAGATTTAGCACTTGGTTTGGGCAAAGGAAAACCA
ATGCTGGCAAGTAAAATAGCAAAGAAAAAACTGGGACTTAATAATACTGGTGGAATAAAC
AGCAATAATAGCGGTTTTGGACAAAGACCAAAGGGTATTGGAAAATTGGGCTTCCAAAAA
CGTGCAAGAACTTTTGAGTTGGGACGAAAGCGAGGTCCGAAATCGAAAATGCGTGGAATT
TTTGGTGTTCCTGGAATTGGCTTACAACGACCTGTTGCTGCTGATTCAAAACAAAGTGAC
GAAGAACCAGGATCAGAAAATCGTCTTGTTCTTTGTTCAGCAAAAGACCGTTTTGTGCTT
ACTCAAGATATATGTGTAATGTGTGGTGCAATTGGAACAGATCAAGAAGGCTGTCTAATT
TCATGTGCTCAGTGTGGTCAATGCTATCATCCATATTGTGTAACAGTTAAAGTTACTAAA
GAAATTCTTCAAAAAGGCTGGCGTTGTCTTGATTGTACAGTTTGTGAAGGATGTGGCGAA
AAGAATGATGAAGCACGATTAATTTTATGTGACGATTGTGACATTTCATATCATATTTAT
TGTATGGAGCCGCCATTAACGACTGTTCCACAAGGTACATGGAAATGTAAATGGTGTGCA
ATTTGCTTTAAATGCGGCTCAAACGAAGCGGGTAATAATTGTACATGGCAGAATAGCTAT
ACAGAATGTGGACCGTGTGCAAGTCAATCGAATTGTTCAGTGTGCAGTGAAATCTATGCG
GATGGTGAACTAATAATTCAATGTACAAATTGCGATCGATGGCTTCATTGTTTATGTGAT
TCAATTAAGAATGATACAGAAGCAGAAAAATGTGCAGAAGAAGGCTATATTTGTTTATTA
TGTCGACCCAAAGACATTTTAGCACCACATTTACAAATAAAGAAAAAACCACCAACTCTA
TCAACACAATCATCTACGACAACCAAAGAAGAGTCAATTGATGAAAATGATAACGCTACT
ATTTTGGCTCTCGAAGGTAGTCACTTTGTTGATGGTGTTTATTTGAGTGAGCATGGTTTA
CAATTTATAAAAACACTTCAGACAGAACCAAAACGTGCAAAACGCAAACCCAAAGTTATA
CAAGAAGCAGAAAAAGATGCTGGTATTTTGGCAGCAATTGAATCTGTTGTTTCTGGAAGT
GCTGATAATTCAATGGAAGATATCAAAATGGAACCGATGGATCCAAATGAAGAAGCTCAA
ATTTATAAAGATGGTATGACATGGACTAATAATGAACCTGCACCCGAAGGCTTCTCTCTT
TTCACAAATGAAAGTGGTCAAGTTATATTGCGCAAAAAACGTCAAAGAAATCTTCAAAAA
CTTGGTATTGGAGGATTTGCAGTGCGAAATCGAGCTGTACGAACAACAAAGAATGATGAA
GAGCCCTCAAATCAATGTGATGATGCAAAGAAGAAAAAACCGAATCGTCGAAAGCAAAAG
AATAAGCTTATTGAAACTTACCCGACCTATTTACAGGAAGCCTTTTTCGGTAGGCCATTG
ATGGATTCAAGTCTTAAAGTTAAACTTGAATCAAGCAGCAGTGATGAAGAGACAAAGAGT
AATGTCTCAGATGATAAGACAATAAAATTATCATTAGATGAATTGAAAATGATTGAAGCA
ATGCGAGCCAAGCAGCAACAAAAGTTACAAGAAGAACAAAAATTGTTACAAGTAAATGTG
GTCAATCCATTAGAGCAGCAACAACTAATGTCAAATGGTGCACAAATTCAGTTGAATCAA
AAAATGCAATCAACGGACACGACAAGTGTTTTTAATCAAAATACCACTGGCAGCATAGCA
TCAACTTCATCATCAAATATTAATCAAACAGAGATTAAAACTGAATTGGAATTAATGGAT
GATGATGAAAATAATAGTGATACAGAAGCACTAAAAGATGTTCTCGGTTTGCCAGGCGAT
CTTTTAGATAATGATTTAGTCAATACAATAATGAATGAAGACGACGATGATCTCACTAAA
AATACAGCCGGATTAGACGATGTTGATGTAAAAGGTGTTAAAGATGATTTGGCTGATATT
TTGAGTCCTCATTTTAATATTGATATGGAAGATATGTTATTCAAAAGCGTCTTAACGGAT
GAATCACAAGAGTCTCAGGAATCACAATTGACCAATTCACTAACATCGTACTCAACGCAA
TCTACACCTTCTCATCAATCTGAGGTGCAACCTATTCATCTCAATCAACCAACAAATACT
TCTCTACAAAATAATATAAATCAAATGCAATCTCCACAGCAGCAACCAAGTACACCAACC
GCTACAATTATGAATCCTATGACATTGCAACCGCAATCGCCTCAAATAAATACTCAATTA
AATCCCATGAATGTTATGCTTCAGCAACAACAACAACCCAATCAAATGCATCAAATGCAA
GCAAACATGATGCAACGTCAAAATTCTCAGCCTGGCACACCAATCGGTTTAATGCCACAA
CAACAACAAAGAGGACCACAATTCAATACAGCCAATGTATTTCAAAATCAACCACAATGG
AATGCAAATGTTGATAATGATTTTGATTTAGTCAATATGAATGCCGCTGCTAGTCAAGCA
ACAGCAGTAGCAGGTCAGCCACAAAAAATTAATAATCATCAAAAGAATTCTGAACGAATG
TCAGAAGATGAAAAACTCGGTGATATGTCAACAATATCAGCTGTTCTCTATGCAAATCAA
AATCATCCTGAGCTAAAAATTGAATATCCACATTGGCCAGATCGTTATAAGCAAATTATG
AAAAAATGGCGAGCACTATCAGCTGAAAGAAAGCAACCATATTTACAGCGTGCTCGTGAT
AATCGATCGGCACAACAAGCACAGCAACGAACTAAAAAAGCACAACAGGTTGCTCCTGAC
CTATCACCTGATAATGGAACCACGAGCACAAATAGCACTTCTCCTGCACTGAGTTCAACC
CTCTCAGAGTCTTCATCATTGATGGATGTATCAATGATTCAGCAGCAGCATCAATCACAA
CAACAGCAACAAGTGTCGACCCCAACTACAGTCAATGTCGGTCCACCACAACAACTCCAA
GTGGTAAGAAACAATCAGCAGTCCCCAGGCCGCTCTTCTACTCCTTCTTCTTCTTCTAAT
CTACTGCCGGAACATGTACTTGAAACTAACGCTTTAGATAACAAAAACGATTCGACATCT
CCTAATAATATTAAAAATAATTTACTAATCATAGCAAATGAGCACAAAGAATTTAGTTGT
AGTATTGGTGATGAAAATATTAGTTCCCCTATTTCCAATAACAGTAATAATACTATAAAT
GATAAAATCTCAGAGTCTGATGAGGCGAATTTGCTGTTGCAAGTCAAATCCTCTTCTACT
TCATTCGACATGAGTAGTAAAAGTGATAATAATTGTAGTCTTTTGAATAAAAATGCCAAT
ATATCTGTTGCGAGTAATATTGCGAATGCATTTATAAATCTCACAACTTCACTGCCAAAA
AATAATAATAACGCCAACAATATCAGCAATAATAATGGCAGTGGAAGTAATAATAGTAAT
TGTTTACTAAATGCAAGTGGCTTATCACAAACAAAAAATGTAAATGTCATAGTTTCACAA
CAGCCACTTTCAGCACAGCATTTTATGAAGACACTTTCGTTTACGACATCGAGAGGAATT
TTTGTACCAAATGTTATTGCTACAAATATTGCACCTCAATTTACTGTACATCAATATGGG
TTACATGGTGCATCCAGTTCTGGACCAGCACCAAATGTAACTGTAAATCCGAATGGGAAT
TTCAATACAAACTCAAATCCTACGCTTCGACCTGCTATTCCCAATCATTTTCGATTGCTT
TTGCAACAACAGAATGCAAATATTAATTTAAATATCATCAATAATAATAATAATAATCAA
CAGCAAAGTAATGTAAATCCATCGCCCGCACCTTCTCCTATTTTGGAAGCTTCGTTGCAA
ACAATTACAAAAGAAGCCACGATGCAAGCAACGAAAACAACTGTCTTGCCCACTAAAACT
ATTTTTCCAAATCAAATTATTCATCCACCTCAACGATCAATTACTCCATTCACACCCATT
CAGCAAAAAATTGATCAAGATCATGAAATGGTACAGGTTGAAGAATCCTCATCATTAACA
AATAGCACTGCAGATCAACATATTCGCGTTTTAACACCTTCTGAAATTATGAAGACGCTT
CCTTCACTTTCGACACATGATAATGTTTGTTTTAATAGCAGCAGCATGTCAACTGCGAGT
ATAGAGAAGCAGTCAAATGAGACTGATATAAATGCCCTTAGTACTAACAATAATAAAAAG
CTTCAAAACGCATCACAGCACCATCACCATGATCCGAATGCATCAGGCTCTGTTATCTCT
ATTCTTTCCAATAATTCTGCAATTTCTTCTCCTTTGACCATCACTACCACCACCATCACC
TCAGAATGTAACACTTTTACTAGTAGCCATTTCACCACCATCACTAAAGCAACTTTGTCT
TCTACTACTTCTAATAATGCACAAATAATAATGGATCAAGATAAAATTGCAGCTCAACAG
AAGAGCATGAAAGAAGCTGAACAAGAACGTCAATGGAAAATGCTTCAAGCACAACGTGCT
CGTGAGCAACAACTCGGTATGCCTATGGATCATCAGAGAATGAATGATCCAAACATGCAA
CTTCAAATTTCGACAAATCTCAATGATGGATCAATGAGTCCAGTCGCAAGTCCTTCACCA
AATTCAAGAAATCAATTTATGGCTCCAAATAAGGGACGAATGATGATGAATCCTCAAAGT
CCAAGTTCTTCTAACTTCCAACACCCAGGACGTCCTGTTTCCGCTCAATTGCAACGTCAA
CAAAGTCAACGTATAAATCAAAGTCCATTTTCACCACAAACGGGCACACCTCAATCACCG
AACAACATTTTTCCAGGATCACCGTCAGCAGACGGTTTTCAAAGACAACAATCTCTTGAT
GAACAACAATTTTTGCATTCTCCACAAACACCAAAGTCTATTCAACAACAATCACCTGTT
CATACTCCAACAAGTGCAAATATGAGTCCAGTTTATTCTCAAGTAAATCAATCACAACAA
CAACAACAAGCTCCAAATATTCGACCACTCGATGGTGTTAATGCTTACGCACAAGCACCA
GGAACACCGAGACCTTCATTCAATCCTGGTCAAACGCGTACAACTGTTTATGCTAGACCA
GACATGTTTAACAAACCACCGTTTGTTCAATCTAATCAAAACAATCCAGAGCAAAGCAAT
AGACAATTACGTGACTTATTACAGCGTTCACAAGCACCGACAAATTTGCCTGGACCAACA
TCAAGTGCTTTTTCAATGGAAAATGATATGATGAAGAATCAACAAACTCTTCAAACTGGC
ACTGACAACACATTCCGACAACCTCTCCCACCAGGAATCAGACAGCAAAGAATGCAATCA
ATGGTTGGTGGTCAAATGATTCGTGCAGGCCAATTAACTGGAACTGGTCAAAGAATGATT
ATGACACCAGATAATCGTCCACGATTAAATATTAGACCTGGTATGAATATGAATGTACCA
CAAATGATGACTGATCAGCATCAGCAACAACAACAAATGAATCAAGCAGGTCTTAATCAA
ACTCAAAGAATGCCTATAAATCAAGGAACTAACTTTAATCCTCAGAATGAAATGATGCAG
CAATCTAATATGATTCAACAAAATACTCCTGGACAGAACATGATGATGCAACAGCGTCTT
GTTGCTTCTCAAAATCCAAGTATGCAACATGCTCTACAAGGAAATCAAATTAATCCAACT
CAACCAAATACAACTGATCAAAACGTTGTTCATCAACAAACAGGTACTGATGTCGAAGGA
ATTCCAGATTCCGTCACAGCTGAATTAGAGAAATTAGAACAAGATGAAAATGTTGGTATG
GACGGTGTTGGTGATATTCTTGGTGGACTTGGTGACGATGATGATGATTTGTTAGACTCA
TTGACAGCTGAAATGGGCGCAGATTTTAATATTCTTGAATATGCAGATCCTGAACTTGAC
ACAACTGACGAAAAATCAACACTTCTCGATAGCTTAGAGATGGATGAGGGTGAGAATGCA
AAAGAAGAAAAATTGAAAAATCTGGAAGCAGAGAAAATTGCAAAGACAAATTTCCCTCGA
ACTCAACAAGATATGAGTAATGTGCAAATGCAGCCTAATCAGCAAATTAATCGTATGCAA
ATGCAAATGACAAACATTAATCCTCAAATGCAACAGTCAATAGATGGACAAAATGTTAAT
CAAAATGCAGGTCCATCACAACAGCAGCCTGCTCAAATTCCTCAGCAGCAGCAACAACCG
GGACAAGTGCAAAATCAACAACAAATGTTCCAGCAAGCAAATGTGCAAAGACAAGTTAGA
TTTAAAACAATTCCACCTAATCAAATTCCAGAAATTCAACAAATTCATCAACAAATGATG
TTACAGCTTCAACAAGCTGCTGCTAATGGAAAACCAATGCCAATAGGTACACGATTAGTT
GCCAACAATAATATTACAGGCATTGTGACAGGACCGAACAACATATCTTTAACATTCCCT
ACTGGACAACAACGTCTTCAACAAGTACGAATGGTTAATCCAAATATGATGCAACAAAAT
CAGCAAAATAATCCACGAATGGCTTTACCACATATGGTTCAAAATAGACCTAACATGATG
CAAAATGCTCAACAACAGCAAATAAATCCAAACATACAAGGTCTGATAAATCAAACGCAA
CAGCCACAGCAACAGCAAGCGCAACAATCCCAACAAGTGCCACCACCACCCTATCCAGAG
CCGCCACCTCCATATCCTGGTCAAGCAAACCAAAATCAGGAACAACCGTTACTTTTGGAA
GATTTACTCGAACAAGAAAAGCGTGAACAAGCAAGAAACATAGGAACAAATCAAATGGAC
ATGAATATTCCTCAAAATCAACAACAGCAACAAAGTTTATTTAGTGATCAGGATTATGAA
AAATTGCGTGCTGATGTACTTACAACAACCACACAAAGTATTTCAATGCCACAACAGATT
CAGCAACCGCAGCAGGTTCAAGTTCAACCTCAGCAACCACAATTTGCACCAAGAGGTATT
ATTAATAAACAGTGGCGTCCACAAACACCTGGTGTTGTAAATTCACAAGCGCCATCAACC
AGTCCTGATATTGTGAGATCAGTACCTATATTTAATGCAAATCTGACGCCAATGCCTCCA
ATGCCACCTGAAATTATACAAACAGAGACAGATAAACAAATTCAAAGCAACTATGAAGCA
TGGTTGATTACACAAAATGATTCGTTGCAGAAGCAACTTCACTATTATGAAACAGAAATT
GCTGAATTAAGAAAATTGAAGAAATCTCTCAATACAAAACAACGTCAATTAAAGAAAAAT
GGAGGTGATTTGAATGAAATTGATGCTCAGACACTTCTCAAAGTAACTCATGAACAAGCA
GCAGTACAGAAGCATTTAGAAAGCTCTAGAAAGCAAGCAAGAAATCACTTGCAAATGAAA
CAAGATTATGATAATAAACAAAAATCTAAACAAATGGCAAATATTTCTCATATGGCTCAA
TCACCAGTCGGAGTGCAAATGAATGATCAATCACCAATGATGTCTCCTTCGCCAAATATT
ATTCAACAACCCGTACAATCGCCACTTGGTAATCCAATTATGGCACCTTCACAAAGTCCT
CTTCACTCTCCTAGTCCCATGATGAGTTCTCAAAGTCCTGGACCAAACTCAATTATGCAA
AGTCCAGGAGGTCACATTAATAATGCAATGTCTCCTTACAACACTATGCAACAATCGCCA
CGAATCGGTACTCCACATTCACAAATTGACGAAAGTCCATTCAGTCCAAATTCGATTGAA
TCACCTTCAATTAATAGTCGATTGACTTCACCAATTCCACGCATGACATCGCCACAACAT
AGACCAAATACACCGATGCAAATTCAAATGATGAATAGAATGCCTGTTCAATTTAATCAA
CAACAAAATATGAATATGAATCAACAAAATCGTTTTATCAGACCACAAATGATACCAAAT
GATTCAAATAGTCGTATGGCAGGTATGAGAATGCCTGTACAACAATTTCAACAGCAGATG
CAACACGGAGGAGGCAATGTGATTAGGCAAGTGCAATATGATGCAAATGGAAATCCACAG
AATGTGCAAACAATTTCACAAGCAAATCAGCAAATGGATCCACAAAGAGCTATGCAAATA
AGACAAATGCAAATGAGACAACAACAAATGATGAAACAACAGCAACAGCAGCAGCAGCAG
CAAGGTCAAATGCAAATGCAACAAATGTCACAAGTCCAACCGAATCAAAGTCCTATACAT
CAGCAGCCACAATCTCCATTAATCAATCAAAATGCAACTTCTCCTATGCCGAGAAGTCCA
ATGGTTCATTATCAACAACAACAGCAAAATCCAAACAGTCCGATGATGCAAATGGATAGC
TCTCCTCGTCCTATGTATGCACAACAACAGCAGCAAAATATGCAAATAGAACATAGCAAC
AATAATTCGAATTTGATGCAAGGAGGAGGAGGCGGTGGAAATCATATGATGAATCCAGGT
AATCCAACTCAAAAGCGAGCACCAATTAAATTGGGATTAAGAGGAGGTATGCCAATGTAT
GGAAAAGATGGCAATAAGAAGCAAACTGGAACTTCAGATATGCTTCAATTGGTTCAAAAA
GCTCAGCAAAAACATCAGGCACAACAGCAATCTCAACAGCAAGGATTTACGATTGAAGCT
CAAAGAAATATTTCAGGAAAAGTTATTCCAACAGCTGTTTCATCAATGGAATTACAACAA
GCTTCAAGTAGTAAACAAGTGAAAGCAAAAACTTCACTTTTAAAGAATCCACTTGGACCT
AAGGTAAAATCATTGGTTGATTATGATGATAACGATTCATCTAATGGAACACCACCAATA
AGTCCAATATCACAAAAAATCAGACAAAGATTACAAGATGGAAAGATGCATAATGAAGAC
GTTGTGATTGTTGATAGCAGTCCCGATGAGAAACAGAGGCTGACAGATTATGATGATGAT
AATGATAAAATAACATTAACAGAAGTTTCTTTAAATTCTACTGCTCAAGATGCAGGTGAT
GCTGAAATTGTTGAAGCATTCGACGGAAGCGAACTTGTGTCAAGTCCATTAGTGACTGAA
CAGGAAGCTACTGATTACACACTTTTTGATTCACATGTTGTTCATCTGGATGATAGCAAT
GAATCATTAAAAGAAATTATCAATACTGATTTGTTATTTGAAGATGCAATTCCAGTAACT
AAACCTTCAACTTCAAAAATTGAAACGTCAAGCAGCATAACTGCAACAGCAAAAAATCCT
AAGAATGTTGGAACACGAGAAGATTTTGAGGCAATGATTGACTCAGGCAAAGATGAAGAT
GAAAATGAAAGTGAATCTGATGCCGTTACAATAGAAAAGAATGATCCTATTGAACAAGAA
ACCTTAGAAAAACCACAAGGAACTCCTCATAGAATTTTAACTACTACAACTATTCCGACA
GAGGGAAAACGTGAAATTAAATTTCCAGTTGTTCTCTCATCTTCCTCAGGTGGTCAATTA
ATTACACTTCCTTCAAACCTTATGAATCGAACAAATATTCAAGCATCAGCTCTTGCTGCA
ACTGCAGGCATGACAGGTGGACAGAAAAAAATTGTCAAAGGTTCAACTCAAACATTAGCA
AAAGTCTCTATTGGAAATACTACAATTTCTGTTCCTGTTGTACTTAAAAATATGCCAATA
ACAAATTCTGATCCCTCACAAGGTGCGAAAAAGATGATCACCACAAATGCTCAAACACTT
TCAAGTTTGAAAAAAAATCCAAATATGTTTAGTGTGTCGGGACAAAAAATTAACACATCA
ACAATTGTGACACTTTCACTAAACAAAGGCAACACGAGACCAATTCAAACAGTAGATTTA
AAGCAAAGATTACAACAAAATCAAAAATCAATTATTGTTTCAGCTGCACCGTCAGGAATA
ATACAAGCAAGTTCACATAATGTAGAAAATATTGTATCTAGCGCAACAGTAATAAATAAA
CTATCAGCAAGTTGTGCATCACCTATTTTGACATTCTCAAAAATTCCAACATATACTATG
ACACAAGACACGGCTTTACCAACAAAAATATTAGAAGATGATGACGTTTCACCAGAATCA
AGTGAAAGTGATAAGAATAAAGTTAGAGATGTAAGCAAATCATCTGAAAGTACATCGTTA
TCTATTCCTGAAAGTGGCACAGTAGAAGGAAATGAAGAGGTCAAAGAGACAATTATAGAG
AATAAACCGAAATCATTGATACCAATGCATGTCATTGTCAAATCACGTGAATCTTCTCAA
AGTCCGATAGCATCATCATCATCATCAACAACAACAAATCCGACACACGCACAAGCACAA
CGAATAGTGTCTGGTAATATGTCACAACTTTCTCCATTATCGCAGCCAATTGAAATCAAT
ACAAATACACATAATGCTACACAACAAATCCGTTCAATAATGTCTTCAATTGACGCGAAT
GAAGAGTCGAAAAATAAGGCGGAAGTTGAACAAGTTGCAACATCTACGCAATCAAGTGTT
CCAAAGACAATAATACGAAGCACATCTACACCAACATCATCTGGCGATACAAAAATAATT
ATTTCGAGTCAGCCAAGCATTTCTGTACCTTCATCGCCAAATACTCAATCAACGAATAAT
GTAGTTTTCGTGAAGCAAATCAAAGCCATTCCATCGTCTTCGTCTGCTGGTTCAACTATT
GCTATTAAAAATCAATCAACAACAACACTTGCTCAACAACAACAGCAGCCCCTTATTATA
ACAAAAACAACCAATTTATTGAATATTTTAAGTAATCCTCCAAGTGGTCAACATCAACAA
ACTTTCAAAGCGACAGAAATTAAAATGGAACCTCCCATGAGTGGTGATAGCATTCCCAAA
ACAGTAACAATTCTCAAATCGAATCCGACAATTACAAACTTACTCAATTCAAATTCATTT
AAACGATCAAAATCAAGCGATGATGTCATTACAAAAGAAACCTCAGAAGCGGCCGTTAAT
AAACGTTTAAGTTTTGAAGTTTCTAATGAAATTAAAGCTGAACCAATTGAACCAGTTGTC
ATTAAAGAAACAGTTTCAACACCAACATCAGTAACAACAGTTGTCAAAACAGAACAAAGT
ATTACAGTTACACAAGCATCTACTCCTATATCTCAGACTCAAAAAATTATTCCACCAAAT
AAACCTGAAGATTCTCAGAATGTTCTTCTCAAGCAACTTTTGCAAAATTCTGGATCAGGA
GGACCTGTTGGAGGCTCACCGCTAACTCGAACAGTACCTGGATTAATAACAACACAAAGG
GCACCTAGTCTTGGTATGTTTAGTTCACTTGAAGCACAATTAGCTCGACCAGTGATTCCA
CCTGCACCTGCTAAGCAACTAATTGTGACTACACAACCACTTGTGACATCAATCCCCATA
CCGGTTTCATCAATTTCATCGACCCCAGAAGCAACTGTAAAAACAAGTCAATCAGTAAGT
GTCAGCAAATCTATTTCTATCCATGAGACTTCCTTTGTATCACAGCCATCTGCTTCTAAT
ATATTAACGACAACAACTACAACACCACCTGTTGCGTCACAAACGTTAAATGTGTCAAAC
ATTGGAGAAAAAAAACCAATTGTTATTCTAAACAGAAATGATATTCCTGCTAGTTTATTG
ACATCAAACAGCAGTAATATGACAAAAACAATTATTTCACCACTCACAACTACGGAAGGA
ATGGCACCTCCAAATATGACTATAAAGAAGGAAATGATTTCACCTGGAGGTACCATAAAA
CCAAATCAATCACCTACACATATCGCACCAACATTAGTTAAGAGCATATCACAGTCATCA
CTCAATTCAGGCACAACCACTCCCGTACCACTTGAATCTCCATCAGTTGATATAAAAAAA
GAAATGGATGATAGTTCACAATCGGAATCTGTTGCTTCTGATGTGAGCATGGTAAAGAAT
GAAAATATTATGCTTACACCTTCGCGTGATGGAATGAATGAGCCACTTGATGAAAGTCCT
GCTAAAACAGCTGCGGAAATTGCAAATGAATTGAAGAAGAAAAAACGACGTGAATATCAG
AAAAATCGTCGACAAATGCAAATGTCGAAAGAAAAAGGCGTTAAAAAGCCAAGGAAATTG
CAAAAATCTGAGGAAGATTATGATTCATTCATTGATAATCTTATGCTTCAAATAAAGTCT
CTCCCACAAATGCCAATTTTAGAGCCATCATTACCAAAGAACTATGGTGTTTGCCAAATC
TATGGCTCATGTGAGTTGAATAAAGTCAACAAAAAATATGACACAACGAGTGGTGAATTG
ACAGGAGTTTTTGGCAAAGGCGAACTTTCTAATATCTCAGACTTCTACAATACAAAACCA
TTTGGAGTTCTTGAACCAAAAGTCGAAAAGACGCCTGCTTCTACACAAAGAGGTTTTTAT
GATCAAGAATTTCCACCAATCAAGTTTGATGAGGAAGAAAAAGTTACGCATCATCGATCC
AAATATGAATTATTGGCTAAAGATCGTGATATCGATACACCAGATACTGTTGTAAGTTCA
TCAAGTCCAGAATGTGTAACTATGAAGCCACCAAATCGTTTCCCTGGCCTGAGACTGATA
AGAGAAGAGGAAGATGCTGAAACAGAAGATGAAATGACCAGTTTCATTGACAACAGAATG
TCTCCATCAATTCCAACTATCATTGCACCAATTCCAATACGATTGAAGAGTGGTATTTCG
CTAACTTCTGATAATAAAGTTCATAATGAGAAAGAATTTGAATTATCAAAACAATTGGGA
CTAAAATCTTGTTTCGAACCACCAACACCTGCTAAAGATAATAATAATAATAATGTTACT
GTGACATTAACTCTTACTTCATCAGCAGCAGAAGATATAATGGGTGTACTCAAATCATTA
GCAAATATTCTCAATATTCCAGCACCGACTGCATATCAAATTGTAGAGAGAACTACTACA
CCTCCTAGTCAAAAATTGGGACTTTATAGAATAAAAGGAAAAGATGGGAAAGAAGGGCAA
CCTGTTGATATTCAAACAATTCTCAATGGAACAGCAAAATTCTGTAGACATTGTGACGTT
GTCATACTTAATAATGCAATTAAGGCAAAAGCAAACGAATTTCCATTACTTGTCAATACA
GAACTAGAATCAAACGAGCTTTATTTCTGTGGTCAAACGTGCTACAAACAATTCCAATGG
CGACCAATAAATATGCTTGATGATAAGAGTCTCAATTCAACAACAGATGATAAGGCACTC
GAAACAATGTCAGAAAACATATCAAAAGTTGAAATGCAAGGTGATTTAAGTCAATTAAGA
GATCAAAATAGAGATTTGAAGAGAAAGCATGAAGAAATCGAGGACATGAATGAAAGCAAA
GAAGACATTTTACAAGCTGAAAAACGTCAAAAATTAATGAGAATCAAAACTTTCAGTGCA
AATAGTTTTCCAAATATTCATAAACAAAAGAAACTCTCTGAACGCGAGATTACAGAAATG
CTCTTTAGAATGAATATAACAGTCAATTCGGCACCAAAGATTTTAGAAGATACAAGAAAA
TGCATCTTATGTCATCAAATAGGAGATGGTGTGGCTGATGGTCCATCACGTTTACTTAAT
TATGATGTTGATAAGTGGGTACATCTTAATTGTGCATTATGGTCAGATGGTGTATATGAA
ACAGTAAATGGTGCATTAATGAATCTTGAAGCAGCACTTCAACAAAGCTTAAATTCTCAA
TGTACACACTGCAATCATTTGGGCGCAACAATCAAATGTTTTAAACCGCGTTGTGGTACA
CTCTATCATCTCAATTGTGCTATGAAAGATAATTGTGTGTTTTATAAGAACAAGACAACA
ATGTGTAATATTCATGCACCGAAATCGGAAAAAGATAATGAACTAACAACATTAAGTGTG
CAACGGCGTGTATATATAGAGCGTGATGAAAATCGACAAGTTGCATCTATTATGCATCAT
TCTGATCTCACGAATCTTATGCGAGTTGGCAGTTTAATACTTTTGAATGTCGGCCAATTA
TTACCTCATCAGCTACATACATTCCATACAGCAAATTATATTTATCCAATTGGCTTTAAA
ATTATAAGATTTTTCTGGTCAATGAGGTATCCAAATAAAAGATGTCATTACATTTGCTCG
ATTGCTGATTCTGCAGGAAAACCTGAATTTAGAGTTCTTGTAAAAGAACAAAATGAAGAA
GATACTGAATTTAAGGACGAGTCACCTAAAAAAGTATGGCAAAAAATTCTTGAGACGATT
GTCAAATTACGAAGAGAAAATCAATTGGTTCGAGTTTTTCCTAAATATATCAGTGGTGAA
GATCTCTTTGGTTTAACTGAACCAGCCGTTGTGAGGATTTTGGAGAGCTTGCCAGGTGTT
GAAACTTTGAATGATTATCGATTTAAATATGGGAGAAATCCGCTTCTTGAGTTACCACTT
GCGATAAATCCATCAGGTGCTGCTCGTACAGAACCTCGTCTAAAACATTCTGTACCGATG
AAAAAACCACATACTCAAAGAACTGGATCGACCAGTCAACGTCCTGCATTTGTTCCATCA
ACGTCAGCAGGTGAAATAGCTTGTCCATATAGCAAACAATTTGTACATTCAAAGAGCTCC
CAATATAAAAAAATGAAATTAGAATGGAGAAATAATGTATTTCTTGCACGTTCAAAAATT
CAAGGTTTAGGGCTTTATGCAGCTCGTGATTTAGAGAAGCACACTATGGTTATTGAATAC
ATTGGTGAAGTCATTCGAGGAGAATTGTCTGAATTACGTGAGAAGCAATATGAAGCTAGA
AATCGAGGAATTTATATGTTTAGATTAGATGAAGATCGAGTCATTGATGCCACATTATGT
GGTGGACTAGCTCGATATATCAATCACAGTTGTAATCCAAATTGTGTCACTGAGATAGTA
GAAGTAGATCGTGAATATAGAATTATCATATTCGCAAAACGCAGAATAAATCGTGGAGAA
GAATTATCATACGATTACAAATTTGATATAGAAGACGAGTCGAGAAAAATTGCATGTCAT
TGTGGAGCAGCATATTGTAAAAAATACATGAATTAA

>g2655.t2 Gene=g2655 Length=5711
MDSAASFDDDNSDSDYMNVSSSSESSEENEFPAKRKNVPITKRRDGLDISKNPLSIIKSR
LNIRQSTSQAAVKMCLKRKFSFTNATTTTTTKKENDGSGSSSSTINFQLISASSSSSQPQ
SQSQSPNDEQSMELESPSGSGIGNNLSTLMPAPMQGSDDPPYFPEKFPSKLCAFCNLGER
SQLGQGEMLRLELSEEDSKNALKLKSQQSPQEDESKNGSDDLSKLLKNASSSTLLQQQLN
RRQKGLNKCKNPVATNEYVDELEKIGYTEAMDLSLIVENGYYYVHRACAMWSFGVDRDPI
NEALSNVSTVLKQSLNRKCTHCNHYGASAVCKINCQKFFHFPCIAASGSFEDFQSCSVYC
FDHLSQVAVNCGEEAYCRACCNLGDISNLMVCSKCGDHYHGACVGLAQQPGVRAGWQCKN
CRSCQICRIPDNSDGRTLACETCDKLYHPQCLRPIMTTVPKYGWKCRCCRICSDCGARTP
GAGASSRWHNHFTVCDSCYQQRNKGYSCPICRKAYRAAAYREMVKCSICQKFVHSTCDPE
ADLNAYERRKEVNPDYEYTCGMCKIATQNERVNLAMRRSNSGDDESLSASQESLDDIDMD
IDGRMGSREDLALGLGKGKPMLASKIAKKKLGLNNTGGINSNNSGFGQRPKGIGKLGFQK
RARTFELGRKRGPKSKMRGIFGVPGIGLQRPVAADSKQSDEEPGSENRLVLCSAKDRFVL
TQDICVMCGAIGTDQEGCLISCAQCGQCYHPYCVTVKVTKEILQKGWRCLDCTVCEGCGE
KNDEARLILCDDCDISYHIYCMEPPLTTVPQGTWKCKWCAICFKCGSNEAGNNCTWQNSY
TECGPCASQSNCSVCSEIYADGELIIQCTNCDRWLHCLCDSIKNDTEAEKCAEEGYICLL
CRPKDILAPHLQIKKKPPTLSTQSSTTTKEESIDENDNATILALEGSHFVDGVYLSEHGL
QFIKTLQTEPKRAKRKPKVIQEAEKDAGILAAIESVVSGSADNSMEDIKMEPMDPNEEAQ
IYKDGMTWTNNEPAPEGFSLFTNESGQVILRKKRQRNLQKLGIGGFAVRNRAVRTTKNDE
EPSNQCDDAKKKKPNRRKQKNKLIETYPTYLQEAFFGRPLMDSSLKVKLESSSSDEETKS
NVSDDKTIKLSLDELKMIEAMRAKQQQKLQEEQKLLQVNVVNPLEQQQLMSNGAQIQLNQ
KMQSTDTTSVFNQNTTGSIASTSSSNINQTEIKTELELMDDDENNSDTEALKDVLGLPGD
LLDNDLVNTIMNEDDDDLTKNTAGLDDVDVKGVKDDLADILSPHFNIDMEDMLFKSVLTD
ESQESQESQLTNSLTSYSTQSTPSHQSEVQPIHLNQPTNTSLQNNINQMQSPQQQPSTPT
ATIMNPMTLQPQSPQINTQLNPMNVMLQQQQQPNQMHQMQANMMQRQNSQPGTPIGLMPQ
QQQRGPQFNTANVFQNQPQWNANVDNDFDLVNMNAAASQATAVAGQPQKINNHQKNSERM
SEDEKLGDMSTISAVLYANQNHPELKIEYPHWPDRYKQIMKKWRALSAERKQPYLQRARD
NRSAQQAQQRTKKAQQVAPDLSPDNGTTSTNSTSPALSSTLSESSSLMDVSMIQQQHQSQ
QQQQVSTPTTVNVGPPQQLQVVRNNQQSPGRSSTPSSSSNLLPEHVLETNALDNKNDSTS
PNNIKNNLLIIANEHKEFSCSIGDENISSPISNNSNNTINDKISESDEANLLLQVKSSST
SFDMSSKSDNNCSLLNKNANISVASNIANAFINLTTSLPKNNNNANNISNNNGSGSNNSN
CLLNASGLSQTKNVNVIVSQQPLSAQHFMKTLSFTTSRGIFVPNVIATNIAPQFTVHQYG
LHGASSSGPAPNVTVNPNGNFNTNSNPTLRPAIPNHFRLLLQQQNANINLNIINNNNNNQ
QQSNVNPSPAPSPILEASLQTITKEATMQATKTTVLPTKTIFPNQIIHPPQRSITPFTPI
QQKIDQDHEMVQVEESSSLTNSTADQHIRVLTPSEIMKTLPSLSTHDNVCFNSSSMSTAS
IEKQSNETDINALSTNNNKKLQNASQHHHHDPNASGSVISILSNNSAISSPLTITTTTIT
SECNTFTSSHFTTITKATLSSTTSNNAQIIMDQDKIAAQQKSMKEAEQERQWKMLQAQRA
REQQLGMPMDHQRMNDPNMQLQISTNLNDGSMSPVASPSPNSRNQFMAPNKGRMMMNPQS
PSSSNFQHPGRPVSAQLQRQQSQRINQSPFSPQTGTPQSPNNIFPGSPSADGFQRQQSLD
EQQFLHSPQTPKSIQQQSPVHTPTSANMSPVYSQVNQSQQQQQAPNIRPLDGVNAYAQAP
GTPRPSFNPGQTRTTVYARPDMFNKPPFVQSNQNNPEQSNRQLRDLLQRSQAPTNLPGPT
SSAFSMENDMMKNQQTLQTGTDNTFRQPLPPGIRQQRMQSMVGGQMIRAGQLTGTGQRMI
MTPDNRPRLNIRPGMNMNVPQMMTDQHQQQQQMNQAGLNQTQRMPINQGTNFNPQNEMMQ
QSNMIQQNTPGQNMMMQQRLVASQNPSMQHALQGNQINPTQPNTTDQNVVHQQTGTDVEG
IPDSVTAELEKLEQDENVGMDGVGDILGGLGDDDDDLLDSLTAEMGADFNILEYADPELD
TTDEKSTLLDSLEMDEGENAKEEKLKNLEAEKIAKTNFPRTQQDMSNVQMQPNQQINRMQ
MQMTNINPQMQQSIDGQNVNQNAGPSQQQPAQIPQQQQQPGQVQNQQQMFQQANVQRQVR
FKTIPPNQIPEIQQIHQQMMLQLQQAAANGKPMPIGTRLVANNNITGIVTGPNNISLTFP
TGQQRLQQVRMVNPNMMQQNQQNNPRMALPHMVQNRPNMMQNAQQQQINPNIQGLINQTQ
QPQQQQAQQSQQVPPPPYPEPPPPYPGQANQNQEQPLLLEDLLEQEKREQARNIGTNQMD
MNIPQNQQQQQSLFSDQDYEKLRADVLTTTTQSISMPQQIQQPQQVQVQPQQPQFAPRGI
INKQWRPQTPGVVNSQAPSTSPDIVRSVPIFNANLTPMPPMPPEIIQTETDKQIQSNYEA
WLITQNDSLQKQLHYYETEIAELRKLKKSLNTKQRQLKKNGGDLNEIDAQTLLKVTHEQA
AVQKHLESSRKQARNHLQMKQDYDNKQKSKQMANISHMAQSPVGVQMNDQSPMMSPSPNI
IQQPVQSPLGNPIMAPSQSPLHSPSPMMSSQSPGPNSIMQSPGGHINNAMSPYNTMQQSP
RIGTPHSQIDESPFSPNSIESPSINSRLTSPIPRMTSPQHRPNTPMQIQMMNRMPVQFNQ
QQNMNMNQQNRFIRPQMIPNDSNSRMAGMRMPVQQFQQQMQHGGGNVIRQVQYDANGNPQ
NVQTISQANQQMDPQRAMQIRQMQMRQQQMMKQQQQQQQQQGQMQMQQMSQVQPNQSPIH
QQPQSPLINQNATSPMPRSPMVHYQQQQQNPNSPMMQMDSSPRPMYAQQQQQNMQIEHSN
NNSNLMQGGGGGGNHMMNPGNPTQKRAPIKLGLRGGMPMYGKDGNKKQTGTSDMLQLVQK
AQQKHQAQQQSQQQGFTIEAQRNISGKVIPTAVSSMELQQASSSKQVKAKTSLLKNPLGP
KVKSLVDYDDNDSSNGTPPISPISQKIRQRLQDGKMHNEDVVIVDSSPDEKQRLTDYDDD
NDKITLTEVSLNSTAQDAGDAEIVEAFDGSELVSSPLVTEQEATDYTLFDSHVVHLDDSN
ESLKEIINTDLLFEDAIPVTKPSTSKIETSSSITATAKNPKNVGTREDFEAMIDSGKDED
ENESESDAVTIEKNDPIEQETLEKPQGTPHRILTTTTIPTEGKREIKFPVVLSSSSGGQL
ITLPSNLMNRTNIQASALAATAGMTGGQKKIVKGSTQTLAKVSIGNTTISVPVVLKNMPI
TNSDPSQGAKKMITTNAQTLSSLKKNPNMFSVSGQKINTSTIVTLSLNKGNTRPIQTVDL
KQRLQQNQKSIIVSAAPSGIIQASSHNVENIVSSATVINKLSASCASPILTFSKIPTYTM
TQDTALPTKILEDDDVSPESSESDKNKVRDVSKSSESTSLSIPESGTVEGNEEVKETIIE
NKPKSLIPMHVIVKSRESSQSPIASSSSSTTTNPTHAQAQRIVSGNMSQLSPLSQPIEIN
TNTHNATQQIRSIMSSIDANEESKNKAEVEQVATSTQSSVPKTIIRSTSTPTSSGDTKII
ISSQPSISVPSSPNTQSTNNVVFVKQIKAIPSSSSAGSTIAIKNQSTTTLAQQQQQPLII
TKTTNLLNILSNPPSGQHQQTFKATEIKMEPPMSGDSIPKTVTILKSNPTITNLLNSNSF
KRSKSSDDVITKETSEAAVNKRLSFEVSNEIKAEPIEPVVIKETVSTPTSVTTVVKTEQS
ITVTQASTPISQTQKIIPPNKPEDSQNVLLKQLLQNSGSGGPVGGSPLTRTVPGLITTQR
APSLGMFSSLEAQLARPVIPPAPAKQLIVTTQPLVTSIPIPVSSISSTPEATVKTSQSVS
VSKSISIHETSFVSQPSASNILTTTTTTPPVASQTLNVSNIGEKKPIVILNRNDIPASLL
TSNSSNMTKTIISPLTTTEGMAPPNMTIKKEMISPGGTIKPNQSPTHIAPTLVKSISQSS
LNSGTTTPVPLESPSVDIKKEMDDSSQSESVASDVSMVKNENIMLTPSRDGMNEPLDESP
AKTAAEIANELKKKKRREYQKNRRQMQMSKEKGVKKPRKLQKSEEDYDSFIDNLMLQIKS
LPQMPILEPSLPKNYGVCQIYGSCELNKVNKKYDTTSGELTGVFGKGELSNISDFYNTKP
FGVLEPKVEKTPASTQRGFYDQEFPPIKFDEEEKVTHHRSKYELLAKDRDIDTPDTVVSS
SSPECVTMKPPNRFPGLRLIREEEDAETEDEMTSFIDNRMSPSIPTIIAPIPIRLKSGIS
LTSDNKVHNEKEFELSKQLGLKSCFEPPTPAKDNNNNNVTVTLTLTSSAAEDIMGVLKSL
ANILNIPAPTAYQIVERTTTPPSQKLGLYRIKGKDGKEGQPVDIQTILNGTAKFCRHCDV
VILNNAIKAKANEFPLLVNTELESNELYFCGQTCYKQFQWRPINMLDDKSLNSTTDDKAL
ETMSENISKVEMQGDLSQLRDQNRDLKRKHEEIEDMNESKEDILQAEKRQKLMRIKTFSA
NSFPNIHKQKKLSEREITEMLFRMNITVNSAPKILEDTRKCILCHQIGDGVADGPSRLLN
YDVDKWVHLNCALWSDGVYETVNGALMNLEAALQQSLNSQCTHCNHLGATIKCFKPRCGT
LYHLNCAMKDNCVFYKNKTTMCNIHAPKSEKDNELTTLSVQRRVYIERDENRQVASIMHH
SDLTNLMRVGSLILLNVGQLLPHQLHTFHTANYIYPIGFKIIRFFWSMRYPNKRCHYICS
IADSAGKPEFRVLVKEQNEEDTEFKDESPKKVWQKILETIVKLRRENQLVRVFPKYISGE
DLFGLTEPAVVRILESLPGVETLNDYRFKYGRNPLLELPLAINPSGAARTEPRLKHSVPM
KKPHTQRTGSTSQRPAFVPSTSAGEIACPYSKQFVHSKSSQYKKMKLEWRNNVFLARSKI
QGLGLYAARDLEKHTMVIEYIGEVIRGELSELREKQYEARNRGIYMFRLDEDRVIDATLC
GGLARYINHSCNPNCVTEIVEVDREYRIIIFAKRRINRGEELSYDYKFDIEDESRKIACH
CGAAYCKKYMN

Protein features from InterProScan

Transcript Database ID Name Start End E.value
38 g2655.t2 CDD cd15509 PHD1_KMT2C_like 377 421 1.9465E-18
36 g2655.t2 CDD cd15510 PHD2_KMT2C_like 424 469 5.40054E-20
33 g2655.t2 CDD cd15489 PHD_SF 507 563 6.05498E-7
40 g2655.t2 CDD cd15512 PHD4_KMT2C_like 724 772 1.574E-27
41 g2655.t2 CDD cd15513 PHD5_KMT2C_like 774 820 3.61765E-29
35 g2655.t2 CDD cd15514 PHD6_KMT2C_like 852 901 9.38039E-25
34 g2655.t2 CDD cd00084 HMG-box 1515 1568 8.75126E-6
37 g2655.t2 CDD cd15666 ePHD2_KMT2C_like 5201 5305 1.05828E-61
39 g2655.t2 CDD cd19171 SET_KMT2C_2D 5558 5710 1.55574E-102
28 g2655.t2 Coils Coil Coil 1155 1175 -
29 g2655.t2 Coils Coil Coil 2129 2163 -
31 g2655.t2 Coils Coil Coil 3066 3100 -
27 g2655.t2 Coils Coil Coil 3380 3400 -
30 g2655.t2 Coils Coil Coil 4685 4712 -
32 g2655.t2 Coils Coil Coil 5109 5150 -
25 g2655.t2 Gene3D G3DSA:3.30.40.10 Zinc/RING finger domain 263 367 2.5E-11
26 g2655.t2 Gene3D G3DSA:3.30.40.10 Zinc/RING finger domain 368 423 1.5E-6
24 g2655.t2 Gene3D G3DSA:3.30.40.10 Zinc/RING finger domain 424 473 7.7E-10
21 g2655.t2 Gene3D G3DSA:3.30.40.10 Zinc/RING finger domain 501 566 3.6E-10
22 g2655.t2 Gene3D G3DSA:3.30.40.10 Zinc/RING finger domain 719 829 8.4E-26
19 g2655.t2 Gene3D G3DSA:2.60.120.650 Cupin 846 1018 1.1E-6
20 g2655.t2 Gene3D G3DSA:1.10.30.10 DNA Binding (I) 1485 1576 2.4E-26
23 g2655.t2 Gene3D G3DSA:3.30.40.10 Zinc/RING finger domain 5195 5308 3.7E-22
17 g2655.t2 Gene3D G3DSA:3.30.160.360 - 5346 5483 5.1E-45
18 g2655.t2 Gene3D G3DSA:2.170.270.10 SET domain 5515 5711 1.7E-51
70 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 1 52 -
69 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 10 27 -
67 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 28 49 -
77 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 111 138 -
74 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 1073 1102 -
79 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 1073 1092 -
59 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 1324 1349 -
56 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 1482 1503 -
58 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 1552 1605 -
75 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 1558 1605 -
66 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 1611 1630 -
76 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 1643 1663 -
62 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 2220 2259 -
61 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 2284 2305 -
71 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 2719 2751 -
63 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 2877 2898 -
72 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 2877 2891 -
57 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 3123 3147 -
60 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 3123 3153 -
73 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 3237 3280 -
54 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 3368 3458 -
55 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 4029 4074 -
78 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 4029 4052 -
80 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 4095 4118 -
68 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 4619 4659 -
65 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 4691 4723 -
64 g2655.t2 MobiDBLite mobidb-lite consensus disorder prediction 4702 4723 -
6 g2655.t2 PANTHER PTHR45888:SF3 HL01030P-RELATED 174 1834 0.0
8 g2655.t2 PANTHER PTHR45888 - 174 1834 0.0
5 g2655.t2 PANTHER PTHR45888:SF3 HL01030P-RELATED 4303 5711 0.0
7 g2655.t2 PANTHER PTHR45888 - 4303 5711 0.0
3 g2655.t2 Pfam PF00628 PHD-finger 775 821 1.0E-8
2 g2655.t2 Pfam PF05964 F/Y-rich N-terminus 5350 5401 8.2E-20
1 g2655.t2 Pfam PF05965 F/Y rich C-terminus 5407 5491 3.6E-27
4 g2655.t2 Pfam PF00856 SET domain 5582 5686 3.8E-18
90 g2655.t2 ProSiteProfiles PS50016 Zinc finger PHD-type profile. 374 424 8.84
87 g2655.t2 ProSiteProfiles PS50016 Zinc finger PHD-type profile. 421 472 9.543
89 g2655.t2 ProSiteProfiles PS50016 Zinc finger PHD-type profile. 722 775 8.992
86 g2655.t2 ProSiteProfiles PS50016 Zinc finger PHD-type profile. 772 822 9.712
88 g2655.t2 ProSiteProfiles PS50016 Zinc finger PHD-type profile. 849 904 9.098
83 g2655.t2 ProSiteProfiles PS50118 HMG boxes A and B DNA-binding domains profile. 1507 1573 8.863
81 g2655.t2 ProSiteProfiles PS51805 Extended PHD (ePHD) domain profile. 5198 5306 24.662
85 g2655.t2 ProSiteProfiles PS51542 FYR domain FYRN motif profile. 5344 5404 28.789
82 g2655.t2 ProSiteProfiles PS51543 FYR domain FYRC motif profile. 5405 5490 31.364
84 g2655.t2 ProSiteProfiles PS50280 SET domain profile. 5571 5687 17.634
91 g2655.t2 ProSiteProfiles PS50868 Post-SET domain profile. 5695 5711 9.151
47 g2655.t2 SMART SM00249 PHD_3 318 360 11.0
43 g2655.t2 SMART SM00249 PHD_3 376 422 5.9E-5
45 g2655.t2 SMART SM00249 PHD_3 423 470 1.5E-7
48 g2655.t2 SMART SM00249 PHD_3 507 564 2.4E-5
42 g2655.t2 SMART SM00249 PHD_3 724 773 1.6E-6
49 g2655.t2 SMART SM00249 PHD_3 774 820 1.6E-10
46 g2655.t2 SMART SM00249 PHD_3 851 902 0.013
50 g2655.t2 SMART SM00398 hmgende2 1506 1574 0.0017
44 g2655.t2 SMART SM00249 PHD_3 5260 5306 0.59
52 g2655.t2 SMART SM00541 fyrn_3 5360 5403 2.7E-16
51 g2655.t2 SMART SM00542 fyrc_3 5409 5496 1.9E-32
53 g2655.t2 SMART SM00317 set_7 5571 5693 1.4E-40
14 g2655.t2 SUPERFAMILY SSF57903 FYVE/PHD zinc finger 371 429 8.2E-7
13 g2655.t2 SUPERFAMILY SSF57903 FYVE/PHD zinc finger 414 477 4.65E-12
16 g2655.t2 SUPERFAMILY SSF57903 FYVE/PHD zinc finger 503 566 1.23E-5
11 g2655.t2 SUPERFAMILY SSF57903 FYVE/PHD zinc finger 720 774 5.98E-7
15 g2655.t2 SUPERFAMILY SSF57903 FYVE/PHD zinc finger 766 825 2.44E-11
12 g2655.t2 SUPERFAMILY SSF57903 FYVE/PHD zinc finger 840 907 1.04E-8
9 g2655.t2 SUPERFAMILY SSF47095 HMG-box 1516 1562 5.59E-6
10 g2655.t2 SUPERFAMILY SSF82199 SET domain 5553 5711 1.7E-46

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0005634 nucleus CC
GO:0005515 protein binding MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values