Gene loci information

Transcript annotation

  • This transcript has been annotated as histone-arginine N-methyltransferase.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_3 g2655 g2655.t1 TSS g2655.t1 19363007 19363007
chr_3 g2655 g2655.t1 isoform g2655.t1 19363118 19383354
chr_3 g2655 g2655.t1 exon g2655.t1.exon1 19363118 19364228
chr_3 g2655 g2655.t1 cds g2655.t1.CDS1 19363118 19364228
chr_3 g2655 g2655.t1 exon g2655.t1.exon2 19364297 19367552
chr_3 g2655 g2655.t1 cds g2655.t1.CDS2 19364297 19367552
chr_3 g2655 g2655.t1 exon g2655.t1.exon3 19367822 19368000
chr_3 g2655 g2655.t1 cds g2655.t1.CDS3 19367822 19368000
chr_3 g2655 g2655.t1 exon g2655.t1.exon4 19368062 19368276
chr_3 g2655 g2655.t1 cds g2655.t1.CDS4 19368062 19368276
chr_3 g2655 g2655.t1 exon g2655.t1.exon5 19369220 19370884
chr_3 g2655 g2655.t1 cds g2655.t1.CDS5 19369220 19370884
chr_3 g2655 g2655.t1 exon g2655.t1.exon6 19371535 19371667
chr_3 g2655 g2655.t1 cds g2655.t1.CDS6 19371535 19371667
chr_3 g2655 g2655.t1 exon g2655.t1.exon7 19371729 19373548
chr_3 g2655 g2655.t1 cds g2655.t1.CDS7 19371729 19373548
chr_3 g2655 g2655.t1 exon g2655.t1.exon8 19373610 19373728
chr_3 g2655 g2655.t1 cds g2655.t1.CDS8 19373610 19373728
chr_3 g2655 g2655.t1 exon g2655.t1.exon9 19373792 19374065
chr_3 g2655 g2655.t1 cds g2655.t1.CDS9 19373792 19374065
chr_3 g2655 g2655.t1 exon g2655.t1.exon10 19374650 19374781
chr_3 g2655 g2655.t1 cds g2655.t1.CDS10 19374650 19374781
chr_3 g2655 g2655.t1 exon g2655.t1.exon11 19374837 19379666
chr_3 g2655 g2655.t1 cds g2655.t1.CDS11 19374837 19379666
chr_3 g2655 g2655.t1 exon g2655.t1.exon12 19379729 19382621
chr_3 g2655 g2655.t1 cds g2655.t1.CDS12 19379729 19382621
chr_3 g2655 g2655.t1 exon g2655.t1.exon13 19382684 19382949
chr_3 g2655 g2655.t1 cds g2655.t1.CDS13 19382684 19382949
chr_3 g2655 g2655.t1 exon g2655.t1.exon14 19383015 19383197
chr_3 g2655 g2655.t1 cds g2655.t1.CDS14 19383015 19383197
chr_3 g2655 g2655.t1 exon g2655.t1.exon15 19383262 19383354
chr_3 g2655 g2655.t1 cds g2655.t1.CDS15 19383262 19383354
chr_3 g2655 g2655.t1 TTS g2655.t1 19384274 19384274

Sequences

>g2655.t1 Gene=g2655 Length=17169
ATGGATAGTGCAGCGAGTTTTGATGACGATAACTCGGATAGTGATTATATGAACGTTTCC
TCATCATCTGAAAGTTCAGAAGAAAATGAATTTCCAGCAAAACGTAAGAACGTACCAATC
ACTAAACGCAGAGATGGACTCGATATTAGTAAAAATCCATTATCGATTATTAAATCGAGG
CTTAATATTAGGCAATCGACAAGTCAGGCGGCAGTTAAAATGTGTTTAAAGCGAAAATTC
TCTTTTACAAACGCAACGACTACAACAACAACAAAAAAGGAAAATGATGGTTCAGGCTCA
TCATCGTCCACCATTAACTTTCAACTCATTTCGGCTTCTTCGTCTTCGTCACAACCACAA
TCTCAATCTCAATCGCCAAATGATGAACAATCGATGGAGCTTGAGTCACCAAGTGGTAGT
GGCATTGGAAACAACCTTTCTACACTTATGCCTGCACCAATGCAGGGTAGCGATGATCCA
CCATATTTTCCTGAAAAATTTCCAAGTAAACTATGTGCCTTTTGTAACTTGGGTGAACGC
AGTCAGCTCGGACAGGGTGAAATGTTACGATTAGAACTGAGTGAAGAAGATTCGAAAAAT
GCATTAAAATTAAAATCGCAACAATCACCACAGGAAGACGAATCGAAAAATGGTAGTGAT
GATTTAAGTAAATTGTTAAAAAATGCTTCATCGTCTACATTATTACAACAACAATTAAAT
CGTCGTCAAAAAGGCTTAAACAAGTGCAAGAATCCCGTTGCAACAAACGAATATGTTGAT
GAATTGGAAAAAATTGGCTATACGGAAGCCATGGATTTGAGTCTGATTGTTGAAAATGGC
TACTATTATGTTCATCGTGCGTGTGCAATGTGGTCATTTGGTGTTGATCGAGATCCAATT
AATGAGGCATTAAGTAATGTGTCAACTGTACTGAAACAAAGTTTGAATCGTAAATGTACA
CATTGCAATCATTATGGCGCGAGTGCAGTTTGCAAAATAAATTGTCAAAAGTTTTTCCAT
TTTCCATGCATCGCTGCATCAGGGAGTTTTGAAGATTTTCAATCGTGTTCAGTGTACTGT
TTCGATCATTTGTCACAAGTTGCTGTAAATTGTGGTGAAGAGGCCTACTGTCGAGCATGT
TGTAATTTGGGTGATATAAGCAATCTCATGGTGTGTTCAAAATGTGGTGATCATTATCAT
GGAGCATGTGTTGGATTAGCACAACAACCTGGTGTAAGAGCTGGCTGGCAGTGTAAGAAT
TGTCGTTCATGTCAGATTTGTCGCATTCCTGATAATAGCGATGGTCGAACATTAGCTTGT
GAAACATGCGATAAACTTTATCATCCTCAATGTTTAAGACCAATTATGACAACTGTACCA
AAGTACGGATGGAAGTGTCGTTGTTGTCGTATTTGCAGTGATTGCGGTGCACGAACGCCT
GGAGCAGGAGCTTCTTCGCGATGGCATAATCATTTTACAGTTTGCGATTCATGCTATCAG
CAACGTAATAAGGGATATTCTTGTCCAATTTGTCGTAAAGCATATCGTGCTGCTGCTTAT
AGAGAAATGGTGAAATGTTCAATTTGTCAAAAGTTTGTGCATAGTACATGTGACCCAGAG
GCAGATTTAAACGCCTACGAACGAAGAAAGGAAGTAAATCCGGACTATGAGTATACATGC
GGAATGTGTAAAATTGCAACGCAAAATGAACGAGTTAATTTAGCGATGAGAAGAAGCAAT
AGTGGAGATGATGAAAGTCTTTCCGCATCACAAGAGAGTTTAGATGATATTGATATGGAC
ATTGATGGAAGAATGGGAAGTAGAGAAGATTTAGCACTTGGTTTGGGCAAAGGAAAACCA
ATGCTGGCAAGTAAAATAGCAAAGAAAAAACTGGGACTTAATAATACTGGTGGAATAAAC
AGCAATAATAGCGGTTTTGGACAAAGACCAAAGGGTATTGGAAAATTGGGCTTCCAAAAA
CGTGCAAGAACTTTTGAGTTGGGACGAAAGCGAGGTCCGAAATCGAAAATGCGTGGAATT
TTTGGTGTTCCTGGAATTGGCTTACAACGACCTGTTGCTGCTGATTCAAAACAAAGTGAC
GAAGAACCAGGATCAGAAAATCGTCTTGTTCTTTGTTCAGCAAAAGACCGTTTTGTGCTT
ACTCAAGATATATGTGTAATGTGTGGTGCAATTGGAACAGATCAAGAAGGCTGTCTAATT
TCATGTGCTCAGTGTGGTCAATGCTATCATCCATATTGTGTAACAGTTAAAGTTACTAAA
GAAATTCTTCAAAAAGGCTGGCGTTGTCTTGATTGTACAGTTTGTGAAGGATGTGGCGAA
AAGAATGATGAAGCACGATTAATTTTATGTGACGATTGTGACATTTCATATCATATTTAT
TGTATGGAGCCGCCATTAACGACTGTTCCACAAGGTACATGGAAATGTAAATGGTGTGCA
ATTTGCTTTAAATGCGGCTCAAACGAAGCGGGTAATAATTGTACATGGCAGAATAGCTAT
ACAGAATGTGGACCGTGTGCAAGTCAATCGAATTGTTCAGTGTGCAGTGAAATCTATGCG
GATGGTGAACTAATAATTCAATGTACAAATTGCGATCGATGGCTTCATTGTTTATGTGAT
TCAATTAAGAATGATACAGAAGCAGAAAAATGTGCAGAAGAAGGCTATATTTGTTTATTA
TGTCGACCCAAAGACATTTTAGCACCACATTTACAAATAAAGAAAAAACCACCAACTCTA
TCAACACAATCATCTACGACAACCAAAGAAGAGTCAATTGATGAAAATGATAACGCTACT
ATTTTGGCTCTCGAAGGTAGTCACTTTGTTGATGGTGTTTATTTGAGTGAGCATGGTTTA
CAATTTATAAAAACACTTCAGACAGAACCAAAACGTGCAAAACGCAAACCCAAAGTTATA
CAAGAAGCAGAAAAAGATGCTGGTATTTTGGCAGCAATTGAATCTGTTGTTTCTGGAAGT
GCTGATAATTCAATGGAAGATATCAAAATGGAACCGATGGATCCAAATGAAGAAGCTCAA
ATTTATAAAGATGGTATGACATGGACTAATAATGAACCTGCACCCGAAGGCTTCTCTCTT
TTCACAAATGAAAGTGGTCAAGTTATATTGCGCAAAAAACGTCAAAGAAATCTTCAAAAA
CTTGGTATTGGAGGATTTGCAGTGCGAAATCGAGCTGTACGAACAACAAAGAATGATGAA
GAGCCCTCAAATCAATGTGATGATGCAAAGAAGAAAAAACCGAATCGTCGAAAGCAAAAG
AATAAGCTTATTGAAACTTACCCGACCTATTTACAGGAAGCCTTTTTCGGTAGGCCATTG
ATGGATTCAAGTCTTAAAGTTAAACTTGAATCAAGCAGCAGTGATGAAGAGACAAAGAGT
AATGTCTCAGATGATAAGACAATAAAATTATCATTAGATGAATTGAAAATGATTGAAGCA
ATGCGAGCCAAGCAGCAACAAAAGTTACAAGAAGAACAAAAATTGTTACAAGTAAATGTG
GTCAATCCATTAGAGCAGCAACAACTAATGTCAAATGGTGCACAAATTCAGTTGAATCAA
AAAATGCAATCAACGGACACGACAAGTGTTTTTAATCAAAATACCACTGGCAGCATAGCA
TCAACTTCATCATCAAATATTAATCAAACAGAGATTAAAACTGAATTGGAATTAATGGAT
GATGATGAAAATAATAGTGATACAGAAGCACTAAAAGATGTTCTCGGTTTGCCAGGCGAT
CTTTTAGATAATGATTTAGTCAATACAATAATGAATGAAGACGACGATGATCTCACTAAA
AATACAGCCGGATTAGACGATGTTGATGTAAAAGGTGTTAAAGATGATTTGGCTGATATT
TTGAGTCCTCATTTTAATATTGATATGGAAGATATGTTATTCAAAAGCGTCTTAACGGAT
GAATCACAAGAGTCTCAGGAATCACAATTGACCAATTCACTAACATCGTACTCAACGCAA
TCTACACCTTCTCATCAATCTGAGGTGCAACCTATTCATCTCAATCAACCAACAAATACT
TCTCTACAAAATAATATAAATCAAATGCAATCTCCACAGCAGCAACCAAGTACACCAACC
GCTACAATTATGAATCCTATGACATTGCAACCGCAATCGCCTCAAATAAATACTCAATTA
AATCCCATGAATGTTATGCTTCAGCAACAACAACAACCCAATCAAATGCATCAAATGCAA
GCAAACATGATGCAACGTCAAAATTCTCAGCCTGGCACACCAATCGGTTTAATGCCACAA
CAACAACAAAGAGGGTACAATTTTAATTCAGATTTTTGCAGCTATAGACCACAATTCAAT
ACAGCCAATGTATTTCAAAATCAACCACAATGGAATGCAAATGTTGATAATGATTTTGAT
TTAGTCAATATGAATGCCGCTGCTAGTCAAGCAACAGCAGTAGCAGGTCAGCCACAAAAA
ATTAATAATCATCAAAAGAATTCTGAACGAATGTCAGAAGATGAAAAACTCGGTGATATG
TCAACAATATCAGCTGTTCTCTATGCAAATCAAAATCATCCTGAGCTAAAAATTGAATAT
CCACATTGGCCAGATCGTTATAAGCAAATTATGAAAAAATGGCGAGCACTATCAGCTGAA
AGAAAGCAACCATATTTACAGCGTGCTCGTGATAATCGATCGGCACAACAAGCACAGCAA
CGAACTAAAAAAGCACAACAGGTTGCTCCTGACCTATCACCTGATAATGGAACCACGAGC
ACAAATAGCACTTCTCCTGCACTGAGTTCAACCCTCTCAGAGTCTTCATCATTGATGGAT
GTATCAATGATTCAGCAGCAGCATCAATCACAACAACAGCAACAAGTGTCGACCCCAACT
ACAGTCAATGTCGGTCCACCACAACAACTCCAAGTGGTAAGAAACAATCAGCAGTCCCCA
GGCCGCTCTTCTACTCCTTCTTCTTCTTCTAATCTACTGCCGGAACATGTACTTGAAACT
AACGCTTTAGATAACAAAAACGATTCGACATCTCCTAATAATATTAAAAATAATTTACTA
ATCATAGCAAATGAGCACAAAGAATTTAGTTGTAGTATTGGTGATGAAAATATTAGTTCC
CCTATTTCCAATAACAGTAATAATACTATAAATGATAAAATCTCAGAGTCTGATGAGGCG
AATTTGCTGTTGCAAGTCAAATCCTCTTCTACTTCATTCGACATGAGTAGTAAAAGTGAT
AATAATTGTAGTCTTTTGAATAAAAATGCCAATATATCTGTTGCGAGTAATATTGCGAAT
GCATTTATAAATCTCACAACTTCACTGCCAAAAAATAATAATAACGCCAACAATATCAGC
AATAATAATGGCAGTGGAAGTAATAATAGTAATTGTTTACTAAATGCAAGTGGCTTATCA
CAAACAAAAAATGTAAATGTCATAGTTTCACAACAGCCACTTTCAGCACAGCATTTTATG
AAGACACTTTCGTTTACGACATCGAGAGGAATTTTTGTACCAAATGTTATTGCTACAAAT
ATTGCACCTCAATTTACTGTACATCAATATGGGTTACATGGTGCATCCAGTTCTGGACCA
GCACCAAATGTAACTGTAAATCCGAATGGGAATTTCAATACAAACTCAAATCCTACGCTT
CGACCTGCTATTCCCAATCATTTTCGATTGCTTTTGCAACAACAGAATGCAAATATTAAT
TTAAATATCATCAATAATAATAATAATAATCAACAGCAAAGTAATGTAAATCCATCGCCC
GCACCTTCTCCTATTTTGGAAGCTTCGTTGCAAACAATTACAAAAGAAGCCACGATGCAA
GCAACGAAAACAACTGTCTTGCCCACTAAAACTATTTTTCCAAATCAAATTATTCATCCA
CCTCAACGATCAATTACTCCATTCACACCCATTCAGCAAAAAATTGATCAAGATCATGAA
ATGGTACAGGTTGAAGAATCCTCATCATTAACAAATAGCACTGCAGATCAACATATTCGC
GTTTTAACACCTTCTGAAATTATGAAGACGCTTCCTTCACTTTCGACACATGATAATGTT
TGTTTTAATAGCAGCAGCATGTCAACTGCGAGTATAGAGAAGCAGTCAAATGAGACTGAT
ATAAATGCCCTTAGTACTAACAATAATAAAAAGCTTCAAAACGCATCACAGCACCATCAC
CATGATCCGAATGCATCAGGCTCTGTTATCTCTATTCTTTCCAATAATTCTGCAATTTCT
TCTCCTTTGACCATCACTACCACCACCATCACCTCAGAATGTAACACTTTTACTAGTAGC
CATTTCACCACCATCACTAAAGCAACTTTGTCTTCTACTACTTCTAATAATGCACAAATA
ATAATGGATCAAGATAAAATTGCAGCTCAACAGAAGAGCATGAAAGAAGCTGAACAAGAA
CGTCAATGGAAAATGCTTCAAGCACAACGTGCTCGTGAGCAACAACTCGGTATGCCTATG
GATCATCAGAGAATGAATGATCCAAACATGCAACTTCAAATTTCGACAAATCTCAATGAT
GGATCAATGAGTCCAGTCGCAAGTCCTTCACCAAATTCAAGAAATCAATTTATGGCTCCA
AATAAGGGACGAATGATGATGAATCCTCAAAGTCCAAGTTCTTCTAACTTCCAACACCCA
GGACGTCCTGTTTCCGCTCAATTGCAACGTCAACAAAGTCAACGTATAAATCAAAGTCCA
TTTTCACCACAAACGGGCACACCTCAATCACCGAACAACATTTTTCCAGGATCACCGTCA
GCAGACGGTTTTCAAAGACAACAATCTCTTGATGAACAACAATTTTTGCATTCTCCACAA
ACACCAAAGTCTATTCAACAACAATCACCTGTTCATACTCCAACAAGTGCAAATATGAGT
CCAGTTTATTCTCAAGTAAATCAATCACAACAACAACAACAAGCTCCAAATATTCGACCA
CTCGATGGTGTTAATGCTTACGCACAAGCACCAGGAACACCGAGACCTTCATTCAATCCT
GGTCAAACGCGTACAACTGTTTATGCTAGACCAGACATGTTTAACAAACCACCGTTTGTT
CAATCTAATCAAAACAATCCAGAGCAAAGCAATAGACAATTACGTGACTTATTACAGCGT
TCACAAGCACCGACAAATTTGCCTGGACCAACATCAAGTGCTTTTTCAATGGAAAATGAT
ATGATGAAGAATCAACAAACTCTTCAAACTGGCACTGACAACACATTCCGACAACCTCTC
CCACCAGGAATCAGACAGCAAAGAATGCAATCAATGGTTGGTGGTCAAATGATTCGTGCA
GGCCAATTAACTGGAACTGGTCAAAGAATGATTATGACACCAGATAATCGTCCACGATTA
AATATTAGACCTGGTATGAATATGAATGTACCACAAATGATGACTGATCAGCATCAGCAA
CAACAACAAATGAATCAAGCAGGTCTTAATCAAACTCAAAGAATGCCTATAAATCAAGGA
ACTAACTTTAATCCTCAGAATGAAATGATGCAGCAATCTAATATGATTCAACAAAATACT
CCTGGACAGAACATGATGATGCAACAGCGTCTTGTTGCTTCTCAAAATCCAAGTATGCAA
CATGCTCTACAAGGAAATCAAATTAATCCAACTCAACCAAATACAACTGATCAAAACGTT
GTTCATCAACAAACAGGTACTGATGTCGAAGGAATTCCAGATTCCGTCACAGCTGAATTA
GAGAAATTAGAACAAGATGAAAATGTTGGTATGGACGGTGTTGGTGATATTCTTGGTGGA
CTTGGTGACGATGATGATGATTTGTTAGACTCATTGACAGCTGAAATGGGCGCAGATTTT
AATATTCTTGAATATGCAGATCCTGAACTTGACACAACTGACGAAAAATCAACACTTCTC
GATAGCTTAGAGATGGATGAGGGTGAGAATGCAAAAGAAGAAAAATTGAAAAATCTGGAA
GCAGAGAAAATTGCAAAGACAAATTTCCCTCGAACTCAACAAGATATGAGTAATGTGCAA
ATGCAGCCTAATCAGCAAATTAATCGTATGCAAATGCAAATGACAAACATTAATCCTCAA
ATGCAACAGTCAATAGATGGACAAAATGTTAATCAAAATGCAGGTCCATCACAACAGCAG
CCTGCTCAAATTCCTCAGCAGCAGCAACAACCGGGACAAGTGCAAAATCAACAACAAATG
TTCCAGCAAGCAAATGTGCAAAGACAAGTTAGATTTAAAACAATTCCACCTAATCAAATT
CCAGAAATTCAACAAATTCATCAACAAATGATGTTACAGCTTCAACAAGCTGCTGCTAAT
GGAAAACCAATGCCAATAGGTACACGATTAGTTGCCAACAATAATATTACAGGCATTGTG
ACAGGACCGAACAACATATCTTTAACATTCCCTACTGGACAACAACGTCTTCAACAAGTA
CGAATGGTTAATCCAAATATGATGCAACAAAATCAGCAAAATAATCCACGAATGGCTTTA
CCACATATGGTTCAAAATAGACCTAACATGATGCAAAATGCTCAACAACAGCAAATAAAT
CCAAACATACAAGGTCTGATAAATCAAACGCAACAGCCACAGCAACAGCAAGCGCAACAA
TCCCAACAAGTGCCACCACCACCCTATCCAGAGCCGCCACCTCCATATCCTGGTCAAGCA
AACCAAAATCAGGAACAACCGTTACTTTTGGAAGATTTACTCGAACAAGAAAAGCGTGAA
CAAGCAAGAAACATAGGAACAAATCAAATGGACATGAATATTCCTCAAAATCAACAACAG
CAACAAAGTTTATTTAGTGATCAGGATTATGAAAAATTGCGTGCTGATGTACTTACAACA
ACCACACAAAGTATTTCAATGCCACAACAGATTCAGCAACCGCAGCAGGTTCAAGTTCAA
CCTCAGCAACCACAATTTGCACCAAGAGGTATTATTAATAAACAGTGGCGTCCACAAACA
CCTGGTGTTGTAAATTCACAAGCGCCATCAACCAGTCCTGATATTGTGAGATCAGTACCT
ATATTTAATGCAAATCTGACGCCAATGCCTCCAATGCCACCTGAAATTATACAAACAGAG
ACAGATAAACAAATTCAAAGCAACTATGAAGCATGGTTGATTACACAAAATGATTCGTTG
CAGAAGCAACTTCACTATTATGAAACAGAAATTGCTGAATTAAGAAAATTGAAGAAATCT
CTCAATACAAAACAACGTCAATTAAAGAAAAATGGAGGTGATTTGAATGAAATTGATGCT
CAGACACTTCTCAAAGTAACTCATGAACAAGCAGCAGTACAGAAGCATTTAGAAAGCTCT
AGAAAGCAAGCAAGAAATCACTTGCAAATGAAACAAGATTATGATAATAAACAAAAATCT
AAACAAATGGCAAATATTTCTCATATGGCTCAATCACCAGTCGGAGTGCAAATGAATGAT
CAATCACCAATGATGTCTCCTTCGCCAAATATTATTCAACAACCCGTACAATCGCCACTT
GGTAATCCAATTATGGCACCTTCACAAAGTCCTCTTCACTCTCCTAGTCCCATGATGAGT
TCTCAAAGTCCTGGACCAAACTCAATTATGCAAAGTCCAGGAGGTCACATTAATAATGCA
ATGTCTCCTTACAACACTATGCAACAATCGCCACGAATCGGTACTCCACATTCACAAATT
GACGAAAGTCCATTCAGTCCAAATTCGATTGAATCACCTTCAATTAATAGTCGATTGACT
TCACCAATTCCACGCATGACATCGCCACAACATAGACCAAATACACCGATGCAAATTCAA
ATGATGAATAGAATGCCTGTTCAATTTAATCAACAACAAAATATGAATATGAATCAACAA
AATCGTTTTATCAGACCACAAATGATACCAAATGATTCAAATAGTCGTATGGCAGGTATG
AGAATGCCTGTACAACAATTTCAACAGCAGATGCAACACGGAGGAGGCAATGTGATTAGG
CAAGTGCAATATGATGCAAATGGAAATCCACAGAATGTGCAAACAATTTCACAAGCAAAT
CAGCAAATGGATCCACAAAGAGCTATGCAAATAAGACAAATGCAAATGAGACAACAACAA
ATGATGAAACAACAGCAACAGCAGCAGCAGCAGCAAGGTCAAATGCAAATGCAACAAATG
TCACAAGTCCAACCGAATCAAAGTCCTATACATCAGCAGCCACAATCTCCATTAATCAAT
CAAAATGCAACTTCTCCTATGCCGAGAAGTCCAATGGTTCATTATCAACAACAACAGCAA
AATCCAAACAGTCCGATGATGCAAATGGATAGCTCTCCTCGTCCTATGTATGCACAACAA
CAGCAGCAAAATATGCAAATAGAACATAGCAACAATAATTCGAATTTGATGCAAGGAGGA
GGAGGCGGTGGAAATCATATGATGAATCCAGGTAATCCAACTCAAAAGCGAGCACCAATT
AAATTGGGATTAAGAGGAGGTATGCCAATGTATGGAAAAGATGGCAATAAGAAGCAAACT
GGAACTTCAGATATGCTTCAATTGGTTCAAAAAGCTCAGCAAAAACATCAGGCACAACAG
CAATCTCAACAGCAAGGATTTACGATTGAAGCTCAAAGAAATATTTCAGGAAAAGTTATT
CCAACAGCTGTTTCATCAATGGAATTACAACAAGCTTCAAGTAGTAAACAAGTGAAAGCA
AAAACTTCACTTTTAAAGAATCCACTTGGACCTAAGGTAAAATCATTGGTTGATTATGAT
GATAACGATTCATCTAATGGAACACCACCAATAAGTCCAATATCACAAAAAATCAGACAA
AGATTACAAGATGGAAAGATGCATAATGAAGACGTTGTGATTGTTGATAGCAGTCCCGAT
GAGAAACAGAGGCTGACAGATTATGATGATGATAATGATAAAATAACATTAACAGAAGTT
TCTTTAAATTCTACTGCTCAAGATGCAGGTGATGCTGAAATTGTTGAAGCATTCGACGGA
AGCGAACTTGTGTCAAGTCCATTAGTGACTGAACAGGAAGCTACTGATTACACACTTTTT
GATTCACATGTTGTTCATCTGGATGATAGCAATGAATCATTAAAAGAAATTATCAATACT
GATTTGTTATTTGAAGATGCAATTCCAGTAACTAAACCTTCAACTTCAAAAATTGAAACG
TCAAGCAGCATAACTGCAACAGCAAAAAATCCTAAGAATGTTGGAACACGAGAAGATTTT
GAGGCAATGATTGACTCAGGCAAAGATGAAGATGAAAATGAAAGTGAATCTGATGCCGTT
ACAATAGAAAAGAATGATCCTATTGAACAAGAAACCTTAGAAAAACCACAAGGAACTCCT
CATAGAATTTTAACTACTACAACTATTCCGACAGAGGGAAAACGTGAAATTAAATTTCCA
GTTGTTCTCTCATCTTCCTCAGGTGGTCAATTAATTACACTTCCTTCAAACCTTATGAAT
CGAACAAATATTCAAGCATCAGCTCTTGCTGCAACTGCAGGCATGACAGGTGGACAGAAA
AAAATTGTCAAAGGTTCAACTCAAACATTAGCAAAAGTCTCTATTGGAAATACTACAATT
TCTGTTCCTGTTGTACTTAAAAATATGCCAATAACAAATTCTGATCCCTCACAAGGTGCG
AAAAAGATGATCACCACAAATGCTCAAACACTTTCAAGTTTGAAAAAAAATCCAAATATG
TTTAGTGTGTCGGGACAAAAAATTAACACATCAACAATTGTGACACTTTCACTAAACAAA
GGCAACACGAGACCAATTCAAACAGTAGATTTAAAGCAAAGATTACAACAAAATCAAAAA
TCAATTATTGTTTCAGCTGCACCGTCAGGAATAATACAAGCAAGTTCACATAATGTAGAA
AATATTGTATCTAGCGCAACAGTAATAAATAAACTATCAGCAAGTTGTGCATCACCTATT
TTGACATTCTCAAAAATTCCAACATATACTATGACACAAGACACGGCTTTACCAACAAAA
ATATTAGAAGATGATGACGTTTCACCAGAATCAAGTGAAAGTGATAAGAATAAAGTTAGA
GATGTAAGCAAATCATCTGAAAGTACATCGTTATCTATTCCTGAAAGTGGCACAGTAGAA
GGAAATGAAGAGGTCAAAGAGACAATTATAGAGAATAAACCGAAATCATTGATACCAATG
CATGTCATTGTCAAATCACGTGAATCTTCTCAAAGTCCGATAGCATCATCATCATCATCA
ACAACAACAAATCCGACACACGCACAAGCACAACGAATAGTGTCTGGTAATATGTCACAA
CTTTCTCCATTATCGCAGCCAATTGAAATCAATACAAATACACATAATGCTACACAACAA
ATCCGTTCAATAATGTCTTCAATTGACGCGAATGAAGAGTCGAAAAATAAGGCGGAAGTT
GAACAAGTTGCAACATCTACGCAATCAAGTGTTCCAAAGACAATAATACGAAGCACATCT
ACACCAACATCATCTGGCGATACAAAAATAATTATTTCGAGTCAGCCAAGCATTTCTGTA
CCTTCATCGCCAAATACTCAATCAACGAATAATGTAGTTTTCGTGAAGCAAATCAAAGCC
ATTCCATCGTCTTCGTCTGCTGGTTCAACTATTGCTATTAAAAATCAATCAACAACAACA
CTTGCTCAACAACAACAGCAGCCCCTTATTATAACAAAAACAACCAATTTATTGAATATT
TTAAGTAATCCTCCAAGTGGTCAACATCAACAAACTTTCAAAGCGACAGAAATTAAAATG
GAACCTCCCATGAGTGGTGATAGCATTCCCAAAACAGTAACAATTCTCAAATCGAATCCG
ACAATTACAAACTTACTCAATTCAAATTCATTTAAACGATCAAAATCAAGCGATGATGTC
ATTACAAAAGAAACCTCAGAAGCGGCCGTTAATAAACGTTTAAGTTTTGAAGTTTCTAAT
GAAATTAAAGCTGAACCAATTGAACCAGTTGTCATTAAAGAAACAGTTTCAACACCAACA
TCAGTAACAACAGTTGTCAAAACAGAACAAAGTATTACAGTTACACAAGCATCTACTCCT
ATATCTCAGACTCAAAAAATTATTCCACCAAATAAACCTGAAGATTCTCAGAATGTTCTT
CTCAAGCAACTTTTGCAAAATTCTGGATCAGGAGGACCTGTTGGAGGCTCACCGCTAACT
CGAACAGTACCTGGATTAATAACAACACAAAGGGCACCTAGTCTTGGTATGTTTAGTTCA
CTTGAAGCACAATTAGCTCGACCAGTGATTCCACCTGCACCTGCTAAGCAACTAATTGTG
ACTACACAACCACTTGTGACATCAATCCCCATACCGGTTTCATCAATTTCATCGACCCCA
GAAGCAACTGTAAAAACAAGTCAATCAGTAAGTGTCAGCAAATCTATTTCTATCCATGAG
ACTTCCTTTGTATCACAGCCATCTGCTTCTAATATATTAACGACAACAACTACAACACCA
CCTGTTGCGTCACAAACGTTAAATGTGTCAAACATTGGAGAAAAAAAACCAATTGTTATT
CTAAACAGAAATGATATTCCTGCTAGTTTATTGACATCAAACAGCAGTAATATGACAAAA
ACAATTATTTCACCACTCACAACTACGGAAGGAATGGCACCTCCAAATATGACTATAAAG
AAGGAAATGATTTCACCTGGAGGTACCATAAAACCAAATCAATCACCTACACATATCGCA
CCAACATTAGTTAAGAGCATATCACAGTCATCACTCAATTCAGGCACAACCACTCCCGTA
CCACTTGAATCTCCATCAGTTGATATAAAAAAAGAAATGGATGATAGTTCACAATCGGAA
TCTGTTGCTTCTGATGTGAGCATGGTAAAGAATGAAAATATTATGCTTACACCTTCGCGT
GATGGAATGAATGAGCCACTTGATGAAAGTCCTGCTAAAACAGCTGCGGAAATTGCAAAT
GAATTGAAGAAGAAAAAACGACGTGAATATCAGAAAAATCGTCGACAAATGCAAATGTCG
AAAGAAAAAGGCGTTAAAAAGCCAAGGAAATTGCAAAAATCTGAGGAAGATTATGATTCA
TTCATTGATAATCTTATGCTTCAAATAAAGTCTCTCCCACAAATGCCAATTTTAGAGCCA
TCATTACCAAAGAACTATGGTGTTTGCCAAATCTATGGCTCATGTGAGTTGAATAAAGTC
AACAAAAAATATGACACAACGAGTGGTGAATTGACAGGAGTTTTTGGCAAAGGCGAACTT
TCTAATATCTCAGACTTCTACAATACAAAACCATTTGGAGTTCTTGAACCAAAAGTCGAA
AAGACGCCTGCTTCTACACAAAGAGGTTTTTATGATCAAGAATTTCCACCAATCAAGTTT
GATGAGGAAGAAAAAGTTACGCATCATCGATCCAAATATGAATTATTGGCTAAAGATCGT
GATATCGATACACCAGATACTGTTGTAAGTTCATCAAGTCCAGAATGTGTAACTATGAAG
CCACCAAATCGTTTCCCTGGCCTGAGACTGATAAGAGAAGAGGAAGATGCTGAAACAGAA
GATGAAATGACCAGTTTCATTGACAACAGAATGTCTCCATCAATTCCAACTATCATTGCA
CCAATTCCAATACGATTGAAGAGTGGTATTTCGCTAACTTCTGATAATAAAGTTCATAAT
GAGAAAGAATTTGAATTATCAAAACAATTGGGACTAAAATCTTGTTTCGAACCACCAACA
CCTGCTAAAGATAATAATAATAATAATGTTACTGTGACATTAACTCTTACTTCATCAGCA
GCAGAAGATATAATGGGTGTACTCAAATCATTAGCAAATATTCTCAATATTCCAGCACCG
ACTGCATATCAAATTGTAGAGAGAACTACTACACCTCCTAGTCAAAAATTGGGACTTTAT
AGAATAAAAGGAAAAGATGGGAAAGAAGGGCAACCTGTTGATATTCAAACAATTCTCAAT
GGAACAGCAAAATTCTGTAGACATTGTGACGTTGTCATACTTAATAATGCAATTAAGGCA
AAAGCAAACGAATTTCCATTACTTGTCAATACAGAACTAGAATCAAACGAGCTTTATTTC
TGTGGTCAAACGTGCTACAAACAATTCCAATGGCGACCAATAAATATGCTTGATGATAAG
AGTCTCAATTCAACAACAGATGATAAGGCACTCGAAACAATGTCAGAAAACATATCAAAA
GTTGAAATGCAAGGTGATTTAAGTCAATTAAGAGATCAAAATAGAGATTTGAAGAGAAAG
CATGAAGAAATCGAGGACATGAATGAAAGCAAAGAAGACATTTTACAAGCTGAAAAACGT
CAAAAATTAATGAGAATCAAAACTTTCAGTGCAAATAGTTTTCCAAATATTCATAAACAA
AAGAAACTCTCTGAACGCGAGATTACAGAAATGCTCTTTAGAATGAATATAACAGTCAAT
TCGGCACCAAAGATTTTAGAAGATACAAGAAAATGCATCTTATGTCATCAAATAGGAGAT
GGTGTGGCTGATGGTCCATCACGTTTACTTAATTATGATGTTGATAAGTGGGTACATCTT
AATTGTGCATTATGGTCAGATGGTGTATATGAAACAGTAAATGGTGCATTAATGAATCTT
GAAGCAGCACTTCAACAAAGCTTAAATTCTCAATGTACACACTGCAATCATTTGGGCGCA
ACAATCAAATGTTTTAAACCGCGTTGTGGTACACTCTATCATCTCAATTGTGCTATGAAA
GATAATTGTGTGTTTTATAAGAACAAGACAACAATGTGTAATATTCATGCACCGAAATCG
GAAAAAGATAATGAACTAACAACATTAAGTGTGCAACGGCGTGTATATATAGAGCGTGAT
GAAAATCGACAAGTTGCATCTATTATGCATCATTCTGATCTCACGAATCTTATGCGAGTT
GGCAGTTTAATACTTTTGAATGTCGGCCAATTATTACCTCATCAGCTACATACATTCCAT
ACAGCAAATTATATTTATCCAATTGGCTTTAAAATTATAAGATTTTTCTGGTCAATGAGG
TATCCAAATAAAAGATGTCATTACATTTGCTCGATTGCTGATTCTGCAGGAAAACCTGAA
TTTAGAGTTCTTGTAAAAGAACAAAATGAAGAAGATACTGAATTTAAGGACGAGTCACCT
AAAAAAGTATGGCAAAAAATTCTTGAGACGATTGTCAAATTACGAAGAGAAAATCAATTG
GTTCGAGTTTTTCCTAAATATATCAGTGGTGAAGATCTCTTTGGTTTAACTGAACCAGCC
GTTGTGAGGATTTTGGAGAGCTTGCCAGGTGTTGAAACTTTGAATGATTATCGATTTAAA
TATGGGAGAAATCCGCTTCTTGAGTTACCACTTGCGATAAATCCATCAGGTGCTGCTCGT
ACAGAACCTCGTCTAAAACATTCTGTACCGATGAAAAAACCACATACTCAAAGAACTGGA
TCGACCAGTCAACGTCCTGCATTTGTTCCATCAACGTCAGCAGGTGAAATAGCTTGTCCA
TATAGCAAACAATTTGTACATTCAAAGAGCTCCCAATATAAAAAAATGAAATTAGAATGG
AGAAATAATGTATTTCTTGCACGTTCAAAAATTCAAGGTTTAGGGCTTTATGCAGCTCGT
GATTTAGAGAAGCACACTATGGTTATTGAATACATTGGTGAAGTCATTCGAGGAGAATTG
TCTGAATTACGTGAGAAGCAATATGAAGCTAGAAATCGAGGAATTTATATGTTTAGATTA
GATGAAGATCGAGTCATTGATGCCACATTATGTGGTGGACTAGCTCGATATATCAATCAC
AGTTGTAATCCAAATTGTGTCACTGAGATAGTAGAAGTAGATCGTGAATATAGAATTATC
ATATTCGCAAAACGCAGAATAAATCGTGGAGAAGAATTATCATACGATTACAAATTTGAT
ATAGAAGACGAGTCGAGAAAAATTGCATGTCATTGTGGAGCAGCATATTGTAAAAAATAC
ATGAATTAA

>g2655.t1 Gene=g2655 Length=5722
MDSAASFDDDNSDSDYMNVSSSSESSEENEFPAKRKNVPITKRRDGLDISKNPLSIIKSR
LNIRQSTSQAAVKMCLKRKFSFTNATTTTTTKKENDGSGSSSSTINFQLISASSSSSQPQ
SQSQSPNDEQSMELESPSGSGIGNNLSTLMPAPMQGSDDPPYFPEKFPSKLCAFCNLGER
SQLGQGEMLRLELSEEDSKNALKLKSQQSPQEDESKNGSDDLSKLLKNASSSTLLQQQLN
RRQKGLNKCKNPVATNEYVDELEKIGYTEAMDLSLIVENGYYYVHRACAMWSFGVDRDPI
NEALSNVSTVLKQSLNRKCTHCNHYGASAVCKINCQKFFHFPCIAASGSFEDFQSCSVYC
FDHLSQVAVNCGEEAYCRACCNLGDISNLMVCSKCGDHYHGACVGLAQQPGVRAGWQCKN
CRSCQICRIPDNSDGRTLACETCDKLYHPQCLRPIMTTVPKYGWKCRCCRICSDCGARTP
GAGASSRWHNHFTVCDSCYQQRNKGYSCPICRKAYRAAAYREMVKCSICQKFVHSTCDPE
ADLNAYERRKEVNPDYEYTCGMCKIATQNERVNLAMRRSNSGDDESLSASQESLDDIDMD
IDGRMGSREDLALGLGKGKPMLASKIAKKKLGLNNTGGINSNNSGFGQRPKGIGKLGFQK
RARTFELGRKRGPKSKMRGIFGVPGIGLQRPVAADSKQSDEEPGSENRLVLCSAKDRFVL
TQDICVMCGAIGTDQEGCLISCAQCGQCYHPYCVTVKVTKEILQKGWRCLDCTVCEGCGE
KNDEARLILCDDCDISYHIYCMEPPLTTVPQGTWKCKWCAICFKCGSNEAGNNCTWQNSY
TECGPCASQSNCSVCSEIYADGELIIQCTNCDRWLHCLCDSIKNDTEAEKCAEEGYICLL
CRPKDILAPHLQIKKKPPTLSTQSSTTTKEESIDENDNATILALEGSHFVDGVYLSEHGL
QFIKTLQTEPKRAKRKPKVIQEAEKDAGILAAIESVVSGSADNSMEDIKMEPMDPNEEAQ
IYKDGMTWTNNEPAPEGFSLFTNESGQVILRKKRQRNLQKLGIGGFAVRNRAVRTTKNDE
EPSNQCDDAKKKKPNRRKQKNKLIETYPTYLQEAFFGRPLMDSSLKVKLESSSSDEETKS
NVSDDKTIKLSLDELKMIEAMRAKQQQKLQEEQKLLQVNVVNPLEQQQLMSNGAQIQLNQ
KMQSTDTTSVFNQNTTGSIASTSSSNINQTEIKTELELMDDDENNSDTEALKDVLGLPGD
LLDNDLVNTIMNEDDDDLTKNTAGLDDVDVKGVKDDLADILSPHFNIDMEDMLFKSVLTD
ESQESQESQLTNSLTSYSTQSTPSHQSEVQPIHLNQPTNTSLQNNINQMQSPQQQPSTPT
ATIMNPMTLQPQSPQINTQLNPMNVMLQQQQQPNQMHQMQANMMQRQNSQPGTPIGLMPQ
QQQRGYNFNSDFCSYRPQFNTANVFQNQPQWNANVDNDFDLVNMNAAASQATAVAGQPQK
INNHQKNSERMSEDEKLGDMSTISAVLYANQNHPELKIEYPHWPDRYKQIMKKWRALSAE
RKQPYLQRARDNRSAQQAQQRTKKAQQVAPDLSPDNGTTSTNSTSPALSSTLSESSSLMD
VSMIQQQHQSQQQQQVSTPTTVNVGPPQQLQVVRNNQQSPGRSSTPSSSSNLLPEHVLET
NALDNKNDSTSPNNIKNNLLIIANEHKEFSCSIGDENISSPISNNSNNTINDKISESDEA
NLLLQVKSSSTSFDMSSKSDNNCSLLNKNANISVASNIANAFINLTTSLPKNNNNANNIS
NNNGSGSNNSNCLLNASGLSQTKNVNVIVSQQPLSAQHFMKTLSFTTSRGIFVPNVIATN
IAPQFTVHQYGLHGASSSGPAPNVTVNPNGNFNTNSNPTLRPAIPNHFRLLLQQQNANIN
LNIINNNNNNQQQSNVNPSPAPSPILEASLQTITKEATMQATKTTVLPTKTIFPNQIIHP
PQRSITPFTPIQQKIDQDHEMVQVEESSSLTNSTADQHIRVLTPSEIMKTLPSLSTHDNV
CFNSSSMSTASIEKQSNETDINALSTNNNKKLQNASQHHHHDPNASGSVISILSNNSAIS
SPLTITTTTITSECNTFTSSHFTTITKATLSSTTSNNAQIIMDQDKIAAQQKSMKEAEQE
RQWKMLQAQRAREQQLGMPMDHQRMNDPNMQLQISTNLNDGSMSPVASPSPNSRNQFMAP
NKGRMMMNPQSPSSSNFQHPGRPVSAQLQRQQSQRINQSPFSPQTGTPQSPNNIFPGSPS
ADGFQRQQSLDEQQFLHSPQTPKSIQQQSPVHTPTSANMSPVYSQVNQSQQQQQAPNIRP
LDGVNAYAQAPGTPRPSFNPGQTRTTVYARPDMFNKPPFVQSNQNNPEQSNRQLRDLLQR
SQAPTNLPGPTSSAFSMENDMMKNQQTLQTGTDNTFRQPLPPGIRQQRMQSMVGGQMIRA
GQLTGTGQRMIMTPDNRPRLNIRPGMNMNVPQMMTDQHQQQQQMNQAGLNQTQRMPINQG
TNFNPQNEMMQQSNMIQQNTPGQNMMMQQRLVASQNPSMQHALQGNQINPTQPNTTDQNV
VHQQTGTDVEGIPDSVTAELEKLEQDENVGMDGVGDILGGLGDDDDDLLDSLTAEMGADF
NILEYADPELDTTDEKSTLLDSLEMDEGENAKEEKLKNLEAEKIAKTNFPRTQQDMSNVQ
MQPNQQINRMQMQMTNINPQMQQSIDGQNVNQNAGPSQQQPAQIPQQQQQPGQVQNQQQM
FQQANVQRQVRFKTIPPNQIPEIQQIHQQMMLQLQQAAANGKPMPIGTRLVANNNITGIV
TGPNNISLTFPTGQQRLQQVRMVNPNMMQQNQQNNPRMALPHMVQNRPNMMQNAQQQQIN
PNIQGLINQTQQPQQQQAQQSQQVPPPPYPEPPPPYPGQANQNQEQPLLLEDLLEQEKRE
QARNIGTNQMDMNIPQNQQQQQSLFSDQDYEKLRADVLTTTTQSISMPQQIQQPQQVQVQ
PQQPQFAPRGIINKQWRPQTPGVVNSQAPSTSPDIVRSVPIFNANLTPMPPMPPEIIQTE
TDKQIQSNYEAWLITQNDSLQKQLHYYETEIAELRKLKKSLNTKQRQLKKNGGDLNEIDA
QTLLKVTHEQAAVQKHLESSRKQARNHLQMKQDYDNKQKSKQMANISHMAQSPVGVQMND
QSPMMSPSPNIIQQPVQSPLGNPIMAPSQSPLHSPSPMMSSQSPGPNSIMQSPGGHINNA
MSPYNTMQQSPRIGTPHSQIDESPFSPNSIESPSINSRLTSPIPRMTSPQHRPNTPMQIQ
MMNRMPVQFNQQQNMNMNQQNRFIRPQMIPNDSNSRMAGMRMPVQQFQQQMQHGGGNVIR
QVQYDANGNPQNVQTISQANQQMDPQRAMQIRQMQMRQQQMMKQQQQQQQQQGQMQMQQM
SQVQPNQSPIHQQPQSPLINQNATSPMPRSPMVHYQQQQQNPNSPMMQMDSSPRPMYAQQ
QQQNMQIEHSNNNSNLMQGGGGGGNHMMNPGNPTQKRAPIKLGLRGGMPMYGKDGNKKQT
GTSDMLQLVQKAQQKHQAQQQSQQQGFTIEAQRNISGKVIPTAVSSMELQQASSSKQVKA
KTSLLKNPLGPKVKSLVDYDDNDSSNGTPPISPISQKIRQRLQDGKMHNEDVVIVDSSPD
EKQRLTDYDDDNDKITLTEVSLNSTAQDAGDAEIVEAFDGSELVSSPLVTEQEATDYTLF
DSHVVHLDDSNESLKEIINTDLLFEDAIPVTKPSTSKIETSSSITATAKNPKNVGTREDF
EAMIDSGKDEDENESESDAVTIEKNDPIEQETLEKPQGTPHRILTTTTIPTEGKREIKFP
VVLSSSSGGQLITLPSNLMNRTNIQASALAATAGMTGGQKKIVKGSTQTLAKVSIGNTTI
SVPVVLKNMPITNSDPSQGAKKMITTNAQTLSSLKKNPNMFSVSGQKINTSTIVTLSLNK
GNTRPIQTVDLKQRLQQNQKSIIVSAAPSGIIQASSHNVENIVSSATVINKLSASCASPI
LTFSKIPTYTMTQDTALPTKILEDDDVSPESSESDKNKVRDVSKSSESTSLSIPESGTVE
GNEEVKETIIENKPKSLIPMHVIVKSRESSQSPIASSSSSTTTNPTHAQAQRIVSGNMSQ
LSPLSQPIEINTNTHNATQQIRSIMSSIDANEESKNKAEVEQVATSTQSSVPKTIIRSTS
TPTSSGDTKIIISSQPSISVPSSPNTQSTNNVVFVKQIKAIPSSSSAGSTIAIKNQSTTT
LAQQQQQPLIITKTTNLLNILSNPPSGQHQQTFKATEIKMEPPMSGDSIPKTVTILKSNP
TITNLLNSNSFKRSKSSDDVITKETSEAAVNKRLSFEVSNEIKAEPIEPVVIKETVSTPT
SVTTVVKTEQSITVTQASTPISQTQKIIPPNKPEDSQNVLLKQLLQNSGSGGPVGGSPLT
RTVPGLITTQRAPSLGMFSSLEAQLARPVIPPAPAKQLIVTTQPLVTSIPIPVSSISSTP
EATVKTSQSVSVSKSISIHETSFVSQPSASNILTTTTTTPPVASQTLNVSNIGEKKPIVI
LNRNDIPASLLTSNSSNMTKTIISPLTTTEGMAPPNMTIKKEMISPGGTIKPNQSPTHIA
PTLVKSISQSSLNSGTTTPVPLESPSVDIKKEMDDSSQSESVASDVSMVKNENIMLTPSR
DGMNEPLDESPAKTAAEIANELKKKKRREYQKNRRQMQMSKEKGVKKPRKLQKSEEDYDS
FIDNLMLQIKSLPQMPILEPSLPKNYGVCQIYGSCELNKVNKKYDTTSGELTGVFGKGEL
SNISDFYNTKPFGVLEPKVEKTPASTQRGFYDQEFPPIKFDEEEKVTHHRSKYELLAKDR
DIDTPDTVVSSSSPECVTMKPPNRFPGLRLIREEEDAETEDEMTSFIDNRMSPSIPTIIA
PIPIRLKSGISLTSDNKVHNEKEFELSKQLGLKSCFEPPTPAKDNNNNNVTVTLTLTSSA
AEDIMGVLKSLANILNIPAPTAYQIVERTTTPPSQKLGLYRIKGKDGKEGQPVDIQTILN
GTAKFCRHCDVVILNNAIKAKANEFPLLVNTELESNELYFCGQTCYKQFQWRPINMLDDK
SLNSTTDDKALETMSENISKVEMQGDLSQLRDQNRDLKRKHEEIEDMNESKEDILQAEKR
QKLMRIKTFSANSFPNIHKQKKLSEREITEMLFRMNITVNSAPKILEDTRKCILCHQIGD
GVADGPSRLLNYDVDKWVHLNCALWSDGVYETVNGALMNLEAALQQSLNSQCTHCNHLGA
TIKCFKPRCGTLYHLNCAMKDNCVFYKNKTTMCNIHAPKSEKDNELTTLSVQRRVYIERD
ENRQVASIMHHSDLTNLMRVGSLILLNVGQLLPHQLHTFHTANYIYPIGFKIIRFFWSMR
YPNKRCHYICSIADSAGKPEFRVLVKEQNEEDTEFKDESPKKVWQKILETIVKLRRENQL
VRVFPKYISGEDLFGLTEPAVVRILESLPGVETLNDYRFKYGRNPLLELPLAINPSGAAR
TEPRLKHSVPMKKPHTQRTGSTSQRPAFVPSTSAGEIACPYSKQFVHSKSSQYKKMKLEW
RNNVFLARSKIQGLGLYAARDLEKHTMVIEYIGEVIRGELSELREKQYEARNRGIYMFRL
DEDRVIDATLCGGLARYINHSCNPNCVTEIVEVDREYRIIIFAKRRINRGEELSYDYKFD
IEDESRKIACHCGAAYCKKYMN

Protein features from InterProScan

Transcript Database ID Name Start End E.value
39 g2655.t1 CDD cd15509 PHD1_KMT2C_like 377 421 2.00811E-18
37 g2655.t1 CDD cd15510 PHD2_KMT2C_like 424 469 5.46391E-20
34 g2655.t1 CDD cd15489 PHD_SF 507 563 6.49789E-7
41 g2655.t1 CDD cd15512 PHD4_KMT2C_like 724 772 1.65562E-27
42 g2655.t1 CDD cd15513 PHD5_KMT2C_like 774 820 3.62461E-29
36 g2655.t1 CDD cd15514 PHD6_KMT2C_like 852 901 1.0359E-24
35 g2655.t1 CDD cd00084 HMG-box 1526 1579 8.76817E-6
38 g2655.t1 CDD cd15666 ePHD2_KMT2C_like 5212 5316 1.06034E-61
40 g2655.t1 CDD cd19171 SET_KMT2C_2D 5569 5721 1.55877E-102
32 g2655.t1 Coils Coil Coil 1155 1175 -
31 g2655.t1 Coils Coil Coil 2140 2174 -
33 g2655.t1 Coils Coil Coil 3077 3111 -
28 g2655.t1 Coils Coil Coil 3391 3411 -
27 g2655.t1 Coils Coil Coil 4696 4723 -
30 g2655.t1 Coils Coil Coil 5120 5161 -
29 g2655.t1 Coils Coil Coil 5721 5722 -
25 g2655.t1 Gene3D G3DSA:3.30.40.10 Zinc/RING finger domain 263 367 2.5E-11
26 g2655.t1 Gene3D G3DSA:3.30.40.10 Zinc/RING finger domain 368 423 1.5E-6
24 g2655.t1 Gene3D G3DSA:3.30.40.10 Zinc/RING finger domain 424 473 7.7E-10
21 g2655.t1 Gene3D G3DSA:3.30.40.10 Zinc/RING finger domain 501 566 3.6E-10
23 g2655.t1 Gene3D G3DSA:3.30.40.10 Zinc/RING finger domain 719 829 8.4E-26
19 g2655.t1 Gene3D G3DSA:2.60.120.650 Cupin 846 1018 1.1E-6
20 g2655.t1 Gene3D G3DSA:1.10.30.10 DNA Binding (I) 1496 1587 2.4E-26
22 g2655.t1 Gene3D G3DSA:3.30.40.10 Zinc/RING finger domain 5206 5319 3.7E-22
17 g2655.t1 Gene3D G3DSA:3.30.160.360 - 5357 5494 5.1E-45
18 g2655.t1 Gene3D G3DSA:2.170.270.10 SET domain 5526 5722 1.7E-51
72 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 1 52 -
71 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 10 27 -
70 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 28 49 -
78 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 111 138 -
55 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 1074 1102 -
81 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 1074 1092 -
63 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 1324 1349 -
59 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 1493 1515 -
76 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 1563 1616 -
58 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 1569 1616 -
74 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 1622 1641 -
66 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 1654 1674 -
60 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 2231 2270 -
67 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 2295 2316 -
62 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 2730 2762 -
56 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 2888 2902 -
75 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 2888 2909 -
73 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 3134 3158 -
80 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 3134 3164 -
65 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 3248 3291 -
77 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 3379 3469 -
61 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 4040 4063 -
68 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 4040 4085 -
79 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 4106 4129 -
69 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 4630 4670 -
57 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 4702 4734 -
64 g2655.t1 MobiDBLite mobidb-lite consensus disorder prediction 4713 4734 -
5 g2655.t1 PANTHER PTHR45888:SF3 HL01030P-RELATED 174 1845 0.0
7 g2655.t1 PANTHER PTHR45888 - 174 1845 0.0
6 g2655.t1 PANTHER PTHR45888:SF3 HL01030P-RELATED 4311 5722 0.0
8 g2655.t1 PANTHER PTHR45888 - 4311 5722 0.0
3 g2655.t1 Pfam PF00628 PHD-finger 775 821 1.0E-8
2 g2655.t1 Pfam PF05964 F/Y-rich N-terminus 5361 5412 8.2E-20
1 g2655.t1 Pfam PF05965 F/Y rich C-terminus 5418 5502 3.6E-27
4 g2655.t1 Pfam PF00856 SET domain 5593 5697 3.8E-18
91 g2655.t1 ProSiteProfiles PS50016 Zinc finger PHD-type profile. 374 424 8.84
88 g2655.t1 ProSiteProfiles PS50016 Zinc finger PHD-type profile. 421 472 9.543
90 g2655.t1 ProSiteProfiles PS50016 Zinc finger PHD-type profile. 722 775 8.992
87 g2655.t1 ProSiteProfiles PS50016 Zinc finger PHD-type profile. 772 822 9.712
89 g2655.t1 ProSiteProfiles PS50016 Zinc finger PHD-type profile. 849 904 9.098
84 g2655.t1 ProSiteProfiles PS50118 HMG boxes A and B DNA-binding domains profile. 1518 1584 8.863
82 g2655.t1 ProSiteProfiles PS51805 Extended PHD (ePHD) domain profile. 5209 5317 24.662
86 g2655.t1 ProSiteProfiles PS51542 FYR domain FYRN motif profile. 5355 5415 28.789
83 g2655.t1 ProSiteProfiles PS51543 FYR domain FYRC motif profile. 5416 5501 31.364
85 g2655.t1 ProSiteProfiles PS50280 SET domain profile. 5582 5698 17.634
92 g2655.t1 ProSiteProfiles PS50868 Post-SET domain profile. 5706 5722 9.151
48 g2655.t1 SMART SM00249 PHD_3 318 360 11.0
44 g2655.t1 SMART SM00249 PHD_3 376 422 5.9E-5
45 g2655.t1 SMART SM00249 PHD_3 423 470 1.5E-7
49 g2655.t1 SMART SM00249 PHD_3 507 564 2.4E-5
43 g2655.t1 SMART SM00249 PHD_3 724 773 1.6E-6
50 g2655.t1 SMART SM00249 PHD_3 774 820 1.6E-10
46 g2655.t1 SMART SM00249 PHD_3 851 902 0.013
51 g2655.t1 SMART SM00398 hmgende2 1517 1585 0.0017
47 g2655.t1 SMART SM00249 PHD_3 5271 5317 0.59
53 g2655.t1 SMART SM00541 fyrn_3 5371 5414 2.7E-16
52 g2655.t1 SMART SM00542 fyrc_3 5420 5507 1.9E-32
54 g2655.t1 SMART SM00317 set_7 5582 5704 1.4E-40
14 g2655.t1 SUPERFAMILY SSF57903 FYVE/PHD zinc finger 371 429 8.2E-7
13 g2655.t1 SUPERFAMILY SSF57903 FYVE/PHD zinc finger 414 477 4.65E-12
16 g2655.t1 SUPERFAMILY SSF57903 FYVE/PHD zinc finger 503 566 1.23E-5
11 g2655.t1 SUPERFAMILY SSF57903 FYVE/PHD zinc finger 720 774 6.2E-7
15 g2655.t1 SUPERFAMILY SSF57903 FYVE/PHD zinc finger 766 825 2.66E-11
12 g2655.t1 SUPERFAMILY SSF57903 FYVE/PHD zinc finger 840 907 1.04E-8
9 g2655.t1 SUPERFAMILY SSF47095 HMG-box 1527 1573 5.59E-6
10 g2655.t1 SUPERFAMILY SSF82199 SET domain 5564 5722 1.7E-46

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0005634 nucleus CC
GO:0005515 protein binding MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values