Gene loci information

Transcript annotation

  • This transcript has been annotated as Hemocytin.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_2 g5301 g5301.t1 isoform g5301.t1 8587960 8606608
chr_2 g5301 g5301.t1 exon g5301.t1.exon1 8587960 8588085
chr_2 g5301 g5301.t1 cds g5301.t1.CDS1 8587960 8588085
chr_2 g5301 g5301.t1 exon g5301.t1.exon2 8590067 8590183
chr_2 g5301 g5301.t1 cds g5301.t1.CDS2 8590067 8590183
chr_2 g5301 g5301.t1 exon g5301.t1.exon3 8590256 8590601
chr_2 g5301 g5301.t1 cds g5301.t1.CDS3 8590256 8590601
chr_2 g5301 g5301.t1 exon g5301.t1.exon4 8590665 8591303
chr_2 g5301 g5301.t1 cds g5301.t1.CDS4 8590665 8591303
chr_2 g5301 g5301.t1 exon g5301.t1.exon5 8591469 8591612
chr_2 g5301 g5301.t1 cds g5301.t1.CDS5 8591469 8591612
chr_2 g5301 g5301.t1 exon g5301.t1.exon6 8592330 8592668
chr_2 g5301 g5301.t1 cds g5301.t1.CDS6 8592330 8592668
chr_2 g5301 g5301.t1 exon g5301.t1.exon7 8592730 8592820
chr_2 g5301 g5301.t1 cds g5301.t1.CDS7 8592730 8592820
chr_2 g5301 g5301.t1 exon g5301.t1.exon8 8592896 8593384
chr_2 g5301 g5301.t1 cds g5301.t1.CDS8 8592896 8593384
chr_2 g5301 g5301.t1 exon g5301.t1.exon9 8593740 8594050
chr_2 g5301 g5301.t1 cds g5301.t1.CDS9 8593740 8594050
chr_2 g5301 g5301.t1 exon g5301.t1.exon10 8594183 8594446
chr_2 g5301 g5301.t1 cds g5301.t1.CDS10 8594183 8594446
chr_2 g5301 g5301.t1 exon g5301.t1.exon11 8594590 8594736
chr_2 g5301 g5301.t1 cds g5301.t1.CDS11 8594590 8594736
chr_2 g5301 g5301.t1 exon g5301.t1.exon12 8594960 8595079
chr_2 g5301 g5301.t1 cds g5301.t1.CDS12 8594960 8595079
chr_2 g5301 g5301.t1 exon g5301.t1.exon13 8595165 8595658
chr_2 g5301 g5301.t1 cds g5301.t1.CDS13 8595165 8595658
chr_2 g5301 g5301.t1 exon g5301.t1.exon14 8595713 8596480
chr_2 g5301 g5301.t1 cds g5301.t1.CDS14 8595713 8596480
chr_2 g5301 g5301.t1 exon g5301.t1.exon15 8596631 8598942
chr_2 g5301 g5301.t1 cds g5301.t1.CDS15 8596631 8598942
chr_2 g5301 g5301.t1 exon g5301.t1.exon16 8599219 8599339
chr_2 g5301 g5301.t1 cds g5301.t1.CDS16 8599219 8599339
chr_2 g5301 g5301.t1 exon g5301.t1.exon17 8599398 8599686
chr_2 g5301 g5301.t1 cds g5301.t1.CDS17 8599398 8599686
chr_2 g5301 g5301.t1 exon g5301.t1.exon18 8599883 8600392
chr_2 g5301 g5301.t1 cds g5301.t1.CDS18 8599883 8600392
chr_2 g5301 g5301.t1 exon g5301.t1.exon19 8600448 8600832
chr_2 g5301 g5301.t1 cds g5301.t1.CDS19 8600448 8600832
chr_2 g5301 g5301.t1 exon g5301.t1.exon20 8600967 8601008
chr_2 g5301 g5301.t1 cds g5301.t1.CDS20 8600967 8601008
chr_2 g5301 g5301.t1 exon g5301.t1.exon21 8601159 8601386
chr_2 g5301 g5301.t1 cds g5301.t1.CDS21 8601159 8601386
chr_2 g5301 g5301.t1 exon g5301.t1.exon22 8601451 8602636
chr_2 g5301 g5301.t1 cds g5301.t1.CDS22 8601451 8602636
chr_2 g5301 g5301.t1 exon g5301.t1.exon23 8602706 8603395
chr_2 g5301 g5301.t1 cds g5301.t1.CDS23 8602706 8603395
chr_2 g5301 g5301.t1 exon g5301.t1.exon24 8603469 8603763
chr_2 g5301 g5301.t1 cds g5301.t1.CDS24 8603469 8603763
chr_2 g5301 g5301.t1 exon g5301.t1.exon25 8603937 8604158
chr_2 g5301 g5301.t1 cds g5301.t1.CDS25 8603937 8604158
chr_2 g5301 g5301.t1 exon g5301.t1.exon26 8605694 8605866
chr_2 g5301 g5301.t1 cds g5301.t1.CDS26 8605694 8605866
chr_2 g5301 g5301.t1 exon g5301.t1.exon27 8605942 8606356
chr_2 g5301 g5301.t1 cds g5301.t1.CDS27 8605942 8606356
chr_2 g5301 g5301.t1 exon g5301.t1.exon28 8606427 8606608
chr_2 g5301 g5301.t1 cds g5301.t1.CDS28 8606427 8606608
chr_2 g5301 g5301.t1 TSS g5301.t1 NA NA
chr_2 g5301 g5301.t1 TTS g5301.t1 NA NA

Sequences

>g5301.t1 Gene=g5301 Length=11445
ATGTTCACCACAAAGAAGGAAATTCCCATTCCTGTTCAAATTGCAGGATTAAAAGTGACC
AGATCAGGACTTGATGTTAGAATTTTACTTGAATCTATACCATTGACTATCACATGGGAT
ACAAAAAAATTTGTGCAGATTGATGCATCAGCTTCGATTTTCAATAGAACAGCTGGCATT
TGTGGTACAATGGATGGAATCATGAGTAACGATTTTATGTCGAAAGACGGAGAATTTCAT
AAAATGCCAGCAACATTTGTTGATTCATGGCGTGCATTGAATATGGATAAGAGTAAAGAA
AAATGCATTCCACGTTCACAAAAACTAGCAGCAAAAGGTTGTGAGACGGCTGTCGAGGAA
AAGGCAAACCTTGTATGCACAGATCTATTAAAGAATCCCAAAATGGACAATTGCTTTAAA
ATGTTCAATATGGAAGTTTTAATGAAAAATTGCATCTCAGATTATTGCGATTGCAAGAAC
TTATATGAACGTACTGAATGCATTTGCAATGGCATTGCTGTACTTGCTAAAGACTGTCGT
TTTAGAGGCGTTATGTTAGAAGATGGATGGCGTGATTGGCAGATTTGTCCTTTAAATTGC
ACAAATGGTCGAGTTTATAAATCATGTGGACCAATAAATGAAGCATCTTGTGGTTCTGCT
ATTGAAACTGATTCATTTAATTGTCGTGAAGGATGCTTCTGTCCCGATGGTATGCTACTC
AATGATGGCATTTGTGTTAAAAAAGAAATGTGTCCATGCAAGTTAAAAGGCAAAATTTTT
CAACCAAATAGCAAAATCAAGCGTGATTGCAATACATGCGTTTGTGAGGAAGGCAATTGG
AAATGTAGCTCGTTGACATGTGGTTCACGTTGTTCCTCAGTTGGCGATCCTCATTACAGC
ACATTTGACGGGAAACGATATGATTTTATGGGCAAATGCTCATACTATTTAATGAAAACA
GAAAATATCATAGTAGAAGCAGAAAACGTTGCATGTAGTGGAGCAATATCAGAGAGCATG
AATTTCTTGCCATCTTTCACTACAGAATTACCATCATGCACAAAGTCACTCACTATTAAA
TATAATGACAACACTGGACAGCAGCGGATAATTAAATTGAAACAGGGTGGTTTTGTGCTT
GTCGATGGTTTTGAAGTTTCAAAATTACCAAGGGATTTATCTGATGGTGCAGTTGTAATA
AGACAAGCAAGCTCTAGTTTTATTCTAGTTGATTTCAATGATGGTGTTCAAGTATGGTGG
GATGGTTTAACACGAGCTTATATTGATGCACCAGCATCTTATCGTGGCAAGACTCAAGGG
CTTTGTGGAACATTTAATTCAAATTTACAAGACGACTTTTTAACACCGGAAGGTGATGTT
GAAACAACAGTCACACCATTTGCAAATAAATGGCAAACAAAAGAATCGTGTGATCAAATT
AGTGATAAAATTGTTCCACATCCATGTCAAACAAATCCAGAAAATAAACCAAAAGCTGAG
AAAATTTGTCAGAAACTTCTTAAAGATAAAATATTTGAAGAATGTCATCTTTTTGTTGAT
CCTGAGCCATTTTATGAAGATTGTCTTTATGATATGTGTGCATGCAAGGGTGAGGTTAGT
CAATGTGCATGTCCAATTTTAGCTTCATATGCGACAGAATGCGCTCGTCAGGGAACAATT
CTCAATTGGCGTTATCAAGTTAATGAATGTGTCATCAACTGCCCCAACGGACAAGTCTTT
GAGCAATGTGGCGAAGCTTGCTCACGAACATGTTCAGATTTGCAAAGAGAAGCACCATGC
AAATTGCAGTGTGTTGAAGGTTGTCGTTGTCCACTTGGCCAAGTGCTTGATGAAAATAAT
GAATGTGTGGCTGTTTCTATGTGCAAATGCATTTATAAAGGCTTAGAATTTAAGCCAAAC
TATAAGGAAGTGAGACCTGGACAAAAATTTTTGCAATTATGCACATGTATGAATGGCAAA
TGGAATTGTGTTGAAGCAAAAGAAGGGGACTCTGTCAAATATCCACCAGCTGAAGATATT
TCAAAGAAATGTTCGTCAGTAAATAATGAAGTTTTCACAACATGCGAACCGGCAGAGCCT
TTAACATGCAAGAACATGCATATGAATGTGACATCGACTACAGCAGTATGCAGAGCAGGA
TGCAAATGCAAAGAAGGCTTTGTGCTTGATACACTTTTGAAAAAGTGTGTTTTACCTGAA
AGCTGCTCATGTCATCATGGTGGACGAAGTTATAATGATGGCGAGAAAATTAAAGAGGAT
TGCAACACATGCTTATGTAAAGGAGGGAAATGGTCGTGTACAACAAAAGATTGTGCTGCA
ACTTGTAGTTTATGGGGTGATAATCATTTTACAACATTTGATGGTCGTGAATTTGATTTC
CAAGGAGTCTGTTCATATGTTCTTGCAAAGGGAAAAATTCTCAATAATGATGGATTTTCG
GTTTCAATTCAAAATGTTCTCTGTGGTTCAAATGGTGTAACATGTTCGAAAACATTGACA
ATAAATCTTGTCGGAAATGAACCAGAATCAATTACACTCTCATCAGATTCACCCGTGGCA
GGCATCAAGGGCTTATCAAATGAAGCAAAGCCAATGAAAAAGATGCTCACGTATCGCTCG
GGTGTGTTTGTCGTTGTTGAAATGCCACACATCGGTGTGCTGCTTAAATGGGATCGAGGC
ACACGAGTGTATTTGAGATTGGAGAATCGTTGGAAGGGTAAAGTTCAGGGATTATGTGGA
AATTACAATTATGATGCTCTAGATGACTTTATCAATCCAAGCGGCGGCATTGAGAGTAAT
CCAATTATTTTTGGCCATTCATGGAAGCTCGATGACTCTTGTGCTAAACCAAGCGAACAA
GTCGACTCATGTTCATTAAATCCACAAAGGAAAACATGGGCACAATTGAAATGTGGACTA
CTGAAATCAAGTAATTTTGCACAATGTCACTCGGAAGTCTCTGTTGAAAGCTTTTATAAA
CGATGTATTCATGACACTTGTAGCTGCGATCAAGGTGGTGACTGTGAATGTCTCTGTACA
GCTTTAGCAGCATATGCTTACGCTTGCACAGCAAGTGGTGTGAATATACGCTGGAGAACA
CCTGATTTGTGTCCAATGCAGTGCGATCCAGAATGCTCAAATTACAATCCATGCATTCAA
TCATGTCCAGTTGAAACATGTGACAACATGATGCATCCGTTGAAAAATGAAAGACTTTGT
AGTTCAGATAATTGTGTTGAAGGTTGCAAGCTAAAAGAATGTCCAATTGGTAGCGTTTAT
AAGAATGACTCGTTTTTGGAATGTGTACCAAAGTCAATTTGTAAGCCAATATGTTTACAA
CAGGATGGCATTACATATTATGAAGGTGATGTAATGGCTTCCGATGCTTGTCACACGTGT
AAATGCACACGAGGTAGCAAAGTTTGTGTCGGTATTCCATGCGCAACTGAATCGCCACCA
CTAATTGGTGATTTGTTTGGATGTGAGACAGGATGGACAGCATGGATTAATCAAGATTCA
CTCGATAATCTTCAAAGCAAAATTCCATCTAAGACAAATTACTTCAAAGAAGATGATATT
GAGCCGTTGCCTAGTACACTACATTTTAAAAATATTGGTGGAAAAATTCTTTGTAAGCCT
GAATTTTATGCTGCTATTGAATGTCGAACTGTTTTTGGTCATAATCATCCTAAATCAACT
GGACAAAATGTAGAATGTTCCCTTGAGCGTGGCCTTGTCTGTAAAGGTCAATGTTTTGAT
TATGAAATTAGAGTTTTTTGTGATTGTGGTGAAGCCACAACAAAATTGACAGGAATTTTG
GGACCAACAATGCCACCAGTAAAAATAATTTCTCCTTTACCAACAACTTTACCACCAGTG
AAAATTGTTCCAACAAAATCTCCAATAATTACTCCAAGTATCTGCGATCCAACCATTCCA
AATGTTGAACATCCACTCGATTGTACTAAATATTTACAGTGTGTGATGGCAAACAACGGA
AGTTATGTTTATGTGGAAAAAACTTGTGGTGCTTCGATGATGTTCAATCCAACCTCAATG
ACATGCGATTGGCCGGCAAATGTTATTGCGAAAAAACCAAAATGTGGAAAACCCGAAGAG
CCAATGCAAGCTGAATGTCCGATGGGTTATGTATGGAGTGAATGTGCTGTTCCGTGTCGC
CGCGCTTGTAATTATTACGGAAAACTACTGACTATGAACGGAAATTGTACAACTGCATCG
AATGATTGCATTCCTGGCTGCATGCCGCAAGAAAGCGCAGTTAAATGTGAATATCCACGA
TTATGGAGAGATTGGAAGAGTTGCGTTGAAATTACTGATTGTACATGTATGGGACCTAAC
AATGAGCAACTCAAGCCTGGTCAAGTAGTCAAAGTCAGCGATTGCAAAACATGCCAATGC
ATCAGAAATGAATATATTTGCACAGATACTCCATGTGCAACAACAGCTAAACCAATAAAA
ATAATTTCAACAGTCAAGCCTATTCCAGAAGTAGAAGTTTTAATAACTCAACAAAAAACA
ACGCCTGGACCTCAAATACCACAAACAACAAAGAAATCTGTCATAATTATTGAGCAAGAA
GTTATAGAAAGACCTTATTATTTGACAATTTGTGATCCATCAATTCCTCATGTTGAACAT
CCAAACAGTTGCTATAAATTCCTGCATTGCATGCCAGCATCAAATGGAAGTTATGTTTAT
GCTGTTAAAACTTGCTATCCTGACATGATGTTCAATCCAATTTCAATGATTTGTGATTGG
CCAATAAATGTTAGACATGTAAAGCCACAATGTGCAGATAATCCAGGTGAAATTGAAATT
TGGGAAGAAACAGAAATTAAGACAATAATAAAGCAAACAACACCAAGTACAAAGAGAAGA
TTGACTTTCCCACCAATTGATGAAACAGATTATGCAGAAGGAAAATTACCAACTTATATT
AGAATTTGTGATCCAACAATTCCAATGATTCCACATCCAAAAAGTTGTTATAAATATTTG
CATTGTGTTCAAGCTCCAAATGGAAGTTTTGTGTATACTGAAAAGACTTGCTTCCCTGAT
ATGATGTACAATCCAAATGCTTATAATTGTGATTGGCTTGATAATGTTTTTAAAATTAAA
CCTGAATGCAATGAAAATCCTGGTGAAACTGAACTTTGGACTGAAATTATTAAGAAATAT
ACAAGAATTATTTCACCCACAACTCCTGCTTTTGGAACTTCTCCTAAACCTTCATTTATT
ACAACTATAGCTTCAATTCAAACATCACCAAAACCTGTAAAAATTATTGAAGAAATTCCT
TATTATATTCGTATTTGTGATCCATCAGTGCCTCATGTTGAACATCCGAACAGTTGCTAT
AAATTCTTACATTGTGTTCCTTCATCAAATGGAAGTTATATTTATGCAGTTAAAACTTGC
TATCCAGCTACAATGTTTAATCCAGTGACAATGATTTGTGATTGGCCAGATAGTGTGAAA
AAAGTCAAACCACAATGTGGAACTGATCCAGGTGAAATTGAAATTTGGGAAGAAGAAGTG
ATTAAGAAAAAGACAATTATTAAACCTGTGACAACACAAAAACCATCAATGTCAATAGCA
CCAACTATTCCACTTTATATTGAACAACATGTGACTGAATGTAATCCAGCCAATCCATTA
ATTGAACATCCATCGGATTGTCATAAATATCTTGAATGTCTCTTAAGTCCAGCTGATGGT
GTTTATAAATACGTTGAAAAATCATGTGGACTTGAACTCATGTATAATCCTTCAAAACAT
GATTGTGATTGGATTGAAGATGTCATAGCTGCTAAACCACAGTGTCAAAAGATTGAAATT
CTAGGTCAAATAACAATGCCACCTGTAATTACAACAACAACACCAAAATCAGGATTAGTT
AAGGATAGAAAACCAGGAAAATTAACACCAAAACCAAAAGTAACAACTTTAGTACCATTG
GTTGTTTTGCCATCAACAGTTACACCACCTATGCAATGTGATGAAAGTAAATTTGTTCCA
CTTTTGCCTCTTGTGCCAGACACAGCTTTTTCATCAAGCAGCAATTTGAATGAAGCATTT
AAACCTGAAGCATCGAGATTAGAATCAAGACCGACTGAAAAAGCTATGGGAAGTTGGTCA
CCAGCAACTAATGATAAAAATCAATTCTTACAAGTTACATTCCCTAAAGCAATTCCAATT
TATGGAATAATTGTTAGAGGAAGTCCATTATTAAGTCAATATGTGACAAGTTTCAAAATT
CTCTACAGTCTTAATGATGAAGTTTTTCATGTCATTGATGATCATCATGGAAATCCACAA
ATTTTCAGTGGTTCAGTTGATCAAAATACGCCAGTTAAAAATATTTTTTCAACCCCAGTA
GAAGCTAAATTTATAAGAATTTATCCATTATCATGGTATCAAGGAATTGCACTTCGAATT
GAACTTTTAGGATGTCAACGAGATCCATCGAAGCCAATTTCATTTGAATATCCTGTGACA
ACACCAAAATCTGTCGAAACAGTAACACCAGCTATAAAAATTGTCCCAACTGTTGTCGTT
CCTGTAACACCACCATCTGTTTATACAATTGCTCCAGTGATTCCACTTTGTGATGACCCG
TTAGGTGTTGAAAATAGCAAATTGAGTCCGCAACAGATCAAGTTTAGTTCTATCAAGGAT
GCAGGTTCAGTAAAAACTAAAATACGACGAAATCCACTTGAAATTATTAAATTGTCATCT
GCACGTGGATGGATGCCTTTAGCTGATAATGTTAATGAATATGTCATGTTTGATTTCATC
GAAAATCGAAATCTGACTGGTGTTATTACAAAAGGTGGTGAATATGGTTGGGTTAAAAGT
TTCATCATACTCTATTCAAAAGATGATGTCATATGGAATAAATTACTCGATCAAAATGGA
CAACCAAAGCAATTTCTCGCTAACATTGATGCTGAAAGTGTCAAGAAAAATTACTTCCCA
AATCCAATTAATGCGAGATATCTCAAAATTCAGCCTGTAAAATGGCATGGAGCGATTGAG
TTGAAACTTGAACCGATAGGATGTTATTTGCCTTATCCACCGCCCGATCCATCAATGACA
ACAGTGAAATATCCTGAAGAAATTGAAATTGTCACACCGAGCAACTGTGGTTTATGTAAA
GGCGCTTTATTGCGAACGCCCTCAAATAATCGTGAAACATGTCTATGCTATCCGCCACTT
TACTATAATGGCGATGAATGTGTTCCGCAAAGTGACTGTCCATGCTTCGAAGGACACATT
AGTTATCCTGTTGGCGAGACTTATCTTACGGAAACATGTGCTGAGTGCATATGTCAAATT
GGCGGCATTCCGCAATGCACACAGAAAACATGCAAACCTTGTGCAAAAGGATTAAGAAGA
ATTGCGCCTGGAACATGCGATTGCAAATGTGAAAAATGTCCACCTGAAAGTGTAATTTGT
CAGACAAGTGGTGAATGCATTCCTGAATCATCATGGTGTGATGGTGTTCAAGATTGTCCA
GATGATGAAATTAATTGCTTAATCACTGAAGCTCCTCATATTAGCATTAATAAAACGGAA
ATAATAACAATTACAAAGAAATGTCCAGATCCATCTTGTCCACCTGGATTTATGATCAAA
CCAAAGCAATCACGCAAATCAAAAATGATGAGTTCACGCTTCTCTGATGATGATGATGAA
AATGATAAGCCACAACATCAATATTACTACATCAAAGCAAAATATATGGCAAGTTATGAA
GCAATTTTGCCAATTTCAACAACAAAAGATAAACAGGATATTAATTCTGAAGAATGCTTT
GAATTTATGTGCATTCCAATAGTAGAAACGCCTGAAGGACCAGACTATGCAGCTGGAAAT
CAAACTAAACTTATTAGATGTCCGGAACCGAAATGTCCTCGAGGCTACATGTTGAGACTT
CAAATTCAAAAGTCACCTAATGAATGTGCAAAATTCGCATGTGATCCAGTAGTAGAAAAT
GATGCTGTATGCAACGTAACCGGCCGCACATTCAATACATTTGATTCGACAGAATTTAAA
TATGACATTTGTAGTCATCTTCTTGCGCGCGACGTGTCAGATGCGAAGTGGAACGTGATA
ATGAGAAAGAATTGCTCAACTGGAAATAATGTTTGCACCAAAGAGATTGAAATTAAGGAT
AAGGTTGCAAAATATACTCTTATTCTCTATCCTACACTAACAGTGAATCTCGATGGCTAT
TACTTTACAATTCAGCAATTGCAAAATTCAAACCGTAAAATGTCATTTGTGGTATCAAAG
AATGGTGACAGTCTTCTTTATGTCTCACATACACACGGCATTTGGGTGACACTTGATCAA
TATGGTGATATAAAACTTGGTATTTCTTCACAGTATGTCAATCAAGTCGATGGTCTTTGT
GGTTTCTATAATAATGATAAAAAAGATGACAAACGTACACCGAGTGGCCAACTTGCAACA
TCGACTGTTGATTTTGGTGATAGCTGGTCAGTTAGTAATGGCACTGCTGAAGATTGCATT
CCACATTCATGTCCAAAAGCAGTTCAAGAAGTTGCAATGAAAATGTGCGAACTTGTCAAT
CATGAAATTTTTACGCCATGCAAAAAAACAATCAATCCAACTCATTTTATATCAACATGT
TTAGAGACAGCCTGTGATTGTTTATTAACAGCCACAAATGGCACAACTGCGACTAAAGAT
TTAGAAAAACATGCAAAACAGTGCAAATGTTCAATGCTAAAGAATTACGTGGTCGAATGC
ATGGCAGCTGATGAAAACGTGCATCTAGAAACGTGGCGCTCTGTTCATTCATGTGAAGCA
ACTTGTGTTGCCCCACTTGTACACCAAGACTGCTATCGACGAAGTTGTGAGACAACATGC
AATAATATACAATCAAATGAGTGTGCAAACGTGCCTGGAACTTGTTTCTCTGGTTGCTTT
TGTCCACCAGGAACTGTTAAGAAAGACCAAACATGTGTGCCAATAAGTGAATGCCGTGAT
TGTGTTTGTGATGGTTTTGGAAAGTCTCAATATATCACATATGACCGTAAAAACTTTACA
TTCGATGGAAATTGCACATACTTGCTATCACGTGATATTTCATTGAAAAATGTTTACACT
TTCCAAGTTTATGCAACGATTGGTCCATGTGATACAAAAGCAAATGCAATTGCTCAGAAA
AAAGTAACATGTACTCAAGCACTTCATATTGCTTACAATAGTCACATTGTTCATATTCAA
AAAGATGCAATGAAAAACTTAGAAGTTATTATTGATGGATTCAAAATTGCAAACAAGCCA
TACATTCAAAATTGGATCAAAATCACAGAACAAGGAAAAACATTGAATATTATTTTGCCA
GAATCACAGATTGAACTTGTTACGATGTTTGAAGCAATGTCATTCAGTATAAAAGTACCA
AGTGTAAAATATGGAAATAAAATGGAAGGACTTTGTGGCAATTGCAATGGTAATCCTGAG
GATGATTTGACTAAAAATCCAGCTGCCACAAATCTCCCGCTCACAAACACTCCATTGCAA
AACTTTGCACTCAGTTGGCTTGCTAATGAACCAAAACTTTTGCTCAATGAACAAAAAGAT
AAATGTTATGTTGATGAGGAAAGTGAATGTCTGCCACTAGCACCTGAAACTGATCCTTGC
TTTAAGATTCTCGATGAGGAAATTTTTGGAAAATGTCACTTTGTGGTTGAACCTTTGATG
TATGTAACAGCATGCCAGCAAGATCTTTGTAGAACGGGACCAACACAAAAAGGCTCATGT
GATTCTATTGCTGCTTATGCTCGAGAATGTGCTCGTAATGGTGTCTGTGTTGATTGGCGT
AAAAATGGTTTCTGTCAAATGGATTGTGCTGCTCCTTATGTTTATACACCTTGTGGTTGT
GCTGAAACATGTCAATCGATTGAAGCAAGAGATAAATTGATAAAATCATCGTCCGCAATA
AAAGATCCAAAAACAATTGAAAAGATGAAATCAATTTGTAATGCTGGTTTGAGTGAGGGC
TGTTTCTGTCCAAAAGGAACCGTTTTGCATAATGGCAAATGTCTACGTGAAATTGAATGT
CGTGCATGCGATGATAAAGGTCATTTACCAGGTGACGTATGGCATTTGGATGTGTGTACT
AAATGTACATGCCGCAATGATTCAACAATTCAATGCCAAAAGATTCAATGTCCAACACAA
GCAATATGTGGCATTGGTTTTACAGCAGTCGAGTCATCACCGACAGGAGAATGTTGCAAG
AAATATGCTTGCATACCAGAAGAAAAGCCTATACAACGAACATTATGTCCTCAATTAGTT
TTACCAAAATGTGGTCTCGATCAGACGAATAAAATAATAAATGGTACCGATGGATGCTCG
AAATATATTTGCGACTGCATACCAAAGGCTCAATGTAAGCCGGTACAAAACTTTACACAA
TTCGAGCCGGGTTTTAAGGCAATAACTGACACAACAGGATGCTGTCCCATATCGAAGCTT
GTTTGTGATAAATCACTGTGCCCGCCCAAACCAACTCAATGTACGGAAGCGTTTTACGTT
CTCGAGAAGACAAAGAGTGCTGTTGATAAAATTTGTTGCGATATCTACGAGTGCCGTCCA
CCGCAAAATAATTGCATTGCAACAATCAATGGCAAGAAGATTTTGAAAAAGATGGACGAG
TTGTGGCAGACAGAGGATGTTTGTGAGAAGGCACAATGTGCCTTTGACACAAATGGAAAA
CCGATAATTAAAACACAGCGTGAAGTGTGTAATACAATATGCCAAGCGGGCTACGAGCTT
AAACCAATCGTTGGCAAATGCTGTGGTGAATGCGTCAAAACGAAATGCATAGCGGATAAC
AAACTTTATGAGATAGGCCAAACGTGGTTCAGTGACGATAATTGCACGACATTCGAGTGC
AATATCAAAGACGGACAGATGCTAATCACATCAATGATGCCAACATGCCCTGATATTTCA
TCATGTCCACCACTACTTCGATACAAGGACGGCTGCTGTGAAAAATGCAAAATGGAATCG
CTGTCACAGAAAAATTGTTTACCCGAGTCGCTTTCTGAAAGTCAAACTGTTGGTTTGATT
CAATTGCAACTGCCACCTCACGGAAATTGCAAAAATGTTAACGGCATCAGAGGGATCACT
CAATGTACTGGTACATGCAAGAGTGGTACCAAATTTGATCCATTGACGTTTGATCAATTG
AAGACTTGTGAATGTTGCTCAGTAAGTGGAGTTCGTGAACTTCCAGTTGAACTTCTTTGT
GATGATAAATACAAATTTACAAAGAACTTTAATGTTCCCTCATCATGTTCATGCACGAAA
TGCGGCACTGATGAAAAAATACAAAAACTAAGAGCTGCTTTTTAA

>g5301.t1 Gene=g5301 Length=3814
MFTTKKEIPIPVQIAGLKVTRSGLDVRILLESIPLTITWDTKKFVQIDASASIFNRTAGI
CGTMDGIMSNDFMSKDGEFHKMPATFVDSWRALNMDKSKEKCIPRSQKLAAKGCETAVEE
KANLVCTDLLKNPKMDNCFKMFNMEVLMKNCISDYCDCKNLYERTECICNGIAVLAKDCR
FRGVMLEDGWRDWQICPLNCTNGRVYKSCGPINEASCGSAIETDSFNCREGCFCPDGMLL
NDGICVKKEMCPCKLKGKIFQPNSKIKRDCNTCVCEEGNWKCSSLTCGSRCSSVGDPHYS
TFDGKRYDFMGKCSYYLMKTENIIVEAENVACSGAISESMNFLPSFTTELPSCTKSLTIK
YNDNTGQQRIIKLKQGGFVLVDGFEVSKLPRDLSDGAVVIRQASSSFILVDFNDGVQVWW
DGLTRAYIDAPASYRGKTQGLCGTFNSNLQDDFLTPEGDVETTVTPFANKWQTKESCDQI
SDKIVPHPCQTNPENKPKAEKICQKLLKDKIFEECHLFVDPEPFYEDCLYDMCACKGEVS
QCACPILASYATECARQGTILNWRYQVNECVINCPNGQVFEQCGEACSRTCSDLQREAPC
KLQCVEGCRCPLGQVLDENNECVAVSMCKCIYKGLEFKPNYKEVRPGQKFLQLCTCMNGK
WNCVEAKEGDSVKYPPAEDISKKCSSVNNEVFTTCEPAEPLTCKNMHMNVTSTTAVCRAG
CKCKEGFVLDTLLKKCVLPESCSCHHGGRSYNDGEKIKEDCNTCLCKGGKWSCTTKDCAA
TCSLWGDNHFTTFDGREFDFQGVCSYVLAKGKILNNDGFSVSIQNVLCGSNGVTCSKTLT
INLVGNEPESITLSSDSPVAGIKGLSNEAKPMKKMLTYRSGVFVVVEMPHIGVLLKWDRG
TRVYLRLENRWKGKVQGLCGNYNYDALDDFINPSGGIESNPIIFGHSWKLDDSCAKPSEQ
VDSCSLNPQRKTWAQLKCGLLKSSNFAQCHSEVSVESFYKRCIHDTCSCDQGGDCECLCT
ALAAYAYACTASGVNIRWRTPDLCPMQCDPECSNYNPCIQSCPVETCDNMMHPLKNERLC
SSDNCVEGCKLKECPIGSVYKNDSFLECVPKSICKPICLQQDGITYYEGDVMASDACHTC
KCTRGSKVCVGIPCATESPPLIGDLFGCETGWTAWINQDSLDNLQSKIPSKTNYFKEDDI
EPLPSTLHFKNIGGKILCKPEFYAAIECRTVFGHNHPKSTGQNVECSLERGLVCKGQCFD
YEIRVFCDCGEATTKLTGILGPTMPPVKIISPLPTTLPPVKIVPTKSPIITPSICDPTIP
NVEHPLDCTKYLQCVMANNGSYVYVEKTCGASMMFNPTSMTCDWPANVIAKKPKCGKPEE
PMQAECPMGYVWSECAVPCRRACNYYGKLLTMNGNCTTASNDCIPGCMPQESAVKCEYPR
LWRDWKSCVEITDCTCMGPNNEQLKPGQVVKVSDCKTCQCIRNEYICTDTPCATTAKPIK
IISTVKPIPEVEVLITQQKTTPGPQIPQTTKKSVIIIEQEVIERPYYLTICDPSIPHVEH
PNSCYKFLHCMPASNGSYVYAVKTCYPDMMFNPISMICDWPINVRHVKPQCADNPGEIEI
WEETEIKTIIKQTTPSTKRRLTFPPIDETDYAEGKLPTYIRICDPTIPMIPHPKSCYKYL
HCVQAPNGSFVYTEKTCFPDMMYNPNAYNCDWLDNVFKIKPECNENPGETELWTEIIKKY
TRIISPTTPAFGTSPKPSFITTIASIQTSPKPVKIIEEIPYYIRICDPSVPHVEHPNSCY
KFLHCVPSSNGSYIYAVKTCYPATMFNPVTMICDWPDSVKKVKPQCGTDPGEIEIWEEEV
IKKKTIIKPVTTQKPSMSIAPTIPLYIEQHVTECNPANPLIEHPSDCHKYLECLLSPADG
VYKYVEKSCGLELMYNPSKHDCDWIEDVIAAKPQCQKIEILGQITMPPVITTTTPKSGLV
KDRKPGKLTPKPKVTTLVPLVVLPSTVTPPMQCDESKFVPLLPLVPDTAFSSSSNLNEAF
KPEASRLESRPTEKAMGSWSPATNDKNQFLQVTFPKAIPIYGIIVRGSPLLSQYVTSFKI
LYSLNDEVFHVIDDHHGNPQIFSGSVDQNTPVKNIFSTPVEAKFIRIYPLSWYQGIALRI
ELLGCQRDPSKPISFEYPVTTPKSVETVTPAIKIVPTVVVPVTPPSVYTIAPVIPLCDDP
LGVENSKLSPQQIKFSSIKDAGSVKTKIRRNPLEIIKLSSARGWMPLADNVNEYVMFDFI
ENRNLTGVITKGGEYGWVKSFIILYSKDDVIWNKLLDQNGQPKQFLANIDAESVKKNYFP
NPINARYLKIQPVKWHGAIELKLEPIGCYLPYPPPDPSMTTVKYPEEIEIVTPSNCGLCK
GALLRTPSNNRETCLCYPPLYYNGDECVPQSDCPCFEGHISYPVGETYLTETCAECICQI
GGIPQCTQKTCKPCAKGLRRIAPGTCDCKCEKCPPESVICQTSGECIPESSWCDGVQDCP
DDEINCLITEAPHISINKTEIITITKKCPDPSCPPGFMIKPKQSRKSKMMSSRFSDDDDE
NDKPQHQYYYIKAKYMASYEAILPISTTKDKQDINSEECFEFMCIPIVETPEGPDYAAGN
QTKLIRCPEPKCPRGYMLRLQIQKSPNECAKFACDPVVENDAVCNVTGRTFNTFDSTEFK
YDICSHLLARDVSDAKWNVIMRKNCSTGNNVCTKEIEIKDKVAKYTLILYPTLTVNLDGY
YFTIQQLQNSNRKMSFVVSKNGDSLLYVSHTHGIWVTLDQYGDIKLGISSQYVNQVDGLC
GFYNNDKKDDKRTPSGQLATSTVDFGDSWSVSNGTAEDCIPHSCPKAVQEVAMKMCELVN
HEIFTPCKKTINPTHFISTCLETACDCLLTATNGTTATKDLEKHAKQCKCSMLKNYVVEC
MAADENVHLETWRSVHSCEATCVAPLVHQDCYRRSCETTCNNIQSNECANVPGTCFSGCF
CPPGTVKKDQTCVPISECRDCVCDGFGKSQYITYDRKNFTFDGNCTYLLSRDISLKNVYT
FQVYATIGPCDTKANAIAQKKVTCTQALHIAYNSHIVHIQKDAMKNLEVIIDGFKIANKP
YIQNWIKITEQGKTLNIILPESQIELVTMFEAMSFSIKVPSVKYGNKMEGLCGNCNGNPE
DDLTKNPAATNLPLTNTPLQNFALSWLANEPKLLLNEQKDKCYVDEESECLPLAPETDPC
FKILDEEIFGKCHFVVEPLMYVTACQQDLCRTGPTQKGSCDSIAAYARECARNGVCVDWR
KNGFCQMDCAAPYVYTPCGCAETCQSIEARDKLIKSSSAIKDPKTIEKMKSICNAGLSEG
CFCPKGTVLHNGKCLREIECRACDDKGHLPGDVWHLDVCTKCTCRNDSTIQCQKIQCPTQ
AICGIGFTAVESSPTGECCKKYACIPEEKPIQRTLCPQLVLPKCGLDQTNKIINGTDGCS
KYICDCIPKAQCKPVQNFTQFEPGFKAITDTTGCCPISKLVCDKSLCPPKPTQCTEAFYV
LEKTKSAVDKICCDIYECRPPQNNCIATINGKKILKKMDELWQTEDVCEKAQCAFDTNGK
PIIKTQREVCNTICQAGYELKPIVGKCCGECVKTKCIADNKLYEIGQTWFSDDNCTTFEC
NIKDGQMLITSMMPTCPDISSCPPLLRYKDGCCEKCKMESLSQKNCLPESLSESQTVGLI
QLQLPPHGNCKNVNGIRGITQCTGTCKSGTKFDPLTFDQLKTCECCSVSGVRELPVELLC
DDKYKFTKNFNVPSSCSCTKCGTDEKIQKLRAAF

Protein features from InterProScan

Transcript Database ID Name Start End E.value
63 g5301.t1 CDD cd00057 FA58C 2028 2164 2.89553E-29
62 g5301.t1 CDD cd00057 FA58C 2220 2367 1.00269E-24
61 g5301.t1 CDD cd00112 LDLa 2493 2526 6.49295E-5
54 g5301.t1 Gene3D G3DSA:2.10.25.10 Laminin 191 288 6.6E-19
53 g5301.t1 Gene3D G3DSA:2.10.25.10 Laminin 563 667 1.8E-16
55 g5301.t1 Gene3D G3DSA:2.10.25.10 Laminin 676 779 2.2E-17
49 g5301.t1 Gene3D G3DSA:2.10.25.10 Laminin 1036 1155 1.4E-8
57 g5301.t1 Gene3D G3DSA:2.170.140.10 Antimicrobial Protein 1310 1374 4.1E-7
51 g5301.t1 Gene3D G3DSA:2.10.25.10 Laminin 1375 1493 2.0E-7
56 g5301.t1 Gene3D G3DSA:2.170.140.10 Antimicrobial Protein 1782 1847 1.6E-5
58 g5301.t1 Gene3D G3DSA:2.60.120.260 - 2022 2166 2.8E-38
59 g5301.t1 Gene3D G3DSA:2.60.120.260 - 2215 2371 2.9E-33
48 g5301.t1 Gene3D G3DSA:2.10.25.10 Laminin 2383 2472 1.1E-6
60 g5301.t1 Gene3D G3DSA:4.10.400.10 - 2492 2523 1.4E-5
50 g5301.t1 Gene3D G3DSA:2.10.25.10 Laminin 2951 3060 1.9E-8
52 g5301.t1 Gene3D G3DSA:2.10.25.10 Laminin 3295 3417 2.6E-8
94 g5301.t1 MobiDBLite mobidb-lite consensus disorder prediction 2561 2581 -
26 g5301.t1 PANTHER PTHR11339:SF383 HEMOLECTIN, ISOFORM A 35 184 0.0
33 g5301.t1 PANTHER PTHR11339 EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED 35 184 0.0
25 g5301.t1 PANTHER PTHR11339:SF383 HEMOLECTIN, ISOFORM A 164 666 0.0
32 g5301.t1 PANTHER PTHR11339 EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED 164 666 0.0
24 g5301.t1 PANTHER PTHR11339:SF383 HEMOLECTIN, ISOFORM A 683 1007 0.0
31 g5301.t1 PANTHER PTHR11339 EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED 683 1007 0.0
21 g5301.t1 PANTHER PTHR11339:SF383 HEMOLECTIN, ISOFORM A 995 1277 0.0
28 g5301.t1 PANTHER PTHR11339 EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED 995 1277 0.0
22 g5301.t1 PANTHER PTHR11339:SF383 HEMOLECTIN, ISOFORM A 2681 3029 0.0
29 g5301.t1 PANTHER PTHR11339 EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED 2681 3029 0.0
23 g5301.t1 PANTHER PTHR11339:SF383 HEMOLECTIN, ISOFORM A 3022 3423 0.0
30 g5301.t1 PANTHER PTHR11339 EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED 3022 3423 0.0
20 g5301.t1 PANTHER PTHR11339:SF383 HEMOLECTIN, ISOFORM A 3591 3799 0.0
27 g5301.t1 PANTHER PTHR11339 EXTRACELLULAR MATRIX GLYCOPROTEIN RELATED 3591 3799 0.0
17 g5301.t1 Pfam PF00094 von Willebrand factor type D domain 5 77 1.3E-8
2 g5301.t1 Pfam PF08742 C8 domain 126 196 2.0E-6
14 g5301.t1 Pfam PF01826 Trypsin Inhibitor like cysteine rich domain 200 251 1.0E-5
18 g5301.t1 Pfam PF00094 von Willebrand factor type D domain 291 458 5.4E-26
1 g5301.t1 Pfam PF08742 C8 domain 503 568 5.3E-18
13 g5301.t1 Pfam PF01826 Trypsin Inhibitor like cysteine rich domain 574 628 2.4E-11
12 g5301.t1 Pfam PF01826 Trypsin Inhibitor like cysteine rich domain 688 742 5.8E-7
15 g5301.t1 Pfam PF00094 von Willebrand factor type D domain 782 935 1.4E-27
4 g5301.t1 Pfam PF08742 C8 domain 977 1044 2.2E-18
8 g5301.t1 Pfam PF01607 Chitin binding Peritrophin-A domain 1322 1367 4.0E-8
6 g5301.t1 Pfam PF01607 Chitin binding Peritrophin-A domain 1793 1839 3.3E-5
7 g5301.t1 Pfam PF01607 Chitin binding Peritrophin-A domain 1897 1952 1.2E-5
9 g5301.t1 Pfam PF00754 F5/8 type C domain 2031 2162 7.7E-19
10 g5301.t1 Pfam PF00754 F5/8 type C domain 2261 2359 3.6E-15
16 g5301.t1 Pfam PF00094 von Willebrand factor type D domain 2684 2837 2.1E-17
5 g5301.t1 Pfam PF08742 C8 domain 2875 2958 8.5E-9
11 g5301.t1 Pfam PF01826 Trypsin Inhibitor like cysteine rich domain 2962 3018 2.6E-6
19 g5301.t1 Pfam PF00094 von Willebrand factor type D domain 3023 3185 1.4E-23
3 g5301.t1 Pfam PF08742 C8 domain 3240 3305 1.0E-15
64 g5301.t1 ProSitePatterns PS01286 Coagulation factors 5/8 type C domain (FA58C) signature 2. 2149 2165 -
65 g5301.t1 ProSitePatterns PS01285 Coagulation factors 5/8 type C domain (FA58C) signature 1. 2263 2292 -
66 g5301.t1 ProSitePatterns PS01208 VWFC domain signature. 3399 3444 -
97 g5301.t1 ProSiteProfiles PS51233 VWFD domain profile. 1 129 17.688
99 g5301.t1 ProSiteProfiles PS51233 VWFD domain profile. 290 518 44.807
96 g5301.t1 ProSiteProfiles PS51233 VWFD domain profile. 781 992 42.379
104 g5301.t1 ProSiteProfiles PS50940 Chitin-binding type-2 domain profile. 1312 1377 10.97
101 g5301.t1 ProSiteProfiles PS50940 Chitin-binding type-2 domain profile. 1548 1613 9.803
102 g5301.t1 ProSiteProfiles PS50940 Chitin-binding type-2 domain profile. 1660 1725 9.917
103 g5301.t1 ProSiteProfiles PS50940 Chitin-binding type-2 domain profile. 1783 1848 9.968
100 g5301.t1 ProSiteProfiles PS50940 Chitin-binding type-2 domain profile. 1891 1957 9.917
106 g5301.t1 ProSiteProfiles PS50022 Coagulation factors 5/8 type C domain (FA58C) profile. 2013 2165 31.031
105 g5301.t1 ProSiteProfiles PS50022 Coagulation factors 5/8 type C domain (FA58C) profile. 2217 2368 27.945
109 g5301.t1 ProSiteProfiles PS50068 LDL-receptor class A (LDLRA) domain profile. 2492 2523 8.913
98 g5301.t1 ProSiteProfiles PS51233 VWFD domain profile. 2682 2890 28.04
95 g5301.t1 ProSiteProfiles PS51233 VWFD domain profile. 3022 3255 28.364
107 g5301.t1 ProSiteProfiles PS50184 VWFC domain profile. 3372 3445 10.754
108 g5301.t1 ProSiteProfiles PS01225 C-terminal cystine knot domain profile. 3706 3802 10.226
74 g5301.t1 SMART SM00832 c8_a 119 197 0.013
92 g5301.t1 SMART SM00215 vwc2 253 321 1.6E-6
70 g5301.t1 SMART SM00216 VWD_2 280 458 6.6E-36
73 g5301.t1 SMART SM00832 c8_a 496 571 2.3E-23
79 g5301.t1 SMART SM00214 vwc 744 804 0.065
91 g5301.t1 SMART SM00215 vwc2 744 812 0.0037
68 g5301.t1 SMART SM00216 VWD_2 771 935 4.1E-39
76 g5301.t1 SMART SM00832 c8_a 971 1045 8.5E-34
83 g5301.t1 SMART SM00214 vwc 1118 1168 0.97
88 g5301.t1 SMART SM00494 chi_10 1313 1377 1.2E-10
81 g5301.t1 SMART SM00214 vwc 1456 1510 6.1
87 g5301.t1 SMART SM00494 chi_10 1549 1613 1.6E-4
90 g5301.t1 SMART SM00494 chi_10 1661 1725 5.9E-6
86 g5301.t1 SMART SM00494 chi_10 1784 1848 9.4E-5
89 g5301.t1 SMART SM00494 chi_10 1892 1957 2.8E-5
85 g5301.t1 SMART SM00231 disc_4 2012 2165 8.4E-9
84 g5301.t1 SMART SM00231 disc_4 2216 2368 8.2E-5
82 g5301.t1 SMART SM00214 vwc 2435 2493 3.3
71 g5301.t1 SMART SM00192 LDLa_2 2492 2528 2.6E-4
67 g5301.t1 SMART SM00216 VWD_2 2672 2836 2.7E-24
77 g5301.t1 SMART SM00832 c8_a 2869 2959 0.011
69 g5301.t1 SMART SM00216 VWD_2 3011 3188 2.2E-23
75 g5301.t1 SMART SM00832 c8_a 3233 3306 7.7E-18
78 g5301.t1 SMART SM00214 vwc 3380 3444 1.7E-8
93 g5301.t1 SMART SM00215 vwc2 3383 3439 0.59
80 g5301.t1 SMART SM00214 vwc 3636 3696 2.5
72 g5301.t1 SMART SM00041 CT_3 3718 3802 9.5E-5
39 g5301.t1 SUPERFAMILY SSF57567 Serine protease inhibitors 195 253 3.76E-9
38 g5301.t1 SUPERFAMILY SSF57567 Serine protease inhibitors 570 630 7.36E-10
37 g5301.t1 SUPERFAMILY SSF57567 Serine protease inhibitors 688 744 1.18E-7
42 g5301.t1 SUPERFAMILY SSF57603 FnI-like domain 737 782 9.0E-7
47 g5301.t1 SUPERFAMILY SSF57625 Invertebrate chitin-binding proteins 1317 1368 1.46E-8
46 g5301.t1 SUPERFAMILY SSF57625 Invertebrate chitin-binding proteins 1557 1611 9.15E-6
45 g5301.t1 SUPERFAMILY SSF57625 Invertebrate chitin-binding proteins 1666 1716 4.71E-6
43 g5301.t1 SUPERFAMILY SSF57625 Invertebrate chitin-binding proteins 1790 1840 5.75E-6
44 g5301.t1 SUPERFAMILY SSF57625 Invertebrate chitin-binding proteins 1893 1948 7.06E-7
35 g5301.t1 SUPERFAMILY SSF49785 Galactose-binding domain-like 2024 2166 6.8E-33
34 g5301.t1 SUPERFAMILY SSF49785 Galactose-binding domain-like 2216 2370 1.19E-29
40 g5301.t1 SUPERFAMILY SSF57424 LDL receptor-like module 2492 2529 1.68E-5
36 g5301.t1 SUPERFAMILY SSF57567 Serine protease inhibitors 2958 3018 6.38E-7
41 g5301.t1 SUPERFAMILY SSF57603 FnI-like domain 3625 3697 1.46E-6

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

Data is missing for g5301/g5301.t1; file /home/yuki.yoshida/nias/analysis/reanalysis/18_revice/midgebase/iupred3/g5301.t1.fa.iupred3.txt does not exist

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0008061 chitin binding MF
GO:0005515 protein binding MF
GO:0005576 extracellular region CC

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values