Gene loci information

Transcript annotation

  • This transcript has been annotated as histone-arginine N-methyltransferase.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_1 g9601 g9601.t1 TTS g9601.t1 4233779 4233779
chr_1 g9601 g9601.t1 isoform g9601.t1 4234369 4240562
chr_1 g9601 g9601.t1 exon g9601.t1.exon1 4234369 4234667
chr_1 g9601 g9601.t1 cds g9601.t1.CDS1 4234369 4234667
chr_1 g9601 g9601.t1 exon g9601.t1.exon2 4236065 4236238
chr_1 g9601 g9601.t1 cds g9601.t1.CDS2 4236065 4236238
chr_1 g9601 g9601.t1 exon g9601.t1.exon3 4236300 4239389
chr_1 g9601 g9601.t1 cds g9601.t1.CDS3 4236300 4239389
chr_1 g9601 g9601.t1 exon g9601.t1.exon4 4239462 4239556
chr_1 g9601 g9601.t1 cds g9601.t1.CDS4 4239462 4239556
chr_1 g9601 g9601.t1 exon g9601.t1.exon5 4239622 4240562
chr_1 g9601 g9601.t1 cds g9601.t1.CDS5 4239622 4240562
chr_1 g9601 g9601.t1 TSS g9601.t1 4241138 4241138

Sequences

>g9601.t1 Gene=g9601 Length=4599
ATGGCTTTTCACAATCCGGAAATATTTAAAGATTTAAAATTACACTCTCCATCAGACGCT
GAGCCATTTGTATACAAATGGCCTCTATCAGTCGGAATAGGACCAGATAAGCATGACAGT
GGTCTTGAAATAATTGAAACGGTTCGATTGGTATGTGAAGACATTCCTGAAATAAAATCA
TCATTAGAGGACATTTCATTAACCGAACTGGATACAAATGACTATGATACAATGAAAAAT
TTCTGTGACCGTTTTAATAAGGCTATTGATAGTATTAAATCTCTTGAGAAAGGCACTTCA
CTGCCCGTTAGACGACATACACTGCCCTCGAGAAATATGCTAAGACATATTGTTCAACAA
GTCTATAATTACGCTGTTTCAGAGCCAGAGAAGCTTAATCAATACGAGCCATTCTCACCA
GAAGTCTATGGTGAAACCAGTTTCGATTTCATTAGTCAAATGGTCGACGAAATAAATATA
ACGGAAGATGATGTTTTTATTGATCTTGGCAGTGGCGTGGGACAAGTAGTTTTGCAAGTT
GCCGCGTCTACTCGATGCAAAATTTGCATTGGCATTGAAAAAGCAGATGTTCCATCTAAA
TATGCTGAAAATATGAATGGAATGTTTAAAACTTGGATGCGATGGTTTGGAAAAAAATAT
GGTGACTATCAGTTGATTAAAGGTGATTTTTTAGCTGATGAACATCGCGAAAAAATAATG
TCTGCCTCAGTGGTATTTGTAAATAATTTCGCTTTTGGCCCAAACGTTGATCATCAATTG
AAAGAAAGATTTGCAGACTTGAAAGATGGCGCAAAAATTGTTTCATCCAAAAGTTTTTGT
CCATTGAATTTTCGTATCACTGATCGAAATCTTAGTGATATTGGAACGATAATGCATGTA
AAAGAAATGGCACCACTTAAGGGATCTGTTTCATGGACTGGTAAACCCGTTTCATATTAT
CTTCACACCATCGATCGAACGAAACTTGAGAATTACTTTCAAGGTCTTAAAACAAAGGGC
AATGGAAACGGAGACGGTAATGGCAAAAATCGTCGAAGTCGTGATTACAATAAGCAAACG
AGTGGTTCAACGTCAGAAAGTGATGATGATGATGCTGATTATACGGGAACGACAACGCGA
AAAGCTTGGTCTGACTTTGTGAAAGCAAGAAGCAGTCAATCTGAAGATGAGTCTATTAAT
GATAAAAAGAAGAAGCCACAACTAGCAAAAGCAAGAAGAGTGCCTGCACAGAAGAAAGTA
GCAAAGGCGAATGCAGCTGGTGGTAGAGTGAAAAAGCCAAAACATAAACGTCAACTGAAA
ATTGCTGGACTTGATTTATTACATTCTCAAACAATTTTCAGTACTTCAGATCAAGCTATT
GGAATTCGAATGCCACCAGCCGCTGGTTGCGTTGATGAACGTTTAACAAATTATGCTGGC
ACAATGATTCATGAGGAAGTCGATGAGCCATTAGTTCATAATTCGTCTGAAGTTCCATAT
GGACTTAAAATTCTTTTGGATGTTTATAAGACTCAATTTATGAATTTCTTAGAATCAATG
AAATCGTCAGCATTTAAAGAAAATCTACATCGACAAATTGAAAGCGAAAAAGAGAAAAAT
AAACGTTTATTGAACCGTACTGGACAGCTTGAAAAGCAAATAAAAGTTCTTGTTGAAGAC
AGTGTTGTGTTATTGAAGGCGAGAATGCAAGAACTTGGTATCAATACGAGCAGTCAAAAT
GATTTGCTTTGCAAGGCCAAAGAAATTGTTGGAAAACACAAAGAATTACAAATCATGGCC
AATAAAATTCAAGCACAAGTGAATACATTAGAAGAAGAGCATAATAATATTCTGGCAGGT
CACGTGAAGAAAATTGCAGAGAAACATTCAAAACAGTCAATGGATTTTGAATTGGCATCG
AAAGACTCACATGACTTGGTGCTTAAAGAAATCGAAAATACTTTTATTCAACGTAAAAAT
TTACGAAATAAAATTTCATCATTAGAGGCTGAATTGGCATTGATTGAGAAAGCGAGTGAA
GAGCGAAAGCAAACGTCAAATACTGGCAATAGTGCTCAGTCTAATATTTATGTTTCTGGA
AATATTTATAAGCAATCTAATAATCCTCCATCTAATGTTACAAATGCTACTGCACAGCAA
TCATCAACGTCTAAAACTTCAAAGAAAAATCGTGAGCATCGACCTAAAACTCATGAATGG
CCAGAAATACCAGACATTGGAAAAATTGAAGAAAAGAATCCAGAAATTCTAGCACAGAAA
ATACTCGAAACTGGTCGTCAGATTGAAGCAGGAAAGTTCCAAACTACAACAGTAAGTGCA
TCAAATCTTCCACCGAGCAAAATTGCAAAATCAGAAAGTGTGCAGCAGCCTTATACCAAA
AAACATCCTACAAGCGTTCCTCAGCAACTACCGCAAACACAACCGCAGCATCATTTACCA
CAGCATGCGATTATGTCAAATAAAAATGAGATAATGCAAATGACAGTCCATAAAGTGGGT
GGTAAGCCGCAGGCACATCAGGGTCATATTTCTTTACTACCAAAAGCCGGTGGCTCATCC
AAAAAAGTCGCAGAATCACATAAAGTTGTGAATTTCGAAGATAGATTAAAGAGTATTATC
ACTTCAGCATTGCAAGGTCAAGATCAAGCTGGTCAAAATCAAAAACAAAATTTACCACCA
CAACAATCATCTCAACAATCTCTTCCATCTCATATTACTCCGCAAACAAATGTCAGTCCG
AAGAAAATTCCACAAAGTCAAGCTGGACCTTACCATCATTCAGCTTATATTAGCTCAGCA
CATCCACAAGTTTCTCAACAAACAACATCACCTCAAATGTTGACAATAAGTACAAGTAGT
GGTCAACCGCAATCTCATCATCATTTGTCAACAATTTCACCTACACAATCATCGCCCATA
AAGAATCAGAGACAAATGGTTCCACTTCCACAAATGGCACCACATTTACCACCCGAGCAT
TCACGTGAATTGCAAATGCAAGAATCGACATATGCTCAACTTAAAAGAAATGACCATATG
CCAAATTTTGCTAAAGTCATGAATATTATGCATCATGATACTAGTAAAATGTATCAACGA
CATCCTCAAATGTCAATGATGCAGCAACATCAGCAAGGACAATTTGCAATGCAACGAGAG
CGAGAGCGTGAAGGTGCAATGATGTATCAACAGCGACAAATTGTTGAAGAGAAGCGTGAG
TTCAAAACTCCCGATAATGTGAGATGCCAAGATATGGGACGAAGCTCAGTGGGATCGATT
GAGAACGAGTACATAAATAGTGGTAGTAATAGTAGTAGCAATAAATCTCAAACACAACGA
TCAAGTTCTTCTTTATCACAACCAGATTATACTCAAGTATCACCAGCAAAACTTGCACTA
AGACGACATTTGTCACAAGAGAAGCTGGCACAACATCAAGTGGGACCATCACAATTGACC
ACAAAAACAATTGGTGATTTTATCAATAGTGAAATTGAAAAGACACTTGAAATAACACCG
CAATCGATTATAAATGCAGTCATTCAACATAATATGCCAAGTAGTTCACGAATAAATAAT
GAATCAATAATTGACTGTAGTATAAATCAGGATCGTGAACTTGATGACCATCATCATCCA
TATGCTACTCTAAAACCAACGCCTAGTAAGCAGCAGCAGCAACAACCAGGACCATATATG
CATGATCCTTATATAGATCATCATCGTACAAAAATGTTACCAGTTCAATCGAAATATGTT
TCACCAACATCTGCCAATGCGAGAAAAAGTGCAATATCTACGCCACCATTGCGACATGAT
GAAGCTATGAATTATATGAATGCTTCAAAATCACCAGAACGTTATCAAGAGTCGAGTTAC
AAGATAGACACAAAAATGTTCACATCATCTAATATTGGAATGTCAATGCGAAAGGAAGAA
GTTAAAGAGAAAATGTCAGTTGAACATGAACCTCCACTCGAAGGACTTGCAGCATCTTTG
CGACAACATGTTATTGCTTCGATGAAAATTAAGGAGGAATCCGAGAATGAACCAAAATAT
TCAAATTATCATCCAACAATACCATACCCTCACATCAAAAAAGAATCCACAAACCGTATG
AAACGAGCATCGCCAGTTATTCTTCAACGTCAACAGGGAGGAAATAGTGGTCCAATGTAC
AATGAAATGAATGAGGAAATGATGTATATGGGACCATCATCTTCAAGTAATAACGAAAGA
AATTTGACTCCTTTAAGTCATCTTCGTCGCACCGATGAAGATGATATGGGCGATGATGAT
GACATACAATGGCCAGAGGACTTTAAAGTACGAGTATCATCAGGCTTTGATCGTTTAGTT
GCATTTGCTGCTACTGAGCTGAATCGTCGTTCTGATGAAAATTGTGTCAGTCCACCGAAA
CGTGAAATGTCCTACGTTGATTATAGATCTTCCCATATGAAGAAAGAAGATACAAGAAAA
ATGAAAAACGAACGCGATTATCCTGCCGATCTTAGTGAAAGACGTATGCGAATGATGATT
GAAGAGAATCGAAAAAGTAGCAAACATCATTTGGATTAG

>g9601.t1 Gene=g9601 Length=1532
MAFHNPEIFKDLKLHSPSDAEPFVYKWPLSVGIGPDKHDSGLEIIETVRLVCEDIPEIKS
SLEDISLTELDTNDYDTMKNFCDRFNKAIDSIKSLEKGTSLPVRRHTLPSRNMLRHIVQQ
VYNYAVSEPEKLNQYEPFSPEVYGETSFDFISQMVDEINITEDDVFIDLGSGVGQVVLQV
AASTRCKICIGIEKADVPSKYAENMNGMFKTWMRWFGKKYGDYQLIKGDFLADEHREKIM
SASVVFVNNFAFGPNVDHQLKERFADLKDGAKIVSSKSFCPLNFRITDRNLSDIGTIMHV
KEMAPLKGSVSWTGKPVSYYLHTIDRTKLENYFQGLKTKGNGNGDGNGKNRRSRDYNKQT
SGSTSESDDDDADYTGTTTRKAWSDFVKARSSQSEDESINDKKKKPQLAKARRVPAQKKV
AKANAAGGRVKKPKHKRQLKIAGLDLLHSQTIFSTSDQAIGIRMPPAAGCVDERLTNYAG
TMIHEEVDEPLVHNSSEVPYGLKILLDVYKTQFMNFLESMKSSAFKENLHRQIESEKEKN
KRLLNRTGQLEKQIKVLVEDSVVLLKARMQELGINTSSQNDLLCKAKEIVGKHKELQIMA
NKIQAQVNTLEEEHNNILAGHVKKIAEKHSKQSMDFELASKDSHDLVLKEIENTFIQRKN
LRNKISSLEAELALIEKASEERKQTSNTGNSAQSNIYVSGNIYKQSNNPPSNVTNATAQQ
SSTSKTSKKNREHRPKTHEWPEIPDIGKIEEKNPEILAQKILETGRQIEAGKFQTTTVSA
SNLPPSKIAKSESVQQPYTKKHPTSVPQQLPQTQPQHHLPQHAIMSNKNEIMQMTVHKVG
GKPQAHQGHISLLPKAGGSSKKVAESHKVVNFEDRLKSIITSALQGQDQAGQNQKQNLPP
QQSSQQSLPSHITPQTNVSPKKIPQSQAGPYHHSAYISSAHPQVSQQTTSPQMLTISTSS
GQPQSHHHLSTISPTQSSPIKNQRQMVPLPQMAPHLPPEHSRELQMQESTYAQLKRNDHM
PNFAKVMNIMHHDTSKMYQRHPQMSMMQQHQQGQFAMQREREREGAMMYQQRQIVEEKRE
FKTPDNVRCQDMGRSSVGSIENEYINSGSNSSSNKSQTQRSSSSLSQPDYTQVSPAKLAL
RRHLSQEKLAQHQVGPSQLTTKTIGDFINSEIEKTLEITPQSIINAVIQHNMPSSSRINN
ESIIDCSINQDRELDDHHHPYATLKPTPSKQQQQQPGPYMHDPYIDHHRTKMLPVQSKYV
SPTSANARKSAISTPPLRHDEAMNYMNASKSPERYQESSYKIDTKMFTSSNIGMSMRKEE
VKEKMSVEHEPPLEGLAASLRQHVIASMKIKEESENEPKYSNYHPTIPYPHIKKESTNRM
KRASPVILQRQQGGNSGPMYNEMNEEMMYMGPSSSSNNERNLTPLSHLRRTDEDDMGDDD
DIQWPEDFKVRVSSGFDRLVAFAATELNRRSDENCVSPPKREMSYVDYRSSHMKKEDTRK
MKNERDYPADLSERRMRMMIEENRKSSKHHLD

Protein features from InterProScan

Transcript Database ID Name Start End E.value
10 g9601.t1 CDD cd02440 AdoMet_MTases 166 274 0.00102162
8 g9601.t1 Coils Coil Coil 526 553 -
7 g9601.t1 Coils Coil Coil 593 613 -
9 g9601.t1 Coils Coil Coil 651 688 -
5 g9601.t1 Gene3D G3DSA:1.10.260.60 - 10 131 1.5E-44
6 g9601.t1 Gene3D G3DSA:3.40.50.150 Vaccinia Virus protein VP39 132 350 4.9E-82
11 g9601.t1 MobiDBLite mobidb-lite consensus disorder prediction 338 375 -
18 g9601.t1 MobiDBLite mobidb-lite consensus disorder prediction 389 433 -
13 g9601.t1 MobiDBLite mobidb-lite consensus disorder prediction 390 405 -
14 g9601.t1 MobiDBLite mobidb-lite consensus disorder prediction 710 739 -
21 g9601.t1 MobiDBLite mobidb-lite consensus disorder prediction 710 725 -
17 g9601.t1 MobiDBLite mobidb-lite consensus disorder prediction 776 803 -
16 g9601.t1 MobiDBLite mobidb-lite consensus disorder prediction 885 980 -
20 g9601.t1 MobiDBLite mobidb-lite consensus disorder prediction 885 924 -
19 g9601.t1 MobiDBLite mobidb-lite consensus disorder prediction 938 980 -
22 g9601.t1 MobiDBLite mobidb-lite consensus disorder prediction 1075 1134 -
12 g9601.t1 MobiDBLite mobidb-lite consensus disorder prediction 1089 1133 -
15 g9601.t1 MobiDBLite mobidb-lite consensus disorder prediction 1491 1512 -
2 g9601.t1 PANTHER PTHR21451 HISTONE H3 METHYLTRANSFERASE 33 600 2.3E-190
3 g9601.t1 PANTHER PTHR21451:SF0 HISTONE-LYSINE N-METHYLTRANSFERASE, H3 LYSINE-79 SPECIFIC 33 600 2.3E-190
1 g9601.t1 Pfam PF08123 Histone methylation protein DOT1 122 324 9.8E-82
23 g9601.t1 ProSiteProfiles PS51569 Histone-lysine N-methyltransferase DOT1 (EC 2.1.1.43) domain profile. 21 337 93.297
4 g9601.t1 SUPERFAMILY SSF53335 S-adenosyl-L-methionine-dependent methyltransferases 14 334 6.4E-75

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0051726 regulation of cell cycle BP
GO:0034729 histone H3-K79 methylation BP
GO:0031151 histone methyltransferase activity (H3-K79 specific) MF
GO:0018024 histone-lysine N-methyltransferase activity MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values