Gene loci information

Transcript annotation

  • This transcript has been annotated as histone-arginine N-methyltransferase.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_4 g16349 g16349.t1 TTS g16349.t1 9160697 9160697
chr_4 g16349 g16349.t1 isoform g16349.t1 9160758 9162876
chr_4 g16349 g16349.t1 exon g16349.t1.exon1 9160758 9161081
chr_4 g16349 g16349.t1 cds g16349.t1.CDS1 9160758 9161081
chr_4 g16349 g16349.t1 exon g16349.t1.exon2 9161177 9162686
chr_4 g16349 g16349.t1 cds g16349.t1.CDS2 9161177 9162686
chr_4 g16349 g16349.t1 exon g16349.t1.exon3 9162746 9162876
chr_4 g16349 g16349.t1 cds g16349.t1.CDS3 9162746 9162876
chr_4 g16349 g16349.t1 TSS g16349.t1 9162941 9162941

Sequences

>g16349.t1 Gene=g16349 Length=1965
ATGAGCACACCAAAAACTCCAAAAGTAAGTCTTGAATTTAAGAAAAAATTACGACAAGAA
TATGAAAGATTAAAGGCACTTCAAGAATCTCAAGAAAAGGAAATAATTGCAGTAACTTGG
GAAAATAATCGAAAGCAATGCAAGGCCAATGTTGATGAGATCAATGAACGCTTGGCAAAT
ATCAATTGTGTTTTTCAAGATGATGCCTACAACAACATTGAGCACAAAGATCATTTAAAA
AAGGCTTCAATTACAAGTCATGATGGTGTAAAGCAATCAGTACCAACATATGTCATGTCT
AATGTCATTCCAATACCAAATATGTGTGCATGGGCACCAGTACAGCAAAATTTCATGGTC
GAAGATGAGACAGTACTCTATAACATCCCCTATATGGGTGATGAAGTTCTAGACAAAGAT
TCAGACTTTATTGATGAATTAATTAAAAATTATGATGGAAAAGTTCATGGTGATAATGAT
GCAGAATTTATTGATGATAAATTATTTGTAGATTTAGTTAAAGCAATGATTCCACATCAA
AGTTCTGAACCATCAACAAGTGAAGCTTTTCCAAGTCAAATGACTTTCCAAGCTGTCAGT
GAAATGTTCCCGAGTAAAGGCACACCTGAAGAACTTCGACAAAATTATATAGAACTCACA
AAAAGTGTTGATCCAACTCGACCACCAGAATTTTGCACACCAAACATAGATGGACCAAAT
GCTGAATCAGTCAGTCATGATCAAACATTGCATTCTTTTCATGCACTTTTTTGTCGTCGA
TGTTTCAAATATGATTGTTTTTTGCATCGACTTTCAACACATCCTGGACCAAATTTACAA
AAACGTAAAGGACCAGAAGTTGAATCAAGTGGTGAGCCATGTGGTCCAGATTGCTATTTA
CATATTGTCCCTAAACAAACTGAACCAAGTCCTAAGCAACAAGATCAACGTCAAAAAGCA
ATTGATAGATCAAAATTTACTCTTGAATCAAGTCCAAGTCCAATGGAAGCTGATGATGTC
TGGACTGGCTCAGATCAGTCATTTTTCCGTGTTCTTCATCGTTGTTTTTTGGACAATTAT
TGTGCAATTGCTAAAGGAATGCAAACAAAAACATGTCGACAAGTTTATGAATTTGCACAA
GAAGAACTTCAACATACTGAGCCATTAGAATTAGGAAAACATGCATCACCACCAAGAAAA
CAACAAAAGAAGAAAATTAATACAAAATTATGGACAATGCATATTCGTAAAAATGCCAGA
AAAGAAGCAAATGCTAACAATGTTTATAAATACACACCATGTGATCATAGTGGACCATGT
GGTGCAGGCTGTTGCTGCTTTGACAGTTCAAATTATTGTGAAAAATTCTGCAATTGTTCA
TCTGATTGTGCTAATCGCTTTCCTGGATGTCGATGCAAAGCTCAATGTAATACAAAACAA
TGCAGTTGTTATTTGGCTGTTAGGGAATGCGATCCTGATTTATGTCAATGTGGTGCACAT
CAATTCCATTTGAATAAAATGACTTGTAAGAATGTTAGTGTACAGAGAGGTCTAAGAAAG
CATTTATTGATGGCACCGAGCGATATTGCTGGTTGGGGAATTTTCTTAAAAGATGGAGCT
CAGAAAAATGAATTTATTAGTGAATACTGCGGTGAAATAATCAGTCAAGATGAAGCTGAT
CGTCGTGGAAAAGTTTATGATAAATATGCATGCAGTTTTCTTTTCAATCTTAATAATGAC
TTTGTCGTTGATGCCACAAGAAAAGGAAATAAAATTCGCTTTGCAAATCATTCAATAAAT
CCAAATTGTTATGCAAAAGTCATCATGGTTAATGGCGATCATAGAATTGGAATTTTTGCT
AAAAGAATTATTCAACCTGGTGAAGAATTGTTCTTTGATTATCGTTATGGACCAACTGAA
CAGTTGAAATTTGTTGGAATTGAAAGAGAAATGGAAATTTTATAA

>g16349.t1 Gene=g16349 Length=654
MSTPKTPKVSLEFKKKLRQEYERLKALQESQEKEIIAVTWENNRKQCKANVDEINERLAN
INCVFQDDAYNNIEHKDHLKKASITSHDGVKQSVPTYVMSNVIPIPNMCAWAPVQQNFMV
EDETVLYNIPYMGDEVLDKDSDFIDELIKNYDGKVHGDNDAEFIDDKLFVDLVKAMIPHQ
SSEPSTSEAFPSQMTFQAVSEMFPSKGTPEELRQNYIELTKSVDPTRPPEFCTPNIDGPN
AESVSHDQTLHSFHALFCRRCFKYDCFLHRLSTHPGPNLQKRKGPEVESSGEPCGPDCYL
HIVPKQTEPSPKQQDQRQKAIDRSKFTLESSPSPMEADDVWTGSDQSFFRVLHRCFLDNY
CAIAKGMQTKTCRQVYEFAQEELQHTEPLELGKHASPPRKQQKKKINTKLWTMHIRKNAR
KEANANNVYKYTPCDHSGPCGAGCCCFDSSNYCEKFCNCSSDCANRFPGCRCKAQCNTKQ
CSCYLAVRECDPDLCQCGAHQFHLNKMTCKNVSVQRGLRKHLLMAPSDIAGWGIFLKDGA
QKNEFISEYCGEIISQDEADRRGKVYDKYACSFLFNLNNDFVVDATRKGNKIRFANHSIN
PNCYAKVIMVNGDHRIGIFAKRIIQPGEELFFDYRYGPTEQLKFVGIEREMEIL

Protein features from InterProScan

Transcript Database ID Name Start End E.value
14 g16349.t1 CDD cd00167 SANT 341 383 0.00383162
13 g16349.t1 Coils Coil Coil 14 34 -
12 g16349.t1 Coils Coil Coil 37 57 -
11 g16349.t1 Gene3D G3DSA:2.170.270.10 SET domain 431 637 6.4E-75
16 g16349.t1 MobiDBLite mobidb-lite consensus disorder prediction 306 333 -
5 g16349.t1 PANTHER PTHR45747 HISTONE-LYSINE N-METHYLTRANSFERASE E(Z) 4 166 3.6E-229
8 g16349.t1 PANTHER PTHR45747:SF1 HISTONE-LYSINE N-METHYLTRANSFERASE EZH1 4 166 3.6E-229
4 g16349.t1 PANTHER PTHR45747 HISTONE-LYSINE N-METHYLTRANSFERASE E(Z) 177 303 3.6E-229
7 g16349.t1 PANTHER PTHR45747:SF1 HISTONE-LYSINE N-METHYLTRANSFERASE EZH1 177 303 3.6E-229
6 g16349.t1 PANTHER PTHR45747 HISTONE-LYSINE N-METHYLTRANSFERASE E(Z) 335 654 3.6E-229
9 g16349.t1 PANTHER PTHR45747:SF1 HISTONE-LYSINE N-METHYLTRANSFERASE EZH1 335 654 3.6E-229
3 g16349.t1 Pfam PF18118 Polycomb repressive complex 2 tri-helical domain 183 220 2.4E-10
1 g16349.t1 Pfam PF18264 CXC domain 469 499 1.4E-7
2 g16349.t1 Pfam PF00856 SET domain 531 634 1.8E-17
17 g16349.t1 ProSiteProfiles PS51633 CXC domain profile. 414 513 18.902
18 g16349.t1 ProSiteProfiles PS50280 SET domain profile. 520 635 17.3
15 g16349.t1 SMART SM00317 set_7 520 641 1.9E-38
10 g16349.t1 SUPERFAMILY SSF82199 SET domain 423 643 1.31E-60

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0070734 histone H3-K27 methylation BP
GO:0005515 protein binding MF
GO:0018024 histone-lysine N-methyltransferase activity MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values