Gene loci information

Transcript annotation

  • This transcript has been annotated as Putative GATA-binding factor A.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_1 g10393 g10393.t2 TTS g10393.t2 9520358 9520358
chr_1 g10393 g10393.t2 isoform g10393.t2 9521169 9532824
chr_1 g10393 g10393.t2 exon g10393.t2.exon1 9521169 9521488
chr_1 g10393 g10393.t2 cds g10393.t2.CDS1 9521169 9521488
chr_1 g10393 g10393.t2 exon g10393.t2.exon2 9521548 9521760
chr_1 g10393 g10393.t2 cds g10393.t2.CDS2 9521548 9521760
chr_1 g10393 g10393.t2 exon g10393.t2.exon3 9521823 9521898
chr_1 g10393 g10393.t2 cds g10393.t2.CDS3 9521823 9521898
chr_1 g10393 g10393.t2 exon g10393.t2.exon4 9521961 9522125
chr_1 g10393 g10393.t2 cds g10393.t2.CDS4 9521961 9522125
chr_1 g10393 g10393.t2 exon g10393.t2.exon5 9522333 9522424
chr_1 g10393 g10393.t2 cds g10393.t2.CDS5 9522333 9522424
chr_1 g10393 g10393.t2 exon g10393.t2.exon6 9522622 9522873
chr_1 g10393 g10393.t2 cds g10393.t2.CDS6 9522622 9522873
chr_1 g10393 g10393.t2 exon g10393.t2.exon7 9523820 9523967
chr_1 g10393 g10393.t2 cds g10393.t2.CDS7 9523820 9523967
chr_1 g10393 g10393.t2 exon g10393.t2.exon8 9524327 9524552
chr_1 g10393 g10393.t2 cds g10393.t2.CDS8 9524327 9524552
chr_1 g10393 g10393.t2 exon g10393.t2.exon9 9526822 9527806
chr_1 g10393 g10393.t2 cds g10393.t2.CDS9 9526822 9527806
chr_1 g10393 g10393.t2 exon g10393.t2.exon10 9532803 9532824
chr_1 g10393 g10393.t2 cds g10393.t2.CDS10 9532803 9532824
chr_1 g10393 g10393.t2 TSS g10393.t2 9532968 9532968

Sequences

>g10393.t2 Gene=g10393 Length=2499
ATGTCTTCGACAAGAAATGACGATACATCATATGGCACAAATTCTTCAACAACTACGATC
GATCGCACAACATCAGTTGTTCTGAAGACGTCAAGTGATAAGGAGCGACTTGCAGATAGA
GACGAGCAAAAAAATAGCAATACTAACAATTTTGAAAGTGAAGAGACAAAAGTGACACGT
GAAATTCTTACACCATCGCCGGTAAATTTGAATGTCGATAATAAAAGTTGTTTAGAAATC
GATAGTGCTCTCTATACAAGTACATTACCATTAGTTGATAATTTTGTAAACAACACACAT
AGTGGTGGAAAATATTACGAATTCTCATCGACTGGAAATTGTTGGCCTTACGATCTTTCG
CGTACATCTAGCAACACACTCGATTATTCGCTTGTAACGCGTGGAAGTGATCAATTAGTG
ACAAACAGTGCAAATATTTATAGTGCAAATTTTCGTGGTCATCCATACGGCACAACATTC
TACAACACGACGACCGCCATGAAAGGTAATATGATGACATTATCGCAATTTGACGTTATG
TCGCCACCATCATCATCATCTTCCACCACCTCTGATTCATCGTCGTATTACGGACAAGAC
AATAACAATTCATCATCATTCATTAACATTAACGATTATGCAATTCATCATCAGTCGAGT
TATATTGATGATTTTTGTGAGATTTTAAAAGATGAAAATTATTTGTTAGTCGAAGATGCC
GGACATTATACAACCCTTACGAACGCTACAACATCGACTAGTGGCAGTGGACAATTTGAT
GCCTACTTTCAAGAACATATCCCGAGAAATTATGGTCATCAGCATTCAACATCGAGTGGT
GGTGATTCAAGAAGTCCTGATGGTTTCGCAGCCACTGATGACTACGATAATGGCATGCAA
AATTTTACTCAGCTAACGAGTCTAACTACACGAAATGGACTTTACACATCATCGCCAGTA
AACGTCACTGATAGCATGCTTTCCAACTATGATTCAACCGCAAATAGTTACGCGCGAACT
TTTTCGCCTGTTTCATCACAAGCAGCAACCATGTTGCCTTATCTTTCAAGTATCAGCCCC
ACAATCGAACAGCAAGTAATGTGGCCAGTGAATGGACACAATGATGATTTTATGGATACC
AATAACAGAATCGGCAAGCTTCCAGAATTTCAACGACTTACAAATTTTACAACGCAACCG
AAAAATACTCATTATACATCAAATTATAATCAGAATGATTGGTCATCATCATATTTGTCA
TCTACAATTTCAACATCAAATGGTCATGATACAAAATTGACAATTCCATCGCTTCCAGTT
GTAACACAGCGTGGTCGACCAGGAGTAACTCATTTATCCGCCTCACAATCACTATCAGCC
ATGCCGCAGACATTTATCTACGATGATTCGTGTGATATTGGTCGTGAATGTGTCAACTGC
GGTGCAATAAGTACACCATTATGGCGTCGTGATGGAACTGGTCACTATTTGTGCAATGCT
TGTGGACTTTATCATAAAATGAATGGCATGAATCGCCCTCTTGTTAAGCAGCCCAGAAGA
TTAGTAAGCAAAGTAGATTCAACAATTTTAATTTCATTTTTATTTTTTATTACATTTCTA
GGAAAGATAAAAATGGCTGGATCGAGTGAAATTTATAAAACGCCATTGCTCTATCCACCA
CCTGGTAATAGTGCACCACGCGAAGAAAAACAATCGCGACGTACGCCTGCTGCACGTCGT
ACAGGATTAAAATGTTCAAATTGCAATACCGTGAATACTTCACTATGGCGTAGAAATGCA
CAAGGTGAACCTGTTTGTAATGCATGTGGATTGTACTATAAACTACATAATGTCAATCGA
CCAATCACCATGAAAAAAGAACAAATTCAAACAAGAAAACGTAAACCAAAAGGCATGAAA
AATCCAGATGGAACTTCAAAAAAAGGAAAAATGAATGTTAGTTGTGATTACATGGTAAAT
GGCAAAATTTCAAATGCCCGTAAAACAAAAAACACCAAAGGACTGACAATAGAAACACCA
ATGCCAACAACTGAACAAAACCAAAGCAGCTCATCATCAGGCTCATCTTCAACGTCATCA
TCGTCATCAAATTCATCACCTAATTCACAAAATTCACCCCAAGTTGCATCACCACTTTCG
CATCATTCACCAATGACTTTACCAAGTTTAACACCACGTCCTGTAGTATCAGGAATGGGC
ACTTGTTCGAGTGTAATCACAACATCTCCAAATATGACGTCGTATCCGCCATTGACACCA
CTCCCACAACTCAACAGCCACTTGTACAATTCATCACTCTTGAGTCCGAATGCCGACAGC
AATCAATCATCTTTTCAATCAAACTTTGCTAACTTAAATCATCTCATAAAAACAGAAGAT
GGTGCACATATCGCACTAAATAACAATGATATTAATAGCAAGGCAATTGATTTGGAACTT
AAACATCGTCAGCTGCTGGCAAATATTGAATCTCGATAG

>g10393.t2 Gene=g10393 Length=832
MSSTRNDDTSYGTNSSTTTIDRTTSVVLKTSSDKERLADRDEQKNSNTNNFESEETKVTR
EILTPSPVNLNVDNKSCLEIDSALYTSTLPLVDNFVNNTHSGGKYYEFSSTGNCWPYDLS
RTSSNTLDYSLVTRGSDQLVTNSANIYSANFRGHPYGTTFYNTTTAMKGNMMTLSQFDVM
SPPSSSSSTTSDSSSYYGQDNNNSSSFININDYAIHHQSSYIDDFCEILKDENYLLVEDA
GHYTTLTNATTSTSGSGQFDAYFQEHIPRNYGHQHSTSSGGDSRSPDGFAATDDYDNGMQ
NFTQLTSLTTRNGLYTSSPVNVTDSMLSNYDSTANSYARTFSPVSSQAATMLPYLSSISP
TIEQQVMWPVNGHNDDFMDTNNRIGKLPEFQRLTNFTTQPKNTHYTSNYNQNDWSSSYLS
STISTSNGHDTKLTIPSLPVVTQRGRPGVTHLSASQSLSAMPQTFIYDDSCDIGRECVNC
GAISTPLWRRDGTGHYLCNACGLYHKMNGMNRPLVKQPRRLVSKVDSTILISFLFFITFL
GKIKMAGSSEIYKTPLLYPPPGNSAPREEKQSRRTPAARRTGLKCSNCNTVNTSLWRRNA
QGEPVCNACGLYYKLHNVNRPITMKKEQIQTRKRKPKGMKNPDGTSKKGKMNVSCDYMVN
GKISNARKTKNTKGLTIETPMPTTEQNQSSSSSGSSSTSSSSSNSSPNSQNSPQVASPLS
HHSPMTLPSLTPRPVVSGMGTCSSVITTSPNMTSYPPLTPLPQLNSHLYNSSLLSPNADS
NQSSFQSNFANLNHLIKTEDGAHIALNNNDINSKAIDLELKHRQLLANIESR

Protein features from InterProScan

Transcript Database ID Name Start End E.value
16 g10393.t2 CDD cd00202 ZnF_GATA 476 520 8.82564E-18
17 g10393.t2 CDD cd00202 ZnF_GATA 584 635 4.54418E-21
12 g10393.t2 Gene3D G3DSA:3.30.50.10 - 471 521 1.4E-25
11 g10393.t2 Gene3D G3DSA:3.30.50.10 - 574 635 2.7E-29
24 g10393.t2 MobiDBLite mobidb-lite consensus disorder prediction 1 27 -
27 g10393.t2 MobiDBLite mobidb-lite consensus disorder prediction 1 57 -
25 g10393.t2 MobiDBLite mobidb-lite consensus disorder prediction 28 57 -
23 g10393.t2 MobiDBLite mobidb-lite consensus disorder prediction 623 721 -
26 g10393.t2 MobiDBLite mobidb-lite consensus disorder prediction 626 644 -
22 g10393.t2 MobiDBLite mobidb-lite consensus disorder prediction 664 721 -
4 g10393.t2 PANTHER PTHR10071:SF154 TRANSCRIPTION FACTOR GATA-4 401 522 1.1E-75
6 g10393.t2 PANTHER PTHR10071 TRANSCRIPTION FACTOR GATA FAMILY MEMBER 401 522 1.1E-75
3 g10393.t2 PANTHER PTHR10071:SF154 TRANSCRIPTION FACTOR GATA-4 570 736 1.1E-75
5 g10393.t2 PANTHER PTHR10071 TRANSCRIPTION FACTOR GATA FAMILY MEMBER 570 736 1.1E-75
8 g10393.t2 PRINTS PR00619 Transcription factor GATA zinc finger signature 473 490 3.4E-14
7 g10393.t2 PRINTS PR00619 Transcription factor GATA zinc finger signature 491 508 3.4E-14
2 g10393.t2 Pfam PF00320 GATA zinc finger 477 509 7.1E-14
1 g10393.t2 Pfam PF00320 GATA zinc finger 585 618 3.6E-14
14 g10393.t2 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 1 520 -
15 g10393.t2 Phobius TRANSMEMBRANE Region of a membrane-bound protein predicted to be embedded in the membrane. 521 541 -
13 g10393.t2 Phobius CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the cytoplasm. 542 832 -
20 g10393.t2 ProSitePatterns PS00344 GATA-type zinc finger domain. 477 501 -
21 g10393.t2 ProSitePatterns PS00344 GATA-type zinc finger domain. 585 609 -
29 g10393.t2 ProSiteProfiles PS50114 GATA-type zinc finger domain profile. 471 524 26.548
28 g10393.t2 ProSiteProfiles PS50114 GATA-type zinc finger domain profile. 579 632 27.866
18 g10393.t2 SMART SM00401 GATA_3 471 521 5.8E-16
19 g10393.t2 SMART SM00401 GATA_3 579 629 2.0E-21
9 g10393.t2 SUPERFAMILY SSF57716 Glucocorticoid receptor-like (DNA-binding domain) 471 517 4.75E-15
10 g10393.t2 SUPERFAMILY SSF57716 Glucocorticoid receptor-like (DNA-binding domain) 578 636 1.62E-17

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0006357 regulation of transcription by RNA polymerase II BP
GO:0006355 regulation of transcription, DNA-templated BP
GO:0043565 sequence-specific DNA binding MF
GO:0008270 zinc ion binding MF
GO:0003700 DNA-binding transcription factor activity MF

KEGG

Orthology

Pathway

This gene does not belong to any pathways.

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values