Gene loci information

Transcript annotation

  • This transcript has been annotated as Golgi apparatus protein 1.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_3 g2437 g2437.t1 TSS g2437.t1 17998728 17998728
chr_3 g2437 g2437.t1 isoform g2437.t1 17998857 18002433
chr_3 g2437 g2437.t1 exon g2437.t1.exon1 17998857 17998915
chr_3 g2437 g2437.t1 cds g2437.t1.CDS1 17998857 17998915
chr_3 g2437 g2437.t1 exon g2437.t1.exon2 17998975 18000933
chr_3 g2437 g2437.t1 cds g2437.t1.CDS2 17998975 18000933
chr_3 g2437 g2437.t1 exon g2437.t1.exon3 18000992 18001223
chr_3 g2437 g2437.t1 cds g2437.t1.CDS3 18000992 18001223
chr_3 g2437 g2437.t1 exon g2437.t1.exon4 18001289 18002027
chr_3 g2437 g2437.t1 cds g2437.t1.CDS4 18001289 18002027
chr_3 g2437 g2437.t1 exon g2437.t1.exon5 18002084 18002196
chr_3 g2437 g2437.t1 cds g2437.t1.CDS5 18002084 18002196
chr_3 g2437 g2437.t1 exon g2437.t1.exon6 18002263 18002433
chr_3 g2437 g2437.t1 cds g2437.t1.CDS6 18002263 18002433
chr_3 g2437 g2437.t1 TTS g2437.t1 NA NA

Sequences

>g2437.t1 Gene=g2437 Length=3273
ATGTACAAATTGCTGGTTTTATTATTGAATTTTTCATTTTTATCTTACGCCGATGACAAA
ATAAAAAGTCAAAATTTGCTTGAAGATCCTGCATGTCCACAACTTAAAAATCTTTGTAGC
AATTTATCGAATAATAGTGAAAATTTATTGGTGCTTGAATGTATTGACACGTTTCAGACA
TCAGAATATGACCTGAACATCGATTCACAGTGTCAACATGCAATTTATTCAAAGAAGCTT
GAACTAATGAATGATAAAAATGTTCACAGTTTACTCGAGCGAAGCTGCAAAGATATTGAG
GTGTTGAATACTTATTGCCAGCCTGATAGTAAAGAGTACTCTGGAAAGTACTTATCATGT
GTACTGGACAATCGAGACATAATAAAAGACTTAATTTGTAAGGGACAAATTCAGCGCATT
GAAACAGTTGCATTCAGTGACTTTCGACTTATTTCTAAATTTCTCACAAGTTGTTCAAAT
GATATCGAAAAAAGCAACTGTGGAAGACTGACGAAAAACTCAAAATCAACTCAAGGCGAT
ACACTTTCTTGTTTACAAATGCAAATTTCACAACTTTCAGATAATTGCAAGAAAGAAATT
TATCATATTTCTGAATTACAAGCTGACAATATAAAATTCGATCGACAAACGTATATGGCA
TGCAATAATGATGTGAAAAAATTCTGTTCAGACATGCGAGGCTACGAAATTTACAAATGT
CTCATGAAGAATAAAAATTCACAGCAAATGTCTAAAAAGTGTGAAAGTCAATTAAACAGA
CGATCTTCATTGATGGGACAAGACTATCGTATAAGTCGCGGACTTGCAAAAGCATGTAAA
GAAGATATAAAACTTAATCATTGTAGAAAAGGCACAAGCGATGATAAAGACATTAGATTA
GCTCAGATTTTACTGTGTTTAGAAGCTGCTCATAAAAATAACTCAAAATTATTGCCTGAT
TGTCTTGCTGAGATTTTTGATCATCGAAAAATGTTGATGGAAGATTACAAACTTTCACCT
GAAATTATAATTGACTGTTCTGAAGAATTGACGAGATTTTGCACAAATACTGAAGGTTCA
CAAACAATTCATTGTCTAATGGATCATGCAAAACCAAGAAAAAAGAAGGAATTGAGAGTT
TCAGCACAATGTTTGAGAGCAGTTGAAAATTTAATAAAGGTCACAGATGTCTCTGAAGAT
TGGAGAGTTGATCCAGTTTTGAGAAAAGCATGCAAACCAGTTGTCGATCAAGTATGTGGT
AATGATGTGGAAGGAAATTCAAGAGTTATGAGTTGTCTTATGGAGAAATTAGGAACAAAA
TATATGGCAGGTCCGTGTGAAGCTGCACTTTTGCAAATTCAGTACTTTACGGCTAGAGAC
TTTAAACTTGATGCAATGCTTTATTCGCAATGTAAAGAGGATGCAATTAAATTTTGCCAT
GCAAAAAAGACATGGGCAGATGTGAATGATAATCAAATGGATCCCGAGCGAGGACCAATT
ATTTTGCCTTGCTTATATCGCTATGCTTATCATCCTGATCCAAAAATGCAATTAACACAA
AATTGCTTCACAGAAATCAAAAGAGTTATGAGAGAAAGAGCAGTTTCAGTTGATCTCATT
CCAGAAGTTGAAGATGTCTGTTTAGATGATTTAGCAGAATTTTGTTTTGACAAGACACAA
AAGGGAGAAGAAATGGAATGTTTGCAGCAACATTTAGAAGAATTAACAAAAGAATGCAAA
AAAGCAGTAACAAGTTATACTGAGGAAGAGTCAGCACATATAGAACTCAATCCACTTATT
CGATCAGCGTGTAGTGATGCATTAGAAAAATATTGTGGAAATATTATGGCGGGTCGAGAG
GATGGCGACGTAATGGAATGTTTGATTTCACTTAAAAATGACGTTTTAAAACAAAACATC
AAGTGTCGTGCAGCAATTGAACATTTTCAATTGATATCACTAAAAAACTATGTGTTTACT
CACAAATTTAAAGAAGCGTGTAAGCCATACGTTAATAGATATTGTCCAAATAGTAGCACT
AAATACGATGTTATTGCATGTCTTAGTGAAATTATGACAAAAGATACCATAAAAGATCAC
AAACATTCACTACCTAAAGATTGTCGAATGCAAGTTCGCTCTCAACTTCTTCAGCAACGC
GAAAATATTGATTTTGATCCAAAATTAAAGAACACATGCAAAAAAGACATTGAAACTTTT
TGTTATAAAGTCGATAAAGACTTTGGTCAAGTTCTCGAATGTTTAACAAGCAATCAAAAT
AAATTAAGCACTAACTGTAAGCATGCTGTTTTTGCGATAAAAAAATCGGAATTGACTGAC
AGCAAAACTGATTTCGCACTAATGACTACGTGCAAAGGCATGTTGAAGCAATATTGCGAG
AATATCGATGATGCAAATGTACTGAAATGCTTAAAACTTCATAAAGATGAAAATATGTTT
GATAAGAAATGCCACATGATTGTCGTCAACCGACTAATTTTACAGAATCAAGATTATAGA
TTTAATCCCGATTTGCAACAAGCATGTTCAAAAGATATTGCAGATTATTGCACAAGAATT
ATTGTTGAAGCCAAAGAAAATGAAGAAATGAACGGAAAAGTAATAAATTGCCTCAAACAA
AAATTCAGAGAAGGGAAATTACAAACGAAATGTGAAAAACAAATGACAGTGATTCTACAC
GATCAAGCACTTAATTATAAACTGAATCCACTTTTAGCAGCAGTATGCAAATCAGAAATT
GATATTTTATGTCGTATGGATGATGAAAATGATGAAGAGGGACAAGTAGAAGAATGTCTA
AAACGTCAATTTATTGAGAAGAAAATTATTACGAAAGAATGTAAAGTTGAGATTGCAACA
TTGATTCAAGAAGCCAAAGCTGATATTCATGTTGATCCAATTCTTCTAAAAGCATGCACA
GTTGATTTATTAAAATATTGTTCAAATGTTGAAAGTGGAAATGGAAGACAACTTAAATGT
CTGCAAATAATTTTAAACGATGAAACGAAATCAAAAGCACTTGAAGATGATTGTAGAGAA
AAATTACAACAAAGAGTTGAAATGTTTAATAATGCGGCAGCAGTAATGCCACAACCTGAA
AGTTTTCAAGATCTTTATGAAACAGTTTCAAATTCGCCCGCAAGAAAGTATTTCATGATT
GTGATGTTCACATTTGTAGGATTTATTTTCATCATTGGACTATTCTTCGGCCGTGTTACT
AATAAAAGATATACAGCACTGAAAAATAAGTGA

>g2437.t1 Gene=g2437 Length=1090
MYKLLVLLLNFSFLSYADDKIKSQNLLEDPACPQLKNLCSNLSNNSENLLVLECIDTFQT
SEYDLNIDSQCQHAIYSKKLELMNDKNVHSLLERSCKDIEVLNTYCQPDSKEYSGKYLSC
VLDNRDIIKDLICKGQIQRIETVAFSDFRLISKFLTSCSNDIEKSNCGRLTKNSKSTQGD
TLSCLQMQISQLSDNCKKEIYHISELQADNIKFDRQTYMACNNDVKKFCSDMRGYEIYKC
LMKNKNSQQMSKKCESQLNRRSSLMGQDYRISRGLAKACKEDIKLNHCRKGTSDDKDIRL
AQILLCLEAAHKNNSKLLPDCLAEIFDHRKMLMEDYKLSPEIIIDCSEELTRFCTNTEGS
QTIHCLMDHAKPRKKKELRVSAQCLRAVENLIKVTDVSEDWRVDPVLRKACKPVVDQVCG
NDVEGNSRVMSCLMEKLGTKYMAGPCEAALLQIQYFTARDFKLDAMLYSQCKEDAIKFCH
AKKTWADVNDNQMDPERGPIILPCLYRYAYHPDPKMQLTQNCFTEIKRVMRERAVSVDLI
PEVEDVCLDDLAEFCFDKTQKGEEMECLQQHLEELTKECKKAVTSYTEEESAHIELNPLI
RSACSDALEKYCGNIMAGREDGDVMECLISLKNDVLKQNIKCRAAIEHFQLISLKNYVFT
HKFKEACKPYVNRYCPNSSTKYDVIACLSEIMTKDTIKDHKHSLPKDCRMQVRSQLLQQR
ENIDFDPKLKNTCKKDIETFCYKVDKDFGQVLECLTSNQNKLSTNCKHAVFAIKKSELTD
SKTDFALMTTCKGMLKQYCENIDDANVLKCLKLHKDENMFDKKCHMIVVNRLILQNQDYR
FNPDLQQACSKDIADYCTRIIVEAKENEEMNGKVINCLKQKFREGKLQTKCEKQMTVILH
DQALNYKLNPLLAAVCKSEIDILCRMDDENDEEGQVEECLKRQFIEKKIITKECKVEIAT
LIQEAKADIHVDPILLKACTVDLLKYCSNVESGNGRQLKCLQIILNDETKSKALEDDCRE
KLQQRVEMFNNAAAVMPQPESFQDLYETVSNSPARKYFMIVMFTFVGFIFIIGLFFGRVT
NKRYTALKNK

Protein features from InterProScan

Transcript Database ID Name Start End E.value
17 g2437.t1 Coils Coil Coil 558 592 -
15 g2437.t1 PANTHER PTHR11884 SELECTIN LIGAND RELATED 20 1090 0.0
14 g2437.t1 Pfam PF00839 Cysteine rich repeat 133 191 2.7E-10
11 g2437.t1 Pfam PF00839 Cysteine rich repeat 194 248 1.7E-10
5 g2437.t1 Pfam PF00839 Cysteine rich repeat 253 313 4.6E-10
4 g2437.t1 Pfam PF00839 Cysteine rich repeat 319 371 8.6E-12
12 g2437.t1 Pfam PF00839 Cysteine rich repeat 383 438 2.4E-11
7 g2437.t1 Pfam PF00839 Cysteine rich repeat 445 511 8.5E-11
2 g2437.t1 Pfam PF00839 Cysteine rich repeat 521 575 9.1E-9
6 g2437.t1 Pfam PF00839 Cysteine rich repeat 577 634 1.8E-11
3 g2437.t1 Pfam PF00839 Cysteine rich repeat 641 692 4.1E-8
9 g2437.t1 Pfam PF00839 Cysteine rich repeat 706 761 9.0E-12
1 g2437.t1 Pfam PF00839 Cysteine rich repeat 765 817 5.2E-9
13 g2437.t1 Pfam PF00839 Cysteine rich repeat 823 885 1.6E-14
8 g2437.t1 Pfam PF00839 Cysteine rich repeat 890 946 6.0E-10
10 g2437.t1 Pfam PF00839 Cysteine rich repeat 952 1008 1.2E-11
20 g2437.t1 Phobius SIGNAL_PEPTIDE Signal peptide region 1 17 -
21 g2437.t1 Phobius SIGNAL_PEPTIDE_N_REGION N-terminal region of a signal peptide. 1 3 -
22 g2437.t1 Phobius SIGNAL_PEPTIDE_H_REGION Hydrophobic region of a signal peptide. 4 13 -
24 g2437.t1 Phobius SIGNAL_PEPTIDE_C_REGION C-terminal region of a signal peptide. 14 17 -
19 g2437.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 18 1056 -
23 g2437.t1 Phobius TRANSMEMBRANE Region of a membrane-bound protein predicted to be embedded in the membrane. 1057 1080 -
18 g2437.t1 Phobius CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the cytoplasm. 1081 1090 -
28 g2437.t1 ProSiteProfiles PS51289 Cysteine-rich GLG1 repeat profile. 66 129 9.843
37 g2437.t1 ProSiteProfiles PS51289 Cysteine-rich GLG1 repeat profile. 133 186 6.581
38 g2437.t1 ProSiteProfiles PS51289 Cysteine-rich GLG1 repeat profile. 191 249 15.482
29 g2437.t1 ProSiteProfiles PS51289 Cysteine-rich GLG1 repeat profile. 250 315 10.322
33 g2437.t1 ProSiteProfiles PS51289 Cysteine-rich GLG1 repeat profile. 316 374 15.018
40 g2437.t1 ProSiteProfiles PS51289 Cysteine-rich GLG1 repeat profile. 379 441 14.61
30 g2437.t1 ProSiteProfiles PS51289 Cysteine-rich GLG1 repeat profile. 446 513 9.197
41 g2437.t1 ProSiteProfiles PS51289 Cysteine-rich GLG1 repeat profile. 517 573 9.59
39 g2437.t1 ProSiteProfiles PS51289 Cysteine-rich GLG1 repeat profile. 574 636 13.471
35 g2437.t1 ProSiteProfiles PS51289 Cysteine-rich GLG1 repeat profile. 637 696 12.318
34 g2437.t1 ProSiteProfiles PS51289 Cysteine-rich GLG1 repeat profile. 703 763 13.752
27 g2437.t1 ProSiteProfiles PS51289 Cysteine-rich GLG1 repeat profile. 765 817 7.566
31 g2437.t1 ProSiteProfiles PS51289 Cysteine-rich GLG1 repeat profile. 819 886 13.541
36 g2437.t1 ProSiteProfiles PS51289 Cysteine-rich GLG1 repeat profile. 887 948 10.223
32 g2437.t1 ProSiteProfiles PS51289 Cysteine-rich GLG1 repeat profile. 949 1009 14.132
16 g2437.t1 SignalP_EUK SignalP-noTM SignalP-noTM 1 17 -
25 g2437.t1 SignalP_GRAM_NEGATIVE SignalP-noTM SignalP-noTM 1 17 -
26 g2437.t1 TMHMM TMhelix Region of a membrane-bound protein predicted to be embedded in the membrane. 1057 1079 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0016020 membrane CC
GO:0000139 Golgi membrane CC

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values