Gene loci information

Transcript annotation

  • This transcript has been annotated as Putative Hemicentin-1.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_2 g6191 g6191.t1 TTS g6191.t1 15054032 15054032
chr_2 g6191 g6191.t1 isoform g6191.t1 15054089 15123342
chr_2 g6191 g6191.t1 exon g6191.t1.exon1 15054089 15054288
chr_2 g6191 g6191.t1 cds g6191.t1.CDS1 15054089 15054288
chr_2 g6191 g6191.t1 exon g6191.t1.exon2 15054361 15054744
chr_2 g6191 g6191.t1 cds g6191.t1.CDS2 15054361 15054744
chr_2 g6191 g6191.t1 exon g6191.t1.exon3 15057044 15057418
chr_2 g6191 g6191.t1 cds g6191.t1.CDS3 15057044 15057418
chr_2 g6191 g6191.t1 exon g6191.t1.exon4 15057875 15060028
chr_2 g6191 g6191.t1 cds g6191.t1.CDS4 15057875 15060028
chr_2 g6191 g6191.t1 exon g6191.t1.exon5 15097301 15097497
chr_2 g6191 g6191.t1 cds g6191.t1.CDS5 15097301 15097497
chr_2 g6191 g6191.t1 exon g6191.t1.exon6 15098247 15098507
chr_2 g6191 g6191.t1 cds g6191.t1.CDS6 15098247 15098507
chr_2 g6191 g6191.t1 exon g6191.t1.exon7 15098887 15099079
chr_2 g6191 g6191.t1 cds g6191.t1.CDS7 15098887 15099079
chr_2 g6191 g6191.t1 exon g6191.t1.exon8 15123297 15123342
chr_2 g6191 g6191.t1 cds g6191.t1.CDS8 15123297 15123342
chr_2 g6191 g6191.t1 TSS g6191.t1 15123974 15123974

Sequences

>g6191.t1 Gene=g6191 Length=3810
ATGGATTTACTCAAACGGATTAAATTTTTCTTTTTACTATCAATTGTGCTAATAGTGCTG
ACAAATGCAAATGGTGAAGATATTTCAATGGATACTCGAGACGGCGAAGATCTGATGCTT
CAATGTCGATTTAGTCCAGACTATTCAAGTAAAGATTTCATTTACTATTGGGCACGCAGT
TCACAACAGAAATTCGAAAATGTTGCTGTTGATCGCAATGCATTGAGTTCAACATACAAA
GTGGATTTCCGACCAGAAAAGGGTATTTATGATTTACGCATCACAAATGCATCATATAAT
CGAGATAATGGTCGATTTGAATGTCGCATCAAAGCACAAGGCACTGGTGATGATGTTCAT
CAGCAATATTACAATTTGACAGTTCTCACACCGCCACAGCCACCTCTTGTTGCACCAGGC
AATATTGTCGTTGCTACAGAAGAAAAGAAACAGGAATTGACATGCAGTAGTATTGGTGGC
TCTCCTGATCCAACTATCACATGGTATCGTGAAGGCACAAATGCGCCATTGCAGGCGATC
GTAAATCGTGGCGGTTCAAAGGATCATCAAACGACTAGCACATTGACCATTACACCCACA
AAGGATGATGATGGCGCACGCTATCGCTGTGTTGTATATAATCGTGCCATGAAGGAAGGC
GAAAAGTATGAGACGACCGTTACACTAAGTGTTAACTATTATCCACGTGTTGAAGTAGGT
CCCGAGAATCCATTACGTGTTGAACGTGATGGCACAGCAAAATTGGATTGTAATGTCGAT
GCGAAACCAAAAGTGACAAATGTTAGATGGACGAGAGGCGGACGTTTCATTAGCTCATCA
TTCTCGCATACAATTCATCGCGTGTCTGTACAGGATGCGGGCGAATATGAATGTACAGCT
GACAATGGTTTAGGACAAGTTGGTACTGCCAAAATTCAACTCGATGTACTTTTTGCACCA
GTTGTTGTTATCGAAACAAAAACACGCGAAGCTGAAGAACGCGAAAGCATTAACATTAAA
TGCAATGTCACTTCAAATCCACCACCTGTTGAGATTGAATGGGTTAAAGAAGGTAATCCT
GATTTTCGTCAATCAAGTGAAGTACTGTCAATCAATGATATTCGTGCGGAACATGCTGGC
ACTTATATTTGTCGCGGTATAAACAATATGCGTCCATATGGCGCGAAAGCAACTGAACGT
GTTGGAAATTCAACTGTCGCAGTACTTGTTCGTCATAGACCAGGAAAAGCTGTTATAACA
CCTAACAAACCAGTCGTTCATGTTGGCAATAGTGTGACTTTTACATGTTCCGCTAGCCCT
CCAGGCTGGCCAGTTCCACAATATAGATGGTTTAGAGATATTGAAGGTGAAATTTCTACA
CAAAAAGTTTTATCACAAGGATCGCAATATGTCATACCGCGTGCACATCTCGGTAGCGAA
GGCCGTTATCATTGTCATGCAGCAAATGAATTAGGACATGGAGAAATGGCAACAATAACG
CTCGAAGTGCATCAACCGCCACAATTTCTTGCTAAATTGCAGCAACATTTGACTCGACGT
GTAGGCGATAGTGATTTTAGTGTTTCATGCAGTGCAAAAGGAAAACCAAAACCAGCAGTC
AAATGGTTTAAAGATGGCCGTGAATTAACTGGTGAAGCAAATTTCTATGAAATAACAACA
TCACCAAATGAAGGTCCAAATGGAATGGTCACTGTTCAAAGCAATCTCAAATTTTTCGGT
AAAGGACGACCAAATTCAAATCAACTTTTGCCCAATGATCGTGGAACTTACTCATGCGTA
TATCAAAATGAAGTCAACTCTGCTAATTCAACAATGCATTTGAAAATTGAACATGAACCA
ATCATTCTACATCAATACAATAAAGTTGCTGCTGATTTGCGAGAAACTGCTGTTGTTGTT
TGCAAAGTTCAAGCTTATCCAAAGCCAGAATTTCAATGGACTTTTGGCACAAATACAGCT
CCAATTTCAATGAGCTCTGATGCACATTATGAAATCAACACCCAAAGTGATAACAATGAT
ATCTACACGAGTACACTTAAAATTCACAATGTTCGTCACAATGATTATGGCGAGTATAAT
TGCCGCGCTTCAAATTCACTTGAGACAATTAGAGCACCAATTAGATTACAACCTAAAGGT
GCTCCCGAGAAACCTACAAACCTTGATGCTGCCGACATTGGATCAAATTACATTTCACTC
AATTGGACGCCCGGCTTTGACGGTGGTCTTAGCAACACAAAGTTCTTTGTTCTATATCGA
AAAGTGTCAATTCCAAGTAATGAAGTGAATGGCGATTGTGGGTCATATTATACATCGTCG
CCCGACTGGTCAGAATTTGATTGTCATAGAAATTATCCCTGCAACATCACGCCCTTGGAT
CAACATTCAAGTTATGTATTTAAAGTTAAGGCAATGAACACGAAAGGCAATTCAGAATAT
TCGAGCGAGATTACGAAAACGACTAAAGTAAGCAAAATCCCATCCGCATTACGTGTCTCA
TACGATCCTTCGACGAAAATGTTGGGCATAAATGTCGGTCAGACATGCTTAGCACTTACG
GCAGTTGTAGAGACTATGTTACATGCTGAGACACATGTTCCAGAATGGCATATTGTTGGG
CAAATTCCATTACAAGCTTCAGGCAATGCACCGACATTCAAGGAATCAATTGTTGAGAAT
ATTATAATTCCAAGACGAAGTGCAGCACGCTCATTAGAGGATGAAGAAGATTTTACAATG
GCACTCGAGGATGAAAGCATGCCACGCGTAAGAGTCAAATTGTGCCTGCGAGGTAACCCA
GAACATTGTGGTGAATACATTGGAGCTGAAATCGGTCCATCATATATTCAGGACGTGGCT
TCAATTAACATGCCAACATTGGTGGCAATTGTCGTATCATGTATAGTTTTTGCTCTTTTC
ATTGGACTATTGCTTATGTTCTGTCGATGCAAAAAGAAGCAAACGAAAAAGAGTGTGCAG
GCAAAGGACTATGAAATGGATTCCGTTCGTCCATCAATTGTCGCACAACAAAATCAAGCA
CCGCCTCCTTACTATCCCGCAAGTGGTCTCGAGAACAAGGCACTCGAGCATTCAATGGAT
CTCGCTCTAGCTATGGAAGATCAAAAGACTGCAATTTATGCCACACAAAATGGATATGGT
TACAATAGCGGTTTGCAAGTGCCATCACATAACATTCCTCCTGATTGGGTCAACATGGGC
TATATGGAAAATAGTTATTCAAATTCAAACAATGGCGGAAGCGTAAATTCACAAGATTCA
CTTTATCAAATGAAAATGTCAGCAGCTGCAAGCAATTCTGGTAATTTAATTTCACAGCAT
CATCAATATGTTGAACGACAGCAGCCATCATATGGCTATGATCCTTTAACTCATGGTGGC
TATGGCACTGTAGACGACGATTATGCAACTTATCCACATTTGGCAACTACTCCATCACAA
CAAATGATAGCAGATGATTATCACAATCAAATGATGATGGCACAGCAGCCACCATCACGA
CAAGACTATTGTTCTGATCCATATGCAGCTGTACATAAACCAAATAAAAAGAGATTAGAC
TCACATATGGAATCGCCCTATCATGATGTCAGTGGCCTACCCGATCCATATATGGAACAA
CAAATGAATATGGGCGAAGATCAGCTAGATCATCCTAAGCCGCCGCAGCATTTGCCACAG
CATCTTGGTGGCTATGATGAAAACCTCGAAAGCGGCTATTCGACACCCAATTCACGAAAT
CGACGCGTTATACGCGAAATAATTGTCTGA

>g6191.t1 Gene=g6191 Length=1269
MDLLKRIKFFFLLSIVLIVLTNANGEDISMDTRDGEDLMLQCRFSPDYSSKDFIYYWARS
SQQKFENVAVDRNALSSTYKVDFRPEKGIYDLRITNASYNRDNGRFECRIKAQGTGDDVH
QQYYNLTVLTPPQPPLVAPGNIVVATEEKKQELTCSSIGGSPDPTITWYREGTNAPLQAI
VNRGGSKDHQTTSTLTITPTKDDDGARYRCVVYNRAMKEGEKYETTVTLSVNYYPRVEVG
PENPLRVERDGTAKLDCNVDAKPKVTNVRWTRGGRFISSSFSHTIHRVSVQDAGEYECTA
DNGLGQVGTAKIQLDVLFAPVVVIETKTREAEERESINIKCNVTSNPPPVEIEWVKEGNP
DFRQSSEVLSINDIRAEHAGTYICRGINNMRPYGAKATERVGNSTVAVLVRHRPGKAVIT
PNKPVVHVGNSVTFTCSASPPGWPVPQYRWFRDIEGEISTQKVLSQGSQYVIPRAHLGSE
GRYHCHAANELGHGEMATITLEVHQPPQFLAKLQQHLTRRVGDSDFSVSCSAKGKPKPAV
KWFKDGRELTGEANFYEITTSPNEGPNGMVTVQSNLKFFGKGRPNSNQLLPNDRGTYSCV
YQNEVNSANSTMHLKIEHEPIILHQYNKVAADLRETAVVVCKVQAYPKPEFQWTFGTNTA
PISMSSDAHYEINTQSDNNDIYTSTLKIHNVRHNDYGEYNCRASNSLETIRAPIRLQPKG
APEKPTNLDAADIGSNYISLNWTPGFDGGLSNTKFFVLYRKVSIPSNEVNGDCGSYYTSS
PDWSEFDCHRNYPCNITPLDQHSSYVFKVKAMNTKGNSEYSSEITKTTKVSKIPSALRVS
YDPSTKMLGINVGQTCLALTAVVETMLHAETHVPEWHIVGQIPLQASGNAPTFKESIVEN
IIIPRRSAARSLEDEEDFTMALEDESMPRVRVKLCLRGNPEHCGEYIGAEIGPSYIQDVA
SINMPTLVAIVVSCIVFALFIGLLLMFCRCKKKQTKKSVQAKDYEMDSVRPSIVAQQNQA
PPPYYPASGLENKALEHSMDLALAMEDQKTAIYATQNGYGYNSGLQVPSHNIPPDWVNMG
YMENSYSNSNNGGSVNSQDSLYQMKMSAAASNSGNLISQHHQYVERQQPSYGYDPLTHGG
YGTVDDDYATYPHLATTPSQQMIADDYHNQMMMAQQPPSRQDYCSDPYAAVHKPNKKRLD
SHMESPYHDVSGLPDPYMEQQMNMGEDQLDHPKPPQHLPQHLGGYDENLESGYSTPNSRN
RRVIREIIV

Protein features from InterProScan

Transcript Database ID Name Start End E.value
36 g6191.t1 CDD cd00096 Ig 253 307 1.11979E-8
38 g6191.t1 CDD cd00096 Ig 337 388 8.92703E-8
37 g6191.t1 CDD cd00096 Ig 432 495 1.2961E-6
39 g6191.t1 CDD cd00096 Ig 526 613 3.01012E-8
35 g6191.t1 CDD cd00096 Ig 637 715 6.5894E-11
40 g6191.t1 CDD cd00063 FN3 722 828 2.196E-11
26 g6191.t1 Gene3D G3DSA:2.60.40.10 Immunoglobulins 26 133 1.4E-9
22 g6191.t1 Gene3D G3DSA:2.60.40.10 Immunoglobulins 134 236 3.6E-22
23 g6191.t1 Gene3D G3DSA:2.60.40.10 Immunoglobulins 237 318 1.3E-14
25 g6191.t1 Gene3D G3DSA:2.60.40.10 Immunoglobulins 319 393 1.2E-13
20 g6191.t1 Gene3D G3DSA:2.60.40.10 Immunoglobulins 413 511 3.0E-14
27 g6191.t1 Gene3D G3DSA:2.60.40.10 Immunoglobulins 512 625 1.5E-22
24 g6191.t1 Gene3D G3DSA:2.60.40.10 Immunoglobulins 626 714 1.6E-19
21 g6191.t1 Gene3D G3DSA:2.60.40.10 Immunoglobulins 715 835 8.8E-16
8 g6191.t1 PANTHER PTHR11640:SF134 ECHINOID, ISOFORM A-RELATED 7 1269 0.0
9 g6191.t1 PANTHER PTHR11640 NEPHRIN 7 1269 0.0
4 g6191.t1 Pfam PF13927 Immunoglobulin domain 134 214 2.0E-11
3 g6191.t1 Pfam PF13927 Immunoglobulin domain 235 302 4.0E-9
2 g6191.t1 Pfam PF13927 Immunoglobulin domain 320 388 1.1E-11
7 g6191.t1 Pfam PF13895 Immunoglobulin domain 418 493 3.7E-11
5 g6191.t1 Pfam PF13927 Immunoglobulin domain 506 603 1.6E-7
1 g6191.t1 Pfam PF13927 Immunoglobulin domain 620 705 3.2E-13
6 g6191.t1 Pfam PF00041 Fibronectin type III domain 724 821 1.5E-9
30 g6191.t1 Phobius SIGNAL_PEPTIDE Signal peptide region 1 25 -
31 g6191.t1 Phobius SIGNAL_PEPTIDE_N_REGION N-terminal region of a signal peptide. 1 8 -
32 g6191.t1 Phobius SIGNAL_PEPTIDE_H_REGION Hydrophobic region of a signal peptide. 9 20 -
34 g6191.t1 Phobius SIGNAL_PEPTIDE_C_REGION C-terminal region of a signal peptide. 21 25 -
29 g6191.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 26 966 -
33 g6191.t1 Phobius TRANSMEMBRANE Region of a membrane-bound protein predicted to be embedded in the membrane. 967 988 -
28 g6191.t1 Phobius CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the cytoplasm. 989 1269 -
58 g6191.t1 ProSiteProfiles PS50835 Ig-like domain profile. 34 127 7.976
59 g6191.t1 ProSiteProfiles PS50835 Ig-like domain profile. 134 228 13.947
63 g6191.t1 ProSiteProfiles PS50835 Ig-like domain profile. 235 313 11.098
61 g6191.t1 ProSiteProfiles PS50835 Ig-like domain profile. 320 402 12.913
62 g6191.t1 ProSiteProfiles PS50835 Ig-like domain profile. 414 500 10.825
60 g6191.t1 ProSiteProfiles PS50835 Ig-like domain profile. 507 617 11.624
57 g6191.t1 ProSiteProfiles PS50835 Ig-like domain profile. 620 717 10.154
56 g6191.t1 ProSiteProfiles PS50853 Fibronectin type-III domain profile. 724 831 15.408
46 g6191.t1 SMART SM00409 IG_3c 27 129 0.11
45 g6191.t1 SMART SM00409 IG_3c 140 232 1.3E-6
51 g6191.t1 SMART SM00408 igc2_5 146 217 1.5E-6
44 g6191.t1 SMART SM00409 IG_3c 242 317 1.8E-5
54 g6191.t1 SMART SM00408 igc2_5 248 305 0.0017
42 g6191.t1 SMART SM00409 IG_3c 326 411 2.6E-6
53 g6191.t1 SMART SM00408 igc2_5 332 391 0.0019
43 g6191.t1 SMART SM00409 IG_3c 421 504 1.8E-9
50 g6191.t1 SMART SM00408 igc2_5 427 492 2.6E-6
48 g6191.t1 SMART SM00409 IG_3c 514 617 5.6
52 g6191.t1 SMART SM00408 igc2_5 521 606 0.0011
47 g6191.t1 SMART SM00409 IG_3c 626 719 3.5E-5
55 g6191.t1 SMART SM00408 igc2_5 632 708 1.1E-7
49 g6191.t1 SMART SM00060 FN3_2 722 818 1.0E-8
16 g6191.t1 SUPERFAMILY SSF48726 Immunoglobulin 34 136 8.68E-8
13 g6191.t1 SUPERFAMILY SSF48726 Immunoglobulin 131 235 1.67E-17
12 g6191.t1 SUPERFAMILY SSF48726 Immunoglobulin 231 321 8.32E-14
11 g6191.t1 SUPERFAMILY SSF48726 Immunoglobulin 316 398 1.67E-13
15 g6191.t1 SUPERFAMILY SSF48726 Immunoglobulin 417 510 1.82E-13
17 g6191.t1 SUPERFAMILY SSF48726 Immunoglobulin 502 620 2.53E-12
14 g6191.t1 SUPERFAMILY SSF48726 Immunoglobulin 620 726 2.65E-17
10 g6191.t1 SUPERFAMILY SSF49265 Fibronectin type III 711 833 7.76E-16
19 g6191.t1 SignalP_EUK SignalP-noTM SignalP-noTM 1 25 -
18 g6191.t1 SignalP_GRAM_POSITIVE SignalP-TM SignalP-TM 1 25 -
41 g6191.t1 TMHMM TMhelix Region of a membrane-bound protein predicted to be embedded in the membrane. 966 988 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0005515 protein binding MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values