Gene loci information

Transcript annotation

  • This transcript has been annotated as Putative Collagen alpha-1(XVIII) chain.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_1 g11445 g11445.t1 TTS g11445.t1 16300976 16300976
chr_1 g11445 g11445.t1 isoform g11445.t1 16301514 16305802
chr_1 g11445 g11445.t1 exon g11445.t1.exon1 16301514 16301650
chr_1 g11445 g11445.t1 cds g11445.t1.CDS1 16301514 16301650
chr_1 g11445 g11445.t1 exon g11445.t1.exon2 16301716 16301787
chr_1 g11445 g11445.t1 cds g11445.t1.CDS2 16301716 16301787
chr_1 g11445 g11445.t1 exon g11445.t1.exon3 16301869 16301927
chr_1 g11445 g11445.t1 cds g11445.t1.CDS3 16301869 16301927
chr_1 g11445 g11445.t1 exon g11445.t1.exon4 16301993 16302181
chr_1 g11445 g11445.t1 cds g11445.t1.CDS4 16301993 16302181
chr_1 g11445 g11445.t1 exon g11445.t1.exon5 16302243 16302373
chr_1 g11445 g11445.t1 cds g11445.t1.CDS5 16302243 16302373
chr_1 g11445 g11445.t1 exon g11445.t1.exon6 16302432 16302473
chr_1 g11445 g11445.t1 cds g11445.t1.CDS6 16302432 16302473
chr_1 g11445 g11445.t1 exon g11445.t1.exon7 16302530 16302661
chr_1 g11445 g11445.t1 cds g11445.t1.CDS7 16302530 16302661
chr_1 g11445 g11445.t1 exon g11445.t1.exon8 16302729 16302821
chr_1 g11445 g11445.t1 cds g11445.t1.CDS8 16302729 16302821
chr_1 g11445 g11445.t1 exon g11445.t1.exon9 16302875 16302948
chr_1 g11445 g11445.t1 cds g11445.t1.CDS9 16302875 16302948
chr_1 g11445 g11445.t1 exon g11445.t1.exon10 16303009 16303038
chr_1 g11445 g11445.t1 cds g11445.t1.CDS10 16303009 16303038
chr_1 g11445 g11445.t1 exon g11445.t1.exon11 16303096 16303129
chr_1 g11445 g11445.t1 cds g11445.t1.CDS11 16303096 16303129
chr_1 g11445 g11445.t1 exon g11445.t1.exon12 16303223 16303309
chr_1 g11445 g11445.t1 cds g11445.t1.CDS12 16303223 16303309
chr_1 g11445 g11445.t1 exon g11445.t1.exon13 16303377 16303453
chr_1 g11445 g11445.t1 cds g11445.t1.CDS13 16303377 16303453
chr_1 g11445 g11445.t1 exon g11445.t1.exon14 16303596 16303623
chr_1 g11445 g11445.t1 cds g11445.t1.CDS14 16303596 16303623
chr_1 g11445 g11445.t1 exon g11445.t1.exon15 16303702 16303727
chr_1 g11445 g11445.t1 cds g11445.t1.CDS15 16303702 16303727
chr_1 g11445 g11445.t1 exon g11445.t1.exon16 16303783 16303840
chr_1 g11445 g11445.t1 cds g11445.t1.CDS16 16303783 16303840
chr_1 g11445 g11445.t1 exon g11445.t1.exon17 16304018 16304053
chr_1 g11445 g11445.t1 cds g11445.t1.CDS17 16304018 16304053
chr_1 g11445 g11445.t1 exon g11445.t1.exon18 16304241 16304263
chr_1 g11445 g11445.t1 cds g11445.t1.CDS18 16304241 16304263
chr_1 g11445 g11445.t1 exon g11445.t1.exon19 16304324 16304432
chr_1 g11445 g11445.t1 cds g11445.t1.CDS19 16304324 16304432
chr_1 g11445 g11445.t1 exon g11445.t1.exon20 16304509 16304583
chr_1 g11445 g11445.t1 cds g11445.t1.CDS20 16304509 16304583
chr_1 g11445 g11445.t1 exon g11445.t1.exon21 16304680 16304756
chr_1 g11445 g11445.t1 cds g11445.t1.CDS21 16304680 16304756
chr_1 g11445 g11445.t1 exon g11445.t1.exon22 16304823 16304833
chr_1 g11445 g11445.t1 cds g11445.t1.CDS22 16304823 16304833
chr_1 g11445 g11445.t1 exon g11445.t1.exon23 16304995 16305034
chr_1 g11445 g11445.t1 cds g11445.t1.CDS23 16304995 16305034
chr_1 g11445 g11445.t1 exon g11445.t1.exon24 16305114 16305143
chr_1 g11445 g11445.t1 cds g11445.t1.CDS24 16305114 16305143
chr_1 g11445 g11445.t1 exon g11445.t1.exon25 16305210 16305288
chr_1 g11445 g11445.t1 cds g11445.t1.CDS25 16305210 16305288
chr_1 g11445 g11445.t1 exon g11445.t1.exon26 16305584 16305727
chr_1 g11445 g11445.t1 cds g11445.t1.CDS26 16305584 16305727
chr_1 g11445 g11445.t1 exon g11445.t1.exon27 16305794 16305802
chr_1 g11445 g11445.t1 cds g11445.t1.CDS27 16305794 16305802
chr_1 g11445 g11445.t1 TSS g11445.t1 16306518 16306518

Sequences

>g11445.t1 Gene=g11445 Length=1902
ATGGCTATGGTTATATCAAGTAGAGCAAAAGGGATGATTGGCGCATTTATATCATTGATA
CTTTTATCAACTGTTCTTGTAACTGCATCGACAAAAGGCTGGTGGTTTGGTTTAAATAAA
AATAATGGTGAACATGTTGCTGCTCGAATACAGGCAAACAGTGACTCAGATTTTAATGAT
TATAGTAACGAAAATGAAAGTCCAGTTTTACAGGCTCCACCAGATTTTAAGAATTATGGT
GGATACAGAAAGGGTGAAAAAGGTGAGAAGGGAGCTAGAGGAATTCCAGGCGATTCAATC
AGAGGGCCACCTGGACCTCCAGGTCCAAAAGGAGAATGTCAAATAGTTAATTTTAATAAT
AATACCAACAGTTACAATAATAATTTCAAACAGACGGAACAAAAATTAGCACCAGTTTGT
GCATGTAATTACGACAATATTATTGATATTCTTCACAATGAATCGGTAATTCAAATTCTA
CGAGGCCCTCAAGGCCCTCCGGGATTAACGGGAGCTCCCGGTCAAAAAGGGGAAATGGGC
GAAAGAGGAGCAGATGGTATTGATGGAATTCCAGGATTGCCAGGGACACCAGGAGAGAGT
TCATCTAATTGGGACTCATCTAGGATGTACAAGGAATCAATGATGGGAAGCATTCGTGGC
AAGGATTCACGAGGAGAAAAAGGCGATAAAGGCGATATGGGTATGAAAGGAATGAAGGGA
GAAGGTGGAGCAAAAGGAGAAAAAGGAGCATGTATAACAGTTCCAGAAATACAGACTAAC
AATTGCGGTTGTCCATTCAATGATACATACAAAGGAATAAAGGGAGATAAAGGGCTTAGA
GGAAAACGTGGAAAAACTGGCAGTCAAGGAGAAAAAGGACAGAAAGGAGATAGTGGGTCA
TCAGTGGGGCCAAAAGGTGACAAAGGAGAGCGAGGTCAACCAGGTCTGCCAGGACCACCT
TTTAGTGGCTTTGATGACTCAATGAATTATCAACGATCAGGAATCGGCACAATAATCACA
TTTCAAAATACTGACACAATGATAAAACAATCATCTACATATCCTGTAGGTTCAATTTGT
TATGTTATAGATGAGGAAGCTCTGTTAGTGAAAGTTTCAAAAGGATGGCAATACATTGCT
CTTGGCACATTATTACCATTCACAACTCCTTATGTAACCACTTCACCAATGTCTCCAACT
TCCTACATGGACCTTCAAGCTTCAAATTTGCTCAACAGTAACAGTATTTTAAAGTCTCCT
GAGAGCTATACATTTACAACACCTCCAGAATATGAAACATGGAATCCAAAAATGTTAAGA
TTGATTGCATTGAATGAACCATACTCTGGTAATTTACAAGGTTTACGAAACGCTGATTTA
AATTGTCATCGACAAGCAAGACGATCTGGATTGATGGGTAACTTTAGAGCTTTCTTATCA
ACTAGAATTCAGAACTTGGATTCTCTAATAAAACCCGAAGACAGAGAATTGCCAATAACA
AACTTGCGTGGGGATGTGCTTTTTAATTCATTCAACGCTATTTTCAATAATAATGCTCAA
GGAATCTTTCTGTCATCCAATTCACCGCGAATTATTAGCTTCAGTGGCAAAAATGTGATG
AATGACAATACTTGGCCTCATAAAATTGTTTGGCATGGCGCACGTGCGGATTCAATAGAC
ACAAATTGTGAAGGTTGGCATAGCAATTTTCAAGATAAGGTTGGTTTAGGGAGCAGTCTG
TTAGGAAATAAGTTACTTGCTCAAGAAATGTATAGTTGTCAGCAAAAGAATATTGTTCTA
TGCATTGAAGTGTTATCGCATAGTAGCAGTGGTGATATTGCAAATCGTCGAAAGCGTGAG
ATGATGCAGAGTAATGACGATACATACGACAACGAAAAATGA

>g11445.t1 Gene=g11445 Length=633
MAMVISSRAKGMIGAFISLILLSTVLVTASTKGWWFGLNKNNGEHVAARIQANSDSDFND
YSNENESPVLQAPPDFKNYGGYRKGEKGEKGARGIPGDSIRGPPGPPGPKGECQIVNFNN
NTNSYNNNFKQTEQKLAPVCACNYDNIIDILHNESVIQILRGPQGPPGLTGAPGQKGEMG
ERGADGIDGIPGLPGTPGESSSNWDSSRMYKESMMGSIRGKDSRGEKGDKGDMGMKGMKG
EGGAKGEKGACITVPEIQTNNCGCPFNDTYKGIKGDKGLRGKRGKTGSQGEKGQKGDSGS
SVGPKGDKGERGQPGLPGPPFSGFDDSMNYQRSGIGTIITFQNTDTMIKQSSTYPVGSIC
YVIDEEALLVKVSKGWQYIALGTLLPFTTPYVTTSPMSPTSYMDLQASNLLNSNSILKSP
ESYTFTTPPEYETWNPKMLRLIALNEPYSGNLQGLRNADLNCHRQARRSGLMGNFRAFLS
TRIQNLDSLIKPEDRELPITNLRGDVLFNSFNAIFNNNAQGIFLSSNSPRIISFSGKNVM
NDNTWPHKIVWHGARADSIDTNCEGWHSNFQDKVGLGSSLLGNKLLAQEMYSCQQKNIVL
CIEVLSHSSSGDIANRRKREMMQSNDDTYDNEK

Protein features from InterProScan

Transcript Database ID Name Start End E.value
12 g11445.t1 Gene3D G3DSA:1.20.5.320 - 83 135 2.5E-5
11 g11445.t1 Gene3D G3DSA:2.10.10.50 - 337 377 1.6E-6
13 g11445.t1 Gene3D G3DSA:3.10.100.10 - 432 605 7.7E-61
22 g11445.t1 MobiDBLite mobidb-lite consensus disorder prediction 86 107 -
21 g11445.t1 MobiDBLite mobidb-lite consensus disorder prediction 215 236 -
24 g11445.t1 MobiDBLite mobidb-lite consensus disorder prediction 215 247 -
20 g11445.t1 MobiDBLite mobidb-lite consensus disorder prediction 275 290 -
23 g11445.t1 MobiDBLite mobidb-lite consensus disorder prediction 275 313 -
6 g11445.t1 PANTHER PTHR24023:SF965 COLLAGEN TYPE XVIII ALPHA 1 CHAIN B 75 117 2.0E-56
8 g11445.t1 PANTHER PTHR24023 COLLAGEN ALPHA 75 117 2.0E-56
5 g11445.t1 PANTHER PTHR24023:SF965 COLLAGEN TYPE XVIII ALPHA 1 CHAIN B 162 597 2.0E-56
7 g11445.t1 PANTHER PTHR24023 COLLAGEN ALPHA 162 597 2.0E-56
3 g11445.t1 Pfam PF01391 Collagen triple helix repeat (20 copies) 162 199 2.2E-7
4 g11445.t1 Pfam PF01391 Collagen triple helix repeat (20 copies) 272 319 6.5E-7
2 g11445.t1 Pfam PF06482 Collagenase NC10 and Endostatin 339 393 3.7E-7
1 g11445.t1 Pfam PF06482 Collagenase NC10 and Endostatin 427 604 4.7E-58
15 g11445.t1 Phobius SIGNAL_PEPTIDE Signal peptide region 1 29 -
16 g11445.t1 Phobius SIGNAL_PEPTIDE_N_REGION N-terminal region of a signal peptide. 1 12 -
17 g11445.t1 Phobius SIGNAL_PEPTIDE_H_REGION Hydrophobic region of a signal peptide. 13 24 -
18 g11445.t1 Phobius SIGNAL_PEPTIDE_C_REGION C-terminal region of a signal peptide. 25 29 -
14 g11445.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 30 633 -
9 g11445.t1 SUPERFAMILY SSF56436 C-type lectin-like 437 603 5.02E-51
10 g11445.t1 SignalP_GRAM_POSITIVE SignalP-TM SignalP-TM 1 29 -
19 g11445.t1 TMHMM TMhelix Region of a membrane-bound protein predicted to be embedded in the membrane. 13 35 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

There are no GO annotations for this transcript.

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values