Gene loci information

Transcript annotation

  • This transcript has been annotated as Dorsal-ventral patterning protein Sog.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_1 g11382 g11382.t1 isoform g11382.t1 15851255 15872037
chr_1 g11382 g11382.t1 exon g11382.t1.exon1 15851255 15851434
chr_1 g11382 g11382.t1 cds g11382.t1.CDS1 15851255 15851434
chr_1 g11382 g11382.t1 exon g11382.t1.exon2 15851494 15851694
chr_1 g11382 g11382.t1 cds g11382.t1.CDS2 15851494 15851694
chr_1 g11382 g11382.t1 exon g11382.t1.exon3 15851766 15851941
chr_1 g11382 g11382.t1 cds g11382.t1.CDS3 15851766 15851941
chr_1 g11382 g11382.t1 exon g11382.t1.exon4 15852015 15852095
chr_1 g11382 g11382.t1 cds g11382.t1.CDS4 15852015 15852095
chr_1 g11382 g11382.t1 exon g11382.t1.exon5 15852165 15853973
chr_1 g11382 g11382.t1 cds g11382.t1.CDS5 15852165 15853973
chr_1 g11382 g11382.t1 exon g11382.t1.exon6 15854035 15854094
chr_1 g11382 g11382.t1 cds g11382.t1.CDS6 15854035 15854094
chr_1 g11382 g11382.t1 exon g11382.t1.exon7 15860295 15860427
chr_1 g11382 g11382.t1 cds g11382.t1.CDS7 15860295 15860427
chr_1 g11382 g11382.t1 exon g11382.t1.exon8 15862776 15862822
chr_1 g11382 g11382.t1 cds g11382.t1.CDS8 15862776 15862822
chr_1 g11382 g11382.t1 exon g11382.t1.exon9 15862882 15862938
chr_1 g11382 g11382.t1 cds g11382.t1.CDS9 15862882 15862938
chr_1 g11382 g11382.t1 exon g11382.t1.exon10 15871908 15872037
chr_1 g11382 g11382.t1 cds g11382.t1.CDS10 15871908 15872037
chr_1 g11382 g11382.t1 TSS g11382.t1 NA NA
chr_1 g11382 g11382.t1 TTS g11382.t1 NA NA

Sequences

>g11382.t1 Gene=g11382 Length=2874
ATGGCAAGTTTCACGAGATATTATATCAAAATACTCCTTATGCTACTCATCATAGGATTA
GCAACGTGTGCACGATCAAAATCACCACTGCTTATCGAAGATGAAGGCAATCGACGAAAT
AGACCAGCAGAATGTCATTTTGGAAAAGAATTAAAAGAACTCGGATCAACATGGTTTCCT
GATTTAGGTGCACCATTCGGAAAAATGTATTGCATTAAATGCCAGTGCGTGCCAGTGCAA
AAGAAGCGAAGAATTGTTGCTCGAGTTCAATGTAGAAACATCAAAAACGAATGCCCAAAG
CCGACGTGTGAAGAGCCAATTCTGTTACCGGGTCGTTGTTGCAAAATCTGTCCAGGAGAT
TCAAGCCATCCTGACCTTTTACAAGATACACCGGTGACTCAATTAAATGAACCAGAAGAC
ATGAAACATTTTGGAGCTCTTCTTACGGGCAGAACATCAACGATGCTGAAACGAGAAGAA
ATGCTAAATAATTATCCTTCGAAAAATCCTCAGAATCTTGTTGCAACTGGTCGTTTTTCT
TTCCACAAGAAGAATTTATATTACTCATTTTATGTATCCGAGAAAGCAACCTCACGTCCA
AGTGCCATTCAGTTCATTGATAATAGTGGAAATATTTTAGAAGAGCATAGTCTAGTCACG
CTCAATTATGGTACAATTAGCAGCTATCAGAATGCCACTGGAAAGATTTGCGGCGTTTGG
AAACGTGTTCCTCGTGATTATCGACGGATTTTACGGGAAGATCAAATGAATGTTGTATTG
TTATGGGGAGATAAGCAGCAAGCTGAATTAGCATTAGCGGGTCCAATTAGCAAATATCCA
GCACTTTCTACTGAGTTATTTAGTTCTCTCTTAGAACCAGGTCCAAATACTCGACCTGAG
ATTATGGCTGGTGCTGGTGGTACAGCTATTGTGAGCACAAATAGCGGTGTTGCACCCTCC
ATCCATCTTACTTTGGTTCTCAATGGTCTCTTTGGTCCAGACGAAGTTGCTGATGTACCA
CTCAACATTCGTCTTGAAAGTGTTGAAAAGAAGCAAATTATAATTGAAGAAATTCAACGT
GTAAAAAAACCAAATCATGACATTAATGTTATTGAATTTTCTTCGCCTGTCTCTGTCTAT
GAATTGCGCATGTTGACACGAGGAAAATTACAGATTGTTGTTGAATCTAGAAAGAAACCT
GAAGCATTAAGGATTCAAGGAAGTGTTGTCACTCGTGTTGCATGTGAACTCTTTCAAACA
ATTCTATCATCTCATAATGCTGAATCAAAAACCAAATCAAGTGGTATGGCATGGCTTTAT
CTCAATAAAGAAGGCTCCCTTGTTTACAACATTCAGACACACAATCTTAATTTAGCCGAT
AATCCGCTTATAACTCTCACAAATGACAATGGTGGCAAAAAAAATACCGAGTTAGAAGAC
TTAACTTCATCATTGGATATGGATCGTGCAAATGGAATTGTTGATAAATTAGGGCCACGA
GTACTTGAACCACTGTATGCTGGAGATCTTGGTCTTAATGTTGCCACTAAAACTGAGACA
AGTCTCATTCGTGGAAAATTTTCTGTTCGACCCGTTACTGATGCTCGTGATGCTGAGGAA
CCTTATTTGATGAAAAGATTCAGCGATCAAGCACCATCGCAATCTGTTGGTATGGCATGG
ATTGCTGTAGATAATGATTGCAATTTACATTATGATGTTAGTTTGGCTGGAGTATCAAAT
CAATATCATCCCTTGCAATTATATTTAGTCGATATGCCAATGGAAGTTTATGGAGCACCT
GTTCATCGACGTTTACTTGAAGAATTCGCATCTAATCATTTAGAAGGATTTGTGCTTAGT
ATGTCAGCAGCTGATCTTGCCAAGCTTGAATCAAGTGTCAATTTCCTCGAAGTTGTATCA
AAAGAGCAAAATAATATTCTTAAAACAAAACTCAAATCAGTCAAAATTCCAAAGCAATGT
TATCCACCATCTGCAGAAAATGAAGTGGGAAACGTAGGCGATGATAATAGTAAACATCAA
TCATTTGATACAAAATGTTTCCATTCAAATCGTTTTTATGAAGATGGTGAACAATGGACA
AGTGCTATCGAGTCATGCACAGTTTGTGCATGTAATAATCGACGAGTAAAATGCGATCCC
ATAAAATGTCCACCATTAAAATGTAAAAAGGAAGAACAACAGCATAGAAAAGGCGATTGC
TGTCCTATATGTGTGGGAAAATATGTAGAAGAAACAAGCACCAATAACAACGCATCACCA
CGTGGATGTAAACTTGGTGATCAATTTTATAAAGCAGGTTCATCATGGCATCCATATTTG
CCTCCTAATGGATTCGATACGTGTGCTGTATGTACATGTGATGCGAATAACTTAGAAATC
TCTTGTCCAAGAGTTCAATGTCCTCCACTCAATTGCAGTGAAAAAGTTGCATATCGTCCC
GATAAAAAAGCATGCTGTAAGAGATGTCCAGATGTTAAACCTCCAAAAGATTTCAAAACT
AATACTGAGGAAATGAAAGATCAAGGTAGTAAAAGTGGAACACTCAATTCACCAACAGTT
ATTATGGCAAATGGAGGATGTAAATCACATATGGGATATCATACAAACGGTCAAGAATGG
CATCCTGTAATAGCATCACATGGAGAACAAAAATGTGTCAAATGTCGATGCAAGGATGGT
AACATTAATTGCGAAAAAAAGCGTTGTTCGCGTGCGTCCTGCCAGAATCAAAAGGGTGCG
AAAAAAGGACAAAGTGATGATGATTGCTGTCAATGTCGAGCAAGACGTCATCAGGCAGGA
CATCGAAAAAAACAAAAGCAGCAGCAACAACAGCAGGGAGGATCAAAGAGTTGA

>g11382.t1 Gene=g11382 Length=957
MASFTRYYIKILLMLLIIGLATCARSKSPLLIEDEGNRRNRPAECHFGKELKELGSTWFP
DLGAPFGKMYCIKCQCVPVQKKRRIVARVQCRNIKNECPKPTCEEPILLPGRCCKICPGD
SSHPDLLQDTPVTQLNEPEDMKHFGALLTGRTSTMLKREEMLNNYPSKNPQNLVATGRFS
FHKKNLYYSFYVSEKATSRPSAIQFIDNSGNILEEHSLVTLNYGTISSYQNATGKICGVW
KRVPRDYRRILREDQMNVVLLWGDKQQAELALAGPISKYPALSTELFSSLLEPGPNTRPE
IMAGAGGTAIVSTNSGVAPSIHLTLVLNGLFGPDEVADVPLNIRLESVEKKQIIIEEIQR
VKKPNHDINVIEFSSPVSVYELRMLTRGKLQIVVESRKKPEALRIQGSVVTRVACELFQT
ILSSHNAESKTKSSGMAWLYLNKEGSLVYNIQTHNLNLADNPLITLTNDNGGKKNTELED
LTSSLDMDRANGIVDKLGPRVLEPLYAGDLGLNVATKTETSLIRGKFSVRPVTDARDAEE
PYLMKRFSDQAPSQSVGMAWIAVDNDCNLHYDVSLAGVSNQYHPLQLYLVDMPMEVYGAP
VHRRLLEEFASNHLEGFVLSMSAADLAKLESSVNFLEVVSKEQNNILKTKLKSVKIPKQC
YPPSAENEVGNVGDDNSKHQSFDTKCFHSNRFYEDGEQWTSAIESCTVCACNNRRVKCDP
IKCPPLKCKKEEQQHRKGDCCPICVGKYVEETSTNNNASPRGCKLGDQFYKAGSSWHPYL
PPNGFDTCAVCTCDANNLEISCPRVQCPPLNCSEKVAYRPDKKACCKRCPDVKPPKDFKT
NTEEMKDQGSKSGTLNSPTVIMANGGCKSHMGYHTNGQEWHPVIASHGEQKCVKCRCKDG
NINCEKKRCSRASCQNQKGAKKGQSDDDCCQCRARRHQAGHRKKQKQQQQQQGGSKS

Protein features from InterProScan

Transcript Database ID Name Start End E.value
12 g11382.t1 Gene3D G3DSA:2.10.70.10 Complement Module 685 721 1.1E-5
29 g11382.t1 MobiDBLite mobidb-lite consensus disorder prediction 935 957 -
6 g11382.t1 PANTHER PTHR46526 CHORDIN 14 950 1.7E-274
18 g11382.t1 PIRSF PIRSF002496 Chordin 5 955 7.6E-275
3 g11382.t1 Pfam PF00093 von Willebrand factor type C domain 45 117 6.1E-8
1 g11382.t1 Pfam PF07452 CHRD domain 418 525 1.9E-5
4 g11382.t1 Pfam PF00093 von Willebrand factor type C domain 686 744 4.1E-12
2 g11382.t1 Pfam PF00093 von Willebrand factor type C domain 763 829 2.1E-10
5 g11382.t1 Pfam PF00093 von Willebrand factor type C domain 867 931 8.7E-8
14 g11382.t1 Phobius SIGNAL_PEPTIDE Signal peptide region 1 26 -
15 g11382.t1 Phobius SIGNAL_PEPTIDE_N_REGION N-terminal region of a signal peptide. 1 10 -
16 g11382.t1 Phobius SIGNAL_PEPTIDE_H_REGION Hydrophobic region of a signal peptide. 11 21 -
17 g11382.t1 Phobius SIGNAL_PEPTIDE_C_REGION C-terminal region of a signal peptide. 22 26 -
13 g11382.t1 Phobius NON_CYTOPLASMIC_DOMAIN Region of a membrane-bound protein predicted to be outside the membrane, in the extracellular region. 27 957 -
28 g11382.t1 ProSitePatterns PS01208 VWFC domain signature. 706 744 -
35 g11382.t1 ProSiteProfiles PS50933 CHRD domain profile. 140 281 18.003
32 g11382.t1 ProSiteProfiles PS50933 CHRD domain profile. 283 414 17.581
33 g11382.t1 ProSiteProfiles PS50933 CHRD domain profile. 417 532 10.682
34 g11382.t1 ProSiteProfiles PS50933 CHRD domain profile. 535 658 6.548
31 g11382.t1 ProSiteProfiles PS50184 VWFC domain profile. 684 745 11.564
30 g11382.t1 ProSiteProfiles PS50184 VWFC domain profile. 761 830 8.743
25 g11382.t1 SMART SM00214 vwc 45 117 0.11
23 g11382.t1 SMART SM00754 chrd_5 142 278 0.081
21 g11382.t1 SMART SM00754 chrd_5 285 411 3.7E-7
22 g11382.t1 SMART SM00754 chrd_5 416 529 3.2E-9
20 g11382.t1 SMART SM00754 chrd_5 538 648 7.3E-4
27 g11382.t1 SMART SM00214 vwc 686 744 1.1E-12
24 g11382.t1 SMART SM00214 vwc 763 829 7.3E-4
26 g11382.t1 SMART SM00214 vwc 867 932 0.17
9 g11382.t1 SUPERFAMILY SSF57603 FnI-like domain 42 120 3.14E-6
7 g11382.t1 SUPERFAMILY SSF57603 FnI-like domain 684 747 1.46E-12
8 g11382.t1 SUPERFAMILY SSF57603 FnI-like domain 761 831 1.78E-8
10 g11382.t1 SUPERFAMILY SSF57603 FnI-like domain 865 931 4.39E-5
11 g11382.t1 SignalP_EUK SignalP-noTM SignalP-noTM 1 23 -
19 g11382.t1 TMHMM TMhelix Region of a membrane-bound protein predicted to be embedded in the membrane. 7 24 -

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0005515 protein binding MF
GO:0007275 multicellular organism development BP

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values