Gene loci information

Transcript annotation

  • This transcript has been annotated as Armadillo segment polarity protein.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_1 g13374 g13374.t1 TSS g13374.t1 29585796 29585796
chr_1 g13374 g13374.t1 isoform g13374.t1 29586607 29590734
chr_1 g13374 g13374.t1 exon g13374.t1.exon1 29586607 29586625
chr_1 g13374 g13374.t1 cds g13374.t1.CDS1 29586607 29586625
chr_1 g13374 g13374.t1 exon g13374.t1.exon2 29587360 29589002
chr_1 g13374 g13374.t1 cds g13374.t1.CDS2 29587360 29589002
chr_1 g13374 g13374.t1 exon g13374.t1.exon3 29589061 29589264
chr_1 g13374 g13374.t1 cds g13374.t1.CDS3 29589061 29589264
chr_1 g13374 g13374.t1 exon g13374.t1.exon4 29589326 29589431
chr_1 g13374 g13374.t1 cds g13374.t1.CDS4 29589326 29589431
chr_1 g13374 g13374.t1 exon g13374.t1.exon5 29589498 29589664
chr_1 g13374 g13374.t1 cds g13374.t1.CDS5 29589498 29589664
chr_1 g13374 g13374.t1 exon g13374.t1.exon6 29589857 29589935
chr_1 g13374 g13374.t1 cds g13374.t1.CDS6 29589857 29589935
chr_1 g13374 g13374.t1 exon g13374.t1.exon7 29590481 29590734
chr_1 g13374 g13374.t1 cds g13374.t1.CDS7 29590481 29590734
chr_1 g13374 g13374.t1 TTS g13374.t1 29590871 29590871

Sequences

>g13374.t1 Gene=g13374 Length=2472
ATGTCACAAAATCGCACGATGTCTCACAATCCTTATCAAAATTCCGAAATGGCAATGGGA
AAGGATCAACAAACGTTGATGTGGCAACAAAATTCATACTTGGCTGGCGGTGATTCTGGC
ATTCAATCGGGTGCAATCACTCAAGTACCATCATTGAGCGGTAAAGATGATGATGAAATG
GGAGGTGGTGATGATACACTTATGTTTGATCTTGATCAAGGTTTCAATCAAAATTATACA
CAAGATCAAGTTGATGATATGAATCAGCAGTTAAATCAAACTCGTAGTCAACGAGTTCGT
GCTGCAATGTTTCCAGAAACTTTGGAAGAAGGCATAGAAATTCCATCGACACAATTTGAT
CCACAACAACCAACTGCTGTTCAGCGGTTGTCTGAACCTTCACAAATGTTAAAACACGCT
GTTGTTAATTTGATTAATTATCAAGATGATGCAGATTTGGCCACACGTGCCATTCCTGAA
TTGATTAAATTACTTAATGATGAAGATCAAGTTGTTGTCTCACAAGCTGCAATGATGGTA
CATCAATTATCTAAGAAAGAAGCTTCACGTCATGCCATTATGAACAGTCCACAAATGGTT
GCTGCTTTGGTTCGTGCATTGTCCACCAGCAATGACTTGGAAACAACAAAGGGAGCTGTC
GGAACCCTTCACAATTTATCACACCATCGTCAAGGTCTTTTGGCTATTTTCAAAAGTGGT
GGCATTCCTGCATTGGTCAAGTTGTTGTCATCACCAGTTGAAAGTGTTTTGTTCTATGCA
ATCACAACATTACACAATCTTTTGCTTCATCAAGATGGTTCAAAAATGGCTGTTCGTTTA
GCTGGAGGCTTGCAAAAGATGGTCGCATTATTACAGCGCAATAACGTAAAGTTCCTCGCC
ATCGTTACAGACTGCTTGCAAATTCTAGCTTATGGCAATCAAGAAAGCAAGTTGATTATT
TTAGCTTCCACTGGACCAATTGAGTTGGTTCGTATTATGCGTTCATATGACTATGAGAAG
TTATTGTGGACAACATCACGTGTATTGAAAGTTCTATCAGTCTGCTCTAGTAACAAACCA
GCAATTGTTGAAGCTGGTGGCATGCAAGCATTAGCTATGCATTTGGGAAATCCATCACAA
CGTTTGGTTCAAAATTGCTTATGGACATTGAGAAATTTATCAGATGCAGCAACAAAGGTC
GATGGTTTGGAGAATTTATTGCAAGGACTTGTTCATGTACTCGCCAGTTCAGATGTCAAT
GTTGTCACATGCGCTGCTGGAATTTTGTCAAATTTGACATGCAATAATCAGCGTAATAAA
GTGACAGTGTGCCAAGTTGGTGGCGTTGAAGCGCTCGTTCGAACAATTATTAATGCTGGT
GATCGTGAAGAAATTACTGAACCAGCAGTTTGTGCATTGCGTCATTTGACATCACGTCAT
CAAGAATCGGATGCAGCTCAGAATTTGGTTCGACAAAACTATGGATTACCAGTGATTGTA
AAGCTTTTGCATCCACCATCACGCTGGCCGTTAGTTAAGGCCGTTATTGGTCTCATTCGT
AATTTGGCAATATGTCCGGCAAATTCAACACCATTACGTGAACATGGTGCAATTCATCAT
CTTGTTCGATTGCTTATTCGCGCTTTCCAAGATACACAACGTCAACGATCATCAGTGGCA
ACAAGTGGTTCACAACAACCAGGTGCATATGCTGATGGTGTAAGAATGGAAGAAATTGTA
GAAGGCACAGTCGGTGCTTTGCATATTCTCTCAAAGGATGAATACAATCGTCAATTGATT
CGTCAACAAAATGTCATTCCTGTATTCGTTCAATTGCTCTTCTATAATGACATTGAAAAC
ATTCAGCGAGTCGCTGCAGGAGTATTGTGTGAATTAGCAGTTGATAAAGAAGTGGCAGAA
TTGATAGAACAAGAAGGAGCAACAGCTCCATTAACTGAATTATTAAATTCAGCTAATGAG
GGTGTTGCGACATACGCAGCTGCCGTGCTCTTCAAAATGAGCGAAGATAAATCTTTGGAT
TATAAGAAACGATTCTCAAGCGAGCTTACGACTTTACCAGTATTTCGTGATGATCAAATG
TGGAATAATGGCGACTTGACAATTGGACCAGATCTACAGGATATTTTGGCACCAGATCAA
GCCTATGAGGGTCTATATGGACAAAATCCCAACGCAAATGGTCGAGCATATCAACAAGGA
TATGATACATTACCAATCGATTCAATGCAAGGTCTAGAAATTGGTGGTGGCCAGCAGCAG
CAAAATTTAATGGGTGGCATGAATACCGGACCGCCTACATCGCCTAATATGATGGATATG
GAATGTGTTATTGGCGAAATGGATGCCAGTGAATTGACATTCCAACATCATTTGGGAAAT
GAATCAATGATGCCATCACCGCCTGCCAATGACAACTCGCAGGTTGCTGCCTGGTATGAT
ACCGACTTGTAA

>g13374.t1 Gene=g13374 Length=823
MSQNRTMSHNPYQNSEMAMGKDQQTLMWQQNSYLAGGDSGIQSGAITQVPSLSGKDDDEM
GGGDDTLMFDLDQGFNQNYTQDQVDDMNQQLNQTRSQRVRAAMFPETLEEGIEIPSTQFD
PQQPTAVQRLSEPSQMLKHAVVNLINYQDDADLATRAIPELIKLLNDEDQVVVSQAAMMV
HQLSKKEASRHAIMNSPQMVAALVRALSTSNDLETTKGAVGTLHNLSHHRQGLLAIFKSG
GIPALVKLLSSPVESVLFYAITTLHNLLLHQDGSKMAVRLAGGLQKMVALLQRNNVKFLA
IVTDCLQILAYGNQESKLIILASTGPIELVRIMRSYDYEKLLWTTSRVLKVLSVCSSNKP
AIVEAGGMQALAMHLGNPSQRLVQNCLWTLRNLSDAATKVDGLENLLQGLVHVLASSDVN
VVTCAAGILSNLTCNNQRNKVTVCQVGGVEALVRTIINAGDREEITEPAVCALRHLTSRH
QESDAAQNLVRQNYGLPVIVKLLHPPSRWPLVKAVIGLIRNLAICPANSTPLREHGAIHH
LVRLLIRAFQDTQRQRSSVATSGSQQPGAYADGVRMEEIVEGTVGALHILSKDEYNRQLI
RQQNVIPVFVQLLFYNDIENIQRVAAGVLCELAVDKEVAELIEQEGATAPLTELLNSANE
GVATYAAAVLFKMSEDKSLDYKKRFSSELTTLPVFRDDQMWNNGDLTIGPDLQDILAPDQ
AYEGLYGQNPNANGRAYQQGYDTLPIDSMQGLEIGGGQQQQNLMGGMNTGPPTSPNMMDM
ECVIGEMDASELTFQHHLGNESMMPSPPANDNSQVAAWYDTDL

Protein features from InterProScan

Transcript Database ID Name Start End E.value
29 g13374.t1 Gene3D G3DSA:1.25.10.10 - 138 698 0.0000e+00
5 g13374.t1 PANTHER PTHR45976 ARMADILLO SEGMENT POLARITY PROTEIN 22 823 0.0000e+00
6 g13374.t1 PANTHER PTHR45976:SF1 ARMADILLO SEGMENT POLARITY PROTEIN 22 823 0.0000e+00
14 g13374.t1 PRINTS PR01869 Beta-catenin family signature 92 112 0.0000e+00
10 g13374.t1 PRINTS PR01869 Beta-catenin family signature 125 139 0.0000e+00
9 g13374.t1 PRINTS PR01869 Beta-catenin family signature 176 197 0.0000e+00
13 g13374.t1 PRINTS PR01869 Beta-catenin family signature 240 259 0.0000e+00
8 g13374.t1 PRINTS PR01869 Beta-catenin family signature 296 318 0.0000e+00
12 g13374.t1 PRINTS PR01869 Beta-catenin family signature 326 350 0.0000e+00
11 g13374.t1 PRINTS PR01869 Beta-catenin family signature 450 475 0.0000e+00
15 g13374.t1 PRINTS PR01869 Beta-catenin family signature 494 515 0.0000e+00
7 g13374.t1 PRINTS PR01869 Beta-catenin family signature 616 638 0.0000e+00
2 g13374.t1 Pfam PF00514 Armadillo/beta-catenin-like repeat 235 267 4.0000e-07
1 g13374.t1 Pfam PF00514 Armadillo/beta-catenin-like repeat 356 395 0.0000e+00
3 g13374.t1 Pfam PF00514 Armadillo/beta-catenin-like repeat 436 478 8.3000e-06
4 g13374.t1 Pfam PF00514 Armadillo/beta-catenin-like repeat 595 633 3.0000e-07
34 g13374.t1 ProSiteProfiles PS50176 Armadillo/plakoglobin ARM repeat profile. 156 196 1.1287e+01
33 g13374.t1 ProSiteProfiles PS50176 Armadillo/plakoglobin ARM repeat profile. 198 241 1.3877e+01
30 g13374.t1 ProSiteProfiles PS50176 Armadillo/plakoglobin ARM repeat profile. 240 282 1.4717e+01
36 g13374.t1 ProSiteProfiles PS50176 Armadillo/plakoglobin ARM repeat profile. 282 324 9.1520e+00
37 g13374.t1 ProSiteProfiles PS50176 Armadillo/plakoglobin ARM repeat profile. 324 367 8.8370e+00
35 g13374.t1 ProSiteProfiles PS50176 Armadillo/plakoglobin ARM repeat profile. 405 447 1.2547e+01
32 g13374.t1 ProSiteProfiles PS50176 Armadillo/plakoglobin ARM repeat profile. 447 485 1.3212e+01
38 g13374.t1 ProSiteProfiles PS50176 Armadillo/plakoglobin ARM repeat profile. 494 537 1.0237e+01
31 g13374.t1 ProSiteProfiles PS50176 Armadillo/plakoglobin ARM repeat profile. 604 647 1.1567e+01
24 g13374.t1 SMART SM00185 arm_5 146 185 8.4000e+00
22 g13374.t1 SMART SM00185 arm_5 186 228 3.1000e-01
28 g13374.t1 SMART SM00185 arm_5 229 269 7.9000e-05
21 g13374.t1 SMART SM00185 arm_5 270 311 4.6000e-02
23 g13374.t1 SMART SM00185 arm_5 313 354 8.9000e+01
26 g13374.t1 SMART SM00185 arm_5 355 395 1.0000e-07
25 g13374.t1 SMART SM00185 arm_5 396 434 1.6000e+02
18 g13374.t1 SMART SM00185 arm_5 435 478 8.3000e-05
27 g13374.t1 SMART SM00185 arm_5 482 524 2.8000e-03
17 g13374.t1 SMART SM00185 arm_5 525 592 3.6000e-03
19 g13374.t1 SMART SM00185 arm_5 593 634 2.1000e-06
20 g13374.t1 SMART SM00185 arm_5 635 675 1.6000e+00
16 g13374.t1 SUPERFAMILY SSF48371 ARM repeat 140 675 0.0000e+00

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0007155 cell adhesion BP
GO:0005515 protein binding MF
GO:0045296 cadherin binding MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values