Gene loci information

Transcript annotation

  • This transcript has been annotated as Armadillo segment polarity protein.

Parent gene

Gene structure

  • The exon-intron structure of all isoforms are indicated below. CDS regions are colored in green. TSS and TTs that were predicted with CTR-Seq data are indicated in solid circle and squares, respectively. More specific data are shown in the table below.

Chromosome Gene Transcript Category ID Start End
chr_1 g13374 g13374.t3 TSS g13374.t3 29585796 29585796
chr_1 g13374 g13374.t3 isoform g13374.t3 29585811 29590734
chr_1 g13374 g13374.t3 exon g13374.t3.exon1 29585811 29585898
chr_1 g13374 g13374.t3 exon g13374.t3.exon2 29586542 29586625
chr_1 g13374 g13374.t3 cds g13374.t3.CDS1 29586595 29586625
chr_1 g13374 g13374.t3 exon g13374.t3.exon3 29587360 29589002
chr_1 g13374 g13374.t3 cds g13374.t3.CDS2 29587360 29589002
chr_1 g13374 g13374.t3 exon g13374.t3.exon4 29589061 29589264
chr_1 g13374 g13374.t3 cds g13374.t3.CDS3 29589061 29589264
chr_1 g13374 g13374.t3 exon g13374.t3.exon5 29589326 29589431
chr_1 g13374 g13374.t3 cds g13374.t3.CDS4 29589326 29589431
chr_1 g13374 g13374.t3 exon g13374.t3.exon6 29589498 29589664
chr_1 g13374 g13374.t3 cds g13374.t3.CDS5 29589498 29589664
chr_1 g13374 g13374.t3 exon g13374.t3.exon7 29589857 29589935
chr_1 g13374 g13374.t3 cds g13374.t3.CDS6 29589857 29589935
chr_1 g13374 g13374.t3 exon g13374.t3.exon8 29590481 29590734
chr_1 g13374 g13374.t3 cds g13374.t3.CDS7 29590481 29590734
chr_1 g13374 g13374.t3 TTS g13374.t3 29590871 29590871

Sequences

>g13374.t3 Gene=g13374 Length=2625
ATTTTGTAGTGAGCAGCCATTACTCAAAAAAACATTTTTGATTATAGACGAATAAAATAA
TCTTTAAGGTGACAAATATATAAAAACGGTGTGAAAGGAGTAATTTGATCTCTGTCGCCC
GTTAAATTTTCGCAACACACAATGAGCTATCAAATGTCACAAAATCGCACGATGTCTCAC
AATCCTTATCAAAATTCCGAAATGGCAATGGGAAAGGATCAACAAACGTTGATGTGGCAA
CAAAATTCATACTTGGCTGGCGGTGATTCTGGCATTCAATCGGGTGCAATCACTCAAGTA
CCATCATTGAGCGGTAAAGATGATGATGAAATGGGAGGTGGTGATGATACACTTATGTTT
GATCTTGATCAAGGTTTCAATCAAAATTATACACAAGATCAAGTTGATGATATGAATCAG
CAGTTAAATCAAACTCGTAGTCAACGAGTTCGTGCTGCAATGTTTCCAGAAACTTTGGAA
GAAGGCATAGAAATTCCATCGACACAATTTGATCCACAACAACCAACTGCTGTTCAGCGG
TTGTCTGAACCTTCACAAATGTTAAAACACGCTGTTGTTAATTTGATTAATTATCAAGAT
GATGCAGATTTGGCCACACGTGCCATTCCTGAATTGATTAAATTACTTAATGATGAAGAT
CAAGTTGTTGTCTCACAAGCTGCAATGATGGTACATCAATTATCTAAGAAAGAAGCTTCA
CGTCATGCCATTATGAACAGTCCACAAATGGTTGCTGCTTTGGTTCGTGCATTGTCCACC
AGCAATGACTTGGAAACAACAAAGGGAGCTGTCGGAACCCTTCACAATTTATCACACCAT
CGTCAAGGTCTTTTGGCTATTTTCAAAAGTGGTGGCATTCCTGCATTGGTCAAGTTGTTG
TCATCACCAGTTGAAAGTGTTTTGTTCTATGCAATCACAACATTACACAATCTTTTGCTT
CATCAAGATGGTTCAAAAATGGCTGTTCGTTTAGCTGGAGGCTTGCAAAAGATGGTCGCA
TTATTACAGCGCAATAACGTAAAGTTCCTCGCCATCGTTACAGACTGCTTGCAAATTCTA
GCTTATGGCAATCAAGAAAGCAAGTTGATTATTTTAGCTTCCACTGGACCAATTGAGTTG
GTTCGTATTATGCGTTCATATGACTATGAGAAGTTATTGTGGACAACATCACGTGTATTG
AAAGTTCTATCAGTCTGCTCTAGTAACAAACCAGCAATTGTTGAAGCTGGTGGCATGCAA
GCATTAGCTATGCATTTGGGAAATCCATCACAACGTTTGGTTCAAAATTGCTTATGGACA
TTGAGAAATTTATCAGATGCAGCAACAAAGGTCGATGGTTTGGAGAATTTATTGCAAGGA
CTTGTTCATGTACTCGCCAGTTCAGATGTCAATGTTGTCACATGCGCTGCTGGAATTTTG
TCAAATTTGACATGCAATAATCAGCGTAATAAAGTGACAGTGTGCCAAGTTGGTGGCGTT
GAAGCGCTCGTTCGAACAATTATTAATGCTGGTGATCGTGAAGAAATTACTGAACCAGCA
GTTTGTGCATTGCGTCATTTGACATCACGTCATCAAGAATCGGATGCAGCTCAGAATTTG
GTTCGACAAAACTATGGATTACCAGTGATTGTAAAGCTTTTGCATCCACCATCACGCTGG
CCGTTAGTTAAGGCCGTTATTGGTCTCATTCGTAATTTGGCAATATGTCCGGCAAATTCA
ACACCATTACGTGAACATGGTGCAATTCATCATCTTGTTCGATTGCTTATTCGCGCTTTC
CAAGATACACAACGTCAACGATCATCAGTGGCAACAAGTGGTTCACAACAACCAGGTGCA
TATGCTGATGGTGTAAGAATGGAAGAAATTGTAGAAGGCACAGTCGGTGCTTTGCATATT
CTCTCAAAGGATGAATACAATCGTCAATTGATTCGTCAACAAAATGTCATTCCTGTATTC
GTTCAATTGCTCTTCTATAATGACATTGAAAACATTCAGCGAGTCGCTGCAGGAGTATTG
TGTGAATTAGCAGTTGATAAAGAAGTGGCAGAATTGATAGAACAAGAAGGAGCAACAGCT
CCATTAACTGAATTATTAAATTCAGCTAATGAGGGTGTTGCGACATACGCAGCTGCCGTG
CTCTTCAAAATGAGCGAAGATAAATCTTTGGATTATAAGAAACGATTCTCAAGCGAGCTT
ACGACTTTACCAGTATTTCGTGATGATCAAATGTGGAATAATGGCGACTTGACAATTGGA
CCAGATCTACAGGATATTTTGGCACCAGATCAAGCCTATGAGGGTCTATATGGACAAAAT
CCCAACGCAAATGGTCGAGCATATCAACAAGGATATGATACATTACCAATCGATTCAATG
CAAGGTCTAGAAATTGGTGGTGGCCAGCAGCAGCAAAATTTAATGGGTGGCATGAATACC
GGACCGCCTACATCGCCTAATATGATGGATATGGAATGTGTTATTGGCGAAATGGATGCC
AGTGAATTGACATTCCAACATCATTTGGGAAATGAATCAATGATGCCATCACCGCCTGCC
AATGACAACTCGCAGGTTGCTGCCTGGTATGATACCGACTTGTAA

>g13374.t3 Gene=g13374 Length=827
MSYQMSQNRTMSHNPYQNSEMAMGKDQQTLMWQQNSYLAGGDSGIQSGAITQVPSLSGKD
DDEMGGGDDTLMFDLDQGFNQNYTQDQVDDMNQQLNQTRSQRVRAAMFPETLEEGIEIPS
TQFDPQQPTAVQRLSEPSQMLKHAVVNLINYQDDADLATRAIPELIKLLNDEDQVVVSQA
AMMVHQLSKKEASRHAIMNSPQMVAALVRALSTSNDLETTKGAVGTLHNLSHHRQGLLAI
FKSGGIPALVKLLSSPVESVLFYAITTLHNLLLHQDGSKMAVRLAGGLQKMVALLQRNNV
KFLAIVTDCLQILAYGNQESKLIILASTGPIELVRIMRSYDYEKLLWTTSRVLKVLSVCS
SNKPAIVEAGGMQALAMHLGNPSQRLVQNCLWTLRNLSDAATKVDGLENLLQGLVHVLAS
SDVNVVTCAAGILSNLTCNNQRNKVTVCQVGGVEALVRTIINAGDREEITEPAVCALRHL
TSRHQESDAAQNLVRQNYGLPVIVKLLHPPSRWPLVKAVIGLIRNLAICPANSTPLREHG
AIHHLVRLLIRAFQDTQRQRSSVATSGSQQPGAYADGVRMEEIVEGTVGALHILSKDEYN
RQLIRQQNVIPVFVQLLFYNDIENIQRVAAGVLCELAVDKEVAELIEQEGATAPLTELLN
SANEGVATYAAAVLFKMSEDKSLDYKKRFSSELTTLPVFRDDQMWNNGDLTIGPDLQDIL
APDQAYEGLYGQNPNANGRAYQQGYDTLPIDSMQGLEIGGGQQQQNLMGGMNTGPPTSPN
MMDMECVIGEMDASELTFQHHLGNESMMPSPPANDNSQVAAWYDTDL

Protein features from InterProScan

Transcript Database ID Name Start End E.value
29 g13374.t3 Gene3D G3DSA:1.25.10.10 - 142 702 0.0000e+00
5 g13374.t3 PANTHER PTHR45976 ARMADILLO SEGMENT POLARITY PROTEIN 26 827 0.0000e+00
6 g13374.t3 PANTHER PTHR45976:SF1 ARMADILLO SEGMENT POLARITY PROTEIN 26 827 0.0000e+00
8 g13374.t3 PRINTS PR01869 Beta-catenin family signature 96 116 0.0000e+00
11 g13374.t3 PRINTS PR01869 Beta-catenin family signature 129 143 0.0000e+00
7 g13374.t3 PRINTS PR01869 Beta-catenin family signature 180 201 0.0000e+00
13 g13374.t3 PRINTS PR01869 Beta-catenin family signature 244 263 0.0000e+00
10 g13374.t3 PRINTS PR01869 Beta-catenin family signature 300 322 0.0000e+00
15 g13374.t3 PRINTS PR01869 Beta-catenin family signature 330 354 0.0000e+00
12 g13374.t3 PRINTS PR01869 Beta-catenin family signature 454 479 0.0000e+00
9 g13374.t3 PRINTS PR01869 Beta-catenin family signature 498 519 0.0000e+00
14 g13374.t3 PRINTS PR01869 Beta-catenin family signature 620 642 0.0000e+00
2 g13374.t3 Pfam PF00514 Armadillo/beta-catenin-like repeat 239 271 4.0000e-07
4 g13374.t3 Pfam PF00514 Armadillo/beta-catenin-like repeat 360 399 0.0000e+00
3 g13374.t3 Pfam PF00514 Armadillo/beta-catenin-like repeat 440 482 8.4000e-06
1 g13374.t3 Pfam PF00514 Armadillo/beta-catenin-like repeat 599 637 3.0000e-07
36 g13374.t3 ProSiteProfiles PS50176 Armadillo/plakoglobin ARM repeat profile. 160 200 1.1287e+01
37 g13374.t3 ProSiteProfiles PS50176 Armadillo/plakoglobin ARM repeat profile. 202 245 1.3877e+01
33 g13374.t3 ProSiteProfiles PS50176 Armadillo/plakoglobin ARM repeat profile. 244 286 1.4717e+01
32 g13374.t3 ProSiteProfiles PS50176 Armadillo/plakoglobin ARM repeat profile. 286 328 9.1520e+00
31 g13374.t3 ProSiteProfiles PS50176 Armadillo/plakoglobin ARM repeat profile. 328 371 8.8370e+00
35 g13374.t3 ProSiteProfiles PS50176 Armadillo/plakoglobin ARM repeat profile. 409 451 1.2547e+01
38 g13374.t3 ProSiteProfiles PS50176 Armadillo/plakoglobin ARM repeat profile. 451 489 1.3212e+01
30 g13374.t3 ProSiteProfiles PS50176 Armadillo/plakoglobin ARM repeat profile. 498 541 1.0237e+01
34 g13374.t3 ProSiteProfiles PS50176 Armadillo/plakoglobin ARM repeat profile. 608 651 1.1567e+01
17 g13374.t3 SMART SM00185 arm_5 150 189 8.4000e+00
22 g13374.t3 SMART SM00185 arm_5 190 232 3.1000e-01
28 g13374.t3 SMART SM00185 arm_5 233 273 7.9000e-05
21 g13374.t3 SMART SM00185 arm_5 274 315 4.6000e-02
23 g13374.t3 SMART SM00185 arm_5 317 358 8.9000e+01
25 g13374.t3 SMART SM00185 arm_5 359 399 1.0000e-07
26 g13374.t3 SMART SM00185 arm_5 400 438 1.6000e+02
27 g13374.t3 SMART SM00185 arm_5 439 482 8.3000e-05
20 g13374.t3 SMART SM00185 arm_5 486 528 2.8000e-03
19 g13374.t3 SMART SM00185 arm_5 529 596 3.6000e-03
24 g13374.t3 SMART SM00185 arm_5 597 638 2.1000e-06
18 g13374.t3 SMART SM00185 arm_5 639 679 1.6000e+00
16 g13374.t3 SUPERFAMILY SSF48371 ARM repeat 144 679 0.0000e+00

Transmembrane regions from TMHMM

Disordered region

IUPRED3 score over 0.5 is predictive of a disordered region.

GO terms from InterProScan

GOID TERM ONTOLOGY
GO:0007155 cell adhesion BP
GO:0005515 protein binding MF
GO:0045296 cadherin binding MF

KEGG

Orthology

Pathway

  • This transcript belongs to the following pathways

Expression

Transcript expression in Pv11 cells

TPM values are indicated as average +/- STDEV.

Differential expression

Differentially expressed genes were identified with DESeq2 using the ‘run_DE_analysis.pl’ script from Trinity. Transcripts were determined as differentially expressed when (1) FDR < 0.05 (2) fold change > 2 (TPM calculated by RSEM). DE information and fold change between conditions are indicated in the plot below.

Raw TPM values