RaacLogo

A new sequence logo generator by using reduced amino acid clusters

Case 1

Lysozyme C (enzyme)

With a bacteriolytic function, lysozyme C (LYC) is ubiquitous from chickens to humans. we collected 21 lysozyme C (LYC) aligned sequences from chickens to humans. Then, after RaacLogo treatment (polar/Neutrall/hydrophobicity, RKEDQN, GASTPHY, CLVIMFW), the reduced letters make the logo very neat and simple, reflecting the high sequence homology. Although the amino acid coding of LYC changed greatly in the evolution from chicken to human, we can see that the overall amino acid properties of LYC remained unchanged after reduction, and the homologous function of LYC was maintained.

Parameter RaacLogo Type 54
Cluster RKEDQNGASTPHYCLVIMFW
PolarNeutralHydrophobicity
Edit RaacLogo Analysis
01234bits5101520253035404550556065707580859095100105110115120125

Cluster: RKEDQNGASTPHYCLVIMFW

01234bits5101520253035404550556065707580859095100105110115120125

01234bits5101520253035404550556065707580859095100105110115120125

Case 2

CXXC domain (DNA-binding protein)

The typical CXXC domain is rich in basic amino acids and highly conserved among species and combines with a CpG sequence. It has seven amino acids that bind to DNA and stabilizes the spatial conformation through two Zn2+ ions that coordinate with four cysteines (C4-type zinc finger). In the natural logo, the eight cysteines are extremely conserved, while the other amino acids are clearly differentiated. In the reduced logo, the overall effect is good but some jumbled amino acids are always divided into two groups: K and A. This means that the properties of these amino acid sites have changed dramatically. This may be the cause of functional differentiation of paralogs.

Parameter RaacLogo Type 54
Cluster KRANCQGHILMFPSTWYVDE
PositiveNeutralNegative (Charge)
Edit RaacLogo Analysis
01234bits5101520253035

Cluster: KRANCQGHILMFPSTWYVDE

01234bits5101520253035

01234bits5101520253035

Case 3

The double-stranded β-helix (DSBH) domain (enzyme)

Double-stranded b-helix (DSBH) domain is the catalytic core of demethylase, depending on Fe-2OG. It is the vital executor of the dynamic changes between methylation and demethylation in the plants and animals. Homologous peptides of 35 amino acids were selected to demonstrate the reduction of enzyme proteins by RaacLogo. The results showed that RAAC could effectively reduce the chaotic motif into a simple logo to reflect the conservatism of the paralogs.

Parameter RaacLogo Type 7
Cluster WCGPHNDERQKASTFYVMIL
HeterocyclicSulfydrylNo chiralOthersAromaticAliphatic
Edit RaacLogo Analysis
01234bits5101520253035

Cluster: WCGPHNDERQKASTFYVMIL

01234bits5101520253035

01234bits5101520253035

Case 4

POU protein (transcription factor)

The POU (Pit-Oct-Unc) protein family is an evolutionary ancient group of transcription factors (TFs) that bind specific DNA sequences to direct gene expression programs. We collected 30 POU aligned sequences and obtained the natural logo and Raaclogo. The results showed that various amino acid site differences could be effectively reduced by rational strategies. For example, 7-E/R/D/K is reduced to 7-P, and 65-L/V/I is reduced to 65-V.

Parameter RaacLogo Type 7
Cluster WCGPHNDERQKASTFYVMIL
HeterocyclicSulfydrylNo chiralOthersAromaticAliphatic
Edit RaacLogo Analysis
01234bits5101520253035404550556065707580859095100105110115120125130135140

Cluster: WCGPHNDERQKASTFYVMIL

01234bits5101520253035404550556065707580859095100105110115120125130135140

01234bits5101520253035404550556065707580859095100105110115120125130135140

Case 5

G protein coupled receptor (membrane protein)

G-protein-coupled receptors (GPCRs), one of the largest protein superfamilies, are key mediators linking extracellular ligands to downstream signals and are the most common targets for pharmaceutical drug development. The G protein family has a large number of members, and its sites and functions are extremely diverse. Therefore, the logo of its protein sequence appears disordered. After RaacLogo reduction, most of the sites became very conservative, which verified the law of homologous differentiation.

Parameter RaacLogo Type 7
Cluster WCGPHNDERQKASTFYVMIL
HeterocyclicSulfydrylNo chiralOthersAromaticAliphatic
Edit RaacLogo Analysis
01234bits5101520253035404550556065707580859095100105110115120125130135140145150155160165170175180185190195200205210215220225230235240245250255260265270275280285290295300305310315320325330335340345350355360365370375380385390395400405410415420425430435440445450455460465470475480485490495500505510515520525530535540545550555560565570575580585590595600605610615620625630635640645650655660665670675680685690695700705710715720725730735740745750755760765770775780785790795800

Cluster: WCGPHNDERQKASTFYVMIL

01234bits5101520253035404550556065707580859095100105110115120125130135140145150155160165170175180185190195200205210215220225230235240245250255260265270275280285290295300305310315320325330335340345350355360365370375380385390395400405410415420425430435440445450455460465470475480485490495500505510515520525530535540545550555560565570575580585590595600605610615620625630635640645650655660665670675680685690695700705710715720725730735740745750755760765770775780785790795800

01234bits5101520253035404550556065707580859095100105110115120125130135140145150155160165170175180185190195200205210215220225230235240245250255260265270275280285290295300305310315320325330335340345350355360365370375380385390395400405410415420425430435440445450455460465470475480485490495500505510515520525530535540545550555560565570575580585590595600605610615620625630635640645650655660665670675680685690695700705710715720725730735740745750755760765770775780785790795800