Analysis of p.Gly27Arg variant, CFTR gene, CF Transmembrane conductance Regulator protein (1480 residues)
Data provided and calculated by CYSMA must be considered as predictions.
They are meant for educational purposes only and are provided with NO WARRANTY with respect to their biological reliability.
The alignment does not show any divergent sequences.
The mutant residue cannot be found in the alignment.
Following species have a gap at the position:
Caenorhabditis elegans
The wild-type residue G27 is conserved at 98% among the CFTR orthologs
*AAPI: Alignment Average Percentage Identity
**AAPIR: Alignment Average Percentage Identity of the Region (20 residues surrounding position 27). AAPIR appears in green if it is more than 10% compared to AAPI, in red if less than 10%. Click here for more details on the alignment.
Divergencies show the amino acids which have been selected in the evolution.
If you find your variant among them with a high occurrence, there are good chances that your variant will most likely either have a small impact or no impact at all on the CFTR function.
Please note that CYSMA does not consider splicing alterations.
Refer to the Help page for more details.
CYSMA's visualizing modules for Ortholog conservation:
⬇ Download the region alignment (50 residues, Fasta format)
⬇ Download the CFTR phylogenic tree
|
|
27
|
|
|
Homo sapiens
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
V
D
S
A
D
N
L
S
E
K
Pan troglodytes
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
V
D
S
A
D
N
L
S
E
K
Pongo pygmaeus
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
Gorilla gorilla
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
V
D
S
A
D
N
L
S
E
K
Nomascus leucogenys
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
Macaca mulatta
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
Macaca nemestrina
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
Macaca fascicularis
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
Papio anubis
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
Callithrix jacchus
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
Chlorocebus aethiops
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
Colobus guereza
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
S
S
D
I
Y
Q
I
P
S
A
D
S
A
D
Y
L
S
E
K
Ateles geoffroyi
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
N
S
A
D
N
L
S
E
K
Plecturocebus moloch
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
Saimiri boliviensis
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
V
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
Aotus nancymaae
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
Otolemur garnettii
R
S
P
L
E
K
A
S
V
F
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
Microcebus murinus
R
S
P
L
E
K
A
S
V
L
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
Vicugna pacos
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
H
I
S
S
S
D
S
A
D
N
L
S
E
K
Sus scrofa
R
S
P
L
E
K
A
S
I
F
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
H
I
S
S
S
D
S
A
D
N
L
S
E
K
Bos taurus
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
H
I
S
S
S
D
S
A
D
N
L
S
E
K
Muntiacus reevesi
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
H
I
S
S
S
D
S
A
D
N
L
S
E
K
Muntiacus muntjak
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
H
I
S
S
S
D
S
A
D
N
L
S
E
K
Ovis aries
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
H
I
S
S
S
D
S
A
D
N
L
S
E
K
Equus caballus
R
S
P
L
E
K
A
S
V
I
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
Canis familiaris
R
S
P
L
E
K
A
S
V
L
S
K
L
F
F
S
W
T
R
P
I
L
I
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
V
P
S
T
D
S
A
D
H
L
S
E
K
Loxodonta africana
K
S
P
L
E
R
A
S
V
I
S
K
L
F
F
S
W
P
G
P
I
L
R
K
G
Y
R
Q
H
L
K
L
S
D
I
Y
Q
I
P
S
V
D
S
A
D
N
L
S
E
K
Mustela furo
R
S
P
L
E
K
A
S
V
L
S
K
L
F
F
S
W
T
R
P
I
L
T
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
Oryctolagus cuniculus
K
S
P
L
E
K
A
G
V
L
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
Atelerix albiventris
R
S
P
L
E
K
A
S
V
I
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
T
D
S
A
D
N
L
S
E
K
Dasypus novemcinctus
R
S
P
L
E
K
A
S
V
I
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
Rhinolophus ferrumequinum
R
S
P
L
E
K
A
S
V
I
S
K
L
F
F
S
W
T
I
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
S
S
A
D
S
A
D
N
L
S
E
K
Cavia porcellus
K
K
P
Y
T
Q
F
F
I
S
M
K
I
P
F
S
W
T
R
P
I
L
K
K
G
Y
R
K
R
L
E
V
S
D
I
Y
Q
V
P
S
A
D
S
A
D
N
L
S
E
E
Monodelphis domestica
R
S
P
L
E
K
A
N
L
L
S
K
L
F
F
S
W
T
R
P
I
L
S
K
G
F
R
K
R
L
E
L
S
D
I
Y
Q
I
P
S
S
N
S
A
D
N
L
S
E
K
Ornithorhynchus anatinus
R
S
P
L
E
R
A
N
L
F
S
K
L
F
F
S
W
T
K
P
I
L
K
K
G
Y
R
Q
H
L
E
L
S
D
I
Y
Q
I
P
T
A
D
S
A
D
N
L
S
E
K
Didelphis virginiana
R
S
P
L
E
K
A
N
F
L
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
F
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
L
S
N
S
A
D
Y
L
S
E
N
Trichosurus vulpecula
R
S
P
L
E
K
A
N
V
F
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
F
R
R
R
L
E
L
S
D
I
Y
Q
I
P
S
C
N
S
A
D
H
L
S
E
K
Carollia perspicillata
R
S
P
L
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
I
Mus musculus
K
S
P
L
E
K
A
S
F
I
S
K
L
F
F
S
W
T
T
P
I
L
R
K
G
Y
R
H
H
L
E
L
S
D
I
Y
Q
A
P
S
A
D
S
A
D
H
L
S
E
K
Rattus norvegicus
K
S
P
L
E
K
A
S
F
I
S
K
L
F
F
S
W
T
T
P
I
L
R
K
G
Y
R
H
H
L
E
L
S
D
I
Y
Q
A
P
S
S
D
S
A
D
H
L
S
E
K
Gallus gallus
I
I
N
P
R
N
Q
S
S
L
F
Y
F
F
F
R
W
T
K
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
Taeniopygia guttata
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
W
T
N
P
I
L
K
K
G
Y
R
R
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
Xenopus tropicalis
R
S
P
L
E
K
A
S
I
F
S
Q
I
F
F
S
W
T
K
P
I
L
W
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
H
P
G
D
S
A
D
N
L
S
E
R
Xenopus laevis
K
T
P
L
E
K
A
S
I
F
S
Q
I
F
F
S
W
T
K
P
I
L
W
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
H
P
G
D
S
A
D
N
L
S
E
R
Squalus acanthias
R
S
P
I
E
K
A
N
A
F
S
K
L
F
F
R
W
P
R
P
I
L
K
K
G
Y
R
Q
K
L
E
L
S
D
I
Y
Q
I
P
S
S
D
S
A
D
E
L
S
E
M
Danio rerio
R
S
P
V
E
D
A
N
C
L
S
R
Y
F
F
W
W
T
N
P
I
M
R
K
G
F
K
E
K
L
R
P
S
D
V
Y
Q
A
P
S
Q
D
A
A
D
I
L
A
E
R
Oryzias latipes
K
S
P
V
E
D
A
N
F
V
S
R
F
L
F
W
W
I
T
P
L
L
R
R
G
L
N
K
K
L
E
L
T
D
V
Y
K
A
P
S
F
D
L
A
D
T
L
S
E
R
Takifugu rubripes
K
S
P
V
E
D
A
N
F
L
S
K
F
F
F
W
W
T
S
P
L
L
R
K
G
F
K
K
K
L
E
L
S
D
V
Y
K
A
P
S
F
D
L
A
D
N
L
S
E
R
Tetraodon nigroviridis
K
S
P
V
E
D
A
N
F
L
S
K
F
F
F
W
W
T
S
P
L
L
R
K
G
F
R
K
K
L
E
L
S
D
V
Y
K
A
P
S
F
D
L
A
D
N
L
S
E
R
Caenorhabditis elegans
P
S
S
E
D
S
A
G
L
I
S
S
I
L
F
S
F
V
G
F
Y
W
W
R
-
T
R
N
A
Q
T
D
T
D
L
L
E
K
P
S
K
G
I
S
A
K
Y
A
A
Q
Species color legend (basic classification):
Great apes | Other monkeys | Prosimians | Other mammals | Lizards | Birds | Amphibians | Fishes | Insects | Nematods | Tunicates | Echinoderms
Ortholog sequences have been selected from the Ensembl(1) and
NCBI websites. Alignment has been performed with
ClustalW(2), version 1.83 or 2.0.7.
Trees have been built using Phylogeny.fr(3), based on the alignments.
Software used is PhyML 3.0 aLRT with default parameters. Pictures of trees have been made using Phylip at Mobyle.
AAPI and AAPIR have been calculated thanks to Bioperl.
Domain conservation:
The domain N-ter of CF Transmembrane conductance Regulator has been shown to interact with:
CF Transmembrane conductance Regulator - MSD1
CF Transmembrane conductance Regulator - MSD2
The residue p.Gly27 belongs to the domain N-ter.
1
68
N-terminal region is a cytosolic region, also called the "lasso motif" because of its shape. The first 40 residues are partially inserted into the membrane, while the end form the "lasso" helix (Zhang et al., 2016).
N-ter: N-terminal region is a cytosolic region, also called the "lasso motif" because of its shape. The first 40 residues are partially inserted into the membrane, while the end form the "lasso" helix (Zhang et al., 2016).
N-ter of CF Transmembrane conductance Regulator domain alignment including p.Gly27 residue.
***AAPID: Alignment Average Percentage Identity of the Domain (positions are indicated). !AAPIR: Alignment Average Percentage Identity of the Region (20 residues surrounding position 27). AAPIR appears in green if it is more than 10% compared to AAPID, in red if less than 10%.
Divergencies
Residues present in more than 10% of the sequences are highlighted in blue.
A - 12.20%
C - 3.25%
I - 0.81%
S - 3.25%
V - 1.63%
The wild-type residue G27 belongs to the N-ter domain and is conserved at 78.86% among the N-ter homologs (97 / 123 N-ter homologs)
The variant G27R has never been found among the N-ter homologs
Divergencies show the amino acids which have been selected in the evolution. Residues present in more than 10% of the sequences are highlighted in blue.
Please note that CYSMA does not consider splicing alterations.
Refer to the Help page for more details.
CYSMA's visualizing modules for N-ter domain conservation:
Sequence alignments for NBDs have all been extracted from Prosite(4). Sequence alignments for MSDs have been extracted using the PSI-BLAST web server.
Sequences alignments have been manually re-aligned using a structural alignment including the human CFTR and bacterian ABC transporters with know 3D structures (for MSDs and NBDs domains).
Predictions of secondary structures have been made with PsiPred(9)
, version 2.5, using Protein Multiple Sequences Alignments as input, in order to increase the accuracy of the prediction.
Amino acid frequencies have been calculated from a non redondant set defined by the RCSB.
The Help page will tell you more about it.
3D analysis:
Models provided and analysed by CYSMA must be considered as predictions, therefore be careful when interpreting the results. All efforts have been made to build structures of quality, however, they are provided with NO WARRANTY as to their accuracy with the real biological molecules studied.
Wild type and predicted mutant structures have been compared. You will find the results below.
Click on the MolProbity logo for complete details on the structure quality
This model is made of 37 α helices and of 20 β strands (and is mainly composed of helices (739 residues in helices against 102 in strands, for a total of 1480 amino acids)).
3D structures predicts G27 to be located in an α helice (which differs from PsiPred prediction) and R27 in an α helice.
Observed frequencies in α helices: G: 0.76 R: 1.11
Moreover, the residue is located in the last turn (C3) of this α helice (which contains 11 residues). C3 propensities of wild-type and mutant residues are 0.33 and 1.25. A potential Side chain interaction of the type i,i-4 has been detected between mutant residue and ILE23. This
interaction presents an attracting energy of -0.22 kcal/mol.
WARNING! The experimental 3D structure used for our predictions is the complete human CFTR structure which have been solved at a 3.7 Å resolution using cryo-electron microscopy (PDB: 5UAK; Liu et al. 2017). The overall resolution is fairly low so the CYSMA's 3D Automatic Annotation pipeline might have missed some important structural effects.
Replacement of a glycine is likely to rigidify the local structure
Solvent accessibility: the wild-type G27 and the mutant G27R are predicted to be buried
The two residues have a different polarity, which could interfere with hydrogen-bonding capabilities
The two residues have different charge properties, which could interfere with ionic bonds or potential inter- or intra-molecular interactions
Hydrogen bond network:
G27
R27
none
distance: 4.04 Å / angle: 2.02 rad between ARG 27 NE and THR 1036 O
The mutant residue is not predicted to introduce steric clashes
G27
R27
none
none
CYSMA's 3D visualizing module:
If you want to investigate further the structures, you can use
the JSmol applets of the wild-type (left) and mutant (right) structures.
Click on the JSmol applets' link to hide it.
You have a full access to Jmol commands with a simple right click on one applet.
JSmol Legends:
The residue at the position 27 is located in the center, labelled in yellow and surrounded by its neighboring residues (distance < 5 Å).
Van der Waals contacts with the residue 27 are represented by dotted lines.
Amino acids involved in H-bonds with the residue 27 are labelled in blue.
Amino acids involved in steric clashes with the residue 27 are labelled in red.
The overall structure of the complete human CFTR is represented in ribbon diagrams (click on the Reset button to visualize the overall CFTR structure). The membrane-spanning domain MSD1 is represented in blue and MSD2 in light blue.The nucleotide-binding domain NBD1 is represented in orange, NBD2 in light salmon.The lasso domain is shown in red and the R domain in green.
The 3D structures used in CYSMA are models based on the CFTR experimental 3D structure in the channel-closed conformation (PDB: 5UAK; resolution: 3.9 Å). In the wild-type model, the (missing) loops and the (missing) R domain were built de novo using the software Modeller. For the variant models, the point mutation (homology modelling) are made on the fly with Modeller (more).
Each structure has been assessed with MolProbity(19).
Msms(20) is used to calculate solvent accessibility, and STRIDE(21) (plus stride2pdb)
for secondary structure assignment.
Secondary structure analyses in 3D models uses side chain interaction energies reviewed in (23), as well as amino-acids propensities for N-caps, N1-N3, helix middle, C3-C1 and C-caps extracted from (24)(PDB values).
Structural properties are calculated using an in-house developped program based for the USMA's 3D Automatic Annotation pipeline.
Click on the LOVD picture to check if a variant is described at position 27
Graphical display of the region at NCBI (including SNPs)
CYSMA Report:
Report for p.Gly27Arg variant
CFTR orthologs conservation
The wild-type residue G27 is conserved at 98% among the CFTR orthologs
N-ter homologs conservation
The wild-type residue G27 belongs to the N-ter domain and is conserved at 78.86% among the N-ter homologs (97 / 123 N-ter homologs)
The variant G27R has never been found among the N-ter homologs
Structural effects
Replacement of a glycine is likely to rigidify the local structure
Solvent accessibility: the wild-type G27 and the mutant G27R are predicted to be buried
The two residues have a different polarity, which could interfere with hydrogen-bonding capabilities
The two residues have different charge properties, which could interfere with ionic bonds or potential inter- or intra-molecular interactions
The mutant residue is not predicted to introduce steric clashes