Analysis of p.Trp57Gly variant, CFTR gene, CF Transmembrane conductance Regulator protein (1480 residues)
Data provided and calculated by CYSMA must be considered as predictions.
They are meant for educational purposes only and are provided with NO WARRANTY with respect to their biological reliability.
The mutant residue cannot be found in the alignment.
There is no gap in the alignment.
The wild-type residue W57 is highly conserved among the CFTR orthologs: 98% (49 / 50 CFTR orthologs)The variant W57G has never been found among the CFTR orthologs
*AAPI: Alignment Average Percentage Identity
**AAPIR: Alignment Average Percentage Identity of the Region (20 residues surrounding position 57). AAPIR appears in green if it is more than 10% compared to AAPI, in red if less than 10%. Click here for more details on the alignment.
Divergencies show the amino acids which have been selected in the evolution.
If you find your variant among them with a high occurrence, there are good chances that your variant will most likely either have a small impact or no impact at all on the CFTR function.
Please note that CYSMA does not consider splicing alterations.
Refer to the Help page for more details.
CYSMA's visualizing modules for Ortholog conservation:
⬇ Download the region alignment (50 residues, Fasta format)
⬇ Download the CFTR phylogenic tree
|
|
57
|
|
|
Homo sapiens
E
L
S
D
I
Y
Q
I
P
S
V
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Pan troglodytes
E
L
S
D
I
Y
Q
I
P
S
V
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Pongo pygmaeus
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Gorilla gorilla
E
L
S
D
I
Y
Q
I
P
S
V
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Nomascus leucogenys
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Macaca mulatta
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Macaca nemestrina
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Macaca fascicularis
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Papio anubis
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Callithrix jacchus
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
T
Chlorocebus aethiops
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Colobus guereza
E
S
S
D
I
Y
Q
I
P
S
A
D
S
A
D
Y
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Ateles geoffroyi
E
L
S
D
I
Y
Q
I
P
S
A
N
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Plecturocebus moloch
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Saimiri boliviensis
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
T
Aotus nancymaae
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
T
Otolemur garnettii
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Microcebus murinus
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Vicugna pacos
E
L
S
D
I
Y
H
I
S
S
S
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Sus scrofa
E
L
S
D
I
Y
H
I
S
S
S
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Bos taurus
E
L
S
D
I
Y
H
I
S
S
S
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Muntiacus reevesi
E
L
S
D
I
Y
H
I
S
S
S
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Muntiacus muntjak
E
L
S
D
I
Y
H
I
S
S
S
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Ovis aries
E
L
S
D
I
Y
H
I
S
S
S
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Equus caballus
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
V
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Canis familiaris
E
L
S
D
I
Y
Q
V
P
S
T
D
S
A
D
H
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
A
Loxodonta africana
K
L
S
D
I
Y
Q
I
P
S
V
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
V
Mustela furo
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Oryctolagus cuniculus
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
K
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Atelerix albiventris
E
L
S
D
I
Y
Q
I
P
S
T
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
K
F
M
Dasypus novemcinctus
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
T
Rhinolophus ferrumequinum
E
L
S
D
I
Y
Q
I
S
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
Q
R
C
F
F
W
R
F
T
Cavia porcellus
E
V
S
D
I
Y
Q
V
P
S
A
D
S
A
D
N
L
S
E
E
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
L
W
R
F
I
Monodelphis domestica
E
L
S
D
I
Y
Q
I
P
S
S
N
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
K
P
K
L
I
N
A
I
R
R
C
F
F
W
R
F
V
Ornithorhynchus anatinus
E
L
S
D
I
Y
Q
I
P
T
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Didelphis virginiana
E
L
S
D
I
Y
Q
I
P
L
S
N
S
A
D
Y
L
S
E
N
L
E
R
E
W
D
R
E
L
A
S
K
K
K
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
V
Trichosurus vulpecula
E
L
S
D
I
Y
Q
I
P
S
C
N
S
A
D
H
L
S
E
K
L
E
R
E
W
D
R
E
L
A
S
K
K
K
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
V
Carollia perspicillata
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
I
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
I
N
A
L
R
R
C
F
F
W
R
F
M
Mus musculus
E
L
S
D
I
Y
Q
A
P
S
A
D
S
A
D
H
L
S
E
K
L
E
R
E
W
D
R
E
Q
A
S
K
K
N
P
Q
L
I
H
A
L
R
R
C
F
F
W
R
F
L
Rattus norvegicus
E
L
S
D
I
Y
Q
A
P
S
S
D
S
A
D
H
L
S
E
K
L
E
R
E
W
D
R
E
Q
A
S
K
K
K
P
Q
L
I
H
A
L
R
R
C
F
V
W
R
F
V
Gallus gallus
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
T
K
K
K
P
K
L
I
N
A
L
R
R
C
F
F
W
K
F
M
Taeniopygia guttata
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
W
D
R
E
L
A
T
E
K
K
P
K
L
I
N
A
L
R
R
C
F
F
W
K
F
M
Xenopus tropicalis
E
L
S
D
I
Y
Q
I
H
P
G
D
S
A
D
N
L
S
E
R
L
E
R
E
W
D
R
E
V
A
T
K
K
N
P
K
L
I
N
A
L
K
R
C
F
F
W
K
F
L
Xenopus laevis
E
L
S
D
I
Y
Q
I
H
P
G
D
S
A
D
N
L
S
E
R
L
E
R
E
W
D
R
E
V
A
S
K
K
N
P
K
L
I
N
A
L
K
R
C
F
F
W
K
F
L
Squalus acanthias
E
L
S
D
I
Y
Q
I
P
S
S
D
S
A
D
E
L
S
E
M
L
E
R
E
W
D
R
E
L
A
S
K
K
N
P
K
L
V
N
A
L
R
R
C
F
F
W
R
F
L
Danio rerio
R
P
S
D
V
Y
Q
A
P
S
Q
D
A
A
D
I
L
A
E
R
L
E
K
E
W
D
R
E
V
A
S
K
K
K
P
S
L
L
R
A
M
A
R
C
Y
I
K
P
F
L
Oryzias latipes
E
L
T
D
V
Y
K
A
P
S
F
D
L
A
D
T
L
S
E
R
L
E
R
E
W
D
R
E
V
V
S
K
-
R
P
R
L
L
K
A
L
A
R
C
F
F
L
P
F
A
Takifugu rubripes
E
L
S
D
V
Y
K
A
P
S
F
D
L
A
D
N
L
S
E
R
L
E
R
E
W
D
R
E
I
V
S
K
K
R
P
K
L
M
R
A
L
A
R
C
F
L
G
P
F
L
Tetraodon nigroviridis
E
L
S
D
V
Y
K
A
P
S
F
D
L
A
D
N
L
S
E
R
L
A
E
E
N
G
T
E
I
V
S
K
K
R
P
K
L
M
R
A
L
S
R
C
F
L
G
P
F
V
Caenorhabditis elegans
T
D
T
D
L
L
E
K
P
S
K
G
I
S
A
K
Y
A
A
Q
K
L
S
K
W
I
S
-
-
-
S
P
K
R
T
H
F
L
V
G
V
W
H
C
T
K
V
S
V
M
Species color legend (basic classification):
Great apes | Other monkeys | Prosimians | Other mammals | Lizards | Birds | Amphibians | Fishes | Insects | Nematods | Tunicates | Echinoderms
Ortholog sequences have been selected from the Ensembl(1) and
NCBI websites. Alignment has been performed with
ClustalW(2), version 1.83 or 2.0.7.
Trees have been built using Phylogeny.fr(3), based on the alignments.
Software used is PhyML 3.0 aLRT with default parameters. Pictures of trees have been made using Phylip at Mobyle.
AAPI and AAPIR have been calculated thanks to Bioperl.
Domain conservation:
The domain N-ter of CF Transmembrane conductance Regulator has been shown to interact with:
CF Transmembrane conductance Regulator - MSD1
Wild-type residue p.Trp57 is directly involved in this interaction.
CF Transmembrane conductance Regulator - MSD2
The residue p.Trp57 (N-ter) seems to play a key role in the CFTR function:
p.Trp57 belongs to the lasso motif and is involved in the CFTR folding.
The residue p.Trp57 belongs to the domain N-ter.
1
68
N-terminal region is a cytosolic region, also called the "lasso motif" because of its shape. The first 40 residues are partially inserted into the membrane, while the end form the "lasso" helix (Zhang et al., 2016).
N-ter: N-terminal region is a cytosolic region, also called the "lasso motif" because of its shape. The first 40 residues are partially inserted into the membrane, while the end form the "lasso" helix (Zhang et al., 2016).
N-ter of CF Transmembrane conductance Regulator domain alignment including p.Trp57 residue.
***AAPID: Alignment Average Percentage Identity of the Domain (positions are indicated). !AAPIR: Alignment Average Percentage Identity of the Region (20 residues surrounding position 57). AAPIR appears in green if it is more than 10% compared to AAPID, in red if less than 10%.
Divergencies
Residues present in more than 10% of the sequences are highlighted in blue.
A - 1.63%
G - 0.81%
K - 4.07%
L - 4.07%
M - 4.07%
V - 0.81%
Y - 1.63%
The wild-type residue W57 belongs to the N-ter domain and is conserved at 72.36% among the N-ter homologs (89 / 123 N-ter homologs)
The variant W57G has been found among the N-ter homologs with a non significant frequency: 0.81% (1 / 123 N-ter homologs)
Divergencies show the amino acids which have been selected in the evolution. Residues present in more than 10% of the sequences are highlighted in blue.
Please note that CYSMA does not consider splicing alterations.
Refer to the Help page for more details.
CYSMA's visualizing modules for N-ter domain conservation:
Sequence alignments for NBDs have all been extracted from Prosite(4). Sequence alignments for MSDs have been extracted using the PSI-BLAST web server.
Sequences alignments have been manually re-aligned using a structural alignment including the human CFTR and bacterian ABC transporters with know 3D structures (for MSDs and NBDs domains).
Predictions of secondary structures have been made with PsiPred(9)
, version 2.5, using Protein Multiple Sequences Alignments as input, in order to increase the accuracy of the prediction.
Amino acid frequencies have been calculated from a non redondant set defined by the RCSB.
The Help page will tell you more about it.
3D analysis:
Models provided and analysed by CYSMA must be considered as predictions, therefore be careful when interpreting the results. All efforts have been made to build structures of quality, however, they are provided with NO WARRANTY as to their accuracy with the real biological molecules studied.
Wild type and predicted mutant structures have been compared. You will find the results below.
Click on the MolProbity logo for complete details on the structure quality
This model is made of 37 α helices and of 20 β strands (and is mainly composed of helices (739 residues in helices against 102 in strands, for a total of 1480 amino acids)).
3D structures predicts W57 to be located in an α helice (which confirms PsiPred prediction) and G57 in an α helice. Moreover, the residue is located in the core of this α helice (which contains 18 residues). Helix interior propensities of wild-type and mutant residues are 1.03 and 0.42. Mutant residue has less affinity for the middle of helices.
WARNING! The experimental 3D structure used for our predictions is the complete human CFTR structure which have been solved at a 3.7 Å resolution using cryo-electron microscopy (PDB: 5UAK; Liu et al. 2017). The overall resolution is fairly low so the CYSMA's 3D Automatic Annotation pipeline might have missed some important structural effects.
Introduction of a glycine is likely to increase the flexibility of the region
Solvent accessibility: the wild-type residue W57 is predicted to be buried, while the mutant residue W57G is predicted to be exposed, what could destabilize the domain
The two residues have a different polarity, which could interfere with hydrogen-bonding capabilities
Hydrogen bond network:
W57
G57
none
none
The mutant residue is predicted to form less hydrophobic interactions than the wild-type The wild-type residue TRP is buried and is likely to belong to a hydrophobic pocket or core. This hydrophobic core is missing in the mutant residue GLY, which could destabilize the region
W57
G57
4.42 Å between TRP 57 CE3 and ALA 72 CB
4.89 Å between TRP 57 CZ3 and ALA 72 CB
4.67 Å between TRP 57 CD2 and LEU 375 CD2
4.49 Å between TRP 57 CD1 and LEU 375 CD2
4.94 Å between TRP 57 CE2 and LEU 375 CG
4.86 Å between TRP 57 CZ2 and LEU 375 CG
4.59 Å between TRP 57 CH2 and LEU 375 CD2
4.66 Å between TRP 57 CE3 and LEU 69 CB
4.89 Å between TRP 57 CZ2 and LEU 69 CD1
3.76 Å between TRP 57 CZ3 and LEU 69 CB
4.22 Å between TRP 57 CH2 and LEU 69 CB
4.80 Å between TRP 57 CH2 and PRO 67 CB
4.75 Å between TRP 57 CD1 and LEU 61 CB
4.79 Å between TRP 57 CE2 and LEU 61 CB
4.42 Å between TRP 57 CZ2 and LEU 61 CD1
none
For hydrophobic effects, the important point is the number of residues involved more than the number of interactions.
The mutant residue is not predicted to introduce steric clashes
W57
G57
none
none
CYSMA's 3D visualizing module:
If you want to investigate further the structures, you can use
the JSmol applets of the wild-type (left) and mutant (right) structures.
Click on the JSmol applets' link to hide it.
You have a full access to Jmol commands with a simple right click on one applet.
JSmol Legends:
The residue at the position 57 is located in the center, labelled in yellow and surrounded by its neighboring residues (distance < 5 Å).
Van der Waals contacts with the residue 57 are represented by dotted lines.
Amino acids involved in H-bonds with the residue 57 are labelled in blue.
Amino acids involved in steric clashes with the residue 57 are labelled in red.
The overall structure of the complete human CFTR is represented in ribbon diagrams (click on the Reset button to visualize the overall CFTR structure). The membrane-spanning domain MSD1 is represented in blue and MSD2 in light blue.The nucleotide-binding domain NBD1 is represented in orange, NBD2 in light salmon.The lasso domain is shown in red and the R domain in green.
The 3D structures used in CYSMA are models based on the CFTR experimental 3D structure in the channel-closed conformation (PDB: 5UAK; resolution: 3.9 Å). In the wild-type model, the (missing) loops and the (missing) R domain were built de novo using the software Modeller. For the variant models, the point mutation (homology modelling) are made on the fly with Modeller (more).
Each structure has been assessed with MolProbity(19).
Msms(20) is used to calculate solvent accessibility, and STRIDE(21) (plus stride2pdb)
for secondary structure assignment.
Secondary structure analyses in 3D models uses side chain interaction energies reviewed in (23), as well as amino-acids propensities for N-caps, N1-N3, helix middle, C3-C1 and C-caps extracted from (24)(PDB values).
Structural properties are calculated using an in-house developped program based for the USMA's 3D Automatic Annotation pipeline.
Click on the LOVD picture to check if a variant is described at position 57
Graphical display of the region at NCBI (including SNPs)
CYSMA Report:
Report for p.Trp57Gly variant
CFTR orthologs conservation
The wild-type residue W57 is highly conserved among the CFTR orthologs: 98% (49 / 50 CFTR orthologs)The variant W57G has never been found among the CFTR orthologs
N-ter homologs conservation
The wild-type residue W57 belongs to the N-ter domain and is conserved at 72.36% among the N-ter homologs (89 / 123 N-ter homologs)
The variant W57G has been found among the N-ter homologs with a non significant frequency: 0.81% (1 / 123 N-ter homologs)
Structural effects
The wild-type residue W57 is directly involved in the interaction between N-ter and CF Transmembrane conductance Regulator (MSD1). The wild-type residue W57 seems to play a key role in the CFTR function. W57 belongs to the lasso motif and is involved in the CFTR folding
Introduction of a glycine is likely to increase the flexibility of the region
Solvent accessibility: the wild-type residue W57 is predicted to be buried, while the mutant residue W57G is predicted to be exposed, which could destabilize the domain
The two residues have a different polarity, which could interfere with hydrogen-bonding capabilities
The mutant residue is predicted to form less hydrophobic interactions than the wild-type The wild-type residue TRP is buried and is likely to belong to a hydrophobic pocket or core. This hydrophobic core is missing in the mutant residue GLY, which could destabilize the region
The mutant residue is not predicted to introduce steric clashes