Analysis of p.Arg31Leu variant, CFTR gene, CF Transmembrane conductance Regulator protein (1480 residues)
Data provided and calculated by CYSMA must be considered as predictions.
They are meant for educational purposes only and are provided with NO WARRANTY with respect to their biological reliability.
The mutant residue cannot be found in the alignment.
There is no gap in the alignment.
The wild-type residue R31 is highly conserved among the CFTR orthologs: 80% (40 / 50 CFTR orthologs)The variant R31L has never been found among the CFTR orthologs
*AAPI: Alignment Average Percentage Identity
**AAPIR: Alignment Average Percentage Identity of the Region (20 residues surrounding position 31). AAPIR appears in green if it is more than 10% compared to AAPI, in red if less than 10%. Click here for more details on the alignment.
Divergencies show the amino acids which have been selected in the evolution.
If you find your variant among them with a high occurrence, there are good chances that your variant will most likely either have a small impact or no impact at all on the CFTR function.
Please note that CYSMA does not consider splicing alterations.
Refer to the Help page for more details.
CYSMA's visualizing modules for Ortholog conservation:
⬇ Download the region alignment (50 residues, Fasta format)
⬇ Download the CFTR phylogenic tree
|
|
|
31
|
|
Homo sapiens
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
V
D
S
A
D
N
L
S
E
K
L
E
R
E
Pan troglodytes
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
V
D
S
A
D
N
L
S
E
K
L
E
R
E
Pongo pygmaeus
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Gorilla gorilla
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
V
D
S
A
D
N
L
S
E
K
L
E
R
E
Nomascus leucogenys
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Macaca mulatta
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Macaca nemestrina
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Macaca fascicularis
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Papio anubis
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Callithrix jacchus
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Chlorocebus aethiops
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Colobus guereza
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
S
S
D
I
Y
Q
I
P
S
A
D
S
A
D
Y
L
S
E
K
L
E
R
E
Ateles geoffroyi
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
N
S
A
D
N
L
S
E
K
L
E
R
E
Plecturocebus moloch
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Saimiri boliviensis
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
V
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Aotus nancymaae
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Otolemur garnettii
E
K
A
S
V
F
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Microcebus murinus
E
K
A
S
V
L
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Vicugna pacos
-
-
-
-
-
-
-
-
-
-
-
-
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
H
I
S
S
S
D
S
A
D
N
L
S
E
K
L
E
R
E
Sus scrofa
E
K
A
S
I
F
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
H
I
S
S
S
D
S
A
D
N
L
S
E
K
L
E
R
E
Bos taurus
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
H
I
S
S
S
D
S
A
D
N
L
S
E
K
L
E
R
E
Muntiacus reevesi
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
H
I
S
S
S
D
S
A
D
N
L
S
E
K
L
E
R
E
Muntiacus muntjak
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
H
I
S
S
S
D
S
A
D
N
L
S
E
K
L
E
R
E
Ovis aries
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
H
I
S
S
S
D
S
A
D
N
L
S
E
K
L
E
R
E
Equus caballus
E
K
A
S
V
I
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Canis familiaris
E
K
A
S
V
L
S
K
L
F
F
S
W
T
R
P
I
L
I
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
V
P
S
T
D
S
A
D
H
L
S
E
K
L
E
R
E
Loxodonta africana
E
R
A
S
V
I
S
K
L
F
F
S
W
P
G
P
I
L
R
K
G
Y
R
Q
H
L
K
L
S
D
I
Y
Q
I
P
S
V
D
S
A
D
N
L
S
E
K
L
E
R
E
Mustela furo
E
K
A
S
V
L
S
K
L
F
F
S
W
T
R
P
I
L
T
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Oryctolagus cuniculus
E
K
A
G
V
L
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Atelerix albiventris
E
K
A
S
V
I
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
T
D
S
A
D
N
L
S
E
K
L
E
R
E
Dasypus novemcinctus
E
K
A
S
V
I
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Rhinolophus ferrumequinum
E
K
A
S
V
I
S
K
L
F
F
S
W
T
I
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
S
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Cavia porcellus
T
Q
F
F
I
S
M
K
I
P
F
S
W
T
R
P
I
L
K
K
G
Y
R
K
R
L
E
V
S
D
I
Y
Q
V
P
S
A
D
S
A
D
N
L
S
E
E
L
E
R
E
Monodelphis domestica
E
K
A
N
L
L
S
K
L
F
F
S
W
T
R
P
I
L
S
K
G
F
R
K
R
L
E
L
S
D
I
Y
Q
I
P
S
S
N
S
A
D
N
L
S
E
K
L
E
R
E
Ornithorhynchus anatinus
E
R
A
N
L
F
S
K
L
F
F
S
W
T
K
P
I
L
K
K
G
Y
R
Q
H
L
E
L
S
D
I
Y
Q
I
P
T
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Didelphis virginiana
E
K
A
N
F
L
S
K
L
F
F
S
W
T
R
P
I
L
R
K
G
F
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
L
S
N
S
A
D
Y
L
S
E
N
L
E
R
E
Trichosurus vulpecula
E
K
A
N
V
F
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
F
R
R
R
L
E
L
S
D
I
Y
Q
I
P
S
C
N
S
A
D
H
L
S
E
K
L
E
R
E
Carollia perspicillata
E
K
A
S
V
V
S
K
L
F
F
S
W
T
R
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
I
L
E
R
E
Mus musculus
E
K
A
S
F
I
S
K
L
F
F
S
W
T
T
P
I
L
R
K
G
Y
R
H
H
L
E
L
S
D
I
Y
Q
A
P
S
A
D
S
A
D
H
L
S
E
K
L
E
R
E
Rattus norvegicus
E
K
A
S
F
I
S
K
L
F
F
S
W
T
T
P
I
L
R
K
G
Y
R
H
H
L
E
L
S
D
I
Y
Q
A
P
S
S
D
S
A
D
H
L
S
E
K
L
E
R
E
Gallus gallus
R
N
Q
S
S
L
F
Y
F
F
F
R
W
T
K
P
I
L
K
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Taeniopygia guttata
-
-
-
-
-
-
-
-
-
-
-
-
W
T
N
P
I
L
K
K
G
Y
R
R
R
L
E
L
S
D
I
Y
Q
I
P
S
A
D
S
A
D
N
L
S
E
K
L
E
R
E
Xenopus tropicalis
E
K
A
S
I
F
S
Q
I
F
F
S
W
T
K
P
I
L
W
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
H
P
G
D
S
A
D
N
L
S
E
R
L
E
R
E
Xenopus laevis
E
K
A
S
I
F
S
Q
I
F
F
S
W
T
K
P
I
L
W
K
G
Y
R
Q
R
L
E
L
S
D
I
Y
Q
I
H
P
G
D
S
A
D
N
L
S
E
R
L
E
R
E
Squalus acanthias
E
K
A
N
A
F
S
K
L
F
F
R
W
P
R
P
I
L
K
K
G
Y
R
Q
K
L
E
L
S
D
I
Y
Q
I
P
S
S
D
S
A
D
E
L
S
E
M
L
E
R
E
Danio rerio
E
D
A
N
C
L
S
R
Y
F
F
W
W
T
N
P
I
M
R
K
G
F
K
E
K
L
R
P
S
D
V
Y
Q
A
P
S
Q
D
A
A
D
I
L
A
E
R
L
E
K
E
Oryzias latipes
E
D
A
N
F
V
S
R
F
L
F
W
W
I
T
P
L
L
R
R
G
L
N
K
K
L
E
L
T
D
V
Y
K
A
P
S
F
D
L
A
D
T
L
S
E
R
L
E
R
E
Takifugu rubripes
E
D
A
N
F
L
S
K
F
F
F
W
W
T
S
P
L
L
R
K
G
F
K
K
K
L
E
L
S
D
V
Y
K
A
P
S
F
D
L
A
D
N
L
S
E
R
L
E
R
E
Tetraodon nigroviridis
E
D
A
N
F
L
S
K
F
F
F
W
W
T
S
P
L
L
R
K
G
F
R
K
K
L
E
L
S
D
V
Y
K
A
P
S
F
D
L
A
D
N
L
S
E
R
L
A
E
E
Caenorhabditis elegans
D
S
A
G
L
I
S
S
I
L
F
S
F
V
G
F
Y
W
W
R
-
T
R
N
A
Q
T
D
T
D
L
L
E
K
P
S
K
G
I
S
A
K
Y
A
A
Q
K
L
S
K
Species color legend (basic classification):
Great apes | Other monkeys | Prosimians | Other mammals | Lizards | Birds | Amphibians | Fishes | Insects | Nematods | Tunicates | Echinoderms
Ortholog sequences have been selected from the Ensembl(1) and
NCBI websites. Alignment has been performed with
ClustalW(2), version 1.83 or 2.0.7.
Trees have been built using Phylogeny.fr(3), based on the alignments.
Software used is PhyML 3.0 aLRT with default parameters. Pictures of trees have been made using Phylip at Mobyle.
AAPI and AAPIR have been calculated thanks to Bioperl.
Domain conservation:
The domain N-ter of CF Transmembrane conductance Regulator has been shown to interact with:
CF Transmembrane conductance Regulator - MSD1
CF Transmembrane conductance Regulator - MSD2
The residue p.Arg31 belongs to the domain N-ter.
1
68
N-terminal region is a cytosolic region, also called the "lasso motif" because of its shape. The first 40 residues are partially inserted into the membrane, while the end form the "lasso" helix (Zhang et al., 2016).
N-ter: N-terminal region is a cytosolic region, also called the "lasso motif" because of its shape. The first 40 residues are partially inserted into the membrane, while the end form the "lasso" helix (Zhang et al., 2016).
N-ter of CF Transmembrane conductance Regulator domain alignment including p.Arg31 residue.
***AAPID: Alignment Average Percentage Identity of the Domain (positions are indicated). !AAPIR: Alignment Average Percentage Identity of the Region (20 residues surrounding position 31). AAPIR appears in green if it is more than 10% compared to AAPID, in red if less than 10%.
Divergencies
Residues present in more than 10% of the sequences are highlighted in blue.
A - 1.63%
C - 3.25%
D - 1.63%
E - 3.25%
G - 4.88%
H - 3.25%
I - 2.44%
K - 4.88%
L - 1.63%
P - 28.46%
S - 0.81%
T - 4.88%
V - 3.25%
Y - 2.44%
The wild-type residue R31 belongs to the N-ter domain and is conserved at 33.33% among the N-ter homologs (41 / 123 N-ter homologs)
The variant R31L has been found among the N-ter homologs with a non significant frequency: 1.63% (2 / 123 N-ter homologs)
Divergencies show the amino acids which have been selected in the evolution. Residues present in more than 10% of the sequences are highlighted in blue.
Please note that CYSMA does not consider splicing alterations.
Refer to the Help page for more details.
CYSMA's visualizing modules for N-ter domain conservation:
Sequence alignments for NBDs have all been extracted from Prosite(4). Sequence alignments for MSDs have been extracted using the PSI-BLAST web server.
Sequences alignments have been manually re-aligned using a structural alignment including the human CFTR and bacterian ABC transporters with know 3D structures (for MSDs and NBDs domains).
Predictions of secondary structures have been made with PsiPred(9)
, version 2.5, using Protein Multiple Sequences Alignments as input, in order to increase the accuracy of the prediction.
Amino acid frequencies have been calculated from a non redondant set defined by the RCSB.
The Help page will tell you more about it.
3D analysis:
Models provided and analysed by CYSMA must be considered as predictions, therefore be careful when interpreting the results. All efforts have been made to build structures of quality, however, they are provided with NO WARRANTY as to their accuracy with the real biological molecules studied.
Wild type and predicted mutant structures have been compared. You will find the results below.
Click on the MolProbity logo for complete details on the structure quality
This model is made of 37 α helices and of 20 β strands (and is mainly composed of helices (739 residues in helices against 102 in strands, for a total of 1480 amino acids)).
3D structures predicts R31 to be located in a loop (which confirms PsiPred prediction) and L31 in a loop.
WARNING! The experimental 3D structure used for our predictions is the complete human CFTR structure which have been solved at a 3.7 Å resolution using cryo-electron microscopy (PDB: 5UAK; Liu et al. 2017). The overall resolution is fairly low so the CYSMA's 3D Automatic Annotation pipeline might have missed some important structural effects.
Solvent accessibility: the wild-type R31 and the mutant R31L are predicted to be exposed
The two residues have a different polarity, which could interfere with hydrogen-bonding capabilities
The two residues have different charge properties, which could interfere with ionic bonds or potential inter- or intra-molecular interactions
The hydrogen bond(s) present in the wild-type is(are) missing in the mutant
R31
L31
distance: 3.37 Å / angle: 2.94 rad between ARG 31 NH1 and LEU 32 O
distance: 4.01 Å / angle: 1.95 rad between ARG 31 NH1 and GLU 33 OE1
none
The salt brige interaction present in the wild-type is missing in the mutant
R31
L31
4.71 Å between NH1 and GLU 33 OE2
none
The mutant residue is not predicted to introduce steric clashes
R31
L31
none
none
CYSMA's 3D visualizing module:
If you want to investigate further the structures, you can use
the JSmol applets of the wild-type (left) and mutant (right) structures.
Click on the JSmol applets' link to hide it.
You have a full access to Jmol commands with a simple right click on one applet.
JSmol Legends:
The residue at the position 31 is located in the center, labelled in yellow and surrounded by its neighboring residues (distance < 5 Å).
Van der Waals contacts with the residue 31 are represented by dotted lines.
Amino acids involved in H-bonds with the residue 31 are labelled in blue.
Amino acids involved in steric clashes with the residue 31 are labelled in red.
The overall structure of the complete human CFTR is represented in ribbon diagrams (click on the Reset button to visualize the overall CFTR structure). The membrane-spanning domain MSD1 is represented in blue and MSD2 in light blue.The nucleotide-binding domain NBD1 is represented in orange, NBD2 in light salmon.The lasso domain is shown in red and the R domain in green.
The 3D structures used in CYSMA are models based on the CFTR experimental 3D structure in the channel-closed conformation (PDB: 5UAK; resolution: 3.9 Å). In the wild-type model, the (missing) loops and the (missing) R domain were built de novo using the software Modeller. For the variant models, the point mutation (homology modelling) are made on the fly with Modeller (more).
Each structure has been assessed with MolProbity(19).
Msms(20) is used to calculate solvent accessibility, and STRIDE(21) (plus stride2pdb)
for secondary structure assignment.
Secondary structure analyses in 3D models uses side chain interaction energies reviewed in (23), as well as amino-acids propensities for N-caps, N1-N3, helix middle, C3-C1 and C-caps extracted from (24)(PDB values).
Structural properties are calculated using an in-house developped program based for the USMA's 3D Automatic Annotation pipeline.
Click on the LOVD picture to check if a variant is described at position 31
Graphical display of the region at NCBI (including SNPs)
CYSMA Report:
Report for p.Arg31Leu variant
CFTR orthologs conservation
The wild-type residue R31 is highly conserved among the CFTR orthologs: 80% (40 / 50 CFTR orthologs)The variant R31L has never been found among the CFTR orthologs
N-ter homologs conservation
The wild-type residue R31 belongs to the N-ter domain and is conserved at 33.33% among the N-ter homologs (41 / 123 N-ter homologs)
The variant R31L has been found among the N-ter homologs with a non significant frequency: 1.63% (2 / 123 N-ter homologs)
Structural effects
Solvent accessibility: the wild-type R31 and the mutant R31L are predicted to be exposed
The two residues have a different polarity, which could interfere with hydrogen-bonding capabilities
The two residues have different charge properties, which could interfere with ionic bonds or potential inter- or intra-molecular interactions
The hydrogen bond(s) present in the wild-type is(are) missing in the mutant
The salt brige interaction present in the wild-type is missing in the mutant
The mutant residue is not predicted to introduce steric clashes
Allele frequency
The variant R31C in gnomAD (123,136 exomes): 1.68e-03 ; variant R31C in gnomAD (15,496 genomes): 1.34e-03 The variant R31H in gnomAD (123,136 exomes): 5.58e-05 The variant R31L in gnomAD (123,136 exomes): 3.19e-05 ; variant R31L in gnomAD (15,496 genomes): 9.56e-05
Clinical significance
The variant R31L has been been described as Uncertain significance - criteria provided, multiple submitters, no conflicts - (ClinVar for more details)
CFTR-France
The variant R31L has not been described in CFTR-France
Additional resources
SIFT prediction: variant R31L is predicted to be tolerated (score: 0.07)
PPH2 prediction: variant R31L is predicted to be benign (score: 0.325)