Analysis of p.Phe200Ile variant, CFTR gene, CF Transmembrane conductance Regulator protein (1480 residues)
Data provided and calculated by CYSMA must be considered as predictions.
They are meant for educational purposes only and are provided with NO WARRANTY with respect to their biological reliability.
The mutant residue cannot be found in the alignment.
There is no gap in the alignment.
The wild-type residue F200 is highly conserved among the CFTR orthologs: 98% (49 / 50 CFTR orthologs)The variant F200I has never been found among the CFTR orthologs
*AAPI: Alignment Average Percentage Identity
**AAPIR: Alignment Average Percentage Identity of the Region (20 residues surrounding position 200). AAPIR appears in green if it is more than 10% compared to AAPI, in red if less than 10%. Click here for more details on the alignment.
Divergencies show the amino acids which have been selected in the evolution.
If you find your variant among them with a high occurrence, there are good chances that your variant will most likely either have a small impact or no impact at all on the CFTR function.
Please note that CYSMA does not consider splicing alterations.
Refer to the Help page for more details.
CYSMA's visualizing modules for Ortholog conservation:
⬇ Download the region alignment (50 residues, Fasta format)
⬇ Download the CFTR phylogenic tree
|
|
200
|
|
Homo sapiens
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
A
L
L
M
G
L
I
W
E
L
L
Q
A
S
A
F
C
Pan troglodytes
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
A
L
L
M
G
L
I
W
E
L
L
Q
A
S
A
F
C
Pongo pygmaeus
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
A
L
L
M
G
L
I
W
E
L
L
Q
A
S
A
F
C
Gorilla gorilla
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
A
L
L
M
G
L
I
W
E
L
L
Q
A
S
A
F
C
Nomascus leucogenys
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
A
L
L
M
G
L
I
W
E
L
L
Q
A
S
A
F
C
Macaca mulatta
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
V
P
L
Q
V
A
L
L
M
G
L
I
W
E
L
L
Q
A
S
A
F
C
Macaca nemestrina
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
V
P
L
Q
V
A
L
L
M
G
L
I
W
E
L
L
Q
A
S
A
F
C
Macaca fascicularis
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
V
P
L
Q
V
A
L
L
M
G
L
I
W
E
L
L
Q
A
S
A
F
C
Papio anubis
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
V
P
L
Q
V
A
L
L
M
G
L
I
W
E
L
L
Q
A
S
A
F
C
Callithrix jacchus
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
A
L
L
M
G
L
I
W
E
L
L
Q
A
S
A
F
C
Chlorocebus aethiops
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
V
P
L
Q
V
A
L
L
M
G
L
I
W
E
L
L
Q
A
S
A
F
C
Colobus guereza
S
I
G
Q
L
V
S
F
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
V
P
L
Q
V
A
L
L
M
G
L
I
W
E
L
L
Q
A
S
A
F
C
Ateles geoffroyi
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
A
L
L
M
G
L
I
W
E
L
L
Q
A
S
A
F
C
Plecturocebus moloch
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
A
L
L
M
G
L
I
W
E
L
L
Q
A
S
V
F
C
Saimiri boliviensis
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
A
L
L
M
G
L
I
W
E
L
L
Q
A
S
A
F
C
Aotus nancymaae
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
A
L
L
M
G
L
I
W
E
L
L
Q
A
S
A
F
C
Otolemur garnettii
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
T
L
L
M
G
L
L
W
E
L
L
Q
A
S
A
F
C
Microcebus murinus
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
M
L
L
M
G
L
L
W
E
L
L
Q
A
S
A
F
C
Vicugna pacos
S
I
G
Q
L
I
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
T
L
L
M
G
L
L
W
E
L
L
Q
A
S
T
F
C
Sus scrofa
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
T
L
L
M
G
L
L
W
E
L
L
Q
A
S
A
F
C
Bos taurus
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
T
L
L
M
G
L
L
W
E
L
L
Q
A
F
T
F
C
Muntiacus reevesi
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
T
L
L
M
G
L
L
W
E
L
L
Q
A
F
T
F
C
Muntiacus muntjak
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
T
L
L
M
G
L
L
W
E
L
L
Q
A
F
T
F
C
Ovis aries
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
T
L
L
M
G
L
L
W
D
L
L
Q
A
F
T
F
C
Equus caballus
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
T
L
L
M
G
L
L
W
D
L
L
Q
A
S
A
F
C
Canis familiaris
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
T
L
L
M
G
L
L
W
D
L
L
Q
A
S
A
F
C
Loxodonta africana
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
T
L
L
M
G
L
I
W
D
L
L
Q
A
S
A
F
C
Mustela furo
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
T
L
L
M
G
L
L
W
D
L
L
Q
A
S
A
F
C
Oryctolagus cuniculus
S
I
G
Q
L
I
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
S
P
L
Q
V
T
L
L
M
G
L
L
W
E
L
L
Q
A
S
A
F
C
Atelerix albiventris
S
I
G
Q
L
I
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
V
P
L
Q
V
T
L
L
M
G
L
L
W
E
L
L
Q
A
S
A
F
C
Dasypus novemcinctus
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
T
L
L
M
G
L
L
W
D
L
L
Q
A
S
A
F
S
Rhinolophus ferrumequinum
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
M
L
L
M
G
L
L
W
E
L
L
Q
A
S
A
F
C
Cavia porcellus
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
T
L
L
M
G
L
L
W
D
L
L
Q
A
S
A
F
C
Monodelphis domestica
S
T
G
Q
L
I
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
V
P
L
Q
V
V
L
L
M
G
L
L
W
D
L
L
Q
A
S
A
F
C
Ornithorhynchus anatinus
S
T
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
M
L
L
M
G
L
L
W
D
L
L
Q
A
S
A
F
C
Didelphis virginiana
S
T
G
Q
L
I
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
V
P
L
Q
V
V
L
L
M
G
L
L
W
D
L
L
Q
A
S
A
F
C
Trichosurus vulpecula
S
T
G
Q
L
I
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
V
L
L
M
G
L
L
W
D
L
L
Q
A
S
A
F
C
Carollia perspicillata
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
T
L
L
M
G
L
L
W
E
L
L
Q
A
S
A
F
Y
Mus musculus
S
I
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
I
W
I
A
P
L
Q
V
T
L
L
M
G
L
L
W
D
L
L
Q
F
S
A
F
C
Rattus norvegicus
S
I
G
Q
L
I
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
I
W
I
A
P
L
Q
V
V
L
L
M
G
L
L
W
D
L
L
Q
F
S
A
F
C
Gallus gallus
S
T
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
A
L
L
M
G
L
L
W
D
M
L
Q
A
S
A
F
A
Taeniopygia guttata
S
T
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
A
L
L
M
G
L
L
W
D
M
L
E
A
S
A
F
S
Xenopus tropicalis
S
T
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
L
L
L
M
G
L
L
W
D
L
L
Q
A
S
A
F
C
Xenopus laevis
S
T
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
L
A
L
A
H
F
V
W
I
A
P
L
Q
V
L
L
L
M
G
L
L
W
D
L
L
Q
A
S
A
F
C
Squalus acanthias
D
T
G
Q
L
V
S
L
L
S
N
N
L
N
K
F
D
E
G
V
A
V
A
H
F
V
W
I
A
P
V
Q
V
V
L
L
M
G
L
I
W
N
E
L
T
E
F
V
F
C
Danio rerio
S
T
G
Q
L
V
S
L
M
S
A
N
L
G
K
F
D
Q
S
L
G
M
A
H
F
I
W
I
S
P
L
Q
C
I
L
C
T
G
L
I
W
E
L
I
D
V
N
S
F
C
Oryzias latipes
T
T
G
Q
L
V
S
L
M
S
A
H
L
N
K
L
D
E
S
L
G
L
A
H
F
V
W
I
T
P
L
Q
C
M
L
C
A
G
L
V
W
E
L
I
E
V
N
S
L
W
Takifugu rubripes
S
T
D
Q
L
V
S
L
M
S
A
H
L
N
K
L
D
E
S
L
G
L
A
H
F
I
W
I
T
P
L
Q
C
I
L
C
V
G
L
I
W
E
L
I
E
V
N
G
F
C
Tetraodon nigroviridis
S
T
D
Q
L
V
S
L
M
S
A
H
L
N
K
L
D
E
S
L
G
L
A
H
F
I
W
I
T
P
L
Q
C
I
L
C
V
G
L
I
W
E
L
I
E
V
N
G
F
C
Caenorhabditis elegans
N
S
G
Q
I
I
Q
L
L
N
T
D
A
A
K
L
E
Q
A
F
L
F
A
H
Y
V
W
L
C
P
L
L
M
F
F
Y
A
Y
I
L
W
S
M
F
G
F
C
C
L
L
Species color legend (basic classification):
Great apes | Other monkeys | Prosimians | Other mammals | Lizards | Birds | Amphibians | Fishes | Insects | Nematods | Tunicates | Echinoderms
Ortholog sequences have been selected from the Ensembl(1) and
NCBI websites. Alignment has been performed with
ClustalW(2), version 1.83 or 2.0.7.
Trees have been built using Phylogeny.fr(3), based on the alignments.
Software used is PhyML 3.0 aLRT with default parameters. Pictures of trees have been made using Phylip at Mobyle.
AAPI and AAPIR have been calculated thanks to Bioperl.
Domain conservation:
The domain MSD1 of CF Transmembrane conductance Regulator has been shown to interact with:
The residue p.Phe200 (MSD1) seems to play a key role in the CFTR function:
p.Phe200 is involved in the pore construction.
The residue p.Phe200 belongs to the domain MSD1.
69
422
is the membrane-spanning domain 1, composed of six transmembrane helices (TM1-TM6). Four of the six TMs protrude into the cytosol to form the intracellular loops ICL1 (between TM2 and TM3) and ICL2 (between TM4 and TM5). ICL1 contacts NBD1 at the level of the ATP-binding site, while ICL2 binds in a groove located at the surface of NBD2.
MSD1: is the membrane-spanning domain 1, composed of six transmembrane helices (TM1-TM6). Four of the six TMs protrude into the cytosol to form the intracellular loops ICL1 (between TM2 and TM3) and ICL2 (between TM4 and TM5). ICL1 contacts NBD1 at the level of the ATP-binding site, while ICL2 binds in a groove located at the surface of NBD2.
MSD1 of CF Transmembrane conductance Regulator domain alignment including p.Phe200 residue.
***AAPID: Alignment Average Percentage Identity of the Domain (positions are indicated). !AAPIR: Alignment Average Percentage Identity of the Region (20 residues surrounding position 200). AAPIR appears in green if it is more than 10% compared to AAPID, in red if less than 10%.
Divergencies
Residues present in more than 10% of the sequences are highlighted in blue.
A - 2.80%
D - 2.10%
E - 1.86%
G - 9.09%
I - 9.56%
K - 4.43%
L - 14.69%
M - 3.26%
N - 3.96%
P - 5.36%
Q - 0.23%
R - 2.80%
S - 6.99%
T - 5.36%
V - 5.36%
Y - 0.70%
The wild-type residue F200 belongs to the MSD1 domain and is conserved at 18.65% among the MSD1 homologs (80 / 429 MSD1 homologs)
The variant F200I has been found among the MSD1 homologs with a notable frequency: 9.56% (41 / 429 MSD1 homologs)
Divergencies show the amino acids which have been selected in the evolution. Residues present in more than 10% of the sequences are highlighted in blue.
Please note that CYSMA does not consider splicing alterations.
Refer to the Help page for more details.
CYSMA's visualizing modules for MSD1 domain conservation:
Sequence alignments for NBDs have all been extracted from Prosite(4). Sequence alignments for MSDs have been extracted using the PSI-BLAST web server.
Sequences alignments have been manually re-aligned using a structural alignment including the human CFTR and bacterian ABC transporters with know 3D structures (for MSDs and NBDs domains).
Predictions of secondary structures have been made with PsiPred(9)
, version 2.5, using Protein Multiple Sequences Alignments as input, in order to increase the accuracy of the prediction.
Amino acid frequencies have been calculated from a non redondant set defined by the RCSB.
The Help page will tell you more about it.
3D analysis:
Models provided and analysed by CYSMA must be considered as predictions, therefore be careful when interpreting the results. All efforts have been made to build structures of quality, however, they are provided with NO WARRANTY as to their accuracy with the real biological molecules studied.
Wild type and predicted mutant structures have been compared. You will find the results below.
Click on the MolProbity logo for complete details on the structure quality
This model is made of 37 α helices and of 20 β strands (and is mainly composed of helices (739 residues in helices against 102 in strands, for a total of 1480 amino acids)).
3D structures predicts F200 to be located in an α helice (which confirms PsiPred prediction) and I200 in an α helice. Moreover, the residue is located in the core of this α helice (which contains 39 residues). Helix interior propensities of wild-type and mutant residues are 1.06 and 1.3.
WARNING! The experimental 3D structure used for our predictions is the complete human CFTR structure which have been solved at a 3.7 Å resolution using cryo-electron microscopy (PDB: 5UAK; Liu et al. 2017). The overall resolution is fairly low so the CYSMA's 3D Automatic Annotation pipeline might have missed some important structural effects.
Solvent accessibility: the wild-type F200 and the mutant F200I are predicted to be buried
Hydrogen bond network:
F200
I200
none
none
The mutant residue is predicted to form less hydrophobic interactions than the wild-type The wild-type residue PHE is buried and is likely to belong to a hydrophobic pocket or core. This hydrophobic core is maintained in the mutant residue ILE
F200
I200
4.81 Å between PHE 200 CE1 and LEU 88 CB
4.44 Å between PHE 200 CZ and LEU 88 CD1
4.65 Å between PHE 200 CD2 and LEU 197 CB
4.71 Å between PHE 200 CE2 and LEU 197 CB
4.16 Å between PHE 200 CB and ALA 357 CB
4.52 Å between PHE 200 CB and ILE 203 CB
4.84 Å between PHE 200 CG and ILE 203 CB
4.13 Å between PHE 200 CD1 and ILE 203 CB
4.87 Å between PHE 200 CE1 and ILE 203 CG1
4.78 Å between ILE 200 CD1 and LEU 88 CD1
5.00 Å between ILE 200 CG2 and ALA 204 CB
4.18 Å between ILE 200 CB and ALA 357 CB
4.23 Å between ILE 200 CG2 and ALA 357 CB
4.96 Å between ILE 200 CG2 and ILE 203 CB
4.84 Å between ILE 200 CD1 and ILE 203 CB
For hydrophobic effects, the important point is the number of residues involved more than the number of interactions.
The mutant residue is not predicted to introduce steric clashes
F200
I200
none
none
CYSMA's 3D visualizing module:
If you want to investigate further the structures, you can use
the JSmol applets of the wild-type (left) and mutant (right) structures.
Click on the JSmol applets' link to hide it.
You have a full access to Jmol commands with a simple right click on one applet.
JSmol Legends:
The residue at the position 200 is located in the center, labelled in yellow and surrounded by its neighboring residues (distance < 5 Å).
Van der Waals contacts with the residue 200 are represented by dotted lines.
Amino acids involved in H-bonds with the residue 200 are labelled in blue.
Amino acids involved in steric clashes with the residue 200 are labelled in red.
The overall structure of the complete human CFTR is represented in ribbon diagrams (click on the Reset button to visualize the overall CFTR structure). The membrane-spanning domain MSD1 is represented in blue and MSD2 in light blue.The nucleotide-binding domain NBD1 is represented in orange, NBD2 in light salmon.The lasso domain is shown in red and the R domain in green.
The 3D structures used in CYSMA are models based on the CFTR experimental 3D structure in the channel-closed conformation (PDB: 5UAK; resolution: 3.9 Å). In the wild-type model, the (missing) loops and the (missing) R domain were built de novo using the software Modeller. For the variant models, the point mutation (homology modelling) are made on the fly with Modeller (more).
Each structure has been assessed with MolProbity(19).
Msms(20) is used to calculate solvent accessibility, and STRIDE(21) (plus stride2pdb)
for secondary structure assignment.
Secondary structure analyses in 3D models uses side chain interaction energies reviewed in (23), as well as amino-acids propensities for N-caps, N1-N3, helix middle, C3-C1 and C-caps extracted from (24)(PDB values).
Structural properties are calculated using an in-house developped program based for the USMA's 3D Automatic Annotation pipeline.
Click on the LOVD picture to check if a variant is described at position 200
Graphical display of the region at NCBI (including SNPs)
CYSMA Report:
Report for p.Phe200Ile variant
CFTR orthologs conservation
The wild-type residue F200 is highly conserved among the CFTR orthologs: 98% (49 / 50 CFTR orthologs)The variant F200I has never been found among the CFTR orthologs
MSD1 homologs conservation
The wild-type residue F200 belongs to the MSD1 domain and is conserved at 18.65% among the MSD1 homologs (80 / 429 MSD1 homologs)
The variant F200I has been found among the MSD1 homologs with a notable frequency: 9.56% (41 / 429 MSD1 homologs)
Structural effects
The wild-type residue F200 seems to play a key role in the CFTR function. F200 is involved in the pore construction
Solvent accessibility: the wild-type F200 and the mutant F200I are predicted to be buried
The mutant residue is predicted to form less hydrophobic interactions than the wild-type The wild-type residue PHE is buried and is likely to belong to a hydrophobic pocket or core. This hydrophobic core is maintained in the mutant residue ILE
The mutant residue is not predicted to introduce steric clashes
Allele frequency
The variant F200I in gnomAD (123,136 exomes): 3.98e-06 The variant F200L in gnomAD (123,136 exomes): 3.98e-06 ; variant F200L in gnomAD (15,496 genomes): 3.19e-05
Clinical significance
The variant F200I has been been described as Uncertain significance - criteria provided, single submitter - (ClinVar for more details)
CFTR-France
The variant F200I might correspond to: NM_000492.3:c.598T>A, which is reported to be disease-causing (CFTR-France for more details)
Additional resources
SIFT prediction: variant F200I is predicted to be damaging (score: 0.01)
PPH2 prediction: variant F200I is predicted to be damaging (score: 1)