The major goal of the project was to understand how variation in the exome affects heart, lung and blood related traits and diseases.
The study participants were selected from a sample of over 220,000 individuals who participated in another National Institute of Health (NIH) supported study that had collected extensive medical dataon the participants. “Individuals were selected to have a disease endpoint of interest or an extreme trait value of public health importance,” said Dr. Leal.
By sequencing the exomes of 91 cystic fibrosis patients, Dr. Leal and her research colleagues discovered and replicated an association between variants in the DCTN4 gene and when a patient first develops a Pseudomonas aeruginosa airway infection.*
The researchers were also able to replicate many known associations between individual DNA variants and traits, such as high blood levels of low-density lipoprotein, known as the ‘bad’ cholesterol, and C-reactive protein, which increases the body’s response to inflammation.
The majority of these findings are for variants that are common in the population, said Dr. Leal.
To detect associations with rare variants, analyses were performed by aggregating information from individual variants within a gene. This approach successfully detected an association with rare variants in the APOC3 gene that lowers triglyceride levels, an unhealthy type of fat in the blood, said Dr. Leal.
“In order to detect associations with rare variants, due to their modest effects, very large samples sizes are required. In many cases the data from the Exome Sequencing Project gave us leads that had to be evaluated using more study subjects. One mechanism for doing this was by genotyping additional samples using the exome chip, which contains approximately 240,000 coding variants. The Exome Sequencing Project played a very important role in the development of the exome chip, by being the largest contributor of data,” she added.
According to the NHLBI, exome sequencing is an efficient way to search for rare variants associated with complex traits. In contrast to previous genome wide association studies (GWAS), which concentrated on common variants scattered throughout the genome, exome sequencing has the potential to accelerate the search for unambiguous genetic links to disease by focusing attention on the protein coding portion of the genome
In the journal Science**, Dr. Leal and her colleagues wrote that GWAS have substantially improved knowledge about common genetic variation, but have been generally uninformative about the patterns of rare variation within the protein coding regions of the genome.
“This is a very new field for which new methodology had to be developed. We learned many lessons in the quality control and analysis of exome data, as well as what types of results one would expect to see when analyzing rare variants. Additionally, the Exome Sequencing Project has been extremely valuable in obtaining a better understanding of population genomics and the history of man,” Dr. Leal said.
More information: evs.gs.washington.edu/EVS