RANDOMIZZAZIONE MENDELIANA: STUDI RANDOMIZZATI NELL'ERA POST GENOMICA
   
Author Thanassoulis G and O'Donnell CJ.
Title Mendelian randomization. Nature's randomized trial in the post-genome era
Full source JAMA 2009;301:2386-2388
Text

Despite several observational studies showing that lipoprotein(a) is associated with myocardial infarction (MI),1-2 only circumstantial evidence exists regarding the causal nature of this association. Observational epidemiological studies, even with a sound prospective design, can provide hints to disease pathogenesis when the effect size is modest but cannot provide definitive evidence for causal relationships. Much of the current understanding of the causal factors in cardiovascular disease, such as the role of low-density lipoprotein, has been confirmed by randomized controlled trials (RCTs).3-4 However, RCTs are not always feasible. In the case of lipoprotein(a), the modest effect size and the lack of specific lipoprotein(a)-lowering therapy are major obstacles to obtaining causal evidence for its role in cardiovascular disease. In this issue of JAMA, Kamstrup and colleagues5 provide insights using a mendelian randomization approach and provide evidence for the causal role of lipoprotein(a) in MI. This study elegantly demonstrates how mendelian randomization can be used to improve the evidence for causality from observational studies and highlights the advantages and limitations of such an approach.

The study by Kamstrup et al5 can be viewed as a form of natural "randomized trial" of plasma lipoprotein(a) level on MI with a few important differences from traditional clinical trials. Instead of using drug therapy to decrease plasma lipoprotein(a) levels, genetic variation in the lipoprotein apo(a) gene, LPA, that controls plasma levels of lipoprotein(a) are used. Randomization, which minimizes the effect of confounding variables, is achieved through the random assortment of these LPA gene variants from parents to offspring that occurs during gamete formation and conception. In this way, because LPA variants are randomly assigned, they are unlikely to associate with other nongenetic factors, such as lifestyle factors, that could act as confounders, thereby providing an unbiased assessment of the role of lipoprotein(a) in MI.

Lipoprotein(a) represents an excellent example of a genetic exposure that can be evaluated using the mendelian randomization approach. Such an approach requires highly accurate quantitation of the triangulation of 3 associations: LPA gene variants and serum lipoprotein(a) levels, serum lipoprotein(a) levels and MI events, and LPA gene variants and MI events. Much of the population variation (30%-60%) in lipoprotein(a) levels is explained by the number of copy number variants, known as kringle IV type 2 (KIV-2) repeats, a form of structural variation, in the LPA gene.5-6 With increasing number of KIV-2 repeats in the LPA gene, plasma levels of lipoprotein(a) are reduced. In the Copenhagen City Heart Study (CCHS), the authors demonstrate that this important relationship holds from early adulthood to old age, confirming that individuals with increased KIV-2 repeats have lifelong lower plasma levels of lipoprotein(a). Furthermore, the previously established association between plasma lipoprotein(a) levels and MI is also confirmed in the same population.1-2 The authors also demonstrate that LPA gene variants are unequivocally associated with MI, an association that has been reported by other groups.7-12

The major strengths of this study include the estimation of the 3 key associations required for mendelian randomization within the same population, the replication of these associations in several independent cohorts, and the use of instrumental variable analysis, frequently used in econometrics, to demonstrate that a genetically determined doubling of lipoprotein(a) plasma levels leads to a 22% increase in the risk of MI. Because lipoprotein(a) levels are stable with increasing age, this represents the best estimate for the largely unconfounded effect of lifelong lipoprotein(a) plasma levels on MI risk.

Despite the many advantages of this approach, a number of considerations can invalidate the results of such studies.13 For example, as in an RCT, the randomization process can fail, leading to biased estimates of effect. In this case, genetic variation did not appear to be related to the major cardiovascular covariates given the well-balanced distribution of covariates across genotypes. In addition, population stratification, a form of confounding by ethnicity, can also lead to biased estimates. Simply put, if the frequency of LPA alleles was higher in certain ethnic subgroups and the incidence of MI was also higher in these same subgroups, then this would lead to spurious associations between the LPA gene variants and MI. However, as the authors point out, this type of confounding is unlikely in the relatively homogeneous populations studied. Also, if the LPA gene has pleiotropic effects on MI not mediated by lipoprotein(a) plasma levels, the analysis for the effect estimates of plasma lipoprotein(a) lowering on MI risk will be biased. In fact, such pleiotropic effects may exist for the LPA gene because the KIV-2 variants affect both lipoprotein(a) plasma levels and lipoprotein(a) isoform size. If smaller lipoprotein(a) isoforms, independent of plasma lipoprotein(a) levels, are true risk factors for MI, as has been suggested,9, 14 then the increase in MI risk attributed to increasing plasma lipoprotein(a) reported by Kamstrup et al may be overestimated, because of the combined effects of smaller lipoprotein(a) isoforms and higher plasma lipoprotein(a) concentrations in individuals with fewer KIV-2 repeats.

Taken in the context of previous observational studies of lipoprotein(a), the study by Kamstrup et al adds the strongest evidence to date that lifelong increased lipoprotein(a) plasma levels are causally related to MI. This is an important biological finding that elevates the status of lipoprotein(a), as a biomarker for MI, from putative risk marker to confirmed causal factor and that stimulates renewed interest in the biology of lipoprotein(a) and its role in cardiovascular disease. Although this study certainly provides interesting mechanistic insights into the biology of lipoprotein(a) in the context of MI and suggestive evidence regarding the potential benefit of decreasing lipoprotein(a) early in life, clinicians may ask: "How will these results affect current approaches for prediction, prevention, and treatment of my patients?" At present, the clinical implications remain quite limited. These results do not provide the necessary evidence that genetic testing of the LPA locus or measurements of plasma lipoprotein(a) have a role in cardiovascular risk stratification or decisions regarding lipid-lowering therapy. Ultimately, despite nature's best efforts to provide causal evidence for lipoprotein(a), only a true RCT demonstrating reductions in MI with targeted lipoprotein(a)-lowering therapy can provide the evidence for benefits and risks of a lipoprotein(a)-lowering strategy.

There is currently no well-tolerated medication that can specifically reduce lipoprotein(a) levels. Even if such an agent were to be developed, the modest 22% increase in risk with a lifelong doubling of plasma lipoprotein(a) raises the question whether a shorter duration of lipoprotein(a) lowering achieved by pharmacotherapy in adulthood would be effective in decreasing the risk of MI. By comparison, lifelong LDL reductions of only 15% have been reported to produce a 47% reduction in risk of cardiovascular disease,15 illustrating that, as a lifelong cardiovascular risk factor and as a pharmacologic target, LDL is likely much more potent than lipoprotein(a). These results only provide evidence that lower levels of lipoprotein(a) throughout life reduce the risk of MI. It is not known whether reductions later in life when MI risk is highest, and when most individuals would presumably be treated, would lead to similar reductions in the risk of MI.

In the short number of years following the complete elucidation of the sequence of all 3 billion base pairs in the human genome, a remarkable explosion of genome-wide association studies has provided evidence for common genetic variants numbering in the thousands—single-nucleotide polymorphisms (SNPs) as well as copy number polymorphisms (CNPs)—that underlie hundreds of diseases, as well as quantitative disease traits16 that are diagnosed and treated daily by readers of JAMA. To this catalog of robust and well-replicated genetic associations, a huge number of lower-frequency and very rare genetic variants identified by whole-genome sequencing will soon be added.

As the number of genetic associations continues to increase, clinicians will be increasingly confronted with mendelian randomization designs and should be aware that successful mendelian randomization studies, such as the lipoprotein(a) association with MI reported by Kamstrup et al,5 may be particularly uncommon, and many such studies may fail to identify true causal relationships, even when they exist, for a number of reasons. First, because the magnitude of effect is small to modest for most associations between SNPs and diseases, the sample size required for adequately powered mendelian randomization studies can be prohibitively large (>10 000 individuals) and many studies will be grossly underpowered. Second, the specific causal SNP and the mechanism of most SNP associations are often not obvious at the discovery stage and require further experimental work to identify the causal SNPs and the intermediate phenotypes with which to conduct these studies. Weak correlations between SNPs and intermediate phenotype, or between discovery SNPs and truly causal variants, will significantly attenuate estimated effect sizes. Third, most genome-wide association studies to date have been conducted in middle-aged and older adults, and the absence of an association between a SNP and an intermediate phenotype or a disease outcome may result from the influence of multiple cumulative environmental effects in older age and other gene-environment or gene-gene interactions that dilute a modest but real genetic effect that may be more apparent in early life.

Despite these important limitations, large-scale mendelian randomization will be increasingly used given the success of genome-wide association studies and the ease of genotyping in large, prospective cohorts such as the CCHS and many other cohorts around the world. Thus far, the mendelian randomization approach has successfully been used in cardiovascular disease to raise questions about any causal role for C-reactive protein17; to confirm the causal role of low-density lipoprotein15; and now, with the study by Kamstrup et al,5 to demonstrate the causal role of lipoprotein(a). Given the unique contributions of mendelian randomization to the understanding of biology, this approach will continue to provide one avenue for the evaluation of causal associations. Such studies will continue to demand careful interpretation, particularly when findings are negative, and any positive results will need to be placed in the appropriate biological and clinical context. Although nature's randomized trials may provide a window to evaluate causality, confirmatory evidence from human-made RCTs will continue to be required to inform clinical practice.


References

1. Danesh J, Collins R, Peto R. Lipoprotein(a) and coronary heart disease: meta-analysis of prospective studies. Circulation. 2000;102(10):1082-1085.
2. Bennet A, Di Angelantonio E, Erqou S; et al. Lipoprotein(a) levels and risk of future coronary heart disease: large-scale prospective data. Arch Intern Med. 2008;168(6):598-608.
3. Downs JR, Clearfield M, Weis S; et al. Primary prevention of acute coronary events with lovastatin in men and women with average cholesterol levels: results of AFCAPS/TexCAPS: Air Force/Texas Coronary Atherosclerosis Prevention Study. JAMA. 1998;279(20):1615-1622.
4. Shepherd J, Cobbe SM, Ford I; et al, West of Scotland Coronary Prevention Study Group. Prevention of coronary heart disease with pravastatin in men with hypercholesterolemia. N Engl J Med. 1995;333(20):1301-1307.
5. Kamstrup PR, Tybjærg-Hansen A, Steffensen R, Nordestgaard BG. Genetically elevated lipoprotein(a) and increased risk of myocardial infarction. JAMA. 2009;301(22):2331-2339.
6. Boerwinkle E, Leffert CC, Lin J, Lackner C, Chiesa G, Hobbs HH. Apolipoprotein(a) gene accounts for greater than 90% of the variation in plasma lipoprotein(a) concentrations. J Clin Invest. 1992;90(1):52-60.
7. Chasman DI, Shiffman D, Zee RYL; et al. Polymorphism in the apolipoprotein(a) gene, plasma lipoprotein(a), cardiovascular disease, and low-dose aspirin therapy. Atherosclerosis. 2009;203(2):371-376.
8. Kathiresan S, Willer CJ, Peloso GM; et al. Common variants at 30 loci contribute to polygenic dyslipidemia. Nat Genet. 2009;41(1):56-65.
9. Kronenberg F, Kronenberg MF, Kiechl S; et al. Role of lipoprotein(a) and apolipoprotein(a) phenotype in atherogenesis: prospective results from the Bruneck study. Circulation. 1999;100(11):1154-1160.
10. Luke MM, Kane JP, Liu DM; et al. A polymorphism in the protease-like domain of apolipoprotein(a) is associated with severe coronary artery disease. Arterioscler Thromb Vasc Biol. 2007;27(9):2030-2036.
11. Shiffman D, O’Meara ES, Bare LA; et al. Association of gene variants with incident myocardial infarction in the Cardiovascular Health Study. Arterioscler Thromb Vasc Biol. 2008;28(1):173-179.
12. Tregouet DA, Konig IR, Erdmann J; et al, Wellcome Trust Case Control Consortium; Cardiogenics Consortium. Genome-wide haplotype association study identifies the SLC22A3-LPAL2-LPA gene cluster as a risk locus for coronary artery disease. Nat Genet. 2009;41(3):283-285.
13. Ebrahim S, Davey Smith G. Mendelian randomization: can genetic epidemiology help redress the failures of observational epidemiology? Hum Genet. 2008;123(1):15-33.
14. Wu HD, Berglund L, Dimayuga C; et al. High lipoprotein(a) levels and small apolipoprotein(a) sizes are associated with endothelial dysfunction in a multiethnic cohort. J Am Coll Cardiol. 2004;43(10):1828-1833.
15. Cohen JC, Boerwinkle E, Mosley TH Jr, Hobbs HH. Sequence variations in PCSK9, low LDL, and protection against coronary heart disease. N Engl J Med. 2006;354(12):1264-1272.
16. Hindorff L, Junkins H, Mehta J, Manolio T. A catalog of published genome-wide association studies. http://www.genome.gov/26525384. Accessed May 5, 2009.
17. Zacho J, Tybjærg-Hansen A, Jensen JS, Grande P, Sillesen H, Nordestgaard BG. Genetically elevated C-reactive protein and ischemic vascular disease. N Engl J Med. 2008;359(18):1897-1908.