| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
(Circulation. 2001;103:e101.)
© 2001 American Heart Association, Inc.
Clinical Cardiology: Physician Update |
From the Cardiovascular Division, Brigham and Womens Hospital, Boston, Mass.
Correspondence to Elliott M. Antman, MD, Cardiovascular Division, Brigham and Womens Hospital, 75 Francis Street, Boston, MA 02115. E-mail eantman{at}rics.bwh.harvard.edu
Key Words: drugs statistics cost-benefit analysis trials myocardial infarction
| Introduction |
|---|
|
|
|---|
| Design of Clinical Trials |
|---|
|
|
|---|
Randomized controlled trials typically involve the
randomization of patients to either the control or test treatment (ie,
randomized concurrent control). These trials are the gold standard for
evaluating new therapies, and they form the foundation for the highest
level of recommendations in practice guideline documents that stress
evidence-based medicine5 (Figure 1
). Randomization has 3 important influences that
explain why it is considered the standard for trial design: (1) it
reduces the likelihood of patient selection bias that may occur either
consciously or unconsciously; (2) it enhances the likelihood that
comparable groups of subjects are compared, especially if the sample
size is sufficiently large; and (3) it validates the use of common
statistical tests such as the
2 test for
comparison of proportions and Students t test for comparison of
means.6
|
A stratified randomization scheme is typically used when it is considered important to achieve a balance of key baseline characteristics (eg, location of infarction) among the treatment groups. The ratio of randomization to the control and test therapies may be equal or unbalanced, allowing investigators to acquire more information about the test therapy (eg, risk of intracranial hemorrhage with a new fibrinolytic) while still maintaining a comparison with a control group. Because many clinical trials in cardiology are multicenter in nature, an attempt is made to achieve balance between the treatment assignment at each enrolling center by constraining the randomization to the treatment groups so that the desired ratio occurs in blocks of patients (eg, every 6 or 8 patients) at each center.
When the investigator selects the subjects to be allocated to the control or treatment groups, the study is referred to as a nonrandomized, concurrent control trial.7 Unlike randomized controlled trials, such trials are subject to bias because it may be difficult for investigators to match the test and control groups adequately. Clinical trials using historical controls compare a test intervention to data obtained earlier in a nonconcurrent, nonrandomized control group.7 The use of historical controls allows clinicians to offer potentially beneficial therapies to all subjects, thereby reducing the sample size for the study. The major drawbacks are bias in the selection of the control population and potential failure of historical controls to reflect contemporary diagnostic criteria and current treatment regimens for the disease under study.
Other trial designs useful for evaluating cardiovascular treatments are the crossover design and withdrawal study. The crossover design is actually a special case of the randomized controlled trial in which each subject serves as his or her own control.7 Comparisons are made between the treatment response seen during the first treatment to which the patient is allocated and during subsequent treatments. In withdrawal studies, patients with a chronic cardiovascular condition are taken off therapy and a comparison is made between the treatment effect observed on therapy with that observed off therapy. Potential limitations of withdrawal studies are that only patients who have tolerated the test intervention for a period of time are eligible for enrollment and changes in the natural history of the disease may influence the response to withdrawal of therapy.
Given the multitude of drugs administered simultaneously to patients with
cardiovascular diseases, an increasingly important area
of investigation is drug interactions. Trials that test
2 therapies
simultaneously typically use a factorial design. Such
trials are most appropriate when there is thought to be no interaction
between the test treatments. If such interactions are known to exist or
are discovered to exist, it is important to evaluate each test
treatment against a control treatment.
To minimize the possibility of bias, blinding (sometimes referred to as masking) of treatment assignment is used. When only the patient is unaware of the treatment assignment, the trial is single blind. If the investigator is also blinded, the trial is double blind.
The responsibility for monitoring evolving efficacy and safety on an interim basis during the conduct of the trial rests with a Data Safety Monitoring Board or Committee.8 It is usual practice to prespecify the level of statistical evidence the Data Safety Monitoring Board might observe at an interim analysis that might lead to a recommendation of premature discontinuation of the trial because of overwhelming evidence of benefit or harm from the test therapy (eg, stopping boundaries). Investigators and sponsors sometimes also prespecify a calculation of the conditional probability of the trial achieving its objective(s) given the observations at a particular interim look (eg, when half the expected number of patients have been enrolled or half the expected number of events have occurred). Potential recommendations that the Board might make include increasing the sample size if the event rate is lower than expected to increase the power of the study or discontinuation of the study for futility.9
| Statistical Considerations |
|---|
|
|
|---|
=false-positive statement that erroneously declares there is a
difference between the treatments) is typically 2-sided and set at the
5% level. The type II error (ß=false-negative statement that
erroneously declares there is no difference between the treatments) is
typically set between 10% and 20%, such that the power of the trial
(1-ß) is between 90% and 80%, respectively. The combination of
and ß errors determines the sample size of the trial. Fortunately, the mortality rate from cardiovascular illnesses continues to decline. The implication of a low event rate in the control group is that tens of thousands of patients must be randomized to show a difference between treatments.10 Many trials in cardiology therefore use a composite end point, such as death plus nonfatal myocardial infarction, to maintain a practical size for the trial. When interpreting the results of trials with composite end points, it is important for clinicians to note whether the direction and magnitude of the treatment effect is similar for each of the elements of the end point.
In cardiovascular therapeutics, several
efficacious treatments may coexist for a given condition. Potential
differences in clinically important areas such as tolerability, ease of
administration, and cost may lead investigators to perform a trial
demonstrating therapeutic equivalence of 2 treatments. Because it is
not possible to show that 2 active therapies are completely equivalent
without a trial of infinite sample size, investigators specify a value
(
) and consider the test therapy equivalent to the standard therapy
if, with a high degree of confidence, the true difference in treatment
effects is less than
.11
In this case, the null hypothesis states that the rate of events in
patients receiving the test therapy exceeds the rate in patients
receiving control therapy by
. The alternative hypothesis states
that the rate of events in patients receiving the test therapy is less
than the rate in patients receiving control therapy plus
. In a
classical equivalence trial, if the effects of the 2 treatments differ
by more than the equivalence margin (ie,
) in either direction, then
equivalence is said not to be present. In practical terms, the
usual objective in clinical trials of 2 active therapies is to
establish that the new therapy is not worse than the standard therapy
by more than
.3 Such one-sided comparisons are referred to as noninferiority trials
(Figure 2
). The new therapy may satisfy the definition of
noninferiority but, depending on the results, may or may
not actually show superiority compared with the standard therapy. The
and ß errors of the noninferiority trial determine
the sample size just as for superiority trials.
|
It is important that investigators prespecify the
noninferiority margin before learning the trial results to
avoid the bias that might be introduced by retrofitting a
noninferiority margin such that the test therapy satisfies
the definition of noninferiority. Specification of the
appropriate margin or
is a challenging area involving the desire of
regulatory authorities to be assured that the test therapy is at least
superior to placebo and the desire of clinicians to set
at a
clinically meaningful difference between treatments.
| Critical Readings of Clinical Trials |
|---|
|
|
|---|
|
Because many clinical trials in cardiology
involve a comparison of the event rates in 2 groups of patients, those
allocated to control and test therapies, it is convenient to summarize
the data in a 2x2 table, such as that in
Figure 3
. The event rates in the groups are compared using a
2 test or Fishers exact test to
determine the statistical significance of the difference in event
rates.14
|
Several different statements can be constructed to describe the treatment effect. The degree of imprecision of the estimate of the treatment effect is typically presented in the form of 95% confidence intervals. Convenient terms used in reporting clinical trial results are the relative risk and odds ratio. As the rate increases in the group allocated to control, the odds ratio deviates farther from the relative risk, and clinicians should rely more on the latter.
It is important to scrutinize all statements describing the treatment effect to gain a comprehensive picture of the magnitude of the observation and its implications for clinical practice. If practitioners are given clinical trial results only in the form of relative risk reduction, they tend to perceive a greater effectiveness of the test intervention than if a more comprehensive statement is provided that includes the absolute risk difference and the number of patients who need to be treated to prevent one event.15
Against the benefits associated with a test therapy, clinicians must weigh the risks associated with its use. Terms that describe the harmful effects of test therapies include the absolute risk increase (the absolute increase in events with the test therapy compared with control therapy) and the number needed to harm (1/absolute risk increase). The composite of benefit and harm has sometimes been expressed as net clinical benefit. An example is the composite of lives saved with fibrinolytic therapy for ST elevation myocardial infarction plus the number of patients who survive but suffer a severe disabling stroke from intracranial hemorrhage.16
When weighing the evidence from a single clinical trial for a treatment decision in an individual patient, physicians must consider more than the level of significance of the findings.17 A judgment must be made about whether the patient is representative of the type of patients enrolled in the trial. Additional data from other related trials should be incorporated in the decision-making process. Pooling of trial data and synthesis of the information in the form of a meta-analysis, if available, may be helpful.18 The complex interplay of benefit, harm, and cost are best analyzed with the techniques of decision analysis and cost-effectiveness analysis, and clinicians should familiarize themselves with such analyses for the therapy of interest if they are available.
| References |
|---|
|
|
|---|
2. Meinert C. Clinical trials: design, conduct, and analysis. In: Lilienfeld A, ed. Monographs in Epidemiology and Biostatistics. Vol. 8. New York: Oxford University Press; 1986:6570.
3.
Temple R, Ellenberg SS. Placebo-controlled trials and active-control trials in the evaluation of new treatments, part 1: ethical and scientific issues. Ann Intern Med. 2000;133:455463.
4.
Ellenberg SS, Temple R. Placebo-controlled trials and active-control trials in the evaluation of new treatments, part 2: practical issues and specific cases. Ann Intern Med. 2000;133:464470.
5.
Braunwald E, Antman EM, Beasley JW, et al. ACC/AHA guidelines for the management of patients with unstable angina and non-ST-segment elevation myocardial infarction: a report of the American College of
Cardiology/American Heart Association Task Force on
Practice Guidelines (Committee on the Management of Patients With
Unstable Angina). J Am Coll Cardiol. 2000;36:9701062.
6. Friedman L, Furberg C, DeMets D. Fundamentals of Clinical Trials. 3rd ed. Littleton: PSG Publishing; 1998:4160.
7. Food and Drug Administration. International conference on harmonization: choice of control group in clinical trials. Federal Register. 1999;64:5176751780.
8. DeMets DL, Pocock SJ, Julian DG. The agonizing negative trend in monitoring of clinical trials. Lancet. 1999;354:19831988.[Medline] [Order article via Infotrieve]
9. Ware J, Muller J, Braunwald E. The futility index: an approach to the cost-effective termination of randomized clinical trials. Am J Med. 1985;78:635643.[Medline] [Order article via Infotrieve]
10. Collins R, MacMahon S. Reliable assessment of the effects of treatment on mortality and major morbidity, I: clinical trials. Lancet. 2001;357:373380.[Medline] [Order article via Infotrieve]
11.
Ware JH, Antman
EM. Equivalence trials. N Engl J
Med. 1997;337:11591161.
12.
Guyatt GH,
Sackett DL, Cook DJ. The medical literature. users guides to the
medical literature, II: how to use an article about therapy or
prevention, A: are the results of the study valid?
JAMA. 1993;270:25982601.
13.
Guyatt GH,
Sackett DL, Cook DJ. the medical literature. users guides to the
medical literature, II: how to use an article about therapy or
prevention, B: what were the results and will they help me in caring
for my patients? JAMA. 1994;271:5963.
14. Glantz S. Primer of Biostatistics. 3rd ed. New York: McGraw-Hill; 1992:110154.
15.
Bucher H,
Weinbacher M, Gyr K. Influence of method of reporting study results on
decision of physicians to prescribe drugs to lower
cholesterol concentration.
BMJ. 1994;309:761764.
16.
The Global Use of
Strategies to Open Occluded Coronary Arteries (GUSTO III)
Investigators. A comparison of reteplase with alteplase for acute
myocardial infarction. N Engl J
Med. 1997;337:11181123.
17.
Myerburg RJ,
Mitrani R, Interian A, et al. Interpretation of outcomes of
antiarrhythmic clinical trials: design features and population impact.
Circulation. 1998;97:15141521.
18. Lau J, Ioannidis JP, Schmid CH. Summing up evidence: one answer is not always enough. Lancet. 1998;351:123127.[Medline] [Order article via Infotrieve]
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Circulation Home | Subscriptions | Archives | Feedback | Authors | Help | AHA Journals Home | Search Copyright © 2001 American Heart Association, Inc. All rights reserved. Unauthorized use prohibited. |