(Circulation. 1996;93:403-406.)
© 1996 American Heart Association, Inc.
Articles |
From the Department of Medicine, Duke University Medical Center, Durham, NC.
Correspondence to Robert M. Califf, MD, Professor of Medicine, Duke University Medical Center, 2024 W Main St, Durham, NC 27705.
Key Words: Editorials clinical outcome quality control
| Introduction |
|---|
|
|
|---|
The recent release of operator-specific outcomes data has ushered in a new terminology known as "scorecard medicine."2 3 The use of the term "scorecard," although having some negative connotations, provides symbolic insight into the issues that must be addressed by the profession. For decades in this country, thousands of fans have carefully followed the statistics of their favorite baseball players (without the benefits of a postgraduate mathematics degree). Intuitive familiarity through years of observation has led the public to widespread sophistication about interpreting these statistics. Small differences in batting averages early in the year are not regarded as important. Hitters with few "at bats" are not considered to have stable estimates of their hitting capabilities; a hitter with 3 hits in his first 10 at bats is not considered to be a ".300 hitter" but rather a batter who needs more exposure to determine his capabilities. Likewise, the fan understands that the runs-batted-in statistic is dependent on the ability of preceding hitters to get on base so that the hitter has an opportunity to drive in runs. Batting averages are often stratified as a function of batting against right-handed and left-handed pitching because of the recognition that expected outcomes differ depending on the pitching. In many ways, the medical scorecard has similar characteristics; the problem is that the important modifying factors are complex and poorly understood and the methods of adjusting for differences are even less well understood. As this field of quantitative outcomes evolves from its current early stage of development, we believe that constant exposure to medical outcome data will eventually lead to intuitive understanding similar to that of the interested baseball fan.
Just as managers in other industries depend on information systems to provide support for policy decisions, managers of organized medical systems (managed care organizations) will do the same. Assessment of the value of strategies of medical careincluding the value provided by individual practitionershas become a reality; clinicians and researchers must accept this fact and move forward quickly to guide the development of efficient, fair, and effective methods of measurement.
Outcomes can be defined in terms of cost, patient satisfaction with the delivery of medical care, freedom from adverse medical events, and quality of life in its many dimensions. The ease of measuring and understanding outcomes is directly related to the order in which they are listed above. Financial systems now allow measurement of cost at a detailed level, and the methods of analyzing cost data are common to all businesses. Similarly, patient satisfaction surveys are fundamentally similar to customer satisfaction surveys in other industries. Adverse outcomes can range from highly objective (mortality end points) to highly subjective (infarction rates, procedural success), as pointed out by Ellis and colleagues.1 Finally, the measurement of quality of life is extremely complicated because of its multidimensional nature and difficulty with subjective interpretation.
Clinical outcomes, previously the sole interest of the medical profession as an intellectual research endeavor, have become an area critical to the understanding of the value and quality of medical care. A central tenet of medical practice is that cost or satisfaction cannot be properly interpreted unless the corresponding medical outcomes are taken into consideration. The complexity of outcome measurement and interpretation must be demonstrated to consumers and managers of health care without self-serving attempts to derail the process, and methods to make outcome measures practical must be developed.
Because of its substantial expense and critical impact on life and death, procedure-based cardiovascular medicine has become a pivotal testing ground for medical outcome measurement. Ellis and colleagues provided important insight into many of the possibilities and pitfalls of using risk-adjusted outcome measures to assess the performance of individual practitioners. By developing a regression analysis to understand the major factors responsible for outcomes in percutaneous intervention and applying the analysis to individual operator outcomes, they provided data that can be used to evaluate the methods and applications of "cardiovascular scorecards."
The basis for using statistical models to assess practitioner performance is straightforward. The observed results of procedures are dependent on a number of factors, most of which are unrelated to the skill of the individual practitioner. Most importantly, the severity of illness of the patient far outweighs the effect of treatment in almost all quantitative studies. A practitioner performing procedures on higher-risk patients will obviously have worse outcomes than practitioners selecting "easy" cases. Risk adjustment with regression models is a statistical method of "leveling the playing field" to account for baseline differences so that the influence of care can be more precisely quantified.
The simplified risk-adjustment model developed by Ellis and colleagues provides an important first step, although considerably more work needs to be done to provide the best statistical model to level the field for interventional cardiology. For example, given the low event rates in their database, considerably more than 5000 patients would be required to accurately identify more than four or five major independent predictors of outcome without risking "overfitting" the model4 and to ensure that risk factors increasing the odds of adverse outcomes by 1.5-fold will be detected.5 Continuous characteristics such as ejection fraction and age can be modeled much more effectively as continuous functions; for example, although patients with an ejection fraction of 15% are obviously quite different in terms of risk than are patients with an ejection fraction of 39%, both would be assigned the same weight in the model developed by Ellis and colleagues. As large clinical databases are accumulated, without doubt the common risk factors for poor outcomes with medical and surgical treatment of coronary disease will emerge as prognostic indicators for percutaneous intervention. Rather than three-vessel disease or proximal left anterior descending coronary artery disease being critical, a continuum of risk almost certainly exists with more complex and diffuse lesions.6 7 Comorbid illnesses such as diabetes and peripheral vascular disease will also likely be identified as important predictors of outcomes after coronary intervention.8 9
Beyond developing regression models, Ellis and colleagues point out many of the potential pitfalls facing scorecarding; these problems include considerable uncertainty remaining after the factors in the models are considered, difficulty in making inferences about individual low-volume operators, subjectivity of outcome measures and baseline characteristics with corresponding ability to "game" the system, and lack of reliability in identifying outliers. To this list we add the lack of ability to define the success of an operator not only by the acute outcomes observed but also based on an understanding of the overall effect on the long-term outcomes of the population served.10
| Model Accuracy |
|---|
|
|
|---|
| Problems With Low-Volume Operators |
|---|
|
|
|---|
| Gaming the Results |
|---|
|
|
|---|
| Consistency of Results |
|---|
|
|
|---|
values are low; individuals on the border
of statistical significance with one outcome would understandably fall
within the 95% CI with another measure. These perturbations based on
marginal statistical differences should prompt caution on the part of
those who advise public disclosure of outcome information based solely
on "outlier" status. A more sensible method of displaying the
information would be to provide a rank-ordering of
operator-adjusted outcomes, identifying a range of providers with
results within the bounds of good practice and results deserving
commendation. Public disclosure of true-negative outliers should
require both definitive statistical evidence and a peer review. For
example, previous studies have found that although providers designated
as "outliers" can vary significantly depending on the
risk-adjustment methods used, overall hospital performance
rankings relative to their peers were remarkably
consistent.23 The fixation on "bad apple" identification is a component of scorecarding that is destructive to the practice of continuous quality improvement, as enunciated by Deming.24 This approach provides sensational headlines but drives the profession into a defensive posture rather than a positive engagement in the use of data to improve practice. Although we agree that operators with persistently poor results should undergo further scrutiny, we urge a shift in the fundamental focus of scorecarding. Variation in risk-adjusted outcome should serve as a starting point for which more detailed examination of care processes can be initiated. In this manner, the play of chance can be considered and the focus becomes a genuine search for the practices that lead to the best outcomes. After experiences in the northeast United States, physicians are wary of the potential impact of public release of scorecard data without sufficiently balanced presentation of the data. Because of the uncertainties outlined above, we strongly believe that public dissemination of poor outcome data should follow a peer review of practice. To the extent that such data collection systems can be supported and maintained by healthcare providers rather than government entities, they will not be subject to premature public release.
| Patient Selection and Long-term Benefits |
|---|
|
|
|---|
Returning to the scorecard analogy, we believe that public and professional interests will be best served if physicians "play ball" rather than "strike" against systems designed to assess quality and outcome. Despite all of the difficulties and uncertainties, the quest for quantitative evidence of quality in medical care will continue to gain momentum. We believe that systematic measurement of baseline risk and clinical outcome can provide a powerful tool for well-meaning operators to develop an understanding of how to practice better medicine. Patients and their families entrust their future with us when they choose to undergo percutaneous revascularization procedures. A voluntary engagement of the cardiovascular community could rapidly lead to a concise, well-defined set of data reflecting severity of illness, procedural methodology, and outcomes. We have a responsibility to improve our practice to produce the best outcomes possible. If cardiovascular practitioners do not heed the call to gather outcome data and improve practice through its use, nonpractitioners will impose policies and select providers without adequately considering the quality of care provided; our patients deserve our involvement.
| Footnotes |
|---|
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
D. M. Shahian, S.-L. Normand, D. F. Torchiana, S. M. Lewis, J. O. Pastore, R. E. Kuntz, and P. I. Dreyer Cardiac surgery report cards: comprehensive review and statistical critique Ann. Thorac. Surg., December 1, 2001; 72(6): 2155 - 2168. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. P. Brown, M. J. Mack, A. W. Simon, S. L. Battaglia, L. G. Tarkington, S. D. Culler, and E. R. Becker Comparing clinical outcomes in high-volume and low-volume off-pump coronary bypass operation programs Ann. Thorac. Surg., September 1, 2001; 72(3): S1009 - 1015. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. D. McGrath, D. E. Wennberg, J. D. Dickens Jr, A. E. Siewers, F. L. Lucas, D. J. Malenka, M. A. Kellett Jr, and T. J. Ryan Jr Relation Between Operator and Hospital Volume and Outcomes Following Percutaneous Coronary Interventions in the Era of the Coronary Stent JAMA, December 27, 2000; 284(24): 3139 - 3144. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. D. Peterson, E. R. DeLong, L. H. Muhlbaier, A. B. Rosen, H. E. Buell, C. I. Kiefe, and T. F. Kresowik Challenges in comparing risk-adjusted bypass surgery mortality results: Results from the Cooperative Cardiovascular Project J. Am. Coll. Cardiol., December 1, 2000; 36(7): 2174 - 2184. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Kereiakes, S. C. Smith, A. K. Jacobs, M. J. Kern, and D. P. Faxon Angioplasty for acute myocardial infarction in community hospital without surgical back-up: Response to Wharton and Angelini publications "should guidelines be changed?: Not whether but when" J. Am. Coll. Cardiol., July 1, 2000; 36(1): 299 - 300. [Full Text] [PDF] |
||||
![]() |
S. G. Ellis, D. Miller, T. F. Keys, K. Brown, R. Ellert, G. Howell, A. M. Lincoff, and E. J. Topol Comparing physician-specific two-year patient outcomes after coronary angiography: Methodologic issues and results J. Am. Coll. Cardiol., April 1, 1999; 33(5): 1278 - 1285. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Kastrati, F.-J. Neumann, and A. Schomig Operator volume and outcome of patients undergoing coronary stent placement J. Am. Coll. Cardiol., October 1, 1998; 32(4): 970 - 976. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. G. Jollis, E. D. Peterson, C. L. Nelson, J. A. Stafford, E. R. DeLong, L. H. Muhlbaier, and D. B. Mark Relationship Between Physician and Hospital Coronary Angioplasty Volume and Outcome in Elderly Patients Circulation, June 3, 1997; 95(11): 2485 - 2491. [Abstract] [Full Text] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Circulation Home | Subscriptions | Archives | Feedback | Authors | Help | AHA Journals Home | Search Copyright © 1996 American Heart Association, Inc. All rights reserved. Unauthorized use prohibited. |