Antipsychotic (neuroleptic) medications are an important therapeutic option for many individuals with schizophrenia and other psychoses. For these medications to be maximally beneficial, they must have an acceptable side effect profile and be taken as prescribed. One untoward effect of many antipsychotic drugs is weight gain (1). The extent of weight gain apparently varies by drug, which may be because of the drugs’ differing degrees of action on the serotonergic (2), dopaminergic (3), cholinergic (2), histaminergic (4), and other neurotransmitter systems.
Obesity is a threat to health and longevity (5). Given that over one-third of the adults in the United States are obese (6), practices causing major weight gain deserve careful consideration. Obesity and weight gain have been associated with hypertension, type II diabetes, coronary heart disease, stroke, gallbladder disease, osteoarthritis, sleep apnea and respiratory problems, and some types of cancer (endometrial, breast, prostate, and colon) (7). Moreover, obesity is a common concomitant of schizophrenia (8), and schizophrenic individuals appear to be at increased risk for certain obesity-related conditions such as type II diabetes and cardiovascular disease (9–12).
Weight gain may also cause patients taking antipsychotic medications to discontinue their medications, which may predispose them to relapse (1). Historically, the extrapyramidal side effects of antipsychotics outweighed any nonextrapyramidal side effects. With the advent of newer "atypical" antipsychotics, extrapyramidal side effects are becoming less of a problem. These recent developments in antipsychotics have made it imperative to revisit the topic of antipsychotic-induced weight gain. Therefore, we conducted a comprehensive, quantitative review of the research literature regarding the amount of weight gain associated with each antipsychotic drug available or undergoing clinical trials in the United States.
Antipsychotics eligible for inclusion were those that are approved for use as antipsychotic agents in the United States or that were not currently approved but were under investigation in humans for use as antipsychotics. A list (t1) was compiled from Hyman et al. (13), the 1997 edition of the Physicians’ Desk Reference, and expert colleagues.
To avoid publication bias (14, 15) we retrieved both published and unpublished studies and conducted the most comprehensive search possible according to White’s guidelines (16). The search consisted of the following. 1) References were searched for with the use of the computerized databases MEDLINE (1966 to November 1996), PsychINFO (1967 to October 1996), CINAHL (1982 to September 1996), HealthSTAR (1975 to October 1996), and Dissertation Abstracts International (1861 to January 1997). (Contact the first author for the search terms used.) 2) In an "ancestry analysis" (17), references were obtained from bibliographies of articles retrieved through computerized literature searches. 3) Several types of consultation were used to retrieve further information: informal consultation with expert colleagues in the field; contacts with authors of primary studies obtained through other search procedures, requesting more information and asking whether they knew of additional data of which we should be aware; and registered letters sent to the manufacturer of each compound under study, requesting a list of published and unpublished studies with respect to that compound and weight gain. To companies that provided data and/or expressed an interest (Janssen, Eli Lilly, Pfizer, Zeneca), we offered the opportunity to check our raw data files on their compounds for accuracy.
The literature search yielded over 350 reports, which were then screened for eligibility. To be eligible for this review, a study had to include human subjects, have a sample size greater than one, not be a review article, investigate at least one compound listed in t1, and measure weight change after initiating use of the drug.
English- and non-English-language articles were considered. Four non-English articles were located and read by individuals fluent in the articles’ languages. Only an article by Aberg (18) contained sufficient information and met the eligibility criteria. Six studies met the criteria but were rejected because they investigated prenatal exposure to neuroleptic drugs (one study) or studied patients suffering from anorexia nervosa or Huntington’s chorea (five studies). In one case, only part of a study was used; specifically, from a study by Heimberg et al. (19) that compared individuals who were on a weight-reducing diet and taking clozapine with those who were not on such a diet but taking clozapine, only the data on the group not in the diet condition were used, because the diet condition did not represent usual conditions of use.
+
Coding and Data Extraction
Studies were coded by one investigator (J.L.M.) and spot-checked by one of two other investigators (M.H. or D.B.A). When a discrepancy was found (a fairly rare event), the coders met to discuss and resolve the discrepancy.
The mean and standard deviation of weight change and the size of each group were the three essential pieces of information needed from the studies. In many cases, these data were reported directly in the article and simply recorded. However, in other cases, they were not. In this latter situation, one of several approaches was taken in the following order of preference.
1. Missing means, standard deviations, or sample sizes were directly calculated by using other information available in the article (for example, t, F, or p values) and standard statistical formulas (20).
2. If the article was published in 1990 or later, we attempted to contact the authors for more information.
3. Two other procedures were used to estimate (rather than directly calculate) the necessary statistics. One method was used when data were presented in "binned" categories (e.g., "Ten percent of the patients gained no weight, 30% gained 0–5 pounds, 40% gained 5–15 pounds, and 20% gained more than 15 pounds"). In these situations, by using the categories and the proportions of subjects in each category, the missing mean and/or standard deviation was estimated by maximum likelihood methods; that is, we simply found the estimates of the means and the standard deviations that maximized the likelihood of the observed data by using the normal distribution likelihood function (21). The second method was used when the standard deviation was not reported but the range was (e.g., "Weight change ranged from –4 kg to +15 kg"). In this case we adapted the approach of Tippett (22), who published tables that, given the sample size, provide the expected ratio between the sample range and the standard deviation. Using Tippett’s method, we estimated the standard deviation.
4. If only the standard deviation was missing, it was estimated as the square root of the weighted average variance across all other studies where the weights used were the sample sizes in each study. It was necessary for a standard deviation to be available in order to estimate the variance of the mean for each study, so that the inverse of this variance could be used as a weighting factor in subsequent analyses.
Finally, if none of these methods could be used to estimate the mean, standard deviation, and size of a study sample or a subgroup within a study, that study or subgroup was excluded from further consideration in the formal statistical meta-analysis. The total number of studies yielding usable data was 81. These studies yielded a total of 418 estimates of weight change in some antipsychotic drug condition or nondrug control condition. Of these 418 data points, 96.7% of the means, 69.6% of the standard deviations, and 100% of the numbers of study subjects were obtained by transcription or calculation, and the remainder by some form of estimation or imputation. t2 shows the mean and range of time on medication (in weeks) for the observed data points on each drug.
Before the statistical meta-analysis was conducted, a verbal overview was done, because several articles provided descriptive data on weight change that could not be included in the quantitative analysis but nevertheless offered some information. Key quotations that characterized the effect of the drugs in question were extracted from such articles.
Statistical analyses were conducted with SPSS, version 7.5 (23). The effects of antipsychotic drugs were analyzed separately for each drug, since preliminary analyses indicated marked differences among the specific compounds in terms of their effects. Because most studies did not include a placebo comparison group, the effect size we used was the raw weight change from baseline to posttreatment. Only 18 studies included placebo comparisons. By using the pretreatment-to-posttreatment weight change in all studies, we were able to make full use of all of the available data.
Since there were 19 different drugs/conditions (including placebo; nonpharmacologic, nonplacebo control; and polypharmacy), 19 separate analyses were conducted (one for each condition). For each condition we attempted to calculate the weighted mean weight change and standard error based on both a fixed effects model (24) and a random effects model (24). Although both the fixed and random effects estimates are presented in the tables, only the random effects estimates are discussed in the text, given the significant heterogeneity present for most compounds (see the Results section).
For each drug, when sufficient data (i.e., six or more data points) were available, we regressed mean weight change on standardized drug dosage and length of treatment. One older, poorly controlled study (25) was eliminated because it was an outlier, and its exceptionally long follow-up of 11 years caused it to act as a leverage point (26); all of the other follow-ups were less than 200 weeks long. These regressions were conducted as weighted least squares multiple regressions, where the weights were equal to the inverse of the variances of the dependent observations. To more reasonably compare drugs by controlling for different dosage levels, we calculated standardized doses by dividing the actual doses used in the studies by the midpoint of the recommended dose range and taking the natural log of the resulting ratio. (Although we adhered to this procedure for all drugs in the interest of consistency, it is possible that in some cases, the midpoint of the recommended dose range may not have been the best estimate of the standard dose. Therefore, for the atypical antipsychotics, haloperidol, and thioridazine [the most commonly used drugs], we conducted a sensitivity analysis by recomputing the results. We replaced the standardized dose first with the typical dose in chlorpromazine equivalents according to APA’s Practice Guideline for the Treatment of Patients With Schizophrenia[27] and second with the average dose used in clinical settings as reported in the peer-reviewed literature.) Recommended dose ranges were obtained from the appendix of a consensus report (28), the Physicians’ Desk Reference, or the drug manufacturer. The regression equation we used was Δkg=β0 + β1(weeks – 10) + β2(weeks – 10)2 + β3(D) + β4(D)2 + e, where Δkg is weight change in kilograms, the βs are parameters to be estimated, weeks is number of weeks of treatment, D is the standardized dose calculated as described above, and e is an error term. In this equation, β0 is a direct estimator of weight change at 10 weeks at the standard dose. For placebo, nonpharmacologic control, and polypharmacy, dosage information was not included in the regression.
Using the aforementioned equation, we estimated the weight-promoting effects of each drug at the midpoint of its recommended dose at 10 weeks with the use of both fixed effects (29) and random effects (30) models. Ten weeks was chosen as the time point because this value required no extrapolation beyond the observed data for any drug.
Finally, we used pairwise comparisons for the estimated weight changes at 10 weeks at the standard dose of each compound. The significance of differences was tested with a z statistic. The quantity (θi – θj)/(SE2[θi] + SE2[θj]) is asymptotically (in the number of subjects not the number of means) distributed as a standard normal deviate, where θi and θj are the estimates of weight change for the ith and jth compounds, respectively (29). To account for multiple comparisons, we used Monte Carlo simulation with 100,000 simulated data sets to determine the z value that, given the number of tests being conducted, would hold the overall alpha rate to the two-tailed 0.05 level. The simulated data were generated from a model with normal distribution based on the sample sizes we had. (For the concept behind this approach, see reference 31.) The critical z value obtained was 3.31. Therefore, any pairwise comparison yielding a z statistic greater in absolute value than 3.31 is statistically significant even after accounting for conducting multiple comparisons. This is slightly less conservative than the 3.41 required for the ordinary Bonferroni correction.
t3 displays the results from the verbal overview. The statements regarding specific drugs may be useful to clinicians and patients considering use of these drugs. On a very general level, two conclusions can be drawn from this tabulation. First, many drugs do seem to induce clinically meaningful weight gain. Second, many authors report their weight gain data in an incomplete, idiosyncratic, and poorly defined manner. This is clearly an area that would benefit from guidelines and standardization.
t4 displays the results from the quantitative meta-analysis in detail. (Because of space limitations, studies used in the meta-analysis but not cited are not listed in the reference list. A complete reference list can be obtained from the first author.) The second column in table 4 indicates the estimated mean weight change across all studies with the use of a fixed effects model (29) and the 95% confidence interval for that mean. These means, though interesting, are probably not maximally informative, because the studies varied greatly in terms of length of treatment and dosage. This heterogeneity among studies is indicated by the chi-square test for heterogeneity in the third column. In almost all cases, the values are highly significant, indicating that different studies with different durations and different dosages gave different answers. Therefore, we used a random effects estimate in the fourth column. This takes between-study variation into account but does not specifically attribute this variation to sources such as study duration and dosage. In the fifth column there is an estimate of the 10-week weight change based on a fixed effects regression (29) as described earlier. The chi-square values to the right of this column are calculated according to Raudenbush (30). As can be seen, for many drugs the heterogeneity tests were no longer significant, indicating that accounting for dosage and duration adequately explained the variation among studies. However, in some cases, the chi-square statistics were still significant, suggesting the importance of using a random effects model. The final column contains the point estimates for each drug for patients on standard doses for 10 weeks. These were calculated by means of the regression described earlier. We believe that these estimates in the final column of t1 are the most reasonable estimates. F1 summarizes these results graphically. Several points are noteworthy.
First, subjects in placebo conditions typically lost about 0.74 kg. This may be because in many of the placebo-controlled studies, subjects were taking some other neuroleptic drug before the trial. Therefore, when this drug was removed, some of the weight gain it previously induced may have been lost. Another possibility is that studies including placebo usually have acutely psychotic subjects, and food intake may be lower in individuals whose acute psychotic symptoms are not improved.
Two drugs, molindone and pimozide, were also associated with weight loss. In the case of molindone, this has been reported previously in the literature (50–52). Although the estimated weight loss with molindone (–1.06 kg) was significant overall, the estimated loss at 10 weeks (–0.39 kg) was not significant. For pimozide, the estimated weight loss was 2.69 kg (in the random effects model), but the standard error was quite large and the estimate was not significantly different from zero.
For other drugs, the degree of weight gain, estimated by the random effects regression at 10 weeks, ranged from 0.04 kg for ziprasidone (not significantly different from zero) to 4.45 kg for clozapine. Among the five new atypical antipsychotics in the study (ziprasidone, risperidone, sertindole, olanzapine, and clozapine), ziprasidone had the lowest weight gain (0.04 kg) and clozapine had the highest (4.45 kg). t5 contains z statistics and p values for pairwise significance tests comparing the estimated 10-week weight changes for patients taking the specific compounds. Although reported data were somewhat limited, there was little apparent difference across drugs in the average age of subjects in the studies and in the percentage of male subjects.
Finally, t6 shows results of a sensitivity analysis of estimated 10-week weight gains (random effects model) based on different definitions of standard dose. The results are quite robust to the choice of standard dose except for clozapine, which does not show sizably different weight gains (between 2.96 and 4.45 kg) across the different standard doses.
Most neuroleptic drugs were associated with weight gain. It does not appear as though any of this weight gain can be attributed to a placebo effect, since patients on placebo appear to have lost weight. The degree of weight gain clearly increased with time for the drugs considered. Weight gain was estimated at 10 weeks because there were many data for this time interval. However, estimated weight gain while patients are taking a drug for longer periods would be expected to be substantially higher. This expectation is based on both the physics and the physiology of weight gain (59) and empirical observations from studies of selected compounds for which longer-term data were available (60).
+
Limitations of the Study
This study has several limitations. First, standard errors were calculated under the assumption that all observations were independent, which was not true in every case because some studies assessed subjects at repeated time points. When such data were available, we included all data points in the interest of using all available information. Our estimates of weight change with use of the weighted least squares method remain accurate (i.e., unbiased), but their standard errors may be too small. Ordinarily, one would take this dependency into account through the use of generalized least squares estimation (26). Unfortunately, generalized least squares implementation requires knowledge of the covariance structure among the observations, and this information was not available. Therefore, the standard errors presented here and the significance levels based on them may, in some cases, be biased. To estimate the plausible degree of this bias, we assumed that all dependent observations had a correlation as high as 0.90 and conducted the fixed effects regression analyses through generalized least squares. The largest putative change in standard error for any drug occurred with chlorpromazine and was 59%. For no other drug did the increase in estimated standard error exceed 4%. Thus, this sensitivity analysis suggests that our standard errors are unlikely to have been underestimated to any substantial degree.
A second limitation concerns our inability to examine the extent to which weight change with antipsychotic drugs varied as a function of patients’ characteristics, such as age, sex, and initial body mass index. Unfortunately, the limited information presented in each study on the distributions of age, sex, and starting body mass index and the limited number of studies available for each drug precluded inclusion of terms for such patient characteristics in the metaregressions.
A third limitation is that for most drugs, insufficient information was available to provide precise estimates of weight change when patients were on the drug for extended periods of time, such as 6 months or more. Although we initially attempted to calculate such estimates, this frequently required extrapolations outside the observed range of data, and the resulting estimates were often extremely imprecise (i.e., had very large standard errors).
To our knowledge, this is the most comprehensive literature synthesis on antipsychotic-induced weight gain to date. Although it is plausible that some studies assessing the effect of antipsychotic medications on body weight were not discovered by our literature search, our procedures kept this to a minimum. We conducted a thorough search of the electronic literature and made efforts to access undetected literature through the "Invisible College" and formal contacts with pharmaceutical companies. Moreover, we conducted electronic searches of databases that also include unpublished literature, such as Dissertation Abstracts International and PsychINFO.
A common problem in meta-analysis is inadequate and incomplete reporting of key information in the primary articles. We attempted to minimize the impact of such incomplete reporting by contacting authors when feasible. Moreover, for studies that did not yield sufficient information to be included in the formal quantitative synthesis, we still attempted to extract whatever information was available in these reports and provide that in a structured verbal overview (t2).
Our literature retrieval procedures also maximized the chances of obtaining relevant unpublished data. Publication bias is a commonly acknowledged problem in applied research. This problem confronts all literature reviewers, whether they conduct formal meta-analyses or not. We believe our efforts were quite strong in this regard and therefore should serve to minimize publication bias.
In the early days of chlorpromazine pharmacotherapy, Planansky and Heilizer (61) reported that weight gain was associated with symptom improvement, and weight loss was associated with symptom deterioration. The subsequent availability of multiple antipsychotic medications has led to the observation that weight gain is a common side effect of antipsychotic treatment. There is some conjecture that the drugs which cause the most weight gain are the most effective. However, results are inconsistent and equivocal (58, 62–65), and further research is needed on this question.
How clinically meaningful are these degrees of weight gain? Some compounds were estimated to produce close to a 5-kg weight gain at 10 weeks. Furthermore, the shorter-term controlled studies usually included data on all subjects, whereas long-term use was usually restricted to individuals showing a positive therapeutic response to the drug. If therapeutic response and weight gain are correlated (which may or may not be true), then this would imply that the 10-week weight gain may be higher than we have estimated. On the basis of the compounds for which longer-term data were available (chlorpromazine, clozapine, and olanzapine), it seems clear that although weight gain might reach a plateau after a certain period (e.g., for olanzapine, after 4–5 months), total weight gain will still be much larger after periods longer than 10 weeks. Thus, weight gains far in excess of 5 kg may be seen in patients on long-term therapy. However, even a weight gain of 5 kg will represent a weight gain of more than 5% of initial body weight for the majority of individuals. To place this in perspective, it is useful to consider that a number of authoritative bodies, such as the Institute of Medicine (66), have suggested that weight losses of as little as 5% in obese individuals can result in clinically meaningful reductions in morbidity and risk of early mortality. It might be plausible, then, to expect that increases in body weight of as much as 5% would result in corresponding increases in morbidity and risk of early mortality.
Although the literature assessing the effects of weight gain on "hard" end points reveals a complex set of relations and modifiers, certain general conclusions can be drawn. For end points such as mortality (67, 68), incidence of cancer (69, 70), cardiovascular disease (71, 72), and diabetes (73), when factors such as smoking are accounted for, it appears that weight gains of 5% or greater during the adult lifespan are associated with important increases in risks. This is especially true for individuals who are overweight to begin with. Finally, although results are clearly preliminary, emerging data suggest that the drugs causing weight gain (i.e., clozapine and olanzapine) may, perhaps as a result, also be causing type II diabetes (74–77). Clearly, then, antipsychotics can induce medically meaningful degrees of weight gain.
Weight gain induced by antipsychotic drugs may also discourage patients from reliably taking their medication. This would, in turn, increase the likelihood of relapse. Although we are aware of no data that would allow precise quantification of the impact of weight gain on compliance with medication, we have observed that a number of patients complain of weight gain and occasionally report it as a reason for noncompliance. On the other hand, in studies conducted with olanzapine, Tollefson et al. (78) and Beasley et al. (60) found that for acute trials and studies lasting a year, drug discontinuation attributed to weight gain was quite rare. For example, in a 6-week study, Tollefson et al. (78) found that "none of 1,336 olanzapine-treated patients discontinued early because of weight gain."
Given this background, it is important to consider methods for minimizing the impact of weight gain induced by antipsychotic drugs. One approach might be to implement weight control procedures with schizophrenic individuals who are taking antipsychotic medications. Several such efforts have been made in the past, including pharmacologic (79, 80) and nonpharmacologic (81–85) approaches. In some cases, particularly with subjects in inpatient settings, results have been good. However, results with outpatients are less clear, and more research on this topic is needed. Some investigators have even begun to explore the potential of amantadine in pharmacologic treatment specifically for neuroleptic-induced weight gain (86–88), but this is not a generally accepted treatment at this time.
The use of pharmacologic treatments of obesity with this population may present challenges. With one exception (89), all pharmacologic agents for the treatment of obesity that are currently available or likely to be released in the very near future achieve their effects by increasing noradrenergic, dopaminergic, and/or serotonergic activity (90). In contrast, antipsychotic medications typically achieve much of their effect by decreasing dopaminergic and serotonergic activity. Therefore, the use of pharmacologic agents to treat obesity in individuals with schizophrenia may exacerbate their psychotic symptoms (91–95). Indeed, weight loss itself has even been reported to provoke psychotic symptoms in rare cases (96). Therefore, any use of centrally acting pharmacologic agents to treat obesity in this population should be undertaken with the utmost caution and, in our opinion, be preceded by well-controlled clinical trials to establish efficacy and safety.
Finally, the selection of the right compound for the right patient might minimize the impact of weight gain with antipsychotic medications. There are schizophrenic individuals who are sufficiently thin that weight gain would likely be harmless and perhaps even beneficial (8). In such cases, not all weight gain will necessarily represent fat. Studies also indicate that weight gain is highest in individuals with a low baseline body mass index (60). Although these patients are rare, for such patients there would be little reason to avoid the use of drugs that produce greater degrees of weight gain. However, for patients with an average body mass index or higher or patients with a history of obesity, clinicians may wish to consider using compounds associated with less weight gain. The estimates provided in table 4 may help clinicians make such choices. The preceding notwithstanding, we wish to emphasize that weight gain should never be a sole reason for choosing one antipsychotic drug over another. Both therapeutic efficiency and other factors such as dose-related extrapyramidal syndromes should also be considered in drug selection. For many individuals the degree of risk imposed by the weight gain from a drug will not outweigh the degree of benefit achieved by alleviation of schizophrenic symptoms. In the end, clinical choices must be made on a case-by-case basis, with careful consideration of issues of weight, therapeutic efficacy, and other relevant factors.
Presented in part at the 151st annual meeting of the American Psychiatric Association, Toronto, May 30–June 4, 1998, and the 1998 meetings of the New Clinical Drug Evaluation Unit, the Association of European Psychiatrists, and the Collegium Internationale Neuro-Psychopharmacologicum. Received Aug. 4, 1998; revision received March 8, 1999; accepted March 17, 1999. From the Obesity Research Center, St. Luke’s-Roosevelt Hospital, Columbia University College of Physicians and Surgeons; the Graduate School of Education, Fordham University, New York; and Pfizer Central Research, Groton, Conn. Address reprint requests to Dr. Allison, Obesity Research Center, 1090 Amsterdam Ave., Suite 14B, New York, NY 10025; dba8@columbia.edu (e-mail). Supported by a grant from Pfizer Central Research and grants DK-26687, DK-51716, and DK-47526 from the National Institute of Diabetes and Digestive and Kidney Diseases. The authors thank the following for their help: Charles M. Beasley, Jr., Alan Breier, Ann Marie K. Crawford, Martin Brecher, Rolando Gutierrez, Andrew Chanlam, Rakhee Vasant, Mani Lakshminarayanan, Robert Monty, Muriel Young, Sandra Wiejowski, Christine A. Ney, Jaime Mullen, Albert S. Stunkard, Petra Platte, Christine Peterson, Donna Wirshing, Danielle Goldstein, and Michael C. Neale.