Article Text

Download PDFPDF

Original research
Linking the Mini-Mental State Examination, the Alzheimer’s Disease Assessment Scale–Cognitive Subscale and the Severe Impairment Battery: evidence from individual participant data from five randomised clinical trials of donepezil
  1. Stephen Z Levine1,
  2. Kazufumi Yoshida2,
  3. Yair Goldberg3,
  4. Myrto Samara4,5,
  5. Andrea Cipriani6,
  6. Orestis Efthimiou7,
  7. Takeshi Iwatsubo8,
  8. Stefan Leucht4,
  9. Toshi A Furukawa2
  1. 1 Department of Community Mental Health, University of Haifa, Haifa, Israel
  2. 2 Department of Health Promotion and Human Behavior, Kyoto University Graduate School of Medicine/ School of Public Health, Kyoto, Japan
  3. 3 Faculty of Industrial Engineering and Management, Haifa Technion, Haifa, Israel
  4. 4 Department of Psychiatry and Psychotherapy, School of Medicine, Technical University of Munich, Munich, Germany
  5. 5 Department of Psychiatry, School of Medicine, Aristotle University of Thessaloniki, Thessaloniki, Greece
  6. 6 Department of Psychiatry, University of Oxford, Oxford, UK
  7. 7 Institute of Social and Preventive Medicine (ISPM), University of Bern, Bern, Switzerland
  8. 8 Department of Neuropathology, Graduate School of Medicine, University of Tokyo, Bunkyo-ku, Tokyo, Japan
  1. Correspondence to Professor Stephen Z Levine, University of Haifa, Haifa 31905, Israel; slevine{at}


Background The Mini-Mental State Examination (MMSE), the Alzheimer’s Disease Assessment Scale–Cognitive Subscale (ADAS-Cog) and the Severe Impairment Battery (SIB) are widely used rating scales to assess cognition in Alzheimer’s disease.

Objective To understand the correspondence between these rating scales, we aimed to examine the linkage of MMSE with the ADAS-Cog and SIB total and change scores.

Methods We used individual-level data on participants with Alzheimer’s disease (n=2925) from five pivotal clinical trials of donepezil. Data were collected at baseline and scheduled visits for up to 6 months. We used equipercentile linking to identify the correspondence between simultaneous measurements of MMSE with ADAS-Cog, and SIB total and change ratings.

Findings Spearman’s correlation coefficients were of strong magnitude between the MMSE total score and the ADAS-Cog (rs from −0.82 to −0.87; p<0.05) and SIB total scores (rs from 0.70 to 0.75; p<0.05). Weaker correlations between the change scores were observed between the MMSE change score and the ADAS-Cog (week 1: r=−0.11, p=0.18; rs thereafter: −0.28 to −0.45; p<0.05) and SIB change scores (rs from 0.31 to 0.44; p<0.05). Linking suggested that the MMSE total scores were sensitive to moderate and severe cognitive impairment levels. Despite weak to moderate correlations for the change scores, moderate change levels linked well, indicating ceiling and floor effects.

Conclusions The current results can be used in meta-analyses, data harmonisation and may contribute to increasing statistical power when pooling data from multiple sources.

Clinical implications The current study results help clinicians to understand these cognitive rating scale scores.

  • adult psychiatry
  • delirium & cognitive disorders

Data availability statement

Data are available upon reasonable request. Data requests can be submitted via

Statistics from

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.


Alzheimer’s disease is a chronic neurodegenerative disease characterised by the progressive deterioration of neurons and atrophy of brain tissue.1 Onset is insidious and gradual, with initial cognitive impairment in short-term memory and with disease progression spreads to multiple cognitive domains (e.g., executive functions, attention, language) become impaired.2 To assess the extent of cognitive impairment, various rating scales have been developed to evaluate cognitive functioning in research and clinical settings at different stages of the disease.

The most widely used cognitive rating scale is the Mini-Mental State Examination (MMSE), which was designed to screen for cognitive impairment in under 10 minutes. However, the MMSE lacks sensitivity to distinguish severe levels of impairment. Subsequently, the Alzheimer’s Disease Assessment Scale–Cognitive Subscale (ADAS-Cog) was developed as a comprehensive assessment of the extent of cognitive dysfunction in Alzheimer’s disease.3 Despite an administration time of approximately 40 minutes, it has become the gold standard to assess the efficacy of antidementia treatments in clinical trials.4 For instance, a 4-point difference between the treatment and the placebo groups is considered clinically relevant,5 at least in most Alzheimer’s disease trials where symptomatology is moderate.6 For populations in the advanced stages of the illness, the Severe Impairment Battery (SIB) was developed to address the cognitive and behavioural characteristics of severe dementia.7 It has an administration time of around 30 minutes. There are design trade-offs between these scales, including administration time, and ceiling and floor effects (i.e., scales that measure the most severe cases lack sensitivity to differentiate moderate cases and vice versa).

Nonetheless, being able to convert between scale scores is often desirable, even a requirement. Research has demonstrated conversions between rating scales in schizophrenia,8 depression,9 panic disorder10 and Alzheimer’s disease.11 12 Prior studies linking rating scales in Alzheimer’s disease have been informative but are restricted to cross-sectional information, do not link change scores, do not link MMSE and SIB scores and do not consider learning effects from the first measure administered to the second. Nonetheless, there are benefits to converting measures.12 First, most clinicians are probably used to the MMSE, therefore an explanation as to what scores the other rating scales (e.g., the SIB mean will be useful. Second, in meta-analyses two different rating scales often need to be transformed into a single common metric. Third, healthcare maintenance organisations frequently have information on different rating scales that could be harmonised for a single statistical analysis. Fourth, datasets with different scale ratings can be pooled to increase statistical power.


In the current study, we aim to link the MMSE to the SIB and the ADAS-Cog based on five clinical trials of Alzheimer’s disease.

Study selection and analysis

We obtained access to the individual-level participant data of all randomised controlled double-blinded trials of donepezil conducted by Eisai Co in which two or more of the cognitive scales of interest were administered simultaneously. Data access was provided following the submission of an a priori analytic plan and analysed via a secure internet cloud-based platform ( We included trials in which patients with Alzheimer’s disease were assessed with at least two of MMSE,13 the SIB14 15 and/or the ADAS-Cog.3


The following three measures were administered in the trials.

Mini-Mental State Examination

The MMSE is comprised of 20 items and takes between 5 and 10 min to administer. It consists of components of cognitive functioning (e.g., time-space orientation, short-term memory, attention, language and construction) that constitute a single underlying entity of cognitive functioning.13 MMSE total scores range from 0 to 30, with lower scores representing a worse cognitive deficit.

Alzheimer’s Disease Assessment Scale–Cognitive Subscale

The ADAS-Cog is a neuropsychological index of the severity of the cognitive symptoms of dementia.4 It is comprised of 11 tasks (word recall, word recognition, constructional praxis, orientation, naming objects and fingers, commands, ideational praxis, remembering test instruction, spoken language, word-finding, comprehension) that include both participant-completed and observer-based assessments. ADAS-Cog total scores range from 0 to 70, with higher scores representing a greater cognitive deficit.

The Severe Impairment Battery

The SIB consists of 40 one-step questions and commands, with a scale score of 100 points and nine aspects of cognitive functioning (social interaction, memory, orientation, language, attention, praxis, visuospatial ability, construction, orienting to name). SIB total scores may range from 0 to 100, with higher scores representing a lesser cognitive deficit.14

The MMSE was administered in every trial. In contrast, the ADAS-Cog and SIB assessments did not coincide in the same trial or week (table 1). Across the trials, the MMSE was administered before the ADAS-Cog and SIB, except at week 24 in one trial.16

Table 1

Trial characteristics of the analytic dataset

Statistical analysis

At step one of the analysis, we presented the trial characteristics and demographics. At step two, we conducted linking of each available pair of ratings assessed at the same week (between the MMSE with the ADAS-Cog and then the MMSE with the SIB) with one exception. One trial16 had screening rather than baseline SIB assessments (conducted within the 28±7 days before drug administration, median 22 days). Therefore, it so was used as a baseline appraisal to maximise the sample size and because there was no alternative treatment for Alzheimer’s disease when the trials were conducted.

We computed Spearman’s correlation coefficients and equipercentile linking. Although there are no agreed thresholds to interpret Spearman’s correlation coefficient values, we follow prior guidelines and interpret the magnitude as weak (0.10–0.39), moderate (0.40–0.69) and strong (0.70–0.89).17 We computed equipercentile linking to examine the extent to which it was possible to convert between the study measures at each time point. Prior studies of schizophrenia18 and depression9 19 have used equipercentile linking. Equipercentile linking is a statistical method to equate scores between two test scales. In short, the method ranks percentile scores between tests, and there is no independent or dependent variable. In this way, transformations between test scores are ascertained.20 We analysed all trials collectively as a unique population rather than by trial to maximise the sample size and so attain robust linkage estimates. Finally, we took the median value from the equipercentile linking values across different measurement points to define the corresponding scores between the scales. We report cut-offs that link to mild (21-25), moderate (11-20), and severe (0-10) dementia on the MMSE.21 In this way, different scores become interchangeable and comparable. Equipercentile linking was computed using the equate library20 in R

Sensitivity analysis

We conducted sensitivity analysis precisely as above, but without the trial by Rogers et al 23 owing to 13 participants at one site, which violated the trial protocol.


Trial characteristics

We identified four randomised placebo-controlled clinical trials and one trial24 comparing random dose levels (of 10mg or 23mg of donepezil) cumulating in 2925 trial participants. Four participants were removed owing to missing information following the screening phase in one trial,24 leaving 2921 trial participants with information in the dataset. Of the 2921 participants, 1110 participants had MMSE and ADAS-Cog scores and 1786 had MMSE and SIB scores. Hence, the analytic dataset consisted of 2896 trial participants (table 1). In addition, eight participants who contributed to the linkage between ADAS-Cog and MMSE total and one participant who contributed to the linkage between MMSE and SIB total did not contribute to change scores due to missing baseline assessments. The trial characteristics of the analytic dataset are summarised in table 1. There were 1059 (36.6%) males and 1837 (63.4%) females, with a mean age (SD) of 74.4 (7.8) years. Total and change rating scale score descriptive statistics are shown in table 2.

Table 2

Sample characteristics

Equipercentile linking

Spearman’s correlation coefficients between the MMSE and the ADAS-Cog total were statistically significant (p<0.05) and of large magnitude at all time points (r=−0.82 to −0.87; table 2). Equipercentile linking of the MMSE total scores with ADAS-Cog total scores was calculated. Across different time points, each 10 point increase on the MMSE total score corresponded to a decrease of approximately 20 on the ADAS-Cog total score (see figure 1A and the conversion table: online supplemental eTable 1). In general, one MMSE point was equivalent to two ADAS-Cog points. Spearman’s correlation coefficient between the MMSE and the ADAS-Cog change scores was null at baseline (r=−0.11), but was statistically significant (p<0.05) for the remaining weeks with weak to moderate magnitude (range from r=−0.28 to r=−0.45; p<0.05; table 2). These lower correlations may partly reflect the inability of the MMSE change to capture severe cases of ADAS-Cog change. Equipercentile linking between the MMSE and the ADAS-Cog change scores showed slight fluctuations over time from −2 to 5 on the MMSE with floor and ceiling effects where change was under −3 and over 5 (see figure 1B and the conversion table: online supplemental eTable 2).

Supplemental material

Figure 1

Linking the MMSE and ADAS-Cog scores. Fig. 1A shows the MMSE total score linked to the ADAS-Cog total score from baseline to week 24. Figure 1B shows the MMSE change score linked to the ADAS-Cog change score from week one to 24. ADAS-Cog, Alzheimer’s Disease Assessment Scale–Cognitive Subscale; MMSE, Mini-Mental State Examination.

Spearman’s correlation coefficients between the MMSE and the SIB scores were of strong magnitude for the total scores (range r=0.70–0.75; p values <0.05; table 2) and a moderate magnitude for the change scores (range r=0.31–0.45; p values <0.05; table 2). Each point increase in the range 0–6 on the MMSE total score corresponded to an increment of approximately 8 on the SIB total score; for MMSE total scores ranging from 7 to 11 each point increase was linked to 4–5 point increases on the SIB score; and for MMSE total scores ranging from 12 to 30, each point increase linked to 1–2 point increments on the SIB total (figure 2A, and conversion online supplemental eTable 3). Examination of the linkage between the MMSE and SIB total change scores showed consistent linkage over time for MMSE change scores ranging from −2 to 8 (and vice versa). MMSE total change scores under −3 to −9 generally linked to a point reduction of −4 on the SIB change score, with lower values linking at −11 MMSE scores (see figure 2B, and conversion online supplemental eTable 4).

Figure 2

Linking the MMSE and SIB scores. Figure 2A shows the MMSE total score linked to the SIB total score from baseline to week 24. Figure 2B shows the MMSE change score linked to the SIB change score to week six to 24. MMSE, Mini-Mental State Examination; SIB, Severe Impairment Battery.

Table 3 summarises the corresponding scores across MMSE, ADAS-Cog and SIB. To interpret table 3, we used established guidelines to classify the MMSE scores into mild (21-25), moderate (11-20), and severe (0-10) dementia on the MMSE.21 MMSE scores of mild dementia (i.e., 21–25) linked to ADAS-Cog scores from 15 to 22 and SIB scores from 95 to 98; moderate dementia (i.e., 11–20) linked to ADAS-Cog scores from 24 to 46 and SIB scores from 75 to 93 and severe dementia (i.e., 0–10) linked to ADAS-Cog scores over 48 and SIB scores under 70.

Table 3

Conversion table between the study measures

Sensitivity analysis by removing the trial by Rogers et al 23 owing to 13 participants at one site that violated the trial protocol did not manifestly impact the results (conversion online supplemental eTables 5 and 6).

Conclusions and clinical implications

The current study is the first linkage between the MMSE with ADAS-Cog and SIB total and change from baseline scores. The results show a generally consistent linking pattern over time and are easy to translate into practice. For instance, at baseline, an MMSE score of 10 was linked to an ADAS-Cog score of 48 and a SIB score of 70, and across time each 10 point increase on the MMSE total score linked to a decrease of approximately 20 on the ADAS-Cog total score.

There are several potential reasons that the correlations between the total scores increased over time (see table 2). One explanation is that as the trial progressed, the variability of the scale scores increased and so did the correlation coefficients between them. Another possibility is that the natural progression of the disease induced an increase in generalised cognitive impairment. Furthermore, possibly repeated testing induced learning effects. Notably, the magnitudes of the change score correlations were less than that of the total scores, suggesting that conversion of change scores may be less reliable.25

Interestingly, the plots between the MMSE and the ADAS-Cog and SIB totals differ in form. The graph of the MMSE-SIB total linkage resembles an asymptotic function (i.e., SIB scores seem to level off at high MMSE scores). This likely reflects ceiling effects, namely, that the SIB is sensitive to severe stages but is insensitive to more normal range scores represented by MMSE scores of 25–30. The presence of ceiling effects was less evident on the linkage between the MMSE and the ADAS-Cog total scales.

There are several limitations to our study. First, as the results are based on clinical trial data with inclusion criteria, they may have restricted generalisability. Second, in one trial16 ratings of the MMSE at screening were used because there were no baseline scores. Considering there were approximately 3 weeks between screening and baseline and the rate of deterioration in Alzheimer’s disease, this artefact is unlikely to impact the study results. Third, the change scores did not correlate highly owing to outlying extreme scores. Fourth, we lacked item-level MMSE scores for a fine-level analysis. One might expect equivalence in well-defined domains of the same aspect of cognitive functioning. Fifth, the combined sample size and distribution did not enable us to analyse the placebo and donepezil groups separately. Sixth, we lacked information on the Montreal Cognitive Assessment (MoCA), which is a widely used measure to index cognitive impairment. Hence, the lack of information on the MoCA restricts the generalisability of the current results.26 Seventh, MMSE was administered before the other rating scales, except for one trial where the SIB was administered before the MMSE.16 This may raise concerns about the temporal order of each rating measure pair and learning cross-over effects between tests. However, at each visit, we consider learning effects are unlikely to adversely impact the result since the questions on the MMSE capture less severe impairment than the other rating scales.

A notable advantage of the current study was the large number of participants making the results robust. Moreover, all rating scale total scores were generally strongly correlated with each other, with little variability over time. This feature reinforces our faith in the robustness of linking functions.

In conclusion, our results provide linkages of the MMSE with the ADAS-Cog and SIB total and change scores often used in clinical trials of Alzheimer’s disease.27 The study results have the potential to contribute to clinical cross-walks when different clinicians use different scales and to meta-analytic calculations when researchers want to pool studies using different scales. Furthermore, our results can assist in data harmonisation and thus increase the statistical power of analyses that combine data from multiple sources. To facilitate the uptake of our results in practice, we have provided detailed conversion tables to directly link between rating scores of cognitive impairment in Alzheimer’s disease. From a clinical point of view, individual priorities and goals of care will vary significantly depending on the stage of dementia, and discussions need to be tailored to the stage of illness. It is important, however, that complex decisions are discussed at a relatively early stage, when a person with dementia may be actively involved in care planning.28

Data availability statement

Data are available upon reasonable request. Data requests can be submitted via

Ethics statements

Ethics approval

Institutional review boards approved each study, and all patients had given their written informed consent.


We acknowledge Eisai Co for providing us with the study data. Eisai Co did not provide study design, critical input or manuscript review for the study.



  • SZL and KY are joint first authors.

  • Twitter @szlevine, @And_Cipriani, @Toshi_FRKW

  • Contributors SZL contributed to manuscript drafting, statistical analysis, data management and study conceptualisation. KY contributed to critical manuscript feedback, data management and statistical analysis. YG contributed to critical manuscript feedback and statistical analysis. MS and AC contributed to study conceptualisation, interpretation and critical manuscript feedback. OE contributed to critical manuscript feedback and statistical interpretation. TI and SL contributed to study conceptualisation, interpretation and critical manuscript feedback. SL contributed to study conceptualisation, interpretation and critical manuscript feedback. TAF contributed to critical manuscript feedback, statistical review, study conceptualisation and mentorship.

  • Funding AC is supported by the National Institute for Health Research (NIHR) Oxford Cognitive Health Clinical Research Facility, by an NIHR Research Professorship (grant RP-2017-08-ST2-006), by the NIHR Oxford and Thames Valley Applied Research Collaboration and by the NIHR Oxford Health Biomedical Research Centre (grant BRC-1215-20005). OE is supported by project grant No 180 083 from the Swiss National Science Foundation (SNSF).

  • Competing interests TI has served as a consultant of Eisai, Roche and Biogen in the last 3 years. AC has received research and consultancy fees from INCiPiT (Italian Network for Paediatric Trials), CARIPLO Foundation and Angelini Pharma. In the past 3 years, SL has received honoraria as a consultant or for lectures for LB Pharma, Otsuka, Lundbeck, Boehringer Ingelheim, LTS Lohmann, Janssen, Johnson&Johnson, TEVA, MSD, Sandoz, SanofiAventis, Angelini, Sunovion, Recordati and Geodon Richter. TAF reports personal fees from Mitsubishi-Tanabe, MSD and Shionogi, and a grant from Mitsubishi-Tanabe, outside the submitted work; TAF has a patent 2018-177688 pending.

  • Provenance and peer review Not commissioned; externally peer reviewed.