Suicide risk assessment tools and prediction models: new evidence, methodological innovations, outdated criticisms

Aida Seyedsalehi; Seena Fazel

doi:10.1136/bmjment-2024-300990

Article Text

PDF

Perspective

Suicide risk assessment tools and prediction models: new evidence, methodological innovations, outdated criticisms

http://orcid.org/0000-0002-3949-2386Aida Seyedsalehi1,
http://orcid.org/0000-0002-5383-5365Seena Fazel1,2

¹ Department of Psychiatry, University of Oxford, Oxford, UK
² Oxford Health NHS Foundation Trust, Oxford, UK

Correspondence to Professor Seena Fazel, Department of Psychiatry, University of Oxford, Warneford Hospital, Oxford, OX3 7JX, UK; seena.fazel{at}psych.ox.ac.uk

Abstract

The number of prediction models for suicide-related outcomes has grown substantially in recent years. These models aim to assist in stratifying risk, improve clinical decision-making, and facilitate a personalised medicine approach to the prevention of suicidal behaviour. However, there are contrasting views as to whether prediction models have potential to inform and improve assessment of suicide risk. In this perspective, we discuss common misconceptions that characterise criticisms of suicide risk prediction research. First, we discuss the limitations of a classification approach to risk assessment (eg, categorising individuals as low-risk vs high-risk), and highlight the benefits of probability estimation. Second, we argue that the preoccupation with classification measures (such as positive predictive value) when assessing a model’s predictive performance is inappropriate, and discuss the importance of clinical context in determining the most appropriate risk threshold for a given model. Third, we highlight that adequate discriminative ability for a prediction model depends on the clinical area, and emphasise the importance of calibration, which is almost entirely overlooked in the suicide risk prediction literature. Finally, we point out that conclusions about the clinical utility and health-economic value of suicide prediction models should be based on appropriate measures (such as net benefit and decision-analytic modelling), and highlight the role of impact assessment studies. We conclude that the discussion around using suicide prediction models and risk assessment tools requires more nuance and statistical expertise, and that guidelines and suicide prevention strategies should be informed by the new and higher quality evidence in the field.

Suicide & self-harm
Data Interpretation, Statistical
Machine Learning

https://creativecommons.org/licenses/by/4.0/

This is an open access article distributed in accordance with the Creative Commons Attribution 4.0 Unported (CC BY 4.0) license, which permits others to copy, redistribute, remix, transform and build upon this work for any purpose, provided the original work is properly cited, a link to the licence is given, and indication of whether changes were made. See: https://creativecommons.org/licenses/by/4.0/.

https://doi.org/10.1136/bmjment-2024-300990

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

The growing interest in precision psychiatry in recent years has led to a plethora of risk prediction models, both for the onset of mental illness and for a wide range of course-of-illness outcomes. Predicting the risk of suicide and self-harm has been an area of particular interest. However, there are contrasting views on whether prediction models should be used to assist in suicide risk assessment, with some experts questioning the predictive performance and clinical utility of these models. Here, we discuss four common misconceptions that dominate criticisms of suicide risk assessment tools and prediction models. These have been repeated after the publication in BMJ Mental Health of the OxSATS risk calculator,1 a novel, scalable and evidence-based approach for estimating 12-month risk of suicide death following self-harm. The OxSATS model was developed in a sample of over 37 000 individuals with hospital presentations of self-harm, using data from Swedish population-based registers. The final 11-item model includes routinely collected sociodemographic and clinical predictors, and showed good discrimination (c-index 0.77, 95% CI 0.75 to 0.78) and calibration (tested by the calibration slope, intercept and calibration plots) in external validation.1 To our knowledge, it is the first prediction model in this population that provides probability scores for suicide risk and has been assessed on a full range of performance measures.

The first common misconception among critics of suicide risk prediction, particularly in the UK and Australia,2 3 is that all prediction tools invariably have to classify individuals into risk categories (eg, low vs high). This is not the case, and exemplified in one of the most widely advocated prognostic tools in medicine, the Framingham score, which estimates an individual’s probability of developing cardiovascular disease in the next 10 years. In our view, the focus of suicide prediction should shift from classification of individuals (into low-risk vs high-risk groups) to estimating probabilities. Classification implies that all individuals within a risk group should be treated as if they have the same predicted suicide risk. Conversely, two individuals with risk estimates just below and above a classification threshold are assumed to have different levels of risk (and may receive different interventions as a result).4 Probability estimates, on the other hand, allow for more personalised decision-making at the individual patient level and hence are more informative.4 This is an important distinction between OxSATS (a risk prediction model) and some earlier tools that are classifiers (ie, they do not produce probability estimates).2 3 5 In some contexts, guidelines may need to specify a probability threshold for recommending interventions in clinical practice. However, defining risk groups in such contexts still relies on accurate estimation of probabilities.6 Furthermore, comparing an individual’s personalised probability estimate with the proposed threshold could improve decision-making in these situations.4 An important area for future research is how best to communicate probability estimates in clinical practice to support decision-making around suicide risk management.

Second, arguments against the use of suicide prediction models have largely been based on measures of classification, including sensitivity, specificity and positive predictive value (PPV). However, the overwhelming focus on classification measures when assessing model predictive performance is problematic. While these measures are easily interpretable, their values are strongly dependent on the chosen threshold or cut-off.4 There is often no universally optimal threshold for a prediction model, as the choice of threshold should be determined by the clinical context, including the benefits of true positives and the costs of false positive and false negative classifications.4 Different clinicians and patients will likely differ in their attitudes towards the costs of misclassification (and therefore risk thresholds for intervention), and any prediction model should be able to accommodate these.6 7 For instance, if the intervention involves referral or admission to psychiatric services, the threshold for a given patient may partly be determined by the level of social support (eg, using a higher threshold for referral or admission if the patient has a high degree of social support). Clinicians may also vary in their general propensity to intervene, some having a lower threshold for intervention (ie, more concerned about missing a suicide or self-harm event), while others are more conservative (ie, prioritising avoiding unnecessary interventions).8 As such, when evaluating the predictive performance of a model, the primary focus should be on measures of discrimination, such as the area under the receiver operating characteristic curve (AUC), and calibration (ie, the agreement between predicted and observed risks), such as calibration plots. These measures are threshold-independent and assess the quality of predictions across the entire range of model-predicted probabilities.9 Ideally, assessment of a model’s performance should also involve examining the (in)stability of its predictions—that is, the extent to which the estimated risks for an individual may differ depending on the particular sample used for model development.10 Measures for quantifying model instability at the development stage have recently been proposed by Riley and Collins.10 These instability checks can help users decide whether model predictions are likely to be reliable enough in new individuals from the population in which the model was developed.

Third, some critics have suggested that the AUC values for suicide prediction models are too low to be useful.11 However, as has been discussed by de Hond et al,12 the practice of labelling specific AUC values (eg, as poor, moderate, good or excellent) is discouraged as such value judgements are often arbitrary. What is considered ‘good’ discriminative ability for a model depends on the clinical area and on the available alternatives.13 While very high AUC values (eg, above 0.90) are sometimes possible in diagnostic prediction modelling (such as the ADNEX model for preoperative diagnosis of ovarian tumours14), such values are rare in the context of prognostic prediction. For instance, the most promising models for predicting a range of adverse health outcomes (including mortality) in hospitalised COVID-19 patients, as identified in a recent systematic review,15 have AUCs ranging from 0.76 to 0.79. An important related issue which these criticisms fail to recognise is that two models can have similar AUCs despite very different calibration performance. For instance, OxSATS shows reasonably good calibration in external validation,1 while some of the first-generation scales5 cannot even be assessed on their calibration performance because they do not provide probability estimates. Calibration is a key performance criterion for any model intended to support clinical decision-making, as poorly calibrated risk predictions can be misleading and lead to overtreatment or undertreatment, potentially causing patient harm.16 This has been emphasised in numerous methodological and reporting guidelines for prognostic modelling studies,9 17 but almost entirely overlooked in the suicide prediction literature.

Fourth, whether or not a model should be used to support clinical decision-making around suicide risk (eg, to support safety planning, screen for more detailed clinical and/or psychosocial assessment or determine treatment) is an empirical question which requires specific measures beyond discrimination and calibration - basing such conclusions on AUC values alone is misguided and involves a conflation of the concepts of model predictive performance and clinical utility. One approach that can be used to evaluate the clinical usefulness of a model for decision-making is to plot the net benefit of the model across a range of clinically reasonable risk thresholds (ie, a decision curve analysis).18 As an example, this approach has been recently used to assess the net benefit of a prediction model for violence risk (OxMIV) in a first-episode psychosis population in England.19

From a health economics perspective, decision analytical modelling has been used to evaluate the cost-effectiveness of implementing suicide prediction models in different populations and settings. These analyses require a risk threshold to be specified as they reflect the consequences of using the model for decision-making. For instance, it has been shown that implementation of OxMIS20—a tool which estimates the probability of suicide in people with severe mental illness—in secondary care in England can lead to cost savings and a small improvement in health outcomes compared with usual care (using a 1% risk threshold to target a high-risk management strategy).21 Another economic evaluation study estimated threshold classification accuracy values required for a suicide prediction model to be cost-effective in US primary care.22 The analyses showed that for targeting a safety planning and telephone call intervention, at a specificity of 95%, the required PPVs to achieve cost-effectiveness were 0.8% for suicide attempts and 0.07% for suicide deaths. The threshold PPVs were higher for a more resource-intensive intervention (cognitive–behavioural therapy), namely, 1.7% for suicide attempts and 0.2% for suicide deaths.

For low prevalence outcomes like suicide, the PPV of any prediction model, at any given threshold, will be low, and the associated high false positive rate could lead to ‘alarm fatigue’ in clinical practice. However, as highlighted in the study by Ross et al,22 measures such as PPV and false positive rate cannot be interpreted without considering the clinical context (including the target population, the specific decision that the model is intended to inform, and the relative importance of true vs false positive classifications in that context). This suggests that the same prediction model may have clinical utility and be cost-effective for targeting one particular suicide risk reduction intervention but not another. For instance, if the consequences of being classified as high risk of suicide are not harmful and the target interventions have additional benefits (eg, reducing risk of self-harm or accidental deaths), then a low PPV may not be problematic. Furthermore, there may be specific patient populations (eg, those with a higher prevalence of suicide or non-fatal self-harm) where prediction models are more likely to be clinically useful and/or cost-effective. This further emphasises the point that the most appropriate risk threshold for a given prediction model may be specific to the intervention and population of interest, and should only be determined after the predictive performance of the model (in terms of discrimination and calibration) is thoroughly investigated.4

Ultimately, suicide prediction models are only useful in practice if they are linked to effective and scalable interventions,1 and if their implementation has a positive impact on clinical decision-making, patient outcomes and cost-effectiveness of care. Quantifying the impact of a prediction model on these outcomes ultimately requires evidence from prospective impact studies (ideally a cluster randomised trial), which are costly and time-consuming.13 Such impact studies are rare in prognostic model research, and to our knowledge have not been conducted for any suicide prediction model. However, they are an important step towards implementation for adequately validated models which show evidence of net benefit and the potential for improved patient outcomes and/or favourable cost-effectiveness in decision analytical modelling.13 23

In conclusion, we agree with critics of suicide risk prediction that identifying individuals who go on to self-harm or die from suicide is challenging; this is precisely the rationale for developing complex multivariable models using high-quality methods on very large datasets to model risk. There is much to be criticised about the suicide prediction modelling literature; the field must prioritise improved methodological rigour and adherence to best-practice reporting guidelines in the development of new models. There is also a clear need for high-quality external validations in large sample sizes (followed by model updating if necessary), as well as more research assessing the clinical utility and impact of promising models. However, researchers and experts should bring statistical expertise and more nuance in the discussion around using prediction models and risk assessment tools for self-harm and suicide. The field needs to move beyond simplistic blanket statements suggesting that we abandon the endeavour of risk prediction in this area altogether.1 24 As discussed here, such statements are not evidence based, do not align with the rest of medicine and come across as ideological. Further, without proper assessment of clinical utility and cost-effectiveness, assertions that PPVs or AUCs of suicide prediction models are too low to be useful should be avoided. Instead, clinical guidelines and suicide prevention strategies should be based on emerging high-quality evidence in the field, and consider a range of issues related to model predictive performance and clinical usefulness.

Ethics statements

Patient consent for publication

Ethics approval

Not applicable.

References

↵
2. Fazel S ,
3. Vazquez-Montes MDLA ,
4. Molero Y , et al
. Risk of death by suicide following self-harm presentations to Healthcare: development and validation of a multivariable clinical prediction rule (OxSATS). BMJ Ment Health 2023;26:e300673. doi:10.1136/bmjment-2023-300673
↵
2. Carter G ,
3. Milner A ,
4. McGill K , et al
. Predicting suicidal Behaviours using clinical instruments: systematic review and meta-analysis of positive predictive values for risk scales. Br J Psychiatry 2017;210:387–95. doi:10.1192/bjp.bp.116.182717
OpenUrl Abstract/FREE Full Text
↵
2. Chan MKY ,
3. Bhatti H ,
4. Meader N , et al
. Predicting suicide following self-harm: systematic review of risk factors and risk scales. Br J Psychiatry 2016;209:277–83. doi:10.1192/bjp.bp.115.170050
OpenUrl Abstract/FREE Full Text
↵
2. Wynants L ,
3. van Smeden M ,
4. McLernon DJ , et al
. Three myths about risk thresholds for prediction models. BMC Med 2019;17:192. doi:10.1186/s12916-019-1425-3
↵
2. Steeg S ,
3. Quinlivan L ,
4. Nowland R , et al
. Accuracy of risk scales for predicting repeat self-harm and suicide: a Multicentre, population-level cohort study using routine clinical data. BMC Psychiatry 2018;18:113. doi:10.1186/s12888-018-1693-z
↵
2. van den Goorbergh R ,
3. van Smeden M ,
4. Timmerman D , et al
. The harm of class imbalance corrections for risk prediction models: illustration and simulation using logistic regression. Journal of the American Medical Informatics Association 2022;29:1525–34. doi:10.1093/jamia/ocac093
OpenUrl
↵
2. Birch J ,
3. Creel KA ,
4. Jha AK , et al
. Clinical decisions using AI must consider patient values. Nat Med 2022;28:229–32. doi:10.1038/s41591-021-01624-y
OpenUrl
↵
2. Vickers AJ ,
3. van Calster B ,
4. Steyerberg EW
. A simple, step-by-step guide to interpreting decision curve analysis. Diagn Progn Res 2019;3. doi:10.1186/s41512-019-0064-7
↵
2. Moons KGM ,
3. Altman DG ,
4. Reitsma JB , et al
. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med 2015;162:W1–73. doi:10.7326/M14-0698
OpenUrl CrossRef PubMed
↵
2. Riley RD ,
3. Collins GS
. Stability of clinical prediction models developed using statistical or machine learning methods. Biom J 2023;65:e2200302. doi:10.1002/bimj.202200302
↵
2. Fazel S ,
3. Vazquez-Montes M ,
4. Molero Y , et al
. Risk of death by suicide following self-harm presentations to Healthcare: development and validation of a multivariable clinical prediction rule (Oxsats). BMJ Ment Health 2023;26:e300673. doi:10.1136/bmjment-2023-300673
↵
2. de Hond AAH ,
3. Steyerberg EW ,
4. Van Calster B
. Interpreting area under the receiver operating characteristic curve. The Lancet Digital Health 2022;4:e853–5. doi:10.1016/S2589-7500(22)00188-1
OpenUrl
↵
2. Moons KGM ,
3. Altman DG ,
4. Vergouwe Y , et al
. Prognosis and Prognostic research: application and impact of Prognostic models in clinical practice. BMJ 2009;338:b606. doi:10.1136/bmj.b606
↵
2. Van Calster B ,
3. Van Hoorde K ,
4. Valentin L , et al
. Evaluating the risk of ovarian cancer before surgery using the ADNEX model to differentiate between benign, borderline, early and advanced stage invasive, and secondary metastatic tumours: prospective Multicentre diagnostic study. BMJ 2014;349:g5920. doi:10.1136/bmj.g5920
↵
2. Wynants L ,
3. Van Calster B ,
4. Collins GS , et al
. Prediction models for diagnosis and prognosis of COVID-19: systematic review and critical appraisal. BMJ 2020;369:m1328. doi:10.1136/bmj.m1328
↵
2. Van Calster B ,
3. McLernon DJ ,
4. van Smeden M , et al
. Calibration: the Achilles heel of predictive Analytics. BMC Med 2019;17:230. doi:10.1186/s12916-019-1466-7
↵
2. Moons KGM ,
3. Wolff RF ,
4. Riley RD , et al
. PROBAST: A tool to assess risk of bias and applicability of prediction model studies: explanation and elaboration. Ann Intern Med 2019;170:W1–33. doi:10.7326/M18-1377
OpenUrl CrossRef PubMed
↵
2. Vickers AJ ,
3. Van Calster B ,
4. Steyerberg EW
. Net benefit approaches to the evaluation of prediction models, molecular markers, and diagnostic tests. BMJ 2016;352:i6. doi:10.1136/bmj.i6
↵
2. Whiting D ,
3. Mallett S ,
4. Lennox B , et al
. Assessing violence risk in first-episode psychosis: external validation, updating and net benefit of a prediction tool (OxMIV). BMJ Ment Health 2023;26:e300634. doi:10.1136/bmjment-2022-300634
↵
2. Fazel S ,
3. Wolf A ,
4. Larsson H , et al
. The prediction of suicide in severe mental illness: development and validation of a clinical prediction rule (OxMIS). Transl Psychiatry 2019;9:98. doi:10.1038/s41398-019-0428-3
↵
2. Botchway S ,
3. Tsiachristas A ,
4. Pollard J , et al
. Cost-effectiveness of implementing a suicide prediction tool (OxMIS) in severe mental illness: economic modeling study. Eur Psychiatry 2022;66:e6. doi:10.1192/j.eurpsy.2022.2354
↵
2. Ross EL ,
3. Zuromski KL ,
4. Reis BY , et al
. Accuracy requirements for cost-effective suicide risk prediction among primary care patients in the US. JAMA Psychiatry 2021;78:642–50. doi:10.1001/jamapsychiatry.2021.0089
OpenUrl
↵
2. Steyerberg EW ,
3. Moons KGM ,
4. van der Windt DA , et al
. Prognosis research strategy (PROGRESS) 3: Prognostic model research. PLoS Med 2013;10:e1001381. doi:10.1371/journal.pmed.1001381
↵
2. Mulder R ,
3. Newton-Howes G ,
4. Coid JW
. The futility of risk prediction in psychiatry. Br J Psychiatry 2016;209:271–2. doi:10.1192/bjp.bp.116.184960
OpenUrl Abstract/FREE Full Text

Footnotes

Contributors AS and SF jointly conceived, drafted and revised the editorial.
Funding AS is funded by a Department of Psychiatry Studentship (University of Oxford), the Clarendon Fund and the Robert Oxlade Scholarship (St John’s College, Oxford). SF is supported by the NIHR Oxford Health Biomedical Research Centre.
Disclaimer The views expressed are those of the authors and not necessarily those of the NIHR or the Department of Health and Social Care.
Competing interests SF is part of the team that developed OxSATS, OxMIS, OxMIV and other OxRisk calculators. AS has no competing interests.
Provenance and peer review Not commissioned; externally peer reviewed.

[1] ↵

Fazel S ,
Vazquez-Montes MDLA ,
Molero Y , et al
. Risk of death by suicide following self-harm presentations to Healthcare: development and validation of a multivariable clinical prediction rule (OxSATS). BMJ Ment Health 2023;26:e300673. doi:10.1136/bmjment-2023-300673

[3] Fazel S ,

[4] Vazquez-Montes MDLA ,

[5] Molero Y , et al

[6] ↵

Carter G ,
Milner A ,
McGill K , et al
. Predicting suicidal Behaviours using clinical instruments: systematic review and meta-analysis of positive predictive values for risk scales. Br J Psychiatry 2017;210:387–95. doi:10.1192/bjp.bp.116.182717
OpenUrl Abstract/FREE Full Text

[8] Carter G ,

[9] Milner A ,

[10] McGill K , et al

[11] ↵

Chan MKY ,
Bhatti H ,
Meader N , et al
. Predicting suicide following self-harm: systematic review of risk factors and risk scales. Br J Psychiatry 2016;209:277–83. doi:10.1192/bjp.bp.115.170050
OpenUrl Abstract/FREE Full Text

[13] Chan MKY ,

[14] Bhatti H ,

[15] Meader N , et al

[16] ↵

Wynants L ,
van Smeden M ,
McLernon DJ , et al
. Three myths about risk thresholds for prediction models. BMC Med 2019;17:192. doi:10.1186/s12916-019-1425-3

[18] Wynants L ,

[19] van Smeden M ,

[20] McLernon DJ , et al

[21] ↵

Steeg S ,
Quinlivan L ,
Nowland R , et al
. Accuracy of risk scales for predicting repeat self-harm and suicide: a Multicentre, population-level cohort study using routine clinical data. BMC Psychiatry 2018;18:113. doi:10.1186/s12888-018-1693-z

[23] Steeg S ,

[24] Quinlivan L ,

[25] Nowland R , et al

[26] ↵

van den Goorbergh R ,
van Smeden M ,
Timmerman D , et al
. The harm of class imbalance corrections for risk prediction models: illustration and simulation using logistic regression. Journal of the American Medical Informatics Association 2022;29:1525–34. doi:10.1093/jamia/ocac093
OpenUrl

[28] van den Goorbergh R ,

[29] van Smeden M ,

[30] Timmerman D , et al

[31] ↵

Birch J ,
Creel KA ,
Jha AK , et al
. Clinical decisions using AI must consider patient values. Nat Med 2022;28:229–32. doi:10.1038/s41591-021-01624-y
OpenUrl

[33] Birch J ,

[34] Creel KA ,

[35] Jha AK , et al

[36] ↵

Vickers AJ ,
van Calster B ,
Steyerberg EW
. A simple, step-by-step guide to interpreting decision curve analysis. Diagn Progn Res 2019;3. doi:10.1186/s41512-019-0064-7

[38] Vickers AJ ,

[39] van Calster B ,

[40] Steyerberg EW

[41] ↵

Moons KGM ,
Altman DG ,
Reitsma JB , et al
. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med 2015;162:W1–73. doi:10.7326/M14-0698
OpenUrl CrossRef PubMed

[43] Moons KGM ,

[44] Altman DG ,

[45] Reitsma JB , et al

[46] ↵

Riley RD ,
Collins GS
. Stability of clinical prediction models developed using statistical or machine learning methods. Biom J 2023;65:e2200302. doi:10.1002/bimj.202200302

[48] Riley RD ,

[49] Collins GS

[50] ↵

Fazel S ,
Vazquez-Montes M ,
Molero Y , et al
. Risk of death by suicide following self-harm presentations to Healthcare: development and validation of a multivariable clinical prediction rule (Oxsats). BMJ Ment Health 2023;26:e300673. doi:10.1136/bmjment-2023-300673

[52] Fazel S ,

[53] Vazquez-Montes M ,

[54] Molero Y , et al

[55] ↵

de Hond AAH ,
Steyerberg EW ,
Van Calster B
. Interpreting area under the receiver operating characteristic curve. The Lancet Digital Health 2022;4:e853–5. doi:10.1016/S2589-7500(22)00188-1
OpenUrl

[57] de Hond AAH ,

[58] Steyerberg EW ,

[59] Van Calster B

[60] ↵

Moons KGM ,
Altman DG ,
Vergouwe Y , et al
. Prognosis and Prognostic research: application and impact of Prognostic models in clinical practice. BMJ 2009;338:b606. doi:10.1136/bmj.b606

[62] Moons KGM ,

[63] Altman DG ,

[64] Vergouwe Y , et al

[65] ↵

Van Calster B ,
Van Hoorde K ,
Valentin L , et al
. Evaluating the risk of ovarian cancer before surgery using the ADNEX model to differentiate between benign, borderline, early and advanced stage invasive, and secondary metastatic tumours: prospective Multicentre diagnostic study. BMJ 2014;349:g5920. doi:10.1136/bmj.g5920

[67] Van Calster B ,

[68] Van Hoorde K ,

[69] Valentin L , et al

[70] ↵

Wynants L ,
Van Calster B ,
Collins GS , et al
. Prediction models for diagnosis and prognosis of COVID-19: systematic review and critical appraisal. BMJ 2020;369:m1328. doi:10.1136/bmj.m1328

[72] Wynants L ,

[73] Van Calster B ,

[74] Collins GS , et al

[75] ↵

Van Calster B ,
McLernon DJ ,
van Smeden M , et al
. Calibration: the Achilles heel of predictive Analytics. BMC Med 2019;17:230. doi:10.1186/s12916-019-1466-7

[77] Van Calster B ,

[78] McLernon DJ ,

[79] van Smeden M , et al

[80] ↵

Moons KGM ,
Wolff RF ,
Riley RD , et al
. PROBAST: A tool to assess risk of bias and applicability of prediction model studies: explanation and elaboration. Ann Intern Med 2019;170:W1–33. doi:10.7326/M18-1377
OpenUrl CrossRef PubMed

[82] Moons KGM ,

[83] Wolff RF ,

[84] Riley RD , et al

[85] ↵

Vickers AJ ,
Van Calster B ,
Steyerberg EW
. Net benefit approaches to the evaluation of prediction models, molecular markers, and diagnostic tests. BMJ 2016;352:i6. doi:10.1136/bmj.i6

[87] Vickers AJ ,

[88] Van Calster B ,

[89] Steyerberg EW

[90] ↵

Whiting D ,
Mallett S ,
Lennox B , et al
. Assessing violence risk in first-episode psychosis: external validation, updating and net benefit of a prediction tool (OxMIV). BMJ Ment Health 2023;26:e300634. doi:10.1136/bmjment-2022-300634

[92] Whiting D ,

[93] Mallett S ,

[94] Lennox B , et al

[95] ↵

Fazel S ,
Wolf A ,
Larsson H , et al
. The prediction of suicide in severe mental illness: development and validation of a clinical prediction rule (OxMIS). Transl Psychiatry 2019;9:98. doi:10.1038/s41398-019-0428-3

[97] Fazel S ,

[98] Wolf A ,

[99] Larsson H , et al

[100] ↵

Botchway S ,
Tsiachristas A ,
Pollard J , et al
. Cost-effectiveness of implementing a suicide prediction tool (OxMIS) in severe mental illness: economic modeling study. Eur Psychiatry 2022;66:e6. doi:10.1192/j.eurpsy.2022.2354

[102] Botchway S ,

[103] Tsiachristas A ,

[104] Pollard J , et al

[105] ↵

Ross EL ,
Zuromski KL ,
Reis BY , et al
. Accuracy requirements for cost-effective suicide risk prediction among primary care patients in the US. JAMA Psychiatry 2021;78:642–50. doi:10.1001/jamapsychiatry.2021.0089
OpenUrl

[107] Ross EL ,

[108] Zuromski KL ,

[109] Reis BY , et al

[110] ↵

Steyerberg EW ,
Moons KGM ,
van der Windt DA , et al
. Prognosis research strategy (PROGRESS) 3: Prognostic model research. PLoS Med 2013;10:e1001381. doi:10.1371/journal.pmed.1001381

[112] Steyerberg EW ,

[113] Moons KGM ,

[114] van der Windt DA , et al

[115] ↵

Mulder R ,
Newton-Howes G ,
Coid JW
. The futility of risk prediction in psychiatry. Br J Psychiatry 2016;209:271–2. doi:10.1192/bjp.bp.116.184960
OpenUrl Abstract/FREE Full Text

[117] Mulder R ,

[118] Newton-Howes G ,

[119] Coid JW

Log in using your username and password

Main menu

Log in using your username and password

You are here

Abstract

Statistics from Altmetric.com

Request Permissions

Ethics statements

Patient consent for publication

Ethics approval

References

Footnotes

Read the full text or download the PDF:

Log in using your username and password