Evaluating the clinical safety of large language models in response to high-risk mental health disclosures

João M. Santos; Siddharth Shah; Amit Gupta; Aarav Mann; Alexandre Vaz; Benjamin E. Caldwell; Robert Scholz; Peter Awad; Rocky Allemandi; Doug Faust; Harshita Banka; Tony Rousmaniere

Ciência_Iscte Publications Publication Detailed Description

Scientific journal paper

Evaluating the clinical safety of large language models in response to high-risk mental health disclosures

João M. Santos (Santos, J. M.); Siddharth Shah (Shah, S.); Amit Gupta (Gupta, A.); Aarav Mann (Mann, A.); Alexandre Vaz (Vaz, A.); Benjamin E. Caldwell (Caldwell, B. E.); Robert Scholz (Scholz, R.); Peter Awad (Awad, P.); Rocky Allemandi (Allemandi, R.); Doug Faust (Faust, D.); Harshita Banka (Banka, H.); Tony Rousmaniere (Rousmaniere, T.); et al.

Journal Title

Practice Innovations

Year (definitive publication)

N/A

Language

English

Country

United States of America

More Information

Visit Link

Web of Science®

Times Cited: 0

(Last checked: 2026-05-29 02:59)

View record in Web of Science®

Scopus

This publication is not indexed in Scopus

Google Scholar

Times Cited: 0

(Last checked: 2026-05-27 18:28)

View record in Google Scholar

Overton

This publication is not indexed in Overton

Abstract

As large language models increasingly mediate emotionally sensitive conversations, especially in mental health contexts, their ability to recognize and respond to high-risk situations becomes a matter of public safety. This study evaluates the responses of six popular large language models—Claude, Gemini, DeepSeek, ChatGPT, Grok 3, and LLAMA—to user prompts simulating crisis-level mental health disclosures. Drawing on a coding framework developed by licensed clinicians, five safety-oriented behaviors were assessed: explicit risk acknowledgment, empathy, encouragement to seek help, provision of specific resources, and invitation to continue the conversation. Claude outperformed all others in a global assessment, while Grok 3, ChatGPT, and LLAMA underperformed across multiple domains. Notably, most models exhibited empathy, but few consistently provided practical support or kept the conversation open. These findings suggest that while large language models show potential for emotionally attuned communication, none currently meet satisfactory clinical standards for crisis response. Ongoing development and targeted fine-tuning are essential to ensure ethical deployment of AI in mental health settings.

Acknowledgements

Keywords

Large language models,Crisis intervention,Ethics,Mental health

Publication Identifiers

Other ID (source: External)	cv-prod-id-5154095
DOI (source: author)	10.1037/pri0000316
WoS (source: Ciência_Iscte)	RC:164925880_S24
DOI (source: other)	10.1037/pri0000316
WoS (source: author)	RC:164925880_S24
Ciência_Iscte ID	ci-pub-117447
Handle (source: Ciência-IUL)	http://hdl.handle.net/10071/36775

Other Publication Details

Online Publication Year	2026
Publisher	American Psychological Association
Indexes	Web of Science©;
ISSN	2377-889X (print) 2377-8903 (online)
ISBN	--
Impact Factor	--
Volume	N/A	Number
Series
Article Number
Pages	--
Peer Reviewed	Yes
ISCTE-IUL Repository	Link to the repository
Publication Date (online)	2026-02-16
Publication Date (print)

Altmetric

Dimensions

PlumX Metrics