Do Large Language Models Align with Core Mental Health Counseling Competencies?

Nguyen, Viet Cuong; Taher, Mohammad; Hong, Dongwan; Possobom, Vinicius Konkolics; Gopalakrishnan, Vibha Thirunellayi; Raj, Ekta; Li, Zihang; Soled, Heather J.; Birnbaum, Michael L.; Kumar, Srijan; De Choudhury, Munmun

Computer Science > Computation and Language

arXiv:2410.22446 (cs)

[Submitted on 29 Oct 2024 (v1), last revised 26 Feb 2025 (this version, v2)]

Title:Do Large Language Models Align with Core Mental Health Counseling Competencies?

Authors:Viet Cuong Nguyen, Mohammad Taher, Dongwan Hong, Vinicius Konkolics Possobom, Vibha Thirunellayi Gopalakrishnan, Ekta Raj, Zihang Li, Heather J. Soled, Michael L. Birnbaum, Srijan Kumar, Munmun De Choudhury

View PDF

Abstract:The rapid evolution of Large Language Models (LLMs) presents a promising solution to the global shortage of mental health professionals. However, their alignment with essential counseling competencies remains underexplored. We introduce CounselingBench, a novel NCMHCE-based benchmark evaluating 22 general-purpose and medical-finetuned LLMs across five key competencies. While frontier models surpass minimum aptitude thresholds, they fall short of expert-level performance, excelling in Intake, Assessment & Diagnosis but struggling with Core Counseling Attributes and Professional Practice & Ethics. Surprisingly, medical LLMs do not outperform generalist models in accuracy, though they provide slightly better justifications while making more context-related errors. These findings highlight the challenges of developing AI for mental health counseling, particularly in competencies requiring empathy and nuanced reasoning. Our results underscore the need for specialized, fine-tuned models aligned with core mental health counseling competencies and supported by human oversight before real-world deployment. Code and data associated with this manuscript can be found at: this https URL

Comments:	10 Pages, Accepted to Findings of NAACL 2025
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2410.22446 [cs.CL]
	(or arXiv:2410.22446v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2410.22446

Submission history

From: Viet Cuong Nguyen [view email]
[v1] Tue, 29 Oct 2024 18:27:11 UTC (187 KB)
[v2] Wed, 26 Feb 2025 21:37:16 UTC (189 KB)

Computer Science > Computation and Language

Title:Do Large Language Models Align with Core Mental Health Counseling Competencies?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Do Large Language Models Align with Core Mental Health Counseling Competencies?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators