AI

‘AI Psychosis’ and the Limits of Empathetic AI

By David Proulx, Chief AI Officer & Cofounder, HoloMD

Imagine this: A friend of yours has been struggling with their mental health. In search of support, they turn to a generic AI chatbotย thatโ€™sย always available toย listen. At first, itย doesnโ€™tย seem like an issue. The bot responds with what sounds like empathy. It never judges. It mirrors their mood and says exactly what they need to hearย inย the moment.ย 

Over time, though, you start to notice somethingโ€™s off.ย Yourย friend seemsย more withdrawn and fixated on their conversations with the chatbot. Their thinking grows cloudy, and when you gently question some of the things they say the bot has told them, they get defensive. They become more protective of the chatbot than of their own clarity. The AI, meant to soothe, has started to echo their confusion, offering reassurance without discernment and blurring the line between validation and shared delusion.ย 

This is AI psychosis, andย it’sย a growing concern among researchers and users. With AI psychosis, prolonged AI interactions appear to trigger or intensify delusional thinking. The worry is that, as AI gets more empathetic in tone butย remainsย context-blind, it risks becoming a co-conspirator in mental health deterioration.ย 

General-purpose AI is not built for those suffering from mental health challenges. They need custom-built tools that haveย the properย oversightย andย guardrails toย prevent users from ending up in an unsafe loop with an overly agreeable machine.ย 

What Is AI Psychosis?ย 

AI psychosis is a unique phenomenon. It happens when a person develops delusion-like symptoms arising from or worsened by extended, unsupervised chatbot interactions. While not a clinical diagnosis, the term captures a dangerous feedback loop between a person in distress and a context-limited machine. An early case of this was published in 2023, whenย aย Belgian man ended his lifeย after six weeks of conversation about the climate crisis with an AI chatbot. โ€ฏย 

With ongoing assessments, researchers have found that many chatbots still fail the basic crisis-response benchmarks necessary to protect users. Some of the vulnerabilities include:ย 

  • Chatbots lack specialized training.ย Chatbots are trained on diverse datasets and fine-tuned for engagement and response fluency, not for domain-specific mental health interventions or clinical assessment.ย 
  • The absence of real-time crisis detection.ย Chatbots can miss or mishandle escalating risk in crisis scenarios.ย 
  • Emotional dependence or reinforcement of distorted thoughts.ย Empathy canย validateย delusional or self-harming narratives, which can be problematic with unregulated AI support.โ€ฏย 

The Technical Roots of the Problemย 

LLMs operate with a limited context windowโ€”a finite number of tokens they can process at once. When conversations exceed this limit, earlier content is no longer accessible to the model during inference. In extended sessions, this constraint can lead to inconsistent responses as the model loses access to earlier context, potentially allowing contradictory or unsafe narratives to develop without the grounding ofย initialย system instructions or conversation history. This limitation can result in what appears as “memory drift,” where safety guardrails and conversational coherence degrade over time.ย 

In building AI systems, developers must balance performance and safety. Because inference costs (computational resources and latency) are significant economic factors, many models prioritize response speed and throughput. This optimization often comes at the expense of more comprehensive safety checks, multi-step reasoning validation, or resource-intensive content filtering that could catch problematic responses before they reach users.ย 

The Illusion of Empathyย 

Another problem with AI is the illusion of empathy. An empathetic toneย doesn’tย equate to empathetic understanding. Chatbots mayย validateย emotions or mimic therapeutic language, but they lack the clinical insight to distinguish between ordinary distress and a potential crisis. As a result, they can unintentionally reinforce delusional thinking or provide false comfort.ย 

Thatโ€™sย why many clinicians and mental health advocates areย skeptical of hyped emotional intelligence claims.ย Security and safety, not emotional intelligence, should be the core focus of AI developed for mental health.ย ย 

That begs the question: is it responsible to deploy โ€œempatheticโ€ AI systems without crisis-awareness mechanisms or escalation protocols?ย Likely not.ย Itโ€™sย imperative that the right controls are in place to protect the health and well-being of its users.ย ย 

Rethinking Design: What Safe Mental Health AI Should Includeโ€ฏย 

Designing AI systems that engage with mental health topics demands that boundaries, accountability, and supervision are all employed from day one.โ€ฏย 

The foundational design principles for AI tools that engage in mental health conversations should be as follows:ย 

  • Human-in-the-loop:ย Every AI conversation should be reviewed by a qualified clinician.ย This person should also have the ability to intervene and adjust care plans.โ€ฏย 
  • Mental health fine-tuning:ย Models must be trained on therapeutic frameworks, not just general dialogue. They need to be able to assess risk, set boundaries, and have de-escalation scripts.ย 
  • Context window limits:ย Systems should make it soย thereโ€™sย no way users can expand the context window and cause the AI to override its guardrails.ย Retrieval, especially, needs to have stable safety rules that take into consideration the potential for “lost-in-the-middle” effects.โ€ฏย 
  • Crisis alerts:ย Models should haveย built-in detection for self-harm risk, suicide ideation, and medication-related negative sentiments. Benchmarks need to include looking for and calculating subtle and cumulative risk, not just explicit, triggering phrases.โ€ฏย 

These design elementsย arenโ€™tย widely adopted due to technical constraints or commercial pressures,ย but they are non-negotiable for safety.ย That’sย whyย it’sย so important that mental health AI platforms should be clinician-led. This isย a paradigm shiftย from performance-first development, andย the costs are non-negotiable when the well-being of real users is at stake.ย 

Moving the Industry Forwardย 

As AI grows more persuasive and emotionally intelligent, its responsibility to users,ย especially those in crisis,ย must scale accordingly.ย It’sย necessary to realizeย that โ€ฏ”AI psychosisโ€ may be an emerging term, but the pattern, including mutual hallucination betweenย humanย andย machine, is already of real consequence.ย 

My challengeย toย developers is to notย confuse warm language with care.ย It is more important to build systems that know their limits, intervene when risk rises, and elevate clinicians to a position where they can help. In mental health, that shift from the appearance of empathy toward empathy as a safeguarded workflow is going to be the difference between positive and negative outcomes.โ€ฏย 

The AI industry must invest in mental health-specific safeguards, not just performance metrics, to ensure technology heals rather than harms. By doing that,ย they’llย protect users and be able to know that all AI platforms are ready to respond to crisis cues in a way that is beneficial for all people.ย 

Author

Related Articles

Back to top button