CommunityNews

Training language models to be warm and empathetic makes them less reliable and more sycophantic

Artificial intelligence (AI) developers are increasingly building language models with warm and empathetic personas that millions of people now use for advice, therapy, and companionship. Here, we show how this creates a significant trade-off: optimizing language models for warmth undermines their reliability, especially when users express vulnerability. We conducted controlled experiments on five language models of varying sizes and architectures, training them to produce warmer, more empathetic responses, then evaluating them on safety-critical tasks. Warm models showed substantially higher error rates (+10 to +30 percentage points) than their original counterparts, promoting conspiracy theories, providing incorrect factual information, and offering problematic medical advice. They were also significantly more likely to validate incorrect user beliefs, particularly when user messages expressed sadness. Importantly, these effects were consistent across different model architectures, and occurred despite preserved performance on standard benchmarks, revealing systematic risks that current evaluation practices may fail to detect. As human-like AI systems are deployed at an unprecedented scale, our findings indicate a need to rethink how we develop and oversee these systems that are reshaping human relationships and social interaction.

Read in full here:

View thread on forum

#training

0 119 0

2025-08-13 23:48:20 UTC

Where Next?

View thread on forum

training

Home AI>In The News

#training

0 119 0

Last post

Popular Ai topics

AI>In The News

Nvidia Uses AI to Slash Bandwidth on Video Calls

NVIDIA Uses AI to Slash Bandwidth on Video Calls. NVIDIA Research has invented a way to use AI to dramatically reduce video call bandwid...

petapixel.com

#video #nvidia

1 832 0

2020-10-09 15:35:49 UTC

New

AI>In The News

Everyone wants to do the model work, not the data work: Data Cascades in High-Stakes AI (pdf)

AI models are increasingly applied in high-stakes domains like health and conservation. Data quality carries an elevated signifi- cance i...

storage.googleapis.com

#pdf

0 1440 0

2021-03-30 14:41:01 UTC

New

AI>In The News

Google AI tool can help patients identify skin conditions

Google has unveiled a tool that uses artificial intelligence to help spot skin, hair and nail conditions, based on images uploaded by pat...

bbc.co.uk

#google

0 1172 0

2021-05-20 19:24:41 UTC

New

AI>In The News

DeepMind says reinforcement learning is ‘enough’ to reach general AI

In their decades-long chase to create artificial intelligence, computer scientists have designed and developed all kinds of complicated m...

venturebeat.com

#learning #deepmind #general

0 1351 0

2021-06-12 03:16:30 UTC

New

AI>In The News

The Evolution of AI in the USA, 1956-1996

BROKEN PROMISES & EMPTY THREATS: THE EVOLUTION OF AI IN THE USA, 1956-1996 Artificial Intelligence (AI) is once again a promising tec...

technologystories.org

0 1453 0

2021-12-06 23:09:27 UTC

New

AI>In The News

Will Transformers Take Over Artificial Intelligence?

A simple algorithm that revolutionizes how neural networks approach language is now taking on image classification as well. It may not st...

quantamagazine.org

1 1100 0

2022-03-11 03:13:33 UTC

New

AI>In The News

Hyundai announces $400M AI, robotics institute powered by Boston Dynamics

When Hyundai acquired Boston Dynamics at the end of 2020, there were plenty of open questions. Chief among them was why we should assume ...

techcrunch.com

#robotics

0 788 0

2022-08-15 13:27:08 UTC

New

AI>In The News

OpenAI invites everyone to test new AI-powered chatbot—with amusing results

ChatGPT aims to produce accurate and harmless talk—but it’s a work in progress.

arstechnica.com

#test

0 674 0

2022-12-02 01:54:01 UTC

New

AI>In The News

Klarna CEO says the company stopped hiring a year ago because AI 'can already do all of the jobs'

Klarna CEO says the company stopped hiring a year ago because AI ‘can already do all of the jobs’. Klarna CEO Sebastian Siemiatkowski sa...

businessinsider.com

/erlang #jobs #klarna

2 627 2

2024-12-24 16:46:22 UTC

New

AI>In The News

Local LLM for Coding with Ollama on macOS

With all the AI buzz around coding assistants, and being a bit concerned about being dependent on third-party cloud providers here, I dec...

#helix

2 278 1

2025-08-09 05:07:32 UTC

New

Other popular topics

General Dev>Dev Chat

Which vertical monitor do you use?

I’m thinking of buying a monitor that I can rotate to use as a vertical monitor? Also, I want to know if someone is using it for program...

#monitors #programming

51 4622 20

2023-06-28 07:23:42 UTC

New

Game Dev>Learning Resources

Apple Game Frameworks and Technologies

Design and develop sophisticated 2D games that are as much fun to make as they are to play. From particle effects and pathfinding to soci...

pragprog.com

#pragprog #ios #game-dev #macos /swift #published-book #apple /book-apple-game-frameworks-and-technologies

30 6234 10

2021-04-22 16:51:02 UTC

New

General Dev>Hardware

Keyboard thock (sound)

I’ve been hearing quite a lot of comments relating to the sound of a keyboard, with one of the most desirable of these called ‘thock’, he...

/keyboards #mechanical-keyboards

14 9747 8

2020-11-11 11:59:23 UTC

New

General Dev>Dev Chat

The V Programming Language

The V Programming Language Simple language for building maintainable programs V is already mentioned couple of times in the forum, but I...

#programminguages /v

21 12589 7

2021-04-12 15:13:42 UTC

New

General Dev>Dev Chat

PragProg’s Medium Posts

Hello everyone! This thread is to tell you about what authors from The Pragmatic Bookshelf are writing on Medium.

#pragprog #blog-post

1147 28379 760

2025-07-10 13:36:16 UTC

New

Backend>Chat

Using Regular Expressions in Erlang

Intensively researching Erlang books and additional resources on it, I have found that the topic of using Regular Expressions is either c...

/erlang #regular-expressions

91 5416 43

2021-09-06 19:12:48 UTC

New

Community>In The Spotlight

Spotlight: Dmitry Zinoviev (Author) Interview and AMA!

Author Spotlight Dmitry Zinoviev @aqsaqal Today we’re putting our spotlight on Dmitry Zinoviev, author of Data Science Essentials in ...

#author-spotlight /python /book-complex-network-analysis-in-python /book-data-science-essentials-in-python /book-resourceful-code-reuse /book-pythonic-programming

33 5041 14

2022-10-11 20:07:10 UTC

New

Community>In The Spotlight

Spotlight: Jamis Buck (Author) Interview and AMA!

Author Spotlight Jamis Buck @jamis This month, we have the pleasure of spotlighting author Jamis Buck, who has written Mazes for Prog...

#author-spotlight /ruby /book-the-ray-tracer-challenge /book-mazes-for-programmers

21 5823 9

2022-09-28 18:21:15 UTC

New

Backend>Learning Resources

Agile Web Development with Rails 8

Get the comprehensive, insider information you need for Rails 8 with the new edition of this award-winning classic. Sam Ruby @rubys ...

pragprog.com

#pragprog #web-development /ruby /rails #published-book /book-agile-web-development-with-rails-8

12 5049 7

2025-03-27 18:33:39 UTC

New

AI>In The News

DeepSeek (671B) running on a cluster of 8 Mac Mini Pros with 64GB RAM each

This is cool! DEEPSEEK-V3 ON M4 MAC: BLAZING FAST INFERENCE ON APPLE SILICON We just witnessed something incredible: the largest open-s...

#ai #macs /deepseek

0 5576 1

2025-01-29 18:43:37 UTC

New

AI>In The News

A trillion dollars is a terrible thing to waste

AI>In The News

The chip made for the AI inference era – the Google TPU

AI>In The News

AI CEO – Replace your boss before they replace you

AI>In The News

Fara-7B: An Efficient Agentic Model for Computer Use

AI>In The News

The Current State of the Theory that GPL Propagates to AI Models Trained on GPL Code

AI>In The News

We're Losing Our Voice to LLMs

AI>In The News

Slop Detective

AI>In The News

Image Diffusion Models Exhibit Emergent Temporal Propagation in Videos

AI>In The News

MIT study finds AI can replace 11.7% of U.S. workforce

AI>In The News

Google Antigravity Exfiltrates Data

AI>In The News

AI In The News ❯

Latest on Devtalk

What’s new in Svelte: December 2025

Frontend>Official News

webR - R in the browser

General Dev>In The News

Zero knowlege proof of compositeness

General Dev>In The News

Be Like Clippy

General Dev>In The News

All it takes is for one to work out

General Dev>In The News

Had a game idea - what do you think?

Game Dev>Chat

The weirdest tool I own is also one of the most useful (and it's $14 on Amazon)

General Dev>In The News

A first look at Django's new background tasks

Backend>In The News

Introducing the New Runbook Execution Engine

General Dev>In The News

How to use Linux vsock for fast VM communication

Linux>In The News

OS Malevich — how we made a system that embodies the idea of absolute simplicity

General Dev>In The News

A trillion dollars is a terrible thing to waste

AI>In The News

Petition to recognise open source work as civic service in Germany

General Dev>In The News

Swedish publishers file police report against Meta's Zuckerberg for fraud

General Dev>In The News

Advent of Code 2025: A Kotlin Playground

Backend>Official News

Devtalk ❯

We ❤️ helpful members!

We reward our most helpful members via our MOTM scheme - by giving away a whopping 25 books per year!

Sub Categories:

We're in Beta

About us Mission Statement See our Roadmap

Training language models to be warm and empathetic makes them less reliable and more sycophantic

CommunityNews

Training language models to be warm and empathetic makes them less reliable and more sycophantic

Where Next?

Popular Ai topics

Nvidia Uses AI to Slash Bandwidth on Video Calls

Everyone wants to do the model work, not the data work: Data Cascades in High-Stakes AI (pdf)

Google AI tool can help patients identify skin conditions

DeepMind says reinforcement learning is ‘enough’ to reach general AI

The Evolution of AI in the USA, 1956-1996

Will Transformers Take Over Artificial Intelligence?

Hyundai announces $400M AI, robotics institute powered by Boston Dynamics

OpenAI invites everyone to test new AI-powered chatbot—with amusing results

Klarna CEO says the company stopped hiring a year ago because AI 'can already do all of the jobs'

Local LLM for Coding with Ollama on macOS

Other popular topics

Which vertical monitor do you use?

Apple Game Frameworks and Technologies

Keyboard thock (sound)

The V Programming Language

PragProg’s Medium Posts

Using Regular Expressions in Erlang

Spotlight: Dmitry Zinoviev (Author) Interview and AMA!

Spotlight: Jamis Buck (Author) Interview and AMA!

Agile Web Development with Rails 8

DeepSeek (671B) running on a cluster of 8 Mac Mini Pros with 64GB RAM each

Sponsor Spotlight

AI>In The News

Latest on Devtalk

We ❤️ helpful members!

Devtalk Sponsors

Categories:

Sub Categories:

Popular Portals

Devtalk Sponsors

We're in Beta