December 4, 2025

Web and Technology News

Researchers surprised that with AI, toxicity is harder to fake than intelligence

The next time you encounter an unusually polite reply on social media, you might want to check twice. It could be an AI model trying (and failing) to blend in with the crowd.

On Wednesday, researchers from the University of Zurich, University of Amsterdam, Duke University, and New York University released a study revealing that AI models remain easily distinguishable from humans in social media conversations, with overly friendly emotional tone serving as the most persistent giveaway. The research, which tested nine open-weight models across Twitter/X, Bluesky, and Reddit, found that classifiers developed by the researchers detected AI-generated replies with 70 to 80 percent accuracy.

The study introduces what the authors call a “computational Turing test” to assess how closely AI models approximate human language. Instead of relying on subjective human judgment about whether text sounds authentic, the framework uses automated classifiers and linguistic analysis to identify specific features that distinguish machine-generated from human-authored content.

Read full article

Comments

Previous Article

Bioware says next Mass Effect is still in development despite turmoil at EA

Next Article

GoWish’s shopping and wishlist app is having its biggest year yet

You might be interested in …

Twitter is testing a ‘CC’ button that will let you turn video captions on or off

Twitter announced today that it’s beginning to test a “CC” button that will let users turn video captions on or off. The feature is rolling out to some iOS users and will be coming to Android users soon, Twitter notes. A video posted by the social media shows that the CC button will appear in […]

Leave a Reply

Your email address will not be published. Required fields are marked *