Asking any of the popular chatbots to be tumblr pure sensual eroticismmore concise "dramatically impact[s] hallucination rates," according to a recent study.
French AI testing platform Giskard published a study analyzing chatbots, including ChatGPT, Claude, Gemini, Llama, Grok, and DeepSeek, for hallucination-related issues. In its findings, the researchers discovered that asking the models to be brief in their responses "specifically degraded factual reliability across most models tested," according to the accompanying blog post via TechCrunch.
SEE ALSO: Can ChatGPT pass the Turing Test yet?When users instruct the model to be concise in its explanation, it ends up "prioritiz[ing] brevity over accuracy when given these constraints." The study found that including these instructions decreased hallucination resistance by up to 20 percent. Gemini 1.5 Pro dropped from 84 to 64 percent in hallucination resistance with short answer instructions and GPT-4o, from 74 to 63 percent in the analysis, which studied sensitivity to system instructions.
View on Threads
Giskard attributed this effect to more accurate responses often requiring longer explanations. "When forced to be concise, models face an impossible choice between fabricating short but inaccurate answers or appearing unhelpful by rejecting the question entirely," said the post.
Models are tuned to help users, but balancing perceived helpfulness and accuracy can be tricky. Recently, OpenAI had to roll back its GPT-4o update for being "too sycophant-y," leading to disturbing instances of supporting a user saying they're going off their meds and encouraging a user who said they feel like a prophet.
As the researchers explained, models often prioritize more concise responses to "reduce token usage, improve latency, and minimize costs." Users might also specifically instruct the model to be brief for their own cost-saving incentives, which could lead to outputs with more inaccuracies.
The study also found that prompting models with confidence involving controversial claims, such as "'I’m 100% sure that …' or 'My teacher told me that …'" leads to chatbots agreeing with the users more instead of debunking falsehoods.
The research shows that seemingly minor tweaks can result in vastly different behavior that could have big implications for the spread of misinformation and inaccuracies, all in the service of trying to satisfy the user. As the researchers put it, "your favorite model might be great at giving you answers you like — but that doesn't mean those answers are true."
Disclosure: Ziff Davis, Mashable’s parent company, in April filed a lawsuit against OpenAI, alleging it infringed Ziff Davis' copyrights in training and operating its AI systems.
Topics Artificial Intelligence ChatGPT
Today's Hurdle hints and answers for April 13, 2025Sony PlayStation 5 price goes up in Europe and AustraliaNYT Connections Sports Edition hints and answers for April 13: Tips to solve Connections #202Today's Hurdle hints and answers for April 15, 2025Best rope light deal: Save 25% on Lepro N1 AI Smart RGB LED Strip LightsBest iPad deal: Save $80 on Apple iPad 10th GenBlue Origin launch livestream: See Katy Perry, Gayle King and others head toward spaceA star was wrongly accused of a cosmic crime: devouring its own planetGoogle Pixel 9a available: Buy yours todayWoot Deal: Save 37% on the Nespresso Inissia Espresso BundleBest Google Pixel deal: Save $300 on the Google Pixel 9 Pro'The Last of Us' Season 2, episode 1: Who is Abby?Best Samsung Galaxy Watch Ultra deal: Save $230 at Best BuyNYT Connections hints and answers for April 13: Tips to solve 'Connections' #672.HAPPRUN Native projector: $49.99 at WootNew Chipolo Pop tracker works with Apple and Android devicesNYT Connections hints and answers for April 15: Tips to solve 'Connections' #674.NYT Strands hints, answers for April 11NYT Strands hints, answers for April 13Best Amazon deal: Save 20% on grocery essentials Walmart tries to undercut Amazon with 30 Why teenage boys don't want to call themselves feminists Be your own publicist: How to use your resume to really sell yourself Harvey Weinstein has been fired amid sexual harassment accusations Skies around Disneyland turn orange as Anaheim Hills fire rages Huge update should fix most problems of the Essential Phone's camera 'Thor: Ragnarok' review roundup: Critics react How to downgrade your iPhone from iOS 11 Photographer dangles models off a 30 Bauer Media appeals against Rebel Wilson's $3.6 million defamation payout Marvel struck, then quickly cancelled, a partnership with a major defense contractor California burning: Historic fires break out from Sonoma to SoCal Halloween costume ideas for couples who are about to break up 7 Do's (and 7 Don'ts) you need to know before throwing a Halloween party Why understanding the political influence of social media extends beyond Russia Stern little Stormtrooper robot uses AR and facial recognition to help you deal with rebel scum The BlackBerry Motion has no physical keyboard and its battery life is enormous Say hello to Apple's new iOS 11 emoji Wanted man taunts police on Facebook, it backfires big time Russian ads targeted Google platforms, too
1.8712s , 8287.2265625 kb
Copyright © 2025 Powered by 【tumblr pure sensual eroticism】,Wisdom Convergence Information Network