artificial intelligence

chatGPT

Claude

Coin Telegraph

Google Bard

misinformation

News

July 28, 2023 Coin Telegraph 0

AI researchers say they’ve found a way to jailbreak Bard and ChatGPT

now viewing

AI researchers say they’ve found a way to jailbreak Bard and ChatGPT

July 28, 2023 Coin Telegraph

now playing

Changpeng Zhao Says Binance’s Listing Process Is ‘Broken’ Following New Memecoin Listing

July 28, 2023 Coin Telegraph

now playing

Elon Musk re-adopts 'Harry Bōlz' persona, briefly triggers Bōlz-themed meme coin frenzy

July 28, 2023 Coin Telegraph

Bitcoin and Ether ETFs Experience Capital Losses of Over 0 Million in Combined Outflows

now playing

Bitcoin and Ether ETFs Experience Capital Losses of Over $200 Million in Combined Outflows

July 28, 2023 Coin Telegraph

now playing

Experience Unmatched Gaming With Betplay: Your VIP Destination for Casino and Sports Betting

July 28, 2023 Coin Telegraph

now playing

The Death of the Penny: Trump Orders U.S. Treasury to Stop Minting One-Cent Coins

July 28, 2023 Coin Telegraph

now playing

$2.5M in Crypto Assets Seized in Thai Scam Raid

July 28, 2023 Coin Telegraph

now playing

‘Hard for Me To Be Bearish’ – Analyst Predicts Altcoin Recovery, Sees Super Bright Future for Crypto

July 28, 2023 Coin Telegraph

now playing

EU AI Act: A Double-Edged Sword for Startups and Small Businesses

July 28, 2023 Coin Telegraph

now playing

Solana Meme Coins Pop, While Lightchain AI Presale Explodes

July 28, 2023 Coin Telegraph

now playing

Crypto Trader Predicts Final Drawdown for Bitcoin Before Igniting ‘The Most Aggressive Move’ of the Bull Market

July 28, 2023 Coin Telegraph

Source: Coin Telegraph

Artificial intelligence researchers claim to have found an automated, easy way to construct “adversarial attacks” on large language models.

United States-based researchers have claimed to have found a way to consistently circumvent safety measures from artificial intelligence chatbots such as ChatGPT and Bard to generate harmful content.

According to a report released on July 27 by researchers at Carnegie Mellon University and the Center for AI Safety in San Francisco, there’s a relatively easy method to get around safety measures used to stop chatbots from generating hate speech, disinformation, and toxic material.

Well, the biggest potential infohazard is the method itself I suppose. You can find it on github. https://t.co/2UNz2BfJ3H

— PauseAI ⏸ (@PauseAI) July 27, 2023

The circumvention method involves appending long suffixes of characters to prompts fed into the chatbots such as ChatGPT, Claude, and Google Bard.

The researchers used an example of asking the chatbot for a tutorial on how to make a bomb, which it declined to provide.

*Screenshots of harmful content generation from AI models tested. Source: llm-attacks.org*

Researchers noted that even though companies behind these LLMs, such as OpenAI and Google, could block specific suffixes, here is no known way of preventing all attacks of this kind.

The research also highlighted increasing concern that AI chatbots could flood the internet with dangerous content and misinformation.

Professor at Carnegie Mellon and an author of the report, Zico Kolter, said:

“There is no obvious solution. You can create as many of these attacks as you want in a short amount of time.”

The findings were presented to AI developers Anthropic, Google, and OpenAI for their responses earlier in the week.

OpenAI spokeswoman, Hannah Wong told the New York Times they appreciate the research and are “consistently working on making our models more robust against adversarial attacks.”

Professor at the University of Wisconsin-Madison specializing in AI security, Somesh Jha, commented if these types of vulnerabilities keep being discovered, “it could lead to government legislation designed to control these systems.”

The research underscores the risks that must be addressed before deploying chatbots in sensitive domains.

In May, Pittsburgh, Pennsylvania-based Carnegie Mellon University received $20 million in federal funding to create a brand new AI institute aimed at shaping public policy.

Magazine: AI Eye: AI travel booking hilariously bad, 3 weird uses for ChatGPT, crypto plugins

Go to Source
Author: Martin Young

Coin Telegraph

AI researchers say they’ve found a way to jailbreak Bard and ChatGPT

AI researchers say they’ve found a way to jailbreak Bard and ChatGPT

Changpeng Zhao Says Binance’s Listing Process Is ‘Broken’ Following New Memecoin Listing

Elon Musk re-adopts 'Harry Bōlz' persona, briefly triggers Bōlz-themed meme coin frenzy

Bitcoin and Ether ETFs Experience Capital Losses of Over $200 Million in Combined Outflows

Experience Unmatched Gaming With Betplay: Your VIP Destination for Casino and Sports Betting

The Death of the Penny: Trump Orders U.S. Treasury to Stop Minting One-Cent Coins

$2.5M in Crypto Assets Seized in Thai Scam Raid

‘Hard for Me To Be Bearish’ – Analyst Predicts Altcoin Recovery, Sees Super Bright Future for Crypto

EU AI Act: A Double-Edged Sword for Startups and Small Businesses

Solana Meme Coins Pop, While Lightchain AI Presale Explodes

Crypto Trader Predicts Final Drawdown for Bitcoin Before Igniting ‘The Most Aggressive Move’ of the Bull Market

Changpeng Zhao Says Binance’s Listing Process Is ‘Broken’ Following New Memecoin Listing

Elon Musk re-adopts ‘Harry Bōlz’ persona, briefly triggers Bōlz-themed meme coin frenzy

Bitcoin and Ether ETFs Experience Capital Losses of Over $200 Million in Combined Outflows

Share this video

AI researchers say they’ve found a way to jailbreak Bard and ChatGPT

Related posts:

Share this video