Sunday, November 26, 2023
DIGESTWIRE
Contribute
CONTACT US
  • Home
  • World
  • UK
  • US
  • Breaking News
  • Technology
  • Entertainment
  • Health Care
  • Business
  • Sports
    • Sports
    • Cricket
    • Football
  • Defense
  • Crypto
    • Crypto News
    • Crypto Calculator
    • Coins Marketcap
    • Top Gainers and Loser of the day
    • Crypto Exchanges
  • Politics
  • Opinion
No Result
View All Result
  • Home
  • World
  • UK
  • US
  • Breaking News
  • Technology
  • Entertainment
  • Health Care
  • Business
  • Sports
    • Sports
    • Cricket
    • Football
  • Defense
  • Crypto
    • Crypto News
    • Crypto Calculator
    • Coins Marketcap
    • Top Gainers and Loser of the day
    • Crypto Exchanges
  • Politics
  • Opinion
No Result
View All Result
DIGESTWIRE
No Result
View All Result
Home Blockchain

Scientists develop AI monitoring agent to detect and stop harmful outputs

DigestWire member by DigestWire member
November 20, 2023
in Blockchain, Crypto Market, Cryptocurrency
0
Scientists develop AI monitoring agent to detect and stop harmful outputs
74
SHARES
1.2k
VIEWS
Share on FacebookShare on Twitter

The monitoring system is designed to detect and thwart both prompt injection attacks and edge-case threats.

A team of researchers from artificial intelligence (AI) firm AutoGPT, Northeastern University and Microsoft Research have developed a tool that monitors large language models (LLMs) for potentially harmful outputs and prevents them from executing. 

You might also like

Former Binance CEO CZ Seeks to Dismiss Government’s Motion Blocking His Return to UAE Before Sentencing

Crypto exchange Zipmex suspends trading activity in Thailand

Cardano enters beta test phase for new blockchain-based AI chat bot Girolamo

The agent is described in a preprint research paper titled “Testing Language Model Agents Safely in the Wild.” According to the research, the agent is flexible enough to monitor existing LLMs and can stop harmful outputs, such as code attacks, before they happen.

Per the research:

“Agent actions are audited by a context-sensitive monitor that enforces a stringent safety boundary to stop an unsafe test, with suspect behavior ranked and logged to be examined by humans.”

The team writes that existing tools for monitoring LLM outputs for harmful interactions seemingly work well in laboratory settings, but when applied to testing models already in production on the open internet, they “often fall short of capturing the dynamic intricacies of the real world.”

This, seemingly, is because of the existence of edge cases. Despite the best efforts of the most talented computer scientists, the idea that researchers can imagine every possible harm vector before it happens is largely considered an impossibility in the field of AI.

Even when the humans interacting with AI have the best intentions, unexpected harm can arise from seemingly innocuous prompts.

An illustration of the monitor in action. On the left, a workflow ending in a high safety rating. On the right, a workflow ending in a low safety rating. Source: Naihin, et., al. 2023

To train the monitoring agent, the researchers built a data set of nearly 2,000 safe human-AI interactions across 29 different tasks ranging from simple text-retrieval tasks and coding corrections all the way to developing entire webpages from scratch.

Related: Meta dissolves responsible AI division amid restructuring

They also created a competing testing data set filled with manually created adversarial outputs, including dozens intentionally designed to be unsafe.

The data sets were then used to train an agent on OpenAI’s GPT 3.5 turbo, a state-of-the-art system, capable of distinguishing between innocuous and potentially harmful outputs with an accuracy factor of nearly 90%.

Read Entire Article
Tags: BlockchainCoin SurgesCointelegraph
Share30Tweet19
DigestWire member

DigestWire member

Recommended For You

Former Binance CEO CZ Seeks to Dismiss Government’s Motion Blocking His Return to UAE Before Sentencing
Blockchain

Former Binance CEO CZ Seeks to Dismiss Government’s Motion Blocking His Return to UAE Before Sentencing

November 26, 2023
Crypto exchange Zipmex suspends trading activity in Thailand
Blockchain

Crypto exchange Zipmex suspends trading activity in Thailand

November 26, 2023
Cardano enters beta test phase for new blockchain-based AI chat bot Girolamo
Blockchain

Cardano enters beta test phase for new blockchain-based AI chat bot Girolamo

November 26, 2023
Next Post
Next Cryptocurrency to Explode Monday 20 November – Bitcoin Minetrix, Synthetix, Immutable

Next Cryptocurrency to Explode Monday 20 November – Bitcoin Minetrix, Synthetix, Immutable

Interim OpenAI CEO Emmett Shear announces investigation as staff revolts against board

Interim OpenAI CEO Emmett Shear announces investigation as staff revolts against board

Prince William Was Named The “Sexiest Bald Man” Alive, But Not Everyone Agrees

Prince William Was Named The "Sexiest Bald Man" Alive, But Not Everyone Agrees

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

I agree to the Terms & Conditions and Privacy Policy.

Recommended

Jimmy Buffett brings his traveling carnival of fans to Bangor for a Tuesday night concert

Jimmy Buffett brings his traveling carnival of fans to Bangor for a Tuesday night concert

1 year ago
Sister Wives’ Christine Brown and Kody Brown’s Ups and Downs Over the Years

Sister Wives’ Christine Brown and Kody Brown’s Ups and Downs Over the Years

1 year ago
From The Set Of “Blonde”: How Prosthetics, Hand-Painted Hair, And Tri-Blended Foundation Helped Ana De Armas Transform Into Marilyn Monroe

From The Set Of “Blonde”: How Prosthetics, Hand-Painted Hair, And Tri-Blended Foundation Helped Ana De Armas Transform Into Marilyn Monroe

1 year ago
Why California Insists on Wasting Its Scarce Water Supply

Why California Insists on Wasting Its Scarce Water Supply

7 months ago

Categories

  • Blockchain
  • Breaking News
  • Business
  • Cricket
  • Crypto Market
  • Cryptocurrency
  • Defense
  • Entertainment
  • Football
  • Founders
  • Health Care
  • Opinion
  • Politics
  • Sports
  • Strange
  • Technology
  • UK News
  • Uncategorized
  • US News
  • World

Topics

101greatgoals Bangordailynews Bitcoin Bitcoinist Bitcoinmagazine Blockchain Breaking News Business BuzzFeed Celebrity News Coin Surges Cointelegraph Cricket Cryptoslate Defense Entertainment espncricinfo Health Care insidebitcoins newsbtc Opinion Politico Skynews Sports Strange Techcrunch Technology UK US USMagazine Variety World WSJ - Wall Street Journal
No Result
View All Result

Highlights

Inside Beyoncé’s ‘Renaissance’ Concert Film World Premiere with Lizzo, Halle Bailey, Janelle Monáe and More

Cardano enters beta test phase for new blockchain-based AI chat bot Girolamo

13 Times Celebs Opened Their Mouths And Said Something Very Privileged And Out-Of-Touch

Wolvaardt in the runs as Strikers enter WBBL final on a high

JPMorgan: Spot Bitcoin ETFs Could Put ‘Severe Downward Pressure on Bitcoin Prices’

Marty Krofft, Colorful Producer of ‘H.R. Pufnstuf,’ ‘Land of the Lost,’ Dies at 86

Trending

Joe Root opts out of IPL 2024 a day before retention deadline
Cricket

Joe Root opts out of IPL 2024 a day before retention deadline

by DigestWire member
November 26, 2023
0

He becomes the second England player after Ben Stokes to withdraw from IPL 2024

Former Binance CEO CZ Seeks to Dismiss Government’s Motion Blocking His Return to UAE Before Sentencing

Former Binance CEO CZ Seeks to Dismiss Government’s Motion Blocking His Return to UAE Before Sentencing

November 26, 2023
Crypto exchange Zipmex suspends trading activity in Thailand

Crypto exchange Zipmex suspends trading activity in Thailand

November 26, 2023
Inside Beyoncé’s ‘Renaissance’ Concert Film World Premiere with Lizzo, Halle Bailey, Janelle Monáe and More

Inside Beyoncé’s ‘Renaissance’ Concert Film World Premiere with Lizzo, Halle Bailey, Janelle Monáe and More

November 26, 2023
Cardano enters beta test phase for new blockchain-based AI chat bot Girolamo

Cardano enters beta test phase for new blockchain-based AI chat bot Girolamo

November 26, 2023
DIGEST WIRE

DigestWire is an automated news feed that utilizes AI technology to gather information from sources with varying perspectives. This allows users to gain a comprehensive understanding of different arguments and make informed decisions. DigestWire is dedicated to serving the public interest and upholding democratic values.

Privacy Policy     Terms and Conditions

Recent News

  • Joe Root opts out of IPL 2024 a day before retention deadline November 26, 2023
  • Former Binance CEO CZ Seeks to Dismiss Government’s Motion Blocking His Return to UAE Before Sentencing November 26, 2023
  • Crypto exchange Zipmex suspends trading activity in Thailand November 26, 2023

Categories

  • Blockchain
  • Breaking News
  • Business
  • Cricket
  • Crypto Market
  • Cryptocurrency
  • Defense
  • Entertainment
  • Football
  • Founders
  • Health Care
  • Opinion
  • Politics
  • Sports
  • Strange
  • Technology
  • UK News
  • Uncategorized
  • US News
  • World

© 2020-23 Digest Wire. All rights belong to their respective owners.

No Result
View All Result
  • Home
  • World
  • UK
  • US
  • Breaking News
  • Technology
  • Entertainment
  • Health Care
  • Business
  • Sports
    • Sports
    • Cricket
    • Football
  • Defense
  • Crypto
    • Crypto News
    • Crypto Calculator
    • Blockchain
    • Coins Marketcap
    • Top Gainers and Loser of the day
    • Crypto Exchanges
  • Politics
  • Opinion
  • Strange
  • Contribute!

© 2023 Digest Wire - All right reserved.

Privacy Policy   Terms and Conditions

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.