Thursday, November 20, 2025
DIGESTWIRE
Contribute
CONTACT US
  • Home
  • World
  • UK
  • US
  • Breaking News
  • Technology
  • Entertainment
  • Health Care
  • Business
  • Sports
    • Sports
    • Cricket
    • Football
  • Defense
  • Crypto
    • Crypto News
    • Crypto Calculator
    • Coins Marketcap
    • Top Gainers and Loser of the day
    • Crypto Exchanges
  • Politics
  • Opinion
  • Blog
  • Founders
No Result
View All Result
  • Home
  • World
  • UK
  • US
  • Breaking News
  • Technology
  • Entertainment
  • Health Care
  • Business
  • Sports
    • Sports
    • Cricket
    • Football
  • Defense
  • Crypto
    • Crypto News
    • Crypto Calculator
    • Coins Marketcap
    • Top Gainers and Loser of the day
    • Crypto Exchanges
  • Politics
  • Opinion
  • Blog
  • Founders
No Result
View All Result
DIGESTWIRE
No Result
View All Result
Home Blockchain

OpenAI GPT 4o ranked as best AI model for writing Solidity smart contract code by IQ

by DigestWire member
October 21, 2024
in Blockchain, Crypto Market, Cryptocurrency
0
OpenAI GPT 4o ranked as best AI model for writing Solidity smart contract code by IQ
74
SHARES
1.2k
VIEWS
Share on FacebookShare on Twitter

SolidityBench by IQ has launched as the first leaderboard to evaluate LLMs in Solidity code generation. Available on Hugging Face, it introduces two innovative benchmarks, NaïveJudge and HumanEval for Solidity, designed to assess and rank the proficiency of AI models in generating smart contract code.

Developed by IQ’s BrainDAO as part of its forthcoming IQ Code suite, SolidityBench serves to refine their own EVMind LLMs and compare them against generalist and community-created models. IQ Code aims to offer AI models tailored for generating and auditing smart contract code, addressing the growing need for secure and efficient blockchain applications.

As IQ told CryptoSlate, NaïveJudge offers a novel approach by tasking LLMs with implementing smart contracts based on detailed specifications derived from audited OpenZeppelin contracts. These contracts provide a gold standard for correctness and efficiency. The generated code is evaluated against a reference implementation using criteria such as functional completeness, adherence to Solidity best practices and security standards, and optimization efficiency.

The evaluation process leverages advanced LLMs, including different versions of OpenAI’s GPT-4 and Claude 3.5 Sonnet as impartial code reviewers. They assess the code based on rigorous criteria, including implementing all key functionalities, handling edge cases, error management, proper syntax usage, and overall code structure and maintainability.

Optimization considerations such as gas efficiency and storage management are also evaluated. Scores range from 0 to 100, providing a comprehensive assessment across functionality, security, and efficiency, mirroring the complexities of professional smart contract development.

Which AI models are best for solidity smart contract development?

Benchmarking results showed that OpenAI’s GPT-4o model achieved the highest overall score of 80.05, with a NaïveJudge score of 72.18 and HumanEval for Solidity pass rates of 80% at pass@1 and 92% at pass@3.

Interestingly, newer reasoning models like OpenAI’s o1-preview and o1-mini were beaten to the top spot, scoring 77.61 and 75.08, respectively. Models from Anthropic and XAI, including Claude 3.5 Sonnet and grok-2, demonstrated competitive performance with overall scores hovering around 74. Nvidia’s Llama-3.1-Nemotron-70B scored lowest in the top 10 at 52.54.

SolidityBench scores for LLMs (Hugging Face)
SolidityBench scores for LLMs (Hugging Face)

Per IQ, HumanEval for Solidity adapts OpenAI’s original HumanEval benchmark from Python to Solidity, encompassing 25 tasks of varying difficulty. Each task includes corresponding tests compatible with Hardhat, a popular Ethereum development environment, facilitating accurate compilation and testing of generated code. The evaluation metrics, pass@1 and pass@3, measure the model’s success on initial attempts and over multiple tries, offering insights into both precision and problem-solving capabilities.

Goals of utilizing AI models in smart contract development

By introducing these benchmarks, SolidityBench seeks to advance AI-assisted smart contract development. It encourages the creation of more sophisticated and reliable AI models while providing developers and researchers with valuable insights into AI’s current capabilities and limitations in Solidity development.

The benchmarking toolkit aims to advance IQ Code’s EVMind LLMs and also sets new standards for AI-assisted smart contract development across the blockchain ecosystem. The initiative hopes to address a critical need in the industry, where the demand for secure and efficient smart contracts continues to grow.

Developers, researchers, and AI enthusiasts are invited to explore and contribute to SolidityBench, which aims to drive the continuous refinement of AI models, promote best practices, and advance decentralized applications.

Visit the SolidityBench leaderboard on Hugging Face to learn more and begin benchmarking Solidity generation models.

The post OpenAI GPT 4o ranked as best AI model for writing Solidity smart contract code by IQ appeared first on CryptoSlate.

Read Entire Article
Tags: BlockchainCoin SurgesCryptoslate
Share30Tweet19
Next Post
Crypto Regulation Uproar! Can John Deaton End Warren’s Senate Streak?

Crypto Regulation Uproar! Can John Deaton End Warren’s Senate Streak?

Japan Crypto Tax Relief: Can Tamaki’s 20% Rate Bring Positive Change?

Japan Crypto Tax Relief: Can Tamaki’s 20% Rate Bring Positive Change?

Millions Already Hold This Latam-Based Dollar Pegged Stablecoin

Millions Already Hold This Latam-Based Dollar Pegged Stablecoin

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

I agree to the Terms & Conditions and Privacy Policy.

No Result
View All Result
Coins MarketCap Live Updates Coins MarketCap Live Updates Coins MarketCap Live Updates
ADVERTISEMENT

Highlights

Tim Robinson’s ‘The Chair Company’ Renewed for Season 2 at HBO

Amir El Masry, Kaouther Ben Hania, Yaqoub Alfarhan and Lebleba Join Red Sea In Conversation Series

‘Baywatch’ Reboot Lands $21 Million Tax Credit to Film in L.A.

Rideback RISE Teams With Highways Lab and Adobe to Launch Director and Editor Development Programs

‘House of the Dragon’ Renewed for Season 4

‘Game of Thrones’ Prequel ‘Knight of the Seven Kingdoms’ Renewed for Season 2 Ahead of Series Premiere

Trending

Mushfiqur: ‘I want to give back for as long as I’m playing for Bangladesh’
Cricket

Mushfiqur: ‘I want to give back for as long as I’m playing for Bangladesh’

by DigestWire member
November 20, 2025
0

"I want to ensure there are one or two players who can fill my gap when I...

Get the NYC Rich Mom Look With These Steep Black Friday Week Deals

Get the NYC Rich Mom Look With These Steep Black Friday Week Deals

November 20, 2025
Rachel Sennott’s ‘I Love L.A.’ Renewed for Season 2 at HBO

Rachel Sennott’s ‘I Love L.A.’ Renewed for Season 2 at HBO

November 20, 2025
Tim Robinson’s ‘The Chair Company’ Renewed for Season 2 at HBO

Tim Robinson’s ‘The Chair Company’ Renewed for Season 2 at HBO

November 20, 2025
Amir El Masry, Kaouther Ben Hania, Yaqoub Alfarhan and Lebleba Join Red Sea In Conversation Series

Amir El Masry, Kaouther Ben Hania, Yaqoub Alfarhan and Lebleba Join Red Sea In Conversation Series

November 20, 2025
DIGEST WIRE

DigestWire is an automated news feed that utilizes AI technology to gather information from sources with varying perspectives. This allows users to gain a comprehensive understanding of different arguments and make informed decisions. DigestWire is dedicated to serving the public interest and upholding democratic values.

Privacy Policy     Terms and Conditions

Recent News

  • Mushfiqur: ‘I want to give back for as long as I’m playing for Bangladesh’ November 20, 2025
  • Get the NYC Rich Mom Look With These Steep Black Friday Week Deals November 20, 2025
  • Rachel Sennott’s ‘I Love L.A.’ Renewed for Season 2 at HBO November 20, 2025

Categories

  • Blockchain
  • Blog
  • Breaking News
  • Business
  • Cricket
  • Crypto Market
  • Cryptocurrency
  • Defense
  • Entertainment
  • Football
  • Founders
  • Health Care
  • Opinion
  • Politics
  • Sports
  • Strange
  • Technology
  • UK News
  • Uncategorized
  • US News
  • World

© 2020-23 Digest Wire. All rights belong to their respective owners.

No Result
View All Result
  • Home
  • World
  • UK
  • US
  • Breaking News
  • Technology
  • Entertainment
  • Health Care
  • Business
  • Sports
    • Sports
    • Cricket
    • Football
  • Defense
  • Crypto
    • Crypto News
    • Crypto Calculator
    • Blockchain
    • Coins Marketcap
    • Top Gainers and Loser of the day
    • Crypto Exchanges
  • Politics
  • Opinion
  • Strange
  • Blog
  • Founders
  • Contribute!

© 2024 Digest Wire - All right reserved.

Privacy Policy   Terms and Conditions

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.