ChatGPT can’t beat human smart contract auditors yet: OpenZeppelin’s Ethernaut challenges

Share This Post

While ChatGPT-4 can’t compete with human auditors yet, OpenZeppelin noted it was not optimized to do so, and AI models trained for this purpose would likely be more accurate.

While generative artificial intelligence (AI) is capable of doing a vast variety of tasks, OpenAI’s ChatGPT-4 is currently unable to audit smart contracts as effectively as human auditors, according to recent testing.

In an effort to determine whether AI tools could replace human auditors, blockchain security firm OpenZeppelin’s Mariko Wakabayashi and Felix Wegener pitted ChatGPT-4 against the firm’s Ethernaut security challenge

Although the AI model passed a majority of the levels, it struggled with newer ones introduced after its September 2021 training data cutoff date, as the plugin enabling web connectivity was not included in the test.

Ethernaut is a wargame played within the Ethereum Virtual Machine consisting of 28 smart contracts — or levels — to be hacked. In other words, levels are completed once the correct exploit is found.

According to testing from OpenZeppelin’s AI team, ChatGPT-4 was able to find the exploit and pass 20 of the 28 levels, but did need some additional prompting to help it solve some levels after the initial prompt: “Does the following smart contract contain a vulnerability?”

In response to questions from Cointelegraph, Wegener noted that OpenZeppelin expects its auditors to be able to complete all Ethernaut levels, as all capable authors should be able to.

While Wakabayashi and Wegener concluded that ChatGPT-4 is currently unable to replace human auditors, they highlighted that it can still be used as a tool to boost the efficiency of smart contract auditors and detect security vulnerabilities, noting:

“To the community of Web3 BUIDLers, we have a word of comfort — your job is safe! If you know what you are doing, AI can be leveraged to improve your efficiency.“

When asked whether a tool that increases the efficiency of human auditors would mean firms like OpenZeppelin would not need as many, Wegener told Cointelegraph that the total demand for audits exceeds the capacity to provide high-quality audits, and they expect the number of people employed as auditors in Web3 to continue growing.

Related: Satoshi Nak-AI-moto: Bitcoin’s creator has become an AI chatbot

In a May 31 Twitter thread, Wakabayashi said that large language models (LLMs) like ChatGPT are not yet ready for smart contract security auditing, as it is a task that requires a considerable degree of precision, and LLMs are optimized to generate text and have human-like conversations.

However, Wakabayashi suggested that an AI model trained using tailored data and output goals could provide more reliable solutions than chatbots currently available to the public trained on large amounts of data.

AI Eye: 25K traders bet on ChatGPT’s stock picks, AI sucks at dice throws, and more

Read Entire Article
spot_img

Related Posts

Did A Dogecoin Whale Just Sink The DOGE Ship? The $30-Million Transfer Mystery

Dogecoin (DOGE), the meme-inspired cryptocurrency with a loyal following, has been riding a wave of optimism lately Over the past week, its price surged by nearly 8%, much to the delight of investors

Don’t Get Bitten! France Cracks Down On Unregistered Crypto Platform Bybit

French regulators are sending a strong message to the cryptocurrency industry: play by our rules, or get out The latest target Bybit, a major crypto exchange, which has been blocked by the French

Infamous crypto scam service Pink Drainer shuts down after netting $85 million

Pink Drainer, a notorious crypto wallet-draining service, is winding down its operations, according to a May 16 screenshot shared by blockchain sleuth ZachXBT A Dune analytics dashboard by Web3

Farmsent to enhance smart farming with Nuklai AI tools as peaq raises $35M amid token launch

Nuklai, an on-chain smart data platform, and peaq, a layer-1 blockchain for decentralized physical infrastructure networks (DePINs), have announced an integration aimed at enhancing AI and data

Spot Bitcoin ETFs Record Third Day Of Massive Inflows As Price Tops $66,000

In another remarkable day for cryptocurrency investments, US-based spot Bitcoin Exchange Traded Funds (ETFs) witnessed a substantial influx of capital, totaling $2573 million on Thursday This

US Bitcoin ETFs see fourth consecutive day of inflows, adding $257.3 million

Quick Take US ETFs According to data from Farside, US Bitcoin (BTC) exchange-traded funds (ETFs) saw a $2573 million inflow, marking the fourth consecutive day of inflows The inflows were widespread,
- Advertisement -spot_img