Lifestyle

AI vs. Humans: Which Performs Certain Skills Better?

With ChatGPT’s explosive rise, AI has been making its presence felt for the masses, especially in traditional bastions of human capabilities—reading comprehension, speech recognition and image identification.

This article was written by Mark Belan and originally published by Visual Capitalist.

In fact, in the chart above it’s clear that AI has surpassed human performance in quite a few areas, and looks set to overtake humans elsewhere.

How Performance Gets Tested

Using data from Contextual AI, we visualize how quickly AI models have started to beat database benchmarks, as well as whether or not they’ve yet reached human levels of skill.

Each database is devised around a certain skill, like handwriting recognition, language understanding, or reading comprehension, while each percentage score contrasts with the following benchmarks:

  • 0% or “maximally performing baseline”
    This is equal to the best-known performance by AI at the time of dataset creation.
  • 100%
    This mark is equal to human performance on the dataset.

By creating a scale between these two points, the progress of AI models on each dataset could be tracked. Each point on a line signifies a best result and as the line trends upwards, AI models get closer and closer to matching human performance.

Below is a table of when AI started matching human performance across all eight skills:

Skill Matched Human
Performance
Database Used
Handwriting Recognition 2018 MNIST
Speech Recognition 2017 Switchboard
Image Recognition 2015 ImageNet
Reading Comprehension 2018 SQuAD 1.1, 2.0
Language
Understanding
2020 GLUE
Common Sense
Completion
2023 HellaSwag
Grade School Math N/A GSK8k
Code Generation N/A HumanEval

A key observation from the chart is how much progress has been made since 2010. In fact many of these databases—like SQuAD, GLUE, and HellaSwag—didn’t exist before 2015.

In response to benchmarks being rendered obsolete, some of the newer databases are constantly being updated with new and relevant data points. This is why AI models technically haven’t matched human performance in some areas (grade school math and code generation) yet—though they are well on their way.

What’s Led to AI Outperforming Humans?

But what has led to such speedy growth in AI’s abilities in the last few years?

Thanks to revolutions in computing power, data availability, and better algorithms, AI models are faster, have bigger datasets to learn from, and are optimized for efficiency compared to even a decade ago.

This is why headlines routinely talk about AI language models matching or beating human performance on standardized tests. In fact, a key problem for AI developers is that their models keep beating benchmark databases devised to test them, but still somehow fail real world tests.

Since further computing and algorithmic gains are expected in the next few years, this rapid progress is likely to continue. However, the next potential bottleneck to AI’s progress might not be AI itself, but a lack of data for models to train on.

Share
U Cast Studios

Recent Posts

  • I Read It On The Internet

America’s Dairy Cow Replacement Inventory Collapses To Two-Decade Low

The nation's food supply chain remains under stress. We've been sounding the alarm on America's beef… Read More

23 hours ago
  • Business

Mapped: The Top 10 U.S. States, By Lowest Real GDP Growth

While the U.S. economy defied expectations in 2023, posting 2.5% in real GDP growth, several states lagged… Read More

2 days ago
  • I Read It On The Internet

Concepts In Quantum Materials And Computing: From Dreams Toward Use

You likely have never heard a student exclaim at a school career day, "I want… Read More

2 days ago
  • Business

Soaring Inflation Is Making Home & Car Insurance Unaffordable

American car owners are facing a wall of bad debt to finance vehicles they can’t afford —… Read More

3 days ago
  • I Read It On The Internet

Astronomers Discover 27,500 New Asteroids Lurking In Archival Images

There are well over a million asteroids in the solar system. Most don’t cross paths… Read More

3 days ago
  • LA/Ventura

US Regulator Opens Safety Probe Into Waymo Robotaxis

The top US auto safety regulator on Tuesday said it had opened an investigation into… Read More

4 days ago

This website uses cookies.