ChatGPT successfully navigated a radiology board exam, marking a milestone in the integration of AI into the medical field, but also exposing challenges. Reuters
ChatGPT successfully navigated a radiology board exam, marking a milestone in the integration of AI into the medical field, but also exposing challenges. Reuters
ChatGPT successfully navigated a radiology board exam, marking a milestone in the integration of AI into the medical field, but also exposing challenges. Reuters
ChatGPT successfully navigated a radiology board exam, marking a milestone in the integration of AI into the medical field, but also exposing challenges. Reuters

ChatGPT passes radiology board exam, but still has limitations


Marwa Hassan
  • English
  • Arabic

The latest version of the artificial intelligence chatbot ChatGPT has passed a radiology board-style examination.

Researchers put ChatGPT to the test using 150 multiple-choice questions modelled on the Canadian Royal College and American Board of Radiology exams.

This breakthrough underscores the vast potential of AI in medical fields, yet it also reveals certain limitations that affect its dependability, two studies said.

ChatGPT, a deep-learning model developed by OpenAI, is known for generating humanlike responses based on the input it receives.

Its pattern recognition abilities allow it to interpret and respond to vast amounts of data, but it sometimes produces factually incorrect responses because of the absence of a source of truth in its training data.

“The use of large language models like ChatGPT is rapidly expanding and will only continue to grow,” said Dr Rajesh Bhayana, an abdominal radiologist and technology lead at University Medical Imaging Toronto, Toronto General Hospital.

“Our research offers valuable insight into how ChatGPT performs in a radiology setting, emphasising its immense potential while shedding light on current reliability issues.”

ChatGPT's usage and influence have been growing significantly. Notably, it was recently named the fastest-growing consumer application in history. It is also being integrated into popular search engines like Google and Bing, which both physicians and patients use for medical inquiries.

  • Argentine artist Sofia Crespo holds one of her works as she poses for a photo at the Estrela garden in Lisbon on June 8, 2022. AFP
    Argentine artist Sofia Crespo holds one of her works as she poses for a photo at the Estrela garden in Lisbon on June 8, 2022. AFP
  • Sofia Crespo creates her works with the help of artificial intelligence. AFP
    Sofia Crespo creates her works with the help of artificial intelligence. AFP
  • She is part of the 'generative art' movement, where humans create rules for computers which then use algorithms to generate new forms, ideas and patterns. AFP
    She is part of the 'generative art' movement, where humans create rules for computers which then use algorithms to generate new forms, ideas and patterns. AFP
  • Sofia Crespo holds one of her works as she poses for a photo. AFP
    Sofia Crespo holds one of her works as she poses for a photo. AFP
  • Undated handout photo issued by Aidan Meller of a portrait of Queen Elizabeth II, painted by an ultra-realistic humanoid robot artist. The painting, titled 'Algorithm Queen', was painted by Ai-Da robot, an AI robot built in 2019 that creates drawings, paintings and sculptures.
    Undated handout photo issued by Aidan Meller of a portrait of Queen Elizabeth II, painted by an ultra-realistic humanoid robot artist. The painting, titled 'Algorithm Queen', was painted by Ai-Da robot, an AI robot built in 2019 that creates drawings, paintings and sculptures.
  • The Ai-Da robot, the world's first ultra-realistic humanoid robot, on view for a show organised by the Concilio Europeo dell'Arte (Council of Europe) at the 59th International Art Exhibition in Venice, Italy, on April 20, 2022. EPA
    The Ai-Da robot, the world's first ultra-realistic humanoid robot, on view for a show organised by the Concilio Europeo dell'Arte (Council of Europe) at the 59th International Art Exhibition in Venice, Italy, on April 20, 2022. EPA
  • Ai-Da paints an image during a photocall in central London. AFP
    Ai-Da paints an image during a photocall in central London. AFP

The AI chatbot managed to correctly answer 69 per cent of the questions, just short of the passing grade of 70 per cent.

However, it showed a noticeable gap in performance between lower-order thinking (84 per cent) and higher-order thinking questions (60 per cent), particularly struggling with descriptions of imaging findings, calculations and classifications, and the application of concepts.

Given that the AI has not received any radiology-specific training, these struggles were not unexpected.

A newer version — GPT-4 — was released in March, the release was an improved version of the AI including enhanced advanced reasoning capabilities. In a follow-up study, GPT-4 answered 81 per cent of the same questions correctly, exceeding the passing threshold and outperforming its predecessor, GPT-3.5.

Despite these improvements, GPT-4 did not show any progress on lower-order thinking questions and answered 12 questions incorrectly that GPT-3.5 had answered correctly. This inconsistency raises questions about the AI's reliability in information gathering.

“ChatGPT gave accurate and confident answers to some challenging radiology questions, but then made some very illogical and inaccurate assertions,” said Dr Bhayana.

“Given how these models function, the inaccurate responses should not be surprising.”

The studies noted a tendency of ChatGPT to produce inaccurate responses, termed hallucinations. Although less frequent in GPT-4, this tendency still limits the chatbot's current usability in medical education and practice.

Despite the limitations, the researchers see potential in using ChatGPT to spark ideas and aid in the medical writing process and data summarisation, as long as the information is fact-checked.

“To me, this is its biggest limitation. At present, ChatGPT is best used to spark ideas, help start the medical writing process and in data summarisation. If used for quick information recall, it always needs to be fact-checked,” Dr Bhayana said.

Director: Laxman Utekar

Cast: Vicky Kaushal, Akshaye Khanna, Diana Penty, Vineet Kumar Singh, Rashmika Mandanna

Rating: 1/5

DEADPOOL & WOLVERINE

Starring: Ryan Reynolds, Hugh Jackman, Emma Corrin

Director: Shawn Levy

Rating: 3/5

RESULTS

Argentina 4 Haiti 0

Peru 2 Scotland 0

Panama 0 Northern Ireland 0

SPEC%20SHEET%3A%20APPLE%20M3%20MACBOOK%20AIR%20(13%22)
%3Cp%3E%3Cstrong%3EProcessor%3A%3C%2Fstrong%3E%20Apple%20M3%2C%208-core%20CPU%2C%20up%20to%2010-core%20CPU%2C%2016-core%20Neural%20Engine%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EDisplay%3A%3C%2Fstrong%3E%2013.6-inch%20Liquid%20Retina%2C%202560%20x%201664%2C%20224ppi%2C%20500%20nits%2C%20True%20Tone%2C%20wide%20colour%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EMemory%3A%3C%2Fstrong%3E%208%2F16%2F24GB%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EStorage%3A%3C%2Fstrong%3E%20256%2F512GB%20%2F%201%2F2TB%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EI%2FO%3A%3C%2Fstrong%3E%20Thunderbolt%203%2FUSB-4%20(2)%2C%203.5mm%20audio%2C%20Touch%20ID%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EConnectivity%3A%3C%2Fstrong%3E%20Wi-Fi%206E%2C%20Bluetooth%205.3%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EBattery%3A%3C%2Fstrong%3E%2052.6Wh%20lithium-polymer%2C%20up%20to%2018%20hours%2C%20MagSafe%20charging%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3ECamera%3A%3C%2Fstrong%3E%201080p%20FaceTime%20HD%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EVideo%3A%3C%2Fstrong%3E%20Support%20for%20Apple%20ProRes%2C%20HDR%20with%20Dolby%20Vision%2C%20HDR10%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EAudio%3A%3C%2Fstrong%3E%204-speaker%20system%2C%20wide%20stereo%2C%20support%20for%20Dolby%20Atmos%2C%20Spatial%20Audio%20and%20dynamic%20head%20tracking%20(with%20AirPods)%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EColours%3A%3C%2Fstrong%3E%20Midnight%2C%20silver%2C%20space%20grey%2C%20starlight%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EIn%20the%20box%3A%3C%2Fstrong%3E%20MacBook%20Air%2C%2030W%2F35W%20dual-port%2F70w%20power%20adapter%2C%20USB-C-to-MagSafe%20cable%2C%202%20Apple%20stickers%3C%2Fp%3E%0A%3Cp%3E%3Cstrong%3EPrice%3A%3C%2Fstrong%3E%20From%20Dh4%2C599%3C%2Fp%3E%0A
UAE currency: the story behind the money in your pockets

Cracks in the Wall

Ben White, Pluto Press 

'Munich: The Edge of War'

Director: Christian Schwochow

Starring: George MacKay, Jannis Niewohner, Jeremy Irons

Rating: 3/5

Updated: May 16, 2023, 2:12 PM