In the MedQA benchmark, AI systems have shown significant progress. The best-performing GPT-4 Medprompt model in 2023 achieved an accuracy of 90.2%, an increase of 22.6 percentage points from the highest score in 2022. AI performance has nearly tripled since the benchmark was introduced in 2019.