In this follow-up experiment, we tested how generative AI models handle handwritten text when provided as image input rather than PDF. Unlike in the earlier test, all models were able to process the images and return readable outputs.
Performance Comparison
đ Champion as of October 2: Copilot, Gemini 2.5 Pro, GPT-5
Model | Grade |
---|---|
Copilot | A |
Gemini 2.5 Pro | A |
GPT-5 | A |
Claude Sonnet 4.5 | B |
Mistral | B |
DeepSeek-V3.2 | B |
Grok-4 | C |
Qwen3-Max | C |
Three modelsâGPT-5, Copilot, and Gemini 2.5 Proâproduced fully accurate results across English, French, and Hungarian, with Gemini showing a marked improvement compared to the PDF case. Claude Sonnet 4.5, Mistral, and DeepSeek-V3.2 performed consistently with their earlier results, handling English and French reliably but introducing errors in the Hungarian version. Qwen3-Max also advanced compared to the PDF test, correctly reproducing the English and French texts from the images, though still struggling with Hungarian. Grok-4, by contrast, showed weaker performance on image input than on PDFs, with more substantial rewriting of the Hungarian text.
Input
For this experiment, we used the same three handwritten samples as in the previous testâone each in English, French, and Hungarianâbut provided them as image files rather than PDFs. The instruction given to each model was deliberately minimal: âReturn the text from the attached image.â

Output
GPT-5


Gemini 2.5 Pro

Claude Sonnet 4.5, Mistral, DeepSeek-V3.2, Qwen3-Max

Grok-4

Recommendations
This comparison underlines that the format of the input file can shape OCR performance. While image-based input proved more universally accessibleâenabling all models to return readable textâit also raises practical considerations. For short handwritten notes, supplying images is straightforward, but for longer or multi-page documents the need to scan or photograph each page makes this approach less efficient than extracting text directly from PDFs.
The authors used Copilot [Microsoft (2025) Copilot (accessed on 2 October 2025), Large language model (LLM), available at: https://copilot.microsoft.com] to generate the output.