GPT-4o GPT-4.5 file handling

Exploring Input File Support in GenAI Models: What Works and What Doesn’t

by Rebeka Kiss

Mar 31, 2025

4 min read

Exploring Input File Support in GenAI Models: What Works and What Doesn’t — Source: Freepik via freepik licence

Generative AI models have become indispensable tools across industries, but their effectiveness often hinges on the types of input files they can process. Here, we assess the capabilities of leading GenAI interfaces—including Claude, ChatGPT, Gemini, Grok, etc.—examining their support for formats like PDF, JSON, and Python files, and uncovering the current limitations.

To understand the versatility of GenAI models, we evaluated their ability to handle a range of common and specialised file formats. Our analysis covered plain text (.txt), Microsoft Word documents (.docx), PDFs (.pdf), images (.png), structured data formats like JSON (.json), Excel spreadsheets (.xlsx), and comma-separated values (.csv). We also tested support for technical formats such as LaTeX (.tex) for mathematical notation, Python scripts (.py) for code, PowerPoint presentations (.pptx), and even the more unique .dat files often used for raw data storage. This broad assessment reveals how well these models accommodate diverse inputs in real-world applications.

Interface comparison

Interface	.txt	.docx	.pdf	.png/.jpg	.JSON	.xlsx	.csv	.LaTeX	.py	.pptx	.dat
Claude	Yes	Yes	Yes	Yes	Yes	Limited	Limited	Yes	Yes	No	Yes
Copilot	Yes	Yes	Yes	Yes	Yes	Yes	Yes	No	No	Limited	No
DeepSeek	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes	No
Gemini	Yes	Yes	Yes	Yes	Yes	Yes	Yes	No	Yes	Yes	No
ChatGPT	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes
Grok	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes	No	Yes
LeChat (Mistral)	Yes	Yes	Yes	Yes	Yes	Yes	Yes	No	Yes	Yes	No
Qwen	Yes	Yes	Yes	Yes	Yes	Yes	Yes	No	No	No	No

Claude

Claude handles .txt, .docx, .pdf, .png/.jpg, .json, .latex, .py, and .dat files easily, demonstrating strong performance in processing text, image, and technical formats. It has difficulty with .csv and .xlsx files, often failing to read them reliably, although occasional success is possible after multiple attempts. A notable limitation is the lack of support for .pptx files.

Claude initially encountered multiple errors when attempting to process the .xlsx file, struggling with library imports and binary parsing issues. After several retries, it eventually managed to extract the content successfully—something it failed to achieve with .csv files, which remained unreadable despite repeated attempts.

The model faced difficulties when attempting to read the PowerPoint file (.pptx), extracting primarily binary and XML structure data rather than the readable text content. The actual text from the slides was not visible in the extracted data, and the program identified internal XML structures and relationships between slide elements.

Copilot

Copilot faced limitations when attempting to read .latex and .py files, displaying an error message indicating it couldn't process these types. The .dat file format posed a challenge, as it couldn't be processed either. Similarly, when extracting content from .pptx files, Copilot encountered issues with file loading, requiring multiple attempts to process the file successfully.

DeepSeek

Deepseek successfully handled most file formats, including .pdf, .doc, .xlsx, .ppt, images, text, and code. However, it encountered difficulties with the .dat file format, which it could not process. Other supported formats were processed without difficulties.

Gemini

Gemini encountered difficulties with .latex and .dat file formats, which it could not process. However, it successfully handled all other supported formats without any issues.

Grok

Grok encountered issues with the .pptx file format, displaying an error message and failing to process it. However, it successfully handled all other supported formats without any problems.

Mistral (Le Chat)

Mistral struggled with processing .latex and .dat file formats, which it was unable to handle. Nonetheless, it managed to process all other supported formats without any difficulties.

Mistral's performance (accessed on 6 April 2025)

Qwen

The Qwen models are unable to process .pptx, .latex, and .dat input files.

The models and interfaces have made significant progress, with each iteration expanding their ability to handle a wider range of input file formats. Among the tested interfaces ChatGPT were the only that successfully processed all of them without difficulties or limitations. This demonstrates their advanced capabilities in managing diverse and complex file types.

The authors used GPT-4.5 [OpenAI (2025), GPT-4.5 (accessed on 6 April 2025), Large language model (LLM), available at: https://openai.com] to generate the output.