PROMPT REVOLUTION

Hands-On Prompt-Tutorials for Using GenAI in Research and Education

Setting Up Claude Code for Local File Querying: A Step-by-Step Guide

Claude Code is Anthropic's command line-integrated agent that runs directly on your computer, enabling terminal-based interactions with local files and applications. While the name suggests technical complexity, Claude Code is surprisingly accessible—offering the same conversational interface as chat-based models, but with the ability to execute code and

by Dávid László

Benchmarking AI Audio Transcription: How Leading Models Handle English and Hungarian Speech

As audio becomes an increasingly common input for AI assistants, it becomes natural to ask whether these platforms can also transcribe pre-recorded audio accurately. In this benchmark, we tested five leading generative AI models on identical English and Hungarian audio files to assess their transcription accuracy, language support, and practical

by Dávid László

Testing PangramLabs for AI Text Detection: Impressive but Imperfect Performance

Can AI detection tools reliably distinguish between human and machine-generated text? In our previous testing, we found that platforms like Originality.ai and ZeroGPT and GenAI models, frequently misclassified both AI and human text. Now, PangramLabs has gained attention with claims of near-perfect accuracy, verified by third-party organizations. In this

by Dávid László

Understanding Model Confidence Through Log Probabilities: A Practical OpenAI API Implementation

Log probabilities (logprobs) provide a window into how confident a language model is about its predictions. In this technical implementation, we demonstrate how to access and interpret logprobs via the OpenAI API, using a series of increasingly difficult multiplication tasks. Our experiment reveals that declining confidence scores can effectively signal

by Dávid László

Testing Claude Projects' New RAG Feature: Fast Setup, Accurate Retrieval Across 113 Articles

Claude Projects recently introduced a Retrieval-Augmented Generation (RAG) feature that extends its capabilities beyond the standard context window. Unlike traditional approaches that require custom vector databases and API integration, Claude's implementation offers a streamlined, no-code solution for handling large knowledge bases directly within the Projects interface. In this

by Dávid László

Evolving File Handling in GenAI Models: Stronger Input Support, Persistent Output Limitations

In a previous blog, we examined the file handling capabilities of leading GenAI interfaces. That analysis detailed which formats they could process reliably and where they encountered difficulties—particularly with structured data and technical file types. Since then, the landscape has shifted. While downloadable file generation still faces notable constraints,

by Rebeka Kiss