Miklós Sebők - Rebeka Kiss

Gemini’s ‘Audio Overview’ as a Tool for Open Science: Turning Scientific Papers into Accessible Audio

Can artificial intelligence make academic research more accessible to non-specialist audiences—or even to busy researchers on the go? Gemini’s new ‘Audio Overview’ feature provides a novel way to experience scientific papers: through short, conversational audio summaries. Available even in the free version of Gemini 2.5 Flash, this

Testing the Limits of AI Peer Review: When Even Ian Goodfellow Gets Rejected by OpenReviewer

High-quality feedback is essential for researchers aiming to improve their work and navigate the peer review process more effectively. Ideally, such feedback would be available before formal submission—allowing authors to identify the strengths and weaknesses of their research early on. This is precisely the promise of OpenReviewer, an automated

Structured Legal Summarisation of Constitutional Court Decisions from PDF Using Gemini 2.5 Pro

This post presents a structured application of the Gemini 2.5 Pro language model for the bilingual summarisation of selected Hungarian Constitutional Court decisions. Working exclusively from an official PDF file containing full-text rulings, the model was prompted to generate parallel summaries in Hungarian and UK English, along with a

Slide Generation from Scientific Articles: Putting Manus’s New Slide Generator to the Test

In this post, we examine the performance of Manus’s newly updated slide generation tool when applied to a peer-reviewed scientific article. The developers claim recent improvements focused on enhancing the tool’s ability to support academic communication. To test these capabilities, we selected a published study in political science

Step-by-Step Guide to Integrating GPT and Claude into Google Docs for Custom AI-Automated Editing

This post shows you how to connect large language models—such as GPT or Claude—directly to Google Docs using Apps Script. With just a few setup steps, you can send selected text to your preferred AI model, receive improved versions, summaries, stylistic rewrites, or even answers to complex questions—

Testing Publisher AI Tools for Journal Selection: A Guide for Researchers

As AI-assisted tools become increasingly embedded in academic publishing, most major journal platforms now offer automated systems that claim to recommend suitable outlets based on a manuscript’s abstract. But how well do these tools perform in practice? To explore this, we tested five journal finder platforms — four developed by

Using Gemini for Grammar and Style Correction in Google Docs

Integrating Google’s Gemini assistant into Google Docs offers a lightweight yet effective solution for academic and professional editing tasks. Rather than relying on traditional spelling and grammar checkers, users can now issue custom prompts to Gemini—transforming the assistant into a real-time editorial aid capable of producing high-quality revisions

Transforming Academic References into Structured HTML with Mistral Le Chat

Academic writing increasingly relies on consistent, machine-readable formatting—especially when preparing manuscripts for digital publication, automated parsing, or citation indexing. This post demonstrates how the Mistral Le Chat can accurately convert plain-text bibliographic entries into structured HTML, generating both inline (short-form) citations and full bibliographic records with cross-linked anchors. This

Automated LaTeX Generation From an Academic PDF: Practical Workflow Using GPT-4.1

Formatting academic manuscripts in LaTeX can be both laborious and technically demanding—especially when converting raw text into structured, publication-ready documents. This post presents a practical workflow using GPT-4.1 to automate manuscript formatting with remarkable precision. With a single, well-crafted prompt, the model generates clean LaTeX code with proper

Prompt-Driven Video Analysis of Animal Behaviour Using Gemini 2.5 Pro in Google AI Studio and via API

Can state-of-the-art multimodal models analyse animal behaviour directly from video footage? In this study, we tested Google’s Gemini 2.5 Pro — both in AI Studio and via its API — to assess whether it can produce structured ethological descriptions based purely on short animal-related videos. By applying a consistent prompt

Prompt-Based Hamster Ovary Cell Segmentation with OpenAI o4-mini-high on Microscopy Images

Recent advancements in multimodal language models have opened new avenues for analysing scientific image data using natural language instructions. In this post, we explore the capabilities of OpenAI’s o4-mini-high model for performing cell segmentation tasks on microscopy images through prompt-based interaction. Rather than relying on traditional computer vision techniques

Zero-Shot Distal Fibula Fracture Detection: Gemini 2.5 Flash Delivers Spot-On Results on German Radiology Reports

Identifying specific clinical findings in unstructured medical texts is a common challenge in healthcare data science. In this post, we benchmark Google’s Gemini 2.5 Flash language model on a zero-shot classification task: detecting the presence or absence of distal fibula fractures in real-world German radiology reports. Without any