PROMPT REVOLUTION

Prompt-Driven Video Analysis of Animal Behaviour Using Gemini 2.5 Pro in Google AI Studio and via API

Can state-of-the-art multimodal models analyse animal behaviour directly from video footage? In this study, we tested Google’s Gemini 2.5 Pro — both in AI Studio and via its API — to assess whether it can produce structured ethological descriptions based purely on short animal-related videos. By applying a consistent prompt

by Miklós Sebők - Rebeka Kiss • Jun 3, 2025

Cell Segmentation Bioimage Analysis OpenAI

Prompt-Based Hamster Ovary Cell Segmentation with OpenAI o4-mini-high on Microscopy Images

Recent advancements in multimodal language models have opened new avenues for analysing scientific image data using natural language instructions. In this post, we explore the capabilities of OpenAI’s o4-mini-high model for performing cell segmentation tasks on microscopy images through prompt-based interaction. Rather than relying on traditional computer vision techniques

by Miklós Sebők - Rebeka Kiss • Jun 2, 2025

Gemini 2.5 Flash Radiology Life Sciences

Zero-Shot Distal Fibula Fracture Detection: Gemini 2.5 Flash Delivers Spot-On Results on German Radiology Reports

Identifying specific clinical findings in unstructured medical texts is a common challenge in healthcare data science. In this post, we benchmark Google’s Gemini 2.5 Flash language model on a zero-shot classification task: detecting the presence or absence of distal fibula fractures in real-world German radiology reports. Without any

by Miklós Sebők - Rebeka Kiss • Jun 2, 2025

Life Sciences Natural Sciences Neurobiology

Benchmarking Claude 4 Sonnet and GPT-4o for Brain MRI Image Labelling: Comparing Chat Interface and API Results

We assessed the ability of Claude 4 Sonnet and GPT-4o to classify brain MRI images as healthy or tumorous for research labelling purposes, using both the chat interface (10 images) and the API (125 images). Claude 4 Sonnet achieved perfect accuracy (10/10) in chat, but its API refused to

by Miklós Sebők - Rebeka Kiss • Jun 1, 2025

data extraction Named Entity Recognition DeepSeek-V3

Prompt-Based Disease Mention Extraction with DeepSeek-V3: A Biomedical NER Case Study on a Structured NCBI Test Set

Prompt-based methods are becoming increasingly relevant in biomedical text mining, offering flexible ways to perform tasks such as named entity recognition without explicit model training. In this case study, we assess the performance of DeepSeek-V3 on a structured disease mention extraction task using a curated subset of the NCBI Disease

by Miklós Sebők - Rebeka Kiss • Jun 1, 2025

API model comparison recommendations

Unlocking Large Language Models via API: Capabilities, Access, and Practical Considerations

Accessing large language models (LLMs) through APIs opens up research and development opportunities that go well beyond the limits of traditional chat interfaces. For researchers API access enables language models to be built directly into bespoke workflows, analytical tools, or automated processes—supporting tasks such as large-scale text analysis, data

by Miklós Sebők - Rebeka Kiss • May 31, 2025

Ollama Open WebUI Docker

Practical Guide to Running Open-Weight Large Language Models Locally Using Ollama and Open WebUI

Recent advances in open-weight large language models have made it possible to run powerful AI tools entirely on local machines. In this article, we outline how researchers can set up and interact with models such as Deepseek, Mistral, or Llama using Ollama for local model management and Open WebUI for

by Miklós Sebők - Rebeka Kiss • May 31, 2025

Grok-3 DeepSeek-V3 Qwen2.5-Max

Benchmarking GenAI Models for Penguin Species Prediction: Grok 3, DeepSeek-V3, and Qwen2.5-Max Delivered Top Results

How well can today’s leading GenAI models classify real-world biodiversity data—without bespoke code or traditional machine learning pipelines? In this study, we benchmarked a range of large language models on the task of predicting penguin species from tabular ecological measurements, including both numerical and categorical variables. Using a

by Miklós Sebők - Rebeka Kiss • May 30, 2025

DeepSeek-V3 Earth and Environmental Sciences Geoinformatics

Argania Detection from Sentinel-2 Spectral Data: DeepSeek-V3 Excels with Prompt-Based Labelling of Structured Data

How far can today’s large language models go in scientific data analysis—without bespoke coding or deep learning pipelines? In this experiment, we explore the ability of DeepSeek-V3 to perform pixel-level detection of Argania trees (i.e., binary classification for each pixel) using only tabular Sentinel-2 spectral data. By

by Miklós Sebők - Rebeka Kiss • May 29, 2025

data pre-processing prompt library recommendations

Linking Headlines to Article Bodies for Stance Detection: A Structured Pre-processing Workflow Using GPT-4o

Working with real-world text data often means dealing with structures that are not yet analysis-ready. In our case, the dataset included headlines and full article texts stored separately, across two different tables. The only link between them was a shared identifier field: Body ID. Before we could begin any further

by Miklós Sebők - Rebeka Kiss • May 28, 2025

plant disease detection image classification classification

Automating Plant Disease Detection at Scale: From Prompt Limitations to a High-Accuracy API Workflow with GPT-4o

Image-based classification is increasingly used across biology, ecology, and agriculture—from identifying animal species to detecting plant diseases. One common use case is the analysis of leaf images to distinguish between healthy and diseased plants. In this post, we compare two approaches to classifying strawberry leaves as either fresh (healthy)

by Miklós Sebők - Rebeka Kiss • May 28, 2025

RStudio API OpenAI

Integrating OpenAI’s GPT API into RStudio with Shiny: Real-Time Code Generation from Natural Language

This post presents a practical solution for integrating OpenAI’s GPT API into RStudio using a custom Shiny interface. The tool enables real-time code generation from natural language instructions, allowing users to interact with GPT-4 directly within their R workflow—without leaving the environment or blocking the console. We demonstrate

by Miklós Sebők - Rebeka Kiss • May 27, 2025