PROMPT REVOLUTION

Prompt-Based Disease Mention Extraction with DeepSeek-V3: A Biomedical NER Case Study on a Structured NCBI Test Set

Prompt-based methods are becoming increasingly relevant in biomedical text mining, offering flexible ways to perform tasks such as named entity recognition without explicit model training. In this case study, we assess the performance of DeepSeek-V3 on a structured disease mention extraction task using a curated subset of the NCBI Disease

by Miklós Sebők - Rebeka Kiss • Jun 1, 2025

Grok-3 DeepSeek-V3 Qwen2.5-Max

Benchmarking GenAI Models for Penguin Species Prediction: Grok 3, DeepSeek-V3, and Qwen2.5-Max Delivered Top Results

How well can today’s leading GenAI models classify real-world biodiversity data—without bespoke code or traditional machine learning pipelines? In this study, we benchmarked a range of large language models on the task of predicting penguin species from tabular ecological measurements, including both numerical and categorical variables. Using a

by Miklós Sebők - Rebeka Kiss • May 30, 2025

DeepSeek-V3 Earth and Environmental Sciences Geoinformatics

Argania Detection from Sentinel-2 Spectral Data: DeepSeek-V3 Excels with Prompt-Based Labelling of Structured Data

How far can today’s large language models go in scientific data analysis—without bespoke coding or deep learning pipelines? In this experiment, we explore the ability of DeepSeek-V3 to perform pixel-level detection of Argania trees (i.e., binary classification for each pixel) using only tabular Sentinel-2 spectral data. By

by Miklós Sebők - Rebeka Kiss • May 29, 2025

data pre-processing prompt library recommendations

Linking Headlines to Article Bodies for Stance Detection: A Structured Pre-processing Workflow Using GPT-4o

Working with real-world text data often means dealing with structures that are not yet analysis-ready. In our case, the dataset included headlines and full article texts stored separately, across two different tables. The only link between them was a shared identifier field: Body ID. Before we could begin any further

by Miklós Sebők - Rebeka Kiss • May 28, 2025

plant disease detection image classification classification

Automating Plant Disease Detection at Scale: From Prompt Limitations to a High-Accuracy API Workflow with GPT-4o

Image-based classification is increasingly used across biology, ecology, and agriculture—from identifying animal species to detecting plant diseases. One common use case is the analysis of leaf images to distinguish between healthy and diseased plants. In this post, we compare two approaches to classifying strawberry leaves as either fresh (healthy)

by Miklós Sebők - Rebeka Kiss • May 28, 2025

data pre-processing data transformation recommendations

No-Code Transformation of the NCBI Disease Corpus into a Structured CSV

Working with biomedical corpora often requires programming skills, specialised formats, and time-consuming preprocessing. But what if you could transform a complex annotated dataset—like the NCBI Disease Corpus—into a structured, analysis-ready CSV using nothing more than a single, well-designed prompt? In this post, we demonstrate how a no-code, GenAI-powered

by Miklós Sebők - Rebeka Kiss • May 27, 2025

agents FutureHouse Platform searching for literature

Comparing the FutureHouse Platform’s Falcon Agent and OpenAI’s o3 for Literature Search on Machine Coding for the Comparative Agendas Project

Having previously explored the FutureHouse Platform’s agents in tasks such as identifying tailor-made laws and generating a literature review on legislative backsliding, we now directly compare its Falcon agent and OpenAI’s o3. Our aim was to assess their performance on a focused literature search task: compiling a ranked

by Miklós Sebők - Rebeka Kiss • May 19, 2025

Copilot limitations presentation

Can Copilot Create Scientific Presentations? Not Quite Yet

Generative AI has made extraordinary strides in recent years, reshaping how researchers write, code, and even analyse data. But a key question remains for academics hoping to automate more of their daily work: Can popular GenAI models like Copilot, Claude 3.7 Sonnet, Gemini 2.0 Flash, GPT-4o, Grok 3

by Miklós Sebők - Rebeka Kiss • May 14, 2025

data collection limitations prompt library

Manus AI Will Handle It? How (Not) to Retrieve GDP Data from the World Bank Open Data

Manus is a general-purpose AI agent designed to carry out multi-step tasks on behalf of users. It promises to understand instructions, break down complex goals, and deliver results—whether that means writing code, automating research workflows, or, in our case, retrieving public data from the web. Touted as the world’

by Miklós Sebők - Rebeka Kiss • May 12, 2025

thematic analysis Grok-3 recommendations

Thematic Content Analysis of Martin Luther King Jr.'s "I Have a Dream" Speech Using Grok 3

Thematic content analysis is a key method in political text interpretation, but it typically requires human judgement to define categories and trace meaning. In this study, we explored whether Grok 3, a state-of-the-art large language model, can carry out this task autonomously—without predefined themes or external guidance. Using Martin

by Miklós Sebők - Rebeka Kiss • May 8, 2025

DeepSeek-V3 classification recommendations

Zero-Shot PRO/CON Classification: DeepSeek Achieved 100% Accuracy in Labelling Claims

Can a language model accurately classify argumentative claims without any prior examples or fine-tuning? We put DeepSeek-V3 to the test on a real-world stance classification task involving 200 claims from a structured dataset. The model was asked to determine, for each claim, whether it supported (PRO) or opposed (CON) a

by Miklós Sebők - Rebeka Kiss • May 7, 2025

Claude 3.7 Sonnet argument mapping data visualisation

Exploring the Argument Mapping and Visualisation Capabilities of Claude 3.7 Sonnet

Argument mapping is a useful method for visualising the logical structure of reasoning, particularly in complex or multi-step arguments. In this post, we examine how Claude 3.7 Sonnet performs when prompted to identify the structure of arguments and represent them visually. The model was given five tasks, each involving

by Miklós Sebők - Rebeka Kiss • May 6, 2025