recommendations

recommendations

Solving Health Insurance Demand and Social Loss Models Using Manus AI

Can a large language model accurately solve a university-level microeconomics exercise on health insurance? We tested Manus AI on a multi-part problem involving demand curves, list prices, out-of-pocket prices, and the calculation of social loss under various insurance schemes – including full insurance, coinsurance, and copayment plans. Not only did the

Introducing poltextLAB QuantiCheck: A Custom GPT for Evaluating Quantitative Research Rigour

In our earlier posts, we explored how useful existing Custom GPTs are for academic tasks and explained how to create your own GPT from scratch. This follow-up post puts those insights into practice by introducing QuantiCheck—a Custom GPT we developed specifically to assess the methodological rigour and reproducibility of

Thematic Content Analysis of Martin Luther King Jr.'s "I Have a Dream" Speech Using Grok 3

Thematic content analysis is a key method in political text interpretation, but it typically requires human judgement to define categories and trace meaning. In this study, we explored whether Grok 3, a state-of-the-art large language model, can carry out this task autonomously—without predefined themes or external guidance. Using Martin

Zero-Shot PRO/CON Classification: DeepSeek Achieved 100% Accuracy in Labelling Claims

Can a language model accurately classify argumentative claims without any prior examples or fine-tuning? We put DeepSeek-V3 to the test on a real-world stance classification task involving 200 claims from a structured dataset. The model was asked to determine, for each claim, whether it supported (PRO) or opposed (CON) a

Exploring the Argument Mapping and Visualisation Capabilities of Claude 3.7 Sonnet

Argument mapping is a useful method for visualising the logical structure of reasoning, particularly in complex or multi-step arguments. In this post, we examine how Claude 3.7 Sonnet performs when prompted to identify the structure of arguments and represent them visually. The model was given five tasks, each involving

Using GenAI for Critical Discourse Analysis: OpenAI o3 vs. Fairclough on Tony Blair’s 1996 Stakeholder Economy Speech

Can a generative AI model perform critical discourse analysis at a level comparable to a trained scholar? This case study puts OpenAI’s o3 model to the test by asking it to analyse Tony Blair’s 1996 “stakeholder economy” speech—a foundational text in New Labour rhetoric. The model'

Identifying Primary Texts Where Deleuze and Foucault Critique Marxist Theory: A Literature Search Prompt

Can large language models return accurate results when asked to find real academic texts written by specific philosophers on a specific topic? In this case, the topic was Marxist theory — and the instruction was to list peer-reviewed publications in which Gilles Deleuze or Michel Foucault offer a critique of it.

Can a Philosopher Be Simulated? Testing AI Argumentation on Nozick’s Experience Machine

Can a language model reason like a philosopher—not just mimic their tone, but follow and evaluate arguments with analytic precision? To explore this, we designed a two-step prompt experiment using Robert Nozick’s famous “Experience Machine” thought experiment. First, we instructed OpenAI’s o3 model to adopt the persona