Augmented Data Science: Hypothesis search agent
Augmented Data Science: Hypothesis search agent tl;dr (never ai;dr) Generating testable hypotheses has mostly relied on the data scientist's experience and a literature review (in line with guidance from the business team). An LLM agent skill can structure and expand that search. We built a three-step skill ( context gathering , causal vs. predictive framing , and evidence-backed hypothesis table ) and tested it on two retail business questions using Claude Opus 4.6 and GPT 5.4. Framing the problem and providing the agent with available variables did most of the work: with minimal context, the causal/predictive distinction produced useful, literature-backed hypotheses. 86% of references checked out; directional claims were reliable, but effect sizes were not. Consistent with prior work, we found that LLMs exaggerate the findings in existing research. This confirms our earlier conclusion: data science teams need to establish...