Marketing TestingA/BExperiments

A/B Test Setup

Plan, design, and implement A/B tests and experiments.

$skillfish add ~skills/ab-test-setup

Real Conversations

See how the A/B Test Setup skill responds to actual requests. These are representative examples of what your AI agent will produce.

You

We have a 'Start Free Trial' CTA on our pricing page converting at 3.2%. I want to test it against 'Get Started Free' and 'Try Pipedream Free'. We get about 4,000 unique visitors/month to that page. How do I set this up properly?

Agent

Test Design: 3-way CTA copy test

Sample size calculation:

  • Baseline: 3.2% conversion, MDE: 20% relative lift (0.64pp), α=0.05, power=80%
  • Required per variant: ~7,400 visitors
  • At 4,000/month split 3 ways (1,333/variant/month): **5.5 months to significance**

Recommendation: Reduce to 2 variants (control vs. best challenger) to reach significance in ~2.8 months.

GTM setup:

Trigger: Page URL contains /pricing
Variable: Custom JS → Math.random() < 0.5 ? 'variant' : 'control'
Tag: DOM manipulation → document.querySelector('.cta-primary').innerText
GA4 Event: experiment_assigned {experiment_id, variant}

What to track:

  • Primary: CTA click → trial signup (full funnel, not just click)
  • Secondary: scroll depth, time on page (detect sample pollution)

Guard rails:

  • Run minimum 2 full business cycles before calling
  • Check for novelty effect spike in week 1
  • Segment results by traffic source — paid vs. organic often behave differently

Get Started

How to install and use this skill in your preferred environment.

Skills are designed for AI coding agents (Claude Code, Cursor, Windsurf) and IDE-based workflows where the agent can read files, run scripts, and act on your codebase. Web-based AI can use the knowledge and frameworks, but won't have tool access.

Models & Context

Which AI models and context windows work best with this skill.

Recommended Models

Best
Claude Opus 4 Claude Sonnet 4 GPT-4.1 Gemini 2.5 Pro Grok 3 Kimi K2
Good
Claude Haiku 4.5 GPT-4.1 mini Gemini 2.5 Flash Grok 3 mini

Larger models produce more detailed, production-ready outputs.

Context Window

This skill's SKILL.md is typically 3–10 KB — fits in any modern context window.

8K Skill only
32K+ Skill + conversation
100K+ Skill + references + codebase

All current frontier models (Claude, GPT, Gemini) support 100K+ context. Use the full window for complex multi-service work.

Pro tips for best results

1

Be specific

Include numbers — users, budget, RPS — so the skill can size the architecture.

2

Share constraints

Compliance needs, team size, and existing stack all improve the output.

3

Iterate

Start with a high-level design, then ask follow-ups for IaC, cost analysis, or security review.

4

Combine skills

Pair with companion skills below for end-to-end coverage.

Ready to try A/B Test Setup?

Install the skill and start getting expert-level guidance in your workflow — any agent, any IDE.

$skillfish add ~skills/ab-test-setup
← Browse all 169 skills