Engineering Data ScienceStatisticsML

Senior Data Scientist

Statistical modeling, A/B testing, causal inference, and advanced analytics.

$skillfish add ~skills/senior-data-scientist

Real Conversations

See how the Senior Data Scientist skill responds to actual requests. These are representative examples of what your AI agent will produce.

You

We want to A/B test removing the coupon code field from checkout. Baseline conversion is 3.2%. We need 95% confidence. How do I size the experiment?

Agent

Sample Size Calculation:

from scipy.stats import norm
import math

def sample_size_per_variant(p_baseline, mde, alpha=0.05, power=0.80):
    """Calculate required n per variant for a two-proportion z-test."""
    p_treatment = p_baseline * (1 + mde)
    p_avg = (p_baseline + p_treatment) / 2
    
    z_alpha = norm.ppf(1 - alpha / 2)  # two-tailed
    z_beta = norm.ppf(power)
    
    n = (z_alpha + z_beta)**2 * (p_baseline*(1-p_baseline) + p_treatment*(1-p_treatment)) / (p_treatment - p_baseline)**2
    return math.ceil(n)

# Your numbers:
n = sample_size_per_variant(p_baseline=0.032, mde=0.10)  # detect 10% relative lift
# → ~8,300 users per variant = 16,600 total

At what traffic volume does this complete?

  • 1,000 checkouts/day → 17 days
  • 5,000 checkouts/day → 4 days

Key decisions before you launch:

  1. MDE choice matters more than alpha: A 10% relative lift on 3.2% = detecting 3.52% vs 3.2% — is that a meaningful business outcome? If not, set MDE to 5% (doubles sample size).
  2. Randomization unit: Randomize by user_id, not session — a user seeing both variants inflates false positives
  3. Guardrail metrics: Track add_to_cart rate and revenue_per_visitor alongside conversion — removing the coupon field could hurt AOV even if it lifts conversion

Analysis (post-experiment):

from scipy.stats import chi2_contingency
contingency = [[control_converted, control_not], [treat_converted, treat_not]]
chi2, p_value, _, _ = chi2_contingency(contingency)

Get Started

How to install and use this skill in your preferred environment.

Skills are designed for AI coding agents (Claude Code, Cursor, Windsurf) and IDE-based workflows where the agent can read files, run scripts, and act on your codebase. Web-based AI can use the knowledge and frameworks, but won't have tool access.

Models & Context

Which AI models and context windows work best with this skill.

Recommended Models

Best
Claude Opus 4 Claude Sonnet 4 GPT-4.1 Gemini 2.5 Pro Grok 3 Kimi K2
Good
Claude Haiku 4.5 GPT-4.1 mini Gemini 2.5 Flash Grok 3 mini

Larger models produce more detailed, production-ready outputs.

Context Window

This skill's SKILL.md is typically 3–10 KB — fits in any modern context window.

8K Skill only
32K+ Skill + conversation
100K+ Skill + references + codebase

All current frontier models (Claude, GPT, Gemini) support 100K+ context. Use the full window for complex multi-service work.

Pro tips for best results

1

Be specific

Include numbers — users, budget, RPS — so the skill can size the architecture.

2

Share constraints

Compliance needs, team size, and existing stack all improve the output.

3

Iterate

Start with a high-level design, then ask follow-ups for IaC, cost analysis, or security review.

4

Combine skills

Pair with companion skills below for end-to-end coverage.

Ready to try Senior Data Scientist?

Install the skill and start getting expert-level guidance in your workflow — any agent, any IDE.

$skillfish add ~skills/senior-data-scientist
← Browse all 169 skills