Senior Data Scientist
Statistical modeling, A/B testing, causal inference, and advanced analytics.
$skillfish add ~skills/senior-data-scientist Real Conversations
See how the Senior Data Scientist skill responds to actual requests. These are representative examples of what your AI agent will produce.
We want to A/B test removing the coupon code field from checkout. Baseline conversion is 3.2%. We need 95% confidence. How do I size the experiment?
Sample Size Calculation:
from scipy.stats import norm
import math
def sample_size_per_variant(p_baseline, mde, alpha=0.05, power=0.80):
"""Calculate required n per variant for a two-proportion z-test."""
p_treatment = p_baseline * (1 + mde)
p_avg = (p_baseline + p_treatment) / 2
z_alpha = norm.ppf(1 - alpha / 2) # two-tailed
z_beta = norm.ppf(power)
n = (z_alpha + z_beta)**2 * (p_baseline*(1-p_baseline) + p_treatment*(1-p_treatment)) / (p_treatment - p_baseline)**2
return math.ceil(n)
# Your numbers:
n = sample_size_per_variant(p_baseline=0.032, mde=0.10) # detect 10% relative lift
# → ~8,300 users per variant = 16,600 total
At what traffic volume does this complete?
- 1,000 checkouts/day → 17 days
- 5,000 checkouts/day → 4 days
Key decisions before you launch:
- MDE choice matters more than alpha: A 10% relative lift on 3.2% = detecting 3.52% vs 3.2% — is that a meaningful business outcome? If not, set MDE to 5% (doubles sample size).
- Randomization unit: Randomize by
user_id, not session — a user seeing both variants inflates false positives - Guardrail metrics: Track
add_to_cartrate andrevenue_per_visitoralongside conversion — removing the coupon field could hurt AOV even if it lifts conversion
Analysis (post-experiment):
from scipy.stats import chi2_contingency
contingency = [[control_converted, control_not], [treat_converted, treat_not]]
chi2, p_value, _, _ = chi2_contingency(contingency)
Get Started
How to install and use this skill in your preferred environment.
Skills are designed for AI coding agents (Claude Code, Cursor, Windsurf) and IDE-based workflows where the agent can read files, run scripts, and act on your codebase. Web-based AI can use the knowledge and frameworks, but won't have tool access.
Models & Context
Which AI models and context windows work best with this skill.
Recommended Models
Larger models produce more detailed, production-ready outputs.
Context Window
This skill's SKILL.md is typically 3–10 KB — fits in any modern context window.
All current frontier models (Claude, GPT, Gemini) support 100K+ context. Use the full window for complex multi-service work.
Pro tips for best results
Be specific
Include numbers — users, budget, RPS — so the skill can size the architecture.
Share constraints
Compliance needs, team size, and existing stack all improve the output.
Iterate
Start with a high-level design, then ask follow-ups for IaC, cost analysis, or security review.
Combine skills
Pair with companion skills below for end-to-end coverage.
Ready to try Senior Data Scientist?
Install the skill and start getting expert-level guidance in your workflow — any agent, any IDE.
$skillfish add ~skills/senior-data-scientist