a introduction to a-b test

An introduction to A-B test 数据挖掘组王犇(garfieldwang) 2014-10

Upload: yihucha

Post on 02-Aug-2015

55 views

Category:

Technology

2 download

Report

Download

Embed Size (px):

TRANSCRIPT

An introduction to A-B test

数据挖掘组王犇(garfieldwang)

2014-10

Controlled experiment

example

example

• random variable

• null hypothesis

• Z-score approximate

example

Hypothesis testing

1. State a null and alternative hypothesis clearly (one-tailed or two-tailed test)e.g. one-tailed

2. Determine a test size (significance level). e.g. test size(alpha) = 0.05, critical value=1.645

3. Decision-making: reject or do not reject the null hypothesis.e.g. test statistic = 2.25, p-value = 0.02 …

4. Draw a conclusion and interpret substantively

Statistic Power

• Type I Error (α) : probability of rejecting the null hypothesis when it is true

• Type II Error(β) : accept a wrong null hypothesis [beta]

• Power of a test(1- β)：the probability that it will correctly lead to the rejection of a false null hypothesis

Determining sample size

• Formula 1

Determining sample size

• the point where the upper value of α on the null curve and the value for β on the alternative curve meet

• 80% Power，95% confidence level (Lehr`s equation)

• assume that the distribution of the mean is normal

Determining sample size

• Formula 2

– When |Skewness| > 1 , 355 × S^2 for each variant

– In order to close normal distribution

– skewness: is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. [ from wiki ]

Rules - Small Changes can have a Big Impact to Key Metrics

Sessions success rate improved, time-to-success improved, +$10M annuallyThis kindle of succ is rare

Rules - Speed Matters a LOT

• every 100msec speedup improves revenue by 0.6%

Rules - Reducing Abandonment is Hard, Shifting Clicks is Easy

• local improvements are easy

• global improvements are much harder

• succ– significant improvements to relevance,

– anti-malware flight

More Tips

• A-A test

• Primacy & newness effects

• Robots

• Long-term goals

Beyond A-B test

• Overlapping Experiment Infrastructure—More、Better、Fast

Reference

• [1] Jesse Farmer. Statistical Analysis and A/B Testing

• [2] Ron Kohavi. Controlled experiments on the web : survey and practical guide

• [3] Ron Kohavi. Seven Rules of Thumb for Web Site Experimenters. KDD 2014

• [4] Diane Tang. Overlapping Experiment Infrastructure : More, Better, Faster Experimentation. KDD 2010

• [5] Charles DiMaggio. Power Tools for Epidemiologists. 2014

• [6] Gerald van Belle. Statistical Rules of Thumb

RTX: garfieldwang

mail: [email protected]

Thanks

Product Introduction MX705010A 70 自動測定ソフト … Alliance Test and Certification Working Group （TCWG） IEEE 802.15.4g PHY Conformance Test Suite Specification Revision

INTRODUCTION TO THE CEIBS ADMISSION TEST ...ceibs.edu/pdf/emba/ceibs_gemba_test.pdfINTRODUCTION TO THE CEIBS ADMISSION TEST … ... Note: Diagrams accompanying problems agree with

Génie Logiciel Avancé Cours 6 Introduction to Test …zack/teaching/1415/gla/cours-04-tdd-intro.pdf · Génie Logiciel Avancé Cours 6 — Introduction to Test-Driven Development

Test automation introduction training at Polteq

INTRODUCTION TO HEALTHCARE FINANCE - … 1: Introduction to Healthcare Finance 7 SELF-TEST QUESTIONS the consumer dollar. A pure charity, such as the American Heart Association, on

Formative Assessment Session One- Introduction. To Test or Not to Test?...that is the question! By: Lora Drum and Alycen Wilson ?

BEA242 Introduction to Econometrics - University of … · A2.1 Define the assumptions underpinning econometric modelling A2.2 Test and ... BEA242 Introduction to ... attempted the

VLSI Testing Chapter 5 Design For Testability & Scan Test ...syhuang/testing/ch5.DFT.pdf4 Outline • Introduction • Ad-Hoc Approaches – Test Points – Design Rules • Full Scan

des - HAL archive ouverte · 2014. 10. 18. · Circuit intégré mixte Test fonctionnel Nombreux paramètres Equipement réduit C A N Introduction Test des circuits mixtes Signal

BORANG PENGESAHAN STATUS TESIS* JUDUL: …eprints.utem.edu.my/3522/1/Mobile_Educational_Game_Times_Maniac... · 6.1 Introduction 6.2 Test Plan 6.2.1 Test User ... Example of audio

Test d’intégration - Inriapeople.rennes.inria.fr/Benoit.Baudry/slides/vv/5-integration.pdf · Introduction au test d’intégration 2. ... • il faut ordonner les classes pour

Introduction of Ndt Non Destructive Test

Introduction to A & P TEST REVIEW. Anatomy is a term, which means the study of… a)Physiology b)Morphology c)Cell functions d)Human functions

Test link introduction

Introduction to VLSI Testingtiger.ee.nctu.edu.tw/course/Testing2019/notes/pdf/ch7.seq_ATPG.pdf · 2. Create a copy of a combinational logic, set it time-frame 0. 3. Generate a test

Xiao e WiFi Module - Apex Electronics Ltd...Test Board Introduction WT8266-S1 provides specialized UART_WiFi functional test board to facilitate the customers to test the Wi-Fi module

Washback Effect of Grammar Learners’ Test Preparation ...Key Words: Washback effect, English grammar, Grammar teaching, Production-based test, Grammar test 1. Introduction Grammar

TEST DE COMPÉTENCES DONUT HOCKEY · Introduction Ce test a été conçu selon le principe pédagogique orienté vers les compétences, tout comme le manuel Donut Hockey. Il s’aligne

SVL - Cours-TD 1 Introduction au test du logiciel Premiers ... · Introduction au test du logiciel Premiers pas avec JUnit Mirabelle Nebut Bureau 332 - M3 mirabelle.nebut at lifl.fr

Cours 4: Une introduction aux tests statistiques, le test du 2

Сдай английский на отлично | ESL Cafe · Contents Introduction Test 1 Answers Test 2 Answers Test 3 Answers Combined Starters, Movers and Flyers Thematic Vocabulary

TEST DE CONNAISSANCE DU FRANÇAIS Manuel du ......Introduction Le TCF est le test de niveau linguistique des ministères français de l’Éducation nationale, de l’Enseignement

Data Science Bootcamp · Statistical hypothesis testing (t-test, z-test) •Confidence intervals. Syllabus Outline / 700 hours. ... • Introduction and demo of scikit-learn MODULE

Test Coupling with Piston Valve · Test Coupling complete with Straight Fitting SKK20 Type K Test Coupling for 24° Cone Fittings Test 20 Connection Thread M16 x 2 Introduction SKK20

Introduction to Test Automation

Test de conjugaison A Present Test de conjugaison A Presentekladata.com/nWzaFan4ZbTmT0hbwUZp5vRT5Tc.pdf · Test de conjugaison A Prénom : ..... Date : ..... d Test de conjugaison

Multipath TCP - DiVA portal938436/FULLTEXT01.pdf · Combining multiple radio access technologies, ... 1 Introduction 1 ... Each gray dot is a test result. The

Test de conjugaison A Present Test de conjugaison …ekladata.com/lp0i0LhpodAiXQSgzYLVWTc4GUg.pdfTest de conjugaison A Prénom : Date : d Test de conjugaison A Prénom : ..... Test

Introduction - AMiner · 2014-09-30 · Mircea Vladutiu “Politehnica” University of Timisoara. 2 ... Introduction Proposed approach Extended compatibility tree example Power-test

ection Introduction A Introduction

POLITIQUE DE TEST - BOSA · 2018-11-26 · Politique de test 5 1. Introduction 1.1. OBJET DE CE DOCUMENT Ce document a pour but de définir clairement quelles sont les exigences posées

Introduction to the SAT. What is the SAT? SAT = Scholastic Aptitude Test The nation’s most widely used college entrance exam A standardized test

Introduction au séquenceur de test NI TestStand et exemple

The State-of-the-Art Test Compression and Test Response …¹€홍식.pdf · 2008-11-19 · Nano Technology Introduction Multiple detect Shown to improve test quality in 200K production

Cambridge IELTS 2 - Saint DavidCambridge IELTS 2 Examination papers ... Introduction 1 Test 1 7 Test 2 30 Test 3 54 Test 4 76 ... IELTS is owned by three partners, The University of