Skip to content

Latest commit

 

History

History
30 lines (12 loc) · 1.71 KB

README.md

File metadata and controls

30 lines (12 loc) · 1.71 KB

Fundraising Modeling Task

This project contains code and instructions for completing the Wikimedia Foundation's task for new Research Scientists.

To complete the task, clone this repository, edit task.R to implement an improved test function and submit a pull request to this repositiory or email your solution to [email protected].

The Problem

The Fundraising team at the Wikimedia Foundation needs to be able to accurately model the donated dollars per banner impression in order to choose the banners that will encourage the highest value. In other words, the team would like to maximize the number of $ per impression.

But modeling the number of dollars per impression is hard. The vast majority of impressions (~99.9%) do not lead to a donation. Due to this strong skew in the data, a simple T test won't serve the teams needs (as is demonstrated in task.R).

Histogram of donation amounts

histogram

Histogram (log scaled) of donation amounts

log-scaled histogram

The Task

Your job is to write an improved test. A skeletion of the function my_test() is provided in task.R. You're also provided with run_test_suite(), a function that when provided a set of test functions will simulate sample draws from a population and apply your test function. The test suite will determine how often your 95% confidence intervals correspond to reality and report the results of the test.