Statistics Project

Problem: Consider one way of modeling a problem. Say you have a device and in any given year, there is a 1 in 6 chance that the device will fail. One question is, “On average, how long will it be before such a device fails?” This sort of problem is what will be modeled and analyzed here.

The problem above can be modeled as follows: Roll a normal fair six-sided die. Rolling a 1 will count as “fail”. Anything else will count as “not-fail”. The random variable we will use will be

 = # of rolls required until a 1 is rolled = “time until failure”

Picture 2This is a random variable, so in each experiment (for each device), this will produce a value. For example, if we roll                                   , then  = 5 (we count the final roll). 

Part 1 [Excel] (Empirical Analysis): Simulate 20 repetitions of obtaining a value for , that is, roll until a 1 is rolled, record the result and repeat this 20 times. For example, if you roll   , then record 4 since 4 rolls were required. You can complete this using a die or simulating using Excel, Python, or some other option, e.g., Random.org.

  • Record your rolls and counts.
  • Compute the mean and standard deviation of your values for .
  • Given the modeling problem we started with, interpret the mean and standard deviation you found in terms of how many years are expected before the device fails.

Excel might be used for this part as it makes the calculations, recording of data, etc. very simple. However, it is not required.

Part 2 [Excel] (Theoretical Analysis – Approximations): Compute approximations to expected value, variance, and standard deviation of the random variable . See the additional notes for discussion on these computations, in brief:

  •  = the probability of rolling something other than a 1 )-times, then rolling a 1 on the th throw. Make a table for the first 20 values:
i1220Totals

Use this table to compute

  • , the probability that you get the first 1 in at most 20 rolls.
    • An approximation of   using the first 20 values for , namely                .
    • An approximation to the variance of  using the first 20 terms. For this, use  and then use this and the above to get a rough approximation of .
    • Use the preceding to approximate the standard deviation  using the first 20 terms.

Again, using Excel for this will simplify what needs to be done.

  • Repeat the third item of Part 1:  Given the modeling problem we started with, interpret  and   in terms of how many years are expected before the device fails.

Part 3 [Written] (Theoretical Analysis – Exact Computations – (Optional)): Compute the expected value, variance, and standard deviation of the random variable . See the additional notes for discussion on these computations, in brief:

  • (Expected Value)  
  • (Variance)  
  • (Standard Deviation)  

Hint: For computing these without directly manipulating the infinite summations that appear in the definitions of  and , let  be the event that a 1 is rolled on the first throw and  be the complementary event, namely, that a 1 is not rolled on the first throw. It is clear that  since what is rolled after the first roll is just like starting over. It is also clear that . The Law of Total Expectation (see notes) gives:

This makes it quite simple to find .

A similar “trick” can be used to find , here you will use .   Again, the Law of Total Expectation gives:

and from this it is simple to compute .