article

Sometimes when choosing a coin (particularly for a coin flip), it may be desirable to determine if the coin is fair – that is, if the probability of obtaining a given side (commonly heads or tails) in the toss is 50%.

Preamble


This article is about how to determine if a coin is fair. There are many methods for doing this and this article describes two methods for doing so.

Both methods described in this article involve performing an experiment (or trial) which involves tossing the coin many times and recording down the result of each and every toss. Based on the mathematical analysis of the result of all the tosses, a decision can then be made on whether the coin could be declared "fair".

  • Posterior probability density function. This method assumes that the number of tosses is fixed and is not under the direct control of the experimenter. The method requires the probability density function (PDF) of the true value of obtaining a particular side be derived and the probability of a "fair coin" be obtained by integrating the PDF by the relevant interval. Based on result, a decision can then be made whether to proclaim the coin as a fair coin.

  • Estimator of true probability. This method assumes that the experimenter can decide and implement any number of coin tosses for the particular coin. The experimenter decides on what level of confidence they require and how large the margin of error they are willing to tolerate. This then determines the minimum number of tosses that needs to be perform in the experiment.

Posterior probability density function


One way of verifying this is to calculate the posterior probability density function of Bayesian probability theory.

A test is performed by tossing the coin n times and noting the number of heads h and tails t:

H = h (Total number of heads is h)
T = t (Total number of tails is t)
N = n = h + t (Total number of tosses is n)

Next, let r be the actual probability of obtaining heads in a single toss of the coin. This is the value desired. Using Bayes' theorem, posterior probability of r conditional on H and T is expressed as follows:

f(r | H=h, T=t) =
\frac {\Pr(H=h | r, N=h+t) \, f(r)} {\int_0^1 \Pr(H=h |r, N=h+t) \, f(r) \, dr}. \!

The prior summarizes what is known about the distribution of r in the absence of any observation. We could assume (but it would be rather ridiculous) that the prior distribution of r is uniform over the interval 1. That is, f(r) = 1. In fact, we ought to use a prior distribution that reflects our experience with real coins.

The probability of obtaining h heads in n tosses of a coin with a probability of heads equal to r is given by a binomial distribution:

\Pr(H=h | r, N=h+t) = {h+t \choose h} \, r^h \, (1-r)^t. \!

Putting it all together:

f(r | H=h, T=t) = \frac| < E where p\,\! is the estimated probability of obtaining head. Note: p_{\mathrm{actual}}\,\! is the same actual probability (for obtaining head) as the term r\,\! of the previous section in this article.

  • In statistics, the estimate of a proportion of a sample (denoted by p) has a standard error (standard deviation of error) given by:

s_p = \sqrt{ \frac {p \, (1-p) } {n} }

This standard error s_p will have a maximum theoretical value if p = (1-p) = 0.5.

Hence , assuming the worse case , p is set to 0.5 to get the maximum possible value of s_p.

s_p\,\! = \sqrt{ \frac {p \, (1-p) } {n} } = \sqrt{ \frac {0.5 \times 0.5 } {n} }
= \sqrt{ \frac { 1 } {4 \, n} } = \frac {1}{2 \, \sqrt{n}}

And hence the value of maximum error (E) is given by

E\,\! = Z \, s_p = \frac {Z}{2 \, \sqrt{n}}

Therefore, the final formula for the number of coin tosses for the estimator p\,\! is

E = \frac {Z}{2 \, \sqrt{n}} \quad \quad \mbox{or} \quad \quad n = \frac {Z^2} {4 \, E^2} \!

provided that n \cdot p \ge 5 and n \cdot q \ge 5 where q = (1-p)\, to satisfy the Central Limit Theorem.

Example

1. If a maximum error of 0.01 is desired, how many times should the coin be tossed?

n = \frac {Z^2} {4 \, E^2} = \frac {Z^2} {4 \times 0.01^2} = 2500 \ Z^2

n = 2500\, at 68.27% level of confidence (Z=1)
n = 10000\, at 95.45% level of confidence (Z=2)
n = 27225\, at 99.90% level of confidence (Z=3.3)

2. If the coin is tossed 10000 times, what is the maximum error of the estimator p\,\! on the value of r\,\! (the actual probability of obtaining head in a coin toss)?

E = \frac {Z}{ 2 \, \sqrt{n} }
E = \frac {Z}{ 2 \, \sqrt{ 10000 } } = \frac {Z}{ 200 }
E = 0.0050\, at 68.27% level of confidence (Z=1)
E = 0.0100\, at 95.45% level of confidence (Z=2)
E = 0.0165\, at 99.90% level of confidence (Z=3.3)

3. The coin is tossed 12000 times with a result of 5961 heads (and 6039 tails). What interval does the value of r\,\! (the true probability of obtaining head) lie within if a confidence level of 99.999% is desired?

p = \frac{h}{h+t} \, = \frac{5961}{12000} \, = 0.4968
{| border="0" cellpadding="5" cellspacing="0" align="none" Now find the value of Z corresponding to 99.999% level of confidence.
Z = 4.4172 \,\!
{| border="0" cellpadding="5" cellspacing="0" align="none" Now calculate E
E = \frac{Z}{2 \, \sqrt{n}} \, = \frac{4.4172}{2 \, \sqrt{12000}} \, = 0.0202
{| border="0" cellpadding="5" cellspacing="0" align="none" The interval which contains r is thus:
p - E < r < p + E \,\!

0.4766 < r < 0.5169 \,\!

Hence, 99.999% of the time, the interval above would contain r\,\! which is the true value of obtaining head in a single toss.

Other applications


The above mathematical analysis for determining if a coin is fair can also be applied to other uses. For example:

  • Determining the product defective rates of a product when subjected to a particular (but well defined) condition. Sometimes a product can be very difficult or expensive to produce. Furthermore if testing such products will result in their destruction, a minimum amount of products should be tested. Using the same analysis the probability density function of the product defect rate can be found.

  • Two party polling. If a small random sample poll is taken where the there are only two mutually exclusive choices, then this is equivalent to tossing a single coin multiple times using a bias coin. The same analysis can therefore be applied to determine actual voting ratio.

  • Finding the proportion of females in an animal group. Determining the gender ratio in a large group of an animal species. Provided that a very small random sample is taken when performing the random sampling of the population, the analysis is similar to determining the probability of obtaining heads in a coin toss.

See also


External links


Probability theoryStatistics

 

This article is licensed under the GNU Free Documentation License. It uses material from the "Checking if a coin is fair".

Home Pageartsbusinesscomputersgameshealthhospitalshomekids & teensnewsphysiciansrecreationreferenceregionalscienceshoppingsocietysportsworld