Data & Statistics

Complex Coin-Toss

September 17, 2023

Here is a game. If you win the game, you get a dollar; else, you lose one. What is the probability of winning the game?

The game involves a fair coin and two urns.
Urn 1: 3 red balls; 1 blue ball.
Urn 2: 1 red ball; 3 blue balls.
You toss the coin first. If heads, you draw a ball from urn 1 and if tails, urn 2. Drawing a red ball wins the game.

The marginal probability of getting a head is 1/2, and getting a red ball from Urn 1 = 3/4. Therefore, the joint probability of getting a red ball from Urn 1 is (1/2)x(3/4) = (3/8). Similarly, the joint probability of getting a red ball from Urn 2 is (1/2)x(1/4) = (1/8). The overall probability of drawing a red is

(3/8) + (1/8) = (4/8) = (1/2), same as flipping a coin.

Complex Coin-Toss Read More »

Program Crashes

September 15, 2023

A company has bought three software packages for their operations. They are Abacus, Biscuit and Circuit. On average, Abacus crashes 1 in 200 times, Biscuit 1 in 10 times, and Circuit 1 in 50. Of the ten employees, two were assigned Abacus, five got Biscuit, and three received Circuit. If Sophia’s trial crashed on the first trial, what is the probability that she got Abacus?

Use Bayes’ equation to get the answer:

$\\ P(A|CR) = \frac{P(CR|A) * P(A)}{P(CR|A) * P(A) + P(CR|B) * P(B) + P(CR|C) * P(C)} \\ \\ = \frac{(1/200) * (0.2)}{(1/200) * (0.2) + (1/10) * (0.5) + (1/50) * (0.3)} = 0.01754$

Reference

An Introduction to Probability and Interactive Logic by Ian Hacking

Program Crashes Read More »

Random Walk to -30

September 13, 2023

If a 1-dimensional random walk starts at 0, with steps of one (to the right or left), what is the probability of reaching -30 before reaching 10?

Suppose P₃₀ is the probability of reaching -30, and (1−P₃₀) is the probability that to end with 10.

Let X be the position on the x-axis at the end of this game
E[X] = -30 x P₃₀ + 10 x (1-P₃₀)
For a random walk with equal steps (+1 or -1), E[X] = 0.
0 = -30 x P₃₀ + 10 x (1-P₃₀)
-10 = P₃₀(-30 -10)
P₃₀ = 1/4 = 0.25 = 25%

Random Walk to -30 Read More »

Random Sampling

September 12, 2023

We know the ‘sample’ function creates a random sample of elements from a vector. But if you want to get a random sample between two limits, ‘runif’ is the function. Here is a plot of 1000 samples between 0 and 1.

runif(1000, min = 0, max = 1)

Now, here is a question. If A and B are two random points between 0 & 1, what is the probability A / B lies between 1 and 2?

itr <- 1000000

toss <- replicate(itr, {
sa_A <- runif(1)
sa_B <- runif(1)
sam <- sa_A / sa_B

if(sam >= 1 & sam <= 2) {
  counter <- 1
}else{
  counter <- 0
}
})

mean(toss)

0.25

Here is a graphical representation. X/Y between 1 and 2 implies the area between two lines X / Y = 1 (Y = X) and X/Y = 2 (Y = X/2).

The area between the two lines = 1 x 1 – (1/2) x 1 x 1 – (1/2) x 1 x (1/2) = 1 – 0.5 – 0.25 = 0.25.

Random Sampling Read More »

Dice Polynomial – More Types

September 11, 2023

We have seen how dice values are expressed as polynomials and how the resulting exponents become the sum and coefficients become the number of ways of obtaining the sum. Let’s extend this further and use dice rolling as a technique to estimate the production of polynomials.

(x + x + x^3 + x^4 + x^6 + x^6) * (x + x^2 + x^3 + x^3 + x^5 + x^6)

This is equivalent to two six-sided dice with the following numbers
dice 1: [1, 1, 3, 4, 6, 6]
dice 2: [1, 2, 3, 3, 5, 6]

Throw them a million times, estimate the probability and convert them into whole numbers.

dice_1 <- c(1, 1, 3, 4, 6, 6)
dice_2 <- c(1, 2, 3, 3, 5, 6)
prob_1 <- rep(1/6,6)
prob_2 <- rep(1/6,6)

itr <- 1000000

toss <- replicate(itr, {
sam1 <- sample(dice_1, 1, prob = prob_1, replace = TRUE)
sam2 <- sample(dice_2, 1, prob = prob_2, replace = TRUE)
sam <- sam1 + sam2
})

Let’s write down what we see above:

2x² + 2x³ + 5x⁴ + 2x⁵ + 5x⁶+ 6x⁷ + 3x⁸ + 6x⁹ + x¹⁰ + 2x¹¹ + 2x¹²

Estimate the product manually (or run it through the ‘Wolfram’ calculator )
2 x^2 + 2 x^3 + 5 x^4 + 2 x^5 + 5 x^6 + 6 x^7 + 3 x^8 + 6 x^9 + x^10 + 2 x^11 + 2 x^12

(x + x + x^3 + x^4 )*(x + x^2 + x^3 + x^3 + x^5 + x^6)

dice_1 <- c(1, 1, 3, 4)
dice_2 <- c(1, 2, 3, 3, 5, 6)
prob_1 <- rep(1/4,4)
prob_2 <- rep(1/6,6)

itr <- 1000000

toss <- replicate(itr, {
sam1 <- sample(dice_1, 1, prob = prob_1, replace = TRUE)
sam2 <- sample(dice_2, 1, prob = prob_2, replace = TRUE)
sam <- sam1 + sam2
})

2x² + 2x³ + 5x⁴ + 2x⁵ + 5x⁶ + 4x⁷ + x⁸ + 2x⁹ + x¹⁰
And the manual calculation gives:
2 x^2 + 2 x^3 + 5 x^4 + 2 x^5 + 5 x^6 + 4 x^7 + x^8 + 2 x^9 + x^10

(x + x + x³ + x⁴ )*(x + x² + x³ + x³)

dice_1 <- c(1, 1, 3, 4)
dice_2 <- c(1, 2, 3, 3)
prob_1 <- rep(1/4,4)
prob_2 <- rep(1/4,4)

itr <- 1000000

toss <- replicate(itr, {
sam1 <- sample(dice_1, 1, prob = prob_1, replace = TRUE)
sam2 <- sample(dice_2, 1, prob = prob_2, replace = TRUE)
sam <- sam1 + sam2
})

2 x² + 2 x³ + 5 x⁴ + 2 x⁵ + 3 x⁶ + 2 x⁷

Online Factoring Calculator: Wolfram

Dice Polynomial – More Types Read More »

Dice Polynomial – Sicherman Dice

September 10, 2023

We have seen how one can describe a die with a polynomial. As a well-known example, i.e., the roll of two (regular) dice. The expected probabilities on the sum of dice are:

f(x) x f(x)= (x⁶ + x⁵ + x⁴ + x³ + x² + x¹) (x⁶ + x⁵ + x⁴ + x³ + x² + x¹)

x¹²+ 2x¹¹ + 3x¹⁰ + 4x⁹+ 5x⁸ + 6x⁷ + 5x⁶+ 4x⁵ + 3x⁴ + 2x³+ x²

Where the exponents of x are the X-values and coefficients of x are the Y-values.

Now, a question arises: Can we find another pair of two dice with the same distribution for the sums? One way to find out is to factorise the polynomial, x¹² + 2x¹¹ + 3x¹⁰ + 4x⁹ + 5x⁸ + 6x⁷ + 5x⁶ + 4x⁵ + 3x⁴ + 2x³ + x². George Sicherman discovered that another pair of numbers can lead to the same outcome. They are:

f(x) x g(x)= (x⁴ + x³ + x³ + x² + x² + x¹) (x⁸ + x⁶ + x⁵ + x⁴ + x³ + x¹)

They represent two cubes with the following numbering.
Cube 1: 1, 2, 2, 3, 3, 4
Cube 2: 1, 3, 4, 5, 6, 8

Let’s roll these dice a million times and find out.

dice_1 <- c(1, 2, 2, 3, 3, 4)
dice_2 <- c(1, 3, 4, 5, 6, 8)
prob_1 <- rep(1/6,6)
prob_2 <- rep(1/6,6)

itr <- 1000000

toss <- replicate(itr, {
sam1 <- sample(dice_1, 1, prob = prob_1, replace = TRUE)
sam2 <- sample(dice_2, 1, prob = prob_2, replace = TRUE)
sam <- sam1 + sam2
})

Here is the comparison of a pair of Sicherman dice with the regular.

Dice Polynomial – Sicherman Dice Read More »

Derangements

September 8, 2023

If n letters are placed randomly into n envelopes (with address), what is the expected number of envelopes with the correct letter inside?

Before addressing that, let’s look at a derangement problem. It is the probability of no match. For n items, it is the number of derangements divided by the number of permutations.

!n/n! = (n!/e)/n! ~ 1/e = 0.37

Let’s do a Monte Carlo and see what we get

itr <- 100000

let_env <- replicate(itr, {
  
  n <- 100
  
  env <- seq(1:n)
  let <- sample(seq(1:n), n, replace = FALSE, prob = rep(1/n, n))

  counter <- 0
for (i in 1:n) {
  if(env[i] == let[i]){
    counter <- counter + 1
  }else{
    counter <- counter 
  }
}

  if(counter == 1) {
    sounder <- 1  
  }else{
    sounder <- 0
  }
  

})

mean(let_env)

0.36827

So what about the original question of the expected number?

itr <- 100000

let_env <- replicate(itr, {
  
  n <- 100
  #env <- sample(seq(1:n), n, replace = FALSE, prob = rep(1/n, n))
  env <- seq(1:n)
  let <- sample(seq(1:n), n, replace = FALSE, prob = rep(1/n, n))

  counter <- 0
for (i in 1:n) {
  if(env[i] == let[i]){
    counter <- counter + 1
  }else{
    counter <- counter 
  }
}
  
 counter 
})

mean(let_env)

 1.00014

Derangements Read More »

Entropy and Information

September 7, 2023

We have seen how the entropy of a system is derived as the surprise element of a system. The higher the entropy, the higher the surprise, ignorance or the degree of disorder of the system.

As an extreme example, the entropy of a double-headed coin is zero as it contains no information, i.e., always lands on heads!

$\\ H = \sum\limits_{x=0}^{n} p(x) log_2[\frac{1}{p(x)}] \\\\ = 1 * log_2[\frac{1}{1}] + 0 * log_2[\frac{1}{0}] = 0$

On the other hand, a fair coin (50-50) produces a non-zero entropy. The full spectrum of entropy for a coin toss is:

Entropy and Information Read More »

The Surprising Story of Entropy

September 6, 2023

Entropy is a concept in data science that helps in building classification trees. The concept of entropy is often explained as an element of ‘surprise’. Let’s understand why.

Suppose there is a coin that falls on heads nine out of ten or the probability of heads, p(H) = 0.9. So, if one tosses the coin and gets heads, it is less of a surprise as we expect it to show this outcome more often. whereas when it shows a tail, it is more surprising. In other words, surprise is somewhat an inverse of the probability, i.e. S = 1/p. But that has a problem.

If the probability of something is 1 (100% certain), 1/p becomes 1/1 = 1. Since we know the chance of that outcome is 100%, it should not be a surprise at all, but we get 1. To avoid that situation, S is defined as log (1/p).
p = 1; S = log (1/1) = 0.
On the other hand,
p = 0; S = log(1/0) = log(1) – log(0) = undefined.

It is a practice to use log base 2 for calculating surprise for two outputs.

Surprise = log₂(1 / Probability)

Now, let’s return to the coin with a 0.9 chance of showing heads. The surprise for getting heads is log₂(1/0.9) = 0.15 and log₂(1/0.1) = 3.32 for tail. As expected, the surprise of getting the rarer outcome (tails) is larger.

If the coin is flipped 100 times, the expected value of heads = 100 x 0.9 and the expected value of tails = 100 x 0.1.
The total surprise of heads = 100 x 0.9 x 0.15
The total surprise of tails = 100 x 0.1 x 3.32
The total surprise = 100 x 0.9 x 0.15 + 100 x 0.1 x 3.32
The total surprise per flip = (100 x 0.9 x 0.15 + 100 x 0.1 x 3.32)/100 = 0.9 x 0.15 + 0.1 x 3.32 = 0.47

This is entropy – the expected value of the surprise.

The Surprising Story of Entropy Read More »

Climate Change – Pew Research Survey

September 5, 2023

Motivated reasoning is the tendency to favour conclusions we want to believe despite substantial evidence to the contrary. A famous example is climate change. In the US, for example, Democrats and Republicans disagree on the scientific consensus. A recent Pew Research survey on climate change presents the magnitude of this divide.

Prioritise alternative energy

At the highest level, 67% of people support this view, which is pretty impressive. But that is 90% Democrats (and Democrat-lining) and 42% Republicans (and leaning). The only silver lining is that 67% of Republicans under age 30 support alternative energy developments.

Climate change – a major threat to the well-being

Here again, the difference between the two parties is stark. In the last 13 years, the views from the Democrats have steadily increased from 61% to 78%, acknowledging climate change as a major threat. It has remained steady and low for the Republicans – at 25% in 2010 and 23% in 2022.
Interestingly, 81% of French and 73% of Germans regard it a threat.

Americans’ views of climate change: Pew

Climate Change – Pew Research Survey Read More »