The problem is to determine the optimal strategy of either switching doors or not switching doors in each round starting from the initial pick from one of the original M doors. Also note that Monty is not allowed to switch which door has the new car behind it.
The key here is to have a probability that the door one has picked has the car behind it is a far from 1/2 as possible going into the two-door round. The further this probability is from 1/2, the more confident one can be in either switching one's pick or keeping one's pick as appropriate to win the car.
My proof of optimal strategy is as follows. Suppose that you go into round N+1 (here N>=2) with a probability x of having picked a door with a goat behind it.
- If you don't switch your door, Monty opens a door with a goat behind it and you go into round N with a probability x of having picked a door with a goat behind it.
- If you switch your door, a fraction
- (N-1)/N of people who picked doors with goats behind them switch to another door with a goat behind it;
- 1/N of people who picked doors with goats behind them switch to the door with the car behind it;
- all people who picked the door with the car behind it switch to a door with a goat behind it.
If you switch doors, you go into round N with a probability x*((N-1)/N)+1-x = (N-x)/N of having picked a door with a goat behind it.
Now (N-x)/N can equal 1/2 if N=2 and x=1. That is, if you were absolutely guarenteed that you picked the goat going into the 3 door game, then switching doors would reduce your chance of winning the goat to 1/2. In this case, the optimal strategy is to not switch doors, let Monty open one of the two remaining doors with goats behind them, and then switch doors to win the car with absolute certainty. In any other case other than N=2 and x=1, since x is at most 1, it follows that (N-x)/N is larger than 1/2.
Under what conditions is (N-x)/N closer to 1/2 than x? Start with the inequality
x - 1/2 > ((N-x)/N) - 1/2
This inequality is satisifed if
x > N/(N+1)
In other words, if x > N/(N+1), then switching doors in the N+1 round and then never switching doors again until a final switch in the 2-door final round will lead to a reduced probability of winning the car.
Notice also that if one switches in the N+1 round, thus altering the probability that one has picked a door with a goat behind it from x to (N-x)/N, it is also the case that for 1 > x,
(N-x)/N > (N-1)/N > (N-2)/(N-1) > ... > 2/3
Of course, if x=1 going into the N+1 round, then the optimal strategy is to never switch doors until the 2-door final round, thus winning the car with absolute certainty. What this means is that, if you go into the N+1 round with a probability 1 > x > N/(N+1) of having picked a door with a goat behind it, then switch doors in the N+1 round, then no matter how many rounds you refuse to switch doors before the two door finale, the next switch of doors still further hurts your chance of winning the car. This same argument can be repeated as many times as necessary to show that, as long as one starts with 1 > x > N/(N+1) going into the N+1 round, every time you switch doors before the 2-door final round, no matter when, you hurt your chance of winning the car.
So the conclusion is that, if you go into the N+1 round with N>=2 with x > N/(N+1), every switch of the doors before the 2-door final round, no matter when, will always hurt your chance of winning the car. Notice that your probability of having picked a goat is always going to be at least 2/3, so you will always want to switch doors in the final 2-door round.
Now, one starts the game by picking from M doors with M>=3, then allowing Monty to open a door to reveal the goat behind it. The probability of picking a door with a goat out of a selection of M doors with M-1 doors having goats behind them is x = (M-1)/M. You're forced to allow Monty to open at at least one door before your first switch. So you go into the M-1 round with a probability x = (M-1)/M of having picked a door with a goat behind it. Since
(M-1)/M > (M-2)/(M-1)
it follows that switching doors in the M-1 round or any subsequent round hurts your chances of winning, no matter how many rounds you wait and refuse to switch doors. So the optimal strategy must be to never switch any doors until reaching the two round finale, and then and only then switching doors. This is consistent with all of the special cases above as well, so this is the optimal strategy.
No comments:
Post a Comment