Nate Silver and his fivethirtyeight site has achieved significant notoriety for developing a system to carefully aggregate election polls to create well-calibrated statistical forecasts of outcome elections; his site publishes daily updates to predictions for primary and general elections in House, Senate and Presidential races.
Prediction markets have offered an alternative to poll aggregation in forecasting elections. Markets such as (the now defunct) InTrade, the Iowa Electronic Markets, PredictIt, and others ask users to buy and sell shares assigned to each candidate in each race, so that the price point corresponds to the probability of victory. In this question we focus on PredictIt, which allows users to place relatively small real-money bets on candidates.
Both fivethirtyeight.com and PredictIt have published probabilities for each of the 14 Super Tuesday Primaries on the Democratic side. (The races are California, Texas, North Carolina, Virginia, Massachusetts, Minnesota, Colorado, Tennessee, Alabama, Oklahoma, Arkansas, Utah, Maine, Vermont.)
Which forecasts will prove to be more accurate?
To compare, we will score each set of predictions using a Brier score averaged over all races, computed as where j enumerates the possible outcomes (i.e. possible winners) in the ith race out of N, where is the forecast probability of candidate j winning the ith race, and is assigned 1 if candidate j wins the ith race, and 0 otherwise.
For example, if PredictIt assigned 52% to Sanders and 48% to Warren in the New York Democratic Primary and if Sanders won then PredictIt would achieve a Brier Score of A lower Brier score is better, with perfect predictions corresponding to . (In the case where PredictIt's prices do not add up to $1, we will normalize them to $1 to convert to probabilities.)
This question resolves positively if the Brier score for the 14 races is lower for PredictIt's probabilities than for fivethirtyeight.com's probabilities, where we will take values as of noon EST on 3/2/2020, and election outcomes as reported on 3/3-3/4.