a frustratingly well-paywalled, yet exhaustive, complete and informative overview of the IARPA's FOCUS tournament
Since you quote from a section that is behind the paywall, I assume you have access to the article. If so, could you make it available? Or just send it to me (name@surname.com) and I'll upload it to my site and post a link to it here and on LW. Thanks!
I got sent it to me by the author of the article with the explicit request not to do that. I tried to check whether I could access it through any of my usual methods (disabling javascript, looking in the internet archive, using various extensions etc.), but realized I couldn't.
I thought about not adding it to the newsletter at all, but realized that in this case, I actually respect their monetization model, and I liked the piece. In particular, this piece doesn't seem particularly clickbaity, a la SSC's Problems With Paywalls; instead it's a pretty good and lengthy feature article which took someone maybe a week (?) to write (the pdf version of the article is 16 pages). In contrast, other non-paywalled news media (I'm thinking of Forbes here) sometimes/usually cover forecasting questions so, so terribly.
So that's my starting point. If you or other readers prefer not to see this kind of thing, I'm all ears.
Highlights
Index
Sign up here or browse past newsletters here.
Prediction Markets & Forecasting Platforms
Metaculus updated their track record page. You can now look at accuracy across time, at the distribution of brier scores, and a calibration graph. They also have a new black swan question: When will US metaculus users face an emigration crisis?.
Good Judgement Open has a thread in which forecasters share and discuss tips, tricks and experiences. An account is needed to browse it.
Augur modifications in response to higher ETH prices. Some unfiltered comments on reddit
An overview of PlotX, a new decentralized prediction protocol/marketplace. PlotX focuses on non-subjective markets that can be programmatically determined, like the exchange rate between currencies or tokens.
A Replication Markets participant wrote What's Wrong with Social Science and How to Fix It: Reflections After Reading 2578 Papers. See also: An old long-form introduction to Replication Markets.
Georgetown's CSET is attempting to use forecasting to influence policy. A seminar discussing their approach Using Crowd Forecasting to Inform Policy with Jason Matheny is scheduled for the 19th of October. But their current forecasting tournament, foretell, isn't yet very well populated, and the aggregate isn't that good because participants don't update all that often, leading to sometimes clearly outdated aggregates. Perhaps because of this relative lack of competition, my team is in 2nd place at the time of this writting (with myself at #6, Eli Lifland at #12 and Misha Yagudin at #21). You can join foretell here.
There is a new contest on Hypermind, The Long Fork Project, which aims to predict the impact of a Trump or a Biden victory in November, with $20k in prize money. H/t to user ChickCounterfly.
The University of Chicago's Effective Altruism group is hosting a forecasting tournament between all interested EA college groups starting October 12th, 2020. More details here
In the News
News media sensationalizes essentially random fluctuations on US election odds caused by big bettors entering prediction markets such as Betfair, where bets on the order of $50k can visibly alter the market price. Simultaneously, polls/models and prediction market odds have diverged, because a substantial fraction of bettors lend credence to the thesis that polls will be biased as in the previous elections, even though polling firms seem to have improved their methods.
Red Cross and Red Crescent societies have been trying out forecast based financing. The idea is to create forecasts and early warning indicators for some negative outcome, such as a flood, using weather forecasts, satellite imagery, climate models, etc, and then release funds automatically if the forecast reaches a given threshold, allowing the funds to be put to work before the disaster happens in a more automatic, fast and efficient manner. Goals and modus operandi might resonate with the Effective Altruism community: > "In the precious window of time between a forecast and a potential disaster, FbF releases resources to take early action. Ultimately, we hope this early action will be more effective at reducing suffering, compared to waiting until the disaster happens and then doing only disaster response. For example, in Bangladesh, people who received a forecast-based cash transfer were less malnourished during a flood in 2017." (bold not mine)
Prediction Markets' Time Has Come, but They Aren't Ready for It. Prediction markets could have been useful for predicting the spread of the pandemic (see: coronainformationmarkets.com), or for informing presidential election consequences (see: Hypermind above), but their relatively small size makes them less informative. Blockchain based prediction technologies, like Augur, Gnosis or Omen could have helped bypass US regulatory hurdles (which ban many kinds of gambling), but the recent increase in transaction fees means that "everything below a $1,000 bet is basically economically unfeasible"
Floods in India and Bangladesh:
The many tribes of 2020 election worriers: An ethnographic report by the Washington Post.
Electricity time series demand and supply forecasting startup raises $8 million. I keep seeing this kind of announcement; doing forecasting well in an underforecasted domain seems to be somewhat profitable right now, and it's not like there is an absence of domains to which forecasting can be applied. This might be a good idea for an earning-to-give startup.
NSF and NASA partner to address space weather research and forecasting. Together, NSF and NASA are investing over $17 million into six, three-year awards, each of which contributes to key research that can expand the nation's space weather prediction capabilities.
In its monthly report, OPEC said it expects the pandemic to reduce demand by 9.5 million barrels a day, forecasting a fall in demand of 9.5% from last year, reports the Wall Street Journal
Some criticism of Gnosis, a decentralized prediction markets startup, by early investors who want to cash out. Here is a blog post by said early investors; they claim that "Gnosis took out what was in effect a 3+ year interest-free loan from token holders and failed to deliver the products laid out in its fundraising whitepaper, quintupled the size of its balance sheet due simply to positive price fluctuations in ETH, and then launched products that accrue value only to Gnosis management."
What a study of video games can tell us about being better decision makers ($), a frustratingly well-paywalled, yet exhaustive, complete and informative overview of the IARPA's FOCUS tournament:
Negative Examples
Why Donald Trump Isn’t A Real Candidate, In One Chart, wrote 538 in 2015.
Travel CFOs Hesitant on Forecasts as Pandemic Fogs Outlook, reports the Wall Street Journal.
Long Content
Andrew Gelman et al. release Information, incentives, and goals in election forecasts.
Neither The Economist's model nor 538's are fully Bayesian. In particular, they are not martingales, that is, their current probability is not the expected value of their future probability.
The Literary Digest Poll of 1936. A poll so bad that it destroyed the magazine.
Compare the Literary Digest and Gallup polls of 1936 with The New York Times's model of 2016 and 538's 2016 forecast, respectively.
Feynman in 1985, answering questions about whether machines will ever be more intelligent than humans.
Why Most Published Research Findings Are False, back from 2005. The abstract reads:
Reference class forecasting. Reference class forecasting or comparison class forecasting is a method of predicting the future by looking at similar past situations and their outcomes. The theories behind reference class forecasting were developed by Daniel Kahneman and Amos Tversky. The theoretical work helped Kahneman win the Nobel Prize in Economics.Reference class forecasting is so named as it predicts the outcome of a planned action based on actual outcomes in a reference class of similar actions to that being forecast.
Reference class problem
The Base Rate Book by Credit Suisse.
Hard To Categorize
Improving decisions with market information: an experiment on corporate prediction markets (sci-hub; archive link)
The nonprofit Ought organized a forecasting thread on existential risk, where participants display and discuss their probability distributions for existential risk, and outline some reflections on a previous forecasting thread on AI timelines.
A draft report on AI timelines, summarized in the comments
Gregory Lewis has a series of posts related to forecasting and uncertainty:
Estimation of probabilities to get tenure track in academia: baseline and publications during the PhD.
How to think about an uncertain future: lessons from other sectors & mistakes of longtermist EAs. The central thesis is:
Discussion by a PredictIt bettor on how he made money by following Nate Silver's predictions, from r/TheMotte.
Also on r/TheMotte, on the promises and deficiencies of prediction markets:
Nate Silver on a small twitter thread on prediction markets: "Most of what makes political prediction markets dumb is that people assume they have expertise about election forecasting because they a) follow politics and b) understand "data" and "markets". Without more specific domain knowledge, though, that combo is a recipe for stupidity."
Deloitte forecasts US holiday season retail sales (but doesn't provide confidence intervals.)
Solar forecast. Sun to leave the quietest part of its cycle, but still remain relatively quiet and not produce world-ending coronal mass ejections, the New York Times reports.
The Foresight Insitute organizes weekly talks; here is one with Samo Burja on long-lived institutions.
Some examples of failed technology predictions.
Last, but not least, Ozzie Gooen on Multivariate estimation & the Squiggly language:
Note to the future: All links are added automatically to the Internet Archive. In case of link rot, go there and input the dead link.