All of VincentYu's Comments + Replies

Thanks for writing such a comprehensive explanation!

Why is downvoting disabled, for how long has it been like this, and when will it be back?

Viliam
110

The original purpose of downvoting was to allow community moderation. Here, "moderation" means two things: (1) Giving higher visibility to high-quality content. This functionality we still have, it's the upvotes. (2) Removing low-quality content. Comments with karma below -5 and their whole subthreads are collapsed by default. This is especially important when some newcomers start spamming LW with a lot of low-quality comments. It happened more frequently in the past when LW was more popular.

And the "community" aspect means that these d... (read more)

0Erfeyah
I am very new here but my impression from reading around is that people were taking advantage of the system by creating multiple accounts and downvoting comments that opposed them in order to appear to be right. I am not sure though.

In support of your point, MIRI itself changed (in the opposite direction) from its former stance on AI research.

You've been around long enough to know this, but for others: The former ambition of MIRI in the early 2000s—back when it was called the SIAI—was to create artificial superintelligence, but that ambition changed to ensuring AI friendliness after considering the "terrible consequences [now] feared by the likes of MIRI".

In the words of Zack_M_Davis 6 years ago:

(Disclaimer: I don't speak for SingInst, nor am I presently affiliated with th

... (read more)
2Lumifer
Or maybe because SIAI realized their ability to actually create an AI is non-existent
  • who on lesswrong tracks their predictions outside of predictionbook, and their thoughts on that method

Just adding to the other responses: I also use Metaculus and like it a lot. In another thread, I posted a rough note about its community's calibration.

Compared to PredictionBook, the major limitation of Metaculus is that users cannot create and predict on arbitrary questions, because questions are curated. This is an inherent limitation/feature for a website like Metaculus because they want the community to focus on a set of questions of general inter... (read more)

Here is the full text article that was actually published by Kahneman et al. (2011) in Harvard Business Review, and here is the figure that was in HBR:

Is there any information on how well-calibrated the community predictions are on Metaculus?

Great question! Yes. There was a post on the official Metaculus blog that addressed this, though this was back in Oct 2016. In the past, they've also sent to subscribed users a few emails that looked at community calibration.

I've actually done my own analysis on this around two months ago, in private communication. Let me just copy two of the plots I created and what I said there. You might want to ignore the plots and details, and just skip to the "brief sum... (read more)

That's some neat data and observation! Could there be other substantial moderating differences between the days when you generate ~900 kJ and the days when you don't? (E.g., does your mental state before you ride affect how much energy you generate? This could suggest a different causal relationship.) If there are, maybe some of these effects can be removed if you independently randomize the energy you generate each time you ride, so that you don't get to choose how much you ride.

To make this a single-blinded experiment, just wear a blindfold; to double blind, add a high-beam lamp to your bike; and to triple blind, equip and direct high beams both front and rear.

… okay, there will be no blinding.

4Richard Korzekwa
This could be the case, or there could be a common cause between the total work I do and my mood for the day. What makes me think this is less likely is that, when I'm following a training plan, the total work for the ride is largely determined days or weeks ahead of time. Then again, I will modify the day's workout on a training plan if I'm feeling shitty. Or it could just be that I noticed the pattern once when it happened by chance, then I expected it to continue, so it did (that is, it's more of a placebo than anything else). Then again, it wouldn't be hard for small effects like this to add up to the observed effect. I actually did think about blinding it. I could modify some existing software to give me an intensity or duration that I don't know ahead of time, and that I don't have in front of me while I'm riding, and I could even not look at what it was until days or weeks later when I'm analyzing the results (or I could get even more hardcore and have someone else analyze it). The problem is that most of the motivation mechanisms I have for actually doing a worthwhile ride indoors require me to have access to a lot of this data. It would sort of be like trying to stay motivated in a game where you have no access to your score or whether you've eliminated another player.

Polled.

  1. I generally do only a quick skim of post titles and open threads (edit: maybe twice a month on average; I'll try visiting more often). I used to check LW compulsively prior to 2013, but now I think both LW and I have changed a lot and diverged from each other. No hard feelings, though.

  2. I rarely click link posts on LW. I seldom find them interesting, but I don't mind them as long as other LWers like them.

  3. I mostly check LW through a desktop browser. Back in 2011–2012, I used Wei Dai's "Power Reader" script to read all comments. I also u

... (read more)
29eB1
Is there any information on how well-calibrated the community predictions are on Metaculus? I couldn't find anything on the site. Also, if one wanted to get into it, could you describe what your process is?

I haven't been around for a while, but I expect to start fulfilling the backlog of requests after Christmas. Sorry for the long wait.

Do we know which country Wright was living in during 2010?

The article is available on various websites by exact phrase searching, but there are some minor transcription errors in these copies. I've transcribed it below using Google's copy of the scanned article to correct these errors. There seems to be a relevant captioned figure (maybe a photo of Fuller?) on p. 63 of the magazine that is missing from the scan.


Dymaxion Sleep

Sleep is just a bad habit. So said Socrates and Samuel Johnson, and so for years has thought grey-haired Richard Buckminster Fuller, futurific [sic] inventor of the Dymaxion* house (Time, A... (read more)

Here. Figures 4 and 5 are missing from the scan that I received. Dope ads.

0gwern
Thanks. Looks like figure 4/5 were not important going by the article text.

From the linked Wired article:

The PGP key associated with Nakamoto’s email address and references to an upcoming “cryptocurrency paper” and “triple entry accounting” were added sometime after 2013.

Gwern's comment in the Reddit thread:

[...] this is why we put our effort into nailing down the creation and modification dates of the blog post in third-party archives like the IA and Google Reader.

These comments seem to partly refer to the 2013 mass archive of Google Reader just before it was discontinued. For others who want to examine the data: the rel... (read more)

3. Here.

Huh. I never knew there were so many other plants that had similar effects on cats.

Anyway, best of luck getting Todd's work… and getting cats high.

0gwern
Thanks. Funny story - morphine addict with other issues for which the treatment was... morphine. Clever solution.
4gwern
Thanks. I wasted a bunch of money on catnip when it turned out my cat was immune, which I didn't even know was a thing. After reading up on it, it seemed like there were gaps in the research literature - most of it was hopelessly old and inaccessible, there was no single estimate for how frequently cats respond to catnip and substitutes (so I could meta-analyze/multilevel-model this easily), and no data on the relationships of responses within a cat (so if your cat is immune to catnip, what do you optimally try next?) but this is easy to experiment with since cats are common (I've already gotten set up with several cat toys impregnated with catnip/valerian/honeysuckle so I can test each cat I run into with a battery of stimulants). So after jailbreaking all the relevant literature, maybe run an online survey of catowners, then combine everything to get the population frequency of catnip response, and then begin experimenting with available cats to get an idea of whether responses are correlated and how frequently cats respond to each stimulant. Then catowners will know the risk of catnip immunity and each stimulant they should try next. See http://www.gwern.net/catnip A minor contribution, perhaps, but there are a lot of catowners out there and it would be nice to bring some clarity to this area.
  1. Requested.
  2. Sadly, I can't request entire dissertations. I'm sure there are Harvard students on LW; maybe try asking for help in the open thread?
  3. Requested.
0gwern
Thanks. As far as the Todd dissertation goes, I know someone who can request it for me and I've asked them, so hopefully! (I really want it since it seems to be the most comprehensive set of experiments ever done on catnip and any analysis of mine would be crippled without it.)

Still can't get it. I should be able to access it through an institutional subscription to the EBSCO database once the paper is assigned to an issue, replacing its current "online first" designation.

0gwern
I think it might never have been published. I tried going again via Libgen and my university proxy and Ebscohost, and it just doesn't show up as anything but that abstract despite supposedly being published 3 years ago. Oh well. It wasn't that important.

Nice paper.

p. 558 (Study 4):

Participants also completed a ten item personality scale (Gosling, Rentfrow & Swann, 2003) [the TIPI; an alternative is Rammstedt and John's BFI-10] that indexes individual differences in the Big Five personality traits (extraversion, agreeableness, conscientiousness, emotional stability, and openness). These data will not be considered further.

It's strange not to say why the data will not be considered further. The data are available, the reduction is clean, but the keys look a bit too skeletal given that copies of the ... (read more)

0gwern
Thanks. That's going to take a while to read.

it blew up to 14M

The object streams for indirect objects have been unpacked and stripped away, leaving their contents uncompressed. Use qpdf to regenerate compressed object streams:

qpdf --object-streams=generate in.pdf out.pdf

(The --stream-data=compress option is already set by default.)

While you are at it, might as well re-linearize the PDF for online readers with low bandwidth:

qpdf --object-streams=generate --linearize in.pdf out.pdf
0gwern
That seems to work. I tried gs, Gscan2pdf, and pdf2djvu but they all either didn't reduce size or segfaulted.

Got the whole PDF from HathiTrust. I think Chart I is missing from the scan.

0gwern
Thanks. I added some metadata and it blew up to 14M, which is unfortunate. Chart I does seem to be missing in both the PDF and the online version; I suspect that it's missing from the physical copy at UMich ('pocket' sounds like something that might go missing).
VincentYu
120

Downvoted. I'm sorry to be so critical, but this is the prototypical LW mischaracterization of utility functions. I'm not sure where this comes from, when the VNM theorem gets so many mentions on LW.

A utility function is, by definition, that which the corresponding rational agent maximizes the expectation of, by choosing among its possible actions. It is not "optimal as the number of bets you take approaches infinity": first, it is not 'optimal' in any reasonable sense of the word, as it is simply an encoding of the actions which a rational agent... (read more)

-3Houshalter
I understand the VNM theorem. I'm objecting to it. If you want to argue "by definition", then yes, according to your definition utility functions can't be used in anything other than expected utility. I'm saying that's silly. Not all rational agents, as my post demonstrates. An agent following median maximizing would not be describable by any utility function maximized with expected utility. I showed how to generalize this to describe more kinds of rational agents. Regular expected utility becomes a special case of this system. I think generalizing existing ideas and mathematics is a desirable thing sometimes. Yes, it is. If you assign some subjective "value" to different outcomes, and to different things, then maximizing expected u̶t̶i̶l̶i̶t̶y̶ value, will maximize it, as the number of decisions approaches infinity. For every bet I lose at certain odds, I will gain more from others some predictable percent of the time. On average it cancels out. This might not be the standard way of explaining expected utility, but it's very simple and intuitive, and shows exactly where the problem is. It's certainly sufficient for the explanation in my post. That's quite irrelevant. Sure humans are irrational and make inconsistencies and errors in counterfactual situations. We should strive to be more consistent though. We should strive to figure out the utility function that most represents what we want. And if we program an AI, we certainly want it to behave consistently. Again, back to arguing by definition. I don't care what the definition of "utility" is. If it would please you to use a different word, then we can do so. Maybe "value function" or something. I'm trying to come up with a system that will tell us what decisions we should make, or program an AI to make. One that fits our behavior and preferences the best. One that is consistent and converges to some answer given a reasonable prior. You haven't made any arguments against my idea or my criticisms of expected
VincentYu
110

I see. Looking into this, it seems that the (mis)use of the phrase "confidence interval" to mean "credible interval" is endemic on LW. A Google search for "confidence interval" on LW yields more than 200 results, of which many—perhaps most—should say "credible interval" instead. The corresponding search for "credible interval" yields less than 20 results.

I briefly skimmed the paper and don't see how you are getting this impression. Confidence intervals are—if we force the dichotomy—considered a frequentist rather than Bayesian tool. They point out that others are trying to squish a Bayesian interpretation on a frequentist tool by treating confidence intervals as though they are credible intervals, and they state this quite explicitly (p.17–18, emphasis mine):

Finally, we believe that in science, the meaning of our inferences are important. Bayesian credible intervals support an interpretation of probabili

... (read more)
4[anonymous]
Hmmm, yes, I suppose I was making the same mistake they were... I thought that what confidence intervals were are actually what credible intervals are.

Here. Sorry about the horrible format; I didn't see a better way to download the content or print the page. In addition, I couldn't access the figures.

0Morendil
Awesome, thanks! (ETA) I have the figures already from a secondary source, so that's OK.

Page-by-page .djvu scans are available here (found via this search; edit: it seems to appear sporadically in the search results). Full sequence of download links is http://202.116.13.3/ebook%5C24/24000522/ptiff/00000{001..744}.djvu


I wrote the following just before finding the scan of the book. I'll post it anyway.

I've used 1DollarScan for about 50 books, including math/stat textbooks, and the quality is consistently good (unless you need accurate color reproduction) even with the cheapest option (i.e., $1 per 100 pages), but you'll need to do your own po... (read more)

0gwern
Huh. Weird. I did not see that IP-server, I don't think, and I'm surprised that such a thing exists. I also don't see it in your linked search! Seems to be... maybe some sort of scan prepared by a Chinese university library, going by http://202.116.13.3/detail.asp?id=120 ("Library of JI'NAN University") ? Easy enough to get, combine, and add the metadata: $ for i in {001..744}; do wget "http://202.116.13.3/ebook%5C24/24000522/ptiff/00000"$i".djvu"; done $ djvm -c 1959-schlaifer-probabilitystatisticsbusinessdecisions.djvu 000*.djvu $ djvused 1959-schlaifer-probabilitystatisticsbusinessdecisions.djvu set-meta Title Probability and Statistics for Business Decisions: An Introduction to Managerial Economics Under Uncertainty Author Robert Schlaifer Publisher McGraw-Hill Book Company Subject statistics Keywords decision theory, subjective Bayesianism, value of information URL http://202.116.13.3/fulltext.asp?id=120 Creator Library of JI'NAN University CreationDate 1959 . save ^Z Browsing, it looks nice. Only 21MB, and the OCR looks good. Thanks! I think there was a cheaper one on Amazon, but in any case, that was only if I couldn't find a digital copy.

You take the probability of A not happening and multiply by the probability of B not happening. That gives you P(not A and not B).

Only if A and B are independent.

No. "Expected value" refers to the expectation of a variable under a probability distribution, whereas "expected utility" refers specifically to the expectation of a utility function under a probability distribution. That is, expected utility is a specific instantiation of an expected value; expected value is more general than expected utility and can refer to things other than utility.

The importance of this distinction often arises when considering the utility of large sums of money: a person may well decline a deal or gamble with posi... (read more)

See also this highly-upvoted question on the Physics Stack Exchange, which deals with your question.

Interesting. Thanks for posting this!

I received exactly the same number of SNPs from BGI, so it looks like our data were processed under the same pipeline. I've found three people who have publicly posted their BGI data: two at the Personal Genome Project (hu2FEC01 and hu41F03B, each with 5,095,048 SNPs), and one on a personal website (with 18,217,058 SNPs).

Then there are a few thousand SNPs that one or other analysis (in 26 cases, both) list in their output but don't report anything for. What causes this?

The double dashes are no calls. 23andme reports... (read more)

ILL couldn't get Schretlen et al. Can try again once the paper is included in the print journal, but I'd recommend just asking the authors for a copy.

0gwern
The PDF/journal copy seems to be up now: http://clinicalschizophrenia.org/doi/abs/10.3371/CSRP.SCST.103114?journalCode=csrp (PDF). Can't get to it through Sci-hub, but maybe your university access can get it now?
0gwern
Mm. I don't want to ask because then I can't post a copy publicly. Maybe I'll just drop that one - tDCS is not that important to me that I really need every paper.

Dale and Krueger's paper was revised and published in the Journal of Human Resources under the new title "Estimating the effects of college characteristics over the career using administrative earnings data".

I see. In GAZP vs. GLUT, Eliezer argues that the only way to feasibly create a perfect imitation of a specific human brain is to do computations that correspond in some way to the functional roles behind mental states, which will produce identical conscious experiences according to functionalism.

For uploading, that means whole brain emulation. In my underinformed opinion, whole brain emulation is not the Way to AGI if you just want AGI. At some point, then, AGI will be available while WBE systems will be way behind; and so, uploaders will at least temporarily face a deeply serious choice on this issue.

Are you suggesting that mind uploading to a non-WBE platform will be available before WBE? I don't think this is a common belief; uploading almost exclusively refers to WBE. See, for instance, Sandberg and Bostrom (2008), who don't distinguish bet... (read more)

2torekp
Good question, thanks. Yes, I do think that "mind uploading", suitably loosely defined, will be available first on a non-WBE platform. I'm assuming that human-level AGI relatively quickly becomes superhuman-level, to the point where imitating the behavior of a specific individual becomes a possibility.

The relevant paragraph is in Section 2.2.5:

OCD is ranked by the WHO in the top ten of the most disabling illnesses by lost income and decreased quality of life (Bobes et al., 2001). The severity of OCD differs markedly from one person to another. While some people may be able to hide their OCD from their own family, the disorder may have a major negative impact on social relationships leading to frequent family and marital discord or dissatisfaction, separation or divorce (Koran, 2000). It also interferes with leisure activities (Antony et al., 1998) and

... (read more)

The last one.

I don't think I can get the two dissertations. I'll put in ILL requests for the other papers over the next week.

0gwern
Thanks.

Chapter 3 is available from the publisher as a sample.

(BTW, this is an old help desk thread; the newest one is here.)

Is there any chance your sequencing had greater than 4x coverage?

I don't know. How do I find out?

0rseiter
I think the VCF would tell you if you had it. Another possibility would be using a lower quality threshold for calling SNPs, but that seems unlikely.

There is the References & Resources for LessWrong (last updated in 2011), which has a good selection of older posts and other resources by subject.

Are you sure you've downloaded your entire genome file? My uncompressed file is about 500 MB, and I got about 26000 annotations on Promethease. It seems like your file might have gotten truncated during the download.

Short step-by-step guide for those who want to get their genome annotated by Promethease:

  1. Use the 'Download All Files' link on the SpiderOak page to download your genome file.*
  2. Unzip then gunzip to get the raw text file genome.txt.
  3. Open the file in a text editor. Remove all the commented lines at beginning of the file except the last one (i.e.
... (read more)
4rseiter
Thanks for the explanation and tips! I used your procedure and ended up with the same 131MB file. Interestingly I did not need to remove the "--" entries. I have been exchanging email with BGI and they indicated files could have significantly different number of entries (but I am surprised at >3x!). Is there any chance your sequencing had greater than 4x coverage? My VCF file is queued up and should be available in a few months which should help clarify what I am seeing.

Not directly answering your conundrum on wrist computers, but—I go trail running frequently (in Hong Kong), so I've thought a bit about wearable devices and safety. Here are some of my solutions and thoughts:

  • I use a forearm armband (example) to hold my phone in a position that allows me to use and see the touchscreen while running. I find this incredibly useful for checking a GPS map on the run while keeping both hands free for falls. I worry that the current generation of watches are nowhere near as capable as my phone.

  • I rely a lot on Strava's online

... (read more)
2DataPacRat
Indeed so - but as far as I've been able to dig up so far, they require a bit more gold than I can afford. Such beacons are required to be (re)programmed with a serial number appropriate for the country they're to be used in, which can only be done at an authorized dealer, which makes online purchases from other countries almost pointless. As near as I can tell, the nearest place I can get such a beacon is at mec.ca , where the least expensive example I can find is $265, above my budget for camping electronics. I'd be happy to have such a device; I just don't see how I can acquire one with my particular level of fixed income.
Load More