You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

gwern comments on Open thread, July 29-August 4, 2013 - Less Wrong Discussion

3 Post author: David_Gerard 29 July 2013 10:26PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (381)

You are viewing a single comment's thread.

Comment author: gwern 03 August 2013 02:23:59AM 10 points [-]

Question: where can I upload jailbroken PDFs that is public & Google-visible?

For a job, I compiled ~100MB of lipreading research, some of them extremely obscure & hard to find (I also have some Japanese literature PDFs in a similar situation); while I have no personal interest in the topic and do not want to host indefinitely the PDFs on gwern.net, I feel it would be a massive waste to simply delete them.

I cannot simply put them in a Dropbox public folder because they wouldn't show up in Google, and Scribd is an abomination I despise.

(crosspost from Google+)

Comment author: Douglas_Knight 13 September 2013 05:45:51PM 2 points [-]

wordpress.com has 3gb quota and pdfs are visible to google.

Comment author: gwern 13 September 2013 06:31:40PM 0 points [-]

Interesting. I am giving it a try at http://gwern0.wordpress.com/ . We'll see in a month if any of the PDFs show up in Google.

Comment author: Douglas_Knight 14 September 2013 09:37:10PM 1 point [-]

Where are the links to the documents?

Comment author: gwern 14 September 2013 10:22:36PM 0 points [-]

I don't know. I uploaded the PDFs and 'attached' them to a post. I'm not sure what I'm supposed to do beyond that.

Comment author: Douglas_Knight 14 September 2013 11:30:08PM *  1 point [-]

How to use wordpress to upload and publicize files:

Files show up at gwern0.files.wordpress.com/2013/09/original_name.
There's also an "attachment page" at gwern0.wordpress.com/?attachment_id=##, but only after you publish the associated post, while the file is immediately world readable after upload, just secret.

To get wordpress to populate the post with links:

  1. Edit post
  2. "add media"
  3. (upload files via "upload files" pane)
  4. choose "media files" pane, if necessary
  5. select all files
  6. click "insert into post" at bottom.
Comment author: gwern 15 September 2013 06:24:43PM 0 points [-]

I see, thanks. It looks like that works - I see PDF links in both posts now.

Comment author: Douglas_Knight 15 September 2013 10:06:32PM 1 point [-]

OK, now I can find the links, but can google? It's not supposed to follow links from LW. I think WP advertises new accounts somewhere, but I don't think it's worth much. I suggest you link to it from gwern.net and/or google plus. Also that you link to your google drive public folder.

(I predict that if you don't link to the WP page, google will eventually find it and index it, but not index the pdfs. So if someone searches for the title of the article, google will produce the hit, but google scholar won't have it. And "eventually" might be more than month.)

Comment author: gwern 13 October 2013 10:25:35PM 0 points [-]

So, I just opened up the WP blog and did Scholar searches for 3 or 4 of the lipreading PDFs. Not a single hit.

Comment author: gwern 15 September 2013 10:16:30PM 0 points [-]

We'll see in a month.

Comment author: Douglas_Knight 14 September 2013 10:38:00PM 0 points [-]

Let me get back to you about wordpress, but I wonder if this explains why google drive didn't work for you, when it did work for WB? Google could find everything on the google drive, unlike wp, but maybe they only look via links.

Comment author: hg00 07 August 2013 05:24:38AM 1 point [-]

Scribd is an abomination I despise.

Hm? As far as I can tell, the worst thing they do is sometimes charge users to access older uploaded documents. They have to make money somehow. Would you rather them insert full-page ads in documents the way YouTube now plays ads before video clips?

Anyway, one idea is to find people who run sites on topics related to the PDFs and suggest that they upload them to their sites. Should increase the google juice of both the documents and the sites of those who upload them, so win/win, right?

Comment author: gwern 07 August 2013 11:31:25PM 2 points [-]

As far as I can tell, the worst thing they do is sometimes charge users to access older uploaded documents.

Money which they have zero right to collect and which breaks the implied contract they had with their previous users who uploaded those documents.

And their interface is butt-ugly with PDFs completely unreadable in their HTML version - but of course they don't let you download the PDFs because they're all behind the Scribd paywall.

Hosting documents. A pretty simple task, one would think, and yet Scribd manages to do it both scuzzily and poorly.

They have to make money somehow.

A fully-general excuse. But they are not owed a living.

Comment author: DanielLC 03 August 2013 03:57:02AM 1 point [-]

I'd guess Google Drive.

You could get a website that points to wherever the download actually is.

Comment author: gwern 03 August 2013 02:39:46PM 1 point [-]

That's one of the suggestions on G+ too. I didn't think that they would show up in Google proper and get indexed, but someone said they had for him, so maybe I will go with that. (Even if it doesn't work, I can always redownload and upload somewhere else, presumably.)

Comment author: gwern 26 May 2014 02:22:56AM 0 points [-]

I'm currently trying http://pdf.yt/ for PDF hosting. It seems to talk the talk.