I've seen that. He basically looks through his history with a script and then wgets it, as well as submitting to archive systems. That's both wasteful on bandwidth as everything is downloaded twice, and anything not public needs to be done manually with cookies. He also can't prove that they came from a site even if it used https.
I was thinking something like a browser extension that just made sure nothing downloaded was ever deleted. I wonder if chrome has a hook for when it internally deletes something, that a program could instead copy it and convert it to some format?
That's both wasteful on bandwidth as everything is downloaded twice, and anything not public needs to be done manually with cookies.
But it's dead-simple and robust compared to some sort of in-browser extension which saves the rendered DOM in the background.
He also can't prove that they came from a site even if it used https.
I've never needed to prove that. My concern is usually having a copy at all, and the IA is trusted enough that it's a de facto proof.
But it's possible:
If it's worth saying, but not worth its own post (even in Discussion), then it goes here.
Notes for future OT posters:
1. Please add the 'open_thread' tag.
2. Check if there is an active Open Thread before posting a new one. (Immediately before; refresh the list-of-threads page before posting.)
3. Open Threads should be posted in Discussion, and not Main.
4. Open Threads should start on Monday, and end on Sunday.