Hi, I'm new here. I find this site while looking for information about A.I. I read a few articles and couldn't help but smile to myself and think 'wasn't this what to the Internet was suppose to be. I had no idea this site existed and I'm honestly glad to have found stacks of future reading, you know that feeling. I never really post on sites and would have usually have lurked myself silly but I've been promted into action by a question. I posted this to reddit in the shower thoughts section because it seemed appropriate but I'd like to ask you (more).
I was reading about Orthogonality thesis, and Oracle A.I.'s as warnings and attempted precaution to potential hostile outcomes. I've recently finished Robots and Empires and couldn't help but think that something like the Zeroth law could further complicate trying to restrain A.I.'s with begin laws like do no harm or seemingly innocent tasks like acquire paper clips. To me it seemed trying to stop A.I.'s from harming us whilst also completing another task would always end up with us in the way. So I thought perhaps we should try to give the A.I. a goal that would not benefit from violence in anyway. Try to make it Buddha-like. To become all knowing and one with all things? Would a statement like that even mean anything to a computer? The one critism I receive was "what would be the point of that?" I don't know. But I'm curious.
What do you think?
a goal that would not benefit from violence in anyway...To become all knowing
I have bad news for you. People have described ideas for an AI that only seeks knowledge (though I can't find the best link to explain it now). I think this design would calmly kill us all to see what would happen, if we'd somehow prevented it from dropping an anvil on its own head.
To "become one with all things" does not seem sufficiently well-specified to stop either from happening. In general, if we can reasonably interpret the goal as something that's already true, then the AI will do nothing to achieve it (nothing being the most efficient action).
A few notes about the site mechanics
A few notes about the community
If English is not your first language, don't let that make you afraid to post or comment. You can get English help on Discussion- or Main-level posts by sending a PM to one of the following users (use the "send message" link on the upper right of their user page). Either put the text of the post in the PM, or just say that you'd like English help and you'll get a response with an email address.
* Normal_Anomaly
* Randaly
* shokwave
* Barry Cotter
A note for theists: you will find the Less Wrong community to be predominantly atheist, though not completely so, and most of us are genuinely respectful of religious people who keep the usual community norms. It's worth saying that we might think religion is off-topic in some places where you think it's on-topic, so be thoughtful about where and how you start explicitly talking about it; some of us are happy to talk about religion, some of us aren't interested. Bear in mind that many of us really, truly have given full consideration to theistic claims and found them to be false, so starting with the most common arguments is pretty likely just to annoy people. Anyhow, it's absolutely OK to mention that you're religious in your welcome post and to invite a discussion there.
A list of some posts that are pretty awesome
I recommend the major sequences to everybody, but I realize how daunting they look at first. So for purposes of immediate gratification, the following posts are particularly interesting/illuminating/provocative and don't require any previous reading:
More suggestions are welcome! Or just check out the top-rated posts from the history of Less Wrong. Most posts at +50 or more are well worth your time.
Welcome to Less Wrong, and we look forward to hearing from you throughout the site!
Once a post gets over 500 comments, the site stops showing them all by default. If this post has 500 comments and you have 20 karma, please do start the next welcome post; a new post is a good perennial way to encourage newcomers and lurkers to introduce themselves. (Step-by-step, foolproof instructions here; takes <180seconds.)
If there's anything I should add or update on this post (especially broken links), please send me a private message—I may not notice a comment on the post.
Finally, a big thank you to everyone that helped write this post via its predecessors!