danfaggella
Personal Universes: A Solution to the Multi-Agent Value Alignment Problem
Since the birth of the field of Artificial Intelligence (AI) researchers worked on creating ever capable machines, but with recent success in multiple subdomains of AI [1-7] safety and security of such systems and predicted future superintelligences [8, 9] has become paramount [10, 11]. While many diverse safety mechanisms are being investigated [12, 13], the ultimate goal is to align AI with goals, values and preferences of its users which is likely to include all of humanity. Value alignment problem [14], can be decomposed into three sub-problems, namely: personal value extraction from individual persons, combination of such personal preferences in a way, which is acceptable to all, and finally production of an intelligent system, which implements combined values of humanity. A number of approaches for extracting values [15-17] from people have been investigated, including inverse reinforcement learning [18, 19], brain scanning [20], value learning from literature [21], and understanding of human cognitive limitations [22]. Assessment of potential for success for particular techniques of value extraction is beyond the scope of this paper and we simply assume that one of the current methods, their combination, or some future approach will allow us to accurately learn values of given people. Likewise, we will not directly address how, once learned, such values can be represented/encoded in computer systems for storage and processing.
Can Artificial Intelligence Make the World a Better Place? -
My most recent TEDx is titled "Can AI Make the World a Better Place?" – but this title is somewhat misleading. How the transition beyond humanity will take place. Those who've followed TechEmergence since the early days are aware of the broader moral vision behind the company: "To proliferate the conversation about determining and moving towards the most beneficial transition beyond humanity." I have never identified as a transhumanist, I see the transition beyond humanity as literally inevitable, and I believe we should guide this transition rather than be taken for a ride inadvertently. Because the TEDx format is so short, I'm never permitted the kind of time I wish I was permitted to fully flesh out my ideas, and to reference the sources and people I have drawn from in putting the ideas together.