论文标题
通过消极和积极的激励措施调解人工智能的发展
Mediating Artificial Intelligence Developments through Negative and Positive Incentives
论文作者
论文摘要
人工智能领域(AI)正在经历一段巨大期望的时期,在研究,商业和政策中引入了一定程度的焦虑。 AI种族的叙述使人们认为他们可能会错过。无论是真实的,对这种叙述的信念是否有害,因为某些利益持有人会感到有义务削减安全预防措施,或者只是为了“赢”而忽略社会后果。从描述广泛的技术种族的基线模型开始,与其他人相比,赢家(例如AI进步,专利种族,药品技术)带来了重大好处,我们在这里调查了积极(奖励)和负面(惩罚)激励措施如何有利地影响兴奋成果。我们发现惩罚能够降低不安全参与者的发展速度或通过过度监管减少创新的能力的条件。另外,我们表明,在几种情况下,奖励遵循安全措施的人可能会提高开发速度,同时确保安全选择。此外,在{后者}政权中,奖励不会像惩罚的情况那样遭受过度监管的问题。总体而言,我们的发现为在平稳和突然的技术转变的背景下最适合提高安全性合规性的性质和调节行动提供了宝贵的见解。
The field of Artificial Intelligence (AI) is going through a period of great expectations, introducing a certain level of anxiety in research, business and also policy. This anxiety is further energised by an AI race narrative that makes people believe they might be missing out. Whether real or not, a belief in this narrative may be detrimental as some stake-holders will feel obliged to cut corners on safety precautions, or ignore societal consequences just to "win". Starting from a baseline model that describes a broad class of technology races where winners draw a significant benefit compared to others (such as AI advances, patent race, pharmaceutical technologies), we investigate here how positive (rewards) and negative (punishments) incentives may beneficially influence the outcomes. We uncover conditions in which punishment is either capable of reducing the development speed of unsafe participants or has the capacity to reduce innovation through over-regulation. Alternatively, we show that, in several scenarios, rewarding those that follow safety measures may increase the development speed while ensuring safe choices. Moreover, in {the latter} regimes, rewards do not suffer from the issue of over-regulation as is the case for punishment. Overall, our findings provide valuable insights into the nature and kinds of regulatory actions most suitable to improve safety compliance in the contexts of both smooth and sudden technological shifts.