Artificial intelligence (AI) makes decisions impacting our daily lives in an increasingly autonomous manner. Their actions might cause accidents, harm, or, more generally, violate regulations. Determining whether an AI caused a specific event and, if so, what triggered the AI's action, are key forensic questions. We provide a conceptualization of the problems and strategies for forensic investigation. We focus on AI that is potentially ``malicious by design'' and grey box analysis. Our evaluation using convolutional neural networks illustrates challenges and ideas for identifying malicious AI.
翻译:人工智能正以日益自主的方式做出影响我们日常生活的决策。其行为可能导致事故、伤害,或更广泛地违反法规。确定某个AI是否引发了特定事件,如果是,又是什么触发了该AI的行动,是关键的法医学问题。我们为法医调查中的问题及策略提供了概念化框架。我们聚焦于可能“设计上具有恶意”的AI以及灰盒分析。利用卷积神经网络的评估展示了识别恶意AI所面临的挑战与应对思路。