行业研究公司研究宏观策略财报招股书会议纪要 seedance2.0 低空经济 DeepSeek AIGC 大模型

美国防部大语言模型应用中的网络安全挑战与缓解措施

国防军工 2025-06-18 - 美国空军技术研究所邓轶韬

摘要

本文探讨了大型语言模型（LLMs）在国防部应用中的网络安全挑战，将其风险分为以漏洞为中心的风险和以威胁为中心的风险两大类。

以漏洞为中心的风险包括数据泄露、无意偏见、错误信息和与国防部政策的偏差。数据泄露风险源于LLMs在训练过程中可能记忆并泄露敏感信息，可通过数据预处理、访问控制、加密和差分隐私等技术进行缓解。无意偏见风险源于训练数据中的偏见，可能导致歧视性输出，可通过数据平衡、红队测试和偏见检测算法等方法进行缓解。错误信息风险源于LLMs的不准确性和推理能力有限，可能导致生成错误信息，可通过模型微调、人类在环系统和检索增强生成等技术进行缓解。与国防部政策的偏差风险源于LLMs的输出可能与国防部政策不符，可通过模型微调、人类在环系统和政策合规框架等方法进行缓解。

以威胁为中心的风险包括提示操纵、输入攻击、数据污染和模型损坏。提示操纵风险源于攻击者通过恶意输入提示来操纵LLMs生成有害或意外的输出，可通过响应监控、对抗训练、模型微调和红队测试等方法进行缓解。数据污染风险源于攻击者通过向训练数据中注入恶意样本来破坏LLMs，可通过安全训练流程、数据清理、验证协议和对抗训练等方法进行缓解。

当前，现有的缓解策略在国防部环境中存在一些局限性，例如可扩展性、实时有效性和对国防特定需求的适应性。建议通过增强隐私保护、加强对抗防御、使模型与国防部政策保持一致以及优先考虑人类在环机制和人员培训来弥补这些差距。

最后，本文提出了一个未来研究的路线图，重点关注隐私保护训练、动态数据清理、自适应防御、政策合规、人类在环系统和人员培训等方面。

Corinne Yorkman and Mark ReithAir Force Institute of Technology, WPAFB, USA corinne.yorkman.1@us.af.milmark.reith.3@us.af.mil Abstract:Great power competition has escalated globally, making it increasingly important for the Department of Defense(DoD) to adopt artificial intelligence (AI) technologies that are advanced and secure. Large language models (LLMs), whichgenerate text, code, images, and other digital content based on data sets used in training have gained attention for theirpotential in DoD applications such as data analysis, intelligence processing, and communication. However, due to thecomplex architecture and extensive data dependency of LLMs, integrating LLMs into defense operations presents uniquecybersecurity challenges. These risks, if not properly managed, could pose severe threats to national security and missionintegrity. This survey paper categorizes these challenges into vulnerability-centric risks, such as data leakage, andmisinformation, and threat-centric risks, including prompt manipulation and data poisoning, providing a comprehensive Keywords:Large language models, Cybersecurity challenges, Department of defense 1.Introduction The integration of artificial intelligence (AI) into critical operations has transformed numerous sectors, and theDepartment of Defense (DoD) is no exception. Large language models (LLMs) represent significant advancementsin natural language processing (NLP). These models have the potential to revolutionize decision-makingprocesses, intelligence analysis, and communication strategies within the DoD (Caballero & Jenkins, 2024).However, with these opportunities come substantial cybersecurity risks, as the DoD operates in environmentswith high stakes for national security and sensitive data protection. The expansive capabilities of LLMs alsointroduce novel vulnerabilities and threats that demand rigorous examination and tailored mitigation strategies(Department of Defense, 2023). While the field of cybersecurity has seen a growing body of research, the 1.1Framework A key aspect of this survey is the categorization of risks into two primary frameworks: vulnerability-centric risksand threat-centric risks. Vulnerability-centric risks address the systemic weaknesses within LLMs, such as dataleakage, unintended biases, and misinformation (Ganguli et al., 2023). In contrast, threat-centric risks pertain toexternal adversarial threats, including prompt manipulation and data poisoning. Threat centric and vulnerabilitycentric approaches are among the most common perspectives to address risk (Silva & Jacob, 2018). This dualframework is rooted in established practices within cybersecurity and risk management, which emphasize theinterplay between mitigating internal weaknesses and addressing external threats. Further, it follows the NISTCybersecurity Framework, which identifies vulnerabilities and threats as key risk factors. The classification also 2.Background LLMs are “a category of foundation models trained on immense amounts of data making them capable ofunderstanding and generating natural language and other types of content to perform a wide range of tasks” Corinne Yorkman and Mark Reith (IBM, 2024). These capabilities include text summarization, question answering, language translation, and more.They are built on deep learning architectures and trained on vast datasets which enable them to perform Recent advancements in LLMs have pushed the boundaries of what these systems can achieve and highlydiversified their applications. The introduction of transformers has revolutionized LLM architectures by enablingmodelsto understand context through mechanisms like self-attention,significantly enhancing theirperformance (Vaswani et al., 2017). Advancements in model fine-tuning and transfer learning also allow LLMs 2.1Role of LLMs in DoD Operations The DoD has recognized the transformative potential of LLMs for enhancing operational efficiency and decision-making processes. As stated by the Deputy Secretary of Defense (2023), “the DoD faces an imperative to explorethe use of this technology and the potential of these models' scale, speed, and interactive capabilities to improvethe Department's mission effectiveness while simultaneously identifying proper protection measures andmitigating a variety of related risks.” As an effort to advance safe and responsible AI technology within the USAF,the Air Force Research Laboratory developed NIPRGPT, an LLM cleared for installation on unclassified USAFsystems (Secretary of the Air Force Public Affairs, 2024). Additional LLM use cases include those by the Air MobilityCommand which has leveraged LLMs to generate campaign simulations and those by the US Air Forces Centralwhich use LLMs to expedite routine maintenance of software tools (Caballero & Jenkins, 2024). LLMs have thepotential to transform processes like information, planning, and decision-making and aid in military exercises.They can be used to synthesize intell

点击免费查看完整报告

你可能感兴趣

美国防部大语言模型应用中的网络安全挑战与缓解措施

摘要

你可能感兴趣

大语言模型在投研中的应用：DeepSeek、QwQ-32B与Manus技术解析、投研场景与量化应用

数据与叙事：强劲的美国数据与看跌美元的叙事之间的拉锯战仍在继续。在7月9日的最后期限之前，关税噪音可能会持续存在，但债券市场稳定措施可以带来额外的美元缓解。

量化分析报告：大语言模型(LLM)在量化金融中的应用展望

PPP 新机制在水利基础设施建设运营中的应用与挑战

数字水印在数据泄漏溯源中的应用与挑战