On Large Language Models in National Security Applications
Abstract: The overwhelming success of GPT-4 in early 2023 highlighted the transformative potential of LLMs across various sectors, including national security. This article explores the implications of LLM integration within national security contexts, analyzing their potential to revolutionize information processing, decision-making, and operational efficiency. Whereas LLMs offer substantial benefits, such as automating tasks and enhancing data analysis, they also pose significant risks, including hallucinations, data privacy concerns, and vulnerability to adversarial attacks. Through their coupling with decision-theoretic principles and Bayesian reasoning, LLMs can significantly improve decision-making processes within national security organizations. Namely, LLMs can facilitate the transition from data to actionable decisions, enabling decision-makers to quickly receive and distill available information with less manpower. Current applications within the US Department of Defense and beyond are explored, e.g., the USAF's use of LLMs for wargaming and automatic summarization, that illustrate their potential to streamline operations and support decision-making. However, these applications necessitate rigorous safeguards to ensure accuracy and reliability. The broader implications of LLM integration extend to strategic planning, international relations, and the broader geopolitical landscape, with adversarial nations leveraging LLMs for disinformation and cyber operations, emphasizing the need for robust countermeasures. Despite exhibiting "sparks" of artificial general intelligence, LLMs are best suited for supporting roles rather than leading strategic decisions. Their use in training and wargaming can provide valuable insights and personalized learning experiences for military personnel, thereby improving operational readiness.
- Spinning language models: Risks of propaganda-as-a-service and countermeasures. In 2022 IEEE Symposium on Security and Privacy (SP), pages 769–786. IEEE, 2022.
- Frank Bajak. U.s. intelligence agencies’ embrace of generative ai is at once wary and urgent, 2024. URL https://www.pbs.org/newshour/world/u-s-intelligence-agencies-embrace-of-generative-ai-is-at-once-wary-and-urgent. Accessed: 2024-06-18.
- Ylli Bajraktari. The us and australia need generative ai to give their forces a vital edge. The Strategist, 2024. URL https://www.aspistrategist.org.au/the-us-and-australia-need-generative-ai-to-give-their-forces-a-vital-edge/. Accessed: 2024-06-18.
- Aaron Bazin. How to build a virtual clausewitz. The Strategy Bridge, March 2017. URL https://thestrategybridge.org/the-bridge/2017/3/21/how-to-build-a-virtual-clausewitz.
- Joe Biden. Executive order 14110: Safe, secure, and trustworthy development and use of artificial intelligence, October 2023.
- Sparks of artificial general intelligence: Early experiments with gpt-4. arXiv preprint arXiv:2303.12712, 2023.
- Booookscore: A systematic exploration of book-length summarization in the era of llms. arXiv preprint arXiv:2310.00785, 2023.
- Can llm-generated misinformation be detected? arXiv preprint arXiv:2309.13788, 2023.
- This looks like that: deep learning for interpretable image recognition. Advances in neural information processing systems, 32, 2019.
- Large language models in wargaming: Methodology application and robustness. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2894–2903, 2024.
- Us air force hackathon: How large language models will revolutionize usaf flight test. Databricks Blog, March 2024. URL https://www.databricks.com/blog/us-air-force-hackathon-how-large-language-models-will-revolutionize-usaf-flight-test.
- Michael Dobuski. Google makes adjustments to ai overviews after a rocky rollout. ABC News, June 2024. URL https://abcnews.go.com/Technology/google-makes-adjustments-ai-overviews-rocky-rollout/story?id=99876432.
- Kevin Ellis. Human-like few-shot learning via bayesian reasoning over natural language. Advances in Neural Information Processing Systems, 36, 2024.
- Kirsten Errick. Navy discourages military generative ai, llm usage, October 2023. URL https://federalnewsnetwork.com/artificial-intelligence/2023/10/navy-discourages-military-generative-ai-llm-usage/. Accessed: 2024-06-18.
- Coa-gpt: Generative pre-trained transformers for accelerated course of action development in military operations. arXiv preprint arXiv:2402.01786, 2024.
- Disasterresponsegpt: Large language models for accelerated plan of action development in disaster response scenarios. arXiv preprint arXiv:2306.17271, 2023.
- Generative language models and automated influence operations: Emerging threats and potential mitigations. arXiv preprint arXiv:2301.04246, 2023.
- Elias Groll. State-backed hackers are experimenting with openai models. CyberScoop, February 14 2024. URL https://www.cyberscoop.com/openai-microsoft-apt-llm/.
- Insikt Group. Russia-linked copycat uses llms to weaponize influence content at scale. https://www.recordedfuture.com/russia-linked-copycop-uses-llms-to-weaponize-influence-content-at-scale, 2024. Accessed: 2024-06-15.
- John Harper. Nato on the hunt for new ai cognitive agent. DefenseScoop, November 14 2023. URL https://defensescoop.com/2023/11/14/nato-on-the-hunt-for-new-ai-cognitive-agent/.
- Jon Harper. Ai wargaming: Air force futures at mit. https://defensescoop.com/2024/04/12/ai-wargaming-air-force-futures-mit/, April 12 2024a. Accessed: 2024-06-17.
- Jon Harper. Army set to issue new policy guidance on use of large language models, 2024b. URL https://defensescoop.com/2024/05/09/army-policy-guidance-use-large-language-models-llm/. Accessed: 2024-06-19.
- Unshin Lee Harpley. Space force pumps the brakes on chatgpt-like technology with temporary ban. https://www.airandspaceforces.com/space-force-chatgpt-technology-temporary-ban/, 2023. Accessed: 2024-06-19.
- Unshin Lee Harpley. Air force launches its own generative ai chatbot. experts see promise and challenges. https://www.airandspaceforces.com/air-force-launches-generative-ai-chatbot/, 2024. Accessed: 2024-06-12.
- John Hill. Hadean builds large language model for british army virtual training space. https://www.army-technology.com/news/hadean-builds-large-language-model-for-british-army-virtual-training-space/, 2024. Accessed: 2024-06-15.
- Open-ended wargames with large language models. arXiv preprint arXiv:2404.11446, 2024.
- Jie Huang and Kevin Chen-Chuan Chang. Towards reasoning in large language models: A survey. arXiv preprint arXiv:2212.10403, 2022.
- C. Hunter and B. E. Bowen. We’ll never have a model of an ai major-general: Artificial intelligence, command decisions, and kitsch visions of war. Journal of Strategic Studies, pages 1–31, 2022.
- How large-language models can revolutionize military planning, 2023. URL https://warontherocks.com/2023/04/how-large-language-models-can-revolutionize-military-planning/. Accessed: 2024-06-19.
- Large language models in healthcare: Are we there yet?, May 2024. URL https://hai.stanford.edu/news/large-language-models-healthcare-are-we-there-yet. Stanford University Human-centered Artificial Intelligence.
- Human vs. machine: Language models and wargames. arXiv preprint arXiv:2403.03407, 2024.
- Leidos. Hackathon produces ai-enabled cjadc2 solutions for the battlefield. Leidos Insights, 2024. URL https://www.leidos.com/insights/hackathon-produces-ai-enabled-cjadc2-solutions-battlefield.
- Automatic and universal prompt injection attacks against large language models. arXiv preprint arXiv:2403.04957, 2024.
- Jwen Fai Low. Automated Information Warfare: For and Against Saturation Attacks. PhD thesis, McGill University, 2023.
- Are emergent abilities in large language models just in-context learning? arXiv preprint arXiv:2309.01809, 2023.
- A unified approach to interpreting model predictions. Advances in neural information processing systems, 30, 2017.
- Duncan MacRae. Large language models could ‘revolutionise the finance sector within two years, March 2024. URL https://www.artificialintelligence-news.com/2024/03/27/large-language-models-could-revolutionsise-the-finance-sector-within-two-years/. AINews.
- Anne J. Manning. What is ‘original scholarship’ in the age of ai?, May 2024. URL https://www.news.harvard.edu/gazette/story/2024/05/how-is-generative-ai-changing-education-artificial-intelligence/. The Harvard Gazette.
- The pentagon’s ’terminator conundrum’: Robots that could kill on their own. New York Times, October 25 2016. URL https://www.nytimes.com/2016/10/26/us/pentagon-artificial-intelligence-terminator.html.
- Jim Mattis. Summary of the 2018 national defense strategy of the united states of america. Technical report, Department of Defense Washington United States, 2018.
- Christopher McFadden. China is training ai to predict human behavior on the battlefield. https://interestingengineering.com/military/china-training-ai-predict-humans, 2024. Accessed: 2024-06-15.
- Recent advances in natural language processing via large pre-trained language models: A survey. ACM Computing Surveys, 56(2):1–40, 2023.
- Large language models: A survey. arXiv preprint arXiv:2402.06196, 2024.
- Phil Muncaster. Ai-powered russian network spreads fake news. https://www.infosecurity-magazine.com/news/aipowered-russian-network-fake-news/, 2024. Accessed: 2024-06-15.
- Are we winning? a brief history of military operations assessment. DTIC Document, DOP-2014-U-008512-1Rev, 2014.
- Fakes of varying shades: How warning affects human perception and engagement regarding llm hallucinations. arXiv preprint arXiv:2404.03745, 2024.
- NATO. Nato 2022 strategic concept. https://www.nato.int/nato_static_fl2014/assets/pdf/2022/6/pdf/290622-strategic-concept.pdf, 2022. Accessed: 2024-06-19.
- NATO Allied Command Transformation. TIDE Hackathon: Spotlight on the Wargaming LLM Challenge. NATO ACT, October 20 2023. URL https://www.act.nato.int/article/tide-hackathon-spotlight-wargaming-llm-challenge/.
- Anastasia Obis. Air force unveils new generative ai platform. https://federalnewsnetwork.com/defense-main/2024/06/air-force-unveils-new-generative-ai-platform/, June 2024. Accessed: 2024-06-17.
- Brian O’Connell. U.s. navy not ready to set sail on ai just yet. AI Finance Today, October 2023. URL https://aifinancetoday.com/u-s-navy-not-ready-to-set-sale-on-ai-just-yet/. Accessed: 2024-06-18.
- Adversarial machine learning: A taxonomy and terminology of attacks and mitigations. Technical report, National Institute of Standards and Technology, 2023.
- Is llm-as-a-judge robust? investigating universal adversarial attacks on zero-shot llm assessment. arXiv preprint arXiv:2402.14016, 2024.
- Jane O. Rathbun. Guidance on the use of generative ai and large language models. Memorandum, Department of the Navy, Chief Information Officer, September 2023. URL file:///C:/Users/Cpt%20Jenkins1/Downloads/DONCIOMemoGuidanceonUseofGenerativeAIandLargeLanguageModels06Sept20231.pdf. Accessed: 2024-06-18.
- Shreyas Reddy. North korean hackers take phishing efforts to next level with ai tools. https://www.nknews.org/2024/02/north-korean-hackers-take-phishing-efforts-to-next-level-with-ai-tools-report/#:~:text=called%20%E2%80%9Cfrightening.%E2%80%9D-,Microsoft%20reported%20that%20it%20observed%20North%20Korean%20threat%20group%20Kimsuky,generate%20content%20for%20phishing%20campaigns, 2024. Accessed: 2024-06-15.
- Escalation risks from language models in military and diplomatic decision-making. arXiv preprint arXiv:2401.03408, 2024.
- Kevin Roose. Gpt-4 is exciting and scary. The New York Times, 15, 2023.
- Justin Ross. Bravo 11 hackathon trip report. Technical report, USAF, Air Mobility Command, 2024.
- Walid S Saba. Stochastic llms do not understand language: Towards symbolic, explainable and ontologically based llms. In International Conference on Conceptual Modeling, pages 3–19. Springer, 2023.
- Are emergent abilities of large language models a mirage? Advances in Neural Information Processing Systems, 36, 2024.
- Applying large language models to dod software acquisition: An initial experiment. Carnegie Mellon University, Software Engineering Institute’s Insights, Apr 2024. URL https://doi.org/10.58012/s32x-gc46. Accessed: 2024-Jun-21.
- Science Desk. Us army experimenting with generative ai chatbots in war games: Report, 2024. URL https://indianexpress.com/article/technology/science/us-army-ai-chatbot-war-games-9199456/. Accessed: 2024-06-19.
- Jim Sciutto. The Return of Great Powers: Russia, China, and the End of American Exceptionalism. Penguin Press, New York, 2023. ISBN 9780593474132.
- Ai-powered autonomous weapons risk geopolitical instability and threaten ai research. arXiv preprint arXiv:2405.01859, 2024.
- Peter Suciu. Cia adopts microsoft’s generative ai model for sensitive data analysis, 2024. URL https://www.clearancejobs.com/news/cia-adopts-microsofts-generative-ai-model-for-sensitive-data-analysis/. Accessed: 2024-06-18.
- Sindhu Sundar. Air force and defense department using ai and llm with humans. https://www.businessinsider.com/air-force-defense-department-using-ai-artificial-intelligence-llm-humans-2023-7, July 2023.
- Azmi Tamin. Baidu’s ernie bot better at accuracy than chatgpt but lingers in politics. https://interestingengineering.com/innovation/baidus-ernie-bot-better-at-accuracy-than-chatgpt-but-lingers-in-politics, 2024. Accessed: 2024-06-15.
- TF Lima. Task force lima and gpt. https://chatgpt.com/g/g-v12me2Sha-task-force-lima-gpt, 2024.
- Catherine Trifiletti. Nato leverages mitre’s ai expertise. MITRE, 2023. URL https://www.mitre.org/news-insights/impact-story/nato-leverages-mitres-ai-expertise.
- Does human collaboration enhance the accuracy of identifying llm-generated deepfake texts? In Proceedings of the AAAI Conference on Human Computation and Crowdsourcing, volume 11, pages 163–174, 2023.
- US Department of Defense. DoD Responsible AI (RAI) Strategy and Implementation Pathway, 2022. Accessed 17 Jun 2024.
- U.S. Department of Defense. Dod data analytics and ai adoption strategy. https://media.defense.gov/2023/Nov/02/2003333300/-1/-1/1/DOD_DATA_ANALYTICS_AI_ADOPTION_STRATEGY.PDF, November 2 2023.
- U.S. Department of State. Political declaration on responsible military use of artificial intelligence and autonomy. https://www.state.gov/political-declaration-on-responsible-military-use-of-artificial-intelligence-and-autonomy/, 2024. Accessed: 2024-06-19.
- US Joint Chiefs of Staff. Joint doctrine note 1-18. Technical report, US Joint Chiefs of Staff, 2018. URL https://www.jcs.mil/Portals/36/Documents/Doctrine/jdn_jg/jdn1_18.pdf.
- US Joint Chiefs of Staff. Joint Publication 5-0: Joint Planning. U.S. Department of Defense, 2020. URL https://www.jcs.mil/Portals/36/Documents/Doctrine/pubs/jp5_0.pdf.
- US Joint Chiefs of Staff. Joint publication 3-0: Joint operations. Technical report, US Joint Chiefs of Staff, 2022. URL https://www.jcs.mil/Doctrine/Joint-Doctrine-Pubs/3-0-Operations-Series/.
- David Vergun. Darpa aims to develop ai autonomy applications warfighters can trust, 2024. URL https://www.defense.gov/News/News-Stories/Article/Article/3722849/darpa-aims-to-develop-ai-autonomy-applications-warfighters-can-trust/#:~:text=An%20example%20of%20that%2C%20he,software%20that%20underlies%20critical%20infrastructure. Accessed: 2024-06-20.
- Brandi Vincent. Air and space forces lean into data-informed decision making. https://defensescoop.com/2023/03/22/air-and-space-forces-lean-into-data-informed-decision-making/, March 22 2023a. Accessed: 2024-06-17.
- Brandi Vincent. Inside task force lima’s exploration of 180-plus generative ai use cases for dod. DefenseScoop, 2023b. URL https://defensescoop.com/2023/11/06/inside-task-force-limas-exploration-of-180-plus-generative-ai-use-cases-for-dod/.
- Brandi Vincent. How marine corps university is experimenting with generative ai in simulations and wargaming, 2023c. URL https://defensescoop.com/2023/06/28/how-marine-corps-university-is-experimenting-with-generative-ai-in-simulations-and-wargaming/. Accessed: 2024-06-18.
- Brandi Vincent. Task force lima preps new space for generative ai experimentation. DefenseScoop, April 2 2024a. URL https://defensescoop.com/2024/04/02/task-force-lima-preps-new-space-generative-ai-experimentation/#:~:text=AI-,Task%20Force%20Lima%20preps%20new%20space%20for%20generative%20AI%20experimentation,to%20unleash%20an%20experimental%20sandbox.
- Brandi Vincent. Cdao developing ‘classification guide’ for large language models. DefenseScoop, 2024b. URL https://defensescoop.com/2024/02/21/cdao-classification-guide-large-language-models-lugo/.
- Dell: Generating reactions and explanations for llm-based misinformation detection. arXiv preprint arXiv:2402.10426, 2024.
- Emergent abilities of large language models. arXiv preprint arXiv:2206.07682, 2022.
- Emma Woollacott. Russian trolls outsource disinformation campaigns to africa. Forbes, 2020. Senior Contributor.
- Combating misinformation in the era of generative ai models. In Proceedings of the 31st ACM International Conference on Multimedia, pages 9291–9298, 2023.
- A survey on game playing agents and large models: Methods, applications, and challenges. arXiv preprint arXiv:2403.10249, 2024.
- Ilker Yildirim and LAÂ Paul. From task structures to world models: what do llms know? Trends in Cognitive Sciences, 2024.
- Marc Zao-Sanders. How People Are Really Using GenAI. Harvard Business Review, March 2024. URL https://hbr.org/2024/03/how-people-are-really-using-genai.
- Detecting fake news for reducing misinformation risks using analytics approaches. European Journal of Operational Research, 279(3):1036–1052, 2019.
- Benchmarking large language models for news summarization. Transactions of the Association for Computational Linguistics, 12:39–57, 2024.
- Characterizing political bias in automatic summaries: A case study of trump and biden. arXiv preprint arXiv:2305.02321, 2023.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.