Analyzing and Mitigating (with LLMs) the Security Misconfigurations of Helm Charts from Artifact Hub
Abstract: Background: Helm is a package manager that allows defining, installing, and upgrading applications with Kubernetes (K8s), a popular container orchestration platform. A Helm chart is a collection of files describing all dependencies, resources, and parameters required for deploying an application within a K8s cluster. Objective: The goal of this study is to mine and empirically evaluate the security of Helm charts, comparing the performance of existing tools in terms of misconfigurations reported by policies available by default, and measure to what extent LLMs could be used for removing misconfiguration. We also want to investigate whether there are false positives in both the LLM refactorings and the tool outputs. Method: We propose a pipeline to mine Helm charts from Artifact Hub, a popular centralized repository, and analyze them using state-of-the-art open-source tools, such as Checkov and KICS. First, such a pipeline will run several chart analyzers and identify the common and unique misconfigurations reported by each tool. Secondly, it will use LLMs to suggest mitigation for each misconfiguration. Finally, the chart refactoring previously generated will be analyzed again by the same tools to see whether it satisfies the tool's policies. At the same time, we will also perform a manual analysis on a subset of charts to evaluate whether there are false positive misconfigurations from the tool's reporting and in the LLM refactoring.
- Armo. 2023. Kubescape. https://www.armosec.io/kubescape/. Dec. 2023.
- Som Biswas. 2023. Role of ChatGPT in Computer Programming: ChatGPT in Computer Programming. Mesopotamian Journal of Computer Science 2023 (Feb. 2023), 8–16. https://doi.org/10.58496/MJCSC/2023/002
- Agathe Blaise and Filippo Rebecchi. 2022. Stay at the Helm: secure Kubernetes deployments via graph generation and attack reconstruction. In 2022 CLOUD. IEEE, Barcelona, 59–69. https://doi.org/10.1109/CLOUD55607.2022.00022
- Under-reported Security Defects in Kubernetes Manifests. In 2021 EnCyCriS. IEEE/ACM, Madrid, 9–12. https://doi.org/10.1109/EnCyCriS52570.2021.00009
- Checkmarx. 2023. KICS. https://kics.io. Dec. 2023.
- Prisma Cloud. 2023. Checkov. https://www.checkov.io. Dec. 2023.
- CNCF. 2022. CNCF 2022 Annual Survey. https://www.cncf.io/reports/cncf-annual-survey-2022/. Dec. 2023.
- Datree. 2023. Datree. https://www.datree.io. Dec. 2023.
- Large Language Models for Software Engineering: Survey and Open Problems. arXiv:2310.03533Â [cs.SE]
- VulRepair: A T5-Based Automated Software Vulnerability Repair. In ESEC/FSE 2022 (Singapore, Singapore). Association for Computing Machinery, New York, NY, USA, 935–947. https://doi.org/10.1145/3540250.3549098
- Google. 2022. Gemini - Chat Based AI Tool from Google. https://gemini.google.com. Dec. 2023.
- Emilia Hansson and Oliwer Ellreus. 2023. Code Correctness and Quality in the Era of AI Code Generation : Examining ChatGPT and GitHub Copilot. , 69Â pages.
- A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions. arXiv:2311.05232Â [cs.CL]
- XI Commandments of Kubernetes Security: A Systematization of Knowledge Related to Kubernetes Security Practices. In 2020 SecDev. IEEE, Virtual, 58–64. https://doi.org/10.1109/SecDev45635.2020.00025
- Matt Johnson. 2021. Top trends from analyzing the security posture of open-source Helm charts. Technical Report. Bridgecrew. Available on the web at https://bridgecrew.io/blog/open-source-helm-security-research/.
- OpenAI. 2022. ChatGPT. https://chat.openai.com/. Dec. 2023.
- Asleep at the Keyboard? Assessing the Security of GitHub Copilot’s Code Contributions. In 2022 IEEE Symposium on Security and Privacy (SP). IEEE, San Francisco, 754–768. https://doi.org/10.1109/SP46214.2022.9833571
- Maestro: A Platform for Benchmarking Automatic Program Repair Tools on Software Vulnerabilities. In 2022 ISSTA (Virtual, South Korea). ACM, New York, NY, USA, 789–792. https://doi.org/10.1145/3533767.3543291
- Akond Rahman. 2018. Characteristics of Defective Infrastructure as Code Scripts in DevOps. In 2018 ICSE (Gothenburg, Sweden). ACM, New York, NY, USA, 476–479. https://doi.org/10.1145/3183440.3183452
- The Seven Sins: Security Smells in Infrastructure as Code Scripts. In 2019 IEEE/ACM ICSE. IEEE, Montreal, 164–175. https://doi.org/10.1109/ICSE.2019.00033
- Security Misconfigurations in Open Source Kubernetes Manifests: An Empirical Study. ACM TOSEM 33, 1 (jan 2023), 37Â pages. https://doi.org/10.1145/3579639
- Defect Prediction Metrics for Infrastructure as Code Scripts in DevOps. In 2018 ICSE (Gothenburg, Sweden). Association for Computing Machinery, New York, NY, USA, 414–415. https://doi.org/10.1145/3183440.3195034
- Akond Rahman and Laurie Williams. 2021. Different Kind of Smells: Security Smells in Infrastructure as Code Scripts. IEEE S&P 19, 3 (2021), 33–41. https://doi.org/10.1109/MSEC.2021.3065190
- Lost at C: A User Study on the Security Implications of Large Language Model Code Assistants. In 32nd USENIX Security Symposium (USENIX Security 23). USENIX Association, Anaheim, CA, 2205–2222. https://www.usenix.org/conference/usenixsecurity23/presentation/sandoval
- Shopify. 2023. Kubeaudit. https://github.com/Shopify/kubeaudit. Dec. 2023.
- Snyk. 2023. AI code, security, and trust in modern development. https://snyk.io/de/reports/ai-code-security/. Dec. 2023.
- Daniel Sokolowski and Guido Salvaneschi. 2023. Towards Reliable Infrastructure as Code. In 2023 IEEE 20th International Conference on Software Architecture Companion (ICSA-C). IEEE, Italy, 318–321. https://doi.org/10.1109/ICSA-C57050.2023.00072
- StackRox. 2023. KubeLinter. https://github.com/stackrox/kube-linter. Dec. 2023.
- Tenable. 2023. Terrascan. https://runterrascan.io. Dec. 2023.
- The Linux Foundation. 2022. Artifact HUB. https://artifacthub.io. Oct. 2023.
- Use of General Repair Tool for Fixing Security Vulnerabilities. In 2022 ICITRI. IEEE, online, 135–140. https://doi.org/10.1109/ICITRI56423.2022.9970223
- ChatGPT Prompt Patterns for Improving Code Quality, Refactoring, Requirements Elicitation, and Software Design. arXiv:2303.07839Â [cs.SE]
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.