Papers
Topics
Authors
Recent
Search
2000 character limit reached

Large Language Models for Computer-Aided Design: A Survey

Published 13 May 2025 in cs.LG, cs.CL, cs.GR, and cs.MM | (2505.08137v1)

Abstract: LLMs have seen rapid advancements in recent years, with models like ChatGPT and DeepSeek, showcasing their remarkable capabilities across diverse domains. While substantial research has been conducted on LLMs in various fields, a comprehensive review focusing on their integration with Computer-Aided Design (CAD) remains notably absent. CAD is the industry standard for 3D modeling and plays a vital role in the design and development of products across different industries. As the complexity of modern designs increases, the potential for LLMs to enhance and streamline CAD workflows presents an exciting frontier. This article presents the first systematic survey exploring the intersection of LLMs and CAD. We begin by outlining the industrial significance of CAD, highlighting the need for AI-driven innovation. Next, we provide a detailed overview of the foundation of LLMs. We also examine both closed-source LLMs as well as publicly available models. The core of this review focuses on the various applications of LLMs in CAD, providing a taxonomy of six key areas where these models are making considerable impact. Finally, we propose several promising future directions for further advancements, which offer vast opportunities for innovation and are poised to shape the future of CAD technology. Github: https://github.com/lichengzhanguom/LLMs-CAD-Survey-Taxonomy

Summary

  • The paper offers an exhaustive taxonomy of six LLM-CAD application domains, mapping from code generation to evaluation.
  • It details methodologies like automated CAD code and parametric data generation using multimodal LLM processing.
  • The paper quantifies industrial benefits and outlines future research directions for enhanced CAD design automation.

LLMs for Computer-Aided Design: An Expert Survey

Overview and Taxonomy

This survey offers an exhaustive analysis of the integration of LLMs within Computer-Aided Design (CAD) ecosystems. The authors delineate the industrial relevance of CAD, systematically review foundational LLM architectures, and classify state-of-the-art LLM families (closed-source and open models). The core contribution is a taxonomy of six primary application areas where LLMs intersect with CAD, shaping both current practice and future directions. Figure 1

Figure 1: Taxonomy of the review, enumerating six major thematic areas in LLM–CAD integration.

Industrial Significance of CAD

CAD is established as an indispensable infrastructure across sectors such as manufacturing, architecture, aerospace, electronics, and biomedical engineering. The migration from manual to digital design tools facilitates a reduction in design cycle time by 30–50% and enables programmatic, accurate design workflows. The annual global market for 3D CAD software is forecast to more than double, attributed to the adoption of increasingly complex modeling and automation methods.

Foundations and Scaling of LLMs

LLMs are presented as high-capacity, transformer-based neural architectures with billions to trillions of parameters. The survey carefully discusses core concepts pertinent to CAD researchers: tokenization, context window, pretraining, fine-tuning (especially instruction-based and RLHF), alignment, and advanced prompting paradigms (e.g., Chain-of-Thought, in-context learning).

Scaling up LLMs invokes stringent prerequisites in data quality, diversity, privacy, and bias mitigation. The survey underscores that >90% of web-scraped content may be discarded during curation for optimal model performance, and that scaling laws drive the empirical improvement of LLMs as a function of compute and training data. Distributed training is identified as a critical engineering challenge, with recent models (e.g., LLaMA-3, DeepSeek-V3) utilizing tens of thousands of GPUs and sophisticated mixture-of-experts architectures.

Classification of Mainstream LLMs for CAD

The authors systematically group key LLMs into closed and open-source families:

  • Closed-source: GPT (various generations up to GPT-4.5o), PaLM and derivatives (PaLM-E, Flan-PaLM, Med-PaLM), Gemini series, Claude 3, Grok, BloombergGPT.
  • Open models: LLaMA (LLaMA, LLaMA-2, LLaMA-3/3.1/4 and many fine-tuned derivatives), DeepSeek family, Code LLaMA, Mistral, Vicuna, Alpaca, Qwen, Koala, Orca, WizardLM, MiniGPT-4, Stable Beluga, Gemma, FLM, Pythia, Baichuan2.

The proliferation of instruction-following, multimodal, mixture-of-experts, and domain-specialized models is established as a hallmark of recent progress.

Applications of LLMs in the CAD Domain

The survey's central contribution is a rigorous mapping of LLM capabilities to CAD applications, grouped into six classes:

1. Data Generation

LLMs are leveraged for synthetic dataset creation, especially textual instruction hierarchies and multimodal object descriptions. Prompts, multi-view images, and parameter sequences are paired to produce large, structured corpora. State-of-the-art works primarily utilize LLMs for textual annotation, and there is no evidence yet of direct generation of CAD images or point clouds using LLMs in this application category.

2. CAD Code Generation

LLMs pretrained on code corpora (notably Codex, GPT-4, and GPT-4V) are used for automatic generation of executable CAD scripts. Multimodal LLMs parse inputs ranging from text, sketches, and images to produce modeling macros in Python, SQL, or CAD-specific languages. Iterative refinement processes—debugging, error feedback, and program verification—are widely adopted to improve execution success and semantic fidelity. Figure 2

Figure 2: Canonical CAD code generation workflow with feedback-driven code refinement and visual similarity evaluation.

3. Parametric CAD Generation

LLMs generate parametric sequences (typically JSON or XML formats) that encode CAD construction histories. Models process multimodal inputs (text, images, point clouds) to propose symbolic or continuous parameter tuples, enabling flexible downstream parsing and controllable CAD model synthesis. Figure 3

Figure 3: Parametric CAD generation pipeline, mapping prompt and optional image features via VLM/LLM to structured parametric outputs for 3D model synthesis.

4. Image Generation

Application of LLMs for end-to-end CAD image synthesis remains nascent. One notable instance is ChatCAD—a multi-agent LLM framework for zero-shot digital drawing restoration, utilizing domain-specific Retrieval-Augmented Generation.

5. Model Evaluation

LLMs and VLMs are increasingly adopted to assess generated CAD models both visually and semantically. Evaluation tasks encompass automated scoring of visual fidelity, attributes (size, color, material), and textual description similarity—enabling scalable, human-in-the-loop and auto-feedback loops.

6. Text Generation

Text generation encompasses automatic creation of design descriptions, semantic code block comments, plan generation, and question answering. Recent research demonstrates LLMs decompose complex CAD programs into labeled semantic parts, generate photorealistic prompts, and assist in manufacturing feature recognition from multi-view images.

The survey numerically quantifies that the most frequent application of LLMs in CAD is code generation, with GPT-4o dominating tool selection despite closed-access limitations—indicative of its superior performance on CAD-centric tasks.

A crucial observation is that almost all workflows are mediated via intermediate formats (code, parametric data) rather than direct-to-model synthesis. Multimodality is pervasive, but text and image are the most exploited channels. The manufacturing sector is by far the most active domain, while sectors such as shipbuilding and textiles are underrepresented in current efforts. The authors assert there is no current evidence of direct LLM-based synthetic point cloud or image dataset generation for CAD.

Future Directions

The paper identifies several promising avenues for further research:

  • Interior and home design: LLM-driven interior design solutions exist but lack open-source 3D CAD support, motivating future work on semantically conditioned scene graph and parametric model generation.
  • Multi-format data generation: Projection towards LLMs that generate not only text and code but point clouds and full 3D models for direct use in CAD environments.
  • Automatic compliance checking: Deployment of LLMs for regulatory adherence verification in AEC workflows, with notable progress in BIM interpretation but a gap for direct 3D CAD analysis.
  • Fashion and textile CAD modeling: CAD-enabled fashion workflow automation via LLM-driven parametric design remains underexplored and is highlighted as a high-impact area for computational creativity.

Theoretical and Practical Implications

The survey consolidates the position of LLMs as universal mediators between humans and CAD systems, capable of both semantic understanding and programmatic generation. Practically, the emergence of multimodal prompt-conditioned code and parametric model generation is altering design automation, iterative optimization, and cross-domain interoperability. Theoretically, the failure to directly synthesize 3D model outputs via LLMs marks a significant research challenge, likely requiring deeper architectural and representation advances. The authors further speculate that advances in alignment and vision-language fusion (e.g., VLMs with robust spatial reasoning) will be required for fully automated CAD workflows.

Conclusion

This survey systematically analyzes the current intersection of LLMs and CAD, detailing foundational architectures, application taxonomies, observed trends, and future prospects. The most prominent contributions of LLMs to CAD are in text-centric dataset construction, code and parametric data generation informed by multimodal inputs, and automated evaluation. The survey highlights both technical challenges and untapped sectors, mapping a clear trajectory for innovation and theoretical progress in the automated design domain (2505.08137).

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 2 likes about this paper.