

# AI for Automatic HDL Code Generation from High-Level Descriptions

Md Ruhul Kuddus Saleh

Department Lippstadt 2, Advance Hardware Engineering

Hamm-Lippstadt University of Applied Sciences

md-ruhul-kuddus.saleh@stud.hshl.de

Mat.Nr.: 2219956

Prof. Dr.-Ing. Ali Hayek

Department Lippstadt 2

Hamm-Lippstadt University of Applied Sciences

Supervisor

**Abstract**—Development of a Hardware Description Language (HDL) is among the most time-consuming and expert-resource-intensive steps of FPGA and ASIC designs. With the growing complexity of digital systems and pressing demands of time-to-market, conventional manual HDL designs using the Register Transfer-Level (RTL) approach are no longer adequately scalable. However, with recent AI technology and advancements achieved through the use of large language models (LLMs) in the processing and translation of natural language descriptions, there is a new potential channel of automatic generation of HDL from a natural language description or from an algorithmic description.

The HDL design synthesis comparison between professionally developed HDL designs and designs generated with AI on a target Xilinx Artix-7 FPGA evaluates synthesis feasibility, resource usage, power performance, and possible implementation on silicon. Although the AI designs have correct functional intent and proper HDL syntax, important deficiencies are found in its support for design tools, power performance, overall system architecture understanding, and constraints on implementation. There are instances where synthesis fails or goes beyond device capabilities, making it unimplementable in hardware.

The outcome of the study clearly states that the current application of AI is more optimal in the role of an HDL code assistant, rather than being an independent hardware designer. Also, there is an imperative need for major breakthroughs in the field of device-aware training, power-efficient designs, and verification.

## I. INTRODUCTION

Moore's Law and Dennard scaling have traditionally been the pillars sustaining the semiconductor industry. However, lately, the rate of progress has gradually declined [1]. This has increased the focus on the concept of heterogeneous computing architecture, which lays the foundation using combined CPU architecture specific hardware such as GPUs, FPGAs, or ASIC [2]. This redistribution of workload on platforms aligned for the specific function helps focially maximize performance along with power savings [3].

FPGAs are considered the brightest component within the industry. This assists in the processing associated with specialized calculations [4]. However, FPGAs are complex in nature. They require knowhow in the realm of HDL programming, specifically Verilog or VHDL programming [10]. Manual processing within the handling associated with high-level algorithms in terms of generating the RTL model has always proven time-consuming and error-prone [6], [7]. Moreover, the High-Level Synthesis (HLS) tools available are often limited in handling specialized software-centric processing efficiently [8], [9].

This has sparked the idea of using AI, specifically Large Language Models (LLMs), in the processing [5]. This paper aims at exploring the feasibility associated with AI-synthesized HDL. They compare the AI-synthesized designs with the designs embedded by professionals. This paper aims at exploring the implementation of AI in the hardware description within the industry [10].

## HIGH-LEVEL DESCRIPTIONS FOR FPGA DESIGN

### tocsectionHigh-Level Descriptions for FPGA Design

High-level descriptions form the bridge between the algorithmic intention and the hardware implementation in both conventional high-level synthesis and AI-based design flows. The different descriptions vary based on the level of abstraction, expressiveness, and ability to model hardware constructs like parallelism, time, and data transport. The description approach used has a major impact on design productivity and the quality of the produced RTL.

#### A. Software-Like Languages

The traditional HLS flow relies on software-oriented programming languages like C, C++, OpenCL, and SystemC to program hardware functionality [1]. The mentioned programming languages are widely understandable and supported, although not optimal in their implementation on FPGA architectures. They are natural

and suited for software design, which hides hardware parallelizations, making developers entirely dependent on implementation-specific directives to manage parallelization and pipelining [6]. Moreover, these software-oriented design languages lack sufficient control over memory, and there are no timing-related constructs that make cycle-accurate implementation and optimization difficult [8], [10].

### *B. Domain Specific Languages*

Domain-specific languages (DSLs) such as Chisel, Bluespec, MyHDL, and PyMTL have overcome these challenges through the use of hardware-oriented abstraction constructs [2]. The mentioned languages facilitate hardwired structural description and offer parallelism as a primitive, hence offering a simpler description of a hardware design [4]. The modeling tools, such as MATLAB and Simulink, facilitate fast functional modeling, while efficient implementation, in most cases, demands optimization [9].

### *C. Behavioral vs. Functional Descriptions*

High-level descriptions can thus be categorized into Behavioral or Functional. Behavioral styles are good at explaining the behavior of algorithms, while having little architectural relevance. This results in data paths and control logic being abstracted by the synthesizer or the AI tools, which proves inefficient at times [5]. Functional styles, which include dataflow-based models, address the dependency architecture comprehensively, thus being relatively free from ambiguities in the synthesis process [3]. Hence, description style selection assumes importance in synthesizing hardware at the desired level, either using AI or the conventional approach at times [7].

## TRADITIONAL HIGH-LEVEL SYNTHESIS TECHNIQUES

### *tocsectionIV. Traditional High-Level Synthesis Techniques*

High-Level Synthesis (HLS) has also been recognized as an innovative approach for Electronic Design Automation (EDA) in bridging the increasing productivity gap in hardware design. Since HLS enables designers to model their hardware designs in high-level languages (HLLs), it automatically translates the description of an algorithm into an RTL description, thus offering faster design development time as it does not require RTL coding manually. However, the broader design space associated with high-level synthesis also brings in its own design difficulties.

### *A. Core HLS Workflow*

The translation of high-level code to hardware involves a streamlined set of steps for synthesis and optimization. Scheduling involves identifying the clock cycle during

which an operation should take place, trading off between clock cycles, throughput, and timing constraints, keeping data dependencies in mind.

Resource allocation involves specifications related to hardware resource types and quantities, which could range from arithmetic units, memory blocks, to specifying which operations and variables bind to resource instances, also taking into account resource constraints for implementation. For exploiting parallelism, optimization techniques involving unrolling and pipelining are carried out, which enable concurrent execution, hence speeding up execution by executing a number of iterations concurrently for loops. High-level data storage is then assigned to physical memory resources, which range from register memory to Block RAM (BRAM), also requiring manual partitioning to satisfy bandwidth constraints. Finally, specifying control logic through finite state machines (FSMs) for controlling datapath execution and data transfer is required for designing digital systems by controlling data transfer between different stages through FSMs that implement control flow during system execution.

### *B. Existing HLS Tools*

The HLS environment encompasses a range of tools, including proprietary tools and academic tools. The proprietary tools include Xilinx Vivado HLS (Vitis HLS) and the Intel HLS Compiler, which are designed for specific FPGA architectures. Cadence Stratus HLS and Mentor Catapult HLS enable a wider design space exploration of FPGAs and ASICs. At the same time, open-source academic tools like LegUp and ROCCC enable a flexible environment for customizing the tools. While the environment provides more insight into the design process, the tools lack the timing closure and design-specific optimization of the proprietary tools.

### *C. Limitations of Traditional HLS*

However, the existing traditional HLS tools have various drawbacks that impede their autonomy. The HLS tools tend to parse software-centric code literally and lack the capability to infer the optimal hardware architecture without the involvement of the designer. Obtaining optimal performance requires the tedious insertion of directives by the designer, which makes the exploration of the design space longer and repetitive.

The existing HLS tools tend to behave inefficiently while processing irregular control flow or dynamically varying memory access patterns, which lead to suboptimal hardware consumption and resource usage. The longer synthesis and analysis time introduces hurdles in the optimization process by leading to delayed feedback on power consumption, performance, and area. The above

hindrances open up avenues for exploring the usage of AI-based approaches that facilitate the efficient inference and optimization of architecture.

## V. AI TECHNIQUES FOR HDL CODE GENERATION

### tocsectionV. AI Techniques for HDL Code Generation

The shortcomings of conventional High-Level Synthesis (HLS), specifically the need for human optimization and the expensiveness of design space exploration, have necessitated the involvement of Artificial Intelligence (AI) in the design flow process. More advanced AI methods, based on predictive machine learning and generative Large Language Models (LLM), seek to automate the optimization process and minimize human involvement during hardware design [5], [10].

#### A. Machine Learning for Performance Prediction

These models use machine learning algorithms to approximate the Quality of Results (QoR) attributes such as area, latency, and power consumption without the need for complete synthesis [6]. These models are trained using data from past designs to provide feedback efficiently and aid the selection of pragmas for loop pipelining and memory partitioning [2], [8].

#### B. Deep Learning

Deep learning techniques make it possible to better model complex hardware graphs. Graph Neural Networks (GNNs) analyze structural graphs like Abstract Syntax Trees (ASTs) and Dataflow Graphs (DFGs), focusing on dependencies that are useful in understanding resource utilization and timing information in code [8].

Models based on the transformer neural network further enhance this analysis by finding long-term dependencies in code that are significant in determining hardware efficiency [5].

#### C. Large Language Models

Large Language Models (LLMs), namely ChatGPT, CodeLlama, and StarCoder, bring generative capabilities to hardware EDA. They have the ability to transform natural-language descriptions into synthesizable HDL and assist with code completion and refactoring [5], [9], [11]. The limitations of these approaches lie in hallucination, lack of awareness regarding timing aspects, and ignorance of specific device constraints [10], [11].

## VI. EXPERIMENTAL ANALYSIS AND RESULTS

### tocsectionVI. Experimental Analysis and Results

For a practical estimate of the viability of using AI-based hardware descriptions, a comparative experimental test was done on professionally designed “golden reference” designs and HDL generated using state-of-the-art large

language models. The test was done on a Xilinx Artix-7 FPGA technology using the Vivado 2025.1 design tools. The memory inference HDL was tested using ChatGPT-5.2, while the more complex control HDL was generated using Claude Sonnet 4.5.

#### A. Experiment 1: Memory Inference Using ChatGPT-5.2

The initial experiment tested if it was possible for an AI model to deduce a correct dual-port BRAM and to implement conventional power management techniques, such as clock enables. The reference design used was that of a professional reference design taken from a Xilinx memory template.

TABLE I: BRAM Resource Utilization and Workflow Outcome

| Metric           | Professional (Xilinx Template) | AI-Generated (ChatGPT-5.2) | Difference       |
|------------------|--------------------------------|----------------------------|------------------|
| Block RAM Tiles  | 1 (1.00%)                      | 1 (1.00%)                  | 0%               |
| Bonded IOB       | 158                            | 152                        | -3.8%            |
| Synthesis Status | Pass                           | Fail(Syntax Error)         | Workflow Failure |



(a) Professional BRAM Utilization



(b) AI BRAM Utilization

Fig. 1: Utilization reports for BRAM inference showing identical memory resource usage.

ChatGPT-5.2 was able to correctly infer the RAMB36E1 primitive. Unfortunately, there was an issue with the tool flow compatibility during the synthesis step. ChatGPT-5.2 was able to produce SystemVerilog language constructs while assigning the Verilog-2001 file extension. This resulted in the design being rejected by the Vivado compiler because of the improper syntax.



Fig. 2: Vivado synthesis failure screenshot showing the syntax errors in the AI-generated HDL.

There was a more critical difference between the power analysis. Even though both designs used the same memory resources, the AI-designed architecture used nearly twice the on-chip power.

TABLE II: Power Consumption Comparison for BRAM Design

| Power Component            | Professional    | AI-Generated    | Increase    |
|----------------------------|-----------------|-----------------|-------------|
| BRAM Power                 | 0.160 W         | 0.321 W         | +100% (2x)  |
| I/O Power                  | 30.726 W        | 61.414 W        | +100% (2x)  |
| <b>Total On-Chip Power</b> | <b>32.562 W</b> | <b>64.294 W</b> | <b>+97%</b> |



(a) Professional Power Report



(b) AI-Generated Power Report

Fig. 3: Comparison of on-chip power consumption; the AI design (b) exceeds junction temperature limits.

The professionally designed code incorporated clock-enabling signals (ena/enb) that disabled the BRAM during idling cycles. The generated code lacked the enabling signals, which caused the memory and I/O to run continuously. The effect caused the dynamic power consumption to increase by almost twice the original amount, thus lacking power consumption design reasoning.

#### B. Experiment 2: Generation of Control Logic Using Claude Sonnet 4.5

The second experiment involved the generation of a GPIO controller with a masked write capability. This was an exercise in understanding system-level interfaces as well as realizing efficient hardware-level parallelism. The original design was based on an OpenTitan project.

TABLE III: Logic Resource and Physical Constraint Utilization

| Metric            | Professional (OpenTitan) | AI-Generated (Claude 4.5) | Status               |
|-------------------|--------------------------|---------------------------|----------------------|
| Slice LUTs        | 68                       | 193                       | 2.8× Increase        |
| Slice Registers   | 160                      | 192                       | 1.2× Increase        |
| Bonded IOB        | 203                      | 521                       | <b>FAILED (148%)</b> |
| Max IOB Available | 210                      | 210                       | —                    |
| Total Leaf Cells  | 434                      | 908                       | <b>2.1x Area</b>     |



Fig. 4: Utilization reports highlighting the massive increase in Bonded IOB and LUTs in the AI-generated design.

The hardware unimplementability of the design created by the AI involved many I/O pins. In contrast to the professional design approach, the professional design connected the configuration registers using an internal bus. The design created by the AI made many internal signals available through the top-level port. This led to a hardware need for 521 I/O pins, which was above the hardware capability by 148%.

Moreover, the AI implementation utilized software-style “for loops” for the masked writes. Although correct in functionality, the loops were unrolled during the synthesis process, leading to an abundance of combinational logic. In contrast, the professional implementation utilized a single-cycle approach with bitwise masking.

TABLE IV: Power and Thermal Analysis Results

| Metric           | Professional<br>(OpenTitan) | AI-Generated<br>Code | Difference /<br>Status |
|------------------|-----------------------------|----------------------|------------------------|
| Logic Power      | 0.196 W                     | 0.595 W              | +203% (3x)             |
| I/O Power        | 13.847 W                    | 30.232 W             | +118%                  |
| Total            | 15.133 W                    | 32.829 W             | +117%                  |
| On-Chip<br>Power |                             |                      |                        |
| Junction<br>Temp | 94.0 °C (Safe)              | 125.0 °C<br>(Failed) | DESTRUCTIVE            |



(a) Professional Logic Power



(b) AI Logic Power

Fig. 5: Thermal analysis showing logic power and junction temperature deltas.

The generated design was capable of attaining the highest permissible junction temperature of 125°C, which would activate the thermal shutdown in the actual design, thus making it incapable of being manufactured.

### C. Summary of Key Observations

The results obtained from the experiments have shown that there are three basic challenges associated with the use of the AI-HDL. First, the AI model has a lack of system architecture awareness, causing unmanageable I/O. Secondly, the use of other power-aware design methodologies, such as clock gating, contributes to an increase in the net dynamic power. Finally, the lack of software design methodologies, which includes clock gating, causes a major imbalance in logic usage and resultant thermal characteristics.

## VII. TOOLS AND PROTOTYPES AVAILABLE

### tocsectionVII. Tools and Prototypes Already Available

AI-driven hardware design is beginning to move from research into early commercial deployment, promising

to reduce manual effort in traditional EDA design flows and increase efficiency [2], [5].

Graph Neural Networks used by academic prototypes, such as HARP, have already predicted optimization impacts and accelerated design space exploration [6]. Chip-Chat and Autochip provide iterative improvements to HDL code quality using LLM-based generators [5], [10]. Hybrid flows use neural cost models to enable near-real-time timing and area estimation without full synthesis [8].

**Commercial Tools:** AI is being introduced by vendors for managing difficult SoC designs. Synopsys DSO.ai uses reinforcement learning to automatically perform synthesis and floorplanning. Cadence Cerebrus is an AI-driven parameter tuner that seeks the optimal PPA; Siemens EDA uses AI-assisted verification for speed in bug detection. These commercial tools enhance, instead of replacing, human designers [5].

**Open-source Initiatives:** These include datasets like VHDL-Eval and VHDL-Xform, providing support for HDL-specific LLMs and neural cost models, especially for underrepresented languages [9], [4]. Community-driven tools focus on automated refactoring of AI-generated HDL with the goal of improving readability and synthesizability [10].

## VIII. BENEFITS AND CHALLENGES

### Benefits

- **Development Speed:** AI-driven flows speed up development with automatic pragma insertion and parameter tuning [8].
- **Accessibility:** Rapid prototyping is thus possible even for non-experts [1], [10].
- **Consistency:** AI-assisted refactoring and test generation may be used under expert supervision to boost consistency and overall Quality of Results (QoR) [11].

### Challenges

- **Synthesizability:** AI-generated HDL may include non-synthesizable constructs and thus require human review [10].
- **Data Scarcity:** The training data of limited size, especially for VHDL, restricts the generalization capability of the model [9].
- **Physical Constraints:** Most AI models do not consider timing, power, and physical constraints, resulting in syntactically correct but physically infeasible designs [11].

- **Reliability:** The integration of probabilistic AI into deterministic EDA flows raises reliability and verification issues, highlighting the need for human oversight [5], [7].

## CONCLUSION

### Conclusion

The project demonstrates the beginnings of the useful role of AI models, such as ChatGPT or Claude, as a “code assistant” in the design of hardware, inferring primitives effectively and accelerating the prototyping stage. In their present form, they demonstrate the lack of system-level architectural insight, leading to the generation of software-ish logic that suffers from inefficiency and, thus, critical physical hazards, including overly heavy I/O and doubled power.

The neglect of important power management strategies, including clock gating, reiterates the incompleteness of these tools vis-à-vis manufacturability and thermal robustness. Nonetheless, the use of AI-flows, backed by academic efforts, industrial tools, and open-source projects, allows the amalgamation of AI training data that includes a device perspective, along with online EDA feedback, to fill the gap between the syntax of the model and the hardware that needs to be built.

## REFERENCES

- [1] W. Meeus, K. Van Beeck, T. Goedemé, J. Meel, and D. Stroobandt, “An overview of today’s high-level synthesis tools,” *Design Automation for Embedded Systems*, vol. 16, no. 3, pp. 31–51, Aug. 2012.
- [2] M. W. Numan, B. J. Phillips, G. S. PudDY, and K. Falkner, “Towards Automatic High-Level Code Deployment on Reconfigurable Platforms: A Survey of High-Level Synthesis Tools and Toolchains,” *IEEE Access*, vol. 8, pp. 174692–174722, 2020.
- [3] R. Nane et al., “A Survey and Evaluation of FPGA High-Level Synthesis Tools,” *IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems*, vol. 35, no. 10, pp. 1591–1604, Oct. 2016.
- [4] J. Marjanovic, “LOW VS HIGH LEVEL PROGRAMMING FOR FPGA,” in *Proc. IBIC2018*, Shanghai, China, 2018, pp. 1–7.
- [5] S. Alsager, S. Alajmi, I. Ahmad, and M. Alfailakawi, “The potential of LLMs in hardware design,” *Journal of Engineering Research*, vol. 13, no. 1, pp. 2392–2404, 2025.
- [6] Z. Liu, Y. Dou, J. Jiang, and J. Xu, “Automatic Code Generation of Convolutional Neural Networks in FPGA Implementation,” in *Proc. 2016 IEEE International Conference on Parallel and Distributed Processing with Applications*, Changsha, China, 2016, pp. 1–8.
- [7] M. Dossis and G. Dimitriou, “Are HLS Tools Healthy? The C-Cubed Project,” *Engineering, Technology & Applied Science Research*, vol. 5, no. 2, pp. 790–794, 2015.
- [8] Y. Bai et al., “Learning to Compare Hardware Designs for High-Level Synthesis,” in *Proc. 2024 ACM/IEEE International Symposium on Machine Learning for CAD (MLCAD ’24)*, Salt Lake City, UT, USA, Sep. 2024, pp. 1–7.

- [9] P. Vijayaraghavan et al., "Chain-of-Descriptions: Improving Code LLMs for VHDL Code Generation and Summarization," in *Proc. 2024 ACM/IEEE International Symposium on Machine Learning for CAD (MLCAD '24)*, Salt Lake City, UT, USA, Sep. 2024, pp. 1-10.
- [10] R. Al Amin, M. G. R. Lincoln, and R. Obermaisser, "Towards LLM-Assisted HDL Generation and Verification," in *Proc. 2025 14th Mediterranean Conference on Embedded Computing (MECO)*, Budva, Montenegro, Jun. 2025, pp. 1-4.
- [11] S. Thakur et al., "Autochip: Automating hdl generation using llm feedback," *arXiv preprint arXiv:2311.04887*, 2023.