

# CFIRSTNET: Comprehensive Features for Static IR Drop Estimation with Neural Network

Yu-Tung Liu\*

tonyliu.ee09@nycu.edu.tw

Department of Electronics and Electrical Engineering  
National Yang Ming Chiao Tung University  
Hsinchu, Taiwan

Shao-Yu Wu

sean31.ee09@nycu.edu.tw

Department of Electronics and Electrical Engineering  
National Yang Ming Chiao Tung University  
Hsinchu, Taiwan

Yu-Hao Cheng\*

jason.ee09@nycu.edu.tw

Department of Electronics and Electrical Engineering  
National Yang Ming Chiao Tung University  
Hsinchu, Taiwan

Hung-Ming Chen

hmchen@nycu.edu.tw

Institute of Electronics  
National Yang Ming Chiao Tung University  
Hsinchu, Taiwan

## ABSTRACT

IR drop estimation is now considered a first-order metric due to the concern about reliability and performance in modern electronic products. Since traditional solution involves lengthy iteration and simulation flow, how to achieve fast yet accurate estimation has become an essential demand. In this work, with the help of modern AI acceleration techniques, we propose a comprehensive solution to combine both the advantages of image-based and netlist-based features in neural network framework and obtain high-quality IR drop prediction very effectively in modern designs. A customized convolutional neural network (CNN) is developed to extract PDN features and make static IR drop estimations. Trained and evaluated with the open-source dataset, experiment results show that we have obtained the best quality in the benchmark on the problem of IR drop estimation in ICCAD CAD Contest 2023, proving the effectiveness of this important design topic.

## 1 INTRODUCTION

Analyzing the on-chip power delivery network (PDN) is critical to the modern integrated circuit (IC) design flow. Figure 1 is an illustration of a typical PDN. PDNs could be modeled as a network of resistance connecting the power pads (C4 bumps) and the standard cells. Static IR drop verification is an imperative step in PDN analysis. With the advance of technology, the IR drop issue occurs at the lower nodes leading to performance degradation. Moreover, excessive IR drop might also cause functional failure. Chip designers are supposed to have IR drop sign-off before the tape-out process and ensure to meet the IR drop constraint. If there is any IR drop violation, designers might need to perform the optimization step such as Engineer Change Order (ECO) to reorganize the circuit and attempt to resolve the problem.

However, during every iteration of ECO, EDA tools will assess the circuit repeatedly, i.e. the worst-case voltage drop of the circuit will be calculated. The conventional method to retrieve the voltage of every node was solving the linear equation (1) where  $G$  was a conductance matrix,  $V$  the target voltage vector to be solved, and  $J$  the vector of independent current sources. The number of nodes surged as the amount of cells used in modern design increased



Figure 1: Illustration of a typical PDN structure: a resistive network between the power sources (C4 bumps) and the standard cells.

tremendously. The considerable computational time of the conventional nodal analysis method was so expensive that it became a costly overhead. Therefore, several methods were proposed to trade accuracy for speed and replace the traditional solution with a faster and more accurate IR drop estimation.

$$GV = J \quad (1)$$

While previous works have proposed various approaches to trade accuracy with speed, they all faced limitations. Non-machine learning methods were swamped by complex circuits with hefty amounts of nodes. On the other hand, image-based or netlist-based machine learning (ML) methods focused on certain circuit features solely. We propose CFIRSTNET, a CNN approach along with comprehensive feature extraction to speed up the static IR drop estimation problem with minimal accuracy trade-off. In addition to the provided image-based features, netlist-based inputs are analyzed with KVL, KCL, and superposition in the feature extraction stage to model the electrical features of the PDN. The custom CNN model takes image-based features and netlist-based features for input and generates an IR drop prediction map with reasonable scaling, providing more accurate and more efficient estimations. Tested by the ICCAD CAD Contest 2023 open-source benchmark, CFIRSTNET mitigates the overhead of expensive computational resources and provides lower estimation errors in comparison to prior arts. This work could be

\*Both authors contributed equally to this research.

further implemented in modern design flow and relieve the pain of heavy computational expenses.

The main contributions of this paper are listed as follows:

- CFIRSTNET harnesses the benefit of both image-based and netlist-based inputs and introduces a customized CNN model to perform static IR drop estimation. To the best of our knowledge, this is the first work regarding this combination as CNN model inputs that preserve circuit features while ensuring efficiency.
- We propose the Hypothetical IR Drop Distillation that analyzes the power grid from the SPICE-based data in linear time and provides more comprehensive features for the customized CNN simultaneously. Our approach is independent of fabrication technologies, circuit designs (with different current source and voltage source distributions), and PDN structures.
- The proposed model is scalable due to the properties of CNNs. CFIRSTNET has been tested with various chip sizes and has exhibited strong estimation results. The performance of CFIRSTNET is not affected by macros or irregular PDN arrangement. Compared to 1st place winner of ICCAD CAD Contest 2023, CFIRSTNET reduces average error by 60%, max error by 66%, increases F1 score by 59%, and achieves a 21.8x speedup.

The remainder of this paper is organized as follows. Preliminaries to this problem are discussed in section 2. The details of CFIRSTNET will be elaborated in section 3 and 4. Section 5 and 6 present the experimental setup and results. Finally, Section 7 concludes the paper and discusses the possibilities of exploring future works.

## 2 PRELIMINARIES

Previously proposed methods introduced non-ML methods by modeling the whole power grid and finding the solution to Equation (1). Panda et al.[18] provided a modeling way of resistance-inductance-capacitance (RLC) circuits. Moreover, Panda et al.[18] and Zhao et al.[24] ignored the influence of the inductance in lower layers due to the small number compared to resistance and capacitance. This simplicity made some matrices symmetric and positive definite, which enabled Cholesky factorization to speed up the works.

When it comes to tens or even hundreds of millions of nodes in the circuit, flattened circuit models require tremendous memory capacity and computational resources. Zhao et al.[24] proposed a hierarchy method to lower the memory cost, and speed up the circuit simulation. They generated micromodels for local grids and simulated the whole network. Kozhaya et al.[1] assumed that the sink resistance is larger than the grid resistance, which bridged the gap of the voltage drop between nearby grids. Hence, they used a more coarse structure to process the PDN and accelerate the framework by mapping out the solution with interoperation. The methods mentioned above traded off the accuracy to the speed. Other works such as Qian et al.[20] took the hierarchical random-walk approach. However, the run time of these simulated methods was still prone to the increased node number.

Recently, many ML methods have cast light on the EDA problems, few of which explored the static IR drop estimation problem. Compared with the traditional simulation-based methods, the runtime

of ML methods is not affected by the number of circuit nodes. Large circuits could be processed and the predictions could be made in an unprecedentedly short amount of time. The ML models could be classified into two categories according to the selection of input features. Some took the circuit netlist as input and decision-tree models were introduced[3][10][12][19]. Others formulated the problem as an IR drop map prediction model[5][10][23][26]. Moreover, previous ML-based IR drop prediction methods focused on static and dynamic IR drop estimation[3][8][10]. The two categories were based on different problem settings that dynamic IR drop estimation captures the peak transient current values based on switching activities, which is not available in open-source datasets.

Previous ML works could be separated by the use of input data. Fang et al.[10] involved CNN to perform feature extraction and make predictions with XGBoost[4]. Ho et al.[12] assessed the circuit netlist and the technology libraries and extracted more features with superposition and partition. However, the feature extraction algorithms took great time complexity. On the other hand, Chhabria et al.[5] took advantage of the CNNs with image-based inputs. Unfortunately, the resolution of the proposed image-based data for the experiment was too coarse to indicate the actual IR drop values, which will be discussed in section 5.

To unleash the possibilities of ML methods in static IR drop estimation problems, substantial training data is necessary. Though the approaches mentioned above have achieved the best performance with their restricted evaluation data points, open-source benchmark data is crucial to make a direct comparison among all methods. However, publicly available real PDN circuit netlists are limited due to the confidentiality agreement and some constraints from EDA vendors. Kadagala et al.[13] and Chhabria et al.[6] noticed this lack of an open-source dataset and proposed the dataset consists of real and fake circuits. Chhabria et al.[6] address this issue with Generative Adversarial Networks (GANs) connected with OpeNPDN[7] to generate current maps and power girds with netlist form and release thousands of generated data in different technologies. In our work, we collect PDN circuit data from [13] and [6] under the open-source NanGate 45nm technology[2] for our experiments.

### Problem Formulation

**Given:** Circuit netlist in SPICE format, current map, effective distance to voltage source map, and PDN density map.

**Train:** Extract comprehensive circuit features to train a custom CNN model to make static IR drop estimations.

**Output:** Static IR drop estimation map that matches the size of the ground truth.

## 3 COMPREHENSIVE FEATURES

CFIRSTNET combines the best of the netlist-based and image-based features. An overview of the CFIRSTNET prediction flow and the roles of the feature maps is illustrated in Figure 2. The current map, PDN density map, and effective distance to voltage source map are provided in the open-source benchmark[13]. Additionally, we propose augmented feature maps such as the wire resistance maps, the resistive distance maps, and the hypothetical IR drop maps extracted from the SPICE netlist to provide more information to our custom CNN. The comprehensive feature maps will be



**Figure 2: Extracted features and the CFIRSTNET prediction flow:** Along with the provided image-based data, CFIRSTNET extracts augmented features from the netlist-based data and processed by the custom CNN.

processed by the encoder-decoder CNN. The encoder is comprised of a modified ConvNeXtV2[22] encoder block, while the decoder includes a custom feature pyramid network (FPN) and an IR drop reconstruction module. The output IR drop prediction map will be generated from CFIRSTNET and evaluated by the open-source benchmark.

The additional feature maps represent the electrical and geometric features of the power delivery network. They are collected as feature maps that could be concatenated with the provided image-based features as the model inputs. The following sections will elaborate on the comprehensive model input features.

### 3.1 Provided Image-Based Maps

The benchmark dataset[13] used in this work provides three image-based data: a current map, an effective distance to voltage source map, and a PDN density map. The current map provides the power-consuming hotspots and the effective distance to voltage source map indicates the distance to the power sources. Notably, the benchmark dataset in [13] is comprised of regular and irregular PDN designs. Therefore, the PDN density map offers the wire density information of every part of the circuit.

### 3.2 Wire Resistance Maps

The challenge of estimating IR drop in ICs hinges on understanding the interplay between the power source, power-consuming cells, and the underlying power grid. In essence, the distribution of the power grid and the actual power consumption significantly impact the IR drop. Previous research[5][10][23][26] has leveraged the localization and feature extraction capabilities of CNNs to pinpoint the power source, delineate the power grid, and quantify current consumption. These efforts have primarily focused on 2D feature maps for image-to-image IR drop prediction networks.

However, when dealing with complex ICs, which are inherently 3D structures with multiple metal layers connecting transistors and I/O pins, relying solely on 2D feature maps may overlook crucial details. For instance, the intricate arrangement of different metal layers and the vias (connections) between them demands a more nuanced approach.

CFIRSTNET models the power grid with multiple-layer resistance maps derived from the spice netlist. By concatenating these maps with other relevant features, we aim to capture finer-scale information such as the precise distribution of the power grid and the positions of vias for our subsequent CNN model. This approach promises a more accurate representation of the complex 3D circuitry, enhancing our ability to predict IR drop effectively.

### 3.3 Resistive Distance Maps

When the distance from a cell, a power sink, to the power source, is considerable, it might require longer metals of different layers to connect the cell to the power source, and longer metals indicate larger resistance, which will lead to a larger IR drop. To model this property, we calculate the shortest Manhattan distance from every part of the power delivery network to every metal wire and via point. Furthermore, we collect this information and concatenate it with other feature maps.

### 3.4 Hypothetical IR Drop Distillation

The IR drop of each power node is greatly affected by nearby nodes and power pads due to the local characteristics of PDNs. Traditional IR drop solvers[1][24] often partition the whole PDN into several smaller circuit segments to reduce complexity. Meanwhile, an increased error of IR drop value might be found after partitioning since the PDN is not considered as a whole. To mitigate this issue,

previous work [25] introduced iterative methods, which will impact the overall runtime considerably.

CFIRSTNET introduces an innovative methodology that partitions the PDN into smaller segments with some additional assumptions. The main idea is that these hypothetical IR drops could be hints for the customized CNN model to make more precise estimations. The hypothetical IR drop values of these circuit segments can be retrieved in linear time complexity. Furthermore, these hypothetical IR drops will be part of our model input features.

**3.4.1 Localized IR Drop.** First of all, PDNs consist of multiple metal layers connected with vias. Each layer could be viewed as a plane with plentiful metal stripes. Due to preferred metal wire direction, there is no interconnection between each stripe. Thus, we narrow our scope to a single metal stripe with wire resistances, current sources, and voltage sources.

The computation of localized IR drops hinges on the assumption that vias connecting to the upper layer function as voltage sources and each has the value of VDD, while those linking to the lower layer serve as current sources. The computation of the localized IR drops and current of each instance starts from the lowermost layer. The retrieved current to the upper vias will be the current sources in the layer above. The current source at node  $n$  is defined as  $I_n$ , and the voltage as  $V_n$ . Node  $n$  is the  $n_{th}$  node on the N-node stripe, and the wire resistance between node  $m$  and node  $n$  is represented by  $r_{m,n}$ .

Furthermore, we can analyze the circuit separately by superposition. As shown in Figure 3, each smaller circuit segment contains one or two voltage sources.



**Figure 3: Metal stripe and circuit segments.**

The equivalent resistance  $R_n$  between node  $n$  and voltage sources could be obtained by Equation (2), where  $P$  is the set of voltage sources in the circuit segment.

$$R_n = \frac{1}{\sum_{m \in P} \frac{1}{r_{m,n}}} \quad (2)$$

With superposition, each current source  $I_n$  could be evaluated independently as well. The current  $i_{n,(k,l)}$  caused by  $I_n$  from node  $k$  to  $l$  could be calculated by Equation (3).

$$i_{n,(k,l)} = \begin{cases} I_n \frac{R_n}{r_{m,n}}, & \text{if } k \text{ and } l \text{ are between } m \text{ and } n \\ 0, & \text{otherwise} \end{cases} \quad (3)$$

The current  $\tilde{i}_{k,l}$  response with all current sources in effect between each node could be sum algebraically in (4).

$$\tilde{i}_{k,l} = \sum_{n=1}^N i_{n,(k,l)} \quad (4)$$

According to KVL, the voltage drop from node  $n$  to  $n+1$  is represented in Equation (5).

$$V_{n+1} = V_n - \tilde{i}_{n,n+1} r_{n,n+1} \quad (5)$$

Equation (5) can be reformulated as Equation (6) after applied recursively, where  $p \in P$  and  $IR_{pn} = \sum_{k=p}^{n-1} \tilde{i}_{k,k+1} r_{k,k+1}$  is the localized IR drop voltage. Consequently, the voltage response on node  $n$  is presented in Equation (6).

$$V_n = V_p - IR_{pn} \quad (6)$$

The current  $\hat{I}_n$  on voltage source  $n$ , could be obtained by Equation (7), where  $I_n$  is the current source on node  $n$ .

$$\hat{I}_n = I_n + \tilde{i}_{n,n-1} + \tilde{i}_{n,n+1} \quad (7)$$

Eventually, the IR drop  $IR_{via,n}$  across via  $n$  is expressed in Equation (8), and the IR drop on every node is acquired. We could pass these properties to the next stage and retrieve the overall circuit feature.

$$IR_{via,n} = \hat{I}_n R_{via,n} \quad (8)$$

**3.4.2 Hypothetical IR Drop.** Upon analyzing each metal layer, we perceive the PDN as a cohesive unit. Our primary goal is to determine the IR drop specifically on the cell level. Having already obtained the localized IR drop values for each node of the circuit individually, we aim to figure out how these cumulative IR drops from each metal layer impact the base layer. Starting from the top layer, consider the IR drop at each node associated with a via. These IR drops effectively act as voltage sources for the subsequent layer. According to KCL, the aggregate current at node  $i$  of metal stripe  $s$  in layer  $m$  must be zero (as shown in Equation (9)), where  $V_{m,s,i}$  represents the voltage on node  $i$  of metal stripe  $s$  in layer  $m$  and  $R_{m,s,i,k}$  the resistance between node  $i$  and  $k$  on stripe  $s$  of metal  $m$ .

$$\sum_{k \in N_p} \frac{V_{m,s,i} - V_{m,s,k}}{R_{m,s,i,k}} = 0 \quad (9)$$

Consequently, Equation (9) could be elaborated as Equation (10) by means of voltage division.

$$V_{m,s,i} = \sum_{k \in N_p} V_{m,s,k} \frac{\frac{1}{R_{m,s,i,k}}}{\sum_{j \in N_p} \frac{1}{R_{m,s,i,j}}} \quad (10)$$



**Figure 4:** The detailed illustration of custom CNN model architecture in Figure 2. CFIRSTNET extracts the latent representation of the PDN from the model input features in the encoder stage (purple), aggregates features of various resolutions with the FPN (blue), and generates a high-resolution IR drop estimation map with the custom reconstructor.

Eventually, we plot the hypothetical IR drop, which is the effect of each metal layer on the cell level, separately. Applying the one-dimensional interpolation on two axes could foster the hypothetical IR drop map. As a result, the generated feature could be transferred to the intrinsic model input.

## 4 MODEL ARCHITECTURE

CFIRSTNET includes a CNN with an encoder-decoder architecture, commonly used in computer vision tasks. Unlike traditional symmetric CNNs, CFIRSTNET features a customized encoder and decoder, specifically designed for the problem at hand. The tailored encoder extracts both shallow local features and deep global features. Moreover, the amalgamative decoder merges features from multiple resolutions, combining global features with local features. Figure 4 is a visual representation of our model. CFIRSTNET extracts the latent representation of the PDN from the model input features in the encoder stage (purple), aggregates features of various resolutions with the FPN [14], and generates a high-resolution IR drop estimation map with the custom reconstructor. The unique design of CFIRSTNET ensures efficient feature extraction and resolution adaptation.

### 4.1 ConvNeXtV2 Inherited Encoder

We aim to derive a hierarchical feature maps that captures the global representation of individual IR drop values across various scales. Unlike image classification tasks, which often benefit from extensive annotated datasets, the scarcity of large annotated datasets for IR drop poses a challenge. To address this, we opt for ConvNeXtV2[22], an enhanced variant of the original ConvNeXt architecture[17]. ConvNeXtV2 is specifically chosen for its ability to extract 2-D spatial information effectively. Its inductive bias towards convolution helps prevent overfitting, and its large receptive field ensures robust global representation. In the context of static IR drop estimation, it is essential to analyze the circuit using small segments to capture fine-grained local features. Simultaneously, we must consider the entire circuit hierarchy through global representations. The CFIRSTNET encoder comprises three major working blocks: coordinate encoding, local feature extraction, and global feature extraction.

**4.1.1 Coordinate Encoding.** A typical CNN exhibits translation invariance, meaning that its output remains consistent regardless of the spatial position of input objects. However, in the context of PDN synthesis patterns used in EDA tools, the issue of IR drop becomes highly relevant to the placement of circuit components. To

address this issue, we introduce a coordinate encoding layer[15] in CFIRSTNET prior to the feature extraction stage. This layer encodes translation information related to the input data and concatenates it as augmented input channels. Consequently, CFIRSTNET effectively maps the spatial position of each pixel.

**4.1.2 Local Feature Extraction.** We employ a standard convolutional layer with a 3x3 kernel size to extract local features. This layer provides CFIRSTNET with the initial latent representation of the input circuit.

**4.1.3 Global Feature Extraction.** ConvNeXt[17] is a modernized ConvNet for image classification and segmentation. It combines large receptive fields with scalability, drawing inspiration from Vision and Swin Transformers[9][16]. At the macro design level, ConvNeXt follows Swin Transformer's stage ratio and patchify stem design. For block design, it increases the kernel size to 7x7 (compared to original ResNet[11]) while using depthwise convolution to maintain FLOPS. The inverted bottleneck design from Transformers transforms the block design into a depthwise convolution layer followed by a multilayer perceptron (MLP) with an expansion ratio of 4. At the micro design level, ConvNeXt employs GELU activation and Layer Normalization (LN). ConvNeXtV2[22] enhances this by adding a Global Response Normalization (GRN) layer after the GELU activation function and utilizing Fully Convolutional Masked Autoencoder (FCMAE) pre-training for improved performance compared to supervised pre-training.

## 4.2 Feature Fusion Decoder

To fuse features across different hierarchies and capture fine-grained details of IR drop values, we design our decoder in two parts: a FPN[14] and an IR drop reconstruction module. The FPN enables feature fusion at various levels, while the IR drop reconstruction module reconstructs the IR drop value using the fused features.

**4.2.1 Feature Pyramid Network.** The FPN in CFIRSTNET leverages multi-level features obtained from the backbone. These features are computed at several scales with a scaling step of 2. The bottom-up pathway computes a feature hierarchy, resulting in feature maps at different levels. FPN then transforms features from each level into a unified latent space using convolution and upsampling. This process ensures that latent features maintain their original sizing. The top-down pathway enhances spatially coarser, but semantically stronger, feature maps from higher pyramid levels. The result is a set of proportionally sized feature maps at multiple levels. In our custom FPN design, we input both local features and fine-grained and global features. FPN acts as a generic solution, independent of the backbone convolutional architectures, and provides fused features for IR drop map reconstruction.

**4.2.2 IR Drop Reconstruction.** We develop the IR drop reconstruction module by employing a series of four cascaded depth-wise convolutions with a 7x7 kernel size, complemented by LN. The activation function layers utilize GELU, and we establish a residual connection from the base layer of the FPN. Ultimately, a 3x3 convolutional layer produces the final IR drop prediction map. This module design maintains the crucial fine-grained feature encoding necessary for generating high-resolution IR drop maps.

**Table 1: Detail Information of PDNs for Testing.**

| Instance   | With macros | Chip size (mm <sup>2</sup> ) | PDN type  |
|------------|-------------|------------------------------|-----------|
| testcase7  | O           | 0.361                        | irregular |
| testcase8  |             |                              | regular   |
| testcase9  | O           | 0.697                        | irregular |
| testcase10 |             |                              | regular   |
| testcase13 | X           | 0.066                        | irregular |
| testcase14 |             |                              | regular   |
| testcase15 | X           | 0.239                        | irregular |
| testcase16 |             |                              | regular   |
| testcase19 | O           | 0.757                        | irregular |
| testcase20 |             |                              | regular   |



**Figure 5: Comparison between IR drop maps under different resolutions. A lower resolution (12μm x 12μm) can not represent the actual IR drop values.**

## 5 EXPERIMENT

### 5.1 Experiment Setup

CFIRSTNET is implemented with C++, Python3, and PyTorch. All evaluation is conducted on an Intel Xeon E3-1230 v6 CPU server with an NVIDIA Quadro P4000 graphic card. The implementation details can be found at <https://github.com/jason122490/CFIRSTNET>.

During the training process, a cached dataset is involved in the framework to avoid redundant feature map generation stages. Our selected loss functions are the root-mean-square error (RMSE) and the Dice Loss[21]. This technique guides the model to minimize the error and find the hotspot accurately. Ultimately, the model with the lowest validation loss is saved for testing. On the other hand, all features are resized or generated in the size of 384 x 384 pixels, and z-score normalization is applied to the input features and ground truth.

The performance of CFIRSTNET is evaluated by an open-source IR drop estimation benchmark[13]. The dataset consists of both irregular PDNs and regular PDNs, including 6100 synthetic data points from BeGAN[6] and 20 real circuits from the OpenROAD project[2]. Each of them comes with a current map, a PDN density map, an effective distance map, an IR drop map, and a PDN netlist. Furthermore, the dataset is split into a training set, a validation set, and a testing set. To ensure the feasibility of CFIRSTNET on

**Table 2: Overall performance for IR drop estimation.**

| Instance   | HSPICE             |                    |                |                  | IREDGe            |                   |          |                  | 1st Place of CAD Contest |                    |              |                  | CFIRSTNET          |                    |              |                  |                        |
|------------|--------------------|--------------------|----------------|------------------|-------------------|-------------------|----------|------------------|--------------------------|--------------------|--------------|------------------|--------------------|--------------------|--------------|------------------|------------------------|
|            | $IR_{avg}$<br>(mV) | $IR_{max}$<br>(mV) | Hotspot<br>(%) | Runtime<br>(sec) | $e_{avg}$<br>(mV) | $e_{max}$<br>(mV) | F1 Score | runtime<br>(sec) | $IR_{avg}$<br>(mV)       | $IR_{max}$<br>(mV) | F1 Score     | Runtime<br>(sec) | $IR_{avg}$<br>(mV) | $IR_{max}$<br>(mV) | F1 Score     | Runtime<br>(sec) | Speedup<br>(vs HSPICE) |
| testcase7  | 1.0439             | 4.3045             | 0.378          | 8.94             | 0.6218            | 1.2305            | 0.142    | <b>0.150</b>     | 0.0656                   | 1.2115             | 0.783        | 7.996            | <b>0.0177</b>      | <b>0.3458</b>      | <b>0.923</b> | 0.366            | 24.43x                 |
| testcase8  | 1.5540             | 4.8994             | 0.535          | 8.86             | 0.3845            | 2.2659            | 0.419    | <b>0.149</b>     | 0.0815                   | 1.0416             | 0.816        | 8.396            | <b>0.0257</b>      | <b>0.5214</b>      | <b>0.916</b> | 0.367            | 24.14x                 |
| testcase9  | 1.1811             | 3.7932             | 0.034          | 21.07            | 0.4538            | 1.2780            | 0        | <b>0.264</b>     | 0.0406                   | 0.8755             | <b>0.589</b> | 11.417           | <b>0.0278</b>      | <b>0.4664</b>      | 0.526        | 0.572            | 36.84x                 |
| testcase10 | 1.9483             | 4.5327             | 0.086          | 17.53            | 0.2426            | 1.2721            | 0        | <b>0.262</b>     | 0.0659                   | 0.8547             | <b>0.532</b> | 11.270           | <b>0.0459</b>      | <b>0.5741</b>      | 0.464        | 0.551            | 31.81x                 |
| testcase13 | 2.2638             | 10.5650            | 0.080          | 3.22             | 0.2441            | 3.8389            | 0        | <b>0.033</b>     | 0.2068                   | 7.2341             | 0            | 5.452            | <b>0.0774</b>      | <b>2.1299</b>      | <b>0.680</b> | 0.204            | 15.78x                 |
| testcase14 | 3.2298             | 13.1495            | 0.088          | 2.81             | 0.3138            | 5.5321            | 0        | <b>0.033</b>     | 0.4215                   | 8.8334             | 0            | 5.463            | <b>0.1895</b>      | <b>2.1035</b>      | <b>0.678</b> | 0.203            | 13.84x                 |
| testcase15 | 2.7686             | 5.7812             | 0.067          | 8.86             | 0.1530            | 1.7691            | 0        | <b>0.105</b>     | 0.0968                   | 1.5265             | 0.088        | 8.137            | <b>0.0353</b>      | <b>0.6735</b>      | <b>0.733</b> | 0.327            | 27.09x                 |
| testcase16 | 4.6521             | 7.5669             | 0.345          | 8.47             | 0.2675            | 1.6696            | 0.258    | <b>0.102</b>     | 0.1601                   | 1.5487             | 0.529        | 7.413            | <b>0.0763</b>      | <b>0.8393</b>      | <b>0.785</b> | 0.324            | 26.14x                 |
| testcase19 | 0.4442             | 1.7226             | 0.057          | 23.33            | 1.0649            | 1.4689            | 0        | <b>0.281</b>     | 0.0905                   | 0.5148             | 0.501        | 11.905           | <b>0.0187</b>      | <b>0.3187</b>      | <b>0.752</b> | 0.596            | 39.14x                 |
| testcase20 | 0.6994             | 2.4261             | 0.010          | 18.91            | 0.8204            | 1.4209            | 0        | <b>0.279</b>     | 0.1180                   | 0.5003             | 0.711        | 11.758           | <b>0.0191</b>      | <b>0.3026</b>      | <b>0.773</b> | 0.576            | 32.83x                 |
| Average    | 1.9785             | 5.8741             | 0.168          | 12.2             | 0.4566            | 2.1746            | 0.082    | <b>0.166</b>     | 0.1347                   | 2.4141             | 0.455        | 8.921            | <b>0.0533</b>      | <b>0.8275</b>      | <b>0.723</b> | 0.409            | 27.20x                 |

**Figure 6: (a) CFIRSTNET prediction and the golden IR drop map of testcase 9. (b) Error distribution of benchmark testing.**

real circuits, synthetic circuits are assigned to the training set, and the real circuits are distributed to the validation set and testing set equally to match the benchmark condition[13]. The circuits in this dataset are synthesized with open-source NanGate 45nm technology[2]. The detail of each test case is provided in Table 1. CFIRSTNET is tested by circuits of various chip areas and PDN planning. Additionally, some of them even involve macros.

Moreover, it is important to mention that the scaling of the IR drop map must be chosen carefully. Though Chhabria et al.[5] claimed that higher resolution is inefficient for accuracy, the disproportionate area-to-pixel ratio might not indicate the accurate worst-case IR drop. As exhibited in Figure 5, if we set the resolution too low, we are unable to identify the exact worst-case IR drop nor the precise IR drop value from the downsampled IR drop map. With a more coarse IR drop map, even if significant performance could be achieved, the estimated IR drop values might be far from those in the original circuit. Thus, the circuit resolution ( $1\mu\text{m} \times 1\mu\text{m}$  per pixel) in the open-source benchmark dataset[13] is more reasonable for practical IR drop estimation compared with previous works. As a result, we suggest the resolution used in the evaluation process of CFIRSTNET be identical to the benchmark.

## 5.2 Evaluation Metrics

To assess the performance of CFIRSTNET in comparison to other approaches, we utilized four key evaluation metrics: mean absolute error (MAE), max error, F1 score, and runtime. These metrics align with those reported in the work by Kadagalal et al.[13].

MAE represents the discrepancy between the predicted IR drop and the actual value, calculated as the sum of absolute errors (Manhattan distance) divided by the sample size Equation (11).

$$MAE = \frac{\sum_{i=1}^N |x_i - y_i|}{N} \quad (11)$$

The F1 score provides a comprehensive view of a model's ability to correctly classify positive instances while minimizing false positives and false negatives. It combines both precision and recall into a single value, providing a balanced assessment of a model's performance. In the IR drop estimation task, F1 score could evaluate the model's ability to capture the IR drop hot spot, i.e., the region with IR drop larger than 90% of the testcase, according to Kadagalal et al.[13]. As shown in Equation (12), the F1 score is obtained with true positive (TP), false positive (FP), and false negative (FN) regions.

$$F1Score = \frac{2 * TP}{2 * TP + FP + FN} \quad (12)$$

Moreover, the runtime is also measured in order to show whether CFIRSTNET could speed up the PDN analysis time. It is imperative to mention that all experiments are carried out on the same machine to make fair comparisons.

## 6 EVALUATION RESULT

### 6.1 Benchmark Performance

Figure 6(a) is an example of the predicted IR drop map and the golden one. According to Figure 6(b), which is the error distribution of benchmark testing, the errors of the majority of predicted IR drops are within 0.2mV. Numerically, Table 2 presents the MAE  $e_{avg}$ , maximum absolute error,  $e_{max}$ , F1 score, and runtime of each approach. HSPICE is selected as the golden IR drop tool. CFIRSTNET is compared with IREDGe from [5] and 1st Place of CAD Contest from [13]. IREDGe<sup>1</sup> can make IR drop estimations instantly but the  $e_{avg}$  and  $e_{max}$  is not negligible. Though the 1st place winner in the ICCAD CAD Contest 2023[13] made good estimations, the  $e_{avg}$  is still improvable and the hotspot in some testcases could not be located. CFIRSTNET achieves a 13-39x speedup compared to HSPICE,  $e_{avg}$  lower than 0.19mV,  $e_{max}$  lower than 2.13mV. Given that the IR drop constraint normally falls into 1% to 2.5% of VDD (1.1V in this case), the result indicates that the estimations made by CFIRSTNET could provide lower  $e_{avg}$  with reasonable  $e_{max}$  and meanwhile ensure efficiency. Moreover, owing to the properties of ML methods, the runtime of CFIRSTNET does not exhibit the same increase as HSPICE does with respect to the growth of chip size.

### 6.2 Compared with CAD Contest Results

**Table 3: Compared with winners of ICCAD CAD Contest 2023**

|                        | $e_{avg}$ (mV) | F1 score     | runtime (sec) |
|------------------------|----------------|--------------|---------------|
| 1st place winner       | 0.1347         | 0.455        | 8.921         |
| 2nd place winner       | 0.1498         | 0.455        | N/A           |
| <b>CFIRSTNET(Ours)</b> | <b>0.0533</b>  | <b>0.723</b> | <b>0.409</b>  |

Table 3 shows the evaluation result of CFIRSTNET compared with the hitherto best solutions presented in the CAD Contest<sup>2</sup>[13]. By virtue of the comprehensive features and tailored model, CFIRSTNET reduces  $e_{avg}$  by 60%, increases F1 score by 59%, and achieves a 21.8x speedup compared to 1st Place winner. Moreover, though the CAD Contest didn't evaluate  $e_{max}$ , CFIRSTNET reduces  $e_{max}$  by 66% (Table 2). After making a direct comparison under identical benchmark conditions, we could state that CFIRSTNET is currently the state-of-the-art solution to the static IR drop estimation problems.

### 6.3 Ablation Studies of Comprehensive Features

Table 4 compares the effectiveness of the proposed features gradually. The image-based input established a baseline performance. The proposed Hypothetical IR Drop Distillation (HIRD) boosted the

<sup>1</sup>IREDGe method is reproduced with its source code from GitHub despite the discrepancy between the code and the paper.

<sup>2</sup>We have obtained the 1st Place's executable but that of 2nd Place is not available.

**Table 4: Performance with features involved gradually.**

|                               | $e_{avg}$     | $e_{max}$     | F1 score     | runtime      |
|-------------------------------|---------------|---------------|--------------|--------------|
| Image-based                   | 0.1864        | 2.2650        | 0.217        | <b>0.290</b> |
| HIRD                          | 0.0795        | 0.9341        | 0.677        | 0.347        |
| HIRD+WR+RD                    | 0.0612        | 0.8691        | 0.682        | 0.356        |
| All Features                  | 0.0559        | 0.9351        | 0.674        | 0.486        |
| <b>Comprehensive Features</b> | <b>0.0533</b> | <b>0.8275</b> | <b>0.723</b> | 0.409        |

experimental result substantially. With the Wire Resistance (WR) maps and the Resistive Distance (RD) maps coming into practice,  $e_{avg}$  and  $e_{max}$  become lower and a higher F1 score is achieved. When all features are taken, CFIRSTNET achieves significant improvement compared to the baseline.



**Figure 7: IR drop estimation maps with or without involving current map.**

Moreover, refer to Figure 7, we noticed that incorporating the current map could cause some abnormal spikes in terms of IR drop estimation. Thus, since the Hypothetical IR Drop Distillation has incorporated information on the cell-level current, the Comprehensive features would involve all features but the current map. As a result, the performance was improved slightly without the current map as input.

## 7 CONCLUSION

In this paper, we present CFIRSTNET, a novel approach for static IR drop estimation. CFIRSTNET includes comprehensive feature extraction and a custom-designed CNN. Consequently, CFIRSTNET could make IR drop estimations within 0.6 seconds and achieve a 13-39x speedup compared to HSPICE with  $e_{avg}$  below 0.19mV and maximum error less than 2.13mV. When evaluated with the open-source ICCAD CAD Contest 2023 benchmark, CFIRSTNET outperformed previous works in terms of MAE, max error, F1 score, and runtime. Moreover, CFIRSTNET achieved recognizable results compared to previous works. In terms of the possibilities of future works, more open-source PDNs could be incorporated to further explore the improbability of CFIRSTNET on this problem or even dynamic IR drop estimation.

## ACKNOWLEDGMENTS

This work was supported by the National Science and Technology Council of Taiwan under Grant 112-2221-E-A49-156-MY3.

## REFERENCES

- [1] 2002. A multigrid-like technique for power grid analysis. *IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems* 21, 10 (2002), 1148–1160.
- [2] T Ajayi, D Blaauw, TB Chan, CK Cheng, VA Chhabria, DK Choo, M Coltell, S Dobre, R Dreslinski, M Fogaça, et al. 2019. OpenROAD: Toward a Self-Driving, Open-Source Digital Layout Implementation Tool Chain. *Proc. GOMACTECH* (2019), 1105–1110.
- [3] Jia-Xian Chen, Shi-Tang Liu, Yu-Tsung Wu, Mu-Ting Wu, Chieo-Mo Li, Norman Chang, Ying-Shiu Li, and Wen-Tze Chuang. 2022. Vector-based dynamic IR-drop prediction using machine learning. In *2022 27th Asia and South Pacific Design Automation Conference (ASP-DAC)*. IEEE, 202–207.
- [4] Tianqi Chen and Carlos Guestrin. 2016. Xgboost: A scalable tree boosting system. In *Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining*, 785–794.
- [5] Vidya A Chhabria, Vipul Ahuja, Ashwath Prabhu, Nikhil Patil, Palkesh Jain, and Sachin S Sapatnekar. 2021. Thermal and IR drop analysis using convolutional encoder-decoder networks. In *Proceedings of the 26th Asia and South Pacific Design Automation Conference*. 690–696.
- [6] Vidya A Chhabria, Kishor Kunal, Masoud Zabihi, and Sachin S Sapatnekar. 2021. BeGAN: Power grid benchmark generation using a process-portable GAN-based methodology. In *2021 IEEE/ACM International Conference On Computer Aided Design (ICCAD)*. IEEE, 1–8.
- [7] Vidya A Chhabria and Sachin S Sapatnekar. 2022. OpeNPDN: A Neural-Network-Based Framework for Power Delivery Network Synthesis. *IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems* 41, 10 (2022), 3515–3528. <https://doi.org/10.1109/TCD.2021.3132554>
- [8] Vidya A Chhabria, Yanqing Zhang, Haoxing Ren, Ben Keller, Brucek Khailany, and Sachin S Sapatnekar. 2021. MAVIREC: ML-aided vectored IR-drop estimation and classification. In *2021 Design, Automation & Test in Europe Conference & Exhibition (DATE)*. IEEE, 1825–1828.
- [9] Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, et al. 2020. An image is worth 16x16 words: Transformers for image recognition at scale. *arXiv preprint arXiv:2010.11929* (2020).
- [10] Yen-Chun Fang, Heng-Yi Lin, Min-Yan Sui, Chien-Mo Li, and Eric Jia-Wei Fang. 2018. Machine-learning-based dynamic IR drop prediction for ECO. In *2018 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)*. IEEE, 1–7.
- [11] Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In *Proceedings of the IEEE conference on computer vision and pattern recognition*, 770–778.
- [12] Chia-Tung Ho and Andrew B Kahng. 2019. IncPIRD: Fast learning-based prediction of incremental IR drop. In *2019 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)*. IEEE, 1–8.
- [13] Gana Surya Prakash Kadagala and Vidya A Chhabria. 2023. 2023 ICCAD CAD Contest Problem C: Static IR Drop Estimation Using Machine Learning. In *2023 IEEE/ACM International Conference on Computer Aided Design (ICCAD)*. IEEE, 1–5.
- [14] Tsung-Yi Lin, Piotr Dollár, Ross Girshick, Kaiming He, Bharath Hariharan, and Serge Belongie. 2017. Feature pyramid networks for object detection. In *Proceedings of the IEEE conference on computer vision and pattern recognition*, 2117–2125.
- [15] Rosanne Liu, Joel Lehman, Piero Molino, Felipe Petroski Such, Eric Frank, Alex Sergeev, and Jason Yosinski. 2018. An intriguing failing of convolutional neural networks and the coordconv solution. *Advances in neural information processing systems* 31 (2018).
- [16] Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. 2021. Swin transformer: Hierarchical vision transformer using shifted windows. In *Proceedings of the IEEE/CVF international conference on computer vision*, 10012–10022.
- [17] Zhuang Liu, Hanzi Mao, Chao-Yuan Wu, Christoph Feichtenhofer, Trevor Darrell, and Saining Xie. 2022. A convnet for the 2020s. In *Proceedings of the IEEE/CVF conference on computer vision and pattern recognition*, 11976–11986.
- [18] Rajendran Panda, David Blaauw, Rajat Chaudhry, Vladimir Zolotov, Brian Young, and Ravi Ramaraju. 2000. Model and analysis for combined package and on-chip power grid simulation. In *Proceedings of the 2000 international symposium on Low power electronics and design*. 179–184.
- [19] Chi-Hsien Pao, An-Yu Su, and Yu-Min Lee. 2020. XGBIR: An XGBoost-based IR drop predictor for power delivery network. In *2020 Design, Automation & Test in Europe Conference & Exhibition (DATE)*. IEEE, 1307–1310.
- [20] Haifeng Qian, Sani R Nassif, and Sachin S Sapatnekar. 2005. Power grid analysis using random walks. *IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems* 24, 8 (2005), 1204–1224.
- [21] Carole H. Sudre, Wensi Li, Tom Vercauteren, Sébastien Ourselin, and M. Jorge Cardoso. 2017. *Generalized Dice Overlap as a Deep Learning Loss Function for Highly Unbalanced Segmentations*. Springer International Publishing, 240–248. [https://doi.org/10.1007/978-3-319-67558-9\\_28](https://doi.org/10.1007/978-3-319-67558-9_28)
- [22] Sanghyun Woo, Shoubhik Debnath, Ronghang Hu, Xinlei Chen, Zhuang Liu, In So Kweon, and Saining Xie. 2023. Convnext v2: Co-designing and scaling convnets with masked autoencoders. In *Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition*, 16133–16142.
- [23] Zhiyao Xie, Haoxing Ren, Brucek Khailany, Ye Sheng, Santosh Santosh, Jiang Hu, and Yiran Chen. 2020. PowerNet: Transferable dynamic IR drop estimation via maximum convolutional neural network. In *2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC)*. IEEE, 13–18.
- [24] M. Zhao, R.V. Panda, S.S. Sapatnekar, and D. Blaauw. 2002. Hierarchical analysis of power distribution networks. *IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems* 21, 2 (2002), 159–168. <https://doi.org/10.1109/43.980256>
- [25] Yu Zhong and Martin DF Wong. 2005. Fast algorithms for IR drop analysis in large power grid. In *ICCAD-2005. IEEE/ACM International Conference on Computer-Aided Design, 2005*. IEEE, 351–357.
- [26] Han Zhou, Wentian Jin, and Sheldon X-D Tan. 2020. GridNet: Fast data-driven EM-induced IR drop prediction and localized fixing for on-chip power grid networks. In *Proceedings of the 39th International Conference on Computer-Aided Design*, 1–9.