

# Clock Tree Synthesis Tutorial and iCTS Software

Weiguo Li  
Minnan Normal University  
[dawnli619215645@gmail.com](mailto:dawnli619215645@gmail.com)

# Overview

- 1 **Introduction**
- 2 **Preliminaries**
- 3 **iCTS Structure**
- 4 **Routing Topology**
- 5 **Buffering Optimization**
- 6 **Framework**
- 7 **Experimental Results**
- 8 **Future Works**

# Where's CTS



**Floorplan**  
determining the area of the Die and Core, and placing the macro cell.



**Placement** placing standard cells into legal positions to achieve the optimal design object.



**CTS** designs a clock network, ensuring clock synchronization, achieving low-power and high-performance.



**Routing** materializes all nets to keep the wirelength as short as possible and adhere to the design rules.



# What's CTS?



- CTS, the bridge between Placement and Routing
- Achieving **skew balance** and **minimize design resource** (buffers, wirelength)

# What's our concern



- Timing of Clock Tree
  - skew
  - latency
  - worst negative slack (WNS)
  - total negative slack (TNS)
  - ...
- Design Resource
  - num of buffer insertions
  - wirelength
  - capacitance
  - power/area
  - ...

# What's the operation in CTS

- Routing Topology
    - starting from the **driver pin**
    - connect all **load pins**
    - Steiner tree properties
  - Buffering
    - decompose the net
    - reduce fanout ( $4 \rightarrow 2$ )
    - enhanced driving capability
- > *delay/wirelength/capacitance/...*
- > *cell delay/area/power/slew/...*



# What's we expecting



# Overview

- 1 Introduction
- 2 Preliminaries
- 3 iCTS Structure
- 4 Routing Topology
- 5 Buffering Optimization
- 6 Framework
- 7 Experimental Results
- 8 Future Works

# CTS Formula

- We define the initial distribution of **flip-flops (sinks)** in the CTS stage as  $\mathcal{S}$ . Given a clock source (root), we can establish **clock skew** constraint as follow:

$$\Delta \text{delay}(s_i) = \max_{s_i \in \mathcal{S}} \{ \text{delay}(s_i) \} - \min_{s_i \in \mathcal{S}} \{ \text{delay}(s_i) \} \leq \text{skew\_bound},$$

where  $\text{delay}(s_i)$  represents the delay from clock source to sink  $s_i$  (latency).

We define the set of load pins for each clock net as a **cluster**, i.e.,  $u_j \in \mathcal{U}$ , and generate a clock net, i.e.,  $n_j \in \mathcal{N}$ .



# CTS Formula

- CTS needs to satisfy additional design constraints, such as the **fanout** constraint:

$$|u_j| \leq \text{max\_fanout},$$

the maximum **wirelength** constraint:

$$WL(n_j) \leq \text{max\_length},$$

and the maximum clock net **capacitance** constraint:

$$\sum_{s_i \in u_j} Cap_{pin}(s_i) + c \cdot WL(n_j) \leq \text{max\_cap}.$$



# CTS Formula

- CTS problem can be defined as a multi-objective optimization problem:

$$\begin{aligned} \min \quad & f(\mathcal{N}, \mathcal{U}, \mathcal{S}) \\ \text{s.t.} \quad & \Delta \text{delay}(s_i) \leq \text{skew\_bound}, \\ & |u_j| \leq \text{max\_fanout}, \\ & WL(n_j) \leq \text{max\_length}, \\ & \sum_{s_i \in u_j} Cap_{pin}(s_i) + c \cdot WL(n_j) \leq \text{max\_cap}, \end{aligned}$$

$f(\mathcal{N}, \mathcal{U}, \mathcal{S})$  represents the objective function, such as **design resources** and **power**.

**It's a complex multi-objective optimization problem**

# Load Capacitance

- Equivalent Capacitance (Linear)

$$cap_{load}(s) = cap_{pin}(s) + \sum_{t \in ch(s)} (Cap_{load}(t) + c \cdot L(s, t)),$$

Pin Cap

Capacitance per unit length/area  
Tree (Recursive)  
Wire Cap

Bottom-up Calculation



# Wire (Interconnect) Delay

- Elmore  $\pi$ -Model

$$D_{wire}(s, t) = r \cdot L(s, t) \cdot \left[ \underbrace{\frac{1}{2} \cdot c \cdot L(s, t)}_{\text{Res}} + \underbrace{Cap_{load}(t)}_{\text{Cap}} \right],$$

Resistance per unit length/area      **Res**      **Cap**



Equivalent  $\pi$ -model of a wire segment



# Slew (Transition)

- Bakoglu Metric<sup>[1]</sup> & PERI Model<sup>[2]</sup>

$$Slew_{ideal}(s, t) = \ln 9 \cdot D_{wire}(s, t),$$

$$Slew_{wire}(t) = \sqrt{Slew_{wire}^2(s) + Slew_{ideal}^2(s, t)}.$$



Top-down  
Calculation

# Look-up Table (LUT)

- $Delay/Slew_{out} = LUT(Slew_{in}, Cap_{load})$ 
  - upstream input slew
  - downstream load capacitance
  - not reliant on complex computational models
- Model
  - bilinear interpolation
  - ML fitting...



# Buffer Delay

- Linear Fitting<sup>[3]</sup>

$$D_{buf}(\ast) = LUT_{delay}(Slew_{in}(\ast), Cap_{load}(\ast)),$$



$$D_{buf}(\ast) = \alpha \cdot Slew_{in}(\ast) + \beta \cdot Cap_{load}(\ast) + \gamma.$$



Intrinsic  
Delay

**This type of modeling is only effective for clock buffers**

# Buffer Slew

- Linear Fitting[3]

$$Slew_{out}(\ast) = LUT_{slew}(Slew_{in}(\ast), Cap_{load}(\ast)),$$



$$Slew_{out}(\ast) = \alpha \cdot Cap_{load}(\ast) + \beta.$$

- Benefit

- sufficient accuracy
- independence from the upstream information (under certain conditions)



Fitting results under the 28nm process library

# Clock Topology



H-Tree<sup>[4]</sup>



Deferred-Merge Embedding<sup>[5]</sup>  
(DME)



Fishbone/Spine<sup>[6]</sup>



Mesh Grid<sup>[7]</sup>



Hybrid<sup>[8]</sup>



# Buffering

- Effect
  - reduce fanout
  - affect path delay
  - enhance driving capability (transition)
- Property
  - size/area
  - power/timing
  - library (for LUT)
- Optimization
  - insertion
  - resizing



Buffering (Insertion)



Timing (Buffering)  
Optimization

# Overview

- 1 Introduction
- 2 Preliminaries
- 3 iCTS Structure
- 4 Routing Topology
- 5 Buffering Optimization
- 6 Framework
- 7 Experimental Results
- 8 Future Works

# CTS Components

- Clock Network
- Physicalization
- Optimization
- Skew Schedule



# iCTS Software

- api
- data manager
- module
- solver
- external lib



# Overview

- 1 Introduction
- 2 Preliminaries
- 3 iCTS Structure
- 4 Routing Topology
- 5 Buffering Optimization
- 6 Framework
- 7 Experimental Results
- 8 Future Works

# Bound Skew Tree (BST/DME)<sup>[9]</sup>

- Bottom-up Stage
  - merge two sub-tree
  - determine merge region
- Top-down Stage
  - location embedding
  - nearest endpoint
- Benefit
  - zero/bound skew
  - topology based



Bottom-up



Top-down

# Fundamentals of Steiner Trees

- $PL(s_i)$ , the **path length** from the source
- $MD(s_i)$ , the **Manhattan distance** from the source
- $WL(T)$ , the **total wirelength** of clock tree  $T$
- $WL(T_{FLUTE})$ , the **wirelength of FLUTE<sup>[10]</sup> tree**



# SALT[11]

- Shallowness (path length)

$$\alpha = \max_{s_i \in \mathcal{S}} \left\{ \frac{PL(s_i)}{MD(s_i)} \right\},$$

- Lightness (tree weight)

$$\beta = \frac{WL(T)}{WL(MST(G))} \approx \frac{WL(T)}{WL(T_{FLUTE})}.$$



Different Shallow-Light Tree on the same net

Given  $\epsilon$ , SALT realizes  $\alpha \leq 1 + \epsilon$  and  $\beta \leq 2 + \lceil \log \frac{2}{\epsilon} \rceil$

# Metric Mapping



$$\frac{PL_i}{MD_i} \leq 1$$

Shallowness

**Latency (Delay)**



$$\min\{\sum WL\}$$

Lightness

**Load Capacitance**



$$skew \leq skew_{bound}$$

?

**Skew**

# Skewness\*

- The skewness of the Steiner tree is defined as:

$$\gamma = \frac{\max_{s_i \in \mathcal{S}} \{ PL(s_i) \}}{\overline{PL}}.$$



# Skew-Latency-Load Tree (SLLT)\*

- A rectilinear Steiner tree with  $\alpha \leq \bar{\alpha}$ ,  $\beta \leq \bar{\beta}$ , and  $\gamma \leq \bar{\gamma}$ , is denoted as  $(\bar{\alpha}, \bar{\beta}, \bar{\gamma}) - SLLT$ .



| Algorithm | Max PL | Min PL | Total WL | Mean PL | $\alpha$    | $\beta$     | $\gamma$    | Mean        | Skew Control |
|-----------|--------|--------|----------|---------|-------------|-------------|-------------|-------------|--------------|
| H-tree    | 10     | 9      | 55.5     | 9.75    | 2           | 1.32        | 1.03        | 1.45        | ✓            |
| GH-tree   | 10     | 7      | 47.5     | 8.5     | 1.6         | 1.13        | 1.18        | 1.3         | ✓            |
| ZST       | 10.5   | 10.5   | 55.5     | 10.5    | 2.33        | 1.32        | <b>1.00</b> | 1.55        | ✓            |
| BST       | 10     | 8      | 50       | 9.25    | 2.25        | 1.19        | 1.08        | 1.51        | ✓            |
| FLUTE     | 9      | 5      | 42       | 7.44    | 1.4         | <b>1.00</b> | 1.21        | 1.2         | ✗            |
| R-SALT    | 9      | 5      | 43       | 7.06    | <b>1.00</b> | 1.02        | 1.27        | <b>1.10</b> | ✗            |
| CBS*      | 9      | 7      | 45       | 8.13    | 1.4         | 1.07        | 1.11        | <b>1.19</b> | ✓            |

# Concurrent BST and SALT (CBS)\*

- Bound Skew Tree (BST)
  - **Benefit**
    - Provides bounded skew control
  - **Weakness**
    - High cost of net (wirelength, cap)
- Steiner SLT (SALT)
  - **Benefit**
    - Acceptable total wirelength and smaller path length (delay)
  - **Weakness**
    - Lacks the ability to balance skew



# Comparison

Table 1: Wirelength ( $\mu m$ ) comparison between R-SALT and CBS.

|                  | <b>GreedyDist</b> |              |               | <b>GreedyMerge</b> |              |              | <b>BiPartition</b> |              |              |
|------------------|-------------------|--------------|---------------|--------------------|--------------|--------------|--------------------|--------------|--------------|
| <b>Skew (ps)</b> | 80                | 10           | 5             | 80                 | 10           | 5            | 80                 | 10           | 5            |
| <b>R-SALT</b>    | 314.4             | 314.3        | 315.1         | 312.6              | 313.0        | 315.6        | 312.2              | 312.4        | 312.7        |
| <b>CBS</b>       | 306.0             | 307.1        | 316.1         | 305.2              | 306.3        | 314.3        | 305.3              | 305.6        | 312.7        |
| <b>Reduce</b>    | <b>2.67%</b>      | <b>2.29%</b> | <b>-0.32%</b> | <b>2.37%</b>       | <b>2.14%</b> | <b>0.41%</b> | <b>2.21%</b>       | <b>2.18%</b> | <b>0.00%</b> |

Table 2: Comparison on wirelength, cap and delay between BST-DME and CBS.

|                  | <b>Wirelength (<math>\mu m</math>)</b> |               |               | <b>Cap (<math>fF</math>)</b> |               |               | <b>Wire Delay (ps)</b> |               |               |
|------------------|----------------------------------------|---------------|---------------|------------------------------|---------------|---------------|------------------------|---------------|---------------|
| <b>Skew (ps)</b> | 80                                     | 10            | 5             | 80                           | 10            | 5             | 80                     | 10            | 5             |
| <b>BST-DME</b>   | 363.3                                  | 367.6         | 373.2         | 77.4                         | 78.1          | 79.1          | 15.3                   | 11.5          | 10.2          |
| <b>CBS</b>       | 305.6                                  | 306.1         | 314           | 67.5                         | 67.6          | 68.9          | 11.2                   | 9.2           | 7.4           |
| <b>Reduce</b>    | <b>15.90%</b>                          | <b>16.70%</b> | <b>15.90%</b> | <b>12.80%</b>                | <b>13.50%</b> | <b>12.80%</b> | <b>26.60%</b>          | <b>20.50%</b> | <b>26.80%</b> |

# Overview

- 1 Introduction
- 2 Preliminaries
- 3 iCTS Structure
- 4 Routing Topology
- 5 Buffering Optimization
- 6 Framework
- 7 Experimental Results
- 8 Future Works

# Critical Wirelength Model\*

- Special Net: Buffering between Buffers.

$$T(i,j) = D_{buf}(i) + D_{wire}(i,j) + D_{buf}(j),$$

$$T'(i,j) = D'_{buf}(i) + D'_{wire}(i,k) + D'_{buf}(k) + D'_{wire}(k,j) + D'_{buf}(j).$$



# Critical Wirelength Model\*

- Special Net: Buffering between Buffers.

$$T(i,j) - T'(i,j) = x \cdot (1-x) \cdot r \cdot c \cdot (\ln 9 \cdot \alpha + 1) \cdot L(i,j)^2 - \beta \cdot Cap_{pin} - \gamma,$$

$$\begin{aligned} CL_{buf}(i,j) &= \sqrt{\frac{\beta \cdot Cap_{pin} + \gamma}{x \cdot (1-x) \cdot r \cdot c \cdot (\ln 9 \cdot \alpha + 1)}} \\ &\geq 2 \cdot \sqrt{\frac{\beta \cdot Cap_{pin} + \gamma}{r \cdot c \cdot (\ln 9 \cdot \alpha + 1)}}. \end{aligned}$$



# Critical Wirelength Model\*

- General Net: Buffering between Steiner Points.

$$T(i,j) = D_{wire}(i,j),$$

$$T'(i,j) = D'_{wire}(i,k) + D'_{buf}(k) + D_{wire}(k,j).$$



# Critical Wirelength Model\*

- General Net: Buffering between Steiner Points.

$$\begin{aligned}\Delta T(i,j) = & \frac{1}{2} \cdot r \cdot c \cdot [(2 + \alpha \cdot \ln 9) \cdot x^2 - 2 \cdot x] \cdot L(i,j)^2 \\ & + \{r \cdot x \cdot [(1 + \alpha \cdot \ln 9) \cdot Cap_{pin} - Cap_{load}^*(j)] \\ & + \beta \cdot c \cdot (1 - x)\} \cdot L(i,j) \\ & + \beta \cdot Cap_{load}^*(j) + \gamma \\ & - \beta \cdot [c \cdot (1 - x) \cdot L(i,j) + Cap_{load}^*(j)] \leq 0,\end{aligned}$$



# Critical Wirelength Model\*

- General Net: Buffering between Steiner Points.

$$\widehat{CL}_{stnr}(i,j) = 2\sqrt{\frac{\beta \cdot \frac{cap_{max}}{2} + \gamma}{r \cdot c \cdot (\ln 9 \cdot \alpha + 1)}},$$

Table 3: Critical wirelength statistics.

| Cell      | $CL_{buf}$ | $\widehat{CL}_{stnr}$ | $CL_{stnr}$ | $x$  |
|-----------|------------|-----------------------|-------------|------|
| BUFFD-X3  | 157.043    | 211.842               | 351.185     | 0.27 |
| BUFFD-X4  | 146.492    | 206.286               | 275.137     | 0.31 |
| BUFFD-X6  | 174.212    | 242.255               | 279.531     | 0.35 |
| BUFFD-X8  | 187.388    | 257.016               | 270.459     | 0.38 |
| BUFFD-X12 | 195.638    | 269.881               | 254.932     | 0.41 |
| BUFFD-X16 | 211.98     | 287.216               | 258.906     | 0.42 |

# Insertion Delay Estimation Model\*

- Estimate the contribution of downstream load capacitance to delay:

$$\begin{aligned} \text{delay}_{est} &= LUT(\text{Cap}_{load}, \text{Slew}_{in}) - \omega_s \cdot \text{Slew}_{in} \\ &\approx \omega_c \cdot \text{Cap}_{load} + \omega_{inherent} \end{aligned}$$



# Overview

- 1 Introduction
- 2 Preliminaries
- 3 iCTS Structure
- 4 Routing Topology
- 5 Buffering Optimization
- 6 Framework
- 7 Experimental Results
- 8 Future Works

# Hierarchical Framework\*



# Overview

- 1 Introduction
- 2 Preliminaries
- 3 iCTS Structure
- 4 Routing Topology
- 5 Buffering Optimization
- 6 Framework
- 7 Experimental Results
- 8 Future Works

# Comparison of Open Source Test Cases

Table 4: Comparison between clock tree solutions from ours, commercial tool, and OpenROAD.

| Case     | Latency (ps) |       |       | Skew (ps)    |           |           | #Buffers     |           |       | Buf Area ( $\mu m^2$ ) |       |       | Clk Cap (fF) |       |       | Clk WL ( $\mu m$ ) |                |         |
|----------|--------------|-------|-------|--------------|-----------|-----------|--------------|-----------|-------|------------------------|-------|-------|--------------|-------|-------|--------------------|----------------|---------|
|          | Ours         | Com.  | OR.   | Ours         | Com.      | OR.       | Ours         | Com.      | OR.   | Ours                   | Com.  | OR.   | Ours         | Com.  | OR.   | Ours               | Com.           | OR.     |
| s38584   | <b>71</b>    | 76    | 93    | 13           | <b>8</b>  | 17        | <b>43</b>    | <b>43</b> | 45    | <b>26.6</b>            | 26.8  | 39.7  | <b>904</b>   | 1019  | 1087  | 3382.0             | <b>3366.5</b>  | 3478.9  |
| s38417   | <b>75</b>    | 84    | 94    | <b>10</b>    | 19        | 16        | <b>53</b>    | 54        | 55    | <b>32.4</b>            | 33.6  | 48.5  | <b>1083</b>  | 1235  | 1284  | <b>3755.6</b>      | 3867.4         | 3870.5  |
| s35932   | <b>80</b>    | 81    | 100   | 13           | <b>10</b> | 17        | <b>58</b>    | 59        | 64    | <b>35.5</b>            | 36.5  | 56.4  | <b>1217</b>  | 1380  | 1433  | <b>4380.0</b>      | 4420.9         | 4449.4  |
| salsa20  | <b>82</b>    | 87    | 112   | <b>19</b>    | 21        | 29        | <b>81</b>    | 83        | 109   | <b>49.6</b>            | 51.8  | 96.1  | <b>1715</b>  | 2050  | 2160  | <b>6446.9</b>      | 6580.9         | 6863.5  |
| ethernet | <b>97</b>    | 104   | 159   | 34           | <b>31</b> | 51        | <b>337</b>   | 352       | 455   | <b>315.5</b>           | 320.4 | 408.9 | <b>7314</b>  | 8823  | 9210  | 26113.9            | <b>26105.5</b> | 27248.8 |
| vga_lcd  | <b>134</b>   | 146   | 206   | <b>41</b>    | 49        | <b>92</b> | <b>575</b>   | 597       | 775   | <b>416.9</b>           | 451.8 | 812.1 | <b>12380</b> | 14920 | 15815 | 46763.1            | <b>45969.8</b> | 47484.1 |
| Avg.     | <b>1.000</b> | 1.072 | 1.417 | <b>1.000</b> | 1.062     | 1.708     | <b>1.000</b> | 1.036     | 1.310 | <b>1.000</b>           | 1.051 | 1.668 | <b>1.000</b> | 1.196 | 1.259 | 1.000              | <b>0.994</b>   | 1.028   |

# Comparison of ysyx designs

Table 5: Comparison of ysyx designs among clock tree solutions from ours, commercial tool, and OpenROAD.

| Case   | Latency (ps) |            |       | Skew (ps) |             |            | #Buffers     |       |       | Buf Area ( $\mu m^2$ ) |              |        | Clk Cap (fF) |       |              | Clk WL ( $\mu m$ ) |                 |          |
|--------|--------------|------------|-------|-----------|-------------|------------|--------------|-------|-------|------------------------|--------------|--------|--------------|-------|--------------|--------------------|-----------------|----------|
|        | Ours         | Com.       | OR.   | Ours      | Com.        | OR.        | Ours         | Com.  | OR.   | Ours                   | Com.         | OR.    | Ours         | Com.  | OR.          | Ours               | Com.            | OR.      |
| ysyx_0 | <b>113</b>   | 120        | 156   | 31        | <b>15</b>   | 63         | <b>639</b>   | 654   | 799   | <b>550.4</b>           | 561.5        | 1719.0 | 29032        | 31662 | <b>17130</b> | <b>42698.49</b>    | 42935.09        | 45389.69 |
| ysyx_1 | <b>118</b>   | 122        | 206   | 29        | <b>12</b>   | <b>110</b> | <b>656</b>   | 674   | 863   | 568.4                  | <b>568.3</b> | 1896.4 | 29455        | 32204 | <b>17907</b> | <b>44538.49</b>    | 45142.38        | 48113.24 |
| ysyx_2 | <b>140</b>   | 143        | 191   | 38        | <b>16</b>   | 68         | <b>943</b>   | 958   | 1113  | <b>801.2</b>           | 814.6        | 2401.7 | 35272        | 39256 | <b>25491</b> | <b>64884.11</b>    | 64901.76        | 69753.65 |
| ysyx_3 | 144          | <b>139</b> | 193   | 36        | <b>16</b>   | 59         | <b>798</b>   | 808   | 914   | <b>671.8</b>           | 689.0        | 1970.4 | 32483        | 35830 | <b>21484</b> | 57229.57           | <b>56918.98</b> | 59314.99 |
| Avg.   | <b>1.000</b> | 1.017      | 1.449 | 1.000     | <b>0.44</b> | 2.239      | <b>1.000</b> | 1.019 | 1.215 | <b>1.000</b>           | 1.016        | 3.082  | 1.000        | 1.101 | <b>0.65</b>  | <b>1.000</b>       | 1.003           | 1.063    |

# Topology & Routing



s38417 Topology



s38417 Routing



vga\_enh\_top Topology



vga\_enh\_top Routing

# Overview

- 1 Introduction
- 2 Preliminaries
- 3 iCTS Structure
- 4 Routing Topology
- 5 Buffering Optimization
- 6 Framework
- 7 Experimental Results
- 8 Future Works

# Future

- Software
  - Clock Topology
  - Community
  - iCTS API
- Flow
  - Timing Representation
  - Clock Routing
  - Buffering Optimization
- Technology
  - DSE
  - CCOpt
  - AI for CTS



# Reference

- [1] Bakoglu H B. Circuits, interconnections, and packaging for VLSI[J]. (No Title), 1990.
- [2] Kashyap C V, Alpert C J, Liu F, et al. Closed-form expressions for extending step delay and slew metrics to ramp inputs for RC trees[J]. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2004, 23(4): 509-516.
- [3] Sitik C, Lerner S, Taskin B. Timing characterization of clock buffers for clock tree synthesis[C]. 2014 IEEE 32nd International Conference on Computer Design (ICCD), 2014: 230-236.
- [4] Bakoglu H B. Circuits, interconnections, and packaging for VLSI[J]. 1990.
- [5] Boese K D, Kahng A B. Zero-skew clock routing trees with minimum wirelength[C]. [1992] Proceedings. Fifth Annual IEEE International ASIC Conference and Exhibit, 1992: 17-21.
- [6] Andreev A, Nikishin A, Gribok S, et al. Clock network fishbone architecture for a structured ASIC manufactured on a 28 NM CMOS process lithographic node: Google Patents, 2014.
- [7] Chakrabarti P, Bhatt V, Hill D, et al. Clock mesh framework[C]. Thirteenth International Symposium on Quality Electronic Design (ISQED), 2012: 424-431.
- [8] Abdelhadi A, Ginosar R, Kolodny A, et al. Timing-driven variation-aware synthesis of hybrid mesh/tree clock distribution networks[J]. Integration, 2013, 46(4): 382-391.
- [9] Cong J, Kahng A B, Koh C-K, et al. Bounded-skew clock and Steiner routing[J]. ACM Transactions on Design Automation of Electronic Systems (TODAES), 1998, 3(3): 341-388.
- [10] Chu C, Wong Y-C. FLUTE: Fast lookup table based rectilinear Steiner minimal tree algorithm for VLSI design[J]. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2007, 27(1): 70-83.
- [11] Chen G, Young E F. Salt: provably good routing topology by a novel steiner shallow-light tree algorithm[J]. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2019, 39(6): 1217-1230.

# Thanks