

# A Class-D Tri-Phasing CMOS Power Amplifier With an Extended Marchand-Balun Power Combiner

Mikko Martelius<sup>ID</sup>, Member, IEEE, Kari Stadius<sup>ID</sup>, Member, IEEE, Jerry Lemberg, Enrico Roverato<sup>ID</sup>, Tero Nieminen, Yury Antonov<sup>ID</sup>, Member, IEEE, Lauri Anttila<sup>ID</sup>, Member, IEEE, Mikko Valkama, Senior Member, IEEE, Marko Kosunen<sup>ID</sup>, Member, IEEE, and Jussi Ryyynänen<sup>ID</sup>, Senior Member, IEEE

**Abstract**—This article presents a power amplifier (PA) design, which consists of eight class-D PA units on a single 28-nm CMOS die and a coupled-line power combiner on printed circuit board. The PA utilizes tri-phasing modulation, which combines polar and outphasing components in a way that eliminates linearity-degrading effects of multilevel outphasing while maintaining the back off efficiency. Each PA unit contains a cascoded output stage with a 3.6-V supply voltage, and multilevel operation is enabled by ON/OFF logic circuitry. Our analysis shows that the choice of power-combiner type is vital for reducing PA supply and ground ripple and thus ensuring reliable operation. Accordingly, the power combiner is implemented with extended Marchand baluns, which consist of input transmission lines and coupled-line sections. Unlike the original Marchand balun, our new topology is feasible for implementation under the layout restrictions caused by the multiple-unit PA on a single die. Measurement results show the PA achieving a peak output power of 29.7 dBm with a 34.7% efficiency, and operation with aggregated Long Term Evolution (LTE) signals at 1.7-GHz carrier frequency is verified with bandwidths up to 100 MHz.

**Index Terms**—CMOS integrated circuits, Marchand balun, outphasing, power amplifiers (PA), power combiners, radio transmitters, tri-phasing.

## I. INTRODUCTION

ONE of the long-standing challenges in designing radio transmitters and particularly their power amplifiers (PA) is achieving high efficiency while fulfilling the increasingly

Manuscript received August 13, 2019; revised October 7, 2019; accepted October 29, 2019. Date of publication December 16, 2019; date of current version March 4, 2020. This work was supported in part by Nokia, in part by the Finnish Funding Agency for Technology and Innovation (Tekes), in part by the European Union's Horizon 2020 Research and Innovation Program under the Marie Skłodowska-Curie Grant 704947, and in part by the Academy of Finland. (*Corresponding author: Mikko Martelius*)

M. Martelius and Y. Antonov were with the Department of Electronics and Nanoengineering, Aalto University, 02150 Espoo, Finland. They are now with CoreHW, 00100 Helsinki, Finland (e-mail: mikko.martelius@corehw.com).

K. Stadius, M. Kosunen, and J. Ryyynänen are with the Department of Electronics and Nanoengineering, School of Electrical Engineering, Aalto University, 02150 Espoo, Finland.

J. Lemberg was with the Department of Electronics and Nanoengineering, Aalto University, 02150 Espoo, Finland. He is now with LG Electronics, 20520 Turku, Finland.

E. Roverato and T. Nieminen were with the Department of Electronics and Nanoengineering, Aalto University, 02150 Espoo, Finland. They are now with CoreHW, 33720 Tampere, Finland.

L. Anttila and M. Valkama are with the Department of Electrical Engineering, Tampere University, 33720 Tampere, Finland.

Color versions of one or more of the figures in this article are available online at <http://ieeexplore.ieee.org>.

Digital Object Identifier 10.1109/TMTT.2019.2952771

demanding standards of linearity and spectral purity. Meanwhile, the development toward higher integration levels and system-on-chip (SoC) implementations, along with the decreasing supply voltages in the latest CMOS processes, leads to digital-intensive and time-based solutions being increasingly favored over analog circuit structures. As a result, there is a growing interest in discarding the conventional Cartesian transmitter architecture in favor of alternative structures that employ constant-envelope PA input signals, which enables using highly efficient but nonlinear switch-mode PA classes. One technique for producing a phase- and amplitude-modulated signal with such PAs is outphasing [1], [2], in which the output amplitude is modulated by altering the phase offset between two constant-envelope signal components. While outphasing transmitters can achieve high peak efficiency, they tend to suffer from declining efficiency in power back off.

A previously proposed solution for improving the back-off efficiency is multilevel outphasing, in which the output amplitude is modulated by discrete amplitude levels in addition to a phase offset [3], [4]. In our earlier work, we have developed new circuit solutions for phase modulators [5] and PAs [6] intended for highly integrated wideband multilevel outphasing transmitters [7]. However, while analyzing the characteristics of multilevel outphasing, we have found that amplitude-level transitions inherently cause discontinuities in the harmonic content of the combined output signal [8]. This appears in the spectrum as additional noise at a wide range of frequencies, including the signal band, where it can limit the achievable adjacent-channel leakage ratio (ACLR). As a solution to this tradeoff between efficiency and linearity, we proposed a new transmitter architecture called tri-phasing [8]. This technique combines coarse-amplitude polar modulation with outphasing, resulting in smooth amplitude-level transitions, and thus, improved spectral performance, while simultaneously achieving back-off efficiency equal to multilevel outphasing.

In this article, we present a tri-phasing PA consisting of eight PA units on a single 28-nm CMOS die and a coupled-line power combiner on the printed circuit board (PCB). We place particular emphasis on the analysis and design of the power combiner, which has only been superficially described by our previous articles focusing on the integrated-circuit (IC) design details of the PA [9] and the complete transmitter included on the same die [10]. The combiner is an essential part of



Fig. 1. Schematic of a generic wire-bonded multiple-unit PA system.

the system, largely defining the available carrier-frequency range and many characteristics of the wide spectrum. Furthermore, we present analysis demonstrating that the choice of power-combiner type is critical for PA reliability due to its effects on voltage ripple at the PA supply and ground. This consequential finding justifies our decision to discard the simple transmission-line combiner structure in favor of a notably more complex voltage-subtracting type.

The task of designing a coupled-line combiner is further complicated by the layout restrictions caused by all eight PA units being on a single die, rendering the Marchand balun [11] unfeasible in its original form. As a solution, we developed a new topology, the extended Marchand balun, which is extensively analyzed in this article and forms the basis of the presented combiner design. In addition to the coupled lines, this structure features input lines of an arbitrary length, making it suitable for a wider range of applications.

This article is organized as follows. Section II examines voltage ripple at the PA supply and ground, motivating our decision on the power-combiner type. Section III summarizes the structure and design of the integrated tri-phasing PA. In Section IV, we analyze the extended Marchand balun and present the power-combiner design. Measurement results are reported in Section V, and Section VI concludes this article.

## II. ANALYSIS OF PA SUPPLY AND GROUND RIPPLE

In order to explain why the power-combiner type is critically important for reliable PA operation, this section analyzes how the combiner affects voltage ripple at wire-bonded PA supply and ground nodes. The first part of this analysis examines the effects of common-mode and differential current at the supply and ground, and the second part establishes the dependence between these currents and the power-combiner type. While we apply the results to an outphasing-based PA, the analysis is equally valid for any fully integrated multiple-unit PA.

Fig. 1 shows the schematic of a generic wire-bonded  $N$ -unit PA system, drawn to highlight the supply and ground currents. The bonding wires at the supply and ground nodes are modeled

as inductors  $L_{dd}$  and  $L_{ss}$ , respectively, and the on-chip bypass capacitor  $C$  is included in order to stabilize the voltage difference between the internal supply and ground nodes ( $V_{dd} - V_{ss}$ ). However, the capacitor cannot suppress common-mode ripple at  $V_{dd}$  and  $V_{ss}$ . Despite not affecting the internal operation of the PA, such ripple can nonetheless cause problems in a larger context. First, in an SoC with multiple supply domains and separated ground nodes, the PA needs to operate with input signals that are not yet affected by the ripple. Consequently, the signals would be distorted by the PA input stages due to the relative fluctuation between the two ground levels. This could lead to suboptimal performance or even failure to produce a recognizable output. Second, the ripple would appear as an unwanted component in each PA-unit output voltage, and unless canceled by the combiner, this component would also degrade the combined output signal.

We begin the analysis by examining the currents shown in Fig. 1. Only the ac components of all voltages and currents are included in all following equations. At this point, each PA unit can be modeled as two current sources drawing  $I_{dd,i}$  from the internal supply node ( $V_{dd}$ ) and  $I_{ss,i}$  from the internal ground ( $V_{ss}$ ), while making no assumptions about the characteristics of these currents. Furthermore, to obtain more illustrative equations, we will use common-mode–differential notation for the total internal supply and ground currents

$$I_{dd} = \sum_{i=1}^N I_{dd,i} = I_{CM} + I_D \quad (1)$$

$$I_{ss} = \sum_{i=1}^N I_{ss,i} = I_{CM} - I_D \quad (2)$$

as shown in Fig. 1. Similarly, we separate the effects of the difference between the bonding-wire inductances by expressing them as

$$L_{dd} = L + \Delta L \quad (3)$$

$$L_{ss} = L - \Delta L. \quad (4)$$

First, we can observe that the output current of the  $i$ th PA unit is

$$I_{PA,i} = I_{dd,i} + I_{ss,i} \quad (5)$$

assuming that its input current is negligible. Thus, the sum of all PA-unit output currents is

$$\sum_{i=1}^N I_{PA,i} = I_{dd} + I_{ss} = 2I_{CM}. \quad (6)$$

This helps to describe some of the factors that affect the common-mode and differential current components. As (6) shows,  $I_{CM}$  is directly defined by the sum of PA-unit output currents. Those currents depend partly on the characteristics of the power combiner, as will be shown in a later part of this analysis. On the other hand,  $I_D$  represents the current that flows through the PA units from supply to ground or vice versa. This current does not interact with the combiner and is, thus, primarily determined by the PA units themselves.

With straightforward circuit analysis, the capacitor and bonding-wire currents can be solved as

$$I_C = \frac{2\omega^2 C}{1 - 2\omega^2 LC} (\Delta L I_{CM} + L I_D) \quad (7)$$

$$I_{Ldd} = I_{CM} \left( 1 + \frac{2\omega^2 \Delta L C}{1 - 2\omega^2 LC} \right) + \frac{I_D}{1 - 2\omega^2 LC} \quad (8)$$

$$I_{Lss} = I_{CM} \left( 1 - \frac{2\omega^2 \Delta L C}{1 - 2\omega^2 LC} \right) - \frac{I_D}{1 - 2\omega^2 LC}. \quad (9)$$

From these results, we can derive the supply and ground voltage ripple. By using the common-mode–differential notation

$$V_{dd} = V_{CM} + V_D \quad (10)$$

$$V_{ss} = V_{CM} - V_D, \quad (11)$$

the voltage components are expressed as

$$V_{CM} = -j\omega \left[ I_{CM} \left( L + \Delta L \frac{2\omega^2 \Delta L C}{1 - 2\omega^2 LC} \right) + \frac{\Delta L I_D}{1 - 2\omega^2 LC} \right] \quad (12)$$

$$V_D = -j\omega \frac{\Delta L I_{CM} + L I_D}{1 - 2\omega^2 LC}. \quad (13)$$

In the general case, as shown in (12) and (13), both  $V_{CM}$  and  $V_D$  include components induced by both  $I_{CM}$  and  $I_D$ . Increasing  $C$  suppresses the entirety of  $V_D$  and the part of  $V_{CM}$  that is caused by  $I_D$ , provided that  $C > 1/(2\omega^2 L)$ . Thus, the part of  $V_{CM}$  induced by  $I_{CM}$  is the most difficult component to reduce. This component could theoretically be eliminated at a single frequency by choosing

$$C = \frac{L}{2\omega^2(L^2 - (\Delta L)^2)} \quad (14)$$

but given that  $\Delta L$  is typically much smaller than  $L$ , this value of  $C$  would be severely insufficient for suppressing the other voltage components.

Based on the preceding analysis, we conclude that, in practice, a large value of  $I_{CM}$  prevents effective suppression of supply and ground voltage ripple. Given that  $I_{CM}$  is defined by the sum of PA-unit output currents, it might seem that reducing this current always leads to lower output power. However, this is only true if all PA-unit output voltages are equal in both amplitude and phase, which does not need to be the case. For example, with two fully differential voltages, the output power only depends on the difference between the currents, not the sum. Thus, with a well-informed choice of power-combiner type, it may be possible to design a system in which the PA-unit output currents are fully or mostly differential, leading to a small  $I_{CM}$  even with a high output power.

In order to further examine the dependence between power-combiner type and  $I_{CM}$ , we will next derive the sum of PA-unit output currents with two common types of combiners: a voltage-adding transmission-line combiner [12]–[15] and a voltage-subtracting Marchand balun [11], [16]–[20]. This analysis uses arbitrary combiner input voltages, and thus, applies to all methods of signal construction.

The generic schematic of a voltage-adding transmission-line power combiner with  $N$  inputs is shown in Fig. 2. Assuming lossless transmission lines, the combiner input



Fig. 2. Generic schematic of a voltage-adding transmission-line power combiner.



Fig. 3. Schematic of a Marchand balun.

currents (i.e., PA output currents) are defined by the ABCD parameters of a quarter-wave line

$$\begin{bmatrix} V_i \\ I_i \end{bmatrix} = \begin{bmatrix} 0 & jZ_0 \\ j\frac{1}{Z_0} & 0 \end{bmatrix} \begin{bmatrix} V_{out} \\ I_{o,i} \end{bmatrix}, \quad i = 1, 2, \dots, N. \quad (15)$$

We can derive from the sum of combiner input currents that

$$I_{CM} = \frac{1}{2} \sum_{i=1}^N I_i = j \frac{N}{2Z_0} V_{out}. \quad (16)$$

When applied to (12), this means that the common-mode voltage ripple contains a component that is proportional to the combiner output voltage and cannot be suppressed by a large on-chip capacitance. Thus, when using this type of combiner, the aforementioned ripple component can only be reduced by decreasing the output power.

The Marchand balun, shown in Fig. 3, is a typical coupled-line structure used in voltage-subtracting power combiners. It consists of two pairs of coupled quarter-wave lines, which are here assumed to be lossless and symmetrical. The  $Y$ -parameter equations describing the operation, derived from the general-case equations of coupled lines [21], can be expressed as

$$\begin{bmatrix} I_{1+} \\ I_{2+} \\ I_{3+} \\ -I_L \end{bmatrix} = \begin{bmatrix} 0 & Y_{12} & 0 & Y_{14} \\ Y_{12} & 0 & Y_{14} & 0 \\ 0 & Y_{14} & 0 & Y_{12} \\ Y_{14} & 0 & Y_{12} & 0 \end{bmatrix} \begin{bmatrix} V_1 \\ 0 \\ V_m \\ V_{out} \end{bmatrix} \quad (17)$$

$$\begin{bmatrix} I_{1-} \\ I_{2-} \\ -I_{3+} \\ 0 \end{bmatrix} = \begin{bmatrix} 0 & Y_{12} & 0 & Y_{14} \\ Y_{12} & 0 & Y_{14} & 0 \\ 0 & Y_{14} & 0 & Y_{12} \\ Y_{14} & 0 & Y_{12} & 0 \end{bmatrix} \begin{bmatrix} V_2 \\ 0 \\ V_m \\ V_{oc} \end{bmatrix}. \quad (18)$$

First, we can determine the relation between  $V_{\text{out}}$  and  $V_{\text{oc}}$  from the two equations for  $I_{3+}$

$$V_{\text{oc}} = -V_{\text{out}}. \quad (19)$$

Based on this, we find that regardless of the input voltages, the balun input currents are

$$I_{1+} = Y_{14} V_{\text{out}} \quad (20)$$

$$I_{1-} = -Y_{14} V_{\text{out}}. \quad (21)$$

These equations show that the balun input currents are differential and their sum is always zero, even when the input voltages are not fully differential. Consequently,  $I_{CM}$  is inherently zero with such combiners, and all remaining voltage-ripple components in (12) and (13) can be indefinitely reduced by increasing the on-chip capacitance. Finally, from (20) and (21), we can derive the total input power of the balun

$$P_{\text{in}} = \frac{1}{2} \operatorname{Re}[V_1(Y_{14} V_{\text{out}})^*] + \frac{1}{2} \operatorname{Re}[V_2(-Y_{14} V_{\text{out}})^*] \quad (22)$$

$$P_{\text{in}} = \frac{1}{2} \operatorname{Re}[(V_1 - V_2)(Y_{14} V_{\text{out}})^*]. \quad (23)$$

This shows that despite the zero sum of input currents, the input power is not zero, provided that  $V_{\text{out}} \neq 0$  and  $(V_1 - V_2) \neq 0$ . These results apply to combiners consisting of any number of Marchand baluns whose output currents are summed at a single load.

We summarize the key findings of this section as follows.

- 1) The sum of PA-unit output currents defines the common-mode current at the PA supply and ground, which in turn causes the supply and ground ripple component that is not suppressed by on-chip capacitors. Thus, this sum of currents must be minimized to avoid excessive ripple.
- 2) The PA units can produce a large output power with a small or even zero-sum of output currents if the currents are mostly or fully differential. This indicates that the sum of currents can be minimized by using a voltage-subtracting power combiner that produces the maximum output power with differential voltages and currents.

To support this suggestion, we derived the input currents of two common types of transmission-line-based combiners with arbitrary input voltages, with the following results.

- 1) With a voltage-adding transmission-line combiner, the sum of currents is proportional to the output voltage.
- 2) With a voltage-subtracting Marchand-balun combiner, the currents are always differential, which means that their sum is zero regardless of output power.

Thus, we conclude that by utilizing a Marchand-balun-based power combiner and a sufficiently large on-chip bypass capacitor, we can minimize all components of the PA supply and ground ripple without needing to limit the output power.

### III. TRI-PHASING POWER AMPLIFIER

This section presents an overview of the integrated tri-phasing PA. The PA greatly informs the combiner design, and these circuits can be more meaningfully characterized as a



Fig. 4. (a) Block diagram of the designed tri-phasing PA system. (b) Time-domain example of the tri-phasing signal composition, showing a coarse-level transition (vertical line) with no instantaneous change in the combined waveform.

single entity, which is why this article covers the entire PA system. However, instead of repeating all previously published details of the tri-phasing concept [8] or the PA design [9], this section only summarizes the main points to provide context for the combiner work and the measurement results.

The designed tri-phasing PA system, as shown by the block diagram in Fig. 4(a), contains one always-active outphasing PA pair and a polar section of three PA pairs. The phase-modulated input signals are

$$S_0(t) = \cos(\omega t + \phi(t)) \quad (24)$$

$$S_1(t) = \cos(\omega t + \phi(t) + \theta(t)) \quad (25)$$

$$S_2(t) = \cos(\omega t + \phi(t) - \theta(t)) \quad (26)$$

where  $\phi(t)$  is the polar phase and  $\theta(t)$  is the outphasing angle. The polar section creates coarse-level amplitude modulation, and the outphasing pair covers the gaps between coarse levels with high resolution. The signal composition is illustrated by example waveforms in Fig. 4(b), where  $A_p$  is the number of active polar PA pairs. The waveforms illustrate the main advantage of tri-phasing—the smooth transitions between coarse levels, with no harmonic discontinuities. As demonstrated by analysis and simulations in [8], this can potentially improve the spectral performance compared with multilevel outphasing while maintaining the back-off efficiency benefit.

Fig. 5 shows the schematic of a single-PA unit featuring a cascaded class-D output stage. In order to switch the output stage ON and OFF according to the signal  $A$ , the XOR and NAND gates construct the desired signals in both “ON” and “OFF” states in a manner that allows constant bias voltages [6], [9]. This solution enables multilevel operation with a relatively high 3.6-V supply voltage in an integrated system, where all PA input signals are generated in the 1.0-V domain.

The output-stage transistor sizes were chosen such that their ON-resistances do not cause significant output-voltage



Fig. 5. Simplified schematic of the PA unit.



Fig. 6. Die micrograph of the PA.

saturation at the peak output power. Building on our analysis in [6], this means that the vast majority of power loss in the PA units occurs due to the charging and discharging of the relatively large drain–source capacitances of the output transistors. By assuming that other losses are negligible, we can, thus, derive a simple efficiency model in which the loss in each active PA unit,  $P_{loss}$ , is constant and independent of output power. According to this model, the total efficiency is

$$\eta = \frac{P_{out}}{P_{out} + 2(A_p + 1)P_{loss}} \quad (27)$$

which is equal between tri-phasing and multilevel outphasing at any value of  $P_{out}$ .

The PA was implemented in 28-nm CMOS as an integrated part of the tri-phasing transmitter presented in [10]. Fig. 6 shows the die micrograph of the PA. The shown die area, excluding pads, is  $1.31 \text{ mm}^2$ , of which the PA units occupy  $0.44 \text{ mm}^2$ . Each of the eight PA-unit outputs is connected to two bonding pads, which are directly wire bonded to the power-combiner inputs on the PCB. This configuration causes the main restriction on the combiner layout, requiring its inputs to be located within a relatively small area near the die.

#### IV. EXTENDED MARCHAND-BALUN POWER COMBINER

This section presents the analysis and design of the coupled-line power combiner, implemented as part of the tri-phasing PA. In this system, the combiner sets the limit for the range of carrier frequencies, which is why we chose to realize the combiner on the PCB. This arrangement enables using the same IC in different products by only changing the PCB, saving the expenses of fabricating several variants of the IC.



Fig. 7. Schematic of the extended Marchand balun. The load resistor is shown for illustrative purposes only and can be replaced by any nonopen circuit.

Isolating combiners, such as the Wilkinson combiner [22], are beneficial for linearity but inherently lossy at back off and, thus, discarded. Among nonisolating combiners, one of the simplest types to realize on PCB is the voltage-adding transmission-line combiner shown in Fig. 2, which is often accompanied by Chireix compensation for improved back-off efficiency [12]–[15]. However, we concluded in Section II that using such a combiner with a relatively high-power wire-bonded PA may cause excessive voltage ripple at supply and ground, which can degrade the PA performance or even inhibit proper operation. Thus, we chose to design a voltage-subtracting coupled-line combiner, which can also improve the output spectrum by canceling undesired common-mode signal components [23]. Voltage-subtracting transformer combiners have rarely been implemented on PCB [24], and due to their typical shape, designing the layout for eight inputs connected to a single IC would be difficult. In contrast, coupled lines are long but narrow, which enables placing several of them in a relatively narrow area. Therefore, we chose the Marchand balun, a widely used coupled-line structure, as the basis of our combiner development.

As mentioned in Section III, the integrated PA necessitates locating all combiner inputs within a narrow area. Moreover, the bonding wires are inevitably part of the combiner, functionally resembling short transmission lines. Under these circumstances, the Marchand balun in its original form (Fig. 3) would be difficult to implement. Therefore, we introduce the extended Marchand balun, shown in Fig. 7, which performs the same basic function while containing arbitrarily long input lines before the coupled-line section. The input lines can, thus, be utilized to increase the line spacing, which reduces unwanted coupling and creates sufficient space for dc-block capacitors. By including the input lines, we introduce a degree of freedom in designing the balun, which will reduce the limitations of layout design and, thus, expand the range of potential applications. The coupled-line length is reduced in a way that resembles the previously presented method of compensating for a connecting line between the two coupled-line sections [25]–[27]. However, the inclusion of the input lines is a distinct concept that addresses different practical needs, and it has not been thoroughly analyzed in earlier publications.

##### A. Analysis of the Extended Marchand Balun

In a voltage-subtracting power combiner with multiple pairs of inputs, the purpose of a balun is to produce an output current



Fig. 8. Coupled transmission lines as a four-port network.

that is proportional to the difference of its two input voltages and independent of the output voltage. Thus, the output currents can be summed by connecting all balun outputs to a single load resistor. In this section, we analyze the operation of a single extended Marchand balun and derive the rules by which it needs to be designed.

Fig. 7 shows the schematic of the extended Marchand balun. The original Marchand balun can be considered a special case of this circuit, in which the line lengths in radians are  $\beta_il_i = 0$  and  $\beta_c l_c = \pi/2$ , where  $\beta = 2\pi/\lambda$ . In this case, the current in any port of the quarter-wavelength coupled lines is defined by the voltages at the opposite end of the lines. Therefore, the zero current at the open circuit ( $V_{oc}$ ) consists of components induced by  $V_-$  and  $V_m$ , which must cancel each other, resulting in a  $V_m$  that is proportional to  $-V_-$ .  $V_+$  and  $V_m$  affect the output current in the same proportion as  $V_-$  and  $V_m$  affect the zero current, and thus,  $I_{out}$  is proportional to  $V_+ - V_-$ .

When input lines with nonzero length are included, quarter-wavelength coupled lines no longer result in the desired operation, and therefore, the circuit is analyzed with general line lengths. Now  $I_{out}$  depends not only on  $V_{1+} - V_{1-}$  but also on  $I_{1+} - I_{1-}$ , both of which are affected by input voltages and input currents. The goal in designing the balun is to choose line lengths so that the effect of  $I_{\pm}$  on  $V_{1\pm}$  and the effect of  $I_{\pm}$  on  $I_{1\pm}$  cancel each other in the output current, resulting in  $I_{out}$  depending only on  $V_+ - V_-$ .

The dependencies between the voltages and currents at both ends of a pair of coupled transmission lines, as shown in Fig. 8, are described by the  $Y$ -parameter equation

$$\begin{bmatrix} I_1 \\ I_2 \\ I_3 \\ I_4 \end{bmatrix} = \begin{bmatrix} Y_{11,c} & Y_{12,c} & Y_{13,c} & Y_{14,c} \\ Y_{21,c} & Y_{22,c} & Y_{23,c} & Y_{24,c} \\ Y_{31,c} & Y_{32,c} & Y_{33,c} & Y_{34,c} \\ Y_{41,c} & Y_{42,c} & Y_{43,c} & Y_{44,c} \end{bmatrix} \begin{bmatrix} V_1 \\ V_2 \\ V_3 \\ V_4 \end{bmatrix}. \quad (28)$$

The derivation of the  $Y$ -parameters of a generic coupled-line pair has been presented in [21] and will not be repeated here. For simplicity, the following analysis assumes TEM transmission lines, while the implemented lines operate in quasi-TEM mode. In addition, the input lines are assumed to be lossless. As such, the exact results may be slightly different from the realistic case, but the principles still apply.

The operation of the extended balun is described by two  $Y$ -parameter equations, both related to one input line and one pair of coupled lines

$$\begin{bmatrix} I_+ \\ I_{2+} \\ I_{3+} \\ -I_{out} \end{bmatrix} = \begin{bmatrix} Y_{11,e} & Y_{12,e} & Y_{13,e} & Y_{14,e} \\ Y_{21,e} & Y_{22,e} & Y_{23,e} & Y_{24,e} \\ Y_{31,e} & Y_{32,e} & Y_{33,e} & Y_{34,e} \\ Y_{41,e} & Y_{42,e} & Y_{43,e} & Y_{44,e} \end{bmatrix} \begin{bmatrix} V_+ \\ 0 \\ V_m \\ V_{out} \end{bmatrix} \quad (29)$$

$$\begin{bmatrix} I_- \\ I_{2-} \\ -I_{3+} \\ 0 \end{bmatrix} = \begin{bmatrix} Y_{11,e} & Y_{12,e} & Y_{13,e} & Y_{14,e} \\ Y_{21,e} & Y_{22,e} & Y_{23,e} & Y_{24,e} \\ Y_{31,e} & Y_{32,e} & Y_{33,e} & Y_{34,e} \\ Y_{41,e} & Y_{42,e} & Y_{43,e} & Y_{44,e} \end{bmatrix} \begin{bmatrix} V_- \\ 0 \\ V_m \\ V_{oc} \end{bmatrix}. \quad (30)$$

The analysis is kept generally applicable by examining the output current and voltage as two independent variables instead of assuming any specific dependence between them, such as the load resistor ( $R_L$ ), as shown in Fig. 7. In order to derive these  $Y$ -parameters, we apply the coupled-line  $Y$ -parameters in (28) and the ABCD-parameters of the input line

$$\begin{bmatrix} V_{1\pm} \\ I_{1\pm} \end{bmatrix} = \begin{bmatrix} A' & B' \\ C' & D' \end{bmatrix} \begin{bmatrix} V_{\pm} \\ I_{\pm} \end{bmatrix} \quad (31)$$

$$\begin{bmatrix} A' & B' \\ C' & D' \end{bmatrix} = \begin{bmatrix} \cos(\beta_il_i) & -jZ_{0i}\sin(\beta_il_i) \\ -j\frac{1}{Z_{0i}}\sin(\beta_il_i) & \cos(\beta_il_i) \end{bmatrix}. \quad (32)$$

By solving the currents, taking into account that  $Y_{mn,c} = Y_{nm,c}$ , the following equations are obtained:

$$Y_{11,e} = \frac{A'Y_{11,c} - C'}{D' - B'Y_{11,c}} \quad (33)$$

$$Y_{1m,e} = Y_{m1,e} = \frac{Y_{1m,c}}{D' - B'Y_{11,c}}, \quad m = 2, 3, 4 \quad (34)$$

$$Y_{mn,e} = \frac{B'Y_{1n,c}Y_{m1,c}}{D' - B'Y_{11,c}} + Y_{mn,c}, \quad m, n = 2, 3, 4. \quad (35)$$

In general,  $Y_{m1,c} = Y_{1m,c}$  [21], and thus, (34) and (35) show that the  $Y$ -parameter matrix is symmetric, i.e.,  $Y_{mn,e} = Y_{nm,e}$ . Hereafter,  $Y_{mn,e}$  and  $Y_{nm,e}$  are both expressed as  $Y_{mn,e}$ , where  $m < n$ .

Next, we derive the condition for the output current depending only on input voltages and not on input currents. From (29) and (30), we obtain the equation for the output current

$$I_{out} = \left( \frac{Y_{11,e}Y_{44,e}}{Y_{14,e}} - Y_{14,e} \right) (V_+ - V_-) - \frac{Y_{44,e}}{Y_{14,e}} (I_+ - I_-). \quad (36)$$

The coefficient of  $I_+ - I_-$  has to be zero, which is true if  $Y_{44,e} = 0$ . By using (35) to express  $Y_{44,e}$ , this leads to the general condition for the input-line length

$$\tan(\beta_il_i) = \frac{j}{Z_{0i}} \cdot \frac{Y_{44,c}}{Y_{11,c}Y_{44,c} - Y_{14,c}^2}. \quad (37)$$

Beginning from this point, the analysis is simplified by assuming that the coupled lines are lossless and symmetrical. Thus, the  $Y$ -parameters of the coupled lines are

$$Y_{11,c} = Y_{22,c} = Y_{33,c} = Y_{44,c} = -j \frac{(Y_e + Y_o)\cot(\beta_c l_c)}{2} \quad (38)$$

$$Y_{12,c} = Y_{21,c} = Y_{34,c} = Y_{43,c} = j \frac{Y_e + Y_o}{2\sin(\beta_c l_c)} \quad (39)$$

$$Y_{13,c} = Y_{24,c} = Y_{31,c} = Y_{42,c} = j \frac{(Y_o - Y_e)\cot(\beta_c l_c)}{2} \quad (40)$$

$$Y_{14,c} = Y_{23,c} = Y_{32,c} = Y_{41,c} = j \frac{Y_e - Y_o}{2\sin(\beta_c l_c)} \quad (41)$$

and the condition of (37) can be written as a function of the coupled-line length and admittances with the help of (38)

and (41)

$$\tan(\beta_il_i) = \frac{2}{Z_{0i}} \cdot \frac{(Y_e + Y_o) \sin(\beta_il_c) \cos(\beta_il_c)}{(Y_e - Y_o)^2 - (Y_e + Y_o)^2 \cos^2(\beta_il_c)}. \quad (42)$$

Having defined that  $Y_{44,e} = 0$ , we will finally derive the  $Y$ -parameters of the whole balun, describing  $I_+$ ,  $I_-$ , and  $-I_{\text{out}}$  as functions of  $V_+$ ,  $V_-$ , and  $V_{\text{out}}$ . This requires solving  $V_m$  and  $V_{\text{oc}}$  first. Two equations for  $V_m$  can be obtained, one from the sum of  $I_{3+}$  and  $-I_{3+}$ , and the other from the open-circuit zero current

$$V_m = -\frac{Y_{13,e}}{2Y_{33,e}}(V_+ + V_-) - \frac{Y_{34,e}}{2Y_{33,e}}(V_{\text{out}} + V_{\text{oc}}) \quad (43)$$

$$V_m = -\frac{Y_{14,e}}{Y_{34,e}}V_-.$$

From these two equations, we can derive

$$V_{\text{oc}} = -\frac{Y_{13,e}}{Y_{34,e}}V_+ + \frac{2Y_{14,e}Y_{33,e} - Y_{13,e}Y_{34,e}}{Y_{34,e}^2}V_- - V_{\text{out}}. \quad (45)$$

Now, we can write the equations for  $I_+$ ,  $I_-$ , and  $I_{\text{out}}$  in (29) and (30) by substituting  $V_m$  from (44) and  $V_{\text{oc}}$  from (45). After deriving from (38)–(41) that  $Y_{13,c}Y_{34,c} = Y_{14,c}Y_{33,c}$ , the resulting  $Y$ -parameter equation simplifies into

$$\begin{bmatrix} I_+ \\ I_- \\ -I_{\text{out}} \end{bmatrix} = \begin{bmatrix} Y_{11,e} & -\frac{Y_{13,e}Y_{14,e}}{Y_{34,e}} & Y_{14,e} \\ -\frac{Y_{13,e}Y_{14,e}}{Y_{34,e}} & Y_{11,e} & -Y_{14,e} \\ \frac{Y_{13,e}Y_{14,e}}{Y_{34,e}} & -Y_{14,e} & 0 \end{bmatrix} \begin{bmatrix} V_+ \\ V_- \\ V_{\text{out}} \end{bmatrix}. \quad (46)$$

This leads to the equations for the output current and the sum of the input currents:

$$I_{\text{out}} = -Y_{14,e}(V_+ - V_-) \quad (47)$$

$$I_+ + I_- = \left( Y_{11,e} - \frac{Y_{13,e}Y_{14,e}}{Y_{34,e}} \right) (V_+ + V_-).$$

In the special case that the input voltages are fully differential and the output is connected to a load resistance  $R_L$ , the impedance seen at each input is

$$Z_{\text{in}} = \frac{1}{Y_{11,e} + Y_{13,e}Y_{14,e}/Y_{34,e} - 2R_L Y_{14,e}^2}. \quad (49)$$

The output current is proportional to the difference between the input voltages, as seen in (47). This is the most important result of this analysis, as it demonstrates that the extended Marchand balun performs its intended function regardless of the input-line length as long as the condition of (42) is met. The input-current behavior is more complex and is therefore illustrated graphically in Fig. 9. In this example,  $Z_{0i} = 50 \Omega$ ,  $\beta_il_i = 28.5^\circ$ ,  $Z_e = 125 \Omega$ ,  $Z_o = 20 \Omega$ ,  $\beta_{clc} = 70^\circ$ , and  $R_L = 50 \Omega$ . The magnitude of both input voltages is  $V_{\text{in}} = 1V$ , and their phase difference is defined by the outphasing angle  $\theta$  as

$$V_{\pm} = \pm V_{\text{in}} e^{\pm j\theta}. \quad (50)$$

The results show that the magnitudes of the input currents are not equal and their phase difference is not precisely  $180^\circ$  unless the input voltages are purely differential, i.e., the



Fig. 9. (a) Magnitude and (b) phase of the input currents of an ideal extended Marchand balun in outphasing.



Fig. 10. Sum of combiner input currents in tri-phasing with (a) an extended Marchand-balun combiner and (b) a transmission-line combiner with equal output power.

outphasing angle is zero. This is the most notable effect caused by the additional input lines, compared with the conventional Marchand balun. The shorter the input lines are, the closer to differential the input currents become at all outphasing angles. The mismatch between currents does not directly affect the combiner output, but in a realistic case, it can have an indirect effect by changing the PA-unit output voltages due to nonzero transistor ON-resistances. However, the effect on linearity is minor, provided that the PA is designed to tolerate sufficiently large currents without a significant effect on the output voltage. As a general design guideline, an excessive input-line length should nonetheless be avoided, because the mismatch also leads to a nonzero sum of PA-unit output currents and, thus, common-mode supply and ground ripple.

Finally, Fig. 10(a) shows the sum of combiner input currents in tri-phasing operation, using a power combiner consisting of four ideal extended Marchand baluns with previously stated example parameter values and input-voltage magnitudes. While individual input currents up to 157 mA appear, they are fully differential in the polar section and mostly differential in the outphasing pair such that the sum of all currents is at most 10.3 mA. For comparison, Fig. 10(b) shows the equivalent results with a voltage-adding transmission-line combiner (Fig. 2) that produces an equal output voltage. This comparison shows that although the total PA-unit output current is not precisely zero with extended Marchand baluns, its maximum value is only 0.8% of the maximum with an equivalent transmission-line combiner. Thus, with regard to supply and ground voltage ripple, the effect of the input lines is insignificant compared with the advantage over a voltage-adding combiner.

To analyze the frequency-dependent effects of the input lines, Fig. 11 compares the key characteristics of the extended Marchand balun, based on the output-current



Fig. 11. Calculated balun characteristics as a function of frequency, based on (36). (a) Input-voltage-to-output-current transadmittance. (b) Input-current-to-output-current multiplier. The design-parameter values are shown in Table I.

TABLE I  
DESIGN PARAMETERS USED IN THE FREQUENCY-DOMAIN COMPARISON

| $\beta_i l_i$  | $0^\circ$  | $30^\circ$    | $60^\circ$    |
|----------------|------------|---------------|---------------|
| $Z_{0i}$       | —          | $39.8 \Omega$ | $42.0 \Omega$ |
| $\beta_{cl,c}$ | $90^\circ$ | $72.4^\circ$  | $56.4^\circ$  |



Fig. 12. Schematic of the designed power combiner.

equation (36), in the three example cases. Fig. 11(a) shows the absolute value of the transadmittance of  $(V_+ - V_-)$ , and 11(b) shows the multiplier of  $(I_+ - I_-)$ , both as a function of frequency. In all compared cases,  $Z_e = 125\ \Omega$  and  $Z_o = 20\ \Omega$ . Three different values are chosen for  $\beta_i l_i$ , including  $0^\circ$ , which corresponds to a conventional Marchand balun. The remaining variables are shown in Table I, defined such that  $Y_{14,e}$  at the center frequency is equal between all cases. Based on the results in Fig. 11, the bandwidth effect of the input lines is minor, especially if the lines are relatively short.

### B. Power-Combiner Design

Fig. 12 shows the schematic of the designed power combiner, which consists of four extended Marchand baluns with dc-block capacitors between the input lines and the coupled lines, and an output line with a  $50\ \Omega$  characteristic impedance connected to the load. As indicated by the analysis in Section IV-A, the output-current behavior of each balun is independent of its effective load impedance, which is  $200\ \Omega$  at the peak output power. The layout is presented in Fig. 13, including the top metal layer in Fig. 13(a), the middle layer in Fig. 13(b), and the vias connecting these two layers to the ground plane below them [light-gray circles in Fig. 13(a) and (b)]. The photograph of the fabricated power combiner on PCB is shown in Fig. 14(a). The input lines are



Fig. 13. Layout of the designed power combiner. (a) Top layer. (b) Middle layer.



Fig. 14. Photographs of (a) power combiner on PCB and (b) bonding wires connecting the PA to the combiner.

implemented as equally long microstrip lines with the middle layer used as the ground plane, and they are followed by surface-mounted  $10\text{-pF}$  dc-block capacitors. The coupled-line section is realized with broadside-coupled lines, with coupling between the baluns reduced by ground lines between them on both layers. The balun outputs are connected together on the middle layer, and the combined signal is brought to the SMA connector with a coplanar waveguide.

The layout design was finalized using momentum for electromagnetic (EM) simulations, and the resulting  $S$ -parameters were utilized to simulate the combiner together with the PA. The 16 bonding wires connecting the PA directly to the combiner, as shown in Fig. 14(b), were included in the simulations



Fig. 15. (a) Verification of the theoretical design rules by comparing the left and right side of (37). (b) Calculated input-line length as a function of desired operation frequency.

by using the model included in Keysight ADS [28], which considers the length and approximate shape of the wires and also models mutual coupling between them. The capacitors were modeled by *S*-parameters provided by the manufacturer. In the designed combiner, the input-line width and length are 0.2 and 7.0 mm, respectively, and the coupled-line width and length are 0.25 and 17.2 mm, respectively. To examine the effects of undesired coupling, the combiner was also simulated without the grounded lines between baluns, showing a maximum output-power mismatch of 0.38 dB between two baluns when ideal voltage sources are used. When the grounded lines are included, the mismatch is reduced to 0.17 dB.

In order to verify the theoretical design rules, we calculated the characteristic impedance of the input lines ( $45.1\ \Omega$ ) with ADS and obtained the *Y*-parameters of a single coupled-line pair with an EM simulation. With the help of the bonding-wire model, we also determined that the average bonding-wire pair can be approximated as a 3.4-mm extension of the input line. Based on these results and the actual line lengths, Fig. 15(a) compares the left-hand and right-hand sides of (37), which should be equal to ensure the desired operation. The imaginary part of the right-hand side is negligible, and its real part crosses  $\tan(\beta_i l_i)$  at 1.78 GHz, which is very close to the desired center frequency of 1.8 GHz. Fig. 15(b) shows the calculated dependence between the input-line length and the operation frequency based on (37) and the simulated line characteristics. The dashed line shows the direct result of the calculation, and the solid line represents the actual line length without the 3.4-mm extension modeling the bonding wires.

## V. MEASUREMENT RESULTS

All measurements presented in this section were conducted using the on-chip phase modulators [10] to generate the PA input signals. The 1.0-V supply domain is shared with the modulators, which dominate the power consumption in that domain. Therefore, the power-consumption figures used in PA-efficiency calculations include all circuitry in the 1.8- and 3.6-V supply domains but not the XOR gates. The presented output-power figures are measured at the combiner output.

### A. Continuous-Wave Measurements

Fig. 16 shows the peak output power and corresponding PA efficiency as a function of frequency in continuous-wave



Fig. 16. CW measurements of (a) output power and (b) PA efficiency as a function of frequency.



Fig. 17. CW measurement results at 1.7 GHz. (a) Efficiency as a function of output power. (b) Output power as a function of outphasing angle.

(CW) operation. The dependence between output power and frequency, as shown in Fig. 16(a), demonstrates the power-combiner frequency response, which is the primary limiting factor for the operation frequency. The highest output power of 29.7 dBm (0.93 W) is achieved at 1.77 GHz, with an efficiency of 34.7%. This frequency matches well with the value calculated from (37) and shown in Fig. 15 (1.78 GHz). The 3-dB bandwidth is 476 MHz, ranging between 1.48–1.96 GHz (fractional bandwidth 28%) as illustrated by the dotted lines. Within this range, the PA efficiency remains above 19.8%, as shown in Fig. 16(b). The bandwidth is considerably wider than what was seen in simulations, while the peak output power is lower. Although this resembles the simulated effects of changing the characteristic impedances, we were not able to confirm such inaccuracies in the PCB manufacturing. On the other hand, the measured efficiency is higher than simulated. This is at least partly explained by power loss on IC, which in simulations is larger than the total measured power loss of the IC and the combiner. The measured insertion loss of the combiner without bonding wires is 1.0 dB at the optimal operating frequency. For subsequent measurements, we selected the center frequency of 1.7 GHz, which is near the middle of the 3-dB band.

Tri-phasing operation is demonstrated with CW signals in Fig. 17. The PA efficiency as a function of output power is shown in Fig. 17(a), comparing the results in outphasing and tri-phasing modes. Fig. 17(a) shows that the back-off efficiency is significantly improved by tri-phasing, being at most  $3.9 \times$  the efficiency with outphasing at the same output power. The measurements are also compared with the results of the constant-loss efficiency model in (27), where the PA-unit power loss of 228 mW is chosen to match with the measured peak output power and efficiency. The model is unrelated to



Fig. 18. Simulated output voltages and currents of the outphasing PA pair with  $A_p = 3$ ,  $\theta = 90^\circ$ , illustrating the cause of output-power discrepancy between theory and measurements.

the simulation results, as shown in Fig. 16. Close agreement between the model and measured efficiency confirms the validity of the model and supports the claim of equal efficiency between tri-phasing and multilevel outphasing.

Fig. 17(b) shows the output power as a function of the outphasing angle at all four coarse amplitude levels. The theoretical output power

$$P_{\text{out}} = \frac{P_{\text{peak}}}{16} (A_p + \cos \theta)^2 \quad (51)$$

is also shown for comparison, with a peak value equal to measurements. The minimum output power of each coarse level is slightly lower than the maximum output power of the previous level, which means that there are no gaps in achievable levels of output power. This result differs from (51), in which switching an additional polar PA pair on and changing the outphasing angle from  $0^\circ$  to  $90^\circ$  has no effect on the output power. The cause of this nonideality is shown in Fig. 18, which shows simulated waveforms of the outphasing PA pair with a  $90^\circ$  outphasing angle while all polar PA pairs are active. Due to the nonisolating power combiner, rising and falling edges under these circumstances occur when the output current is positive in one PA unit and negative in the other. This causes a mismatch in rise and fall times, leading to a nonzero contribution to the combined output voltage and reduced output power. Nevertheless, the overlap between coarse levels is significantly smaller than in multilevel outphasing, where an outphasing angle of  $90^\circ$  always corresponds to a zero output power. Furthermore, this nonideality has been taken into account in the following measurements by performing signal-component separation according to the measured minimum and maximum output powers at each coarse level.

### B. Measurements With Modulated Signals

The modulation performance of the PA was evaluated with 64-QAM Long Term Evolution (LTE) signals at a carrier frequency of 1.7 GHz. Apart from using measured CW power figures in signal-component separation, no digital predistortion was used in any of the presented measurements. Fig. 19 shows the output spectra with 20-MHz LTE and 60-MHz aggregated LTE signals. In these measurements, the peak-to-average power ratio (PAPR) is reduced with iterative clipping and filtering, which increases the output power with virtually no effect on the ACLR, while gradually increasing the EVM.



Fig. 19. Measured spectra and constellations of (a) 20-MHz LTE and (b) 60-MHz aggregated LTE signals. The PAPR has been limited to maximize output power and efficiency within the 8% EVM specification.



Fig. 20. Measured spectrum and constellations of a 100-MHz aggregated LTE signal.



Fig. 21. Wide-span spectra of 20-MHz LTE and 60-MHz aggregated LTE signals.

To maximize output power and efficiency, we chose the lowest PAPR that maintains an EVM of at most 8% in each carrier, in accordance with LTE specifications. With a 20-MHz bandwidth, the PA achieves an average output power of 23.1 dBm with an efficiency of 15.3% and an ACLR of  $-37.6$  dBc. By increasing the PAPR, an EVM of 2.4% is achieved at an output power of 21.4 dBm and efficiency of 12.3%. With a 60-MHz bandwidth and an 8% EVM, the average output power is 22.4 dBm, the efficiency is 13.8%, and the ACLR is  $-32.5$  dBc. The PA is also demonstrated to operate with a 100-MHz aggregated LTE signal in Fig. 20, with an output power of 19.0 dBm and efficiency of 9.0%.

The wide-span spectra of the 20-MHz and 60-MHz signals are shown in Fig. 21. The small bumps on both sides of the

TABLE II  
COMPARISON OF MEASUREMENT RESULTS

|                                                    | JSSC [29]        | JSSC [30]         | ESSCIRC [31]      | JSSC [32]     | JSSC [33]         | JSSC [34]    | RFIC [35]         | This work                      |
|----------------------------------------------------|------------------|-------------------|-------------------|---------------|-------------------|--------------|-------------------|--------------------------------|
| <b>CMOS tech. (nm)</b>                             | 32               | 45                | 45                | 90            | 65                | 55           | 65                | <b>28</b>                      |
| <b>Architecture</b>                                | Outphasing       | Multil. outph.    | Outphasing        | Polar         | Polar             | Polar        | IQ cell sharing   | <b>Tri-phasing</b>             |
| <b>PA class</b>                                    | Class D          | Class D           | Class E           | Switched-cap. | Inv. class D      | Inv. class D | Class G           | <b>Class D</b>                 |
| <b>Power combiner</b>                              | On-chip          | On-chip           | PCB               | —             | On-chip           | —            | On-chip           | PCB                            |
| <b>Combiner loss (dB)</b>                          | 1.3 <sup>1</sup> | 1.4 <sup>1</sup>  | N/A               | —             | 1.5 <sup>1</sup>  | —            | N/A               | <b>1.0</b>                     |
| <b>Peak <math>P_{out}</math> (dBm)</b>             | 25.3             | 31.5              | 31.6              | 25.2          | 23.3              | 21.9         | 30.1              | <b>29.7</b>                    |
| <b>Peak efficiency (%)</b>                         | 35               | 27                | 43.7              | 45            | 43                | 41           | 37.0              | <b>34.7</b>                    |
| <b>Carrier freq. (GHz)</b>                         | 2.4              | 2.4               | 2.4               | 2.25          | 2.2               | 2.08         | 2.2               | <b>1.7</b>                     |
| <b>64-QAM signal</b>                               | WiFi             | WiFi              | LTE               | IEEE 802.11g  | IEEE 802.11g      | LTE          | IEEE 802.11g      | <b>LTE</b>                     |
| <b>Mod. <math>P_{out}</math> (dBm)<sup>2</sup></b> | 19.6             | 24.8 <sup>3</sup> | 25.6 <sup>4</sup> | 17.7          | 16.8 <sup>5</sup> | N/A          | 19.5 <sup>6</sup> | <b>23.1 / 21.4<sup>7</sup></b> |
| <b>PAPR (dB)<sup>2</sup></b>                       | 5.7              | 6.7 <sup>3</sup>  | 6.0 <sup>4</sup>  | 7.5           | 6.5 <sup>5</sup>  | N/A          | 10.6 <sup>6</sup> | <b>6.6 / 8.3<sup>7</sup></b>   |
| <b>Mod. efficiency (%)<sup>2</sup></b>             | 21.8             | 16 <sup>3</sup>   | 20.1 <sup>4</sup> | 27            | 21.8 <sup>5</sup> | N/A          | 14.7 <sup>6</sup> | <b>15.3 / 12.3<sup>7</sup></b> |
| <b>ACLR (dBc)<sup>2</sup></b>                      | N/A              | N/A               | -53               | N/A           | N/A               | -35.4        | N/A               | <b>-37.6</b>                   |
| <b>Min. EVM (%)<sup>2</sup></b>                    | 5.6              | 3.5               | N/A               | 2.6           | 2.3               | 2.5          | 0.7               | <b>2.4</b>                     |
| <b>Predistortion</b>                               | Yes              | No                | Yes               | Yes           | Yes               | Yes          | Yes               | <b>No</b>                      |
| <b>Max. BW (MHz)</b>                               | 20               | 20                | 40                | 20            | 20                | 40           | 20                | <b>100</b>                     |

<sup>1</sup> Simulated      <sup>2</sup> Bandwidth = 20 MHz; power and efficiency results at minimum EVM unless noted otherwise

<sup>3</sup> EVM = 5.6%      <sup>4</sup> EVM not reported      <sup>5</sup> EVM = 4%      <sup>6</sup> EVM = 0.9%      <sup>7</sup> EVM = 8% / 2.4%

signal band can be explained by supply ringing arising from sudden current-consumption changes at coarse amplitude-level transitions, which results in some unwanted amplitude modulation [30]. In future work, this ringing could be suppressed, for example, with damping legs [36]. Phase-modulator nonidealities are another source of noise near the signal band [10]. In PA simulations of a 20-MHz LTE signal without the aforementioned effects, tri-phasing improves the ACLR by 3.0 dB and reduces the noise at a 40-MHz offset from the signal band (ACLR2) by 9.1 dB compared with multilevel outphasing. Both spectra in Fig. 21 display a peak around 460 MHz, which is a local maximum in noise level caused by the power-combiner frequency response. While the second harmonic is clearly visible, the third harmonic has been considerably attenuated, although the combiner does not contain any structures specifically intended for harmonic filtering.

Table II summarizes the presented measurement results and compares them with previously published CMOS PAs that integrate all of their PA units on a single die. While many of them report a higher efficiency than our PA, most of these ones either have a significantly lower output power or use PA classes that are generally more efficient than class-D but feature other limitations. Some of them also use flip-chip packaging [29], [31] or probing at the output [32], which reduces losses compared with wire bonding. The insertion loss of our combiner is lower than the simulated values reported by other compared articles, which is expected due to the off-chip implementation but nonetheless indicates viability for high-efficiency systems. The most outstanding result of our circuit is the widest reported signal bandwidth of 100 MHz, in contrast to other compared PAs only presenting results at bandwidths up to 20 or 40 MHz. The EVM of 2.4% is also among the best figures reported with polar and outphasing-based techniques. Our results were achieved without predistortion, which is used by most of the compared circuits.

## VI. CONCLUSION

In this article, we have presented an eight-unit class-D tri-phasing PA implemented in 28-nm CMOS, with a coupled-line power combiner on the PCB. The PA utilizes tri-phasing modulation, a newly developed technique that eliminates the inherent harmonic discontinuities of multilevel outphasing, thus potentially improving the spectral performance without compromising the back-off efficiency. Each PA unit contains a cascoded output stage supporting a 3.6-V supply voltage, and ON/OFF logic circuitry that enables multilevel operation with low-voltage input signals.

The essential outcome of the analysis presented in this article is that the choice of power-combiner type is critical in terms of minimizing voltage ripple at the PA supply and ground. Accordingly, we chose to design a voltage-subtracting coupled-line combiner instead of a simpler voltage-adding one. In order to enable PCB implementation with eight narrowly spaced inputs, as dictated by the integrated PA, we developed and analyzed the extended Marchand balun, which forms the basis of the combiner. This new coupled-line structure with arbitrarily long input lines introduces an additional degree of freedom to the balun layout, which expands the available applications and, thus, enables the advantages of a voltage-subtracting combiner in the context of this article.

The measurement results of the complete PA system demonstrate a peak output power of 29.7 dBm at 1.77 GHz, with an efficiency of 34.7%. Operation with modulated signals is also verified at bandwidths up to 100 MHz, which is the widest reported bandwidth among similar PAs and demonstrates the suitability of tri-phasing for wideband operation.

## REFERENCES

- [1] H. Chireix, "High power outphasing modulation," *Proc. IRE*, vol. 23, no. 11, pp. 1370–1392, Nov. 1935.
- [2] D. Cox, "Linear amplification with nonlinear components," *IEEE Trans. Commun.*, vol. COMM-22, no. 12, pp. 1942–1945, Dec. 1974.

- [3] Y.-J. Chen, K.-Y. Jheng, A.-Y. Wu, H.-W. Tsao, and B. Tzeng, "Multilevel LINC system design for wireless transmitters," in *Proc. Int. Symp. VLSI Des., Automat. Test (VLSI-DAT)*, Apr. 2007, pp. 1–4.
- [4] K.-Y. Jheng, Y.-J. Chen, and A.-Y. Wu, "Multilevel LINC system designs for power efficiency enhancement of transmitters," *IEEE J. Sel. Topics Signal Process.*, vol. 3, no. 3, pp. 523–532, Jun. 2009.
- [5] J. Lemberg *et al.*, "Digital interpolating phase modulator for wideband outphasing transmitters," *IEEE Trans. Circuits Syst. I, Reg. Papers*, vol. 63, no. 5, pp. 705–715, May 2016.
- [6] M. Martelius *et al.*, "Class D CMOS power amplifier with on/off logic for a multilevel outphasing transmitter," in *Proc. IEEE Int. Symp. Circuits Syst. (ISCAS)*, May 2016, pp. 710–713.
- [7] M. Kosunen *et al.*, "A 0.35-to-2.6 GHz multilevel outphasing transmitter with a digital interpolating phase modulator enabling up to 400 MHz instantaneous bandwidth," in *IEEE Int. Solid-State Circuits Conf. (ISSCC) Dig. Tech. Papers*, Feb. 2017, pp. 224–225.
- [8] J. Lemberg *et al.*, "Tri-phasing modulation for efficient and wideband radio transmitters," *IEEE Trans. Circuits Syst. I: Reg. Papers*, vol. 65, no. 9, pp. 3085–3098, Sep. 2018.
- [9] M. Martelius *et al.*, "A 30-dBm Class-D power amplifier with on/off logic for an integrated tri-phasing transmitter in 28-nm CMOS," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, Jun. 2018, pp. 136–139.
- [10] J. Lemberg *et al.*, "A 1.5–1.9-GHz all-digital tri-phasing transmitter with an integrated multilevel Class-D power amplifier achieving 100-MHz RF bandwidth," *IEEE J. Solid-State Circuits*, vol. 54, no. 6, pp. 1517–1527, Jun. 2019.
- [11] N. Marchand, "Transmission line conversion transformers," *Electronics*, vol. 17, no. 12, pp. 142–144, Dec. 1944.
- [12] J. Hur *et al.*, "A multilevel Class-D CMOS power amplifier for an outphasing transmitter with a nonisolated power combiner," *IEEE Trans. Circuits Syst. II: Exp. Briefs*, vol. 63, no. 7, pp. 618–622, Jul. 2016.
- [13] T. W. Barton, A. S. Jurkov, P. H. Pednekar, and D. J. Perreault, "Multi-way lossless outphasing system based on an all-transmission-line combiner," *IEEE Trans. Microw. Theory Techn.*, vol. 64, no. 4, pp. 1313–1326, Apr. 2016.
- [14] T. Hwang, K. Azadet, R. S. Wilson, and J. Lin, "Nonlinearity modeling of a chireix outphasing power amplifier," *IEEE Trans. Circuits Syst. I: Reg. Papers*, vol. 62, no. 12, pp. 2898–2907, Dec. 2015.
- [15] T.-P. Hung, D. K. Choi, L. E. Larson, and P. M. Asbeck, "CMOS outphasing Class-D amplifier with Chireix combiner," *IEEE Microw. Wireless Compon. Lett.*, vol. 17, no. 8, pp. 619–621, Aug. 2007.
- [16] H. Jia, B. Chi, L. Kuang, and Z. Wang, "A W-band power amplifier utilizing a miniaturized marchand balun combiner," *IEEE Trans. Microw. Theory Techn.*, vol. 63, no. 2, pp. 719–725, Feb. 2015.
- [17] S. Muralidharan, K. Wu, and M. Hella, "A compact low loss single-ended to two-way differential power divider/combiner," *IEEE Microw. Wireless Compon. Lett.*, vol. 25, no. 2, pp. 103–105, Feb. 2015.
- [18] H.-C. Park, S. Daneshgar, Z. Griffith, M. Urteaga, B.-S. Kim, and M. Rodwell, "Millimeter-wave series power combining using sub-quarter-wavelength baluns," *IEEE J. Solid-State Circuits*, vol. 49, no. 10, pp. 2089–2102, Oct. 2014.
- [19] A. N. Stameroff, H. H. Ta, A.-V. Pham, and R. E. Leoni, III, "Widebandwidth power-combining and inverse Class-F GaN power amplifier at X-band," *IEEE Trans. Microw. Theory Techn.*, vol. 61, no. 3, pp. 1291–1300, Mar. 2013.
- [20] M. P. van der Heijden, M. Acar, J. S. Vromans, and D. A. Calvillo-Cortes, "A 19W high-efficiency wide-band CMOS-GaN Class-E Chireix RF outphasing power amplifier," in *IEEE MTT-S Int. Microw. Symp. Dig.*, Jun. 2011, pp. 1–4.
- [21] R. K. Mongia, I. J. Bahl, and P. Bhartia, *RF and Microwave Coupled-Line Circuits*. Norwood, MA, USA: Artech House, 2007.
- [22] E. J. Wilkinson, "An *N*-way hybrid power divider," *IRE Trans. Microw. Theory Techn.*, vol. 8, no. 1, pp. 116–118, Jan. 1960.
- [23] M. Martelius *et al.*, "Spectral effects of discrete-time amplitude levels in digital-intensive wideband radio transmitters," in *Proc. IEEE Int. Symp. Circuits Syst. (ISCAS)*, May 2018, pp. 1–5.
- [24] J. Jang, C. Park, H. Kim, and S. Hong, "A CMOS RF power amplifier using an off-chip transmission line transformer with 62% PAE," *IEEE Microw. Wireless Compon. Lett.*, vol. 17, no. 5, pp. 385–387, May 2007.
- [25] C.-H. Lin, C.-H. Wu, G.-T. Zhou, and T.-G. Ma, "General compensation method for a marchand balun with an arbitrary connecting segment between the balance ports," *IEEE Trans. Microw. Theory Techn.*, vol. 61, no. 8, pp. 2821–2830, Aug. 2013.
- [26] G. Yang, Z. Wang, Z. Li, Q. Li, and F. Liu, "Balance-compensated asymmetric marchand baluns on silicon for MMICs," *IEEE Microw. Wireless Compon. Lett.*, vol. 24, no. 6, pp. 391–393, Jun. 2014.
- [27] I. Piekarz, J. Sorocki, S. Gruszcynski, and K. Wincza, "Input match and output balance improvement of marchand balun with connecting line," *IEEE Microw. Wireless Compon. Lett.*, vol. 24, no. 10, pp. 683–685, Oct. 2014.
- [28] A. L. Nazarian, L. F. Tiemeijer, D. L. John, J. A. van Steenwijk, M. de Langen, and R. M. T. Pijper, "A physics-based causal bond-wire model for RF applications," *IEEE Trans. Microw. Theory Techn.*, vol. 60, no. 12, pp. 3683–3692, Dec. 2012.
- [29] H. Xu, Y. Palaskas, A. Ravi, M. Sajadieh, M. A. El-Tanani, and K. Soumyanath, "A flip-chip-packaged 25.3 dBm Class-D outphasing power amplifier in 32 nm CMOS for WLAN application," *IEEE J. Solid-State Circuits*, vol. 46, no. 7, pp. 1596–1605, Jul. 2011.
- [30] W. Tai *et al.*, "A transformer-combined 31.5 dBm outphasing power amplifier in 45 nm LP CMOS with dynamic power control for back-off power efficiency enhancement," *IEEE J. Solid-State Circuits*, vol. 47, no. 7, pp. 1646–1658, Jul. 2012.
- [31] A. Banerjee, L. Ding, and R. Hezar, "High efficiency multi-mode outphasing RF power amplifier in 45nm CMOS," in *Proc. 41st Eur. Solid-State Circuits Conf. (ESSCIRC)*, Sep. 2015, pp. 168–171.
- [32] S.-M. Yoo, J. S. Walling, E. C. Woo, B. Jann, and D. J. Allstot, "A switched-capacitor RF power amplifier," *IEEE J. Solid State Circuits*, vol. 46, no. 12, pp. 2977–2987, Dec. 2011.
- [33] L. Ye, J. Chen, L. Kong, E. Alon, and A. M. Niknejad, "Design considerations for a direct digitally modulated WLAN transmitter with integrated phase path and dynamic impedance modulation," *IEEE J. Solid-State Circuits*, vol. 48, no. 12, pp. 3160–3177, Dec. 2013.
- [34] Q. Zhu *et al.*, "A digital polar transmitter with DC-DC converter supporting 256-QAM WLAN and 40-MHz LTE-A carrier aggregation," *IEEE J. Solid-State Circuits*, vol. 52, no. 5, pp. 1196–1209, May 2017.
- [35] S.-W. Yoo, S.-C. Hung, and S.-M. Yoo, "A 1W quadrature Class-G switched-capacitor power amplifier with merged cell switching and linearization techniques," in *Proc. IEEE Radio Freq. Integr. Circuits Symp. (RFIC)*, Jun. 2018, pp. 124–127.
- [36] P. A. Godoy, S. Chung, T. W. Barton, D. J. Perreault, and J. Dawson, "A 2.4-GHz, 27-dBm asymmetric multilevel outphasing power amplifier in 65-nm CMOS," *IEEE J. Solid-State Circuits*, vol. 47, no. 10, pp. 2372–2384, Oct. 2012.



**Mikko Martelius** (S'16–M'19) was born in Laitila, Finland, in 1988. He received the B.Sc., M.Sc. (Hons.), and D.Sc. degrees in electrical engineering from Aalto University, Espoo, Finland, in 2012, 2015, and 2019, respectively.

From 2013 to 2019, he was with the Department of Electronics and Nanoengineering, Aalto University, where his research focused on switch-mode power amplifiers, digital-intensive transmitters, and frequency synthesizers. Since 2019, he has been an RFIC Design Engineer with CoreHW, Helsinki, Finland, where he is currently involved in mm-wave 5G transceivers.



**Kari Stadius** (S'95–M'03) received the M.Sc., Lic.Tech., and D.Sc. degrees in electrical engineering from the Helsinki University of Technology, Espoo, Finland, in 1994, 1997, and 2010, respectively.

He is currently a Staff Scientist with the Department of Electronics and Nanoengineering, School of Electrical Engineering, Aalto University, Espoo. He has authored or coauthored over 100 refereed journal articles and conference papers in the areas of analog and RF circuit design. His current research interests include the design and analysis of RF transceiver blocks with special emphasis on frequency synthesis.



**Jerry Lemberg** received the M.Sc. and D.Sc. degrees from Aalto University, Espoo, Finland, in 2013 and 2018, respectively.

From 2010 to 2018, he was with the Department of Electronics and Nanoengineering, Aalto University. Since 2018, he has been a Design Engineer with LG Electronics, Turku, Finland. His current research interests include digital-intensive and high-data-rate RF transceivers.



**Lauri Anttila** (S'06–M'11) received the M.Sc. and the D.Sc. (Hons.) degrees in electrical engineering from the Tampere University of Technology (TUT), Tampere, Finland, in 2004 and 2011, respectively.

Since 2016, he has been a University Researcher with the Department of Electrical Engineering, Tampere University (formerly TUT). From 2016 to 2017, he was a Visiting Research Fellow with the Department of Electronics and Nanoengineering, Aalto University, Finland. He has coauthored over 100 refereed articles and three book chapters. His

current research interests include signal processing for wireless communications, transmitter and receiver linearization, and radio implementation challenges in 5G cellular radio, full-duplex radio, and large-scale antenna systems.



**Enrico Roverato** was born in Padua, Italy, in 1988. He received the B.Sc. degree *cum laude* in information engineering from the University of Padua, Padua, in 2010, and the M.Sc. and D.Sc. degrees (Hons.) in electrical engineering from Aalto University, Espoo, Finland, in 2012 and 2017, respectively.

From 2012 to 2018, he was with the Department of Electronics and Nanoengineering, Aalto University, where he was involved in research on high-speed DSP algorithms for all-digital RF transmitters. He is currently a Senior Digital IC Designer with CoreHW,

Tampere, Finland. He is also a Consultant with Huawei Technologies, Helsinki, Finland, where he is involved in the next-generation 5G cellular receivers.



**Tero Nieminen** received the M.Sc. degree in electrical engineering from the Helsinki University of Technology, Espoo, Finland, in 2007, and the Lic.Tech. and D.Sc. degrees in electrical engineering from Aalto University, Espoo, in 2012 and 2016, respectively.

From 2005 to 2016, he was with the Electronic Circuit Design Laboratory, Helsinki University of Technology (later with Aalto University), where he was involved in analog-/mixed-signal CMOS circuits, including analog-to-digital converters (ADCs), digital-to-time converters, and serial input-output interfaces. Since 2017, he has been with CoreHW, Tampere, Finland, providing analog CMOS design at Huawei Technologies, Helsinki, Finland, where he has been involved in ADCs for receiver circuits.



**Mikko Valkama** (S'99–M'02–SM'15) received the M.Sc. and Ph.D. degrees (Hons.) in electrical engineering (EE) from the Tampere University of Technology, Tampere, Finland, in 2000 and 2001, respectively. His Ph.D. dissertation was on advanced I/Q signal processing for wideband receivers: models and algorithms.

In 2003, he was a Visiting Post-Doctoral Research Fellow with the Communications Systems and Signal Processing Institute, San Diego State University (SDSU), San Diego, CA, USA. He is currently a Full

Professor and the Department Head of EE, Tampere University, Tampere. His current research interests include radio communications, radio localization, and radio-based sensing, with an emphasis on 5G and beyond mobile radio networks.

Dr. Valkama received the Best Ph.D. Thesis Award from the Finnish Academy of Science and Letters in 2002.



**Marko Kosunen** (S'97–M'07) received the M.Sc., Lic.Sc., and D.Sc. (Hons.) degrees from the Helsinki University of Technology, Espoo, Finland, in 1998, 2001, and 2006, respectively.

He is currently a Senior Researcher with the Department of Electronics and Nanoengineering, Aalto University, Espoo. He is currently involved in implementations of digital-intensive transceiver circuits and medical sensor electronics. His current research interests include in the implementation of the wireless transceiver DSP algorithms and communication circuits.



**Yury Antonov** (S'15–M'18) received the Engineer's degree in computer design and technology from Bauman Moscow State Technical University, Moscow, Russia, in 2007, and the M.Sc. degree (Hons.) in electrical engineering from Aalto University, Espoo, Finland, in 2014.

From 2004 to 2010, he was a circuit designer with several research and development companies in Moscow, where he was involved in developing complex programmable logic device (CPLD)/field-programmable gate array (FPGA)-based mixed-signal circuits for the synchronization of scalable radar-phased arrays and designing downconversion front ends for lidar-based measurement solutions. From 2014 to 2019, he was with Aalto University, where he was involved in multiple successful tape-outs from 28 to 65 nm. He is currently a Senior RFIC Design Engineer with CoreHW, Helsinki, Finland, where he is involved in the mm-wave frequency synthesizer design. His current research interests include multiphase and spur-free frequency synthesis for RF/mm-wavefront ends with digital assistance and on-chip calibration loops.



**Jussi Rynnänen** (S'99–M'04–SM'16) was born in Ilmajoki, Finland, in 1973. He received the M.Sc. and D.Sc. degrees in electrical engineering from the Helsinki University of Technology, Espoo, Finland, in 1998 and 2004, respectively.

He is currently a Full Professor and the Head of the Department of Electronics and Nanoengineering, Aalto University, Espoo. He has authored or coauthored more than 140 refereed journal articles and conference papers in analog and RF circuit design. He holds seven patents on RF circuits. His current research interest includes integrated transceiver circuits for wireless applications.

Prof. Rynnänen has served as a TPC Member for the European Solid-State Circuits Conference and the IEEE International Solid-State Circuits Conference. He has served as a Guest Editor for the IEEE JOURNAL OF SOLID-STATE CIRCUITS.