

# A back-end, CMOS compatible ferroelectric Field Effect Transistor for synaptic weights

Mattia Halter,<sup>\*,†,‡</sup> Laura Bégon-Lours,<sup>†</sup> Valeria Bragaglia,<sup>†</sup> Marilyne Sousa,<sup>†</sup> Bert Jan Offrein,<sup>†</sup> Stefan Abel,<sup>†</sup> Mathieu Luisier,<sup>‡</sup> and Jean Fompeyrine<sup>†</sup>

<sup>†</sup>*IBM Research GmbH - Zurich Research Laboratory, CH-8803 Rüschlikon, Switzerland*

<sup>‡</sup>*Integrated Systems Laboratory, ETH Zurich, CH-8092 Zurich, Switzerland*

E-mail: att@zurich.ibm.com

## Abstract

Neuromorphic computing architectures enable the dense co-location of memory and processing elements within a single circuit. This co-location removes the communication bottleneck of transferring data between separate memory and computing units as in standard von Neuman architectures for data-critical applications including machine learning. The essential building blocks of neuromorphic systems are non-volatile synaptic elements such as memristors. Key memristor properties include a suitable non-volatile resistance range, continuous linear resistance modulation and symmetric switching. In this work, we demonstrate voltage-controlled, symmetric and analog potentiation and depression of a ferroelectric  $\text{Hf}_{0.57}\text{Zr}_{0.43}\text{O}_2$  (HZO) field effect transistor (FeFET) with good linearity. Our FeFET operates with a low writing energy (fJ) and fast programming time (40 ns). Retention measurements have been done over 4-bits depth with low noise (1 %) in the tungsten oxide ( $\text{WO}_x$ ) read out channel. By adjusting the channel thickness from 15 nm to 8 nm, the on/off ratio of the FeFET can be engineered from 1 % to 200 % with an on-resistance ideally  $>100\text{ k}\Omega$ , depending on the channel geometry. The device concept is using earth-abundant materials, and is

compatible with a back end of line (BEOL) integration into complementary metal-oxide-semiconductor (CMOS) processes. It has therefore a great potential for the fabrication of high density, large-scale integrated arrays of artificial analog synapses.

## Keywords

ferroelectric switching, hafnium zirconium oxide, tungsten oxide, BEOL, ferroelectric field-effect transistor, memristor

## 1 Introduction

The development of new computing architectures has seen a substantial push since the scaling of conventional CMOS technology has come to its limits and cannot keep up with the always increasing demand for computational power. A large part of today's computing resources is dedicated to processing large amounts of data, such as images, videos, or sensor outputs. For all these workloads, conventional von Neuman architectures are limited by a fundamental, time and power consuming task of transferring data between the processor and the memory.<sup>1</sup> Brain-inspired neuromorphic architectures with co-located computation and memory units appear as promising candidates to overcome this issue.<sup>2</sup> Such architectures consist of neurons that are interconnected by plastic synapses, which can be arranged in a crossbar topology to efficiently perform matrix-vector multiplications<sup>3</sup> – a key computing task when executing neural networks.<sup>4,5</sup> In recent years, much progress has been made in neuromorphic hardware, in particular in creating crossbar arrays of artificial synapses connected to CMOS neurons.<sup>6–12</sup> Multiple device concepts have been proposed in order to realize the required artificial synapse, such as phase change memory (PCM),<sup>13–15</sup> filamentary and non-filamentary resistive switching memory (RRAM),<sup>16–18</sup> electro-chemical memory (ECRAM),<sup>19–21</sup> and ferroelectric (FE)<sup>7,22–25</sup> based memory cells. Unlike classical memory elements, such devices are characterized by the stronger need for multilevel or analog pro-

gramming capabilities to define the synaptic weight. While PCM and RRAM devices are essentially current controlled, the different states in ferroelectric memory elements are controlled by applying an electric field. The states are linked to the partial switching of the ferroelectric polarization, which allows to fine tune the synaptic weights in analog computing approaches, with fast and low-power writing.<sup>26</sup>

For circuits solving real world applications, the number of required synapses rapidly explodes with the complexity of the task at hand. Solving even a simple task such as the MNIST database of handwritten digits requires  $\approx 10^5$  synapses,<sup>27</sup> while the training of a deep neural network (DNN) relies on up to millions of synapses. Such numbers of hardware synapses can only be obtained in densely integrated circuits such as fabricated using modern CMOS technology. Part of the functions in neural networks can also be implemented using CMOS circuits (e.g. activation). Therefore, it is important that materials and processes are CMOS compatible. The recent discovery of ferroelectricity in hafnia composites,<sup>28</sup> a material already present in CMOS lines, has revived research activity in the field of integrated ferroelectrics. Artificial ferroelectric synapses have been realized based on two device concepts, namely two-terminal ferroelectric tunneling junctions (FTJ)<sup>29–33</sup> and three-terminal ferroelectric field-effect transistors (FeFET).<sup>7,22,24,34–37</sup> Hafnia-based FTJs remain a challenge as the stabilization of the ferroelectric phase in sub-5 nm thick structures becomes difficult and polarization drops at film thicknesses relevant for tunneling.<sup>33,38,39</sup> Using a transistor instead has the advantage of separating the write process (low power write through high impedance gate<sup>26</sup>) and the read process (through source-drain resistance). It also permits to tune synaptic resistance by changing the channel geometry. Hafnia-based FeFETs were demonstrated mainly as non-volatile memory cells,<sup>34</sup> steep-slope field-effect transistors,<sup>35,36</sup> and artificial neurons.<sup>8</sup> These concepts usually are implemented on the front end of line (FEOL) and use Si as a channel. Because of the constraints imposed by the FEOL on the thermal budget and on the device geometry, an integration in the back end of line (BEOL) can be advantageous. E.g., an integration in the BEOL enable a larger device area with

respect to the size of the ferroelectric domains, which can translate into a larger number of states. Recently, analog synaptic behavior has been shown in a hafnia-based FeFET with indium gallium zinc oxide (IGZO) and poly-Si channels fabricated in the BEOL.<sup>22,24,37</sup> The combination of a hafnia-based ferroelectric with an oxide channel is expected to alleviate the known issues associated with Si-based FeFETs such as unintended low-k interfacial layers formed at the Si interface. On Si based channels, buffer layers have been used as a solution, but they have the disadvantage of reducing the effective field over the ferroelectric layer.<sup>40–43</sup> For neuromorphic applications the absolute resistance should be in the MΩ range<sup>3</sup> and the relative change in resistance ideally within a window of 8<sup>3</sup> up to 20-50.<sup>44</sup> Those values are a compromise between being large enough for performing learning tasks, and low enough to avoid one synaptic element to dominate the response of a whole column/row of the overall crossbar array.<sup>3,44</sup> Here, we report on a Hf<sub>0.57</sub>Zr<sub>0.43</sub>O<sub>2</sub> (HZO) based FeFET utilizing a tungsten oxide (WO<sub>x</sub>) channel. We demonstrate the impact of the ferroelectric polarization on the channel resistance, the influence of the channel thickness on the on/off ratio, ferroelectric HZO with a long endurance, the stabilization of multiple differentiable states, a good retention as well as a continuous potentiation and depression. By using a BEOL compatible process and by using only abundant and CMOS friendly materials, the proposed HZO/WO<sub>x</sub> stack is very promising for large-scale integrated neuromorphic hardware based on ferroelectrics.

## 2 Results and discussion

For our study, we designed FeFET devices similar to back gated PseudoMOS<sup>45</sup> with an HZO (10 nm)/TiN (10 nm)/n<sup>+</sup> Si gate stack and an 8 nm thick WO<sub>x</sub> channel<sup>20,21</sup> (Figure 1a). The channel is formed by oxidizing 2.5 nm of W after the formation of the ferroelectric HZO.<sup>46</sup> The source and drain contacts are deposited on the WO<sub>x</sub> channel through lift-off. The device is encapsulated by a 5 nm Al<sub>2</sub>O<sub>3</sub> and a 100 nm SiO<sub>2</sub> passivation layer. Contact

pads are formed on top of the passivation layers and routed through openings to source and drain. The gate is accessible through the highly  $n^+$  doped Si substrate and is shared between all devices on our chip. As visible in the bright field scanning transmission electron microscopy (BF-STEM), our fabrication process results in sharp interfaces between the layers and crystalline  $\text{WO}_x$  grains (Figure 1b). The energy-dispersive X-ray spectroscopy (EDS) line profile confirms the targeted elemental distributions and reveals regions of intermixing between the various layers. After the low temperature crystallization of HZO by a millisecond flash lamp technique described elsewhere,<sup>46</sup> grazing incidence X-ray diffraction (GIXRD) analysis shows the characteristic peak at  $30.6\text{\AA}$  of the orthorhombic/tetragonal phase in HZO (Figure 1c). The diffractogram is consistent with data from metal-ferroelectric-metal (MFM) structures with the same HZO published in Ref. (46). No monoclinic phase (peaks at  $28.2\text{\AA}$  and  $31.8\text{\AA}$ )<sup>47</sup> is present in our samples, which is a consequence of the low temperature crystallization technique. Following the oxidation and crystallization of W to  $\text{WO}_x$ , GIXRD still shows no monoclinic HZO phase, but displays two additional peaks at  $28.8\text{\AA}$  and  $33.6\text{\AA}$  that can be attributed to the monoclinic P121/c1 phase of  $\text{WO}_x$  (ICSD-647640).<sup>48</sup>

For the electrical characterization of HZO in our FeFET devices, additional metal-semiconductor-ferroelectric-metal (MSFM) capacitor structures have been processed on the same sample. "Capacitance versus voltage" ( $C - V$ ) measurements on a  $60\text{ }\mu\text{m} \times 60\text{ }\mu\text{m}$  capacitor reveal a ferroelectric typical butterfly-shaped hysteresis curve typical of ferroelectrics, with a capacitance per unit area of  $C_{OX} = 2.7\text{ }\mu\text{F/cm}^2$  (Figure 2a). The asymmetric behavior originates from the asymmetric electrodes ( $\text{WO}_x$ , TiN). "Polarization versus voltage" ( $P - V$ ) measurements were performed on the same capacitor (Figure 2b) and show typical characteristics. In the pristine state, the  $P - V$  curve is anti-ferroelectric (AFE)-like with hysteresis, especially on the negative voltage side.<sup>49</sup> We applied  $10^5$  switching cycles with an amplitude of  $\pm 3.8\text{ V}$  at a frequency of  $100\text{ kHz}$ , resulting in a pinched  $P - V$  curve with a positive (negative) remanent polarization  $+P_r = 12.4\text{ }\mu\text{C/cm}^2$  ( $-P_r = 11.8\text{ }\mu\text{C/cm}^2$ ). Furthermore, a slight imprint with a positive coercive voltage of  $+V_C = 0.91\text{ V}$  and a negative



Figure 1: Structural data of the FeFET. (a) Schematic illustration of a FeFET, indicating source (S), drain (D), gate (G), a  $\text{WO}_x$  channel and a ferroelectric HZO gate dielectric. (b) Cross-sectional BF-STEM image with energy-dispersive X-ray spectroscopy (EDS) line profile of the  $\text{SiO}_2/\text{Al}_2\text{O}_3/\text{WO}_x/\text{HZO}/\text{TiN}/n^+ \text{Si}$  gate region. (c) GIXRD for a diffraction angle ( $2\theta$ ) from  $26^\circ$  to  $38^\circ$  showing the presence of the orthorhombic/tetragonal crystalline phase in HZO after crystallization and after the W layer was oxidized to  $\text{WO}_x$ .

one of  $-V_C = -1.27 \text{ V}$  are observed due to the asymmetric electrodes. The cycling endurance of our HZO is  $10^8$  for an MFM structure and  $8 \times 10^6$  in the case of the MSFM configuration present in our FeFET (Figure S1).

Having confirmed the ferroelectric nature of our HZO gate dielectric, the electrical characterization of the  $\text{WO}_x$  channel in a FeFET device was performed next, by investigating the influence of  $P_r$ , channel thickness ( $d_{\text{WO}_x}$ ), and the channel carrier concentration ( $N_D$ ) on the channel resistance ( $R_{\text{DS}}$ ). For that, three samples with different  $d_{\text{WO}_x}$  and one with a non-ferroelectric  $\text{HfO}_2$  gate dielectric were realized.  $R_{\text{DS}}$  was measured between source and drain after each  $2 \mu\text{s}$  long write pulse ( $V_{\text{write}}$ ) applied to the gate (measurement scheme can be seen in Figure S4). For ease of comparison,  $R_{\text{DS}}$  is normalized by  $R_{\text{ON}}$  (Figure 2c, d, e, f). A clear hysteresis in  $R_{\text{DS}}$  is observed for devices with a ferroelectric HZO gate dielectric. To confirm that the modulation of the channel resistance originates from  $P_r$  and not from another effect, an identical device with a non-ferroelectric  $\text{HfO}_2$  gate dielectric was measured. Both have an 8 nm thick  $\text{WO}_x$  channel.  $R_{\text{DS}}$  shows no hysteresis in the non-ferroelectric

$\text{HfO}_2$  sample (Figure 2c) and further proves that the hysteresis originates from the ferroelectricity in HZO. In addition to the polarization in the HZO, the type and concentration of the free charge carriers<sup>50,51</sup> as well as  $d_{\text{WO}_x}$  influence the on/off ratio. For a maximum reduction in the channel off-current, the polarization-field induced depletion width ( $x_d$ ) should be larger than  $d_{\text{WO}_x}$ . Using Poisson's equation, the relationship between  $x_d$  and  $N_D$  can be expressed as follows:<sup>51–53</sup>

$$x_d = \frac{\epsilon_0 \epsilon_{\text{WO}_x}}{C_{\text{HZO}}} \left[ \left( 1 + \frac{2C_{\text{HZO}}^2 V_{GS}}{q N_D \epsilon_0 \epsilon_{\text{WO}_x}} \right)^{1/2} - 1 \right], \quad (1)$$

where  $\epsilon_0$  is the vacuum permittivity,  $\epsilon_{\text{WO}_x}$  the permittivity of  $\text{WO}_x$  ( $\epsilon_{\text{WO}_x} = 189$ , see supplementary information),  $C_{\text{HZO}}$  is the HZO capacitance per unit area (  $C_{\text{HZO}} = 3.14 \mu\text{F}/\text{cm}^2$ , Figure S5b), and  $V_{GS}$  is the polarization charge-induced potential across HZO. The carrier concentration ( $N_D = 1.01 \times 10^{20} \text{ cm}^{-3}$ ), the channel resistivity ( $\rho_H = 3.27 \times 10^{-1} \Omega \text{ cm}$ ) and its mobility ( $\mu_H = 0.19 \text{ cm}^2 \text{ V}^{-1} \text{ s}$ ) were determined by Hall measurements carried out on a similar sample. Using Eq. (1), a depletion width  $x_d = 1.7 \text{ nm}, 3.3 \text{ nm}, 4.8 \text{ nm}$  and  $6.4 \text{ nm}$  for  $V_{GS} = 1 \text{ V}, 2 \text{ V}, 3 \text{ V}$  and  $4 \text{ V}$ , respectively was calculated (Figure S6b). For a constant polarization, the largest effect is obtained if  $d_{\text{WO}_x} < x_d = 6.4 \text{ nm}$  or  $N_D < 1 \times 10^{20} \text{ cm}^{-3}$ . Three samples with different  $d_{\text{WO}_x}$  were realized to benchmark this estimation with experimental data. BF-STEM measurements reveal  $d_{\text{WO}_x} = 8 \text{ nm}, 11.3 \text{ nm}$  and  $15 \text{ nm}$ , as reported in Figures 1b and S2a,b, respectively. The polarization does not change between the three structures (Figures 2a and S3a,b). By decreasing  $d_{\text{WO}_x}$  from  $15 \text{ nm}$  to  $11.3 \text{ nm}$  and  $8 \text{ nm}$  the on/off ratio increases from  $\approx 1\%$  to  $\approx 5\%$  and  $\approx 90\%$ , respectively. Those results agree well with the  $x_d$  calculated by Eq. (1).

For neuromorphic applications multiple (analog) levels of the channel resistance, good retention properties, low device-to-device and cycle-to-cycle variations, fast updates, and low power consumption are important characteristics of ideal devices.<sup>3,4,44,54</sup> The exact requirements vary depending on the details of operation and from one implementation to the other. As an example, inference workloads would use off-line trained weights transferred to the chip



Figure 2: Capacitance and polarization behavior of a  $60\text{ }\mu\text{m} \times 60\text{ }\mu\text{m}$   $\text{W}/\text{WO}_x/\text{HZO}/\text{TiN}/\text{n}^+\text{Si}$  MSFM structure and  $\text{WO}_x$  channel resistance hysteresis: (a) Capacitance versus voltage ( $C - V$ ) measurements after the HZO was woken up by 20  $C - V$  cycles. (b) Polarization versus voltage ( $P - V$ ) characteristics in the pristine state and after  $10^5$  cycles. (c, f) Comparison of simultaneously processed samples with HZO and  $\text{HfO}_2$  gate dielectric. The non ferroelectric  $\text{HfO}_2$  sample does not show any channel resistance hysteresis. (d, e, f) Influence of the channel thickness ( $d_{\text{WO}_x}$ ) on the on/off ratio.

to operate the network, and the precision of the weights ( $\geq 3$ bit) is more relaxed as in the case of a chip designed to perform on-line learning.<sup>55</sup> In our device structure, weights are defined through the intermediate states of the channel resistance, enabled via the multi-domain nature of the ferroelectric HZO layer.<sup>25,29,56</sup> By switching only a subset of the domains, a state between  $R_{\text{ON}}$  and  $R_{\text{OFF}}$  can be set.<sup>29</sup> The fraction of the switched ferroelectric domains



Figure 3: Analog multi-level behavior of a FeFET of  $20\text{ }\mu\text{m}$  width and  $5\text{ }\mu\text{m}$  length. **(a)** The channel resistance ( $R_{DS}$ ) after the application of  $5\text{ }\mu\text{s}$  write pulses ( $V_{write}$ ) of varying amplitudes. The different curves correspond to different consecutive measurements with reducing  $V_{write}$  range. **(b)** Retention measurement for  $1500\text{ s}$ .  $V_{read,D} = 200\text{ mV}$  was uninterruptedly applied while  $R_{DS}$  was measured every  $5\text{ s}$ .

depends on the amplitude, width, and number of the applied write pulses. Different pulsing schemes on HZO have been investigated in the past.<sup>22</sup> For on-line learning algorithms running on crossbar arrays integrated on CMOS, potentiation and depression pulse schemes with a constant pulse amplitude and width are preferred to those with varying amplitude. Nevertheless, for the proof of concept the multi-state nature of a  $20\text{ }\mu\text{m}$  wide and  $5\text{ }\mu\text{m}$  long FeFET was investigated by applying voltage pulses of varying amplitudes, while keeping a fixed pulse duration of  $5\text{ }\mu\text{s}$  (Figure 3a). This pulse scheme results in the best linearity in potentiation and depression.<sup>22</sup> By sweeping  $V_{write}$  from  $-4\text{ V}$  to  $4\text{ V}$ ,  $R_{DS}$  shows a hysteretic cycle from  $80\text{ k}\Omega$  to  $125\text{ k}\Omega$  with various intermediate states ( $\text{on/off} \approx 1.55$ ). By reducing the range of  $V_{write}$  numerous  $R_{DS}$  sub-loops can be accessed, as shown in Figure 3a. The asymmetry in the hysteresis loop is due to the imprint in the ferroelectric layer. Furthermore, the retention properties have been studied, as demonstrated in Figure 3b. First, an intermediate state was written by a  $5\text{ }\mu\text{s}$  pulse. Then, a source-to-drain voltage  $V_{DS} = 200\text{ mV}$  was applied for  $1500\text{ s}$ , while  $R_{DS}$  was measured every  $5\text{ s}$ . Between each measured intermediate state the FeFET was reset to its low resistive state ( $R_{ON}$ ) by setting  $V_{write} = -4\text{ V}$  during  $1\text{ ms}$ . The



Figure 4: Potentiation and depression of a 20  $\mu$ m wide and 5  $\mu$ m long FeFET. **(a)** The top panel shows multiple potentiation and depression cycles of the channel resistance ( $R_{DS}$ ) with varying pulse amplitude ( $V_{write}$ ) and constant pulse width ( $t_{write}$ ). The bottom panel shows the corresponding write pulse sequence . After each pulse  $R_{DS}$  was measured. **(b)** Absolute cycle-to-cycle variation of  $R_{DS}$  showing the data, average and standard deviation error bars. **(c)** Standard deviation of the  $R_{DS}$  cycle-to-cycle variation in percent. **(d)** Multiple potentiation and depression cycles of  $R_{DS}$  with increasing  $t_{write}$  from 40 ns to 250 ns and constant  $V_{write}$ .

FeFET showed stable retention properties for 18 differentiable channel resistances ( $>4$ bit) for the full 1500 s. The good retention measurement hints to an absence of depolarization or other screening mechanisms. The obtained multistate storage capability, the long retention and rather fast programming speed makes this FeFET suited for inference applications.

For on-chip learning, artificial synapses require a finer mesh of intermediate levels. In addition, symmetric and linear potentiation and depression are desirable. With respect to symmetry the field-driven ferroelectric switching is advantageous to other technologies that often show abrupt or unidirectional switching.<sup>22,44</sup> The requirement of low variability is relaxed as the training occurs on a specific hardware and thus incorporates the variability in its solution.<sup>55</sup> To investigate the linearity and symmetry of the potentiation and depression,

multiple write pulses of increasing and decreasing amplitude were applied. For the potentiation  $V_{\text{write}}$  was increased from 0 V to 3.5 V and for the depression decreased from 0 V to -3 V with 100 mV steps (Figure 4a). The duration of the write pulses was kept constant at 10  $\mu$ s. When averaging over several cycles (Figure 4b), multiple states with small standard deviation are observed. Normalizing the cycle-to-cycle standard deviation by  $R_{\text{ON}}$  reveals a constant value of about 1% (Figure 4c). The number and overlap of states are defined by the potentiation and depression step size. The latter could be reduced further to increase the resolution. When fitting the potentiation range from 1 V to 3.1 V and depression range from -0.9 V to -3.0 V by linear regression (Figure 5a), an adjusted residual-square value of 0.952 is obtained. The residuals normalized by the  $R_{\text{DS}}$  window as a function of pulse number is depicted in Figure 5b. For a more detailed analysis of the symmetry, Gaussian process regression (GPR) was used to predict a noise free signal (Figure 5c).<sup>57</sup> Plotting  $\Delta R$  (Figure 5e) and the signal to noise ratio (SNR, Figure 5d) as a function of pulse number reveals diminishing  $\Delta R$  and noisier signals towards the extremes. The symmetry factor (SF) was then calculated using the following equation:<sup>57</sup>

$$SF = \left| \frac{\Delta R_+ - \Delta R_-}{\Delta R_+ + \Delta R_-} \right|, \quad (2)$$

where  $\Delta R_+$  is the potentiation and  $\Delta R_-$  is the depression change in resistance at a certain resistance level. By this definition, SF can take values between 0 and 1 where 0 is the perfect symmetry. The less linear the range of the data becomes, the larger is SF (Figure 5d). The average across the full resistance range is  $SF = 0.20$  while the most linear part in the center reaches a very good symmetry factor of  $SF = 0.08$ .

Short programming pulses are advantageous as fast writing and low-power consumption are important for neuromorphic applications. By varying the pulse width from 40 ns to 250 ns with a fixed amplitude (Figure 4d), already the shortest applied pulse of 40 ns (equipment limit) changes the resistance and demonstrates very fast writing capabilities of the FeFET. It is expected that even shorter pulses could successfully program the device.<sup>29</sup> In our device,



Figure 5: Extraction of linearity and symmetry metric. Linear regression and the GPR methodology<sup>57</sup> is applied to our FeFET data from multiple cycles with: 22 potentiation pulses (blue) with increasing amplitude (1 V to 3.1 V) and 22 depression pulses (black) with decreasing negative amplitude (−0.9 V to −3 V). (a)  $R_{DS}$  as a function of pulse number and the linear regression fit (red). (b) Absolute residuals  $r$  normalized by the channel resistance window. (c) Channel resistance ( $R_{DS}$ ) as a function of pulse number and the GPR predicted noise free signal (red). (d) Absolute SNR for each potentiation and depression pulse. (e) Absolute change of  $R_{DS}$  after each potentiation and depression pulse. (f) Symmetry factor (SF) as a function of  $R_{DS}$ .

little energy is consumed while writing a state. When applying  $V_{\text{write}} = 3.5 \text{ V}$  a gate current of  $I_{\text{gate}} = 3.02 \times 10^{-8} \text{ A}$  is measured. Applying a write pulse duration of  $t_{\text{write}} = 200 \text{ ns}$  results in  $E = \frac{V_{\text{write}} \cdot I_{\text{gate}} \cdot t_{\text{write}}}{w \cdot l} = 2.1 \times 10^{-17} \text{ J } \mu\text{m}^{-2}$ , where  $l$  is the length and  $w$  the width of the gate.

### 3 Conclusion

We propose a device concept based on the ferroelectric field effect into a thin  $\text{WO}_x$  channel using HZO gate dielectric, that can be used as a synaptic element in hardware-supported neural networks. The fabrication process is compatible with the integration in the Back End Of Line of CMOS technology and is using earth-abundant materials, which is making

it attractive for large-scale integration. By comparing HZO and  $\text{HfO}_2$  based devices, and carefully analyzing capacitor and transistor data, we unambiguously show that the channel resistance is directly coupled to the polarization of the HZO layer and can be programmed in a non-volatile manner. Multilevel states programmed over more than 4-bits depth with a good retention and an almost symmetric potentiation and depression is obtained, together with a low programming energy. The property of the  $\text{WO}_x$  layer and the geometry of the device can be arranged so that a well-suited resistance range is obtained, favorable to build large scale arrays. The proposed device exhibit therefore promising metrics when considered as a synaptic element for processing cores supporting artificial neural networks. Future work will concentrate on controlling the channel thickness and the carrier concentration of  $\text{WO}_x$  to increase the on/off ratio, so that the device can be operated strictly in the linear region, without ever fully switching all the domains to the same polarization. This is expected to improve symmetry and to allow a constant pulse scheme for potentiation and depression, which is more friendly to learning algorithms.

## 4 Experimental

**Sample preparation.** Our FeFET is a bottom/gate device with shared gate. The gate contact is accessed through the Si  $n^+$  substrate. First, 10 nm TiN was deposited using a tetrakis(dimethylamino)titanium (TDMAT) precursor and  $\text{N}_2/\text{H}_2$  plasma in an Oxford Instruments plasma enhanced atomic layer deposition (PEALD) system. An approximately 10 nm thick layer of HZO was grown in a process using alternating cycles of tetrakis(ethylmethylamino)hafnium (TEMAH), and ZrCMMM ((MeCp) $_2$ Zr(OMe)(Me)) at 300 °C. Rutherford Back Scattering (RBS) analysis of the film (not shown) indicated an actual film composition of  $\text{Hf}_{0.57}\text{Zr}_{0.43}\text{O}_2$ . The sample was then immediately transferred to a sputter chamber for the deposition of 4 nm W. For the crystallization of HZO a millisecond flash lamp anneal (ms-FLA)<sup>46</sup> with a background temperature of 375 °C was performed.

After crystallization the 4 nm W was reduced to  $\approx$ 2.5 nm by Ar sputtering. The W was then crystallized and oxidized to 10 nm  $\text{WO}_3$  in a rapid thermal annealer (RTA) at 350 °C for 6 min with 50 sccm  $\text{O}_2$ . Afterwards a reduction of the  $\text{WO}_3$  to  $\text{WO}_x$  was performed in a RTA by  $\text{H}_2$  annealing at 150 °C and vacuum annealing at 350 °C.  $\text{WO}_x$  was further thinned by Ar sputtering to 8 nm. Source and drain were deposited by sputtering and liftoff. The passivation consists of 5 nm  $\text{Al}_2\text{O}_3$  by thermal ALD (precursor) and 100 nm  $\text{SiO}_2$  by plasma-enhanced chemical vapor deposition (PECVD). Vias were etched using a reactive ion etcher (RIE) with a  $\text{CHF}_3/\text{O}_2$  plasma. Finally, the contacts were realized by depositing 100 nm W by sputtering and defined in an RIE with a  $\text{SF}_6/\text{O}_2$  plasma.

**Structural Characterization.** Grazing incidence X-ray diffraction (GIXRD) measurements were performed in a Bruker D8 Discover diffractometer equipped with a rotating anode generator. TEM lamellas have been prepared by Focused Ion Beam using a FEI FIB Helios FEI Helios NanoLab 450S and investigated with a double spherical aberration-corrected JEOL JEM-ARM200F microscope. Bright field STEM (BF-STEM) images have been acquired at 200 kV and Energy Dispersive x-ray Spectroscopy (EDS) line profiles have been performed using a liquid-nitrogen-free silicon drift detector.

**Electrical Characterization.**  $R_{\text{DS}} - V_{\text{write}}$  and retention were measured using an Agilent B1500.  $V_{\text{write}}$  pulses were generated by a WGFMU and RSU module for the Agilent B1500 and applied to source and drain simultaneously while grounding the gate (Figure S4a).  $R_{\text{DS}}$  was measured by applying an IV-sweep from  $-200 \text{ mV}$  to  $200 \text{ mV}$  to the drain while having the source connected to ground (Figure S4b).  $R_{\text{DS}}$  was then determined by averaging the resistance at  $\pm 200 \text{ mV}$ .  $P - V$  loops on HZO were recorded using a TF Analyzer 2000 from AixAct. The signal of 5 kHz was applied to the top W/WO<sub>x</sub> contact while the bottom TiN/ $n^+$  Si contact (substrate) was grounded. For the wake-up of HZO,  $10^5$  cycles of  $\pm 3.8 \text{ V}$  and 100 kHz were applied.

## Supporting Information Available

The following files are available free of charge.

The Supporting Information is available free of charge on the ACS Publications website at DOI:

- Additional data concerning the endurance of MFM and MSFM structures, BF-STEM and  $P - V$  measurements on additional samples, electrical measuring schemes, capacitance measurements and permittivity and depletion width calculations. (PDF)

## Acknowledgement

We acknowledge helpful discussions with Nanbo Gong and Takashi Ando. This project has received funding from the European Commission under grant agreement H2020-ICT-2016-1-732642 (ULPEC).

## References

- (1) Wong, H. S.; Salahuddin, S. Memory leads the way to better computing. *Nat. Nanotechnol.* **2015**, *10*, 191–194.
- (2) Poon, C. S.; Zhou, K. Neuromorphic silicon neurons and large-scale neural networks: Challenges and opportunities. *Front. Neurosci.* **2011**, *5*, 2009–2011.
- (3) Gokmen, T.; Vlasov, Y. Acceleration of Deep Neural Network Training with Resistive Cross-Point Devices: Design Considerations. *Front. Neurosci.* **2016**, *10*, 1–13.
- (4) Kim, S.; Gokmen, T.; Lee, H.-M.; Haensch, W. E. Analog CMOS-based resistive processing unit for deep neural network training. 2017 IEEE 60th Int. Midwest Symp. Circuits Syst. 2017; pp 422–425.

- (5) Yu, S. Neuro-Inspired Computing With Emerging Nonvolatile Memorys. *Proc. IEEE* **2018**, *106*, 260–285.
- (6) Likharev, K.; Mayr, A.; Muckra, I.; Türel, Ö. CrossNets: High-Performance Neuromorphic Architectures for CMOL Circuits. *Ann. N. Y. Acad. Sci.* **2003**, *1006*, 146–163.
- (7) Nishitani, Y.; Kaneko, Y.; Ueda, M.; Morie, T.; Fujii, E. Three-terminal ferroelectric synapse device with concurrent learning function for artificial neural networks. *J. Appl. Phys.* **2012**, *111*, 1–7.
- (8) Mulaosmanovic, H.; Chicca, E.; Bertele, M.; Mikolajick, T.; Slesazeck, S. Mimicking biological neurons with a nanoscale ferroelectric transistor. *Nanoscale* **2018**, *10*, 21755–21763.
- (9) Burr, G. W. et al. Neuromorphic computing using non-volatile memory. *Adv. Phys. X* **2017**, *2*, 89–124.
- (10) Ha, S. D.; Ramanathan, S. Adaptive oxide electronics: A review. *J. Appl. Phys.* **2011**, *110*.
- (11) Chen, X.; Yin, X.; Niemier, M.; Hu, X. S. Design and optimization of FeFET-based crossbars for binary convolution neural networks. *Proc. 2018 Des. Autom. Test Eur. Conf. Exhib. DATE 2018* **2018**, *2018-Janua*, 1205–1210.
- (12) Hansen, M.; Zahari, F.; Kohlstedt, H.; Ziegler, M. Unsupervised Hebbian learning experimentally realized with analogue memristive crossbar arrays. *Sci. Rep.* **2018**, *8*, 8914.
- (13) Lacaita, A. Phase change memories: State-of-the-art, challenges and perspectives. *Solid. State. Electron.* **2006**, *50*, 24–31.
- (14) Raoux, S.; Wełnic, W.; Ielmini, D. Phase Change Materials and Their Application to Nonvolatile Memories. *Chem. Rev.* **2010**, *110*, 240–267.

- (15) Boybat, I.; Le Gallo, M.; Nandakumar, S. R.; Moraitis, T.; Parnell, T.; Tuma, T.; Rajendran, B.; Leblebici, Y.; Sebastian, A.; Eleftheriou, E. Neuromorphic computing with multi-memristive synapses. *Nat. Commun.* **2018**, *9*, 2514.
- (16) Baek, I.; Lee, M.; Sco, S.; Lee, M.; Seo, D.; Suh, D.-S.; Park, J.; Park, S.; Kim, H.; Yoo, I.; Chung, U.-I.; Moon, J. Highly scalable non-volatile resistive memory using simple binary oxide driven by asymmetric unipolar voltage pulses. IEDM Tech. Dig. IEEE Int. Electron Devices Meet. 2004. 2004; pp 587–590.
- (17) Lee, H. Y.; Chen, P. S.; Wu, T. Y.; Chen, Y. S.; Wang, C. C.; Tzeng, P. J.; Lin, C. H.; Chen, F.; Lien, C. H.; Tsai, M.-J. Low power and high speed bipolar switching with a thin reactive Ti buffer layer in robust HfO<sub>2</sub> based RRAM. 2008 IEEE Int. Electron Devices Meet. 2008; pp 1–4.
- (18) Waser, R.; Dittmann, R.; Staikov, C.; Szot, K. Redox-based resistive switching memories nanoionic mechanisms, prospects, and challenges. *Adv. Mater.* **2009**, *21*, 2632–2663.
- (19) Fuller, E. J.; Gabaly, F. E.; Léonard, F.; Agarwal, S.; Plimpton, S. J.; Jacobs-Gedrim, R. B.; James, C. D.; Marinella, M. J.; Talin, A. A. Li-Ion Synaptic Transistor for Low Power Analog Computing. *Adv. Mater.* **2017**, *29*, 1604310.
- (20) Tang, J.; Bishop, D.; Kim, S.; Copel, M.; Gokmen, T.; Todorov, T.; Shin, S.; Lee, K.-T.; Solomon, P.; Chan, K.; Haensch, W.; Rozen, J. ECRAM as Scalable Synaptic Cell for High-Speed, Low-Power Neuromorphic Computing. 2018 IEEE Int. Electron Devices Meet. 2018; pp 13.1.1–13.1.4.
- (21) Kim, S.; Todorov, T.; Onen, M.; Gokmen, T.; Bishop, D.; Solomon, P.; Lee, K.-t.; Copel, M.; Farmer, D.; John Ott, T.; Ando, A.; Miyazoe, H.; Narayanan, V.; Rozen, J. Metal-oxide Based, CMOS-compatible ECRAM for Deep Learning Accelerator. 2019 IEEE Int. Electron Devices Meet. 2019.

- (22) Jerry, M.; Chen, P.-Y.; Zhang, J.; Sharma, P.; Ni, K.; Yu, S.; Datta, S. Ferroelectric FET analog synapse for acceleration of deep neural network training. 2017 IEEE Int. Electron Devices Meet. 2017; pp 6.2.1–6.2.4.
- (23) Aziz, A. et al. Computing with ferroelectric FETs: Devices, models, systems, and applications. *Proc. 2018 Des. Autom. Test Eur. Conf. Exhib. DATE 2018*, 2018-Janua, 1289–1298.
- (24) Kim, M.-K.; Lee, J.-S. Ferroelectric Analog Synaptic Transistors. *Nano Lett.* **2019**, *19*, 2044–2050.
- (25) Mulaosmanovic, H.; Ocker, J.; Muller, S.; Noack, M.; Muller, J.; Polakowski, P.; Mikolajick, T.; Slesazeck, S. Novel ferroelectric FET based synapse for neuromorphic systems. *Dig. Tech. Pap. - Symp. VLSI Technol.* **2017**, T176–T177.
- (26) Slesazeck, S.; Schroeder, U.; Mikolajick, T. Embedding hafnium oxide based FeFETs in the memory landscape. 2018 Int. Conf. IC Des. Technol. 2018; pp 121–124.
- (27) Burr, G. W.; Narayanan, P.; Shelby, R. M.; Sidler, S.; Boybat, I.; di Nolfo, C.; Leblebici, Y. Large-scale neural networks implemented with non-volatile memory as the synaptic weight element: Comparative performance analysis (accuracy, speed, and power). 2015 IEEE Int. Electron Devices Meet. 2015; pp 4.4.1–4.4.4.
- (28) Böscke, T. S.; Müller, J.; Bräuhaus, D.; Schröder, U.; Böttger, U. Ferroelectricity in hafnium oxide thin films. *Appl. Phys. Lett.* **2011**, *99*, 0–3.
- (29) Chanthbouala, A.; Garcia, V.; Cherifi, R. O.; Bouzehouane, K.; Fusil, S.; Moya, X.; Xavier, S.; Yamada, H.; Deranlot, C.; Mathur, N. D.; Bibes, M.; Barthélémy, A.; Grollier, J. A ferroelectric memristor. *Nat. Mater.* **2012**, *11*, 860–864.
- (30) Ambriz-Vargas, F.; Kolhatkar, G.; Broyer, M.; Hadj-Youssef, A.; Nouar, R.; Sarkissian, A.; Thomas, R.; Gomez-Yáñez, C.; Gauthier, M. A.; Ruediger, A. A Com-

- plementary Metal Oxide Semiconductor Process-Compatible Ferroelectric Tunnel Junction. *ACS Appl. Mater. Interfaces* **2017**, *9*, 13262–13268.
- (31) Chen, L.; Wang, T.-Y.; Dai, Y.-W.; Cha, M.-Y.; Zhu, H.; Sun, Q.-Q.; Ding, S.-J.; Zhou, P.; Chua, L.; Zhang, D. W. Ultra-low power  $\text{Hf}_{0.5}\text{Zr}_{0.5}\text{O}_2$  based ferroelectric tunnel junction synapses for hardware neural network applications. *Nanoscale* **2018**, *10*, 15826–15833.
- (32) Tian, X.; Toriumi, A. New opportunity of ferroelectric tunnel junction memory with ultrathin  $\text{HfO}_2$ -based oxides. 2017 IEEE Electron Devices Technol. Manuf. Conf. 2017; pp 36–64.
- (33) Goh, Y.; Jeon, S. The effect of the bottom electrode on ferroelectric tunnel junctions based on CMOS-compatible  $\text{HfO}_2$ . *Nanotechnology* **2018**, *29*.
- (34) Mulaosmanovic, H.; Ocker, J.; Müller, S.; Schroeder, U.; Müller, J.; Polakowski, P.; Flachowsky, S.; van Bentum, R.; Mikolajick, T.; Slesazeck, S. Switching Kinetics in Nanoscale Hafnium Oxide Based Ferroelectric Field-Effect Transistors. *ACS Appl. Mater. Interfaces* **2017**, *9*, 3792–3798.
- (35) Sharma, P.; Tapily, K.; Saha, A. K.; Zhang, J.; Shaughnessy, A.; Aziz, A.; Snider, G. L.; Gupta, S.; Clark, R. D.; Datta, S. Impact of total and partial dipole switching on the switching slope of gate-last negative capacitance FETs with ferroelectric hafnium zirconium oxide gate stack. Dig. Tech. Pap. - Symp. VLSI Technol. 2017; pp T154–T155.
- (36) Krivokapic, Z. et al. 14nm Ferroelectric FinFET technology with steep subthreshold slope for ultra low power applications. 2017 IEEE Int. Electron Devices Meet. 2017; pp 15.1.1–15.1.4.
- (37) Mo, F.; Tagawa, Y.; Jin, C.; Ahn, M.; Saraya, T.; Hiramoto, T.; Kobayashi, M. Experimental Demonstration of Ferroelectric  $\text{HfO}_2$  FET with Ultrathin-body IGZO for

High-Density and Low-Power Memory Application. *2019 Symp. VLSI Technol.* **2019**, T42–T43.

- (38) Tian, X.; Shibayama, S.; Nishimura, T.; Yajima, T.; Migita, S.; Toriumi, A. Evolution of ferroelectric HfO<sub>2</sub> in ultrathin region down to 3 nm. *Appl. Phys. Lett.* **2018**, *112*, 102902.
- (39) Chernikova, A.; Kozodaev, M.; Markeev, A.; Negrov, D.; Spiridonov, M.; Zarubin, S.; Bak, O.; Buragohain, P.; Lu, H.; Suvorova, E.; Gruverman, A.; Zenkevich, A. Ultrathin Hf<sub>0.5</sub>Zr<sub>0.5</sub>O<sub>2</sub> Ferroelectric Films on Si. *ACS Appl. Mater. Interfaces* **2016**, *8*, 7232–7237.
- (40) Kwang-Ho Kim,; Jin-Ping Han,; Soon-Won Jung,; Tso-Ping Ma, Ferroelectric DRAM (FEDRAM) FET with metal/SrBi<sub>2</sub>Ta<sub>2</sub>O<sub>9</sub>/SiN/Si gate structure. *IEEE Electron Device Lett.* **2002**, *23*, 82–84.
- (41) Sakai, S.; Ilangovan, R. Metal–Ferroelectric–Insulator–Semiconductor Memory FET With Long Retention and High Endurance. *IEEE Electron Device Lett.* **2004**, *25*, 369–371.
- (42) Takahashi, K.; Aizawa, K.; Park, B.-E.; Ishiwara, H. Thirty-Day-Long Data Retention in Ferroelectric-Gate Field-Effect Transistors with HfO<sub>2</sub> Buffer Layers. *Jpn. J. Appl. Phys.* **2005**, *44*, 6218–6220.
- (43) Kaneko, Y.; Nishitani, Y.; Tanaka, H.; Ueda, M.; Kato, Y.; Tokumitsu, E.; Fujii, E. Correlated motion dynamics of electron channels and domain walls in a ferroelectric-gate thin-film transistor consisting of a ZnO/Pb(Zr,Ti)O<sub>3</sub> stacked structure. *J. Appl. Phys.* **2011**, *110*.
- (44) Yu, S.; Chen, P. Y.; Cao, Y.; Xia, L.; Wang, Y.; Wu, H. Scaling-up resistive synaptic arrays for neuro-inspired architecture: Challenges and prospect. *Tech. Dig. - Int. Electron Devices Meet. IEDM* **2015**, 2016-Febru, 17.3.1–17.3.4.

- (45) Cristoloveanu, S.; Munteanu, D.; Liu, M. A review of the pseudo-MOS transistor in SOI wafers: operation, parameter extraction, and applications. *IEEE Trans. Electron Devices* **2000**, *47*, 1018–1027.
- (46) O'Connor, É.; Halter, M.; Eltes, F.; Sousa, M.; Kellock, A.; Abel, S.; Fompeyrine, J. Stabilization of ferroelectric  $\text{Hf}_x\text{Zr}_{1-x}\text{O}_2$  films using a millisecond flash lamp annealing technique. *APL Mater.* **2018**, *6*, 121103.
- (47) Materlik, R.; Künneth, C.; Kersch, A. The origin of ferroelectricity in  $\text{Hf}_{1-x}\text{Zr}_x\text{O}_2$  : A computational investigation and a surface energy model. *J. Appl. Phys.* **2015**, *117*, 134109.
- (48) Salje, E.; Viswanathan, K. Physical properties and phase transitions in  $\text{WO}_3$ . *Acta Crystallogr. Sect. A* **1975**, *31*, 356–359.
- (49) Kim, H. J.; Park, M. H.; Kim, Y. J.; Lee, Y. H.; Moon, T.; Kim, K. D.; Hyun, S. D.; Hwang, C. S. A study on the wake-up effect of ferroelectric  $\text{Hf}_{0.5}\text{Zr}_{0.5}\text{O}_2$  films by pulse-switching measurement. *Nanoscale* **2016**, *8*, 1383–1389.
- (50) Brotherton, S. D. *Introd. to Thin Film Transistors*; Springer International Publishing: Heidelberg, 2013; Vol. 9783319000; pp 9–44.
- (51) Bang, S.; Lee, S.; Park, J.; Park, S.; Jeong, W.; Jeon, H. Investigation of the effects of interface carrier concentration on ZnO thin film transistors fabricated by atomic layer deposition. *J. Phys. D. Appl. Phys.* **2009**, *42*, 235102.
- (52) Nakata, M.; Tsuji, H.; Sato, H.; Nakajima, Y.; Fujisaki, Y.; Takei, T.; Yamamoto, T.; Fujikake, H. Influence of Oxide Semiconductor Thickness on Thin-Film Transistor Characteristics. *Jpn. J. Appl. Phys.* **2013**, *52*, 03BB04.
- (53) Goetzberger, A.; Nicollian, E. H. TRANSIENT VOLTAGE BREAKDOWN DUE TO AVALANCHE IN MIS CAPACITORS. *Appl. Phys. Lett.* **1966**, *9*, 444–446.

- (54) Gokmen, T.; Onen, O. M.; Haensch, W. Training Deep Convolutional Neural Networks with Resistive Cross-Point Devices. **2017**, 1–22.
- (55) Obradovic, B.; Rakshit, T.; Hatcher, R.; Kittl, J.; Sengupta, R.; Hong, J. G.; Rodder, M. S. A Multi-Bit Neuromorphic Weight Cell Using Ferroelectric FETs, suitable for SoC Integration. *IEEE J. Electron Devices Soc.* **2018**, *6*, 438–448.
- (56) Oh, S.; Kim, T.; Kwak, M.; Song, J.; Woo, J.; Jeon, S.; Yoo, I. K.; Hwang, H.  $\text{HfZrO}_x$  - based ferroelectric synapse device with 32 levels of conductance states for neuromorphic applications. *IEEE Electron Device Lett.* **2017**, *38*, 732–735.
- (57) Gong, N.; Idé, T.; Kim, S.; Boybat, I.; Sebastian, A.; Narayanan, V.; Ando, T. Signal and noise extraction from analog memory elements for neuromorphic computing. *Nat. Commun.* **2018**, *9*, 2102.

# Supporting information for: A back-end, CMOS compatible ferroelectric Field Effect Transistor for synaptic weights

Mattia Halter,<sup>\*,†,‡</sup> Laura Bégon-Lours,<sup>†</sup> Valeria Bragaglia,<sup>†</sup> Marilyne Sousa,<sup>†</sup> Bert  
Jan Offrein,<sup>†</sup> Stefan Abel,<sup>†</sup> Mathieu Luisier,<sup>‡</sup> and Jean Fompeyrine<sup>†</sup>

<sup>†</sup>*IBM Research GmbH - Zurich Research Laboratory, CH-8803 Rüschlikon, Switzerland*

<sup>‡</sup>*Integrated Systems Laboratory, ETH Zurich, CH-8092 Zurich, Switzerland*

E-mail: att@zurich.ibm.com



Figure S1: Endurance measurements of 10 nm HZO. The total remanent polarization ( $P = |P_{r-}| + |P_{r+}|$ ) was determined by positive up negative down (PUND) measurements with 1 kHz and  $\pm 3.5$  V. The cycling frequency was set to 1 kHz up to  $10^4$  cycles, 10 kHz up to  $10^5$  cycles and 100 kHz for cycles above  $10^5$ : (a) The TiN/HZO/TiN MFM configuration was cycled at  $\pm 3.5$  V. (b) The W/WO<sub>x</sub>/HZO/TiN MSFM configuration was cycled at  $\pm 3.0$  V with a  $-0.5$  V offset.



Figure S2: Cross-sectional BF-STEM images of the samples from the WO<sub>x</sub> thickness series: (a)  $d_{\text{WO}_x} = 11.3$  nm, (b)  $d_{\text{WO}_x} = 15$  nm



Figure S3: P-V measurements on the samples from the  $WO_x$  thickness series. Polarization versus voltage ( $P - V$ ) characteristics measured on  $60 \mu\text{m} \times 60 \mu\text{m}$  W/ $WO_x$ /HZO/TiN/n<sup>+</sup>Si MFM structures at 5 kHz in the pristine state and after  $10^5$  cycles: (a)  $d_{WO_x} = 11.3 \text{ nm}$ , (b)  $d_{WO_x} = 15 \text{ nm}$ .



Figure S4: Write and read schematic showing a semiconductor parameter analyzer (B1500) with the waveform generator/fast measurement unit (WGFMU) and its two remote-sense and switch units (RSU): A state is written by applying a pulse to source (S) and drain (D) while the gate (G) is grounded. The channel resistance is read by applying an IV-sweep from -200 mV to 200 mV to D while having S connected to the common ground through SMU2.



Figure S5: Capacitance measurements on a  $60\text{ }\mu\text{m} \times 60\text{ }\mu\text{m}$  (a) TiN/WO<sub>x</sub>/TiN MFM and (b) W/WO<sub>x</sub>/HZO/TiN/n<sup>+</sup>Si MSFM structure

## WO<sub>x</sub> permittivity

The permittivity of WO<sub>x</sub> ( $\epsilon_{WOx} = 189$ ) was calculated using the following equation of two capacitances in series:

$$\frac{1}{C_{WOxHZO}} = \frac{1}{C_{HZO}} + \frac{d_{WOx}}{\epsilon_0 * \epsilon_{WOx} * A}, \quad (1)$$

where  $C_{WOxHZO}$  is the capacitance of the W/WO<sub>x</sub>/HZO/TiN stack,  $C_{HZO}$  the capacitance of TiN/HZO/TiN stack,  $d_{WOx} = 8\text{ nm}$  the thickness of the WO<sub>x</sub> channel,  $\epsilon_0$  the vacuum permittivity and  $A = 3600\text{ }\mu\text{m}^2$  the area of the capacitor. From Figure S5a we get  $C_{HZO} = 1.13 \times 10^{-10}\text{ F}$  and from Figure S5b we get  $C_{WOxHZO} = 9.9 \times 10^{-11}\text{ F}$ .



Figure S6: Depletion width as a function of carrier concentration  $N_D$  calculated for different  $V_{GS}$