

# Introduction to Microfabrication

# Introduction to Microfabrication

## Second Edition

Sami Franssila

*Professor of Materials Science at Aalto University  
and Adjunct Professor of Micro- and Nanotechnology  
at University of Helsinki, Finland*



A John Wiley and Sons, Ltd., Publication

This edition first published 2010  
© 2010, John Wiley & Sons, Ltd

First Edition published in 2004

*Registered office*

John Wiley & Sons Ltd, The Atrium, Southern Gate, Chichester, West Sussex, PO19 8SQ, United Kingdom

For details of our global editorial offices, for customer services and for information about how to apply for permission to reuse the copyright material in this book please see our website at [www.wiley.com](http://www.wiley.com).

The right of the author to be identified as the author of this work has been asserted in accordance with the Copyright, Designs and Patents Act 1988.

All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, except as permitted by the UK Copyright, Designs and Patents Act 1988, without the prior permission of the publisher.

Wiley also publishes its books in a variety of electronic formats. Some content that appears in print may not be available in electronic books.

Designations used by companies to distinguish their products are often claimed as trademarks. All brand names and product names used in this book are trade names, service marks, trademarks or registered trademarks of their respective owners. The publisher is not associated with any product or vendor mentioned in this book. This publication is designed to provide accurate and authoritative information in regard to the subject matter covered. It is sold on the understanding that the publisher is not engaged in rendering professional services. If professional advice or other expert assistance is required, the services of a competent professional should be sought.

*Library of Congress Cataloguing-in-Publication Data*

Franssila, Sami.

Introduction to microfabrication / Sami Franssila. – 2nd ed.

p. cm.

Includes index.

ISBN 978-0-470-74983-8 (cloth)

1. Microelectromechanical systems. 2. Integrated circuits. 3. Semiconductor processing. 4. Nanotechnology.

5. Microfabrication. I. Title.

TK7875.F73 2010

621.381 – dc22

2010010076

A catalogue record for this book is available from the British Library.

ISBN: 978-0-470-74983-8

Set in 9/11pt Times by Laserwords Private Limited, Chennai, India  
Printed and Bound in Markono Print Media Pte Ltd, Singapore

# Contents

|                                                                                                                                                                                                                                                                                          |      |                                                                                                                                                                                                                                                                                                                                                                                                         |
|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| <i>Preface to the First Edition</i>                                                                                                                                                                                                                                                      | ix   | Polysilicon, Oxide and Nitride Thin Films, Polymer Films, Advanced Thin Films, Exercises                                                                                                                                                                                                                                                                                                                |
| <i>Preface to the Second Edition</i>                                                                                                                                                                                                                                                     | xiii | References and Related Reading                                                                                                                                                                                                                                                                                                                                                                          |
| <i>Acknowledgements</i>                                                                                                                                                                                                                                                                  | xv   |                                                                                                                                                                                                                                                                                                                                                                                                         |
| <b>1 Introduction</b>                                                                                                                                                                                                                                                                    | 1    | <b>6 Epitaxy</b> 69<br>Heteroepitaxy, Epitaxial Deposition, CVD<br>Homoepitaxy of Silicon, Doping of Epilayers,<br>Measurement of Epitaxial Deposition, Simulation of<br>Epitaxy, Advanced Epitaxy, Exercises<br>References and Related Reading                                                                                                                                                         |
| Substrates, Thin Films, Processes, Dimensions,<br>Devices, MOS Transistor, Cleanliness and Yield,<br>Industries, Exercises<br>References and Related Reading                                                                                                                             |      |                                                                                                                                                                                                                                                                                                                                                                                                         |
| <b>2 Micrometrology and Materials<br/>Characterization</b>                                                                                                                                                                                                                               | 15   | <b>7 Advanced Thin Films</b> 77<br>General Features of Thin-Film Processes, Film<br>Growth and Structure, Thin-Film Structure<br>Characterization, Surfaces and Interfaces, Adhesion,<br>Two-Layer Films, Alloys and Doped Films,<br>Multilayer Films, Selective Deposition, Reacted<br>Films, Simulation of Deposition, Thickness Limits<br>of Thin Films, Exercises<br>References and Related Reading |
| Microscopy and Visualization, Lateral and Vertical<br>Dimensions, Optical Techniques, Electrical<br>Measurements, Physical and Chemical Analyses,<br>Practical Issues with Micrometrology,<br>Measurements Everywhere, Exercises<br>References and Related Reading                       |      |                                                                                                                                                                                                                                                                                                                                                                                                         |
| <b>3 Simulation of Microfabrication Processes</b>                                                                                                                                                                                                                                        | 29   | <b>8 Pattern Generation</b> 93<br>Pattern Generators, Electron Beam Lithography,<br>Laser Pattern Generators, Photomask Fabrication,<br>Photomask Inspection, Defects and Repair,<br>Photomasks as Tools, Other Pattern Generation<br>Methods, Exercises<br>References and Related Reading                                                                                                              |
| Simulator Types, Levels of Simulation, The 1D<br>Simulators, The 2D Simulators, The 3D Simulators,<br>Other Simulation Needs in Microfabrication,<br>Exercises<br>References and Related Reading                                                                                         |      |                                                                                                                                                                                                                                                                                                                                                                                                         |
| <b>4 Silicon</b>                                                                                                                                                                                                                                                                         | 35   | <b>9 Optical Lithography</b> 103<br>Lithography Process Flow, Resist Chemistry, Resist<br>Application, Alignment and Overlay, Exposure,<br>Resist Profile, Resolution, Process Latitude, Basic<br>Pattern Shapes, Lithography Practice, Photoresist<br>Stripping, Exercises<br>References and Related Reading                                                                                           |
| Silicon Material Properties, Silicon Crystal Growth,<br>Silicon Crystal Structure, Silicon Wafering Process,<br>Defects and Non-Idealities in Silicon Crystals,<br>Advanced Wafers, Exercises<br>References and Related Reading                                                          |      |                                                                                                                                                                                                                                                                                                                                                                                                         |
| <b>5 Thin-Film Materials and Processes</b>                                                                                                                                                                                                                                               | 47   | <b>10 Advanced Lithography</b> 115<br>Projection Optical Systems, Resolution of Projection<br>Optical Systems, Resists, Thin-Film Optics in<br>Resists, Lithography Over Steps, Optical Extensions                                                                                                                                                                                                      |
| Thin Films vs. Bulk Materials, Physical Vapor<br>Deposition, Chemical Vapor Deposition, PECVD:<br>Plasma-Enhanced CVD, ALD: Atomic Layer<br>Deposition, Electrochemical Deposition (ECD),<br>Other Methods, Thin Films Over Topography: Step<br>Coverage, Stresses, Metallic Thin Films, |      |                                                                                                                                                                                                                                                                                                                                                                                                         |

|                                                                                                                                                                                                                                                                                                                                                 |  |  |
|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--|--|
| of Optical Lithography, Non-Optical Extension of Optical Lithography, Lithography Simulation, Lithography Triangles, Exercises References and Related Reading                                                                                                                                                                                   |  |  |
| <b>11 Etching</b> 127                                                                                                                                                                                                                                                                                                                           |  |  |
| Etch Mechanisms, Etching Profiles, Anisotropic Wet Etching, Wet Etching, Plasma Etching (RIE), Isotropic Dry Etching, Etch Masks, Non-Masked Etching, Multistep and Multilayer Etching, Etch Processes for Common Materials, Ion Beam Etching, Etch Process Characteristics, Selecting Etch Processes, Exercises References and Related Reading |  |  |
| <b>12 Wafer Cleaning and Surface Preparation</b> 143                                                                                                                                                                                                                                                                                            |  |  |
| Classes of Contamination, Chemical Wet Cleaning, Physical Wet Cleaning, Rinsing and Drying, Dry Cleaning, Particle Removal, Organics Removal, Metal Removal, Contact Angle, Surface Preparation, Exercises References and Related Reading                                                                                                       |  |  |
| <b>13 Thermal Oxidation</b> 153                                                                                                                                                                                                                                                                                                                 |  |  |
| Thermal Oxidation Process, Deal–Grove Oxidation Model, Oxidation of Polysilicon, Oxide Structure, Local Oxidation of Silicon, Stress and Pattern Effects in Oxidation, Simulation of Oxidation, Thermal Oxides vs. other Oxides, Exercises References and Related Reading                                                                       |  |  |
| <b>14 Diffusion</b> 165                                                                                                                                                                                                                                                                                                                         |  |  |
| Diffusion Process, Diffusion Mechanisms, Doping of Polysilicon, Doping Profiles in Diffusion, Diffusion Applications, Simulation of Diffusion, Diffusion at Large, Exercises References and Related Reading                                                                                                                                     |  |  |
| <b>15 Ion Implantation</b> 173                                                                                                                                                                                                                                                                                                                  |  |  |
| The Implantation Process, Implant Applications, Implant Damage and Damage Annealing, Tools for Ion Implantation, Ion Implantation Simulation, Implantation Further, Exercises References and Related Reading                                                                                                                                    |  |  |
| <b>16 CMP: Chemical–Mechanical Polishing</b> 181                                                                                                                                                                                                                                                                                                |  |  |
| CMP Process and Tool, Mechanics of CMP, Chemistry of CMP, Non-Idealities in CMP, Monitoring CMP Processes, Applications of CMP, CMP as a Whole, Exercises References and Related Reading                                                                                                                                                        |  |  |
| <b>17 Bonding</b> 191                                                                                                                                                                                                                                                                                                                           |  |  |
| Bonding Basics, Fusion Bonding Blanket Silicon Wafers, Anodic Bonding, Metallic Bonding, Adhesive Bonding, Layer Transfer and Temporary Bonding, Bonding of Structured Wafers, Bond Quality Measurements, Bonding for Packaging, Bonding at Large, Exercises References and Related Reading                                                     |  |  |
| <b>18 Polymer Microprocessing</b> 203                                                                                                                                                                                                                                                                                                           |  |  |
| Polymer Materials, Polymer Thermal Properties, Thick-Resist Lithography, Molding Techniques, Hot Embossing, Nanoimprint Lithography, Masters for Replication, Processing on Polymers, Polymer Bonding, Polymer Devices, Polymer Overview, Exercises References and Related Reading                                                              |  |  |
| <b>19 Glass Microprocessing</b> 225                                                                                                                                                                                                                                                                                                             |  |  |
| Structure and Properties of Glasses, Glass Substrates, General Processing Issues with Glasses, Glass Etching, Glass Bonding, Glass Devices, Specialty Glasses, Exercises References and Further Reading                                                                                                                                         |  |  |
| <b>20 Anisotropic Wet Etching</b> 237                                                                                                                                                                                                                                                                                                           |  |  |
| Basic Structures on <100> Silicon, Etchants, Etch Masks and Protective Coatings, Etch Rate and Etch Stop, Front-Side Processed Structures, Convex Corner Etching, Membrane Fabrication, Through-Wafer Structures, <110> Etching, <111> Silicon Etching, Comparison of <100>, <110> and <111> Etching, Exercises References and Related Reading  |  |  |
| <b>21 Deep Reactive Ion Etching</b> 255                                                                                                                                                                                                                                                                                                         |  |  |
| RIE Process Capabilities, RIE Process Physics and Chemistry, Deep Etching, Combining Anisotropic and Isotropic DRIE, Microneedles and Nozzles, Sidewall Quality, Pattern Size and Pattern Density Effects, Etch Residues and Damage, DRIE vs. Wet Etching, Exercises References and Related Reading                                             |  |  |
| <b>22 Wafer Engineering</b> 271                                                                                                                                                                                                                                                                                                                 |  |  |
| Silicon Crystals, Gettering, Wafer Mechanical Specifications, Epitaxial Wafers, SOI Wafers, Bonding Mechanics, Advanced Wafers, Variety of Wafers, Exercises References and Further Reading                                                                                                                                                     |  |  |

---

|                                           |     |                                                                                                                                                                                                                                                                                                                                                                            |
|-------------------------------------------|-----|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| <b>23 Special Processes and Materials</b> | 283 | Membranes and Bridges, Stiction, Multiple Layer Structures, Rotating Structures, Hinged Structures, CMOS Wafers as Substrates, Exercises<br>References and Related Reading                                                                                                                                                                                                 |
| <b>24 Serial Microprocessing</b>          | 299 | Focused Ion Beam (FIB) Processing, Focused Electron Beam (FEB) Processing, Laser Direct Writing, AFM Patterning, Ink Jetting, Mechanical Structuring, Chemical and Chemomechanical Machining Scaled Down, Conclusions, Exercises<br>References and Further Reading                                                                                                         |
| <b>25 Process Integration</b>             | 313 | The Two Sides of the Wafer, Device Example 1: Solar Cell, Device Example 2: Microfluidic Sieves, Wafer Selection, Masks and Lithography, Design Rules, Resistors, Device Example 3: PCR Reactor, Device Example 4: Integrated Passive Chip, Contamination Budget, Thermal Processes, Metallization, Passivation and Packaging, Exercises<br>References and Related Reading |
| <b>26 MOS Transistor Fabrication</b>      | 329 | Polysilicon Gate CMOS, Polysilicon Gate CMOS: 10 $\mu\text{m}$ to 1 $\mu\text{m}$ Generations, MOS Transistor Scaling, CMOS from 0.8 $\mu\text{m}$ to 65 nm, Gate Module, SOI MOSFETs, Thin-Film Transistors, Integrated Circuits, Exercises<br>References and Related Reading                                                                                             |
| <b>27 Bipolar Transistors</b>             | 347 | Fabrication Process of SBC Bipolar Transistor, Advanced Bipolar Structures, Lateral Isolation, BiCMOS Technology, Cost of Integration, Exercises<br>References and Related Reading                                                                                                                                                                                         |
| <b>28 Multilevel Metallization</b>        | 357 | Two-Level Metallization, Planarized Multilevel Metallization, Copper Metallization, Dual Damascene Metallization, Low- $k$ Dielectrics, Metallization Scaling, Exercises<br>References and Related Reading                                                                                                                                                                 |
| <b>29 Surface Micromachining</b>          | 369 | Single Structural Layer Devices, Materials for Surface Micromachining, Mechanics of Free-Standing Films, Cantilever Structures,<br>References and Related Reading                                                                                                                                                                                                          |
| <b>30 MEMS Process Integration</b>        | 387 | Silicon Microbridges, Double-Sided Processing, Membrane Structures, Piezoresistive Pressure Sensor, Tilting and Bending Through-Wafer Etched Structures, Needles and Tips, Channels and Nozzles, Bonded Structures, Surface Micromachining Combined with Bulk Micromachining, MEMS Packaging, Microsystems, Exercises<br>References and Related Reading                    |
| <b>31 Process Equipment</b>               | 409 | Batch Processing vs. Single Wafer Processing, Process Regimes: Temperature and Pressure, Cluster Tools and Integrated Processing, Measuring Fabrication Processes, Equipment Figures of Merit, Simulation of Process Equipment, Tool Lifecycles, Cost of Ownership, Exercises<br>References and Related Reading                                                            |
| <b>32 Equipment for Hot Processes</b>     | 419 | High-Temperature Equipment: Hot Wall vs. Cold Wall, Furnace Processes, Rapid Thermal Processing/Rapid Thermal Annealing, Furnaces vs. RTP Systems, Exercises<br>References and Related Reading                                                                                                                                                                             |
| <b>33 Vacuum and Plasmas</b>              | 425 | Vacuum Physics and Kinetic Theory of Gases, Vacuum Production, Plasma Etching, Sputtering, Residual Gas Incorporation into Deposited Film, PECVD, Residence Time, Exercises<br>References and Related Reading                                                                                                                                                              |
| <b>34 CVD and Epitaxy Equipment</b>       | 433 | Deposition Rate, CVD Rate Modeling, CVD Reactors, CVD with Liquid Sources, Silicon CVD Epitaxy, Epitaxial Reactors, Control of CVD Reactions, Exercises<br>References and Related Reading                                                                                                                                                                                  |
| <b>35 Cleanrooms</b>                      | 441 | Cleanroom Construction, Cleanroom Standards, Cleanroom Subsystems, Environment, Safety and Health (ESH), Cleanroom Operating Procedures, Mini-Environments, Exercises<br>References and Related Reading                                                                                                                                                                    |

|                                                                                                                                                                                                                                        |     |                                                                                                                                                                                                                 |
|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| <b>36 Yield and Reliability</b>                                                                                                                                                                                                        | 449 | Materials Challenges, Statistics and Yield, Limits of Scaling, Exercises<br>References and Related Reading                                                                                                      |
| Yield Definitions and Formulas, Yield Models,<br>Yield Ramping, Package Reliability, Metallization<br>Reliability, Dielectric Defects and Quality,<br>Stress Migration, Die Yield Loss,<br>Exercises<br>References and Related Reading |     |                                                                                                                                                                                                                 |
| <b>37 Economics of Microfabrication</b>                                                                                                                                                                                                | 457 | New Devices, Proliferation of MEMS,<br>Microfluidics, BioMEMS, Bonding and 3D<br>Integration, IC-MEMS Integration, Microfabricated<br>Devices for Microfabrication, Exercises<br>References and Related Reading |
| Silicon, IC Costs and Prices, IC Industry, IC Wafer<br>fabs, MEMS Industry, Flat-Panel Display Industry,<br>Solar Cells, Magnetic Data Storage, Short Term and<br>Long Term, Exercises<br>References and Related Reading               |     |                                                                                                                                                                                                                 |
| <b>38 Moore's Law and Scaling Trends</b>                                                                                                                                                                                               | 469 | From Transistor to Integrated Circuit, Historical<br>Development of IC Manufacturing, MOS Scaling,<br>Departure from Planar Bulk Technology,<br>Memories, Lithography Future, Moore's Law,                      |
|                                                                                                                                                                                                                                        |     |                                                                                                                                                                                                                 |
|                                                                                                                                                                                                                                        |     | <b>Appendix A Properties of Silicon</b> <span style="float: right;">499</span>                                                                                                                                  |
|                                                                                                                                                                                                                                        |     | <b>Appendix B Constants and Conversion<br/>Factors</b> <span style="float: right;">501</span>                                                                                                                   |
|                                                                                                                                                                                                                                        |     | <b>Appendix C Oxide and Nitride Thickness<br/>by Color</b> <span style="float: right;">503</span>                                                                                                               |
|                                                                                                                                                                                                                                        |     | <b>Index</b> <span style="float: right;">505</span>                                                                                                                                                             |

# Preface to the First Edition

Microfabrication is generic: its applications include integrated circuits, MEMS, microfluidics, micro-optics, nanotechnology and countless others. Microfabrication is encountered in slightly different guises in all of these applications: electroplating is essential for deep sub-micron IC metallization and for LIGA-microstructures; deep-RIE is a key technology in trench DRAMs and in MEMS; imprint lithography is utilized in microfluidics where typical dimensions are 100 µm, as well as in nanotechnology, where feature sizes are down to 10 nm. This book is unique because it treats microfabrication in its own right, independent of applications, and therefore it can be used in electrical engineering, materials science, physics and chemistry classes alike.

Instead of looking at devices, I have chosen to concentrate on microstructures on the wafer: lines and trenches, membranes and cantilevers, cavities and nozzles, diffusions and epilayers. Lines are sometimes isolated and sometimes in dense arrays, irrespective of linewidths; membranes can be made by timed etching or by etch stop; source/drain diffusions can be aligned to the gate in a mask aligner or made in a self-aligned fashion; oxidation on a planar surface is easy, but the oxidation of topographic features is tricky. The microstructure-view of microfabrication is a solution against outdated: alignment must be considered for both 100 µm fluidic channels and 100 nm CMOS gates, etch undercutting target may be 10 nm or 10 µm, but it is there; dopants will diffuse during high temperature anneals, but the junction depth target may be tens of nanometres or tens of micrometres.

A common feature of older textbooks is concentration on physics and chemistry: plasma potentials, boundary layers, diffusion mechanisms, Rayleigh resolution, thermodynamic stability and the like. This is certainly a guarantee against outdated in rapidly evolving technologies, but microfabrication is an engineering discipline, not physics and chemistry. CMOS scaling trends have in fact been more reliable than basic physics and chemistry in the past 40 years: optical lithography was predicted to be unable to print submicron lines and

gate oxides today are thinner than the ultimate limits conceived in the 1970s. And it is pedagogically better to show applications of CVD films before plunging into pressure dependence of deposition rate, and to discuss metal film functionalities before embracing sputtering yield models.

In this book, another major emphasis is on materials. Materials are universal, and not outdated rapidly. New materials are, of course, being introduced all the time, but the basic materials properties like resistivity, dielectric constant, coefficient of thermal expansion and Young's modulus must always be considered for low-k and high-k dielectrics, SnO<sub>2</sub> sensor films, diamond coatings and 100 µm-thick photoresists alike. Silicon, silicon dioxide, silicon nitride, aluminium, tungsten, copper and photoresist will be met again in various applications: nitride is used not only in LOCOS isolation, but also in MEMS thermal isolation; aluminium not only serves as a conductor in ICs but also as a mirror in MOEMS; copper is used for IC metallization and also as a sacrificial layer under nickel in metal MEMS; photoresist acts not only as a photoactive material but also as an adhesive in wafer bonding.

Devices are, of course, discussed but from the fabrication viewpoint, without thorough device physics. The unifying idea is to discuss the commonalities and generic features of the fabrication processes. Resistors and capacitors serve to exemplify concepts like alignment sequence and design rules, or interface stability. After basic processes and concepts have been introduced, process integration examples show a wide spectrum of full process flows: for example, solar cell, piezoresistive pressure sensor, CMOS, AFM cantilever tip, microfluidic out-of-plane needle and super-self-aligned bipolar transistor. Small process-sequence examples include, similarly, a variety of structures: replacement gate, cavity sealing, self-aligned rotors and dual damascene-low-k options are among the others.

Older textbooks present microfabrication as a toolbox of MEMS or as the technology for CMOS manufacturing. Both approaches lead to unsatisfactory views on

microfabrication. Ten years ago, chemical-mechanical polishing was not detailed in textbooks, and five years ago discussion on CMP was included in multilevel metallization chapter. Today, CMP is a generic technology that has applications in CMOS front-end device isolation and surface micromechanics, and is used to fabricate photonic crystals and superconducting devices. It therefore deserves a chapter of its own, independent of actual or potential applications. Similarly, wafer cleaning used to be presented as a preparatory step for oxidation, but it is also essential for epitaxy, wafer bonding and CMP. Device-view, be it CMOS or some other, limits processes and materials to a few known practices, and excludes many important aspects that are fruitful in other applications.

The aim of the book is for the student to feel comfortable both in a megafab and in a student lab. This means that both research-oriented and manufacturing-driven aspects of microfabrication must be covered. In order to keep the amount of material manageable, many things have had to be left out: high density plasmas are mentioned, but the emphasis is on plasma processing in general; KOH and TMAH etching are both described, but commonalities rather than differences are shown; imprint lithography and hot embossing are discussed but polymer rheology is neglected; alternatives to optical lithography are mentioned, but discussed only briefly. Emphasis is on common and conceptual principles, and not on the latest technologies, which hopefully extends the usable life of the book.

## Structure of the Book

The structure of this book differs from the traditional structure in many ways. Instead of discussing individual process steps at length first and putting full processes together in the last chapter, applications are presented throughout the book. The chapters on equipment are separated from the chapters on processes in order to keep the basic concepts and current practical implementations apart.

The introduction covers materials, processes, devices and industries. Measurements are presented next, and more examples of measurement needs in microfabrication are presented in almost every chapter. A general discussion of simulation follows, and more specific simulation cases are presented in the chapters that follow.

Materials of microfabrication are presented next: silicon and thin films. Silicon crystal growth is shortly covered but from the very beginning, the discussion centres on wafers and structures on wafers: therefore, silicon wafering process, and resulting wafer properties

are emphasized. Epitaxy, CVD, PVD, spin coating and electroplating are discussed, with resulting materials properties and microstructures on the centre stage, rather than equipment themselves. Lithography and etching then follow. This order of presentation enables more realistic examples to be discussed early on.

The basic steps in silicon technology, such as oxidation, diffusion and ion implantation are discussed next, followed by CMP and bonding. Moulding and stamping techniques have also been included. In contrast to older books, and to books with CMOS device emphasis, this book is strong in back-end steps, thin films, etching, planarization and novel materials. This reflects the growing importance of multilevel metallization in ICs as well as the generic nature of etch and deposition processes, and their wide applicability in almost all microfabrication fields. Packaging is not dealt with, again in line with wafer-level view of microfabrication. This also excludes stereomicrolithography and many miniaturized traditional techniques like microelectrodischarge machining.

Microfabrication is an engineering discipline, and volume manufacturing of microdevices must be discussed. Discussions on process equipment have often been bogged by the sheer number of different designs: should the students be shown both 13.56 MHz diode etcher, triode, microwave, ECR, ICP and helicon plasmas, and should APCVD, LPCVD, SA-CVD, UHV-CVD and PECVD reactors all be presented? In this book, the process equipment discussion is again tied to structures that result on wafers, rather than in the equipment *per se*: base vacuum interaction with thin-film purity is discussed; the role of RTP temperature uniformity on wafer stresses is considered; and surface reaction versus transport controlled growth in different CVD reactors is analysed. Cleanroom technology, wafer fab operations, yield and cost are also covered. Moore's law and other trends expose students to some current and future issues in microfabrication processes, materials and applications.

In many cases, treatment has been divided into two chapters: for example, Chapter 5 treats thin film basics, and Chapter 7 deals with more advanced topics. Lithography and etching have been divided similarly. This enables short or long course versions to be designed around the book. The figures from the book are available to teachers via the Internet. Please register at Wiley for access [www.wileyeurope.com/go/microfabrication](http://www.wileyeurope.com/go/microfabrication).

## Advice to Students

This book is an introductory text. Basic university physics and chemistry suffices for background. Materials science and electronics courses will of course make many aspects

easier to understand, but the structure of the book does not necessitate them. The book contains 250 homework problems, and in line with the idea of microfabrication as an independent discipline, they are about fabrication processes and microstructures; not about devices. Problems fall mainly in three categories: process design/analysis, simulations and back-of-the-envelope calculations. The problems that are designed to be solved with a simulator are marked by "S". A simple one-dimensional simulator will do. The "ordinary" problems are designed to develop a feeling for orders of magnitude in the microworld: linewidths, resistances, film thicknesses, deposition rates, stresses etc. It is often enough to understand if a process can be done in seconds, minutes or hours; or whether resistance range is milliohms, ohms or kiloohms. You must learn to make simplifying assumptions, and to live with uncertain data. Searching the Internet for answers is no substitute to simple calculations that can be done in minutes because the simple estimates are often as accurate (or inaccurate) as answers culled from Internet. It should be borne in mind that even constants are often not well known: for instance, recent measurements

of silicon melting point have resulted in values 1408 °C by one group, 1410 °C by one, 1412 °C by seven groups, 1413 °C by eight groups and 1416 °C by three groups, and if older works are encountered, values range from 1396 °C to 1444 °C. With thin film materials properties are very much deposition process dependent, and different workers have measured widely different values for such basic properties as resistivity or thermal conductivity. Even larger differences will pop up, if, for instance, the phase of metal film changes from body-centered cubic to  $\beta$ -phase: temperature coefficient of resistivity can then be off by a factor of ten. Polymeric materials, too, exhibit large variation in properties and processing. There are also calculations of economic aspects of microfabrication: wafer cost, chip size and yield. A bit of memory costs next to nothing, but the fabs (fab is short for fabrication facility) that churn out these chip are enormously expensive.

Comments and hints to selected homework problems are given in Appendix A. In Appendix B you can find useful physical constants, silicon material properties and unit conversion factors.

# Preface to the Second Edition

If you search on “microfabrication” in Google Scholar, there will be over 100 000 hits; if you type in “MEMS” in Science Direct, you will get in excess of 300 000 hits; “transistor” in IEEE Xplore produces 60 000 articles; and “thin film” in Scirus, over a million articles. There is obviously no problem finding the scientific literature.

This book is a primer that prepares you to access the primary literature. It is a textbook, not a reference work or an encyclopedia, and even less a review article. This means that the topics and examples are chosen on pedagogic grounds, so many remarkable seminal works and recent breakthroughs are not addressed. It also means that most fabrication processes are not shown in full detail, rather fundamental ideas are described, and nuances and special features are left out, in order to highlight the main ideas. The articles and books in the “References and Further Reading” sections have been selected to be accessible to students, and serve as supplementary reading for assignments and exercises.

This is a fabrication text, and device physics is discussed only briefly. It would be impossible to include device physics because microfabrication can be applied to hundreds of different microdevices. In this book the examples are drawn mostly from CMOS, MEMS, microfluidics and solar cells, but there are examples from hard disk drives, flat panel displays, optics and optoelectronics, DNA chips, bipolar and power semiconductor devices, and nanotechnology.

## What is New in this Second Edition?

There is about 25% more material, and the text and figures have been revised throughout. The basic structure of the book remains unchanged, and the order of chapters is mostly as in the first edition.

Silicon wafer basics are discussed in Chapter 4. More advanced wafers, and wafer behavior during processing, are discussed elsewhere, in Chapters 6 (epitaxy), 17 (bonding) and 22 (wafer engineering). Chapters 5 and

7 on thin films have been reorganized into basic and advanced chapters, with step coverage and stresses now included in the basic chapter.

Pattern generation (Chapter 8) now includes additional material on electron beam lithography. Chapter 9 introduces photoresists and optical lithography, but it is limited to  $1\times$  contact/proximity lithography only. Chapter 10 deals with advanced IC fabrication lithography with reduction steppers and scanners.

Chapter 12, on wafer cleaning and surface preparation, has been expanded to include polymer surface treatments and aspects of fluid behavior on surfaces that are important in fluidics.

Chapter 17 on bonding has been completely rewritten. It now covers fusion, anodic, metallic and adhesive bonding, including process physics and chemistry as well as technical implementations. Much of the more specialized material on silicon fusion bonding and SOI wafers has been moved to Chapter 22 on wafer engineering.

Chapter 18 has been completely rewritten and is now called “Polymer Microprocessing.” In addition to the old material on embossing, imprinting and replica molding, it now contains more on polymer materials properties, thick-resist lithography, polymer bonding and a great number of device examples, especially in fluidics.

Chapter 19, on glass microprocessing, is new, but it incorporates some elements of a dismantled chapter “Processing on Non-silicon Substrates”. The new chapter, however, concentrates on processing glass itself, to make microdevices of glass, in addition to fabricating devices on glass.

Chapter 23, on special processes, is a collection of techniques which are not in the mainstream of microfabrication: namely, niche techniques, non-standard approaches, special materials, and the like. For instance, porous silicon by electrochemical etching has been moved from the etching chapter to this chapter, in order to streamline the basic etching chapter.

Another new chapter, Chapter 24, deals with serial microprocessing. This includes many direct writing and

machining techniques, like focused ion beam processing, laser microfabrication and micromilling and machining.

The core of the book, Chapter 25, on process integration, is now hopefully more general because CMOS- and MEMS-specific issues have been moved to their respective chapters, Chapters 26 and 30. Examples include solar cell devices, fluidic filters, resistors and integrated passive devices and PCR chips for DNA amplification. The MOS chapter now covers thin film transistors (TFTs) as well. Advanced CMOS is treated in Chapter 38 in order to keep the introduction to CMOS simple enough.

The old Chapter 22, on sacrificial structures, has been moved, greatly expanded and renamed. It is now Chapter 29, entitled “Surface Micromachining.” Chapter 30 concerns MEMS process integration. It has been completely rewritten and contains almost 100% more material with many device examples.

Chapter 31 on process equipment has been rewritten with elements from a dismantled chapter on integrated

processing, and with parts from an old Chapter 37, Wafer fab.

Chapter 36 is now called “Yield and Reliability.” It draws on the old chapters on yield, process integration and wafer fab. The discussion has been expanded, especially regarding MEMS reliability.

Each of Chapters 37, 38 and 39 can serve as a concluding chapter, with a slightly different emphasis: Chapter 37, on the economics of microfabrication, centers on costs and markets for CMOS, MEMS, solar and flat-panel displays, and magnetic data storage; Chapter 38, on Moore’s law, deals with scaling trends in ICs; and Chapter 39, on microfabrication at large, concerns the integration of different technologies, materials and functionalities, scaling with “More than Moore.”

Teachers adopting the text will have access to all the figures and tables in the book as PDF slides. Additionally, a solutions manual will be available at [www.wiley.com/go/Franssila\\_Micro2e](http://www.wiley.com/go/Franssila_Micro2e).

# Acknowledgements

A number of people have contributed to this 2<sup>nd</sup> edition in various capacities. Sardar Bilal Alam, Susanna Aura, Kestas Grigoras, Eero Haimi, Klas Hjort, Ville Joksa Jokinen, Jari Koskinen, Heikki Kuisma, Tomi Laurila, Marianne Leinikka, Antti Niskanen, Victor Ovtchinnikov, Ville Saarela, Lauri Sainiemi, Ali Shah, Gianmario Scotti, Pia Suvanto, Markku Tilli and Santeri Tuomikoski each read part of the manuscript and gave input that led to rewriting and reorganizing of the text.

Numerous colleagues and friends have provided assistance in finding material, editing figures, helping with software, looking for articles, and contributing their SEMs and other primary material. These people include Veli-Matti Airaksinen, Tapani Alasaarela, Florence Amez-Droz, Nikolai Chekurov, Nico de Rooij, Kai-Erik Elers, Jean-Christophe Eloy, Martin Gijs, Leif Grönberg, Ulrika Gyllenberg, Atte Haapalinna, Kalle Hanhijärvi, Ole Hansen, Paula Heikkilä, Ari Hokkanen, Scotten

W. Jones, Tord Karlin, Ivan Kassamakov, Hannu Kattelus, Marianna Kemell, Kimmo Kokkonen, Kai Kolari, Jorma Koskinen, Anders Kristensen, Anu Kärkkäinen, Boris Lamontagne, Volker Lerche, Lauri Lipiäinen, Laura Luosujärvi, Merja Markkanen, Jyrki Molarius, Juha Muonen, Joachim Oberhammer, Peter Ochojski, Antti Peltonen, Tuomas Pensala, Risto J. Puhakka, Mikko Ritala, Tapani Ryhänen, Henrik Rödjegård, Tomi Salo, Anke Sanz-Velasco, Jens Schmid, Pekka Seppälä, Andreas Stamm, Andrey Timofeev, Daniel Tracy, Esa Tuovinen, Albert van den Berg, Brandon Van Leer and Matthias Worgull.

Simone Taylor, Nicky Skinner, Laura Bell and Clarissa Lim at Wiley have been instrumental in pulling it all together.

Last but not least, thanks are due to Anna, Aku, Atte, Kiira and Oliver.

# Introduction

Integrated circuits, microsensors, microfluidics, solar cells, flat-panel displays and optoelectronics rely on microfabrication technologies. Typical dimensions are around 1 micrometer in the plane of the wafer (the range is rather wide, from 0.02 to 100  $\mu\text{m}$ ). Vertical dimensions range from atomic layer thickness (0.1 nm) to hundreds of micrometers, but thicknesses from 0.01 to 10  $\mu\text{m}$  are most typical. Microfabrication is the collection of techniques used to fabricate devices in the micrometer range.

The historical developments of microfabrication-related disciplines are shown below. The invention of the transistor in 1947 sparked a revolution. The transistor was born out of the fusion of radar technology (fast crystal detectors for electromagnetic radiation) and solid state physics. Developments of microfabrication methods enabled the fabrication of many transistors on a single piece of semiconductor and, a few years later, the fabrication of integrated circuits; that is, transistors were connected to each other on the wafer, rather than separated from each other and reconnected on the circuit board.

Microelectronics makes use of the semiconductor properties of silicon, but it is also important that silicon dioxide is such a useful material, for passivating silicon surfaces and protecting silicon during wafer processing. Silicon dioxide is readily formed on silicon, and it is high-quality electrical insulator. In addition to silicon transistors, integrated circuits require multiple levels of metal wiring, to route signals. Silicon microelectronic devices today are characterized by their immense complexity and miniaturization: a billion transistors fit on a chip the size of a fingernail.

Micromechanics makes use of the mechanical properties of silicon. Silicon is extremely strong, and flexible beams, cantilevers and membranes can be made from it. Pressure sensors, resonators, gyroscopes, switches and other mechanical and electromechanical devices utilize the excellent mechanical properties of silicon. Microelectromechanical systems (MEMS) or



**Figure 1.1** Evolution of microtechnology subfields from the 1960s onwards

microsystems, as they are also called, have expanded in every possible direction: microfluidics, microacoustics, biomedical microdevices, DNA microarrays, microreactors and microrockets to name a few. New subfields have emerged: BioMEMS, PowerMEMS, RF MEMS, as shown in Figure 1.1.

Silicon optoelectronic devices can be used as light detectors like diodes and solar cells, but light emitters like lasers and LEDs are made of gallium arsenide and indium phosphide semiconductors. Micro-optics makes use of silicon in another way: silicon, silicon dioxide and silicon nitride are used as waveguides and mirrors. MOEMS, or optical MEMS, utilize silicon in yet another way: silicon can be machined to make tilting mirrors, adjustable gratings and adaptive optical elements. The micromirror of Figure 1.2 takes advantage of silicon's smoothness and flatness for optics and its mechanical strength for tilting.

Microtechnology has evolved into nanotechnology in many respects. Some of the tools are common, like electron beam lithography machines, which were used to draw nanometer-sized structures long before the term nanotechnology was coined. Electron beam and ion beam defined nanostructures are shown in Figure 1.3. Thin films down to atomic layer thicknesses have been grown



**Figure 1.2** Micromirror made of silicon, 1 mm in diameter, is supported by torsion bars 1.2  $\mu\text{m}$  wide and 4  $\mu\text{m}$  thick (detail figure). Reproduced from Greywall *et al.* (2003), Copyright 2003, by permission of IEEE



**Figure 1.3** Electron microscope image of an electron beam defined gold–palladium horizontal nanobridge (courtesy Juha Muhonen, Aalto University) and vertical ion beam patterned nanopillars (courtesy Nikolai Chekurov, Aalto University); 100 nm minimum dimension in both

and deposited in the microfabrication communities for decades. Novel ways of creating nanostructures by self-assembly (self-organization) are being continuously adopted by the microfabrication community as tools to extend the capabilities of microfabrication. The tools of nanotechnology, such as the atomic force microscope (AFM), have been adopted in microfabrication as a way to characterize microstructures.

Solar cells and flat-panel displays can be large in area, but the crucial microstructures are similar to those in microdevices. Hard disks, and hard disk read/write heads especially, are microfabricated devices, with some of the

most demanding feature size and film thickness issues anywhere in microfabrication.

Listed in the references and further reading section at the end of this chapter are a number of books and review articles on microfabrication in diverse disciplines.

## 1.1 Substrates

Silicon is the workhorse of microfabrication. Silicon is a semiconductor, and the power of microelectronics arises strongly from the fact that silicon is available in both p-type (holes as charge carriers) and n-type (electrons as



**Figure 1.4** Silicon wafer, 100 mm in diameter, with about 200 device chips and a dozen test chips on it. Courtesy VTT

charge carriers), and its resistivity can be tailored over a wide range, from 0.001 to 20 000 ohm-cm. Silicon wafers are available in 100, 125, 150, 200 and 300 mm diameters and various thicknesses. Silicon is available in different crystal orientations, and the control of its crystal quality is very advanced.

Bulk silicon wafers (Figure 1.4) are single crystal pieces cut from larger single crystal ingots and polished. Silicon is extremely strong, on a par with steel, and it also retains its elasticity to much higher temperatures than metals. However, single crystalline (also known as monocrystalline) silicon wafers are fragile: once a fracture starts, it immediately develops across the wafer because covalent bonds do not allow dislocation movements.

Many microfabrication disciplines use silicon for convenience: it is available in a wide variety of sizes and resistivities; it is smooth, flat, mechanically strong and fairly cheap. Most of the machinery for microfabrication was originally developed for silicon ICs and newer technologies ride on those developments.

Single crystalline substrates include silicon, quartz (crystalline  $\text{SiO}_2$ ), gallium arsenide ( $\text{GaAs}$ ), silicon carbide ( $\text{SiC}$ ), lithium niobate ( $\text{LiNbO}_3$ ) and sapphire ( $\text{Al}_2\text{O}_3$ ). Polycrystalline silicon is widely used in solar cell production. Amorphous substrates are also common: glass (which is  $\text{SiO}_2$  mixed with metal oxides like  $\text{Na}_2\text{O}$ ), fused silica (pure  $\text{SiO}_2$ ; chemically it is identical to quartz) and alumina ( $\text{Al}_2\text{O}_3$ ) are used in microfluidics, optics and microwave circuits, respectively. Sheets of polyimides, acrylates and many other polymers are also used as substrates. Substrates must be evaluated for available sizes, purities, smoothness, thermal stability,

mechanical strength, etc. Round substrates are compatible with silicon, but square and rectangular ones need special processing because tools for microfabrication are geared for round silicon wafers.

## 1.2 Thin Films

More functionality is built on the substrates by deposition (and further processing) of thin films: various conducting, semiconducting, insulating, transparent, superconducting, catalytic, piezoelectric and other layers are deposited on the substrates. Thin films for microfabrication include a wide variety of elements: metals of common usage include aluminum, copper, tungsten, titanium, nickel, gold and platinum. Metallic alloys and compounds commonly encountered include Al–0.5% Cu, TiW, titanium silicide ( $\text{TiSi}_2$ ), tungsten silicide ( $\text{WSi}_2$ ) and titanium nitride ( $\text{TiN}$ ). The compound is stoichiometric if its composition matches the chemical formula; for example, there is one nitrogen atom for each titanium atom in  $\text{TiN}$ . In practice, however, titanium nitride is more accurately described as  $\text{TiN}_x$ , with the exact value of  $x$  determined by the details of the deposition process. The most common dielectric thin films are silicon dioxide ( $\text{SiO}_2$ ) and silicon nitride ( $\text{Si}_3\text{N}_4$ ). Other dielectrics include aluminum oxide ( $\text{Al}_2\text{O}_3$ ), hafnium dioxide ( $\text{HfO}_2$ ), diamond, aluminum nitride (AlN) and many polymers.

A special case of thin-film deposition is epitaxy: the deposited film registers the crystalline structure of the underlying substrate, and, for example, more single crystal silicon can be deposited on a silicon wafer but with different dopant atoms and different dopant concentration.

The general material structure of a microfabricated device is shown in Figure 1.5. Interfaces between the thin film and bulk, and between films, are important for the stability of structures. Wafers experience a number



**Figure 1.5** Materials and interfaces in a schematic microstructure



**Figure 1.6** Single crystalline, polycrystalline and amorphous materials

of thermal treatments during their fabrication, and various chemical and physical processes are operative at interfaces, for example chemical reactions and diffusion. Sometimes reactions between films are desired, but most often they should be prevented. This can be achieved by adding extra films, known as barriers, in between films.

For example, thin film 1 might present an aluminum conductor and thin film 2 the passivation layer of silicon nitride; or films 1 and 2 are antireflective and scratch-resistant coatings in optics; or film 1 is thin tunnel oxide and film 2 a charge storage layer (as in memory cards).

Surface physical properties like roughness and reflectivity are material and fabrication process dependent. The chemical nature of the surface is important: some surfaces are reactive, others passive. Many surfaces will be covered by native oxide films if left unattended for some time: for example, silicon, aluminum and titanium form surface oxides over a time scale of hours. Water vapor adsorbed on surfaces must be eliminated before the wafers are processed further.

Thick substrates are not immune to thin films: a thin film  $0.1\text{ }\mu\text{m}$  thick may have such a high stress that a silicon wafer  $500\text{ }\mu\text{m}$  thick will be curved by tens of micrometers; or minute iron contamination on the surface will diffuse through a wafer  $500\text{ }\mu\text{m}$  thick during a fairly moderate thermal treatment.

Just like substrate wafers, the grown and deposited thin films can be

- single crystalline
- polycrystalline
- amorphous

as shown in Figure 1.6. During wafer processing, single crystal films usually stay single crystalline, but they can be amorphized by ion bombardment for example. Polycrystalline films experience grain growth, for instance during heat treatments. Foreign atoms, dopants and



**Figure 1.7** Atomic layer deposited aluminum oxide and titanium oxide thin films over silicon waveguide ridges. Courtesy Tapani Alasaarela, Aalto University

alloying atoms do not distribute themselves uniformly in polycrystalline material but aggregate at grain boundaries, which can lead to both beneficial and detrimental effects, depending on the particular materials and process conditions. Amorphous films can stay amorphous or they can crystallize during high-temperature steps, usually into the polycrystalline state.

Sometimes it is enough to deposit films on flat, planar wafers, but most often the films have to extend over steps and into trenches (Figure 1.7). These severe topographies introduce further deposition process-dependent subtleties.

#### Note on notations

---

|                         |                                                                      |
|-------------------------|----------------------------------------------------------------------|
| $<\text{Si}>$           | single crystal material                                              |
| c-Si                    | single crystal material                                              |
| $\alpha\text{-Si/a-Si}$ | amorphous material                                                   |
| a-Si:H                  | amorphous material with imbedded hydrogen (atomic % sometimes given) |

---

---

|                                                     |                                                                                   |
|-----------------------------------------------------|-----------------------------------------------------------------------------------|
| nc-Si                                               | nanocrystalline material (grain size a few nanometers)                            |
| $\mu$ c-Si                                          | microcrystalline material (grain size in the range of tens of nanometers)         |
| mc-Si                                               | multicrystalline (large-grained polycrystalline, grain size $\gg$ film thickness) |
| Al–0.5% Cu                                          | aluminum alloy with 0.5% copper                                                   |
| W <sub>2</sub> N, Si <sub>3</sub> N <sub>4</sub>    | stoichiometric compounds                                                          |
| SiN <sub>x</sub> , $x \approx 0.8$                  | non-stoichiometric compound                                                       |
| W:N                                                 | stuffed material, nitrogen at grain boundaries (non-stoichiometric)               |
| WF <sub>6</sub> (g)                                 | material in gas phase (for WF <sub>6</sub> , boiling point 17 °C)                 |
| WF <sub>6</sub> (l)                                 | material in liquid phase                                                          |
| W (s)                                               | material in solid phase                                                           |
| H <sub>2</sub> SiOF <sub>6</sub> (aq)               | aqueous solution                                                                  |
| SiH <sub>2</sub> (ad)                               | material adsorbed on a surface                                                    |
| Si/SiO <sub>2</sub> /Si <sub>3</sub> N <sub>4</sub> | film stacks are marked with substrate or bottom film on the left                  |

---

### 1.3 Processes

Microfabrication processes consist of four basic operations:

1. High-temperature processes to modify the substrate.
2. Thin-film deposition on the substrate.
3. Patterning of thin films and the substrate.
4. Bonding and layer transfer.

Under each basic operation there are many specific technologies, which are suitable for certain devices, substrates, linewidths or cost levels. Some techniques work well in research, producing a few devices with elaborate features, but completely different methods may be required if those devices are to be mass produced.

Surface preparation and wafer cleaning could be termed the fifth basic operation but, unlike the other four, no permanent structures are made. Surfaces are modified by etching away a few atomic layers, or by depositing one molecular layer. Surface preparation requirements are widely different in different process steps: in wafer bonding it is paramount to eliminate particles that would create voids if left between the wafers, while in oxidation it is important to eliminate metallic contamination and in epitaxy to ensure that native oxides are removed.

High-temperature steps are used to oxidize silicon and to dope silicon by diffusion, and they are crucial for making transistor, diodes and other electronic devices.



**Figure 1.8** Diffusion process: the 2.2 eV barrier can be crossed at ease at 900 °C but the frequency of crossing the 3.5 eV barrier is low. A higher temperature, for example 1050 °C, would be needed for the 3.5 eV barrier to be crossed at ease

Devices like piezoresistive pressure sensors also rely on high-temperature steps, with epitaxy and resistor diffusion as the key processes. High-temperature steps can be simulated extensively, by solving diffusion equations on a computer. The high-temperature regime in micro-fabrication is ca. 900 °C to 1200 °C, temperatures where dopants readily diffuse and the silicon oxidation rate is technically relevant.

Many chemical and physical processes are exponentially temperature dependent. The Arrhenius equation (Equation 1.1) is a very general and very useful description of the rates of thermally activated processes. Activation energy can be illustrated as a jumping process over a barrier (Figure 1.8). According to the Boltzmann distribution, an atom at temperature  $T$  has an excess of energy  $E_a$  with a probability  $\exp(-E_a/kT)$ . Higher temperature leads higher barrier crossing probability:

$$\text{rate} = z(T) e^{(-E_a/kT)}$$

$$k = 1.38 \times 10^{-23} \text{ J/K} = 8.62 \times 10^{-5} \text{ eV/K} \quad (1.1)$$

The magnitude of the pre-exponential factor  $z(T)$  and the activation energy  $E_a$  vary a lot.

In etching reactions, the activation energy is below 1 eV, in polysilicon chemical vapor deposition  $E_a$  is 1.7 eV, in substitutional dopant diffusion it is 3.5–4 eV and in silicon self-diffusion 5 eV. For a silicon etching process with 0.7 eV activation energy, raising the temperature from 20 to 40 °C results in a rate six times higher.

A great many microfabrication processes show Arrhenius-type dependence: etching, resist development, oxidation, epitaxy, chemical vapor deposition (which are chemical processes) are all governed by exponential temperature dependencies, as are diffusion, electromigration and grain growth (which are physical processes).

Low-temperature processes leave metal-to-silicon interfaces stable, and generally 450 °C is regarded as the upper limit for low temperatures. Between 450 and 900 °C there

is a middle range which must be discussed with specific materials and interfaces in mind.

The high-temperature regime is also known as the front-end of the line (FEOL) in the silicon IC business and the low-temperature regime as the back-end of the line (BEOL). But these terms have other meanings as well: for many people in the electronics industry outside silicon wafer fabrication, front-end includes all processing on wafers, and back-end refers to dicing, testing, encapsulation and assembly. We will use the first definition.

Many thin-film steps can be carried out identically on silicon wafers and other substrates; by definition they are layers deposited on top of a substrate. Thin-film steps do not affect the dopant distribution inside silicon; that is, diodes and transistors are unaffected by them.

Processes act on whole wafers – this is the basic premise. The whole wafer is subject to, for instance, diffusion from the gas phase, and metal is evaporated everywhere. Either selected areas must be protected by masks before the process, or else the material must be removed from selected areas afterward, by etching or polishing.

Patterning processes define structures usually in two steps: polymer processing to form an intermediate pattern which then acts as a mask for etching, deposition, ion implantation or other modification of the underlying material; and after the pattern has been transferred to solid material, the intermittent polymer mask is removed.

The main patterning technique in microfabrication is optical lithography, also known as photolithography. In Figure 1.9 photolithography is shown side by side with the thermal imprint/embossing process. In both processes a polymer film is modified locally to create patterns. In lithography, photosensitive polymer film is exposed to UV light, which hardens the polymer by crosslinking (so-called negative resists). In imprinting, a thermoplastic polymer softens upon heating, and a master stamp is pressed against it. The system is allowed to cool down before the stamp is released, and then the polymer retains its imprinted shape.

Many old methods have been successfully scaled down to micrometer and nanometer scales. Etching was once used by knights to engrave their armor with their coats-of-arms, and metal etching with similar acidic solutions can make aluminum patterns in the micrometer range. Once an original microstamp or nanostamp has been made, its replication into polymers is fairly easy (it is actually the detachment that is difficult). Electroplating is likewise easily applicable to nanometer structures. Casting polymers into micromolds is also popular in microfabrication: the elastomeric (rubber-like) material PDMS (poly(dimethyl)siloxane) is a favorite material for simple microfluidic devices.



**Figure 1.9** Optical lithography (left) and thermal imprint (right): UV light crosslinks photosensitive polymer, and unexposed parts are developed away (in so-called negative resists). In imprinting, softened polymer is forced to shape, and after cooling the shape is retained even though the master is removed. In imprinting, some material remains at the bottom and must be cleared by etching

Wafer bonding and layer transfer enable more complex structures to be made. Bonding a wafer on top of a trench turns it into a channel, useful for microfluidics. Bonding more wafers can lead to elaborate fluidic channel patterns, as in the burner of a flame ionization detector, Figure 1.10. Bonding two wafers with electrodes creates a capacitor, for instance for pressure sensing. Bonding two different wafers can also be used simply as a method to create a new kind of a starting wafer, with the best properties of the two wafers combined.

These elementary operations of patterning, modification, deposition and bonding are combined many times over to create devices. Process complexity is often discussed in terms of the number of lithography steps (the



**Figure 1.10** Oxyhydrogen burner of a flame ionization detector by Pyrex–glass/silicon/Pyrex–glass bonding. Reproduced from Zimmermann *et al.* (2002), Copyright © 2002 by permission of Elsevier Science Ltd

term mask levels is also used): five lithography steps are enough for a simple PMOS transistor (late 1960s’ technology, and still used as a student lab process in many universities), and many MEMS, solar cell and flat-panel display devices can be made with two to six photolithography steps, but 32 nm linewidth microprocessors and logic circuits require over 30 patterning steps.

## 1.4 Dimensions

Microfabricated systems have minimum dimensions from 20 nm to 50 µm, depending on the device types. Advanced microprocessors and memories and the read/write heads of hard disk drives must have features <100 nm to be competitive. In Figure 1.11 the transmission electron micrograph shows the cross-section of a 65 nm MOS gate. Many other electronic devices like RF and power transistors make do with 100 nm to 1 µm dimensions. MEMS devices typically have 1–10 µm minimum lines and microfluidic devices might have 50 µm as the smallest feature.

Microfabricated device sizes are compared to physical, chemical and biological small objects in Figure 1.12, with microscopy methods capable of observing them.

Narrow individual lines can be made by a variety of methods; what really counts is resolution, the power to resolve two neighboring structures. It determines the device packing density. Resolution usually gets most attention when microscopic dimensions are discussed, but alignment between structures in different lithography steps is equally important. Alignment is, as a rule of thumb, one-third of minimum linewidth. High resolution but poor alignment can result in inferior device packing density compared to poorer resolution but tighter alignment.

As another rule of thumb, vertical and lateral dimensions of microdevices are similar. If the height-to-width



**Figure 1.11** Transmission electron microscope image of 65 nm MOS transistor gates. Courtesy Young-Chung Wang, FEI Company

or aspect ratio is more than 2:1, special processing is needed, and new phenomena need to be addressed in such three-dimensional devices. Highly 3D structures are used extensively in both deep submicron ICs and in MEMS, for example in the microneedle of Figure 1.13.

Oxide thicknesses below 5 nm are used in CMOS manufacturing as gate oxides and as flash-memory tunnel oxides. Epitaxial layer thicknesses go down to the atomic layer and up to 100 µm in the thick end. There are also self-limiting deposition processes which enable extremely thin films to be made, often at the expense of deposition



**Figure 1.12** Dimensions in the microworld: electromagnetic radiation, natural objects, humanmade devices, microscopy methods and dirt



**Figure 1.13** Silicon microneedle, about 100 μm. Reproduced from Griss and Stemme (2003), Copyright 2003, by permission of IEEE

rate. Chemical vapor deposition (CVD) can be used for anything from a few nanometers to a few micrometers. Sputtering also produces films from 0.5 nm to 5 μm. Spin coating is able to produce films as thin as 100 nm, or as thick as 100 μm. Typical applications include polymer spinning. Electroplating (galvanic deposition) can produce metal layers of almost any thickness, from a few nanometers up to hundreds of micrometers.

But almost every device includes structures with dimensions of about 100 μm. These are needed to interface the microdevices to the outside world: most devices need electrical connections (by a wire-bonding or bumping

process); microfluidic devices must be connected to capillaries or liquid reservoirs; solar cells and power semiconductors must have thick and large metal areas to bring in and take out the high currents involved; and connections to and from optical fibers require structures about the size of fibers, which is also on the order of 100 μm.

## 1.5 Devices

Microfabricated device can be classified in many ways:

- material: silicon, III–V, wide band gap (SiC, diamond), polymer, glass
- integration: monolithic integration, hybrid integration, discrete devices
- active vs. passive: transistor vs. resistor, valve vs. sieve
- interfacing: externally (e.g., sensor) vs. internally (e.g., processor).

The above classifications are based on device material or functionality. In this book we are concentrating on fabrication technologies, so the following classification is more useful:

- volume (bulk) devices
- surface devices
- thin-film devices
- stacked devices.

Power transistors, thyristors, radiation detectors and solar cells (Figure 1.14) are volume devices: currents are generated and transported (vertically) through the wafer, or, alternatively, device structures extend through the wafer, as in many bulk micromechanical devices. The starting



(a)



(b)

**Figure 1.14** Volume devices: (a) passivated emitter, rear locally diffused solar cell, reproduced from Green (1995) by permission of University of New South Wales; (b) n-channel power MOSFET cross-section, reproduced from Yilmaz *et al.* (1991) by permission of IEEE

wafers for volume devices need to be uniform throughout. Patterns are often made on both sides of the wafer and it is important to note that some processes affect both sides of the wafer and some are one sided.

Surface devices make use of the material properties of the substrate but generally only a fraction of wafer thickness is utilized in making the devices. However, device structure or operation is connected with the properties of the substrate. Most ICs fall under this category: namely, MOS and bipolar transistors, photodiodes, CCD image sensors as well as III–V optoelectronic devices.

In silicon CMOS, only the top 5  $\mu\text{m}$  layer of the wafer is used in making the active devices, the remaining 500  $\mu\text{m}$  of wafer thickness being for support: that is, mechanical strength and impurity control. Shown in Figure 1.15 are CMOS polysilicon gates of 0.5  $\mu\text{m}$  width and 0.25  $\mu\text{m}$



**Figure 1.15** Surface devices: 0.5  $\mu\text{m}$  minimum linewidth CMOS in a scanning electron microscope (SEM) view



**Figure 1.16** Curl switch. Reproduced from Oberhamer and Stemme (2004), Copyright 2004, by permission of IEEE

height. Surface devices can have very elaborate 3D structures, like multilevel metallization in logic circuits, which can be 10  $\mu\text{m}$  thick, but this is still only a fraction of wafer thickness; therefore the term surface device applies.

Devices can be built by depositing and patterning thin films on the wafers, where the wafer has no role in device operation. Wafer properties like thermal conductivity or optical transparency may be part of device operation, but another substrate could be used instead. Thin-film transistors (TFTs) are most often fabricated on non-semiconductor substrates of glass, plastic or steel. Devices like RF switches and relays, optical modulators or DNA arrays are often fabricated on silicon wafers for convenience, but they could be fabricated on glass or polymer substrates as well. Figure 1.16 shows a RF switch: the silicon nitride/gold thin film flap curls up because of film stresses, but can be forced flat by electrostatic actuation.

In MEMS devices with free-standing elements, membranes and cantilevers pose certain processing limitations



**Figure 1.17** A microturbine by five-wafer silicon-to-silicon bonding. Reproduced from Lin *et al.* (1999) by permission of IEEE

of their own. Many processes cannot be done on movable, bending structures because they are not stable enough, and therefore the release step is often the very last process step. Similarly, devices with through-wafer holes pose serious limitations in many process steps.

Stacked devices are made by layer transfer and bonding techniques. Two or more wafers are joined together permanently. Devices with vacuum cavities, for example absolute pressure sensors, accelerometers and gyroscopes, are stacked devices made of bonded silicon/glass wafer pairs. Micropumps and valves are typically stacks of many wafers. Figure 1.17 shows a microturbine. It is made by bonding together five wafers. More and more layer transfer and wafer bonding techniques are being developed, and stacked devices of various sorts are expected to be appear, for example GaAs optical devices bonded to Si-based electronics, or MEMS devices bonded to ICs.

## 1.6 MOS Transistor

The MOS transistor is a capacitor with a silicon substrate as the bottom electrode, the gate oxide as the capacitor dielectric and the gate metal as the top electrode (Figure 1.18). The MOS transistor has been the driving force of the microfabrication industries. It is the top device by all measures: number of devices sold, the narrowest linewidths and the thinnest oxides in mass production, as well as dollar value of production. Most equipment for microfabrication was originally designed for MOS IC fabrication, and later adapted to other applications.

Despite the name MOS, the gate electrode is usually made of phosphorus-doped polycrystalline silicon, not of



**Figure 1.18** Schematic of a MOS transistor: gate, source (S) and drain (D) in an active area defined by thick isolation oxide

metal. The basic function of a MOS transistor is to control the flow of electrons from the source to drain by the gate voltage and the field it generates in the channel. In a NMOS transistor, a positive voltage on the gate pulls electrons from the p-type channel to the Si/SiO<sub>2</sub> interface where an overabundance of electrons inverts the region under the gate to n-type, enabling electrons to flow from the n+ source to the n+ drain.

Transistors are isolated electrically from neighboring transistors by SiO<sub>2</sub> field oxide areas. This isolation takes up a lot of area, and therefore the transistor packing density on a chip does not depend on transistor dimensions alone.

Scaling down MOS transistor channel length makes the transistors faster. The other main aspect is area scaling: a factor  $N$  linear dimension scaling reduces the area to  $A/N^2$ . Gate width, gate oxide thickness and source/drain diffusion depths are closely related, and the ratios are more or less unchanged when the transistors are scaled down. As a rough guide, for a gate width of  $L$ , the oxide thickness is  $L/50$ , and the source/drain junction depth is  $L/5$ .

## 1.7 Cleanliness and Yield

Microfabrication takes place under carefully controlled conditions of constantly circulating purified airflow, with temperature, humidity and vibrations also under strict control because micrometer-scale structures would otherwise be destroyed by particles or else the lithography process would be ruined by vibrations or temperature and humidity fluctuations. Personnel in cleanrooms wear gowns to prevent particle emissions (Figure 1.19), and work procedures have been developed to minimize all disturbances.

Wafers are cleaned actively during processing: thousands of liters of ultrapure water (UPW, as known as de-ionized water, DIW) is used for each wafer during



**Figure 1.19** Working in a cleanroom, courtesy VTT

device fabrication. This is the dynamic part of particle cleanliness: the passive part comes from careful selection of materials for cleanroom walls, floors and ceilings, including sealants and paints, as well as selection of materials for design and for process equipment, wafer storage boxes and all associated tools, fixtures and jigs.

Even though extreme care is taken to ensure cleanliness in microprocessing, some devices will always be defective. As the number of process steps increases, yield  $Y$  goes down according to

$$Y = Y_0^n \quad (1.2)$$

where  $Y_0$  is the yield of a single process step and  $n$  is the number of steps. With 100 process steps and 99% yield in each individual step, this results in a 37% yield (representative of a 64 k DRAM chip), but a 99% yield for a 500-step process (representative of a 16 Mbit DRAM) results in a yield <1%. Clearly a 99% yield is not enough for modern memory fabrication. Chip design also affects yield through area:

$$Y = e^{-DA} \quad (1.3)$$

where  $A$  is the chip area and  $D$  is the defect density: making small chips is much easier than making big chips.

Yield has two major components: stochastic and systematic. Stochastic (random) defects are unpredictable occurrences of pinholes in protective films, particle adhesion on the wafer, corrosion of metal lines, etc. Systematic defects come from equipment performance limitations, impurities in starting materials and design errors: two features may be placed so close to each other that they inadvertently touch, or impurities in chemicals do not allow low enough leakage currents.



**Figure 1.20** Silicon wafer with chips, alignment marks and edge exclusion. The scribe line area is reserved for dicing the chips separately

IC wafers contain typically a hundred or hundreds of chips (also called die). This is depicted in Figure 1.20. It also shows the other elements: alignment marks for registering structures on different layers to each other; and scribe lines, the space reserved for dicing the wafer into separate chips after completing the processing. The number of chips on a wafer has remained more or less unchanged for decades because chip size and wafer size have grown in parallel:  $0.2\text{ cm}^2$  chips were made on 100 mm wafers while  $2\text{ cm}^2$  chips are usual on 300 mm wafers. In extreme cases only one chip fits the wafer, for example a solar cell, a thyristor or a position-sensitive radiation detector. Microfluidic separation devices with channels 5 cm long and optical waveguide devices with large radii of curvature can have a handful of devices per wafer. With standard logic chips or micromechanical pressure sensors and accelerometers, thousands can be crammed together to fit a wafer.

## 1.8 Industries

Worldwide, about \$6 billion is spent on silicon wafers annually. These are used to make \$250 billion worth of semiconductor devices, which fuel the \$1200 billion electronics industry. In 2009, about  $10^{19}$  transistors were shipped, approximately 1 billion devices for each and every person on Earth. As recently as 1968 it was one transistor per year per person. Price of course explains a

**Table 1.1** Wafer size and linewidth distribution in 2009

| Wafer size | % of area | % of wafers |
|------------|-----------|-------------|
| 300 mm     | 38        | 15          |
| 200 mm     | 38        | 35          |
| 150 mm     | 17        | 30          |
| <150 mm    | 7         | 20          |

  

| Linewidth distribution in production: |     |
|---------------------------------------|-----|
| $\geq 400$ nm                         | 10% |
| 160–400 nm                            | 15% |
| 80–160 nm                             | 20% |
| 65–80 nm                              | 20% |
| <65 nm                                | 35% |

Data from SEMI and SICAS.

lot: in 1968 transistors cost about 1\$ a piece; in 2009 the cost was less than one-millionth of a cent.

Device density on chips is quadrupling every three years, a trend known as Moore's law.

Scaling has continued relentlessly for the past 50 years. Linewidths were in the 30  $\mu\text{m}$  range in the early 1960s, and they are 30 nm in the year 2010. Lithographic scaling has thus improved packing density by a factor of a million (linear dimension scaling by a factor of 1000 equals area density scaling by a factor of 1 000 000). The number of transistors on a chip has increased from one to one billion, however. The other two main factors have been an increase in chip size and in circuit cleverness: new designs, new fabrication processes and novel materials use less area for the same functionality.

IC technology generations are classified by their linewidths and each new generation has dimensions roughly 30% smaller than the previous one. In 2010 the minimum linewidth in production is about 30 nm, but this represents just a small fraction of all ICs manufactured. The 200 million wafers (approximately 6 square kilometers) are distributed as shown in Table 1.1. Small wafers are used to fabricate specialty ICs, MEMS and power devices. Gigabit memories and latest generation processors are made on 300 mm wafers. When counted as silicon area, the smaller linewidths gain in importance because linewidth scaling has been accompanied by an increase in wafer size, which means that 65 nm devices are fabricated on 300 mm wafers but 1  $\mu\text{m}$  devices on 100 mm wafers. Only 35% of all devices are fabricated with the latest 45 nm and 32 nm generation technologies, but the fabrication facilities for the 65 nm technology generation (commonly called

"wafer fabs" or simply "fabs") are very new: they were mostly built in 2005–2007.

The microsystems/MEMS industry as such does not exist: microsystems are more a technology than an industry, therefore statistics are erratic. Some estimates put microsystem sales at \$8 billion. The flat-panel display industry has sales of close to \$100 billion. The solar cell industry is growing rapidly, with sales of \$30 billion. The magnetic data storage industry is similar in size but consists of a limited number of established players, while the solar industry is mostly populated by start-up companies.

### Note on drawings

The  $z$ -dimension is enlarged relative to the  $x$ - and  $y$ -directions to make drawings easier to read. For example, in bulk micromechanics the diaphragm of a piezoresistive sensor is 20  $\mu\text{m}$ , or 5% of wafer thickness, and the piezoresistor diffusion depth is 5% of diaphragm thickness, that is 1  $\mu\text{m}$ .

## 1.9 Exercises

1. Silicon atom density is  $5 \times 10^{22} \text{ cm}^{-3}$ . If the boron dopant concentration is  $10^{15} \text{ cm}^{-3}$ , how far are the boron atoms from each other?
2. IC chips are getting larger even though the linewidths are scaled down because more functions are integrated on a chip. Calculate the signal line resistance for:
  - (a) aluminum conductors 1  $\mu\text{m}$  thick, 3  $\mu\text{m}$  wide and 500  $\mu\text{m}$  long (resistivity 3  $\mu\text{ohm}\cdot\text{cm}$ );
  - (b) copper conductors 0.3  $\mu\text{m}$  wide, 0.5  $\mu\text{m}$  thick and 1 mm long (resistivity 2  $\mu\text{ohm}\cdot\text{cm}$ ).
3. Silicon dioxide can sustain a 10 MV/cm electric field. Calculate oxide thickness regimes for:
  - (a) CMOS ICs where the operating voltages are 1–5 V;
  - (b) capillary electrophoresis (CE) microfluidic chips where 500–5000 V is used.
4. DRAM memory is a capacitor. How many electrons are stored in a DRAM capacitor if it has an area of 1  $\mu\text{m}^2$  and a silicon dioxide dielectric 5 nm thick?
5. A micromechanical pressure sensor consists of a 1  $\text{mm}^2$  movable silicon electrode and 1  $\mu\text{m}$  air gap as the dielectric. What is the capacitance? If femtofarad capacitance change can be measured, what is the corresponding displacement of the movable capacitor electrode?
6. Aluminum wires do not tolerate current densities higher than 1 MA/cm<sup>2</sup>. What are the maximum currents that can run in micrometer aluminum wiring?

7. If a silicon etching reaction has an activation energy of 0.7 eV, and the etch rate at 80 °C is 1.3 µm/min, how much is it at 100 °C?
8. What defect densities are typical of modern IC production?
9. CMOS linewidths have been scaled down steadily by 30% every three years. In the year 2010 linewidths were in the range of 32 nm. When will the linewidths equal atomic dimensions?

## References and Related Reading

- Elwenspoek, M. (2001) **Mechanical microsensors**, Springer.
- Geschke, O., H. Klank and P. Telleman (2004) **Microsystem Engineering of Lab-on-a-chip Devices**, 2nd edn, Wiley-VCH Verlag GmbH.
- Green, A.M. (1995) **Silicon Solar Cells**, University of New South Wales Press.
- Greywall, D.S. *et al.* (2003) Crystalline silicon tilting mirrors for optical cross-connect switches, *J. Microelectromech. Syst.*, **12**, 708–712.
- Griss, P. and G. Stemme (2003) Side-opened out-of-plane microneedles for microfluidic transdermal liquid transfer, *J. Microelectromech. Syst.*, **12**, 296.
- Hierold, C. (2004) From micro- to nanosystems: mechanical sensors go nano, *J. Micromech. Microeng.*, **14**, S1–S11.
- Lin, C.-C. *et al.* (1999) Fabrication and characterization of a micro turbine/bearing rig, Proceedings of MEMS'99, p. 529.
- Maluf, N. and K. Williams (2004) **An Introduction to Microelectromechanical Systems Engineering**, 2nd ed, Artech House.
- Motamedi, M.E. (2005) **MOEMS: Micro-Opto-Electro-Mechanical Systems**, SPIE.
- Oberhammer, J. and G. Stemme (2004) Design and fabrication aspects of an S-shaped film actuator based DC to RF MEMS switch, *J. Microelectromech. Syst.*, **13**, 421–428.
- Plummer, J.D., M.D. Deal and P.B. Griffin (2000) **Silicon VLSI Technology**, Prentice Hall.
- Poortmans, J. and V. Arkhipov (eds.) (2006) **Thin Film Solar Cells**, Wiley.
- Rebeiz, G. (2002) **RF MEMS: Theory, Design and Technology**, John Wiley & Sons, Inc.
- Sikanen, T. *et al.* (2010) Microchip technology in mass spectrometry, *Mass Spectrom. Rev.*, **29**, 351–391.
- Solgaard, O. (2008) **Photonic Microsystems: Micro and Nanotechnology Applied to Optical Devices and Systems**, Springer.
- Tanaka, Y. *et al.* (2007) Biological cells on microchips: new technologies and applications, *Biosens. Bioelectron.*, **23**, 449–458.
- Tsuchizawa, T. *et al.* (2005) Microphotronics devices based on silicon microfabrication technology, *IEEE J. Sel. Top. Quantum Electron.*, **11**, 232–240.
- Weibel, D.B., W.R. DiLuzio and G.M. Whitesides (2007) Microfabrication meets microbiology, *Nature Rev. Microbiol.*, **5**, 209–218.
- Yilmaz, H. *et al.* (1991) 2.5 million cell/in<sup>2</sup>, low voltage DMOS FET technology, Proceedings of IEEE APEC 1991, p. 513.
- Ziaie, B. *et al.* (2004) Hard and soft micromachining for BioMEMS: review of techniques and examples of applications in microfluidics and drug delivery, *Adv. Drug Delivery Rev.*, **56**, 145–172.
- Zimmermann, S., A. Vogel and J. Müller (2002) Miniaturized flame ionization detector for gas chromatography, *Sens. Actuators*, **B83**, 285–289.

# Micrometrology and Materials Characterization

When micrometer lines are patterned, and nanometer films grown, measurement tools have to be available to characterize those processes. The measurement spot must sometimes be as small as possible, to obtain information on details of microstructures, but sometimes we can use blanket test wafers for ease of measurement. Often we want to scan and map large areas for uniformity and to get statistical data, and then speed of measurement is paramount. Data is needed on electrical properties like resistivity, optical properties like refractive index, chemical properties like bond identification, physical properties like crystal structure, and mechanical properties like size and shape. Amazingly accurate and precise measurements can be done, but it should be borne in mind that routine monitoring is needed together with complex, highly detailed measurements.

## 2.1 Microscopy and Visualization

Optical microscopy resolution is roughly a micrometer, i.e. similar to wavelength. This is useful for practically all MEMS and solar cell and display devices, while for modern ICs and nanotechnology it is hopelessly inadequate. It is a rough and ready method for checking, for example, thin-film delamination, color changes due to corrosion, scratches from mechanical handling and similar major faults, and these can be observed even if the details of the structures are smaller than the resolution. Defects can also be made larger by etching or deposition, and in this way very small pinhole defects in thin films, for instance, can be visually observed.

Optical microscope performance has been extended by many simple and ingenious ways over the years. In the near-field scanning optical microscopes (NSOM) light is

collected through a small optical aperture, for example 50 nm, and therefore resolution is determined by this size, not by wavelength. In confocal microscopes light is collected only from the focal depth, and vertical information can be obtained. In dark-field microscopes illumination is from the side, and detection is from above as usual. This gives enhanced information about edges and steps on the wafer. Fluorescence microscopes measure fluorescence, and in the best case the signal is generated by a single molecule, which is definitely smaller than the optical resolution of the microscope. Fluorescence is therefore able to detect very small amounts of material, for example photoresist residues.

Electron microscopes can resolve much smaller features: the scanning electron microscope (SEM) can see 5 nm objects. In top view imaging the SEM is very much like the optical microscope, but with two advantages: higher magnification and better depth of field (Figure 2.1). Its real power, however, comes into play in tilted and cross-sectional views (Figure 2.2). Cross-sectional images reveal information like the sidewall angle of microstructures, and sidewall surface quality, or film thickness variation over a step. SEM resolution is sufficient for film thickness determination in the hundreds of nanometers range, but below that its accuracy is inferior to other techniques.

The transmission electron microscope (TEM) provides the ultimate image resolution, down to atomic imaging. The high-resolution TEM (HRTEM) has a special advantage in calibration: the lattice spacing of atoms can be used as accurate internal calibration standards (Figure 2.3). The drawbacks of the TEM are manifold: sample preparation is difficult and time consuming, and only a very small area is imaged. In the TEM each picture takes



**Figure 2.1** Optical microscope and scanning electron microscope (SEM) views of a spiral antenna bolometer. Courtesy Leif Grönberg, VTT



**Figure 2.2** SEM: left, a  $5\text{ }\mu\text{m}$  minimum linewidth stress test structure, courtesy Lauri Sainiemi Aalto University; right,  $100\text{ nm}$  nanobridges, courtesy Nikolai Chekurov Aalto University

hours, but minutes with the SEM and seconds with an optical microscope.

Scanning probe microscopy (SPM) is a collection of techniques which use a sharp nanoneedle tip on a soft cantilever to scan the sample (Figure 2.4), while the interaction of the needle tip with the surface is measured. This interaction can be physical distance, which is sensed as a tunneling current (between the metal surface and metal tip in the scanning tunneling microscope, STM), or magnetic, capacitive, chemical or many other interactions. The STM can have atomic resolution. It is a research tool for surface science, but its relative, the atomic force microscope (AFM), which has nanometer resolution, is a

favorite metrology tool in microfabrication. In the AFM the force in question is the repulsive force between atoms at very short distances. Because this repulsive force is universal, many materials can be used for tips, and silicon and silicon nitride are standard choices, with silicon for cantilevers. Either constant force mode or constant height mode can be used. The former is slow and suitable for hard materials, the latter better for rapid scanning. Cantilever deflection is measured (usually optically) to  $0.01\text{ nm}$ , and the piezoelectric stage covers a scan area of  $5 \times 5\text{ }\mu\text{m}$  typically. The AFM is popular as a surface characterization tool but it is also very good for measuring



**Figure 2.3** High-resolution transmission electron micrograph of single crystal silicon/amorphous silicon dioxide/polycrystalline silicon structure. Reproduced from Buchanan (1999) by permission of IBM

lateral and vertical dimensions in the micrometer and nanometer range.

## 2.2 Lateral and Vertical Dimensions

For device lateral dimensions, 10% deviation is usually accepted as a fabrication tolerance. Measurement precision should be 10% of that variation, that is 10 nm for 1  $\mu\text{m}$  structures. For 100 nm structures this translates to 1 nm, which is very difficult indeed to obtain.

Linewidth is often known as a critical dimension (CD). All major CD measurements rely on scanning, where an

optical slit or aperture, a laser or electron beam spot or a mechanical stylus is scanned over the line. Linewidth measurement depends on edge detection in all these methods. This has both inherent and microstructure-related limitations. The signal from the edge is not a sharp delta function, even in the case of a perfectly vertical sidewall. Beam spot and mechanical stylus alike have finite dimensions which blur line edge detection. Both electromechanical stylus systems and AFMs can be used. The former have a tip radius of curvature of 1–10  $\mu\text{m}$ , the latter 1–10 nm. Stylus instruments are often called profilometers, but, as shown in Figure 2.5, they do not necessarily provide any information about profiles!

Film thicknesses range from one atomic layer to hundreds of micrometers, and no single method can cover such a thickness range. Conductive and dielectric films must often be measured by different techniques even though scanning probe methods are quite universal: a step is formed by etching and a probe tip scans the step. Z-scale precision can be 1 nm or even down to 1 Å (0.1 nm), but in most practical cases surface roughness sets the lower limit for step height/film thickness measurement.

Commonly used optical thickness measuring methods are ellipsometry and reflectometry. In ellipsometry the change in polarization is measured and the amplitude ratio of two different polarizations is computed. Film thickness can be obtained when the optical constants of the film are known. Ellipsometry can be used to measure those constants too, but then additional measurements are needed: for example, multiple angles or multiple wavelengths must be used. Ellipsometry works best for film thicknesses below the measurement wavelength (633 nm, most commonly) because periodicity makes interpretation



**Figure 2.4** (a) AFM cantilever tip scanning the topography; (b) AFM micrograph of a single electron transistor, reproduced from Timofeev *et al.* (2009) by permission of A Timofeev. Copyright 2009 by The American Physical Society



**Figure 2.5** Scanning linewidth measurement, with scan profile shown below. Feature dimensions can be measured but no information is obtained on sidewall angle because the measurement instrument tip/spot itself has non-zero size

of thick-film thicknesses difficult. The low end is just a few nanometers, but for very thin films uncertainty is introduced because optical constants are not really constants, but depend on film thickness.

In reflectometry a wavelength scan is made (e.g., 300–800 nm) and this is fitted to a reflection model. Reflectometry can measure films from a few nanometers up to 50  $\mu\text{m}$  thick.

X-ray reflection (XRR) can be used to measure very thin films. Unlike optical methods, XRR is insensitive to changes in film refractive index. Measurement time, however, is minutes, or even hours, compared to seconds for optical tools. XRR is amenable only to surfaces with a roughness no more than a nanometer and thickness less than a hundred nanometers. There are many such applications in microtechnology: for example, CMOS gate oxides on single crystal silicon are thin and extremely smooth.

## 2.3 Optical Techniques

In addition to microscopy, optics has many attractive qualities for monitoring microfabrication: optical measurements are mostly quick, non-contact and accurate. Lasers can be focused down to micrometer spots, and rapidly scanned over the full wafer, proving both local and global information. Surface roughness, particles and defects can be measured by scatterometry (Figure 2.6). Laser light is scattered from the wafer surface, and multiple detectors measure the reflected and scattered light. Directly reflected (specular) light gives



**Figure 2.6** Scatterometry measurement with three different detectors

information on surface steps and mounds, backscattered light contains information on subsurface defects and diffuse scattering (by the dark-field detector) is indicative of surface particles and microroughness. A general way to obtain more information in optical measurements is to use two different wavelengths (in laser systems) or to scan wavelengths, and to use multiple incidence and detection angles. In scatterometry varying the incidence and detection angles reveals differences between pits and particles for example.

Optical measurements provide information on layer thicknesses (ellipsometry, reflectometry, interferometry) and on optical properties of thin films and surface layers (refractive index, absorption coefficient). They can measure step heights (optical profilometry), surface planarity and wafer curvature (interferometry), and membrane vibrations (interferometry, vibrometry, holography). Interferometry is useful for both static measurements of membrane planarity and cantilever curvature, as well as dynamic properties. Optical probing can be used to measure vibration fields under real operating conditions without disturbing the device. In this way, the mechanical vibration field can be measured, revealing, for example, unwanted vibration modes or energy leakage from a resonator. Optical measurements can thus provide valuable feedback to refine simulations and the design of vibrating MEMS structures (Figure 2.7).

Optical measurements can provide a wealth of information on materials properties. For example, amorphous and crystalline phases have different optical properties. Porous material optical properties can be gauged by modeling the material as a composite of solid and air (or liquid, if the pores are filled with toluene, for example). Multilayer thin films can be measured, but increased reliance on modeling makes the results somewhat inaccurate.



**Figure 2.7** Scanning laser interferometer measurement of a vibration mode in a MEMS resonator: (a) light power image of the resonator; (b) relative vibration amplitude field at 68 MHz; (c) instantaneous surface deflection. Courtesy Kimmo Kokkonen and Lauri Lipiäinen, Aalto University

Optical excitation is used to generate charge carriers in semiconductors, and measuring these provides information about electrical properties. Laser pulse generated local heating causes acoustic waves in solids which will be reflected from material interfaces, which is utilized in thickness measurements. Similarly, thermal pulses provide information on changes in material properties, for example amorphization caused by ion implantation.

## 2.4 Electrical Measurements

A number of electrical measurements can be used to characterize substrates and deposited thin films: namely, resistivity, conductivity type, carrier density and lifetime, mobility, contact resistance or barrier height. Resistivity ( $\rho$ ) depends on charge, carrier mobility ( $\mu$ ) and carrier number ( $n$ ), as follows:

$$\rho = \frac{1}{nq\mu} \quad (2.1)$$

This equation has a catch, however, because mobility ( $\mu$ ) decreases with increasing carrier concentration ( $n$ ) and experimental data is needed to correct for this. Carrier concentration is measured either by mercury probe capacitance–voltage (Hg-CV) measurement or by surface photovoltage (SPV). In the Hg-CV method the capacitance of the depletion region in a mercury semiconductor Schottky diode is measured. This method is accurate but slow and there is the danger of mercury contamination, although no mechanical damage.

Resistivity is an important property of conducting layers but resistance is the property that can be measured easily. For a rectangular piece of conducting material, resistance is given by

$$R = \frac{\rho L}{WT} \quad (2.2)$$

where  $\rho$  is the resistivity,  $L$  the length,  $T$  the thickness and  $W$  the width. If we consider a square piece of conductor,  $L = W$ , we can then define sheet resistance,  $R_s$ , as

$$R_s \equiv \rho / T \quad (2.3)$$

$R_s$  is in units of ohms, but it is usually denoted by ohms/square to emphasize the concept of sheet resistance. The resistance of a conductor line can now be easily calculated by breaking down the conductor into  $n$  squares:  $R = nR_s$  (Figure 2.8). Sheet resistance of doped semiconductor layers will be discussed in Chapter 14.

Measurement of  $R_s$  can be done in several ways: direct measurement necessitates the fabrication of a metal line (lithography and etching steps), but the result follows easily:

$$R_s = R/n = V/nI \quad (2.4)$$

The four-point probe (4PP) method uses two outer probe needles to feed current through the sample and two inner needles to measure voltage, as shown in Figure 2.9.

In the case of a large and thick substrate, resistivity is given by

$$\rho = 2\pi s \frac{V}{I} \quad (2.5)$$



**Figure 2.8** Conceptualizing metal line resistance: four squares with sheet resistance  $R_s$  in series gives resistance as  $R = 4R_s$



**Figure 2.9** The four-point probe measurement set-up with identically spaced needles

In the case of a thin film of thickness  $T$  on an insulating substrate (e.g., Al film on  $\text{SiO}_2$ ), the resistivity is given by

$$\rho = \frac{\pi}{\ln 2} \frac{V}{I} T = 4.53 \frac{V}{I} T \quad \text{or} \quad R_s = 4.53 \frac{V}{I} \quad (2.6)$$

When the sample size is 15 times larger than the probe spacing, resistivity is correct to within 1%. For smaller samples, geometric correction factors need to be applied.

Thickness has to be measured independently, for example by profilometer. Alternatively, sheet resistance can be used to calculate thickness once thin-film resistivity is known (bulk values cannot be used for thin-film resistivities, as will be discussed in Chapter 5).

If depth information of carrier concentration is needed, spreading resistance profiling (SRP) can be used (Figure 2.10). Angle lapping is used to produce a bevel, and resistance is probed along the bevel down to the substrate. An accurate bevel angle is needed to convert data into a thickness profile, and the layer-by-layer resistance data must be converted to resistivity by rather

complex models. SRP is difficult and expensive but it provides data that is not available from other methods.

Many electrical test structures have been devised for conductive films and doping structures. These are fast measurements, ideally suited for wafer mapping: sheet resistance measurement requires four pads for probe needles, and electrical linewidth measurement similarly. Contact chains make do with two pads but generally four-pad measurements, with separate feeds for current and voltage measurements, eliminate contact resistance parasitics. A combined six-pad structure (Figure 2.11) can be used to measure both sheet resistance  $R_s$  and electrical linewidth.

In the six-terminal structure, current  $I_c$  is driven through terminals 2 and 3 and voltage drop  $V_c$  is measured across terminals 5 and 6. Sheet resistance  $R_s$  is given by

$$R_s = \frac{\pi}{\ln 2} \frac{V_c}{I_c} \quad (2.7)$$

Bridge resistance  $R_b$  is the voltage drop between terminals 4 and 5,  $V_{45}$ , divided by current  $I_{13}$  driven through terminals 1 and 3. Linewidth  $W$  is then given

$$W = \frac{R_s \cdot L}{R_b} \quad (2.8)$$

Assuming a rectangular cross-sectional profile usually holds fairly well for plasma-etched lines. Line length  $L$  is fixed on the photomask, and if  $L \gg W$ , minor inaccuracies in lithography (e.g., corner rounding) can be



**Figure 2.10** Spreading resistance profiling (SRP): two-point electrical measurements on a beveled sample. Resistance and carrier concentration profiles of  $n^-/n^+$  epi on  $p^+$  substrate. Courtesy Atte Haapalinna, Okemtic



**Figure 2.11** Electrical six-terminal test structure for sheet resistance and linewidth

ignored. Diffusion can be measured similarly, but the assumption of the profile needs to be accounted for.

Electrical test structures are implemented in test chips on the wafer, or, alternatively, they can be imbedded in the scribe lines between chips. Test structures for wafer fabrication measurements can thus be discarded after the fabrication is completed. This saves area because the dicing saw requires a margin of about a hundred micrometers between the chips anyway, as shown in Figure 1.20.

## 2.5 Physical and Chemical Analyses

One special aspect of materials in microfabrication is their extreme purity: impurities are specified even in parts per trillion (ppt,  $10^{-12}$  relative abundance). This is a relief in some cases because background signals are very low, but if the impurities themselves need to be measured, then we face some tough challenges.

Elemental concentrations are often needed: nitrogen in TiN thin films (50% for stoichiometric film), copper in aluminum (0.5–4% Cu), phosphorus in oxide (5% by weight), boron in silicon wafers ( $1 \times 10^{16} \text{ cm}^{-3}$ ), oxygen in silicon (10–20 ppma, parts per million atoms), sodium impurities in tungsten sputtering target (ppb, parts per billion), or iron in silicon (ppt). These different concentration levels mean that different analytical methods must be employed.

Elemental detection can be accomplished with many methods quite readily, but quantification is often difficult. Comparative results are often presented: treatments A, B, C vs. reference sample. Treatments might represent new plasma CVD oxide processes and thermal oxide used as a reference, or different thermal treatments are compared to the as-deposited sample as a reference.

Measurement and characterization of microstructures differ from macroscopic structures and bulk materials in many respects. Small analysis areas and volumes limit the available methods and sensitivities. The signal-to-noise

**Table 2.1** Physical and chemical analytical methods

| Incoming species | Outgoing species and method name                                                                                                                                                                                                    |
|------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Ions             | Ions: SIMS (Secondary Ion Mass Spectrometry)<br>Ions: RBS (Rutherford Backscattering)                                                                                                                                               |
| Electrons        | Electrons: AES (Auger Electron Spectroscopy)<br>X-rays: EDX (Energy Dispersive X-ray spectroscopy)                                                                                                                                  |
| X-ray photons    | Electrons: XPS (X-ray Photoelectron Spectroscopy)<br>X-rays: XRD (X-ray Diffraction)<br>X-rays: XRF (X-ray Fluorescence)<br>X-rays: TXRF (Total reflection X-ray Fluorescence)<br>X-rays: GXRR (Grazing incidence X-ray Reflection) |

ratio (S/N) is proportional to the square root of the number of atoms probed:

$$S/N \propto \sqrt{\text{number of atoms probed}} \propto R \cdot \sqrt{z} \quad (2.9)$$

where  $R$  is the probing radius and  $z$  is the depth of analysis (cylinder volume  $\propto R^2 z$ ). Equation 2.9 explains why no single method can fulfill all microcharacterization needs.

A large number of techniques are available for chemical and physical analyses. We can excite the sample with electrons, photons or ions, and measure electrons, photons or ions ejected from the sample. Table 2.1 collects some of the main methods, classified according to excitation/emission species. The list is by no means exhaustive; you can find many more techniques in a dedicated book.

Unfortunately, most methods are limited to certain elements only. The only exception is SIMS, which can detect every element from hydrogen to uranium. Auger spectroscopy cannot detect hydrogen, helium or lithium due to a fundamental limitation of the three electron Auger processes, but all other elements are detectable. X-ray methods are insensitive to light elements: depending on X-ray window design, boron ( $m = 11$ ) can be detected, but sometimes fluorine ( $m = 19$ ) and sodium ( $m = 23$ ) are the lightest detectable elements.

### 2.5.1 Analysis area and depth

Optical methods cover a wide range of analysis areas: sometimes micrometer laser spots or large areas from lamp illumination, or scanning a wafer by laser beams.

X-rays cannot be focused, and X-ray methods require typically rather large areas, in the millimeter range. Ion beams can be focused to submicron spots in focused ion beam (FIB) equipment, but most applications use broad beams, in the millimeter range. Electron beam excitation is the most accurate, and when combined with electron detection, the smallest possible analysis volume is obtained.

Analysis depths vary a lot. Some methods are sensitive to a few atomic layers only, while others extend to a micrometer or so. Electrons have very small mean free paths in solids, so only electrons from the top few nanometers can escape and be detected, therefore electron spectroscopies are very surface sensitive. X-rays penetrate deep into matter, and the volume probed is much larger. Ion penetration depth is easily varied by changing energy. In all cases the incident angle can be used to modify the penetration depth: glancing angle incidence leads to surface sensitive analysis when total reflection takes place.

Diffusion depths and film thicknesses are often on the order of a micrometer. Analysis techniques that extend this deep would be very useful, but only a few exist. Rutherford backscattering spectrometry (RBS) has a typical analysis depth of around a micron (for helium ion energy of 2 MeV). Electron beams also penetrate by about a micrometer into solids, and generate a signal from that depth. A combination of surface erosion and surface sensitive analysis is commonly adopted for top micrometer analysis: the ion beam removes material and the newly formed surface is probed by AES or SIMS.

Analysis must be done not only on microfabricated structures themselves but also on defects and non-idealities which are smaller than the device dimensions. If the chemical composition or structure of defects has to be identified, this is even more demanding than analysis

of regular microstructures. Contaminants often come in quantities too small for even the best analytical methods. Vacancies and other point defects are smaller than the resolution of even the best microscopic methods. Indirect methods must be used, like carrier lifetime measurements (defects act as traps for charge carriers), positron annihilation spectroscopy (PAS) (positron lifetime is longer in a material with voids) or photoluminescence (identification of defects by their recombination radiation) or Raman spectroscopy (structural defects, implant damage, local stresses shifting photon energy).

### 2.5.2 Secondary Ion Mass Spectrometry (SIMS)

In SIMS the surface to be analyzed is bombarded by ions which knock atoms from the surface. Some of these are ionized, and subsequently mass analyzed, giving their identity. SIMS is thus a surface sensitive technique, but another important SIMS application is depth profiling: an ion beam erodes the surface and layers beneath the surface become available to analysis. When the erosion rate is known, SIMS data provides information on atomic concentrations as a function of depth (Figure 2.12).

SIMS measurement is slow and expensive, but it is the accepted standard for dopant depth distribution measurement (even though we are most often interested in electrically active dopants, whereas SIMS only counts atoms). SIMS offers nanometer depth resolution and  $10^6$  dynamic range.

### 2.5.3 Rutherford Backscattering (RBS)

RBS is based on elastic recoil collisions. Helium ions (alpha particles) penetrate matter and slow down, but one ion



**Figure 2.12** SIMS data of arsenic ion depth distribution inside silicon. Two different ion implantation energies have been used: (a) arsenic depth distribution immediately after implantation; (b) after  $1050^\circ\text{C}$ , 10 s heat treatment. Reproduced from Plummer and Griffin (2001) by permission of IEEE



**Figure 2.13** RBS spectrum of Si/Ta/Cu (20 nm/100 nm) sample. Even though the tantalum is beneath the copper, its signal is at a higher energy because tantalum is so much heavier. Courtesy Jaakko Saarilahti, VTT

in a million experiences  $180^\circ$  elastic recoil and bounces back toward the surface, slows down on the way, and finally emerges from the solid to reach the detector. All these steps can be handled by calculations and RBS is a quantitative method. Elastic recoil from heavy atoms is more pronounced, and RBS is ideally suited for heavy atoms like arsenic, tantalum, copper and tungsten.

Signal energy is sometimes confusing because it depends not only on the depth from where it originates, but also on the mass of the atom that caused backscattering. In Figure 2.13 a tantalum layer beneath copper has been measured by RBS. The silicon signal is weak because silicon is a light atom and it is deep beneath the copper and tantalum. Copper is the topmost layer, but because it is lighter than tantalum, its peak is lower in energy.

RBS detectivity depends on the matrix: elements lighter than the matrix are not readily detectable. Oxygen and nitrogen analyses on top of silicon wafers are therefore difficult for RBS. Mass separation between neighboring elements is poor in RBS, therefore silicon, aluminum and phosphorus cannot readily be resolved. RBS detection limits are around  $10^{20} \text{ cm}^{-3}$  (0.5%), but with heavy elements RBS can detect concentrations even down to  $10^{17} \text{ cm}^{-3}$  (0.001%).

#### 2.5.4 Auger Electron Spectroscopy (AES)

In Auger measurement an electron beam (3–5 keV) hits the surface and an inner core electron is ejected. The vacancy left behind is filled by an electron from an outer shell, which gives off excess energy during transition. Another outer shell electron receives this energy, and escapes. The energy of the escaping electron is determined by the atomic energy levels and can be uniquely identified. This third electron is called the Auger electron. The escape depth of low-energy Auger electrons is on the order of a nanometer only, which makes AES a truly surface sensitive technique. In Figure 2.14 residues from TiW etching are analyzed by AES: etching is incomplete



**Figure 2.14** Auger analysis of silicon dioxide surface: (a) evidence of titanium and tungsten residues; (b) after sputter etching has removed 100 Å (10 nm) surface layer, the sample has been reanalyzed and found to be free of Ti and W. Reproduced from Schaffner (2000) by permission of IEEE

because both tungsten and titanium are detected on the surface (and carbon, but carbon is a very common contaminant). Etching 100 Å (10 nm) or more removes all traces of tungsten and titanium.

With the aid of sample erosion techniques (similar to SIMS), AES can be transformed into a depth profiling technique: after surface analysis, sputtering removes some material, and Auger measurement of the newly formed surface is carried out. This is continued until the desired sample depth is probed. An example of an Auger depth profile is shown in Figure 7.11.

#### 2.5.5 Energy Dispersive X-ray spectroscopy (EDX)

When the electron energy is high enough to ionize an atom, X-rays can be generated in the recombination process. Electron beams can be focused down to 5 nm spots and devices can be probed for localized analysis. The electron beam diverges as it interacts with the matter. The scattering of electrons spreads the beam over a volume much larger than the beam spot on the surface, as



**Figure 2.15** Finely focused electron beam hits the sample surface, and low-energy secondary electrons escape from the surface only, but backscattered and inelastically scattered electrons contribute to the signal deep inside the sample. Reproduced from Schaffner (2000) by permission of IEEE

shown in the Figure 2.15. Auger electrons which originate at the very surface are unaffected by this spreading, but X-rays and backscattered electrons generated deep inside the sample can escape and reach the detector.

The radius of the X-ray signal is given by

$$R_x(\mu\text{m}) = \frac{0.04 \cdot V^{1.75}}{\rho} \quad (2.10)$$

where acceleration voltage  $V$  is in kilovolts and density  $\rho$  in  $\text{g}/\text{cm}^3$ . Analysis radius  $R$  is given by

$$R = \sqrt{R_x^2 + d^2} \quad (2.11)$$

where  $d$  is beam spot diameter.

This analytical radius of EDX (also known as electron microprobe analysis, EMPA) can be orders of magnitude bigger than the electron beam spot size.

EDX can detect elemental concentrations at the 1% level. Examples of suitable analytical tasks include phosphorus determination in CVD oxide (5% wt typical) or copper concentration in aluminum film (0.5–4% Cu typical). EDX is most often connected to a SEM which is used to image the area of interest first and then subjected to elemental analysis by EDX. If the sample is made thin, on the order of 100 nm, electron scattering effects can be eliminated. This is utilized in the TEM and electron energy loss spectroscopy (EELS).

### 2.5.6 X-ray Photoelectron Spectroscopy (XPS)

XPS is closely related to AES in two ways: low-energy electrons are analyzed, and because their escape depth

is so small, the method is surface sensitive, but XPS excitation is by X-rays. This has important ramifications for the analysis area: X-ray spots are fairly large, in the hundred micrometer range, and large areas are needed for analysis.

Primary X-rays (a few kilovolts) eject electrons from the sample. The energy of the ejected electrons is related to their binding energy, and this enables not only elemental identification but also chemical bond identification. The electron energy is slightly different depending on bonding, so for example C–O, C–F and C–C bonds can be distinguished. The other name for XPS, ESCA (Electron Spectroscopy for Chemical Analysis) emphasizes this important feature of XPS. Figure 2.16 shows a XPS analysis of platinum etching in  $\text{Cl}_2/\text{CO}$ . Initially the surface is of course covered with platinum, with a trace amount of carbon (from atmospheric contamination). After partial etching some chlorine is detected, indicating that chlorine is active in the surface reaction. After overetching, all traces of platinum and chlorine disappear and only silicon and oxygen from the underlying  $\text{SiO}_2$  film are detected.

### 2.5.7 X-ray Diffraction (XRD)

Structural information such as crystal orientation, texture and grain size is important in a number of cases. The resistivity of metal film can increase by an order of magnitude upon phase change, and the initial state, amorphous and polycrystalline, affects thin-film polycrystalline silicon final grain size after annealing. Piezoelectric properties are very sensitive to crystal structure, and



**Figure 2.16** XPS analysis of platinum etching. Reproduced from Kim and Woo (1998), Copyright 1998, American Chemical Society



**Figure 2.17** Aluminum nitride thin film on aluminum thin film on silicon wafer – crystallinity checked by XRD. Reproduced from Naik *et al.* (1999) by permission of ECS, The Electrochemical Society

XRD is used to measure crystallinity (Figure 2.17). XRD provides structural information of aluminum nitride crystal. Auger could indicate that aluminum and nitrogen are present in 1:1 ratio, but it does not indicate whether AlN crystals are present: amorphous AlN also has 1:1 elemental ratio. TEM also provides similar crystallinity information, but TEM analysis area is only tens of nanometers, whereas XRD gives an average over hundreds of micrometers. TEM requires tedious sample preparation while XRD needs none.

### 2.5.8 X-ray Fluorescence (XRF)

Each element gives off characteristic X-rays, as discussed above for the electron beam-excited case. But X-ray excitation can be used, too. This is a very simple yet powerful analytical technique throughout science and engineering. It is a bulk analytical method, with limited use in microfabrication. Its variant, total reflection XRF (TXRF), however, is very important. X-rays hit the wafer at a grazing angle (almost parallel to the wafer surface) and penetrate only a few atomic layers. Therefore the fluorescence signal they excite concerns surface atoms only. TXRF can measure surface impurities at the  $10^{10} \text{ cm}^{-2}$  level.

### 2.5.9 Other methods

Thermal desorption spectroscopy (TDS; also known as Temperature Programmed Desorption) analyses desorption products upon sample heating using a mass spectrometer. TDS can identify and quantify the desorbed species. If the material from wafer surface can be dissolved in acid, atomic adsorption spectroscopy (AAS), and many other standard methods of chemical analysis become available.

## 2.6 Practical Issues with Micrometrology

Many analytical methods can produce accurate results only at the expense of great time and effort: for instance, TEM can image individual atoms but the analysis time is days (it consists mostly of tedious sample preparation but also complicated analysis). TEM analysis costs about \$1000–2000 per sample if bought as a service.

Monitoring must be preferably so fast that whole wafer mapping can be performed to check for uniformity. Mapping measurement also requires that the analytical equipment can handle whole wafers. Many optical and electrical measurements are suitable for mapping, but most physical and chemical methods require the wafer to be broken into ca. square centimeter pieces.

Uniformity can be defined across the wafer (within-wafer non-uniformity, WIWNU), wafer-to-wafer (WTWNU) and lot-to-lot. The definitions for uniformity are given by

$$U = \frac{\max - \min}{2 \times \text{average}} \quad \text{and} \quad U = \frac{\max - \min}{\max + \min} \quad (2.12)$$

The former is applied when five measurements are taken, one at the wafer centre and four at  $90^\circ$  from each other at half-radius; the latter when the four points are at wafer edges.

A uniformity of 5% with 6 mm edge exclusion was long accepted as a typical process performance nowadays 3% with 3 mm exclusion might be required. Some processes are inherently better, for example thermal oxidation, ALD and photoresist spinning routinely produce better than 1% uniformity. On the other hand, CMP is notoriously non-uniform, with 10% considered as good uniformity.

### 2.6.1 Contact vs. non-contact measurements

Measurements can be divided into two categories: contact and non-contact (non-invasive). Both laser-induced thermal pulses and a four-point probe can be used to monitor ion implant dose, but 4PP makes physical contact with the wafer by metal (tungsten) needles, and the wafer is deemed contaminated – it is not allowed to continue to high-temperature steps. Linewidth measurement by SEM is non-contact, as opposed to the stylus profiler or AFM, which make contact with the wafer. Because full wafers are analyzed in a linewidth SEM, only top view pictures are possible, and no cross-sectional information can be obtained.

### 2.6.2 Blanket vs. patterned wafer analysis

Both in R&D and in production, analytical methods are bound by a number of practical constraints related to the number of data points, measurement spot size and speed of measurement. Blanket wafer measurements are simple to perform and many basic studies in film deposition, diffusion, ion implantation, polishing or bonding can be done on blanket wafers, but in many cases structured wafers are indispensable. Linewidths and spacings need to be identical to product wafers, but they must be more accessible to probing, by optical or electron beams, or by mechanical probes. Test structure size needs to be matched to design complexity: if the product chip has 1 million contact holes, how should this be extrapolated from a 1000-hole test structure?

### 2.6.3 Destructive vs. non-destructive analysis

Equipment and labor costs of measurements range from a few cents to a few dollars per wafer, but if the measurement is wafer destructive, its cost is at least the wafer cost, or \$10–100 per sample. Many physical analysis systems require samples of about 1 cm<sup>2</sup>, and the wafer has to be broken, for example for RBS, AES or XPS analysis. If cross-sectional information is needed, wafer breakage is also then necessary.

### 2.6.4 Standards and reference materials

Calibration standards (with traceability to NIST, the National Institute of Standards and Technology) and reference materials (which are supplier certified) are available for all major wafer-level measurements: namely, film thickness and step height, dimensions, electrical resistivity and particles. Reference materials are sufficient for daily work but they must be calibrated regularly against traceable standards. The standards and references are silicon wafers with dedicated test patterns for the quantities in question. A single wafer can provide a series of standards, like different resistivity windows or step heights.

### 2.6.5 Failure analysis and reverse engineering

Analytical methods are needed not only during fabrication, but also after wafer processing has been completed. When circuits are found to malfunction, either in testing or after field return, the causes must be identified. Hard errors, that is consistent failures, are much easier to locate and understand than soft errors, that is intermittent failures, which may take place only under certain operating conditions (e.g., above a certain temperature or frequency). As in wafer-level analysis, non-destructive methods are tried first and destructive ones only afterward.

In reverse engineering, a chip is “disassembled” step by step, and the structures, materials and functions are recorded (see Figure 28.6 for IC metallization stripped of all dielectric films). This is practiced for example for competitive intelligence or patent infringement examination. Methods like electron beam-induced current (EBIC) and voltage contrast SEM can be used to probe the electrical functions of a circuit.

## 2.7 Measurements Everywhere

No microfabrication process works without measurements, either in the research phase, or in manufacturing. Measurement needs change when processes evolve from novelty to routine, but there are some constants, such as the need to be able to measure ever finer details: for example, not only linewidth, but also line edge roughness; and not only film thickness, but also interface quality; and not only membrane deflection statically but at megahertz frequency.

It is not unusual to find that no analytical method is able to do the job: either the quantity involved or the analysis area is too small. Sometimes it is possible to use devices themselves as measuring instruments. Device

performance degradation is attributed to minute effects which are not amenable to direct physical measurements. MOS transistors are sensitive to metal contamination at levels below analytical detection limits (in the  $10^9 \text{ cm}^{-3}$  range). Microscopic vacuum cavities are created by wafer bonding or deposition, and no pressure gauge is small enough to probe these cavities, but the mechanical quality factor  $Q$  of microfabricated mechanical resonators in the cavities is indicative of cavity pressure.

## 2.8 Exercises

1. The sheet resistance of typical aluminum metallization is 0.03 ohms/square. What is the aluminum thickness?
2. The resistance of copper lines  $200 \mu\text{m}$  long was measured as 40 ohms. From the copper deposition process we know that the thickness is 300 nm. What is the linewidth?
3. AFM scan area is  $1 \times 1 \mu\text{m}$ , which corresponds to  $512 \times 512$  pixels. What must the AFM tip radius be so that resolution is tip limited?
4. Estimate the analytical radius of an electron microprobe (EMPA).
5. Can RBS be used to measure dopant profiles?
6. If an electron beam is focused to a  $15 \text{ nm}$  spot, and at least 100 Auger events (electrons) must be collected to get a signal, what is the detection limit of an Auger microprobe?
7. SIMS raw data consists of ion counts vs. erosion time. How can these be converted to concentration vs. depth data?
8. What is the acceleration voltage of an atomic resolution TEM?

## References and Related Reading

Bubert, H. and H. Jenett (eds) (2002) **Surface and Thin Film Analysis**, Wiley-VCH Verlag GmbH.

- Buchanan, M. (1999) Scaling the gate dielectric: materials, integration and reliability, *IBM J. Res. Dev.*, **43**, 245.
- Bunday, B.D. *et al.* (2007) Value-added metrology, *IEEE Trans. Semicond. Manuf.*, **20**, 266–277.
- Diebold, A.C. (1994) Materials and failure analysis methods and systems used in the development of and manufacture of silicon integrated circuits, *J. Vac. Sci. Technol.*, **B12**, 2768.
- Holmgren, O. *et al.* (2009) Analysis of vibration modes in a micromechanical square-plate resonator, *J. Micromech. Microeng.*, **19**, 015028.
- Kim, J.H. and S.I. Woo (1998) Chemical dry etching of platinum using  $\text{Cl}_2/\text{CO}$  gas mixture, *Chem. Mater.*, **10**, 3576–3582.
- Naik, R.S. *et al.* (1999) Low-temperature deposition of highly textured aluminum nitride by direct current magnetron sputtering for applications in thin-film resonators, *J. Electrochem. Soc.*, **146**, 691–696.
- Plummer, J.D. and P.B. Griffin (2001) Material and process limits in silicon VLSI technology, *Proc. IEEE*, **89**, 240.
- Runyan, W.R. and T.J. Schaffner (1998) **Semiconductor Measurements and Instrumentation**, McGraw-Hill.
- Schaffner, T.J. (2000) Semiconductor characterization and analytical technology, *Proc. IEEE*, **88**, 416.
- Schroder, D.K. (1998) **Semiconductor Material and Device Characterization**, 2nd edn, John Wiley & Sons, Inc.
- Tiggelaar, R.M. *et al.* (2009) Stability of thin platinum films implemented in high-temperature microdevices, *Sens. Actuators*, **A152**, 39–47.
- Timofeev, A.V. *et al.* (2009) Electronic refrigeration at the quantum limit, *Phys. Rev. Lett.*, **102**, 200801.
- Yu, G.-Q., S.-H. Lee and J.-J. Lee (2002) Effects of thermal annealing on amorphous carbon nitride films by r.f. PECVD, *Diamond Relat. Mater.*, **11**, 1633–1637.

# 3

## Simulation of Microfabrication Processes

Microfabrication processes consist of tens or hundreds of steps which take weeks or months to complete, and development cycles easily become very long. Simulation is one way to shorten this time. Simulation accuracy is strongly dependent on the details of the process to be simulated, and even a simple simulator can be extremely valuable if it saves enough experimentation time and effort. Simulators can provide meaningful trend data and comparisons between different process options, even though accuracy might be less than perfect. Simulators can be used to explore possibilities and narrow down options before experimental work is begun. Simulation can provide information which is not experimentally available or difficult to measure. For instance, there is no dopant profiling method with sub-10 nm resolution in both vertical and lateral directions, and therefore simulation is the de facto method for 2D dopant distribution analysis.

### 3.1 Simulator Types

There are two breeds of process simulators: integrated packages that can be used to simulate the whole fabrication process with many different steps in sequence; and dedicated simulators for specific process steps. Dedicated simulators are available for almost all processes, ranging from ion implantation damage production to lithography defect modeling, to crystal structure prediction of deposited films. A silicon anisotropic wet etching simulator can use crystal lattice information (bond energies and densities) to predict which atoms will be removed by the etchant. Dedicated simulators are more detailed, more accurate and more computationally intensive. A basic principles diffusion simulation would start from lattice parameters, interatomic potentials, vacancy production and annihilation rates and atom–defect interactions, and provide diffusion data as an output. Integrated simulation packages use simpler models, but offer

seamless stitching of different process steps into whole processes. For instance, macroscopic phenomenological diffusion models based on Fick's equations are used. Bulk silicon process steps, that is high-temperature steps that affect the dopant distribution inside silicon, epitaxy, diffusion, implantation and oxidation, can be analyzed by solving the relevant diffusion equations.

Etching and deposition produce the topography on a wafer. This build-up of topography is difficult to simulate because it involves physics (e.g., plasma generation) and chemistry (e.g., surface reactions) at many time scales and physical sizes. Film deposition simulators depend on atom arrival angles which are not physical constants like diffusivities, but parameters sensitive to experimental conditions. Etching reactions are complex interactions between the chemical contributions (spontaneous etching, free energy considerations) and physical processes (e.g., ion bombardment-enhanced desorption). Topography process simulators are usually semiempirical: some important model parameters are extracted from experiments without fundamental physical validation.

Even though simulation is fast, building a simulator is slow and tedious. It is not possible to build simulators for all possible new materials, processes and devices because calibration data needs to be available, and it is readily available only for those materials, processes and devices that are widely studied and used. In this sense the predictive power of process simulation remains poor.

### 3.2 Levels of Simulation

Process simulation, device simulation and circuit simulation together are termed TCAD, for technology CAD, in contrast to the more established ECAD, or electronic simulations, which involves logic and system simulations. Process simulation deals with physical structures, namely atoms and their distributions; device simulation deals with



**Figure 3.1** Levels of simulators

currents and potentials in devices; and circuit simulation is used to study larger circuit blocks (Figure 3.1). The dopant concentrations produced by a process simulator are used as input for the device simulator, and device simulator results form the starting material for circuit simulation.

Circuit simulation is the most advanced of the three, and process simulation is the least developed. Device simulators for CMOS today are predictive because CMOS device physics is well understood. Of course, continuous scaling to smaller linewidths means that new phenomena must be implemented regularly into process and device simulators.

MEMS simulators must deal with a lot more varied phenomena (e.g., electrical, mechanical, thermal, magnetic, optical, etc., phenomena). MEMS fabrication simulators include silicon etching simulators, which take mask design as input and produce silicon 3D shape as output. Most device simulators, however, are FEM simulators which use drawn geometry as a starting point, instead of process simulator output.

### 3.3 The 1D Simulators

A 1D simulator treats matter as layers, and the simulation outputs are layer thicknesses and dopant distributions in the vertical direction (Figure 3.2). Such 1D simulation has been used since the 1970s when SUPREM from Stanford University emerged. Diffusion, ion implantation, oxidation and epitaxy are treated. Two additional, non-physical process steps are included, namely film deposition and etching, but these are just geometrical steps, like “add 500 nm of undoped oxide on silicon” or “remove top 50 nm of silicon by etching.” These steps are needed for more realistic models of surfaces and interfaces, but they do not reveal anything about the deposition or etching processes.

Over the years more layers and more realistic models have been added to 1D simulators. For instance, some simulators can handle the oxidation and doping of polycrystalline silicon. Polycrystalline materials require more inputs than single crystals, for example grain size and texture, and assumptions of grain boundary diffusion vs. bulk diffusion among others. ICECREM (from Fraunhofer Institute IISB, <http://www.iisb.fraunhofer.de>) is an advanced 1D simulator. It can simulate the following processes:

- epitaxy
- oxidation
- diffusion
- ion implantation.

Additionally, the following steps can be used to generate geometry:

1. Deposition of undoped oxide films (protective capping layers).
2. Deposition of doped oxide films (diffusion sources).
3. Etching (removal of oxide and silicon).



**Figure 3.2** Cross-section of an npn bipolar transistor and its 1D simulation model of dopant concentrations along the cut line



**Figure 3.3** A 1D simulation (ICECREM) of (a) arsenic (50 keV energy) and boron (50 keV) ion implantation into silicon, dose  $10^{15}$  ions/cm $^2$ , and (b) dry oxidation of BF $_2^+$  implanted silicon (20 keV,  $10^{15}$  ions/cm $^2$ )

ICECREM models can account for a number of important real-life effects like high phosphorus concentration in diffusion, implantation through the oxide and oxidation-enhanced diffusion (OED). These features will be discussed in the relevant chapters on basic processes. ICECREM output consists of diffusion profiles, oxide thicknesses, sheet resistances and junction depths. Sensitivity analysis can be carried out to study both process parameter and model parameter changes.

A typical simulator input file begins with the substrate definition (crystal orientation  $<100>$  or  $<111>$ , doping type and level/resistivity). The grid is defined next: that is, the depth into silicon that needs to be simulated (e.g., down to 5 μm), division of matter into layers (e.g., 10 nm thick) and concentrations that need to be calculated (concentrations vary between  $10^{21}$  cm $^{-3}$ , limited by dopant solubility, and  $10^{15}$  cm $^{-3}$ , which is a typical substrate doping level). Process steps are then defined in sequence, followed by output commands. Model parameters can be modified by the user, but default parameters are good for initial simulations and novice users. Simulation examples in the chapters on epitaxy, oxidation, diffusion and implantation are discussed using ICECREM.

A 1D simulator output can visualize dopant depth distributions and film thicknesses, as shown in Figure 3.3. There are two important points in concentration curves: the maximum concentration and its depth, and the junction depth where substrate dopant level and diffused dopant levels match. Junction depths range from tens of nanometers to many micrometers.

### 3.4 The 2D Simulators

A 2D simulation is indispensable because a 1D simulation of more slices cannot predict 2D profiles. This is illustrated in Figure 3.4 for a simple 5 μm linewidth MOS transistor: the 1D simulation produces accurate doping profiles and oxide thicknesses along lines A, B and D, but it cannot produce any meaningful results for C (where implanted dopant spreads laterally under the mask) or E (where oxidation occurs under a protective nitride layer). The 1D results for A, B and D are valid for 5 μm transistors, but as the device is scaled to smaller linewidths, more and more 2D effects arise, and a 2D simulator will be needed for profiles along B and D as well.

The 2D diffusion simulators take into account oxide and polysilicon structures on top of silicon, and produce dopant profiles that extend under the gate and masking



**Figure 3.4** Vertical profiles of a MOS transistor: film thicknesses and dopant distributions along lines A, B and D can be simulated with a 1D simulator, but profiles along C and E require 2D simulation



**Figure 3.5** A 2D simulation: dopant concentration profiles of a CMOS transistor with a gate length of 25 nm. Reproduced from Taur *et al.* (1998) by permission of IEEE

layer, as shown in Figure 3.5, for example. The structures above the silicon surface are usually not simulated but simply drawn geometries. They are tools to add realism, not unlike the deposition and etching steps in 1D simulators.

The 2D simulators are about cross-sections of structures where 1D ones were only about layers. A 2D simulation enables topography simulation: in a 1D simulation it is not possible to study the deposition of films over patterns, neither are cross-sections relevant. Continuum simulator SAMPLE and atomistic simulator SIMBAD are compared in Figure 3.6 for metal sputtering. In both cases metal is deposited over a trench, and thickness of metal on sidewalls and trench bottom is predicted. A continuum simulator which predicts thicknesses from atom arrival angle distribution and surface mobility considerations is useful but the result is geometric only. An atomistic simulator can show crystal structure, and reveal differences in film quality on horizontal and vertical walls.

2D simulation is computationally intensive, and 2D simulators usually have a 1D simulation tool imbedded in them, for quick and easy initial 1D tests. Savings in computational time can be orders of magnitude. A grid, or simulation mesh, in a 1D simulator is regular and easy to generate, but in 2D simulators mesh generation is much more difficult. In order to reduce computational time, a dense grid is used where abrupt changes are expected, and a sparse grid where gradients are not steep. Instead of a rectangular grid, triangular grids are often employed.

### 3.5 The 3D Simulators

When scaling to smaller and smaller dimensions continues, 3D simulation becomes mandatory. Narrow but long transistors can be simulated by a 2D simulator, but a narrow and short transistor with similar dimensions in both the  $x$ - and  $y$ -directions really needs 3D treatment. Again, complexity and time of simulation increase drastically in the 2D case. If a layer 1  $\mu\text{m}$  deep is simulated in a 1D simulator with 10 nm grid spacing, 100 layers need to be calculated. A similar grid size in 2D simulation requires  $100 \times 100$  squares ( $10^4$ ), and in 3D simulation it equals  $10^6$  cubes. Roughly speaking, if 1D simulation takes seconds, 2D takes minutes and 3D hours.

However, a 10 nm grid is no good for 3D simulation because this simulation is used especially for 100 nm devices and the like, and perhaps a 1 nm grid is used. But the question is not just computational: additional physical models need to be developed, because more and more atomistic models must be used, and continuum approximation fails due to the atomic nature of matter. In order to take advantage of 3D process simulation, 3D device simulators must be used, just like 2D process simulators feed into 2D device simulators. Advanced device simulators must similarly account for the fact that electric current is not a continuous variable but a stream of charged packets of  $1.6 \times 10^{-19} \text{ C}$ .



**Figure 3.6** Continuum and atomistic metal step coverage simulation: (a) SAMPLE 2D simulation of metal deposition 0.5  $\mu\text{m}$  thick into a trench 1  $\mu\text{m}$  wide and 1  $\mu\text{m}$  deep; only film thickness is simulated; (b) SIMBAD: sputtered tungsten into a trench with prediction of columnar grain structure. Reproduced from Dew *et al.* (1991) by permission of AIP

### 3.6 Other Simulation Needs in Microfabrication

Simulation needs extend from the atomic scale to reactor scale. At a meter scale, simulation is needed to predict gas flows and temperature distributions inside a reactor; at the centimeter level it can be used to study uniformity across the wafer; at the micrometer scale, simulation is needed to predict doping profiles and step coverage; and atomic-level simulation is needed to understand the details of film growth and diffusion. For thin-film deposition such a simulator would produce a relation between process parameters and film properties. At the present time, such multiscale simulation remains a faraway goal.

Optical lithography simulation is a self-contained regime in process simulation. Its main output is a resist profile with optics, resist photochemistry and development as the main modules. It will be discussed in Chapter 10. Tool simulation will be briefly discussed in Chapter 31 and deposition simulation in Chapter 7.

### 3.7 Exercises

These exercises require a simulator, for example ICECREM or SUPREM, but any 1D simulator will do. ICECREM can be obtained free of charge from: [icecrem@iisb.fraunhofer.de](mailto:icecrem@iisb.fraunhofer.de).

- How much difference is there between the oxidation rates of boron-, phosphorus- and arsenic-doped wafers when all have identical doping levels?

- Use a 1D simulator to find the step height resulting from oxidation of phosphorus-doped silicon depicted below:



- How does thermal oxide thickness on a phosphorus-doped wafer change with dopant concentration?
- What energy must phosphorus ions have to pass through 200 nm oxide?
- Compare your simulator to other simulators for ion implantation of arsenic into silicon:

| $E$<br>(keV) | Dose<br>( $\text{cm}^{-2}$ ) | Simulator | Range<br>( $\text{\AA}$ ) | Peak<br>concentration<br>( $\text{cm}^{-3}$ ) |
|--------------|------------------------------|-----------|---------------------------|-----------------------------------------------|
| 40           | $1.4 \times 10^{13}$         | TRIM      | 332                       | $6.0 \times 10^{17}$                          |
| 40           | $1.4 \times 10^{13}$         | PREDICT   | 268                       | $3.8 \times 10^{18}$                          |
| 40           | $1.4 \times 10^{13}$         | CUSTOM    | 270                       | $4.6 \times 10^{18}$                          |
| 90           | $7.2 \times 10^{14}$         | TRIM      | 636                       | $8.6 \times 10^{18}$                          |
| 90           | $7.2 \times 10^{14}$         | PREDICT   | 603                       | $9.9 \times 10^{19}$                          |
| 90           | $7.2 \times 10^{14}$         | CUSTOM    | 530                       | $1.2 \times 10^{20}$                          |

- Calculate the oxide thickness for 10, 100, 1000 and 10 000 minutes of oxidation at 1100  $^{\circ}\text{C}$ .

## References and Related Reading

- Bechtold, T., E.B. Rudnyi and J.G. Korvink (2005) Dynamic electro-thermal simulation of microsystems – a review, *J. Micromech. Microeng.*, **15**, R17–R31.
- Dabrowski, J., E.R. Weber and J. Dabrowski (eds) (2004) **Predictive Simulation of Semiconductor Processing: Status and Challenges**, Springer.
- Dew, S.K. *et al.* (1991) Modeling bias sputter planarization of metal films using ballistic deposition simulation, *J. Vac. Sci. Technol.*, **A9**, 519–523.
- Ho, C.P. *et al.* (1983) VLSI process modeling – SUPREM III, *IEEE Trans. Electron Devices*, **30**, 1438.
- Law, M. (2002) Process modeling for future technologies, *IBM J. Res. Dev.*, **46**, 339–346.
- Lorentz, J. *et al.* (1996) Three-dimensional process simulation, *Microelectron. Eng.*, **34**, 85.
- Taur, Y. *et al.* (1998) 25 nm CMOS design considerations, International Electron Devices Meeting, p. 789.

# 4

# Silicon

The first transistor was made of polycrystalline germanium in 1947. Electron mobility in germanium is higher than in silicon, and germanium was readily available. However, silicon, with its larger band gap, was favoured because of smaller leakage currents. Initially there was no consensus whether single crystalline or polycrystalline material was better, but the rapid development of single crystal silicon growth in the 1950's soon dominated the market. The real breakthrough came when the beneficial role of silicon dioxide was recognized: it provided passivation of semiconductor surfaces, and it resulted in improved transistor reliability. When it was further noticed that the  $\text{SiO}_2$  layer could act as a diffusion mask and as isolation for integrated metallization, the way was open for the invention of the integrated circuit.

Steady increases in wafer size from half an inch have continued up to this day, with wafers of 300 mm diameter now in production, and the first samples of 450 mm diameter being readied in 2009. For other substrates smaller sizes are still widely used and when new materials like silicon carbide ( $\text{SiC}$ ) or gallium nitride ( $\text{GaN}$ ) are introduced, crystal growth and wafer yield are so low that only small ingots and small wafers make sense.

Silicon is the basis of microelectronics, with over 90% of the semiconductor market. MEMS are largely based on silicon wafers, and silicon is used everywhere in microtechnology because of its availability and excellent properties, even though sometimes only a fraction of those properties are utilized. Over 80% of solar cells are made of crystalline silicon today, but thin-film cells are rapidly gaining popularity. Flat-panel displays are made on glass panels but the transistors controlling the pixels in active matrix LCDs are made of polycrystalline silicon.

## Note on Terminology

Single crystal silicon (SCS for short) is known by many names. Some people prefer the term monocrystalline silicon, though the abbreviation mc-Si refers to

multicrystalline silicon! As short forms, c-Si is used for crystalline silicon and a-Si for amorphous silicon, while polycrystalline silicon is known simply as poly. In the solar cell industry crystalline silicon is sometimes called X-Si.

## 4.1 Silicon Material Properties

Silicon material properties are an excellent compromise between performance and stability. An energy gap of 1.12 eV makes silicon devices less prone to thermal noise than germanium devices with a 0.67 eV gap. Silicon is transparent in the infrared (above 1.1  $\mu\text{m}$  wavelength) which means that it can be used as an optical material at 1.55  $\mu\text{m}$  telecom wavelength applications.

Silicon source gases can be purified to extremely high degrees of purity. Silicon resistivity ranges from 10 000 Ohm-cm (dopant concentration  $10^{12} \text{ cm}^{-3}$ ) to 0.0001 Ohm-cm (dopant concentrations and  $10^{21} \text{ cm}^{-3}$ ), Figure 4.1. There are slight differences between different dopants but the limits are the same: solid solubility sets the upper dopant concentration limit and materials purity (of silicon itself and of materials used in the fabrication process) sets the lower limit.

Silicon is strong: its Young's modulus (elastic modulus) can be as high as 190 GPa for the <111> orientation and less for other orientations, but it is at least 132 GPa. Young's modulus ( $E$ ) is the ratio of stress ( $\sigma$ ) to strain ( $\epsilon$ , elongation):

$$E = \sigma/\epsilon \quad (4.1)$$

It is important in assessing for example thermal stresses that the wafers experience during processing.

The excellent mechanical properties of silicon have been utilized since the 1960s in micromechanical pressure and force sensors which rely on bending beams and diaphragms. Piezoresistive detection depends on doped



**Figure 4.1** Silicon resistivity can be varied over eight orders of magnitude by doping

resistors, and capacitive detection relies on the ability to micromachine shallow air gaps on the order of 1  $\mu\text{m}$ . Both are standard processes in silicon microfabrication.

Silicon is as strong as steel but this fact is disguised by two factors: most of us do not have experience of steel plates 0.5 mm thick; and, since silicon is brittle, the breakage pattern is therefore different from the ductile fracture of multicrystalline steel. Silicon is almost ideally elastic (obeying Hooke's law) up to the yield point, and after that a catastrophic failure takes place. Most metals and oxides obey Hooke's law initially, but then deform plastically before fracture. The yield strength of silicon is 7 GPa at room temperature, different steel varieties have yield strengths of 2–4 GPa, while the yield strength of aluminum is only 0.17 GPa. Fracture strain (elongation before breakage) for single crystal silicon is 4%, an exceptionally large value. More silicon properties are listed in Table 4.1.

## 4.2 Silicon Crystal Growth

### 4.2.1 Purification of silicon

Silicon wafer manufacture is a multistep process which begins with sand purification and ends up with final polishing and defect inspection. Silica sand,  $\text{SiO}_2$ , is reduced by carbon, yielding 98% pure silicon according to the reaction



This material is known as metallurgical grade silicon (MGS). MGS is converted to gaseous trichlorosilane  $\text{SiHCl}_3$  (boiling point 31.8 °C) according to the reaction



The main impurities in MGS (Fe, B, P) react to form  $\text{FeCl}_3$ ,  $\text{BCl}_3$  and  $\text{PCl}_3/\text{PCl}_5$ . Trichlorosilane gas is purified by distillation during which  $\text{FeCl}_3$  and  $\text{PCl}_3/\text{PCl}_5$  are removed as high-boiling-point contaminations and  $\text{BCl}_3$  as low-boiling-point contamination. Trichlorosilane is converted back to solid silicon by the decomposition of  $\text{SiHCl}_3$  on hot silicon rods by the reaction



This material is of extremely high purity and is known as electronic grade silicon (EGS). It is a polycrystalline material which will be used as a source material in single crystal growth.

### 4.2.2 Czochralski (CZ) crystal growth

In CZ growth a silica crucible ( $\text{SiO}_2$ ) is filled with undoped electronic grade polysilicon. Dopant is introduced by adding pieces of doped silicon (for low doping concentration) or elemental dopants P, B, Sb or As (for high doping concentration). The crucible is heated in a vacuum

**Table 4.1** Properties of silicon at 300 K

| <b>Structural and mechanical</b>                           |                                                                                                                                  |                                                                                                                                                                 |
|------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Atomic weight                                              | 28.09                                                                                                                            |                                                                                                                                                                 |
| Atoms, total ( $\text{cm}^{-3}$ )                          | $4.995 \times 10^{22}$                                                                                                           |                                                                                                                                                                 |
| Crystal structure                                          | diamond (FCC)                                                                                                                    |                                                                                                                                                                 |
| Lattice constant ( $\text{\AA}$ )                          | 5.43                                                                                                                             |                                                                                                                                                                 |
| Density ( $\text{g/cm}^3$ )                                | 2.33                                                                                                                             |                                                                                                                                                                 |
| Density of surface atoms ( $\text{cm}^{-2}$ )              | (100) $6.78 \times 10^{14}$<br>(110) $9.59 \times 10^{14}$<br>(111) $7.83 \times 10^{14}$                                        | (111) crystal orientation                                                                                                                                       |
| Young's modulus (GPa)                                      | 190                                                                                                                              |                                                                                                                                                                 |
| Yield strength (GPa)                                       | 7                                                                                                                                |                                                                                                                                                                 |
| Fracture strain                                            | 4%                                                                                                                               |                                                                                                                                                                 |
| Poisson ratio, $\nu$                                       | 0.27                                                                                                                             |                                                                                                                                                                 |
| Knoop hardness ( $\text{kg/mm}^2$ )                        | 850                                                                                                                              |                                                                                                                                                                 |
| <b>Electrical</b>                                          |                                                                                                                                  |                                                                                                                                                                 |
| Energy gap (eV)                                            | 1.12                                                                                                                             |                                                                                                                                                                 |
| Intrinsic carrier concentration ( $\text{cm}^{-3}$ )       | $1.38 \times 10^{10}$                                                                                                            |                                                                                                                                                                 |
| Intrinsic resistivity (ohm-cm)                             | $2.3 \times 10^5$                                                                                                                |                                                                                                                                                                 |
| Dielectric constant                                        | 11.8                                                                                                                             |                                                                                                                                                                 |
| Intrinsic Debye length (nm)                                | 24                                                                                                                               |                                                                                                                                                                 |
| Mobility (drift) ( $\text{cm}^2/\text{V}\cdot\text{s}$ )   | 1500 (electrons)<br>475 (holes)                                                                                                  |                                                                                                                                                                 |
| Temperature coeff. of resistivity ( $\text{K}^{-1}$ )      | 0.0017                                                                                                                           |                                                                                                                                                                 |
| <b>Thermal</b>                                             |                                                                                                                                  |                                                                                                                                                                 |
| Coefficient of thermal expansion ( $^\circ\text{C}^{-1}$ ) | $2.6 \times 10^{-6}$                                                                                                             |                                                                                                                                                                 |
| Melting point ( $^\circ\text{C}$ )                         | 1421                                                                                                                             |                                                                                                                                                                 |
| Specific heat (J/kg-K)                                     | 700                                                                                                                              |                                                                                                                                                                 |
| Thermal conductivity (W/m-K)                               | 150                                                                                                                              |                                                                                                                                                                 |
| Thermal diffusivity                                        | $0.8 \text{ cm}^2/\text{s}$                                                                                                      |                                                                                                                                                                 |
| <b>Optical</b>                                             |                                                                                                                                  |                                                                                                                                                                 |
| Index of refraction                                        | 3.42                                                                                                                             | $\lambda = 632 \text{ nm}$                                                                                                                                      |
|                                                            | 3.48                                                                                                                             | $\lambda = 1550 \text{ nm}$                                                                                                                                     |
| Energy gap wavelength                                      | $1.1 \mu\text{m}$                                                                                                                | (transparent at larger wavelengths)                                                                                                                             |
| Absorption                                                 | $>10^6 \text{ cm}^{-1}$<br>$10^5 \text{ cm}^{-1}$<br>$10^4 \text{ cm}^{-1}$<br>$10^3 \text{ cm}^{-1}$<br>$<0.01 \text{ cm}^{-1}$ | $\lambda = 200\text{--}360 \text{ nm}$<br>$\lambda = 420 \text{ nm}$<br>$\lambda = 550 \text{ nm}$<br>$\lambda = 800 \text{ nm}$<br>$\lambda = 1550 \text{ nm}$ |

to about  $1420^\circ\text{C}$  to melt the silicon. A single crystalline seed of known crystal orientation is dipped into the silicon melt. Silicon solidifies into a crystal structure determined by the seed crystal. Figure 4.2 describes the crystal puller. A thin neck is quickly drawn to suppress defects which develop because of the high-temperature difference between the seed and the melt, and then the pulling rate is lowered. Both the ingot and the crucible are rotated (in opposite directions), ingot rotation at about 20 rpm, crucible rotation at about 10 rpm. Molten silicon is very fluid,

and the rotations induce flows in the crucible; temperature differences are also a source of convection currents. Much engineering has gone into understanding the behavior of molten silicon.

Ingot diameter is determined by ingot pull rate. The pulling rate is limited by heat conduction away from the crystallization interface, and therefore large-diameter ingots have lower pulling rates. While an ingot of 100 mm diameter can be pulled at 1.4 mm/min, the 200 mm ingot pull rate is 0.8 mm/min. Typical pulling time is 30 hours



**Figure 4.2** Czochralski crystal pulling: silicon (melting point 1421 °C) solidifies as it is pulled up. Pulling speed (mm/min), ingot rotation speed (20 rpm) and crucible counter-rotation speed (10 rpm) together determine ingot diameter

(corresponding to ingot lengths of 2 m), not including heating and cooling, which add another 30 hours to the process, for 200 mm ingots. A finished ingot is seen in Figure 4.3.

Ingot length is determined by the yield strength of the silicon neck and crucible size. The thin neck is not perfect material because it has defects arising from thermal shock, and torsional forces act on it during rotation (Figure 4.3). Silicon yield strength is significantly lower at high temperatures, but 300 mm ingots can weigh up to 300 kg.

Not all polysilicon can be utilized: about 10% of original polysilicon remains in the crucible. Crucibles cannot be reused and are extremely expensive disposable objects.

EGS is extremely pure, for instance boron, phosphorus and iron levels can be as low as 0.01–0.02 ppb. However, the crucible is a source of impurities, and for boron, sodium and aluminum it is the crucible and not EGS that determines ingot purity. If synthetic silica is used for crucibles, much higher purity CZ ingots can be pulled.

There is inevitable contamination of the growing silicon crystal from the materials that are essential to the growth set-up: the silica crucible is slightly dissolved during the crystal growth process, and therefore oxygen is always present in CZ silicon. Oxygen concentrations are 5–20 ppma. Carbon from graphite is present at concentrations below 1 ppm.

#### 4.2.3 Dopant incorporation

Impurities are incorporated from the melt into the ingot, but different dopants have widely different segregation coefficients. The segregation coefficient is defined as

$$k_0 = \frac{\text{concentration}_{\text{solid}}}{\text{concentration}_{\text{liquid}}} \quad (4.5)$$

All dopants and metallic impurities are enriched in the melt, with oxygen as perhaps the only material which is incorporated preferentially into the silicon solid phase.

Because dopant segregation coefficients are less than unity (Table 4.2), excess dopant is needed in the melt,



**Figure 4.3** Ingot: narrow neck, shoulder and body

**Table 4.2** Segregation of dopants and impurities at silicon melt/solid interface

| Dopants    | Others                      |
|------------|-----------------------------|
| Boron      | $k_0 = 0.8$                 |
| Phosphorus | $k_0 = 0.35$                |
| Arsenic    | $k_0 = 0.3$                 |
| Antimony   | $k_0 = 0.023$               |
| Gallium    | $k_0 = 0.0072$              |
| Aluminum   | $k_0 = 0.002$               |
| Iron       | $k_0 = 6.4 \times 10^{-6}$  |
| Copper     | $k_0 = 8 \times 10^{-4}$    |
| Nickel     | $k_0 = 1.3 \times 10^{-4}$  |
| Gold       | $k_0 = 2.25 \times 10^{-5}$ |
| Oxygen     | $k_0 = 1.25$                |
| Carbon     | $k_0 = 0.07$                |

compared to the final ingot. This can be calculated from  $k_0$  values easily using Equation 4.6. As pulling advances, the melt volume decreases, dopant concentration in the melt increases and therefore dopant concentration  $C_s$  in the ingot increases along its length:

$$C_s = k_0 C_0 (1 - X)^{k_0 - 1} \quad (4.6)$$

where  $C_0$  is the initial dopant concentration in the melt,  $X$  is the fraction solidified and  $k_0$  is the segregation coefficient.

The concentration of oxygen in the ingot decreases as pulling advances because its segregation coefficient is above unity. There is another reason, too: as the melt is consumed, there is less contact between the silica crucible and melt and less dissolution of oxygen. Ingot resistivity and oxygen concentration will thus vary, and if tight specifications are set, not all of the ingot will be within the specifications.

Because the crystal is rotated during growth, there are also radial non-uniformities in the ingots. One source for these is simply cooling rate: the edges cool faster than the centre (because of radiation at 1400 °C) and therefore any process that depends on cooling rate will be affected. For instance, vacancies and many other defects show a radial distribution.

Because molten silicon is electrically conductive, magnetic fields can be used to control melt behavior. Magnetic fields reduce local temperature fluctuations and thermal convection currents, which leads to more stable melt and consequently to more uniform growth. Magnetic Czochralski (MCZ) growth enables better control of oxygen levels in the crystal, probably because it suppresses flows that distribute oxygen from the crucible. At least a more uniform melt enables other process parameters, like argon gas flow, to be varied over a larger range.

#### 4.2.4 Float zone crystal growth

If high-resistivity silicon is needed, float zone (FZ) crystal growth is used. In the FZ method a polysilicon ingot is

placed on top of a single crystal seed. The polycrystalline ingot is melted by an external RF coil, and the solidifying silicon copies the crystal structure of the seed. The coil is moved up, the melted zone advances upward, and the single crystal solidifies along the ingot.

The highest FZ silicon resistivities are on the order of 20 000 ohm-cm, compared to 1000 ohm-cm for CZ. Because there is no silica crucible, there is much less oxygen ( $10^{16} \text{ cm}^{-3}$ , compared to  $10^{18} \text{ cm}^{-3}$  for CZ) and metal contamination from the crucible is also eliminated. FZ wafers, however, are mechanically weaker than CZ wafers because oxygen mechanically strengthens silicon. FZ wafers are available only in smaller diameters, 150 mm maximum, with 200 mm FZ demonstrated but not used in device manufacturing. When doped FZ silicon is made, dopants are introduced by flushing the melt zone with gaseous dopants like phosphine ( $\text{PH}_3$ ) or diborane ( $\text{B}_2\text{H}_6$ ). High-resistivity FZ is often doped via neutron transmutation doping (NTD) according to the reaction



A silicon nucleus captures a neutron, and the newly formed nucleus decays by  $\beta$ -decay. This doping method explains why high-resistivity silicon (5–20 kohm-cm) is available only in n-type.

## 4.3 Silicon Crystal Structure

Silicon has a cubic diamond lattice structure. The unit cell can be thought of as two interleaved face-centered cubic (FCC) lattices with their origins at (0,0,0) and (1/4, 1/4, 1/4). The distance between two atoms is  $(\sqrt{3}/4)a$ , and radius  $(\sqrt{3}/8)a$ , where  $a$  is the unit cell edge length, 5.43095 Å. As shown in Figure 4.4 there are 18 atoms to



**Figure 4.4** Silicon lattice: the unit cell consists of eight atoms. Reproduced from Jenkins (1995) by permission of Prentice Hall



**Figure 4.5** Some important silicon crystal planes with their Miller indices

be considered: eight at vertices (they are shared between eight unit cells and therefore contribute one atom to the unit cell); six face atoms are shared between two neighboring unit cells and contribute three atoms; and there are four atoms fully inside the unit cell. The volume fraction of the space filled by silicon atoms is 34%, very low compared to hexagonal close packing, which fills 74% of the space. This open structure leads to fast diffusion of dopants (and impurities) in silicon.

Miller indices define the planes of a crystal. The plane that defines the faces of the cube (see (Figure 4.5)) intersects axes 1, 2, 3 at  $(1, \infty, \infty)$ , respectively. The Miller index of a plane is given by the reciprocal of these intersects, namely  $(1,0,0)$ . The planes tying edges are designated  $(1,1,0)$  and diagonal planes  $(1,1,1)$ . The set of six equivalent planes (the six faces of the cube) together are designated  $\{100\}$ . There are twelve  $(1,1,0)$  and eight  $(1,1,1)$  planes. The unit vectors can also be negative: they are designated with a bar, e.g.  $(1,\bar{1},1)$ .

Fourfold symmetry of (100) and sixfold symmetry of (110) and (111) (Figure 4.6) will become apparent in anisotropic wet etching of silicon (to be discussed in Chapter 20).

The angles between the planes can be calculated from the scalar product of the normal vectors:

$$\mathbf{a} \cdot \mathbf{b} = |\mathbf{a}| |\mathbf{b}| \cos(\mathbf{a}, \mathbf{b}) \quad (4.8)$$

Visual examination shows that (100) and (110) planes meet at  $45^\circ$  and all the other angles can be calculated easily when negative unit vectors are accounted for:  $\bar{1}10$  is  $(-1,1,0)$ . The angle between (111) and (100) planes is calculated from  $1 = \sqrt{3} \cos \alpha$ , giving  $\alpha = 54.7^\circ$ .

In order to become familiar with silicon crystal structure, the paper fold model (Figure 4.7) can be handy. Copying the model on an overhead transparency and gluing it together will result in a 26-gon which visualizes the crystal planes nicely. It will be indispensable when



**Figure 4.6** Silicon crystal viewed from different angles: (a) face view (100); (b) edge view (110); (c) vertex view (111). Courtesy Ville Voipio, Aalto University



**Figure 4.7** Fold-up paper model of silicon crystal planes. Courtesy Hiroshi Toshiyoshi

crystal plane-dependent etching of silicon is discussed in Chapter 20.

Two crystal orientations are widely used in microfabrication:  $<100>$  and  $<111>$ . The former is the main material for CMOS, MEMS and solar cells; the latter for bipolar transistors, power semiconductor devices and radiation detectors which rely on epitaxial deposition. Wafers for special applications can also be cut to other index planes, like (311) and (511).

## 4.4 Silicon Wafering Process

Silicon ingots are transformed into wafers by a multistep process which includes mechanical, thermal and chemical treatments, and many cleaning and inspections steps. Some 200 million silicon wafers are produced annually. The main steps in the wafering process are listed in Table 4.3.

The silicon crystal orientation is determined by the seed crystal. After the ingot has cooled down, it is ground to exact diameter and cut into 50 cm stocks, which are measured for crystal orientation by X-ray diffraction. A flat (Figure 1.20) or a notch is then ground into the ingot to establish orientation. The flat or notch of a  $<100>$  wafer is oriented along the [110] direction.

The ingot is then sawn into slices. These slices are for example 600  $\mu\text{m}$  thick when the final target thickness of the wafers is 400  $\mu\text{m}$ . The wire saw is made of kilometers-long continuous wire (diameter 80–250  $\mu\text{m}$ ) which uses

**Table 4.3** Silicon wafering process

- Ingot diameter grinding
- Ingot crystal orientation by XRD
- Flat grinding
- Sawing ingot into wafers
- Lapping
- Edge contouring
- Laser marking
- Etching
- Donor annealing
- Final polishing
- Inspections

mechanical abrasion and a chemically active slurry to cut through the ingot. In practice multiwire saws are used to increase throughput (Figure 4.8). Thousands of wafers can be cut in a single process 12 hours long.

The surface of a  $<100>$  wafer is a (100) plane with [100] surface normal vector, usually cut as precisely as practical (Figure 4.9). Sometimes wafers are intentionally miscut, for example by  $3^\circ$ . The silicon surface resulting from such a miscut is shown in Figure 4.10 (top). In a perfectly cut wafer an atomically flat surface would result, but in a miscut wafer atomic terraces are formed. The terrace length depends on the miscut angle. If the wafer is additionally off-oriented (the flat is not exactly along [110]), kinks will be created, as shown in Figure 4.10



**Figure 4.8** Multiwire saw. Courtesy Jorma Koskinen



**Figure 4.9** A  $<100>$  silicon wafer is cut so that one of the  $(100)$  planes defines the wafer surface; the vector normal to the surface is in the direction  $[100]$  and the flat is along the direction  $[110]$



**Figure 4.10** Miscut wafer surface in atomistic view (top); miscut and off-orientation wafer with kinks along terrace edges (bottom). Reproduced from Butt *et al.* (2006), by permission of Wiley-VCH GmbH



**Figure 4.11** HRTEM micrograph of silicon lattice. Courtesy Young-Chung Wang, FEI Company

(bottom). These will act as preferential sites for deposited atoms to attach to, as will be discussed in Chapters 6 and 7. A high-resolution TEM micrograph (Figure 4.11) of a silicon lattice reveals the atomic planes and terraces, and nicely shows the difference between single crystalline silicon and the amorphous oxide on top of it.

Flats and notches are used by automatic wafer handlers to orient wafers inside equipment. In lithography flats enable patterns to be aligned to crystal planes. This latter aspect is especially important in micromechanics where crystal plane-dependent anisotropic etching is a major technique (Chapter 20).

The next step is lapping: waviness and taper from sawing are removed by lapping. In this step the wafers are rotating between two massive steel plates in an alumina slurry. Lapping ensures parallelism of wafer surfaces. The surface roughness is about  $0.1\text{--}0.3 \mu\text{m}$  after lapping. The edges of the wafers are then rounded (Figure 4.12a) in order to prevent chipping of silicon during wafer handling and to eliminate water marks during drying steps. Wafer breakage often starts from a crack at the edge and, because silicon is brittle, the crack propagates through the whole wafer. Sometimes the edges are polished in order to improve mechanical strength.

Wafers are marked by laser scribing. Subsequent cleaning steps remove the silicon dust generated by marking. Alphanumeric or bar code marking enable wafer identity tracking during processing, and back tracing to the wafering process and to the ingot, ensuring total traceability.

Etching is then used to remove lapping damage: both alkaline ( $\text{KOH}$ ) and acidic ( $\text{HF-HNO}_3$ ) etches can be used. Roughness is reduced somewhat by acid etching, but not by alkaline etching. The SEM micrograph in Figure 4.12b shows the back of a wafer after alkaline etching.



**Figure 4.12** (a) Different edge roundings (b) Back of an alkaline etched (non-polished) silicon wafer in SEM. Reproduced from Abedrabbo *et al.* (1998), copyright © 1999 by permission of Elsevier Science Ltd

Thermal donors are charged interstitial oxygen complexes which behave like n-type dopants. An annealing step at 600–800 °C destroys thermal donors and stabilizes wafer resistivity.

Final polishing with 10 nm silica slurry in alkaline solution removes about 20 μm of silicon and results in a root mean square (RMS) surface roughness of 0.1–0.2 nm. The formulas for calculating roughness are given in Equations 4.9a–c below. Surface height  $z$  is measured over a sampling distance  $L$ , and RMS roughness is calculated in Equation 4.9c using mean height as a reference plane:

$$\sigma^2 = \frac{1}{L} \int_0^L (z - m)^2 dx \quad \text{standard deviation} \quad (4.9a)$$

$$m = \frac{1}{L} \int_0^L z dx \quad \text{mean height} \quad (4.9b)$$

$$\sigma = \sqrt{\frac{1}{L} \int_0^L z^2 dx} \quad \text{RMS roughness} \quad (4.9c)$$

Inspection and cleaning steps constitute a major fraction of all wafering steps. Wafers are measured for mechanical and electrical properties. Contactless measurements, for example capacitance, optical and eddy current methods, are preferred because contact methods introduce contamination and damage. Wafers are specified for particle cleanliness. Laser light scattering

can be used to measure particle size distributions down to 60 nm, but even the unaided eye can detect particles larger than about 0.3 μm because of their scattering under intense light (e.g., from a slide projector).

Wafers are specified for a number of electrical, mechanical, contamination and other properties as agreed between the wafer supplier and customer. The specifications in Table 4.4 are an example of fairly standard 100 mm wafers. Wafer specifications for ICs and MEMS are discussed in more detail in Chapters 26 and 30, respectively.

Thickness refers to wafer centre point thickness only, and other numbers are needed to account for thickness

**Table 4.4** Basic specifications for 100 mm wafers for integrated circuits and microsystems

|                  | IC             | MEMS           |
|------------------|----------------|----------------|
| Growth method    | CZ             | CZ             |
| Type/dopant      | P/boron        | P/boron        |
| Orientation      | 100            | 100            |
| Off-orientation  | 0.0 ± 1.0°     | 0.0 ± 0.2°     |
| Resistivity      | 16–24 ohm-cm   | 1–10 ohm-cm    |
| Diameter         | 100.0 ± 0.5 mm | 100.0 ± 0.5 mm |
| Thickness        | 525 ± 25 μm    | 380 ± 10 μm    |
| Front side       | Polished       | Polished       |
| Back side        | Etched         | Polished       |
| Flat orientation | <110> ± 1°     | <110> ± 0.2°   |
| Oxygen level     | 13–16 ppma     | 6–10 ppma      |
| Particles        | <20 at 0.3 μm  | 20 at 0.3 μm   |



**Figure 4.13** Thickness and total thickness variation (TTV). Wafer flattened to chuck; that is, back-side reference

**Table 4.5** Resistivity vs. dopant concentration

| Dopant level      | Short    | Concentration ( $\text{cm}^{-3}$ ) | Resistivity (ohm-cm) |
|-------------------|----------|------------------------------------|----------------------|
| Lightly doped     | n-, p-   | $10^{13}\text{--}10^{15}$          | >10                  |
| Moderately doped  | n, p     | $10^{15}\text{--}10^{18}$          | 10–0.1               |
| Highly doped      | n+, p+   | $10^{18}\text{--}10^{20}$          | 0.1–0.001            |
| Very highly doped | n++, p++ | > $10^{20}$                        | 0.001                |

variation and geometric distortions (Figure 4.13). Total thickness variation, TTV, is defined as the difference between the maximum and minimum values of thickness encountered in the wafer. Wafers come in standard sizes and thicknesses, for example 100 mm and 525  $\mu\text{m}$ , or 200 mm and 725  $\mu\text{m}$ . In IC fabrication or many thin-film devices, wafer thickness is not an issue, but in bulk MEMS applications through-wafer etching is standard, and it depends critically on wafer thickness control.

The resistivity of silicon can be varied over many orders of magnitude, but typical wafers come with dopant concentrations of  $10^{15}$  to  $10^{18}$   $\text{cm}^{-3}$ , corresponding to resistivities of 0.1–10 ohm-cm. Resistivity ranges and shorthand designations for them are listed in Table 4.5.

EGS purification, CZ crystal growth and the multistep wafering process produce high-quality silicon wafers, but the process is energy intensive and expensive. Silicon for solar cells needs to be considerably cheaper. That topic will be discussed in Chapter 37.

## 4.5 Defects and Non-Idealities in Silicon Crystals

Even though silicon wafer fabrication results in wafers with extremely well-defined properties, some defects are bound to be found. These defects can be classified according to their origin into three categories:

1. Grown-in defects from crystal pulling.

2. Defects resulting from wafering.
3. Process-induced defects.

The first two classes are the responsibility of the wafer manufacturer and the third class requires wafer and device manufacturer co-operation because process-induced defects depend on the starting material. For instance, heating and cooling rates, and maximum temperatures experienced by the wafers, determine how the defects in the wafers will behave, or how new defects are generated.

Defects are further classified into point defects (e.g., vacancies, empty lattice sites), line defects (e.g., stacking faults, extra rows of atoms in an otherwise perfect crystal), area defects (e.g., dislocation loops) and bulk defects (e.g., precipitates). The main types of defects are shown in Figure 4.14.

There are many sources for vacancies: some result from imperfections of crystal growth, and some are created during wafer processing; for example, ion implantation doping of silicon involves high-energy ion collisions in silicon, and ions displace silicon atoms from their lattice sites, creating vacancies. Some vacancies are always present because of statistics: above absolute zero, lattice vibrations displace atoms and create vacancies and interstitials. This becomes significant for silicon around 900 °C, which is the temperature where diffusion becomes fast enough to be technologically relevant.

Haze is defined as light scattering from surface defects, for example scratches, surface roughness or crystal



**Figure 4.14** Defects in single crystalline material: a, foreign interstitial; b, dislocation; c, self-interstitial; d, precipitate; e, stacking fault (external); f, foreign substitutional; g, vacancy; h, stacking fault (internal); i, foreign substitutional. Reproduced from Green (1995) by permission of University of New South Wales

**Table 4.6** Sources of non-idealities in silicon wafers

|                    |                                                 |
|--------------------|-------------------------------------------------|
| EGS polysilicon    | Dopants (B, P) and other impurities (C, metals) |
| Czochralski growth | Impurities from quartz                          |
|                    | Oxygen from quartz                              |
|                    | Carbon from graphite and SiC                    |
|                    | Vacancies and interstitials                     |
|                    | Precipitates                                    |
|                    | Dislocations                                    |
| Wafering process   | Contamination from tools                        |
|                    | Mechanical distortions                          |
| Wafer processing   | Contamination                                   |
|                    | Crystallinity defects                           |
|                    | Precipitation                                   |
|                    | Mechanical distortions                          |
|                    | Dislocations                                    |

defects. Haze measurement is done by scatterometry, where the whole wafer is scanned, in contrast to roughness measurement, which is local area only, for instance a  $5 \times 5 \mu\text{m}$  area by a AFM.

Table 4.6 lists some non-idealities found in silicon wafers and their most common sources. Some of these can be avoided or limited to acceptable levels by careful choice of processing conditions, including wafer cleaning, temperature ramp rates and materials that make contact with the wafers, for instance wafer boats in furnaces. Defects will be discussed in more detail in Chapter 22.

## 4.6 Advanced Wafers

ICs are made in the thin surface layer of a silicon wafer, while many MEMS devices extend throughout the wafer. Lithography is carried out also on the back side of the MEMS wafer and double side polished (DSP) wafers are then needed. While single side polished (SSP) wafers exhibit for example  $0.5 \mu\text{m}$  RMS roughness on the back side, that of the DSP wafer is nearly as perfect as the front side, but not completely, because it is attached to a holder during front-side polishing and some damage from the holder remains.

Bulk micromechanics relies on crystal plane-dependent etching: silicon (100) planes etch 100 times faster than (111) planes, offering intriguing structuring possibilities. These will be discussed in Chapter 20. Most surface micromachining (Chapter 29) is done on SSP silicon wafers, even though device functionality is determined by thin films deposited on top of silicon wafers, and other substrates could be used. Because most of the processing



**Figure 4.15** Epitaxial wafer with p-epitaxial layer on p+ substrate; and silicon-on-insulator (SOI) wafer with a p-type device layer and n-type handle wafer, separated by an oxide layer

equipment in microtechnology has been developed for silicon processing, compatibility favors silicon use even when silicon properties per se are not required.

Epitaxy is a process for growing more crystalline silicon on top of a silicon wafer, with doping level and/or dopant type different from the substrate wafer (Figure 4.15). Multilayer structures with many layers of different doping levels and types are equally easy to make. This will be discussed in more detail in Chapters 6 and 22.

Silicon-on-insulator (SOI) wafers can be made for example by bonding a silicon wafer to an oxidized wafer. SOI wafers offer the possibility to optimize processes on top and bottom wafers separately, as shown in Figure 4.15. It is also possible to reduce process steps and minimize leakage currents because pn junctions can be eliminated. Advanced wafers will be discussed in Chapter 22.

The more special wafers naturally cost more. Double side polishing adds 50% to the price, epitaxy can double the wafer price and SOI can cost anything from three to ten times the price of a basic bulk wafer. The price also goes up with wafer size: 100 mm wafers cost \$10 and 200 mm wafers (with four times the silicon area compared to 100 mm) cost \$40; 300 mm wafers are more expensive per cm<sup>2</sup>, and 450 mm wafers are only available in test quantities, so prices will not stabilize before volume production starts.

## 4.7 Exercises

1. Estimate the silicon lattice constant from atomic mass and density.
2. Consider an Olympic swimming pool filled with golf balls and one squash ball. If golf balls represent silicon atoms, and the squash ball represents a phosphorus atom, what would the resistivity of a silicon piece be with a similar doping level?
3. Electronic grade polysilicon is available with 0.01 ppb phosphorus concentration. What is the highest ingot

- resistivity that can be pulled from such a starting material?
4. If 50 kg of ultrapure polysilicon is loaded into a CZ crystal puller, how much boron should be added if the target doping level of the ingot is 10 ohm-cm?
  5. Convert CZ and FZ silicon maximum resistivities 1000 ohm-cm and 20 000 ohm-cm into dopant concentrations.
  6. If wafer resistivity specifications are 5–10 ohm-cm (phosphorus), calculate the fraction of the ingot that yields wafers within this specification.
  7. Calculate the RMS roughness for the three surfaces shown below. Compare RMS to peak-to-valley roughness.



## References and Related Reading

- Abedrabbo, S. *et al.* (1998) Perspectives on emissivity measurements and modeling in silicon, *Mater. Sci. Semicond. Process.*, **1**, 187–193.
- Bullis, W.M. (2000) Current trends in silicon defect technology, *Mater. Sci. Eng.*, **B72**, 93–98.
- Butt, H.-J., K. Graf and M. Kappl (2006) **Physics and Chemistry of Interfaces**, Wiley-VCH Verlag GmbH.
- Green, M.A. (1995) **Silicon Solar Cells**, Centre for Photovoltaic Devices and Systems, Sydney.
- Hahn, P.O. (2001) The 300 mm silicon wafer – a cost and technology challenge, *Microelectron. Eng.*, **56**, 3–13.
- Hull, R. (1999) **Properties of Crystalline Silicon**, IEE Publishing.
- Irwin, J. (1962) Resistivity of bulk silicon, *Bell. Syst. Tech. J.*, **41**, 387.
- Jenkins, T. (1995) **Semiconductor Science**, Prentice Hall.
- Lindroos, V. *et al.* (2010) **Handbook of Silicon Based MEMS Materials and Technologies**, Elsevier.
- Markov, I.V. (2003) **Crystal Growth for Beginners: Fundamentals of Nucleation, Crystal Growth, and Epitaxy**, World Scientific.
- Petersen, K. (1982) Silicon as a mechanical material, *Proc. IEEE*, **70**, 420. Reprinted in W. Trimmer (ed.), **Micromechanics and MEMS: Classic and Seminal Papers to 1990**, IEEE Press, 1997.
- Porrini, M. *et al.* (2005) Measurement of boron and phosphorus concentration in silicon by low-temperature FTIR spectroscopy, *Appl. Phys.*, **A81**, 1187–1190.
- Shimura, F. (1997) **Semiconductor Silicon Crystal Technology**, Academic Press.

# 5

## Thin-Film Materials and Processes

Thin films are needed to make metal wires and to insulate those wires, to make capacitors, resistors, inductors, membranes, channels, nozzles, mirrors, beams and plates, and to protect those structures against mechanical and chemical damage. Thin films have roles as permanent parts of finished devices, but they are also used intermittently during wafer processing, as protective films, as sacrificial layers, and as etch and diffusion masks.

A great many solid materials are available as thin films: aluminum, gold, copper, tungsten and nickel are routinely used in microfabrication. Oxides of silicon, aluminum, hafnium and tantalum are used, as are nitrides of silicon and titanium. Diamond-like and Teflon-like films offer special properties, as do various alloys like PtMn, TiW, SiGe and CoFe.

The same thin film can serve in many different functions: for example, silicon nitride is the traditional passivation film in integrated circuits, providing both chemical endurance and mechanical scratch protection. But silicon nitride is also used as a mask for oxidation (because oxygen diffusion through nitride is practically nil), as a capacitor dielectric (nitride dielectric constant is higher than that of silicon dioxide), as a suspended membrane (because nitride stress can be made small and tensile), and as optical material (with refractive index matching nicely between silicon and air). Similarly, silicon is used not only for its electronic properties, but also for its mechanical strength (micromechanics), optical absorption at visible wavelengths (solar cells, photodetectors), low absorption in the infrared (waveguides for 1.55  $\mu\text{m}$  optical telecom applications), high Seebeck coefficient (thermoelectric devices) and high thermal conductivity (uniform temperature distribution in micro-hotplate sensors).

This chapter deals with the most common deposition processes for thin films, with the basic characteristics which make thin films different from the bulk, as well as some important applications.

### 5.1 Thin Films vs. Bulk Materials

In thin films at least one dimension of the material is small, the thickness. For narrow lines, two dimensions are small, and for dots all three dimensions are small. This gives rise to the prominence of surface effects like surface scattering of electrons, leading to size-dependent resistivity or, at very small dimensions, to quantum effects. The size scale for quantum effects is estimated by Debye lengths, which are of the order of 10–100 nm at room temperature.

The density of thin films is often very low compared to bulk materials. Sputtered tungsten films can have a density as low as 12 g/cm<sup>3</sup> compared to a bulk value of 19.5 g/cm<sup>3</sup> and evaporated gold can have a density as low as 3 g/cm<sup>3</sup> but then it was optimized for infrared absorption. Usually porosity should be avoided as it leads to long-term instability: water vapor can adsorb in the pores, and high surface area makes the films reactive: such films oxidize and corrode readily.

Many thin-film properties, resistivity, dielectric constant, coefficient of thermal expansion and refractive index are thickness dependent. Additionally, the film properties depend on deposition processes in profound ways. The example in Table 5.1 gives resistivities of sputtered molybdenum films prepared under slightly different process conditions in different sputtering systems. In addition to thickness dependence, three other facts emerge from

**Table 5.1** Resistivity of sputtered molybdenum

| Thickness            | Underlayer     | Conditions       | Resistivity ( $\mu\text{ohm}\cdot\text{cm}$ ) |
|----------------------|----------------|------------------|-----------------------------------------------|
| Bulk                 | —              | —                | 5.6                                           |
| Thin film,<br>50 nm  | $\text{SiO}_2$ | System 1, 20 °C  | 17                                            |
| Thin film,<br>300 nm | $\text{SiO}_2$ | System 1, 20 °C  | 12                                            |
| Thin film,<br>300 nm | TiW            | System 1, 20 °C  | 9                                             |
| Thin film,<br>300 nm | $\text{SiO}_2$ | System 2, 20 °C  | 15                                            |
| Thin film,<br>300 nm | $\text{SiO}_2$ | System 3, 150 °C | 9                                             |
| Thin film,<br>300 nm | $\text{SiO}_2$ | System 3, 450 °C | 8                                             |

**Figure 5.1** X-ray diffraction (XRD) of tantalum thin films: crystal structure and resistivity depend on the underlying material. Reproduced from Ohmi (2001) by permission of IEEE

this table: thin-film resistivity is always more than that of the bulk material; different deposition systems result in very different thin-film properties; and the underlying material has a major effect on film properties.

In Figure 5.1 the tantalum thin-film structure (as measured by X-ray diffraction) and resistivity are similarly seen to depend on the underlying layer: tantalum film on tantalum nitride is low resistivity, while tantalum on oxide has a resistivity an order of magnitude higher. The difference is due to the different crystalline phase, which is due to the different underlying layer.

Structure depends on film thickness, and it may be that very thin films (e.g., 50 nm) are amorphous, yet thicker films (e.g., 200 nm) are polycrystalline. This is shown in Figure 5.2 for  $\text{SrTiO}_3$  film. XRD peaks indicative of

**Figure 5.2** Thickness dependence of  $\text{SrTiO}_3$  films by XRD. The 90 nm thick film is amorphous but thicker films are polycrystalline. Dielectric constant depends on structure, which depends on thickness. Reproduced from Vehkämäki *et al.* (2001) by permission of Wiley-VCH

crystallinity only appear for thicker films. The dielectric constant  $\epsilon_r$  is also a strong function of thickness.

Thin films can be amorphous, polycrystalline or single crystalline (epitaxial) as deposited. Epitaxial films remain single crystalline during annealing; polycrystalline films experience grain growth and sometimes phase transitions into other crystal structures; amorphous films either stay amorphous or crystallize (into a polycrystalline material, and under very special conditions to single crystalline material). Silicon dioxide, silicon nitride and aluminum oxide are exceptional amorphous films because they remain amorphous in all typical microfabrication processes. Pictured in Figure 5.3 are SEM cross-sections of  $\text{Al}_2\text{O}_3$  and  $\text{SrTiO}_3$  films: aluminum oxide is amorphous and strontium titanate is polycrystalline.

Films prepared by different sputtering systems are different, and films prepared by two completely different deposition processes will differ even more. Copper films made by sputtering, evaporation, electroplating or CVD can have a factor of two differences in resistivity or grain size.

## 5.2 Physical Vapor Deposition

The general idea of physical vapor deposition (PVD) is material ejection from a solid target material, transported in a vacuum to the substrate surface where film deposition takes place. Atoms can be ejected from the target by various means: resistive heating, electron beam heating, ion bombardment or laser beam bombardment (known as laser ablation). All aluminum films in microfabrication



**Figure 5.3** SEM micrographs of thin-film structure: (a) amorphous aluminum oxide, reproduced from Ritala *et al.* (1999) by permission of Wiley-VCH; (b) polycrystalline strontium titanate, reproduced from Vehkämäki *et al.* (2001) by permission of Wiley-VCH

are deposited by PVD, and PVD is used for copper, refractory metals and for metal alloys and compounds like TiW, WN, TiN, MoSi<sub>2</sub>, ZnO and AlN.

### 5.2.1 Evaporation

Evaporation of elemental metals is fairly straightforward: hot metals have high vapor pressures and in a high vacuum the evaporated atoms will be transported to the substrate.

Typical deposition rates in evaporation are 0.1–1 nm/s, which is very slow.

Evaporation systems are either high vacuum (HV) or ultrahigh vacuum (UHV) systems, with the best UHV deposition systems with 10<sup>-11</sup> torr base pressures, and 10<sup>-12</sup> torr oxygen partial pressures. In (ultra)high vacuum the atoms do not experience collisions, and therefore they take a line-of-sight route from source to substrate. The mean free path (MFP) is the measure of collisionless transport, and below about 10<sup>-4</sup> torr the MFP is larger than the size of a typical deposition chamber (for more

discussion on vacuum science and technology, refer to Chapter 33). Low deposition temperature combined with line-of-sight transport means that evaporated films will not coat sidewalls of holes and ridges well, even though film quality on planar surfaces is good.

There are very few parameters in evaporation that can be used to tailor film properties. Atoms arrive at thermal speeds, which results in basically room temperature deposition. There is no bombardment in addition to the thermalized atoms themselves, which bring very little energy to the surface. Substrate heating can be done to improve film quality. This works because impurities are desorbed and adsorbed atoms can diffuse and find energetically favorable lattice sites.

Low-melting-point metals, such as gold and aluminum, can easily be evaporated, but refractory metals require more sophisticated heating methods. Localized heating by an electron beam (Figure 5.4) can vaporize even tungsten (m.p. 3650 K), but deposition rates are, however, very low. Additionally, X-rays will be generated, which can damage sensitive devices.

Evaporation of alloys and compounds is tricky: the component with higher vapor pressure will evaporate more readily, and it can happen that the minority atoms in the starting material end up as the majority atoms in the thin film. Most compounds decompose when heated, therefore evaporation of compounds is limited to a few special cases, like silicon dioxide.

It is possible that the molten metal reacts with the crucible because temperatures are very high, even though this



**Figure 5.4** Electron beam evaporation: heated metal vaporizes and the evaporated atoms are transported in high vacuum to the substrate wafer



**Figure 5.5** Sputtering: argon ions knock atoms out of a target, and the ejected atoms travel in a vacuum and deposit on the wafer

is being minimized by the use of refractory materials for crucibles: namely, Mo, Ta, W, graphite, BN,  $\text{SiO}_2$  and  $\text{ZrO}_2$ . Some crucible material can be incorporated into film also in the case of electron beam misalignment: if a misaligned e-beam hits the crucible, crucible material will be evaporated and incorporated in the deposited film.

### 5.2.2 Sputtering

Sputtering (Figure 5.5) is the most important PVD method. Argon ions ( $\text{Ar}^+$ ) from a glow discharge plasma hit the negatively biased target and eject typically one target atom. The ejected target atoms will be transported to the substrate wafers in a vacuum. These atoms are energetic and hit the substrate with considerable energy, which has both beneficial and detrimental effects on the growing film. Typical sputtering rates are 1–10 nm/s, significantly higher than in evaporation. Sputtering of nonconductive films necessitates use of RF fields to prevent charging of the target. Further discussion on sputtering systems technology can be found in Chapter 33.

Because sputtering pressures are quite high, 1–10 mtorr (cf. evaporation  $10^{-6}$  torr), sputtered atoms will experience many collisions before reaching the substrate. In a process called thermalization, the high-energy sputtered particles (5 eV corresponds to about 60 000 K!) collide with argon gas ( $T = 300\text{ K}$ ) and cool down. Thermalization occurs also in other species present in the plasma, namely the reflected neutrals (some argon ions are neutralized upon target collision). These neutrals provide energy

to the substrate. Thermalization reduces the energy of particles reaching the substrate and it reduces the flux of particles to the substrate. Lower flux means lower deposition rate.

In contrast to evaporation, the energy flux to the substrate wafer can be substantial. This has both beneficial and detrimental effects: loosely bound atoms (both film forming atoms as well as unwanted impurities) will be knocked out, improving adhesion and making the film denser. But energies that are too high can cause damage to the film, the substrate and underlying structures (thin-oxide breakdown because of high voltages). There will always be some argon trapped in the film, but its effect can usually be neglected because argon is a noble gas and therefore non-reactive. Incorporation of residual oxygen or nitrogen is much more pronounced because they are reactive and form oxides and nitrides.

Sputtering yield is the number of target atoms ejected per incident ion. Sputtering yields of metals range from about 0.5 (for carbon, silicon and refractory metals Ti, Nb, Ta, W) to 1–2 for aluminum and copper, to 4 for silver at 1000 eV argon ion energy. Refractory metals have low sputtering yields, which is the fundamental reason for lower deposition rates. In practice, there is another reason which further lowers the deposition rate: refractory metals tend to have higher resistivity and thus lower thermal conductivity, which means that high sputtering powers cannot be applied to refractory sputtering targets. For heavy metals like tungsten and tantalum, sputtering yields are higher with xenon and krypton: these heavy gases transfer energy more efficiently to similar mass target atoms. However, argon is almost exclusively used.

If oxygen is added to the sputtering atmosphere intentionally (usually together with argon), oxide films will result. The method is called reactive sputtering. In similar vein, nitrogen additions lead to nitrides. This is the way for example that  $\text{Ta}_2\text{O}_5$  and TiN are made by sputtering.

## 5.3 Chemical Vapor Deposition

In chemical vapor deposition (CVD) the source materials are brought into the reactor in the gas phase, they are activated in the plasma, diffuse to the wafer surface, and react there to deposit film. Byproducts are desorbed and pumped away as shown in Figure 5.6. Deposition rates are temperature dependent according to the Arrhenius equation (Equation 1.1), but they are on the order of 0.1–10 nm/s.

Common CVD processes include





CVD processes depend on both chemical reactions and flow dynamics. There are two main cases: high flow rate supplies enough reactants and film deposition is limited by slow surface chemical reactions (termed “surface reaction limited”); or fast surface reaction consumes source gas rapidly and the deposition rate is limited by gas supply. This is termed “mass transport limited” or “diffusion limited”. These two cases will be discussed in more detail in Chapter 34.

Silicon deposition (Equation 5.1 or 5.2) on a single crystalline silicon wafer can result in a single crystalline thin film. This is termed epitaxy and it is an important special case of thin-film deposition. The next chapter is devoted to epitaxial deposition. Most deposition processes lead to amorphous or polycrystalline films.

Silicon dioxide can be deposited by many reactions, Equations 5.3, 5.5, 5.6 and 5.7, for example:



The simple reaction in Equation 5.6 is, however, problematic. Silane and oxygen can already react in the gas phase, which means that solid oxide particles are formed in the gas stream. These will then float around the reactor and sporadically deposit on the wafers. In the nitrous oxide process, oxide is formed by a surface reaction, therefore particle contamination is reduced (but in both cases oxide is formed on the reactor walls, and these films will be a source of flakes and particles if the reactor is not cleaned regularly).

The names for CVD oxides are unfortunately many. LTO, for low-temperature oxide, refers to oxide deposited by the reaction in Equation 5.6. The low deposition temperature of  $425^\circ\text{C}$  is desirable in many cases. HTO obviously stands for high-temperature oxide (Equation 5.3), but the difference is deeper: different source gases are used, and the resulting film quality is much better at high temperatures. TEOS (Equation 5.7) is the name of the precursor molecule tetraethoxysilane  $\text{Si(OC}_2\text{H}_5)_4$ , but it is used as the name for the resulting oxide too (deposited at  $700^\circ\text{C}$ , resulting in high-quality oxide). Sometimes the name USG is used: it stands for undoped silica glass. However, there are no metals in USG, so it is not glass in the traditional sense (see Chapter 19 for glass microprocessing).

The addition of  $\text{POCl}_3$  gas to the source gas flow leads to phosphorus-doped oxide deposition. The resulting film is called PSG, for phosphorus-doped silica glass. A few



**Figure 5.6** CVD: source gas molecules adsorb and react on surface to form a film, and the reaction products are desorbed, diffused and pumped away

percent of phosphorus (5 atomic % maximum) modifies the oxide in many ways. Phosphorous getters sodium ions which are detrimental to MOS transistors, and therefore PSG is used as a passivation layer in integrated circuits. In MEMS PSG is used as a sacrificial layer because its etch rate in hydrofluoric acid is much faster than that of undoped CVD oxide. Phosphorus also lowers the glass transition temperature of PSG, making it possible to flow PSG at about 1000°C. If both boron and phosphorus are added, we get BPSG. This oxide film flows at about 950°C, resulting in smoothly sloping walls.

CVD tungsten is deposited in two steps. The silane reduction step (Equation 5.8) deposits a thin nucleation layer over every surface in the system and high-rate blanket deposition with hydrogen reduction (Equation 5.9) is used to achieve the desired total thickness:



This process is able to fill holes and trenches (Figure 5.19) and is very important in multilevel metallization (to be discussed in Chapter 28).

#### 5.4 PECVD: Plasma-Enhanced CVD

Because high temperatures cannot be used in many cases, for example when oxide needs to be deposited on aluminum (m.p. 650°C), one has to find new solutions. New source gas chemistries which enable lower deposition temperatures is one way to go. Another solution is to enhance source gas decomposition and reactions by plasmas. This results in deposition rates similar to thermal CVD, 0.1–10 nm/s, at much lower temperatures, typically around 300°C, enabling deposition on most metals, for instance. Unfortunately lower deposition temperature results in less dense films.

A simple parallel-plate diode reactor for PECVD is shown in Figure 5.7. Wafers are placed on a heated bottom electrode, the source gases are introduced from the top, and pumped away around the bottom electrode. The operating frequency is often 400 kHz, which is slow enough for ions to follow the field, which means that heavy ion bombardment is present. At 13.56 MHz only the electrons can follow the field, and the ion bombardment effect is reduced.

In thermal CVD, pressure, temperature, flow rate and flow rate ratio are the main variables. In PECVD there is additionally the RF power that can be varied. In advanced



**Figure 5.7** Schematic PECVD system

PECVD reactors, RF power can be applied to both electrodes, and the two power sources can supply different frequencies, duty cycles and power levels. The ratio of 13.56 MHz power to kilohertz power is important for film stress tailoring. PECVD shares many beneficial features of both thermal CVD and sputtering.

Whereas thermal oxide or LPCVD nitride is stoichiometric SiO<sub>2</sub> and Si<sub>3</sub>N<sub>4</sub>, with ratios 1:2 and 3:4 of atoms, many other (PE)CVD films are non-stoichiometric: for example, plasma nitride is best described as SiN<sub>x</sub> ( $x \approx 0.8$ ). Amorphous silicon, a-Si, or more specifically designated as a-Si:H, is made by PECVD, the overall reaction being the same as that of LPCVD silicon (Equation 5.1). Hydrogen is incorporated into deposited films up to 30 at. %. Hydrogen release during annealing has to be considered: it has both beneficial and detrimental effects.

PECVD can be used to deposit mixed oxides, nitrides and carbides, as well as doped oxides just like thermal CVD. A mixture of silane, nitrous oxide and ammonia will result in oxynitride, SiO<sub>x</sub>N<sub>y</sub>, with varying ratios of nitrogen and oxygen, covering the whole range of compositions (and material properties) between oxide and nitride.

Silicon carbide is deposited via the reaction



Carbon is deposited by the reaction (resembling silicon deposition, Equation 5.1)



Depending on the exact process conditions, many allotropes of carbon can be made. Non-conducting hydrogenated carbon films resemble diamond in some, but not all, respects, and they are known as diamond-like

carbon, DLC. Films with less hydrogen have  $sp^3$  bonds similar to diamond, and they are referred to as ta-C, tetrahedral amorphous carbon. If intense plasma or a hot filament is used, highly reactive atomic hydrogen is produced. In this case it is possible to grow polycrystalline diamond films. Under different CVD conditions carbon nanotubes (CNTs) are made. The important factor for CNT deposition is the presence of metallic catalyst particles, for example iron or nickel.

## 5.5 ALD: Atomic Layer Deposition

In ALD, film is deposited one atomic layer at a time, offering ultimate thickness control. ALD works in pulsed mode: chemical bonds are formed between precursor gas molecules and the surface atoms. Once all possible reaction sites are occupied, no more reactions can take place (Figure 5.8). A purging nitrogen pulse then removes all unreacted precursor molecules. A pulse of second precursor is then introduced. It reacts with the first reacted layer, the surface saturates similarly, and unreacted precursor gases are purged away. Repetition of successive reactant and purge pulses leads to film deposition in a layer-by-layer fashion. The ability of ALD to coat over steps is excellent because all surfaces are coated alike. Figure 1.7 shows ALD alumina and titanium multilayer films deposited over steps. This ability to coat steep topographical features is increasingly in demand as both ICs and MEMS are made more 3D.

As an example of the ALD process, hafnium dioxide deposition is discussed.  $HfO_2$  is a material with a high



**Figure 5.8** ALD: first pulse of precursors saturate wafer surface, and extra precursors are purged away by a nitrogen pulse; second precursor gases react with first layer, and reaction products are purged away

dielectric constant and is being used as the gate oxide in advanced CMOS. Hafnium chloride reacts with surface hydroxyl groups to form  $Hf-O$  bonds. The second precursor is water, and the oxygen in water reacts with the hafnium to form  $Hf-O$  bonds again, with hydrogen chloride formed as a byproduct. The overall reaction for hafnium dioxide deposition is given by Equation 5.12. The notation (ad) emphasizes that the reactions take place between adsorbed molecules on the surface, not in the gas phase:



ALD is free of one of the main mechanisms of irreproducibility in CVD: homogeneous gas phase reactions. Because only one gas is introduced at a time, there cannot be gas phase reactions between precursors.

The layer thickness is given by the number of pulses times the monolayer thickness. In theory one monolayer per pulse is deposited, but in many cases sub-monolayer growth is seen. One explanation is steric hindrance: large precursor molecules take up space, so it is simply impossible for another precursor molecule to come close enough, and some surface atoms will not react with precursor molecules. This is depicted in Figure 5.9. It can also be noted that not all surface sites are reactive enough for the ALD reaction to take place.

Both monolayer and sub-monolayer deposition are self-limiting. Practical growth rates range around  $1 \text{ \AA/cycle}$  ( $0.1 \text{ nm/cycle}$ ): for  $Al_2O_3$  deposition they are  $1.1 \text{ \AA/cycle}$  and for  $TiN$ ,  $0.2 \text{ \AA/cycle}$ . When thickness/cycle numbers are translated into deposition rates, one has to take into account the flushing cycles between the pulses. Overall rates of a few nanometers per minute are typical for ALD. This is slow: for example, the LPCVD rate of polysilicon is typically  $10 \text{ nm/min}$ . But there are many applications where films of a few nanometers are needed, for example CMOS gate oxides and diffusion barriers in copper metallization.



**Figure 5.9** Sub-monolayer deposition in ALD: (a) non-reactive surface site; (b) steric hindrance by a large precursor molecule prevents another precursor molecule from approaching the reactive site

## 5.6 Electrochemical Deposition (ECD)

### 5.6.1 Electroplating/galvanic deposition

Electroplating takes place on a wafer that is connected as a cathode in metal ion-containing electrolyte solution. The counter-electrode is either passive, like platinum, or made of the metal to be deposited (Figure 5.10).

Electroplating can be very simple: copper is deposited on the cathode according to the reduction reaction, Equation 5.13, while at the anode copper is dissolved into the electrolyte solution:



Gold is plated in a two-step process (Equation 5.14) with the second, charge transfer reaction, as the rate limiting step:



Electroplating rates vary a lot but are generally in the range of 0.1–10  $\mu\text{m}/\text{min}$ . Deposited mass is calculated from

$$\text{mass} = \frac{\alpha ItM}{nF} \quad (5.15)$$

where  $I$  is the current,  $t$  the time,  $M$  the molar mass,  $n$  the species charge state,  $\alpha$  the deposition efficiency and  $F$  the Faraday constant, 96 500 coulombs.



**Figure 5.10** Electroplating:  $\text{CuSO}_4$  electrolyte ionizes to produce  $\text{Cu}^{2+}$  and  $\text{SO}_4^{2-}$  ions, copper film deposits at the cathode

Noble metals can be deposited at 100% efficiency ( $\alpha = 1.00$ ). In less noble metal deposition hydrogen evolution makes  $\alpha$  smaller, and for some non-metals like phosphorus co-deposition with cobalt (Co:P 12%, a soft magnetic material),  $\alpha$  can be as low as 0.2. Other typical electroplated metals include nickel and iron–nickel (81% Ni, 19% Fe, Permalloy). Many metals have no plating processes available: aluminum, titanium, tungsten, tantalum and niobium cannot be plated.

Three transport processes are active during ECD: diffusion at the electrodes due to local depletion of the reactant via deposition; migration in the electrolyte; and convective transport in the plating bath. The last is connected to electrochemical cell design, and it is affected by factors such as stirring, heating, recirculation and hydrogen evolution.

Macroscopic current distribution is determined by the plating bath electrode arrangement and wafer and bath conductivity. Electrical contact to the wafer also needs careful consideration. Microscopic (local) current distribution depends on pattern density and pattern shapes. The third scale in ECD is the feature scale: potential gradients inside structures are important, especially when deep and narrow grooves are filled.

In practice the plating solutions are complex mixtures of electrolytes, salts (for conductivity control), modifiers for film uniformity and morphology improvement, as well as surfactants. Accelerators (brighteners) are additives that modify the number of growth sites. Suppressors are additives for surface diffusion control. Taken together, these additives increase the number of nucleation sites and keep the size of each nucleation site small, which drives smooth growth. Pulsed plating can also be used in balancing nucleation and grain growth: high overpotential and low surface diffusion favor nucleation, and the opposite conditions favor grain growth. Many plating solutions are proprietary. Plating baths are rather aggressive solutions, and photoresist leaching into the plating bath or adhesion loss are real concerns for reproducible plating.

### 5.6.2 Plating on structured wafer

Electroplating onto a photoresist pattern easily produces elaborate microstructures, like the gears shown in Figure 5.11. The process is described in Figure 5.12. A conductive seed layer is sputtered on the wafer. This seed layer, also known as the plating base or field metal, can be very thin, tens of nanometers. Photoresist is exposed and developed, and metal plating then follows. Photoresist is then removed and the seed metal is etched away, resulting in metallic microstructures.



**Figure 5.11** Nickel gear structures ( $50\text{ }\mu\text{m}$  high) made by electroplating. Reproduced from Guckel (1998) by permission of IEEE



**Figure 5.12** Resist masked plating (LIGA, for Lithography and Galvanic plating): (a) seed layer deposition and lithography; (b) plating; (c) resist stripping; (d) seed layer removal

The seed layer needs to be removed after plating, otherwise it would electrically short all metallized structures. Often the deposited metal itself can act as an etch mask for seed layer removal because the seed layer is always very thin compared to the plated metal; in many cases the seed layer thickness is less than the plating thickness variation. Thickness uniformity of



**Figure 5.13** (a) Overplating; (b) backplating

plated metals is about 5–10%, so that  $50\text{ nm}$  seed layer thickness is less than thickness fluctuations of plated metal  $1\text{ }\mu\text{m}$  thick. Electroplating is suitable for extremely small structures, too: modern IC metallization is done by electroplating copper into trenches narrower than  $100\text{ nm}$  wide and  $200\text{ nm}$  high. Electroplating can fill trenches  $500\text{ }\mu\text{m}$  deep and  $5\text{ }\mu\text{m}$  wide (aspect ratio 100:1).

Usually plating is allowed to proceed till resist top surface level but not above. It is, however, possible to overplate, and to form mushroom-shaped structures (Figure 5.13). After resist stripping, such a mushroom can be annealed (reflow) to form a ball-like bump. Bumps of Sn–Pb and In are used for flip-chip packaging. Alternatively, plating can be continued until metal fronts touch. Removal of resist underneath results in free-standing metal bridges, or in fluidic channels, depending on design details. The applications can be in RF circuits as air bridges or as cooling channels for high-power electronics.

### 5.6.3 Electroless deposition

Electroless deposition depends on a reduction reaction in an aqueous solution which contains metal salts and a reducing agent. Metal deposition takes place as a result of metal ion reduction. The surface needs to be suitable for electroless deposition and this is achieved by exposing it to a catalyst, such as  $\text{PdCl}_2$ . This reducing agent starts the reduction reaction which then continues locally. Selective deposition is thus possible. Gold, nickel and copper are the usual metals to be deposited by the electroless method. The major advantage of electroless deposition compared to electroplating is elimination of the need to make electrical contacts to the wafer.

Copper electroless deposition chemistries traditionally use sodium hydroxide in the plating bath, but sodium is a contaminant in transistors. Alternative pH adjustment can be done with TMAH (tetramethyl ammonium hydroxide). Copper sulfate  $\text{CuSO}_4$  in formaldehyde (HCHO) and EDTA (ethylene diamine tetraacetic acid) complexing agent are the basic constituents of the bath. Surfactants

(polyethylene glycol) and stabilizers (2,2'-dipyridyl) can be added. The reaction is described by



The deposition rate is on the order of 100 nm/min. The electroless deposition set-up is extremely simple. Selectivity, however, is difficult to maintain. Hydrogen evolution and incorporation into the film are a problem because hydrogen is mobile; carbon incorporation is another problem.

Gold can be deposited from KOH, KCN, KBH<sub>4</sub>, KAu(CN)<sub>2</sub> mixture at rates exceeding 5 μm/min, even though much lower rates are usually used. Temperatures for electrochemical deposition processes range from room temperature to 100 °C.

## 5.7 Other Methods

### 5.7.1 Spin coating

Spin coating is a very widely used method for resist spinning and increasingly for other materials as well, for example spin-on-glasses (SOGs) and polymers (known together as spin-on-dielectrics, SODs) are usually spin coated. Spin coating will be discussed in Chapter 9 in more detail, but briefly the material is dissolved in a suitable solvent, dispensed on a wafer and spun at high speed (e.g., 1000–5000 rpm), see Figure 9.3. Polymeric films

can replace inorganic films, especially when thick films are needed. Thicknesses up to 1000 μm can be made by spin coating; inorganic films made either by CVD or by PVD cannot usually be thicker than a few micrometers. Spin-coated films fill cavities and recesses because they are liquids during spin coating. This is advantageous for filling gaps and smoothing, but if a uniform thickness over the topography is desired, spinning is not ideal. Room temperature spinning is always accompanied by baking (in the range 100–250 °C).

### 5.7.2 Self-limiting methods

After a fashion resembling ALD, monolayer thick polymer films are made by covalently bonding the molecule to a surface. Self-assembled monolayers (SAMs) are made this way. The molecules have a reactive group at one end and a non-reactive group at the other end. In Figure 5.14 the reactive group is the trichloro group (SiCl<sub>3</sub>) and the polymer chain consists of a chain of 18 carbon atoms (octadecane, or C-18) with methyl group (CH<sub>3</sub>) at the other end. Silicon reacts with hydroxyl groups on the silicon surface, forming strong Si–O bonds, and HCl is released. When all hydroxyl groups on the surface have reacted, no more reactions can take place: there are no more reactive sites available once the surface is covered by one molecular layer.

Another SAM is shown by Equation 5.17, where fluorinated SAM reacts to form a hydrophobic (Teflon-like) layer on the surface:



**Figure 5.14** Self-assembled monolayer of octadecane trichlorosilane (OTS) on a silicon surface

Fluorinated SAMs have applications in microfluidics, and they are used as antisticking layers in imprinting (Figure 1.9): the non-stickiness of the fluorinated surface makes detachment of the molded piece from the mold master easier.

## 5.8 Thin Films Over Topography: Step Coverage

Deposition on a patterned substrate introduces new considerations as film must go over steps. These new considerations include the following: atoms arriving in a line-of-sight manner (as in evaporation) and diffusively (as in CVD) will penetrate into grooves and cavities very differently; if there is energetic bombardment present in the process (as in sputtering and PECVD), it will affect horizontal and vertical surfaces differently. And then there is simple geometry: on a horizontal free surface the angle of arrival of atoms is 180°, in convex corners it is 270°, and in the bottom concave corners it is only 90°, as depicted in Figure 5.15. This leads to pronounced deposition at step corners, termed cusping, and reduced deposition in concave (bottom) corners.

If the arriving atoms have high surface diffusivity (e.g., because of high deposition temperature as in thermal CVD) they can move around and find energetically favorable positions and film thickness will be uniform over steps. If the deposition reaction is surface controlled as in ALD, it will naturally follow the existing geometry.

Step coverage is defined as the ratio of film thickness on the sidewall ( $A$ ) to film thickness on the top horizontal surface ( $H$ ), Equation 5.18a. Bottom coverage can be defined similarly, Equation 5.18b. These are depicted in Figure 5.15. That is,

$$\text{step coverage} = A/H \quad (5.18a)$$

$$\text{bottom coverage} = B/H \quad (5.18b)$$

Thermal CVD processes (including TEOS, HTO, nitride, LPCVD poly and CVD-W) have very good step coverage, as shown in Figure 5.16a. This kind of 100% step coverage is termed conformal. ALD has also 100% step coverage, even in sharp corners, as shown for TiN film in Figure 5.6b. PECVD step coverage is highly process dependent: in Figure 5.16c oxide step coverage is ca. 50% but in 5.16d only 25%. In Figure 5.16e PECVD oxide step coverage of an overhang structure is shown: ca. 30% has been achieved even in the shadow areas. Overhang coverage is possible in flow systems, but



**Figure 5.15** Top, arrival angles of depositing species at different positions; bottom, step coverage,  $A/H$ , and bottom coverage,  $B/H$

beam-like deposition systems sputtering and evaporation cannot cover overhangs. Sputtering step coverage is highly variable, but generally it is in the 20–50% range. Simulated examples are shown in Figures 3.6 and 7.17. Step coverage in evaporation is very poor.

Conformal deposition is no guarantee that film quality on the sidewalls is equal to that of planar areas. In sputtering and PECVD particle bombardment hits the horizontal surfaces, densifying the film, and the sidewalls are much less dense material, and the film on the sidewall will etch much faster than the film on horizontal surfaces.

Step coverage is usually not a major problem for low-aspect-ratio structures, for, say, a hole 1 μm wide, 0.5 μm deep hole, but at 1:1 and higher aspect ratios, step coverage deteriorates rapidly. It is important to remember that on real microdevices there are always structures of various shapes and spacings, and film deposition over all these spaces needs to be considered. It is far too simple to consider one size or depth only. It has to be remembered that aspect ratio is a dynamic variable: a contact hole that



**Figure 5.16** Step coverage: (a) conformal coverage in CVD; (b) conformal coverage in ALD; (c) PECVD TEOS oxide with 50% step coverage; (d) PECVD with 25% coverage; (e) PECVD oxide over an overhang structure. Figures a, c and d from ref. Cote; (b) courtesy Kai-Erik Elers; (e) courtesy Oxford Plasma Technology

has 1:1 aspect ratio initially turns into a 2:1 aspect ratio hole as deposition proceeds, and just before closure the aspect ratio approaches infinity.

Good step coverage in metallization is essential for reliability. Even though metal film will be continuous even with, say, 10% step coverage, current density will increase dramatically at the thinnest point, causing a major reliability problem.

## 5.9 Stresses

Thin films are under either compressive or tensile stresses when deposited on the wafers. Stresses consist of extrinsic stresses, caused by thermal expansion mismatch between the film and the substrate, and of intrinsic stresses which depend on film microstructure and the deposition process. Stress at room temperature is the sum of intrinsic and extrinsic stresses.

Stresses in thin films cause the wafer to curve, as shown in Figure 5.17. Thin film can be imagined as a spring, where tensile stress equals spring elongation. If this elongated spring is attached to a much more massive wafer in its elongated, tensile state, it will still try to return to its original size, and in doing so it will bend the wafer. A

film under tensile stress will result in a concave shape for a wafer + film combination. A compressively stressed film will try to return to its shape, too, and the film + wafer will end up with a convex shape. If tensile stress is too high, the film will crack. And too high compressive stresses lead to buckling.

Figure 5.17 gives a macroscopic depiction of stresses, but the same reasoning works on the atomic level as well: the germanium lattice constant is 4.2% larger than that of silicon, and if germanium film is deposited on silicon, it has to be compressed to fit. Conversely, if silicon film is deposited on germanium, it will be under tensile stress.

Extrinsic stresses can be estimated from thermal expansion coefficient differences by

$$\sigma = \frac{E_f}{(1 - \nu)} (\alpha_f - \alpha_s) \cdot \Delta T \quad (5.19)$$

where the indices f and s refer to film and substrate, respectively, and E is Young's modulus of the film,  $\nu$  the Poisson ratio of the film,  $\alpha$  the coefficient of thermal expansion (CTE) and  $\Delta T$  the temperature difference.

By convention, negative stresses are compressive. In a first approximation the temperature difference is the difference between deposition and measurement



**Figure 5.17** Tensile stresses (left) and compressive stresses (right) in thin films. Imagine a free film stretched or compressed to wafer size and attached to a massive wafer. Next, imagine that the films try to return to their original shape. The wafer will curve as a consequence

temperatures, but the situation is really much more complex because stress relaxation can occur during high-temperature deposition.

The coefficient of thermal expansion of silicon is  $2.6 \times 10^{-6}/^{\circ}\text{C}$  (at room temperature). The only other materials used in microfabrication that have smaller coefficients are silicon dioxide, silicon nitride and diamond, which have CTEs of  $0.5 \times 10^{-6}$ ,  $2.4 \times 10^{-6}$  and  $1.1 \times 10^{-6}/^{\circ}\text{C}$ , respectively. Oxide, nitride and diamond are therefore the only materials that can develop compressive extrinsic stresses over silicon substrates. The CTE of aluminum is  $23 \times 10^{-6}/^{\circ}\text{C}$ , which is fairly high, that of tungsten  $4 \times 10^{-6}/^{\circ}\text{C}$  and polymers have CTE values in the range of  $30\text{--}100 \times 10^{-6}/^{\circ}\text{C}$ .

The bimetal thermometer is a classic example of thermal expansion coefficient mismatch. Bimorph structures can be used as sensors and actuators in microsystems, but the initial shape has to be known. As shown

in Figure 5.18,  $\text{SiO}_2/\text{Al}$  and  $\text{SiO}_2/\text{Ti}$  cantilevers are bent because of stresses in the structures, without external sensing or actuation force. In a single material cantilever (e.g., CVD polysilicon) stress gradients can lead to similar bending.

Intrinsic stresses are caused by many mechanisms which are not fully understood. Deposited polycrystalline films are not at their energy minimum. Especially, low deposition temperature means that the arriving atoms do not have enough energy to find energetically favorable positions, and the film builds up without relaxation. Voids and incorporated foreign atoms contribute to intrinsic stresses. Bombardment during deposition has a pronounced effect on many film properties, including stresses, because the bombardment pinches off loosely bound atoms, resulting in a more uniform, less stressed film. Too high a bombardment, on the other hand, implants atoms into the film in a non-equilibrium way, and compressive stresses build up. Crystallization and phase transitions, and other processes which lead to volume changes, such as outgassing, lead to stress changes.

Evaporated metal films are usually under tensile stresses (100 MPa to 1 GPa). Sputtered films can be under tensile or compressive stresses. Sputtering, with ion bombardment during deposition, is a much more complex process than evaporation, and stress tailoring can be achieved by:

- bias power
- argon pressure
- sputtering gas mass
- temperature
- deposition rate.

Sputtered film stress can be tailored by deposition pressure: films are usually under compressive stress if



**Figure 5.18** (a) compressive stress in  $\text{SiO}_2/\text{Al}$  cantilevers causes downward bending; (b) tensile stress in  $\text{SiO}_2/\text{Ti}$  cantilevers leads to upward bending. Reproduced from Fang and Lo (2000) by permission of Elsevier

deposited at low pressure (about 1 mtorr in a magnetron sputtering system) but turn to tensile stress as deposition pressure is raised (to some 10 mtorr). This crossover pressure increases with atomic mass. However, this is not a universal solution, because pressure affects not only film stress but many other properties, like deposition rate and film density.

### 5.9.1 Stress measurement

Thin film stresses are usually measured by methods based on wafer curvature. Optical techniques or stylus profilometers are used for curvature measurement. Because silicon wafers have curvature (e.g., 30 µm), it needs to be eliminated from the results. Therefore, either of two approaches are used: measure wafer curvature, deposit film, measure composite curvature, and deduct silicon wafer curvature; or else measure composite curvature, etch thin film away and measure wafer curvature.

Film stress is given by the Stoney formula:

$$\sigma = \frac{E_s}{6(1-\nu)} \frac{t_s^2}{t_f} \left( \frac{1}{R} - \frac{1}{R_0} \right) \quad (5.20)$$

where  $t_s$  is the substrate thickness,  $\nu$  the Poisson ratio of the substrate (0.27 for silicon),  $t_f$  the film thickness,  $R$  the radius of curvature for the substrate+film system (negative when convex) and  $R_0$  the radius of curvature for the substrate without film (infinite for a flat wafer).

Wafers are about 1000 times thicker than films, and because all solids have similar elastic constants, wafer stresses and strains are about 1000 times less than those of thin films. Thin-film stresses are on the order of 10–1000 MPa (1000 MPa =  $10^{10}$  dyn/cm<sup>2</sup>).

## 5.10 Metallic Thin Films

Metallic thin films have various application in microfabricated devices.

- **Conductors:** Resistivity is the main consideration: aluminum and copper are main choices for most applications, and gold is often used in RF devices, like inductor coils, to minimize resistive losses. Doped silicon and polycrystalline silicon can be used as conductors, but their resistivity is very high compared with metals.
- **Contacts to semiconductors:** Ohmic (metal-like) and Schottky (diode-like) contacts are possible. Aluminum, itself p-type dopant in silicon, makes good ohmic contact to p-type silicon. Platinum silicide is one candidate for silicon Schottky contacts
- **Capacitor electrodes:** Capacitor electrodes need not be highly conductive. The most important capacitor electrode, the MOSFET gate, is chosen to be polycrystalline silicon because its interface with silicon dioxide is stable, and its lithography and etching properties are good.
- **Plug fills:** When vertical holes need to be filled with a conducting material, CVD tungsten and electrodeposition of copper are employed. Because distances are short, it is rather step coverage than resistivity which determines the choices.
- **Resistors:** Doped semiconductors, metals, metal compounds and alloys can be used as resistors. Heating resistors can be made of almost any material, but precision resistors are difficult to make.
- **Adhesion layers:** Noble metals like gold and platinum do not adhere well to substrates, and therefore thin (10–20 nm thick) “glue” layers of titanium or chromium are needed.
- **Barriers:** Barriers are needed to prevent unwanted reactions between thin films or diffusion of unwanted atoms. Amorphous metal alloys and compounds like tungsten nitride W:N, titanium–tungsten TiW, TiN and TaN are used as barriers between metals and silicon.
- **Mechanical materials:** Aluminum, nickel and TiAl alloys are materials for micromechanical free-standing beams and cantilevers, in e.g. micromirrors and resonators. Films like TiN can be used as mechanical stiffening layers to prevent mechanical changes in the underlying, softer films, like aluminum.
- **Optical materials:** In image sensors metals act as light shields, and chromium is used in photomasks to block light. TiN is often deposited on top of aluminum to reduce reflectivity, because lithography is difficult of highly reflecting surface. Transparent conductors like indium doped tin oxide (ITO;  $In_xSn_yO_2$ ) are needed in displays and light emitting devices.
- **Magnetic materials:** Nickel and nickel alloys, Ni:Fe, are used for magnetic structures in microactuators. Cores of microtransformers are also made of these materials, which are usually deposited by electroplating.
- **Catalysts and chemically active layers:** Chemical sensors, microreactors and fuel cells use films like palladium and platinum as catalysts.
- **Electron emitters:** Vacuum microemitter tips are often made of molybdenum because of its high melting point and low work function.
- **Infrared emitters and other IR components:** Heated wires emit infrared, and porous metallic films, like aluminum black, act as IR absorbers. Metallic meshes act as IR filters and aluminum is used as an IR mirror.

**Table 5.2** Properties of metals

| Metal | Resistivity<br>bulk (ohm-cm) | Resistivity thin<br>film (ohm-cm) | CTE (ppm) | Thermal conductivity<br>(W/cm-K) | Melting<br>point (°C) |
|-------|------------------------------|-----------------------------------|-----------|----------------------------------|-----------------------|
| Al    | 3                            | 3–4                               | 23        | 2.4                              | 650                   |
| Cu    | 1.7                          | 2–4                               | 16        | 4                                | 1083                  |
| Mo    | 5.6                          | 10–20                             | 5         | 1.4                              | 2610                  |
| W     | 5.6                          | 10–100 <sup>a</sup>               | 4.5       | 1.7                              | 3387                  |
| Ta    | 12                           | 20–200 <sup>a</sup>               | 6.5       | 0.6                              | 3000                  |
| Ti    | 48                           | 100–200                           | 8.6       | 0.2                              | 1660                  |
| Co    | 6.2                          | 10–20                             | 12.5      | 0.7                              | 1500                  |
| Ni    | 6.8                          | 10–20                             | 13        | 0.9                              | 1455                  |
| Cr    | 13                           | 20–40                             | 6         | 0.7                              | 1875                  |
| Pt    | 10                           | 20–100                            | 9         | 0.7                              | 1769                  |
| Au    | 1.7                          | 2–3                               | 14        | 3                                | 1064                  |

<sup>a</sup>Tungsten and tantalum can exist in two different phases which have different resistivities; a minor change in sputtering conditions can result in either phase.

- **Sacrificial layers:** Many devices require free-standing structures. These must be fabricated on solid films, which will subsequently be etched away. Copper is often used as a sacrificial material under nickel or gold.
- **Protective coatings:** Sometimes the role of the top-most layer is simply to protect the underlying layers from the ambient: from etching agents or environmental stressors. Nickel and chromium are used as masks for etching.
- **X-ray components:** Masks for X-ray lithography require high atomic mass materials which effectively block X-rays. Tungsten, gold and lead are prime candidates. X-ray mirrors are made by alternating heavy (tungsten, molybdenum) and light materials (carbon or silicon) with layer thicknesses in the nanometer range.
- **Bonding layers:** Gold and tin layers are used in eutectic bonding. Tight, hermetic bonds can be obtained at fairly low temperatures when eutectic alloys are formed.
- **Bonding pads:** Wires have to be attached to chips, and this is best achieved with soft metals like aluminum and gold, while hard metals like tungsten or chromium are unsuitable for wire bonding, and also difficult to probe by probe needles.

Deposition process greatly influences the choice of metals. Not all materials are amenable to all deposition methods, and the resulting film properties (resistivity, phase, texture, adhesion, stress, surface morphology) are closely connected with the details of the deposition process, and may well be idiosyncratic with the equipment. Reproducing results that have been obtained with another piece of equipment can be a nightmare.

### 5.10.1 Properties of metallic thin films

Low resistivity is required of metals in thin-film form. Thin-film resistivity is usually much higher than bulk resistivity. Aluminum, copper and gold thin-film resistivities are close to bulk values; for most others thin films, resistivities are factor of two higher. Important microfabrication metals are listed in Table 5.2. Resistivities are strongly deposition process dependent, as was shown in Table 5.1, and the tabulated values should be used as guidelines with every deposition process being characterized individually. It should also be borne in mind that thermal conductivity similarly depends on the details of deposition process and film thickness.

Alloys and compounds TiW, TiN<sub>x</sub> and TaN<sub>x</sub> have resistivities that are even more strongly deposition process dependent than simple metals, and the exact composition will also have a profound effect. Resistivities of TiW, TiN<sub>x</sub> and TaN<sub>x</sub> are usually in the range of 100–500 μohm-cm.

Young's moduli are the same order of magnitude for all metals, from 100 GPa for soft metals to 600 GPa for refractory metals. Many metal properties are related to the melting point. High melting point equals high bond strength, and a stable atomic arrangement in solids. This translates to a high current density tolerance.

## 5.11 Polysilicon

Polysilicon (polycrystalline silicon) is deposited in a low-pressure CVD process (LPCVD, at around 1 torr pressure) by the silane decomposition reaction according to Equation 5.1. A TEM micrograph of polycrystalline silicon on oxide is shown in Figure 2.3. Deposition at 630 °C

leads to polycrystalline films with a grain size of about 100 nm. Between 580 and 600 °C grain size decreases and deposition at about 570 °C results in an amorphous film (a-Si). Because deposition rate is governed by the Arrhenius law, the rate drops dramatically, to a few nanometers per minute.

In a CVD process we can add dopant gases to the gas feed, so dopants are incorporated in the deposited film. Diborane,  $\text{B}_2\text{H}_6$ , dopes the film p-type and  $\text{POCl}_3$  results in n-type polysilicon. In situ boron doping increases deposition rate while phosphorus doping reduces it. Both dopants make the process more difficult to control, in terms of uniformity and reproducibility. Quite often poly is deposited undoped, and doped afterward. The doping methods include thermal diffusion (Chapter 14) and ion implantation (Chapter 15), just like the doping of single crystalline silicon. Polysilicon can be oxidized at high temperatures (more on this in Chapter 13). After all, it is silicon.

High doping levels of  $10^{21} \text{ cm}^{-3}$  result in a polysilicon resistivity of about 500  $\mu\text{ohm}\cdot\text{cm}$ , or 50 ohm/square for film thicknesses of 100 nm. Polysilicon can be used to make conductive wiring when temperatures are too high for metal wires. Obviously the resistance of polysilicon is much higher than that of metallic conductors, but for short wires it is acceptable. Electron mobility in polysilicon is an order of magnitude less than in single crystalline materials,  $10\text{--}50 \text{ cm}^2/\text{V}\cdot\text{s}$ . The polysilicon interface with thermal oxide is well characterized and polysilicon is the “Metal” in MOS transistors. The MOS transistor is a capacitor, and the rather high resistivity of polysilicon is not a major disadvantage.

Poly (625 °C film) and a-Si (570 °C film) differ in their surface smoothness, and many differences remain even when the films are processed further. Amorphous silicon will crystallize when heated above about 600 °C, but then grain size will be different from poly that was polycrystalline to begin with. The resistivity of the originally amorphous film will be lower than that of poly, and stresses will be different (see Figure 25.9).

Polysilicon can be used as a mechanical material just like single crystal silicon. Its mechanical constants are similar to single crystalline material: Young’s modulus is about 160 GPa for both while the yield strength of poly is 2–3 GPa vs. about 7 GPa for single crystalline. The thermal conductivity of polysilicon is 0.2–0.3 W/cm·K vs. 1.57 W/cm·K for single crystal material, and the coefficients of thermal expansions are identical. But CVD deposition offers possibilities for realizing multilayer structures which cannot be made in single crystal material.

### 5.11.1 Amorphous silicon

Amorphous silicon can mean different things to different people. Taken literally, it is just the structure that matters. Amorphous silicon can be made for instance by sputtering. LPCVD amorphous silicon deposited at 570 °C contains minor amounts of hydrogen, but the amount is insignificant for most applications.

In solar cell and flat-panel display manufacturing, amorphous silicon is done by PECVD at low deposition temperature. Lots of hydrogen is incorporated in the films, and thus the proper name is hydrogenated amorphous silicon, a-Si:H. In a-Si:H films the hydrogen content can be up to 30 at. % (and much less in wt %), and they contain considerable amounts of oxygen and carbon impurities ( $10^{19} \text{ cm}^{-3}$ ). PECVD a-Si can be doped in situ like LPCVD silicon, but resistivities are higher than for poly of identical dopant concentration.

## 5.12 Oxide and Nitride Thin Films

Dielectric films have, just like metallic films, a plethora of applications in microdevices. Table 5.3 classifies dielectric film applications into three categories: as structural parts in finished devices, as intermittent layers during wafer processing, and as protective coatings for finished devices. Surprisingly, many films can serve all these roles.

### 5.12.1 Properties of dielectric films

Higher deposition temperature usually leads to denser films which have slower etch and polishing rates, and are less susceptible to moisture absorption. In CVD, and in PECVD in particular, films can have HF etch rates varying enormously depending on particular types of equipment and process conditions (power, flow rate and ratios, temperature). As a rule of thumb, if the thermal  $\text{SiO}_2$  etch rate is 100 nm/min, 300–1000 nm/min is expected for (PE)CVD oxides. Densification annealing of CVD films at high temperature can lower this by a factor of two.

Silicon dioxide and nitride are the most widely used dielectrics in microfabrication, but sometimes polymer films offer exceptional properties. One polymer, BCB, benzo cyclo butadiene, is contrasted with oxide and nitride in Table 5.4.

Films should be free of pinholes, small point-like defects, otherwise they are useless as protective coatings. For PECVD, values less than 0.1 pinholes/ $\text{cm}^2$  are good. If the film is less dense than the bulk, it can be either because of porosity or because of pinholes.

**Table 5.3** Uses of dielectric thin films in microtechnology

| Function                                                           | Examples                                                                                                  |
|--------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------|
| <b>Structural parts of finished devices</b>                        |                                                                                                           |
| Intermetal insulation                                              | SiO <sub>2</sub> , polymers                                                                               |
| Gate oxides in MOS transistors                                     | SiO <sub>2</sub> , HfO <sub>2</sub>                                                                       |
| Capacitor dielectrics                                              | SiO <sub>2</sub> , Si <sub>3</sub> N <sub>4</sub> , Ta <sub>2</sub> O <sub>5</sub> , BaSrTiO <sub>3</sub> |
| Tunnel oxide in EPROMs                                             | SiO <sub>2</sub>                                                                                          |
| Spacers in MOS and bipolar transistors                             | CVD oxide, CVD nitride                                                                                    |
| Ion barriers                                                       | Al <sub>2</sub> O <sub>3</sub> , Si <sub>3</sub> N <sub>4</sub>                                           |
| Gap fill materials                                                 | Oxides, spin-on films                                                                                     |
| Tunnel oxides in Josephson junction devices                        | AlO <sub>x</sub> , NbO <sub>x</sub>                                                                       |
| Dielectric mirrors                                                 | CVD oxide, nitride, polysilicon                                                                           |
| Micromechanical beams and plates                                   | LPCVD nitride                                                                                             |
| Antireflective coatings                                            | PECVD SiN <sub>x</sub> , SiO <sub>2</sub>                                                                 |
| Heat sink for lasers and power devices                             | Diamond                                                                                                   |
| Hydrophobic surfaces                                               | PTFE, PDMS, other polymers                                                                                |
| Microfluidic structures                                            | Polymers, oxide, nitride, diamond                                                                         |
| Microlenses                                                        | Polymers, spin-on glasses                                                                                 |
| <b>Protective coatings against ambient in final devices</b>        |                                                                                                           |
| Passivation layer and metal ion barrier                            | SiO <sub>x</sub> , SiO <sub>x</sub> N <sub>y</sub>                                                        |
| Humidity and scratch protecting barriers                           | PECVD SiN <sub>x</sub> , polyimide                                                                        |
| Tribological coating (wear, friction)                              | Diamond, SiC                                                                                              |
| Corrosion-resistant coatings in harsh environments                 | Ta <sub>2</sub> O <sub>5</sub> , SiC                                                                      |
| <b>Sacrificial and intermittent layers during wafer processing</b> |                                                                                                           |
| Mask for thermal oxidation                                         | Si <sub>3</sub> N <sub>4</sub>                                                                            |
| Diffusion and ion implantation masks                               | SiO <sub>2</sub> , Si <sub>3</sub> N <sub>4</sub>                                                         |
| Dopant evaporation barrier                                         | CVD oxide, SiN <sub>x</sub>                                                                               |
| Etch stop layer                                                    | SiN <sub>x</sub>                                                                                          |
| Etch masks in bulk micromechanics                                  | Oxide, Si <sub>3</sub> N <sub>4</sub>                                                                     |
| Dopant sources                                                     | PSG, BSG                                                                                                  |
| Sacrificial layers in surface micromechanics                       | PSG, resist                                                                                               |
| Release layers                                                     | Teflon, SAMs                                                                                              |

In order to minimize capacitances ( $C = \epsilon A/L$ ,  $\epsilon = \epsilon_r \epsilon_0$ ) between metal layers, it is preferable to use films with a low dielectric constant (known as low- $k$  or low- $\epsilon$  materials). Polymeric materials (e.g., parylene), modified CVD oxides (SiO<sub>x</sub>F<sub>y</sub>, SiO<sub>x</sub>C<sub>y</sub>) and porous spin-on oxides are low- $k$  materials with  $\epsilon_r$  between 2 and 4. The topic of dielectric constant will be discussed in connection with multilevel metallization for ICs in Chapter 28.

Films with a high dielectric constant are required in applications where high capacitance is needed. MOS transistors and DRAMs are capacitors, and in order to make the capacitors smaller, area has been scaled down. To keep capacitance constant, the capacitor dielectric thickness has been scaled down. This approach cannot be continued indefinitely because of tunneling currents through thin oxides. High- $k$  dielectrics for CMOS will be discussed

in Chapter 26. Thin-film dielectrics have a breakdown field in the range of  $10^5$ – $10^7$  V/cm (10–1000 V/ $\mu$ m). This issue is especially important for MOS transistors and flash memories which today have film thicknesses in the sub-10 nm range.

### 5.12.2 Measurements for dielectric films

Thickness and refractive index are basic measurements for lossless dielectric films. Optical methods are accurate, quick, non-contact and suitable for both research and manufacturing control applications. Accuracy of measurement is a fraction of a nanometer for both ellipsometry and reflectometry.

Reflectometry assumes a known index of refraction, but measures real thickness by fitting reflections over a wide wavelength range to the  $d - n_f$  model. Thicknesses

**Table 5.4** Properties of silicon dioxide, silicon nitride and BCB (benzo cyclo butadiene)

|                                           | SiO <sub>2</sub>       | Si <sub>3</sub> N <sub>4</sub> | BCB                 |
|-------------------------------------------|------------------------|--------------------------------|---------------------|
| Resistivity ( $\Omega\text{-cm}$ ), 25 °C | 10 <sup>16</sup>       | 10 <sup>16</sup>               | 10 <sup>19</sup>    |
| Density (g/cm <sup>3</sup> )              | 2.2                    | 2.9-3.1                        | 1.05                |
| Dielectric constant                       | 3.8-3.9                | 6-7                            | 2.65                |
| Dielectric strength (V/cm)                | 12 × 10 <sup>6</sup>   | 10 × 10 <sup>6</sup>           | 3 × 10 <sup>6</sup> |
| Thermal expansion coefficient (ppm/°C)    | 0.5                    | 1.6                            | 52                  |
| Melting point (°C)                        | 1700                   | 1800                           | 350 <sup>a</sup>    |
| Refractive index                          | 1.46                   | 2.00                           | 1.54                |
| Specific heat (J/kg-°C)                   | 1000                   | 700                            | 2180                |
| Young's modulus (GPa)                     | 87                     | 300                            | 2-3                 |
| Yield strength (GPa)                      | 8.4                    | 14                             |                     |
| Stress in film on Si (MPa)                | 200-400 C <sup>b</sup> | 1000 T <sup>b</sup>            | 30 T <sup>b</sup>   |
| Thermal conductivity (W/K-m)              | 1.4                    | 19                             | 0.29                |
| Etch rate in buffered HF (nm/min)         | 100                    | 1                              | 0                   |

<sup>a</sup>Glass transition point.<sup>b</sup>T = tensile, C = compressive.

from 10 nm to 50  $\mu\text{m}$  can be measured, depending on the equipment and algorithm.

Ellipsometry measures thickness and refractive index in a single measurement because both the amplitude and phase of reflected polarized light are measured. In the thin limit ellipsometry is not accurate because optical constants of very thin films, below 10 nm, are not really constants, though precision is very good. For thicker films multiple reflections and interference mean that the solution is periodic and additional information is needed as to which period is in question. See Appendix C for an oxide and nitride color chart which can be used as a quick thickness monitoring method.

Ellipsometry needs a fairly large area for the measurement (e.g., 100 × 100  $\mu\text{m}$ ) while reflectometer spots can be as small as a few micrometers, which enables measurement from the structures themselves, without a dedicated test site. The easiest and quickest way to gauge thickness is from interference colors. The accuracy of this approach is about 10 nm, but the colors repeat at regular intervals, and absolute thickness determination requires additional information.

- sputtering
- evaporation
- CVD and PECVD
- self-limiting vapor phase and liquid phase reactions.

We will discuss polymer thin films briefly here; more information can be found in Chapter 18. Polymer films can offer exceptional properties, like softness, which may be required for low-pressure sensors or sensitive cantilever sensors. Many sensors use polymers as active parts of the devices: for example, capacitive humidity sensors work on the principle that capacitance changes when the polymeric capacitor dielectric absorbs water. Thin-film polymer is paramount for device operation so that humidity rapidly penetrates the whole film. Polymers are also used as structural materials in microsystems. Those structures can be thin or thick, up to millimeters.

Widely used polymer materials in microfabrication include thermally stable aromatic polymers (BCB), epoxies (SU-8) and polyimides (PI). All of these are available as photoresists, too, acting in the negative mode. Non-photoactive polyimides are also widely used. Thermoplastic polymers PMMA, PC and COC are used in embossing/imprinting applications. Parylene is used as a structural material, protective coating and thermal insulator. Various fluoropolymers are used to make hydrophobic surfaces. Perfluorinated films like Teflon have other uses, too, because of their exceptional properties,

## 5.13 Polymer Films

Polymer films can be deposited by a number of methods:

- spin coating
- dip coating

like low water absorption, low friction, extreme chemical tolerance and very good electrical properties at high frequencies.

Polymers are inferior to inorganic films in terms of mechanical strength. Tensile strengths of polymers are in the range of 100–400 MPa, and Young's moduli on the order of 1–10 GPa, compared to 50–500 GPa for inorganic solids and elemental metals. Stresses in polymers are inherently low (<100 MPa) whereas stress minimization in oxides and nitrides is quite a challenge. In addition to normal process-related variation, polymer properties vary from manufacturer to manufacturer, and the listed properties are indicative of some typical values only.

Polymers have thermal limitations: maximum usable temperatures are often in the range of 100–200 °C, but some exceptional polymers tolerate 400 °C. The coefficients of thermal expansion are in the range of 30–100 ppm/°C vs. 1–20 ppm/°C for elemental metal films and simple inorganic compounds, which is a considerable mismatch.

Many of the thin-film deposition methods described in this chapter can be applied to polymer thin films. Evaporation is used to deposit small organic molecules like pentacene ( $C_{14}H_{22}$ ) which will deposit as a conductive thin film and can be used as a channel material in organic electronics (Figure 26.21). The same process conditions and process performance apply for the evaporation as of organics, as for any other material, for example  $5 \times 10^{-7}$  mbar pressure and 0.1 nm/s deposition rate. Polymers cannot be evaporated: they will decompose rather than evaporate.

Parylene is deposited by thermal CVD. Layer thicknesses are similar to other CVD processes in the thin end, but because of polymer softness, stress build-up is less, and layers tens of micrometers thick can be made. These were the traditional parylene applications in microtechnology: thick protecting layers for finished devices. Today, the conformal deposition of thin films, low deposition temperature, basically room temperature, enable novel applications.

Teflon, an insulator, must be sputtered in a RF system (this applies to inorganic insulators as well). Deposition rate tends to be low (e.g., 0.05 nm/s), but high-density, pinhole-free film can then be obtained. Polyimides, polypropylene and polyethylene have also been sputter deposited.

Teflon-like perfluoropolymer films ( $CF_2)_n$  can also be plasma deposited. Fluorine-containing source gases  $CHF_3$  or  $C_4F_8$  (which are readily available because they are used to plasma etch silicon dioxide) are broken down in

plasma, and fluoropolymer is deposited at wafer, at room temperature. FTIR and XPS analyses will reveal the C–F bonds, and also C–H bonds which indicate incomplete fluorination.

## 5.14 Advanced Thin Films

Thin-film deposition is seldom the last process step. The films will be modified intentionally or unintentionally in subsequent process steps. For example, all elevated temperature steps will modify thin films.

So far we have been dealing mostly with single layer films. But processes and structures can be made much more functional and reliable by adopting multilayer films. In IC metallization multiple layers of metal are used for various reasons: titanium improves adhesion, TiN acts as a barrier between materials and prevents reactions, CVD-W is used because it can fill contact holes, etc. Dielectrics are similarly used in double layer structures: passivation is provided by PSG/nitride: phosphorus-doped oxide is a good barrier for sodium ion diffusion, and nitride is an excellent mechanical scratch-protective coating. These are depicted schematically in Figure 5.19 and in the SEM micrograph of Figure 5.20.



**Figure 5.19** Cross-section of multilevel metallization: double layer dielectric ( $SiO_2/SiN_x$ ), triple layer plug fill metallization (Ti/TiN/W) and triple layer top metallization (Ti/TiN/Al)



**Figure 5.20** Contact plug filled by Ti/TiN/CVD-W. Courtesy Brandon Van Leer, FEI Company

## 5.15 Exercises

- What are the resistivities of the tantalum films in Figure 5.1?
- If silane ( $\text{SiH}_4$ ) flow in a single wafer (150 mm) PECVD reactor is 5 sccm ( $\text{cm}^3/\text{min}$ ), what is the theoretical maximum deposition rate of amorphous silicon?
- If the electroplating current density is 100 mA/cm<sup>2</sup> in nickel deposition, what will be the rate?
- Calculate the wafer bow that a thin film 100 nm thick with 100 MPa stress induces on a silicon wafer 675  $\mu\text{m}$  thick and 150 mm in diameter. How about a 100 nm thick film of 500 MPa on a wafer 380  $\mu\text{m}$  thick and 100 mm diameter wafer?
- Draw schematic step coverages for (a) evaporated films and (b) ALD films over the following steps:



- Find out (from the scientific literature) typical deposition rates and film thicknesses for the following processes:
  - evaporation of aluminum
  - sputtering of tungsten
  - CVD of tungsten
  - PECVD a-Si:H

- PECVD of oxide
- electroplated copper
- ALD of aluminum oxide
- reactive sputtering of TiN.

## References and Related Reading

- Briand, D. *et al.* (1999) In situ doping of silicon deposited by LPCVD, *Semicond. Sci. Technol.*, **14**, 173–180.
- Cote, D.R. *et al.* (1999) Plasma-assisted chemical vapor deposition of dielectric thin films for ULSI semiconductor circuits, *IBM J. Res. Dev.*, **43**, (1–2), 5.
- Datta, M. and D. Landolt (2000) Fundamental aspects and applications of electrochemical microfabrication, *Electrochim. Acta*, **45**, 2535–2558.
- Doms, M. *et al.* (2008) Hydrophobic coatings for MEMS applications, *J. Micromech. Microeng.*, 055030.
- Ehrfeld, W. (2003) Electrochemistry and microsystems, *Electrochim. Acta*, **48**, 2857–2868.
- Fang, W. and C.-Y. Lo (2000) On the thermal expansion coefficients of thin films, *Sens. Actuators*, **84**, 310.
- French, P.J. (2002) Polysilicon: a versatile material for microsystems, *Sens. Actuators*, **A99**, 3–12.
- Grigoras, K., V.-M. Airaksinen and S. Franssila (2009) Coating of nanoporous membranes: atomic layer deposition versus sputtering, *J. Nanosci. Nanotechnol.*, **9**, 3763–3770.
- Guckel, H. (1998) High aspect ratio micromachining via deep X-ray lithography, *Proc. IEEE*, **86**, 1586.
- Kamins, T. (1980) Structure and properties of LPCVD polysilicon films, *J. Electrochem. Soc.*, **127**, 686–690.
- Kamins, T. (1998) **Polycrystalline Silicon for Integrated Circuits and Displays**, 2nd edn, Springer.
- Kim, M.-C. *et al.* (2004) Characterization of polymer-like thin films deposited on silicon and glass substrates using PECVD method, *Thin Solid Films*, **447–448**, 592–598.
- Knez, M., K. Nielsch and L. Niinistö (2007) Synthesis and surface engineering of complex nanostructures by atomic layer deposition, *Adv. Mater.*, **19**, 3425–3438.
- Leskelä, M. and M. Ritala (2002) Atomic Layer Deposition (ALD): from precursors to thin film structures, *Thin Solid Films*, **409**, 138.
- Maier-Schneider, D. *et al.* (1996) Elastic properties and microstructure of LPCVD polysilicon films, *J. Micromech. Microeng.*, **6**, 436–446.
- Murarka, S.P. (1993) **Metallization: Theory and Practice for VLSI and ULSI**, Butterworth–Heinemann.
- Ohmi, T. (2001) A new paradigm of silicon technology, *Proc. IEEE*, **89**, 394–412.
- Puurunen, R. (2005) Surface chemistry of atomic layer deposition: a case study for the trimethylaluminum/water process, *J. Appl. Phys.*, **97**, 121301.
- Ritala, M. *et al.* (1999) Perfectly conformal TiN and  $\text{Al}_2\text{O}_3$  film deposited by atomic layer deposition, *Chem. Vapor Depos.*, **5**, 7.
- Ruythooren, W. *et al.* (2001) Electrodeposition for the synthesis of microsystems, *J. Micromech. Microeng.*, **10**, 101.

- Shacham-Diamand, Y. and V.M. Dubin (1997) Copper electroless deposition technology for ultra-large-scale-integration (ULSI) metallization, *Microelectron. Eng.*, **33**, 47.
- Smith, D.L. (1995) **Thin-film Deposition: Principles and Practice**, McGraw-Hill.
- Vehkämäki, M. *et al.* (2001) Atomic layer deposition of SrTiO<sub>3</sub>, *Chem. Vapor Depos.*, **7**, 75.
- Xia, L.-Q. *et al.* (1999) High temperature subatmospheric chemical vapor deposited undoped silicate glass, *J. Electrochem. Soc.*, **146**, 1181–1185.

# 6

## Epitaxy

Epitaxy is a very special case of thin-film deposition: the deposited film will be single crystalline. This can only take place when special conditions are met. The deposited layer registers the crystalline information from the substrate. In order to do so properly, the crystal lattices of the film and the substrate must be identical or closely matching. The simplest case is homoepitaxy: film and substrate are the same material, for example silicon deposition on silicon. Because crystal information is “transmitted” across the substrate/film interface, the surface quality of the starting wafer is of paramount importance. A residual film a few atomic layers thick can prevent epitaxy. Epitaxy reactors are therefore designed with extreme cleanliness in mind, use the highest purity chemicals and are very delicate and expensive pieces of equipment.

Epitaxy is a demanding process and high-quality epitaxial films are difficult to make. Epitaxial deposition can fail partially and result in defective single crystalline material, or it can fail completely and result in polycrystalline or even amorphous film. Whether the defective material is usable for devices depends on the density and location of those defects: if defects are confined to substrate/film interface, and the deposited layer is mostly defect-free, the material may be usable, but this depends on the device operating principle and engineering judgment is needed to decide on acceptable defect levels.

### 6.1 Heteroepitaxy

Epitaxy on dissimilar materials is termed heteroepitaxy, with examples such as AlAs on GaAs, GaN on SiC and SiGe on Si. The lattice constants of various semiconductors are shown in Figure 6.1. The  $\text{Al}_x\text{Ga}_{1-x}\text{As}$  system is favorable because lattice constants of all GaAs and AlAs alloys differ by less than 0.2%, and multiple layers of AlAs/GaAs/AlAs type can be grown easily, with periods

down to atomic layer thickness, equipment limitations allowing. Semiconductor lasers and solar cells can have tens of layers grown epitaxially (Figure 6.2).

Heteroepitaxy of silicon–germanium is an important application. Germanium is a group IV element like silicon, and they have identical lattice structures, so one basic requirement of epitaxy is fulfilled. Their lattice constants are, however, different: that is, silicon 0.543 nm, germanium 0.566 nm. Deposition of silicon with a small concentration of germanium will result in a single crystalline film because the silicon lattice will hold a small number of slightly larger germanium atoms in place (Figure 6.3). Deposition of pure germanium will result in a polycrystalline film because the lattice mismatch is too large to be accommodated.

There exists a critical thickness  $t_c$  (which depends on the  $\text{Si}_x\text{Ge}_{1-x}$  lattice constant and therefore germanium fraction) below which mismatch can be accommodated by elastic deformation, as shown in Figure 6.4. Below the critical thickness the thin epilayer is strained to fit the silicon lattice, and above the critical thickness the lattice relaxes via misfit dislocations, and the film quality may become useless for device applications. In the strained metastable region the epilayer is thermodynamically unstable but kinetically prevented from finding a relaxed state.

It is possible to increase the germanium content gradually, and finally  $\text{Si}_x\text{Ge}_{1-x}$  films with 50% germanium can be deposited epitaxially. SiGe films are under compressive stress, and if a silicon layer is deposited on SiGe, it will be under tensile stress.

There are other applications of heteroepitaxy on silicon: SiC on Si is intensely studied because SiC substrates are expensive, and silicon substrates would be readily available, also in large diameters. Shown in Figure 6.5 is a TEM micrograph of epitaxial oxide  $\text{Y}_2\text{O}_3$  on silicon. As further proof of epitaxial film quality, epitaxial silicon is grown on  $\text{Y}_2\text{O}_3$ . Obviously this has to be the



**Figure 6.1** Lattice constants and band gaps of various semiconductors



**Figure 6.2** Superlattice structure of a quantum well solar cell with 30 periods of GaAs/InGaP. Reproduced from Magnanini *et al.* (2008) by permission of Elsevier



**Figure 6.3** Germanium epitaxy on silicon is impossible because the lattice constants are too different. However, an alloy  $\text{Si}_x\text{Ge}_{1-x}$  can be deposited because the silicon lattice can accommodate some germanium atoms

case: if the two materials have matching lattices and clean surfaces, epitaxy works both ways. Another example of heteroepitaxy is the growth of high-temperature cuprate superconductors on silicon: first single crystalline YSZ, yttria-stabilized zirconia, is grown on single crystalline



**Figure 6.4**  $\text{Si}_x\text{Ge}_{1-x}$  epitaxy on silicon: thicknesses and germanium concentrations refer to 600 °C growth. At higher temperatures critical thickness is smaller



**Figure 6.5** Epitaxial silicon on epitaxial yttrium oxide on silicon, TEM cross-section. Reproduced from Borschel *et al.* (2009), Copyright 2009, American Institute of Physics

silicon, followed by  $\text{CeO}_2$  and finally by  $\text{GdBaCuO}$  superconductor film.

In silicon-on-sapphire (SOS), single crystal silicon is deposited on single crystal sapphire. The lattices of sapphire and silicon are different but if the sapphire crystal is properly oriented the apparent lattice constant is close enough to 0.543 nm of silicon. SOS was the first SOI technology, but it has largely been replaced by other SOI technologies.

## 6.2 Epitaxial Deposition

Silicon epitaxy on silicon enables freedom in doping level and doping-type tailoring. A lightly doped epitaxial p-type layer ( $\rho \sim 10 \text{ ohm}\cdot\text{cm}$ ) can be grown on a heavily p-doped substrate wafer ( $\rho \sim 0.2 \text{ ohm}\cdot\text{cm}$ ). These types of wafers are used for microprocessors and other high-performance logic circuits. An n-silicon epitaxial layer on a p-substrate is used in many micromechanical devices because of electrochemical etch stop (Chapter 20).

In a CZ ingot resistivity depends on the position in the ingot as shown by Equation 4.6. Deposition of the epilayer equalizes wafers in this respect. Epilayer uniformity (both thickness uniformity and doping uniformity) is good, and if very tight resistivity specification is needed, epitaxial wafers override bulk silicon wafers. Another benefit of epitaxy is the absence of oxygen and carbon, which are always present in CZ silicon. About 20% of all starting wafers sold are epiwafers. But epitaxy is also as a part of device processing and this is extensively used in making bipolar transistors, see Chapter 27.

Boron dopant atoms are smaller than silicon, and the resulting boron-doped epilayer will be under tensile stress. Arsenic atoms are larger than silicon, and an arsenic-doped epilayer is under compressive stress. But dopant atom concentrations are fairly small ( $10^{15}\text{--}10^{17} \text{ cm}^{-3}$  vs.  $5 \times 10^{22} \text{ cm}^{-3}$  silicon atom density) and the effect is minor.

Epitaxy depends on crystal information and the energy of arriving atoms: they must have enough energy (surface mobility) to find energetically favorable sites on the surface. All epitaxy processes use elevated temperature, to give atoms surface mobility, and deposition rate also goes up with temperature, but too high a deposition rate is no good: there should be enough time for atoms to find their place on the crystal before the next layer is deposited.

CVD epitaxy of silicon with  $\text{SiH}_{4-x}\text{Cl}_x$  ( $x = 0\text{--}4$ ) source gases has been established since the 1950s. The basic chemical reactions are identical to polysilicon deposition (Equations 5.1 and 5.2). In the compound semiconductor field MOCVD (Metal Organic CVD, also known as MOVPE for Vapor Phase Epitaxy) is common. GaAs is deposited using precursors like  $\text{GaCl}_5$  and  $\text{AsH}_3$ .

Molecular beam epitaxy (MBE) is a variant of evaporation. Instead of an open crucible, the source material is heated in a Knudsen cell. This cell consists of a crucible that is closed except for a small orifice, and atoms can escape from the cell only through the small orifice. The atom beam (in the molecular flow regime, hence the name MBE) emanating from the orifice is much more



**Figure 6.6** Thin heteroepitaxial  $\text{Si}_{1-x}\text{Ge}_x$  layers for high-speed bipolar transistors. The hatched layers are graded epilayers with constantly changing germanium content

stable than a beam evaporating from an open crucible. This stability is tantamount to growing films with atomic layer thickness control.

In solid phase epitaxy (SPE) the solid film registers the crystalline structure from the underlying single crystalline substrate. Amorphous films on single crystalline substrates can thus be converted to single crystalline by annealing. Nucleation is important: a single crystal should be grown from one direction only. If there are many nuclei, the resulting film will be polycrystalline. Of course all the limitations of clean interfaces, matching lattice, etc., still apply. Epitaxy from the liquid phase (LPE) is also possible: both saturated solutions and melts can be used as sources for epitaxial growth. LPE was the dominant technology in the early days of III-V semiconductor laser and LED fabrication, but it has largely been superseded by gas phase and vacuum systems.

The number and thickness of epilayers are practically unlimited: in IGBT power transistors a moderately doped n-layer is grown first, followed by a thicker lightly doped layer. Layer thicknesses in power semiconductors are for example 50  $\mu\text{m}$  (see Figure 2.10). In high-speed bipolar transistors (heterojunction bipolar transistors, HBTs) very thin epitaxial layers are used (in the tens of nanometers range), to engineer the band gap and therefore emitter performance (Figure 6.6).

### 6.3 CVD Homoepitaxy of Silicon

As an example of homoepitaxy, CVD silicon epitaxy is described (epitaxial reactors and growth process details

are presented in Chapter 34). A reactor is heated to about 1200  $^{\circ}\text{C}$  under hydrogen flow, which reduces native oxide, so a clean silicon surface is obtained:



Growth commences when silane gases of type  $\text{SiH}_x\text{Cl}_{4-x}$  ( $x = 0-4$ ) are introduced into the reactor: Equation 6.2 for dichlorosilane (DCS) and Equation 6.3 for silicon tetrachloride. Silane,  $\text{SiH}_4$ , can also be used. Two main differences to poly deposition are temperature and substrate conditioning: higher temperature equals higher deposition rate, higher surface mobility and therefore the possibility to find energy minimum sites on the surface:



Reaction 6.3 is reversible, and cleaning is possible with HCl when the reaction proceeds from right to left (i.e., hydrogen chloride etching of silicon). Excessive etching should be avoided because surface roughness tends to increase in etching. Silicon tetrachloride can also be used as silicon etchant:



This reaction can be prevented when the  $\text{SiCl}_4$  fraction is limited to below 27% (Figure 6.7), and in practice much more dilute gases are used: a typical incoming gas consists of 1% silane and 99% hydrogen.

The  $\text{SiCl}_4$  process temperature is, however, very high and undesirable dopant diffusion takes place during epitaxy. The  $\text{SiH}_4$  reaction is better in this respect, but due to the lower temperature, the deposition rate is lower. Low temperature, and therefore minimal diffusion, are important considerations when sharp interfaces must be made. Trichlorosilane  $\text{SiHCl}_3$  (TCS) and dichlorosilane  $\text{SiH}_2\text{Cl}_2$  (DCS) are good compromises.

Typical epitaxial deposition rates are 1–5  $\mu\text{m}/\text{min}$ . They depend on the silane gas chosen, on temperature and on flows. The deposition rate can be increased by operating at higher temperature, but at very high temperatures polycrystalline, rather than epitaxial, film results. At high temperature the growth rate is so high that there is no time for atoms to arrange themselves into a single crystalline lattice. At low temperatures the film atoms do not have enough energy to diffuse and find suitable lattice sites, and polycrystalline films result.

Perfectly flat surfaces offer no preferred sites for atoms to position themselves, and epi growth is therefore difficult. It can be aided by miscut wafers: instead of slicing



**Figure 6.7** Silicon deposition rate as a function of  $\text{SiCl}_4/\text{H}_2$  flow ratio. Above about  $2\text{--}3 \mu\text{m}/\text{min}$  the resulting film is polycrystalline, not epitaxial. For  $\text{SiCl}_4/\text{H}_2$  (1%/99%) the typical growth condition is  $1 \mu\text{m}/\text{min}$ . Reproduced from Theurer (1961) by permission of Electrochemical Society Inc.

the ingot perfectly, a  $3^\circ$  miscut for example is used (typical for  $<111>$  material specifications), see Figure 4.10. Atomic steps so created act as nucleation sites for epitaxy. Arriving atoms can form more bonds at kinks, and therefore they are more stably bound than atoms on flat areas. Newly arrived atoms therefore arrange themselves regularly according to terraces determined by the crystal structure, while on planar areas various island structures can be formed randomly (Figure 6.8).



**Figure 6.8** Terrace step kink (TSK) growth model of epitaxy: growth proceeds at kinks, and adatoms on flat surface diffuse to energetically favorable positions at kinks. Wafer miscut creates terraced structure. Reproduced from Jenkins (1995)

The term epi-poly is sometimes used in micromechanics. It is an oxymoron: epitaxial films are single crystalline, and poly means polycrystalline. What is meant is that a CVD epireactor has been used to deposit a thick layer of silicon, using epi growth conditions which enable deposition rates  $1\text{--}5 \mu\text{m}/\text{min}$  versus  $10 \text{ nm}/\text{min}$  for LPCVD poly at  $625^\circ\text{C}$ . But the underlying film is amorphous  $\text{SiO}_2$ , resulting in polycrystalline film. Typical epi-poly thicknesses are  $10\text{--}50 \mu\text{m}$ , compared to  $0.1\text{--}2 \mu\text{m}$  typical of LPCVD polysilicon, which is used as a CMOS gate and as a structural layer in surface MEMS. Thick poly is a popular material in micromechanics, combining some of the best sides of both polysilicon thin films and thick silicon layers.

## 6.4 Doping of Epilayers

Epilayer doping level and dopant type can be chosen independently of the substrate. Gaseous dopants,  $\text{PH}_3$ ,  $\text{B}_2\text{H}_6$ ,  $\text{AsH}_3$ , are added to source gas flow, enabling doping during epitaxy. Dopant concentration in the epitaxial film can be varied over seven orders of magnitude ( $10^{13}\text{--}10^{20} \text{ cm}^{-3}$ ). In many applications several epilayers with different doping levels and/or types are grown sequentially, or graded structures where the composition or doping level changes in minor steps, for example from Si to  $\text{Si}_{0.7}\text{Ge}_{0.3}$  in tens of increments of germanium concentration.

Epitaxial deposition need not be the first process step: doped silicon is also single crystalline silicon and epitaxy on it works just as well. In bipolar transistor fabrication, buried layer formation by diffusion is the first step (see Figure 3.2), followed by epitaxial deposition of a lightly doped epilayer on top of a heavily doped buried layer. Base and emitter diffusions will then be done in this lightly doped epitaxial layer. Further discussion of epitaxy on structured wafers can be found in Chapter 27 on bipolar technology.

Because of the high temperatures involved, dopant diffusion will inevitably take place during epitaxy. If the epilayer doping level is lower than that of the substrate, the epilayer will be doped from the substrate through two different mechanisms: (1) solid state diffusion across the substrate/epilayer interface; and 2) dopant atom outdiffusion from the substrate into the gas stream and subsequent vapor phase doping, known as autodoping (Figure 6.9). Autodoping depends on the volatility of dopants, with antimony (Sb) giving the least (the lowest vapor pressure, with arsenic and boron somewhat higher, and phosphorus the highest). Autodoping comes both from substrate itself and from any doped regions that have been made in steps preceding epitaxy.



**Figure 6.9** Autodoping: in substrate autodoping, dopants evaporated from heavily doped substrate add to intentionally added dopant; in lateral autodoping, dopants from heavily doped regions lead to local doping variation



**Figure 6.10** Transition width at substrate/epilayer interface; lightly doped epitaxial layer on heavily doped substrate

The abruptness of the doping at the substrate/epilayer interface depends on both deposition temperature and time, as well as dopant type. The transition width can be taken as the distance where doping concentration drops from a 90% value to a 10% value (Figure 6.10). Depending on whether the substrate or the epilayer is more strongly doped, the transition can take place either inside the epilayer or inside the substrate.

## 6.5 Measurement of Epitaxial Deposition

At least three measurements must be carried out on epitaxial wafers: thickness, resistivity and surface quality. Surface quality is assessed first and foremost by optical inspection: pyramids, mounds and hillocks scatter light which can be detected by optical methods. A Nomarski interference contrast microscope can detect surface height differences and IR depolarization reveals stresses. Laser

scattering measures particles and microroughness. Optical methods are fast, and 100% of wafers are inspected.

The thickness of epilayers can be measured by Fourier transform infrared (FTIR) spectroscopy: constructive and destructive interference from reflections at the surface and at the substrate/epilayer interface are detected. FTIR spectroscopy requires, however, a highly doped substrate (resistivity below 0.025 ohm-cm). On resistive substrates, spreading resistance profiling (SRP) is used. SRP requires sample beveling: that is, it is sample destructive – 1 wafer in 25 or 1 in 100 is measured by SRP. Transition width measurement can be done with SIMS but this is a time-consuming measurement and is done for example once for 1000 wafers.

SRP of course measures resistivity also, but simpler and faster methods are used for routine measurements. Resistivity is measured by the mercury probe capacitance–voltage method (Hg-CV method) for p/p and n/n structures and by the four-point probe method for n/p and p/n structures. In both methods a metal contact is made on silicon, even though liquid mercury drop contact is much more benign than tungsten needle contact of 4PP. Wafers are not usable after metal probes. Non-contact measurements would be much better, but most are rather cumbersome and require special conditions to be fulfilled.

## 6.6 Simulation of Epitaxy

The epitaxy simulators currently used in process integration studies are not physically based. A true physical simulator would use temperature, flow rate and surface reaction rate constants as inputs, and it would reproduce growth rate and dopant distribution as the outputs. Instead, epitaxy simulators are really hybrids between film deposition and diffusion simulators: deposition rate and temperature are given, and dopant profile is calculated from diffusion constants at the relevant temperature.

The input for epitaxy simulator requires:

- dopant type of wafer
- deposition rate and time
- deposition temperature
- dopant type and concentration in the flow.

Such a semiempirical simulator can predict a dopant profile across the substrate/epilayer interface, taking into account both outdiffusion from the substrate and diffusion from the epilayer into the substrate.

Some estimates of gas phase dopant concentration and the resulting epilayer doping are given below:

| Dopant in gas phase | Dopant in epitaxial film  |
|---------------------|---------------------------|
| $10^{-10}$ bar      | $10^{15} \text{ cm}^{-3}$ |
| $10^{-8}$ bar       | $10^{17} \text{ cm}^{-3}$ |
| $10^{-6}$ bar       | $10^{19} \text{ cm}^{-3}$ |

Dopant partial pressures are very small indeed because epitaxy is carried out at atmospheric pressure. Extreme dilution in hydrogen is used to incorporate such dilute dopant flows. Note that phosphorus and boron incorporation into growing silicon is very strong: its concentration in film is much higher than its gas phase concentration. Arsenic incorporated into epitaxial film is even more pronounced.

A 1D simulation of epitaxy is shown in Figure 6.11. Epitaxial films nominally 4 μm thick are deposited. A heavily doped epilayer dopes the substrate, and the electrical junction is deeper than the nominal thickness of the epilayer. When the epilayer is lightly doped and

the substrate heavily doped, the opposite is true. Note that the same deposition rate, 0.2 μm/min, has been used for all temperatures. This is a limitation in epitaxy simulation: rates are really temperature dependent, but they have to be given manually; they do not follow from first principles.

## 6.7 Advanced Epitaxy

If there are both oxide and single crystal silicon areas on the wafer, deposition will be epitaxial on silicon and polycrystalline on oxide. In between there is a region with ill-defined material: both its crystallinity and thickness are difficult to predict. In selective epitaxial growth (SEG) film is deposited only in those areas where single crystal silicon is present (Figure 6.12). SEG is a complex process and not fully understood. One process uses SiH<sub>2</sub>Cl<sub>2</sub>, HCl and H<sub>2</sub> as source gases. Chlorine etches nuclei on oxide faster than on silicon, and deposition is preferentially on silicon. Other factors like temperature, pressure,



**Figure 6.11** ICECREM simulation of epitaxial interface sharpness: three different growth temperatures (1050, 1100, 1150 °C) have been used to grow an epitaxial layer nominally 4 μm thick: (a) heavily doped epilayer on lightly doped substrate; (b) lightly doped epilayer on heavily doped substrate



**Figure 6.12** SEG, selective epitaxial growth, no deposition on oxide; ELO, epitaxial lateral overgrowth; merging of epitaxial film fronts over oxide

flow rates, partial pressures and the chemical nature of the dielectric (oxide, nitride) play their role in SEG. Selective epitaxy can be done many times over, as long as there is a high-quality single crystalline seed area available.

Epitaxy requires crystal orientation information from the substrate, but once this information is registered, epitaxial growth can continue over amorphous or polycrystalline material. The epitaxial lateral overgrowth (ELO) technique incorporates patterned seed areas, oxide isolation and lateral overgrowth. One of the main problems in ELO is the point where the two growth fronts merge: defect density can be very high.

## 6.8 Exercises

1. Can a laboratory scale with 0.1 mg resolution be used for epilayer thickness measurements?
2. What are the resistivities of the substrates and epilayers in Figure 6.12?
3. Growth rates as a function of temperature are given below for SiH<sub>4</sub> epitaxy. Plot rate as a function of inverse temperature and check if there are two growth regimes!

|      | 700  | 750  | 800        | 850 | 900 |
|------|------|------|------------|-----|-----|
| 0.04 | 0.09 | 0.2  | 0.4        | 0.5 |     |
| 950  | 1000 | 1050 | 1100 °C    |     |     |
| 0.6  | 0.7  | 0.75 | 0.8 μm/min |     |     |

4. In Figure 6.7 the deposition rate is negative at high SiCl<sub>4</sub> fractions. What does this mean?
5. How abrupt (in nanometers) are transition widths in thick homoepitaxial silicon cases? What about SiGe epitaxy?
6. If a 200 mm wafer bows 20 μm due to an epitaxial layer 5 μm thick, what is the stress in the epilayer?

## Simulator exercises:

7. For an n+/n- structure (substrate 10<sup>18</sup> cm<sup>-3</sup>, epilayer 10<sup>15</sup> cm<sup>-3</sup>) calculate the transition width as a function of epitaxy temperature for an epilayer 4 μm thick.
8. A starting wafer doping level is 10<sup>15</sup> cm<sup>-3</sup> phosphorus. The epilayer is doped with boron doped at 10<sup>17</sup> cm<sup>-3</sup> concentration. Calculate the junction depth as a function of growth temperature.
9. If pnp-bipolar transistors are made, the buried layer has to be p-type. Calculate the boron updissusion for different epitaxy conditions when the buried layer doping is 10<sup>18</sup> cm<sup>-3</sup> and the epilayer doping is 10<sup>15</sup> cm<sup>-3</sup>.

## References and Related Reading

- Baliga, J.B. (1986) **Epitaxial Silicon Technology**, Academic Press.
- Beaucarne, G. *et al.* (2006) Epitaxial thin-film Si solar cells, *Thin Solid Films*, **511–512**, 533–542.
- Borschel, C. *et al.* (2009) Structure and defects of epitaxial Si (111) layers on Y<sub>2</sub>O<sub>3</sub> (111)/Si(111) support systems, *J. Vac. Sci. Technol.*, **B27**, 305–309.
- Crippa, D., D.R. Rode and M. Masi (2001) Silicon epitaxy, in **Semiconductors and semimetals**, Vol. 72, Academic Press.
- Herman, M.A., W. Ricter and H. Sitter (2004) **Epitaxy**, Springer.
- Jenkins, T. (1995) **Semiconductor Science**, Prentice Hall.
- Magnanini, R. *et al.* (2008) Investigation of GaAs/InGaP superlattices for quantum well solar cells, *Thin Solid Films*, **516**, 6734–6738.
- Meyerson, B.S. (1992) UHV/CVD growth of Si and Si:Ge alloys: chemistry, physics, and device applications, *Proc. IEEE*, **80**, 1592.
- Theurer, H. (1961) Epitaxial silicon films by the hydrogen reduction of SiCl<sub>4</sub>, *J. Electrochem. Soc.*, **108**, 649.
- Wu, Y.H. *et al.* (1999) The effect of native oxide on epitaxial SiGe from deposited amorphous Ge on Si, *Appl. Phys. Lett.*, **74**, 528.

# Advanced Thin Films

In this chapter we will discuss thin-film deposition processes in more detail, and explore the relationships between process parameters and the resulting film properties. Process pressure and temperature, and ion bombardment during deposition, play their roles in resulting film qualities: grain size and orientation, or amorphousness; interface sharpness; volume homogeneity and surface smoothness/roughness. We will encounter a wide variety of cases involving double layer, multilayer, alloyed, reacted and other thin films. Some of these are stable, some are reacted intentionally, and others exhibit unwanted interfacial processes. These complex thin-film stacks have applications in acoustics, magnetics, optics, electronics and practically every field of microtechnology.

## 7.1 General Features of Thin-Film Processes

Thin-film deposition involves thermal physics, fluid dynamics, plasma physics, gas phase chemistry, surface chemistry, solid state physics and materials science. We must deal with source materials (sputtering targets, precursor gases, electrolytes), we must address the transport of source material to the substrate (in high vacuum, low vacuum, at atmospheric pressure or in liquid), and we have to understand surface processes (adsorption, reaction, desorption, ion bombardment-induced effects). Characterization of films entails dozens of techniques ranging from optical to nuclear, electrical to mechanical. This multidisciplinarity leads to a great number of phenomena and models which must be taken into account, in both experimental work and simulation.

There are a few basic methods of source excitation and tens of different configurations of these. Thermal activation can be resistive, and electron, ion and laser beams can be used, too. Plasma sources range from simple DC diodes

to microwave, helical and inductive configurations. In the liquid phase the choices are less numerous, and electrochemical and chemical potential differences are the main driving forces.

Film deposition on the substrate surface is a sum of many factors. In the first approximation, deposition is independent of substrate. This distinguishes deposition from growth processes, like thermal oxidation, which is intimately coupled with the substrate. But surfaces do interact with deposition processes via available chemical bonds, contamination and crystallography. An important parameter is the sticking coefficient, or the probability that an impinging particle will remain on the surface. A high sticking coefficient means that the particle will come to rest at the point of impingement, and a low sticking coefficient means that only energetically favorable attached species will stick, and others will desorb. Sticking coefficients range from  $10^{-5}$ , typical of CVD, to approaching unity, in evaporation.

Even if no annealing is done immediately after film deposition, films will experience thermal treatments during subsequent processing. Thermal loads from these treatments can be considerable, and they affect many film properties, like grain size, resistivity and stress. Film surfaces and interfaces will be modified during these annealing steps by diffusion, dissolution or chemical reactions.

## 7.2 Film Growth and Structure

Atoms impinging on a surface attach to the surface either with chemical bonds ( $\approx 1 \text{ eV}$ , chemisorption) or by short range van der Waals forces ( $\approx 0.3\text{--}0.4 \text{ eV}$ , physisorption). Chemically bonded atoms tend to stay fixed on the surface, with only a small chance of desorption or diffusion. Physisorbed atoms are only loosely bound. This is essential in ALD: a monolayer is chemisorbed

but the temperature is selected to be so high that any additional physisorbed layer would not adhere.

Thin-film deposition is about phase transition from the vapor phase to solid phase. Atoms condense on a substrate. These adatoms (adsorbed atoms) are subject to desorption and surface diffusion. Some adatoms bond to each other, reducing the desorption probability. More atoms aggregate and some of the bigger clusters avoid desorption. These processes are depicted in Figure 7.1. These small nuclei are still mobile, and they grow by merging with other nuclei, but they can also incorporate atoms from the vapor phase. Nuclei grow in size to become islands, but remain separate, and more nuclei can form between the islands. Coalescence is driven by surface energy and surface area minimization. Island merging creates a continuous film. For sputtered and evaporated metal films this happens at about 10–20 nm film thickness. Films thinner than this are optically transparent but they can form an electrically conductive path (percolation). Such films have applications as permeable electrodes in gas sensors and as top metals in optical devices.

There are many modes of film growth. Island growth is also known as three-dimensional growth. It is common in metal deposition on insulators, where bonds between film atoms are stronger than bonds between film atoms and the substrate. Two-dimensional growth, also called layer-by-layer growth, takes place when the arriving film atoms bond more strongly with the substrate than with each other. It is possible to deposit a single atomic layer of, for example, nickel on mica. A third mode, called the Stranski–Krastanov mode, is a mixture of 2D and 3D modes. A fourth mode is columnar growth. These are pictured in Figure 7.2. When adatom mobility is close to zero, as in the room temperature deposition of tungsten, atoms stick where they land and grains grow upward. Columnar growth looks similar to island growth initially, but the difference is in coalescence: in the island mode there is enough energy for the film atoms to find energetically favorable sites (due to the high temperature in



**Figure 7.1** Surface processes in thin-film growth



**Figure 7.2** Thin-film growth modes: left, 2D (layer-by-layer) growth; middle, 3D (island) growth; right, columnar growth with voids between the grains

epitaxy and CVD, or due to ion bombardment in plasma deposition processes) and the islands will eventually coalesce, resulting in a uniform film. In columnar growth grains continue to grow upward, and voids between the grains remain. The surface will be rough, because grain height is determined by random nucleation.

If we measure the early stages of thin-film growth by surface-sensitive techniques, for example Auger electron spectroscopy or XPS (which probe the topmost nanometer only), we can distinguish 2D and 3D growth: in 2D growth mode the signal from the substrate quickly dies out because the whole surface becomes covered by the deposited layer. In 3D mode the substrate signal slowly decreases as the proportion of open substrate area is diminished.

Zone models explain the basic features of PVD film structure. The first question is: will the film be amorphous or polycrystalline? Silicon and other covalently bonding materials often end up amorphous. Many compounds (like TiN and Al<sub>2</sub>O<sub>3</sub>) and metal alloys (TiW, SiCr) with dissimilar-sized atoms similarly result in amorphous films. Elemental metal deposition usually results in polycrystalline films, including all the standard thin films in microtechnology (Al, Au, Cu, W, Ti, Cr, Pt).

The crystallinity of sputtered films is determined by a complex interaction between the substrate (its surface chemistry, surface structure and temperature) and the deposited film. In the zone model pressure and temperature are the main variables to explain film microstructure (temperatures are normalized to melting point temperatures,  $T/T_m$ , in K), see Figure 7.3. Zone 1 is small grained and porous, zone 2 has larger columnar grains and zone 3 exhibits still larger grains. The intermediate region is termed zone T (for transition).

Zone 1 is the region where the low momentum of impinging species is combined with slow chemical processes



**Figure 7.3** Zone model of sputtered thin-film microstructure. Reproduced from Thornton (1986) by permission of American Institute of Physics

due to low temperature: film atoms come to rest almost immediately, leading to columnar grains. Such a structure is under moderate tensile stress. The voids between the grains are nanometer sized, which leads to measurable density reduction and poor stability because atmospheric water and oxygen will be absorbed in the voids. Oxygen impurities in the deposition chamber can change the intrinsic stress from tensile to compressive and complicate the simple model described above.

At lower pressure ion bombardment becomes more important (fewer ion collisions and therefore more energetic bombardment). Ions densify the film by knocking out loosely bound atoms. A further increase in ion bombardment (at lower pressure or higher sputtering power) leads to the disappearance of voids and conversion to compressive stress. Higher temperature leads to enhanced surface diffusion, which will result in denser films as atoms can migrate and find energetically favorable sites to attach to.

Zone 2 occurs at  $T/T_m > 0.3$ , so surface diffusion is significant. Grains grow larger, and defects are eliminated. Zone 3 occurs at  $T/T_m > 0.5$ , where diffusion processes are very fast. Elimination of voids enhances diffusion. Films are annealed during deposition. Grains are more isotropic and films “lose memory” of the deposition process details.

Sputtered aluminum grain size is about  $0.5 \mu\text{m}$ , similar to the typical film thickness. In  $3 \mu\text{m}$  lines there are always many grains across the line, but in  $0.5 \mu\text{m}$  lines the situation changes dramatically: there are practically no three-grain boundaries and the grains are end to end, known as the bamboo structure. All processes

which depend on grain boundaries, like diffusion and electromigration, are strongly affected.

Film structure can change not only continuously as described above, but also abruptly. Tantalum films sputtered under different conditions can end up in either the body centered cubic (BCC) structure or as tetragonal  $\beta$ -Ta. The resistivity of BCC-Ta is about  $20 \mu\text{ohm}\cdot\text{cm}$  with a temperature coefficient of resistivity (TCR) of  $3800 \text{ ppm}/^\circ\text{C}$ . Values for  $\beta$ -Ta are about  $160 \mu\text{ohm}\cdot\text{cm}$  and  $178 \text{ ppm}/^\circ\text{C}$ , respectively (see Figure 5.1 for another tantalum deposition experiment). Phase transitions will also take place during annealing, for example  $\text{TiSi}_2$  has several phases, see Figure 7.16.

Grains in polycrystalline films can have any crystal orientation, but in practice films are often strongly textured: the distribution of grain orientations is along one or two main crystal planes. For example, aluminum films have usually (111) texture, that is (111) planes parallel to the wafer surface. For undoped LPCVD polysilicon, (110) orientation crystals dominate, but for in situ phosphorus-doped poly, (311) is the dominant orientation. Grain structure of LPCVD polycrystalline silicon is shown in Figure 7.4. Because grains of different orientation grow at slightly different rates, the surfaces of polycrystalline films are generally rough.

Texture is established during deposition, and it is not greatly affected by subsequent annealing steps below  $(2/3)T_m$ , even though grain size increases. Texture inheritance is common: subsequent films easily acquire the same texture as the underlying film. Thin seed layers can therefore be used to modify thick layers. In hard disks



**Figure 7.4** SEM of polycrystalline silicon film  $6 \mu\text{m}$  thick deposited by LPCVD on a  $1 \mu\text{m}$  oxide layer. Courtesy Lauri Sainiemi, Aalto University



**Figure 7.5** Hard disk structure: the Ta/Ru intermediate layer is used to induce crystal orientation in the CoCrPt:SiO<sub>2</sub> recording layer. Adapted from Piramanayagan (2007)

the Ta/Ru intermediate layer is deposited to induce [0002] orientation growth of the magnetic CoCrPt:SiO<sub>2</sub> recording layer, Figure 7.5.

### 7.3 Thin-Film Structure Characterization

PVD films, especially sputter-deposited films, can be modified by a number of deposition process parameters. System configuration and geometry come into play via target–substrate distance, base pressure, gas phase impurities, bias voltage. Pressure and power affect the momentum of impinging atoms and ions, and substrate temperature is important for desorption, diffusion and reactions.

Collimated sputtering is a technique where a mechanical grid is placed between the anode and cathode, and off-angle atoms do not contribute to the flux arriving at the wafer, but are deposited on the collimator walls. Collimated sputtering is good for bottom coverage (Figure 5.15). In Table 7.1 a collimated system is

**Table 7.1** Sputtered titanium nitride (TiN) film characterization: collimated vs. standard

| Film property                              | Analytical technique                                          | Collimated TiN                             | Standard TiN                           |
|--------------------------------------------|---------------------------------------------------------------|--------------------------------------------|----------------------------------------|
| Thickness                                  | RBS (density = 4.94 g/cm <sup>-3</sup> )<br>TEM cross-section | 81 nm<br>82 nm                             | 161 nm<br>178 nm                       |
| Sheet resistance                           | Four-point probe                                              | 13.7 ohm/sq                                | 7.4 ohm/sq                             |
| $R_s$ uniformity                           | Four-point probe                                              | 3.3%                                       | 5%                                     |
| Resistivity                                | $R_s$ by four-point probe<br>Thickness by TEM                 | 112 $\mu$ ohm-cm                           | 132 $\mu$ ohm-cm                       |
| Density                                    | Thickness by TEM and RBS<br>Density by RBS                    | 4.88 g/cm <sup>-3</sup><br>93% of bulk     | 4.47 g/cm <sup>-3</sup><br>86% of bulk |
| Stoichiometry (Ti/N)                       | RBS                                                           | 1.31                                       | 1.00                                   |
| Phase                                      | Glancing angle XRD                                            | TiN (38–1420)                              | TiN (38–1420)                          |
| (JCPDS card #)                             | Electron diffraction                                          | TiN (38–1420)                              | TiN (38–1420)                          |
| Preferred orientation                      | $\theta$ -2 $\theta$ XRD<br>Electron diffraction              | (220)<br>19.2 nm                           | (220)<br>18.3 nm                       |
| Net stress                                 | Wafer curvature                                               | 2.7 GPa (tensile)                          | 3.1 GPa (tensile)                      |
| Grain structure                            | Cross-section TEM<br>Plan view TEM                            | Columnar<br>2D equiaxial                   | Columnar<br>2D equiaxial               |
| Average grain size                         | TEM                                                           | 0.43 nm                                    | 1.23 nm                                |
| Average roughness                          | AFM                                                           | 8 nm                                       | 18.7 nm                                |
| Min/max roughness                          |                                                               |                                            |                                        |
| Specular reflection<br>(% of Si reference) | Scanning UV                                                   | 248 nm: 142%<br>365 nm: 55%<br>440 nm: 57% | 145%<br>95%<br>123%                    |
| Impurities<br>(at. %)                      | AES                                                           | O < 1%<br>C < 0.5%                         | O < 1%<br>C < 0.5%                     |

Source: Wang, S.-Q. and J. Schlueter (1996).

AFM:

Surface roughness

$S_q = 40 \text{ nm}$

$S_q = 18 \text{ nm}$

$S_q = 17 \text{ nm}$

$S_q = 16 \text{ nm}$

$S_q = 4 \text{ nm}$

TEM:

Size and shape of the grains



**Figure 7.6** Microstructure evolution of silicon films deposited by PECVD. Grain size measurement by TEM; surface roughness by AFM. Reproduced from Vallat-Sauvain *et al.* (2000) by permission of AIP

compared to a conventional system, and analyzed for a range of film parameters. Such extensive film characterization is done when a new process is being developed and, if adopted in production, a simple monitoring measurement like sheet resistance can be used.

The electrical characterization described in Chapter 2 and in Table 7.1 was at DC, but circuits that operate at gigahertz frequencies must be measured at proper frequencies. The same applies to dielectric films, too.

The main parameters in CVD processes are the reactant gases, flow rates, flow rate ratios of the reactants, temperature and pressure. In PECVD, RF power and RF frequency play important roles. In Figure 7.6 on PECVD silicon deposition, the effects of  $\text{SiH}_4 / (\text{SiH}_4 + \text{H}_2)$  flow ratio on crystal size were studied. High-frequency (70 MHz) PECVD was employed, and glass wafers were used as substrates at  $225^\circ\text{C}$ . Keeping all other deposition parameters constant, changes in gas ratio have resulted in enormous variations in grain size and surface roughness. In LPCVD polysilicon deposition using  $\text{SiH}_4$  as a source gas, a similar grain size variation can be seen as a function of temperature: at  $630^\circ\text{C}$  large grains (on the order of 100 nm) are formed, below  $600^\circ\text{C}$  the grain size is reduced and at  $570^\circ$  the film is amorphous. The effect of temperature on  $\text{MgF}_2$  grain structure can be seen in Figure 7.7: the lower the temperature, the smaller the grains.



**Figure 7.7** Grain structure of  $\text{MgF}_2$  by SEM as a function of deposition temperature: (a)  $250^\circ\text{C}$ ; (b)  $300^\circ\text{C}$ ; (c)  $350^\circ\text{C}$ ; (d)  $400^\circ\text{C}$ . Reproduced from Pilvi *et al.* (2008), Copyright 2008, American Chemical Society

Dielectric films are also measured for a number of properties (Table 7.2). Again, this kind of extensive characterization relates to the research phase of new materials. Boron nitride is a new material that has been studied because of its potential as an insulator in multilevel metallization: it has a lower dielectric constant than nitride

**Table 7.2** Film characterization: PECVD BN (boron nitride)

| PECVD conditions:                               | Process A                                          | Process B                                                    |
|-------------------------------------------------|----------------------------------------------------|--------------------------------------------------------------|
| gases                                           | B <sub>2</sub> H <sub>6</sub> (1%)/NH <sub>3</sub> | B <sub>3</sub> N <sub>3</sub> H <sub>6</sub> /N <sub>2</sub> |
| flow rates                                      | 1800 sccm/120 sccm                                 | 100 sccm/200 sccm                                            |
| RF power                                        | 500 W                                              | 200 W                                                        |
| pressure                                        | 660 Pa (= 5 torr)                                  | 400 Pa (= 3 torr)                                            |
| temperature                                     | 400 °C susceptor                                   | 300 °C susceptor                                             |
| Deposition rate                                 | 300 nm/min                                         | 370 nm/min                                                   |
| Uniformity                                      | <5% (3σ)                                           | 3% (3σ)                                                      |
| Refractive index                                | 1.746                                              | 1.732                                                        |
| Stress                                          | -400 MPa (compressive)                             | -150 MPa (compressive)                                       |
| Etch rate in RIE                                | 62 nm/min                                          | 28 nm/min                                                    |
| Etch rate H <sub>3</sub> PO <sub>4</sub> 167 °C | 1–11 nm/min                                        | —                                                            |
| Etch rate BHF                                   | 0.5 nm/min                                         | <1 nm/min                                                    |
| B/N ratio                                       | 1.02                                               | 1.02                                                         |
| Hydrogen content                                | <8 at. %                                           | <8 at. %                                                     |
| Density                                         | 1.89 g/cm <sup>3</sup>                             | 1.904 g/cm <sup>3</sup>                                      |
| Structure                                       | Amorphous                                          | Amorphous                                                    |
| Step coverage                                   | 60% (1 × 1 μm)                                     | 80% (0.5 × 0.5 μm)                                           |
| Optical band gap                                | 4.7 eV                                             | 4.9 eV                                                       |
| Dielectric constant                             | 3.8–5.7                                            | 3.8–5.7                                                      |
| Breakdown potential                             | 6–7 MV/m                                           | 6–8 MV/m                                                     |

Source: Cote, D.R. et al. (1995).

(3.8 vs. 6) and low etch and polish rates. It is not used in volume manufacturing. One special feature is the use of etch rate as a quality criterion. With dielectrics, thermal SiO<sub>2</sub> acts as a reference film which can always be used to eliminate etchant concentration or temperature effects.

Many of the measurements listed in Table 7.2 are often laborious and in production control ellipsometric or reflectometric thickness and refractive index measurements would probably be used.

IR spectroscopy measures molecular vibrations around 10 μm wavelength. This is indicative of chemical bonds, because IR vibrations are typically bond stretching and bending vibrations. IR spectroscopy is most often practiced using an interferometric measurement set-up known as FTIR, for Fourier transform IR. In silicon nitride thin films it is used to see the presence and qualitative abundance of Si–N, Si–H, N–H bonds (Figure 7.8).

## 7.4 Surfaces and Interfaces

The surface roughness of thin films varies considerably. In general, high-temperature deposition results in smoother films. Epitaxial films are of course very smooth, but many amorphous films can also be extremely smooth. There is a strong correlation between surface smoothness and volume homogeneity: thermal oxide, amorphous silicon and TEOS oxide are both smooth and homogeneous, whereas doped polysilicon and columnar tungsten are rough and



**Figure 7.8** FTIR analysis of silicon nitride films deposited by PECVD: changing nitrogen flow results in different proportion of Si–H, Si–N and N–H bonds. Reproduced from Biasotto *et al.* (2008) by permission of Elsevier

inhomogeneous. Volume inhomogeneity makes measurement of thin-film properties difficult. It is usual then to treat the film as if it were a stack of many layers, each with slightly different properties, for example interfacial mixed layer, bulk of film and surface layers modeled as three materials each with materials constants of their own.



**Figure 7.9** Possible interface structures

Thermodynamics gives hints of interface stability. A change in Gibbs free energy is given by

$$\Delta G = G_{\text{products}} - G_{\text{reactants}} \quad (7.1)$$

$\Delta G$  is positive for a stable pair of materials. For the reaction of titanium with silicon dioxide



the change in Gibbs free energy is  $\Delta G = G_{\text{TiO}_2} - G_{\text{SiO}_2} = (160 - 165) \text{ kcal} = -5 \text{ kcal}$ , indicative that the reaction can take place. Thermodynamics, however, is about initial and final states, not about rates: some thermodynamically favorable processes are so slow that practical devices can be made. But if thermodynamics forbids a reaction, it cannot proceed: the change in Gibbs free energy for the cobalt/silicon dioxide reaction is positive, and cobalt does not reduce oxide. This means that cobalt silicide formation, Equation 7.3, is very sensitive to the presence of native oxide, while the titanium–silicon reaction, Equation 7.4, can proceed in spite of a thin oxide layer:



Interface types also vary significantly. Abrupt interfaces (Figure 7.9a) are not only idealizations but encountered in epitaxy, and other methods, ALD, CVD, PVD, electrochemical, also produce almost ideally sharp interfaces. However, native oxides are very often encountered on interfaces (Figure 7.9b), but in many cases those about 1 nm films are broken or dissolved in subsequent annealing steps.

The case of silicon dioxide/copper (Figure 7.9c) shows copper diffusion into the oxide. A silicon/titanium pair will react and form silicide (Figure 7.9d). Many metals do form silicides: copper silicides form at very low temperature, 200–300 °C, nickel, cobalt and titanium at 400, 500 and 600 °C respectively. Tungsten, molybdenum and tantalum will also form silicides, not all of them simple

$\text{MeSi}_x$  compounds but complex mixtures of various silicides, for example  $\text{Me}_2\text{Si}_5$ ,  $\text{Me}_2\text{Si}_3$ ,  $\text{MeSi}_2$ ,  $\text{MeSi}$ .

Aluminum reacts with tungsten to form  $\text{Al}_{12}\text{W}$  and titanium forms  $\text{Al}_3\text{Ti}$ .

Aluminum does not form a silicide. Annealing at 425 °C will dissolve native oxide, ensuring good electrical contact. However, too much annealing will lead to pitting: silicon is soluble in aluminum (as shown in the Al–Si phase diagram, Figure 7.10) and an open volume is left behind as silicon atoms migrate into the aluminum. Aluminum, on the other hand, will diffuse to fill the space left by silicon dissolution. This leads to the case depicted in Figure 7.9e. These aluminum



**Figure 7.10** Aluminum/silicon phase diagram. Reproduced from Hansen and Anderko (1958) by permission of McGraw-Hill

spikes can be micrometers deep and extend beyond the pn junction. To prevent junction spiking, aluminum can be alloyed with silicon: a silicon concentration of 0.5% (wt %) will saturate aluminum at 425 °C, and 1% Si will prevent silicon dissolution at 500 °C. The other, more general solution is to implement a diffusion barrier, to be discussed shortly.

## 7.5 Adhesion

Adhesion is a major issue in thin-film technology. As a rule of thumb, poor adhesion is the norm and only special attention will lead to good adhesion. Some materials have poor adhesion due to their chemical nature: noble metals are noble because they do not react, therefore they do not form bonds across interfaces. Adhesion is also related to surface cleanliness: residues or dirt from previous steps will almost inevitably lead to poor adhesion. Deposition process variables do play a role: in sputtering energetic ions and atoms will knock out contamination and loosely bound film atoms, but in evaporation the arriving atoms do not have enough energy to displace weakly bonded atoms.

Adhesion layers are additional films with the role of improving adhesion and, to a first approximation, have no effect on device structure or operation. Adhesion layer films are selected on the basis of their bond forming abilities: titanium and chromium are the two most widely used materials, and other oxide forming metals also show good adhesion, like aluminum. The thickness of the adhesion layer is in the range of 10 nm because it has the role of a surface only. Adhesion layer and structural film are deposited immediately after each other in the same vacuum chamber: the freshly formed adhesion layer surface ensures cleanliness and thus eliminates one main factor of poor adhesion. Typical pairs of adhesion layer/noble metal include Ti/Pt and Cr/Au (and vice versa). Adhesion layers are also useful for near-noble refractory metals like tungsten.

The first adhesion test is the tape pull test: adhesive tape (standard office tape is commonly used) is attached to the thin film and then pulled off. If the film peels off with the tape, it has failed the adhesion test. More advanced tests use quantifiable pull force.

## 7.6 Two-Layer Films

Two layers of thin films can offer performance benefits compared to single film, as discussed in connection with Figure 5.19 and in connection with adhesion. There are many other reasons for adopting two-layer films. If very

thin films are made, depositing two layers reduces the possibility of defects: it would be improbable for two defects to coincide. This is utilized in making masks for deep etching of glass. Similarly, metal wires consisting of two different metals can survive even if one film is destroyed, by for example electromigration or corrosion (see Chapter 36). Stress compensation is often done by inserting a thin silicon dioxide film underneath silicon nitride. The tensile stress in the nitride is compensated by the compressive stress in the oxide. In optical MEMS the mirror membrane material is chosen on mechanical grounds, and the choice is often silicon or nitride. But in order to have high reflectance, the mirror can be coated by, for instance, gold or aluminum.

Bimorph cantilevers (of the type shown in Figure 5.18) can be used as temperature sensors: unequal CTE leads to cantilever bending. In IR sensors (bolometers and thermopiles) there is usually a sensor layer and the absorber layer is separate (with sometimes an extra aluminum layer underneath to reflect non-absorbed light). In electrowetting microfluidics the electrodes are covered by a hydrophobic thin film.

Barriers are additional layers between two materials. Their role is to separate adjacent layers. Many aspects of barriers are similar to adhesion layers: barriers are not needed for device operation as such, but their presence makes either the device fabrication process more robust or the resulting device more stable. Barriers are thin, like adhesion layers, with 10–100 nm as the typical barrier thickness.

Total barriers must prevent all fluxes through them, both atom diffusion and charge carrier transport. In flat-panel displays ion barriers are needed: sodium diffusion from glass sheet to transistors has to be prevented. When solar cells are processed on steel sheets, iron diffusion must be prevented. In flexible electronics, gas barriers are needed: oxygen diffusion through a polymer sheet into active devices must be prevented. The common solution is to deposit an inorganic film, typically oxide or nitride, by PECVD, ALD or spin coating. Barriers are needed on both sides of the polymer because its permeability easily allows oxygen diffusion through polymer sheets hundreds of micrometers thick.

In the case of metallization, current has to flow through the barrier, but metal mixing by diffusion must be prevented. Metallic barriers have relatively loose requirements for resistivity (the distance is less than 100 nm). Most barrier materials have resistivities around 100–500  $\mu\text{ohm}\cdot\text{cm}$ , one to two orders of magnitude higher than conductors. While resistivity is not a problem for barriers, contact resistivity must be low.



**Figure 7.11** Auger depth profile of Pt/Ti/SiN<sub>x</sub>/Si structure: (a) as deposited; (b) oxygen annealed at 600 °C: interdiffusion of films is almost complete. Oxygen and carbon accumulations on the surface in the as-deposited sample indicate cleaning problems. Reproduced from Kang *et al.* (1999) by permission of Institute of Pure and Applied Physics

The first diffusion barriers to be implemented were TiW films 100 nm thick between aluminum and silicon to prevent Al–Si junction spiking. Early experiments in a poor vacuum led to the incorporation of oxygen and nitrogen, which passivated grain boundaries without the researchers noticing the real effect behind the barrier performance. When the mechanism was elucidated, reactive sputtering of TiW in an argon/nitrogen atmosphere was adopted. It led to 10 nm grains and nitrogen incorporation at grain boundaries, both of which led to improved barrier performance. Amorphous films are preferable as barriers and a-WN is one candidate.

Copper metallization needs barriers between the copper and silicon to prevent a silicidation reaction (formation of for example Cu<sub>3</sub>Si). Tantalum nitride, TaN, is one possible choice. But a barrier is needed also between the copper and oxide because copper diffuses into the oxide. Silicon nitride can be used.

Adhesion layer and diffusion barrier stability can be checked by electrical and physical measurements. Sheet resistance increase is a quick and simple measurement. Copper resistivity is very low, ca. 2  $\mu\text{ohm}\cdot\text{cm}$ , and when the barrier fails, copper reacts to form silicides which are more resistive. They can be identified by X-ray diffraction but the resistance increase is indicative of silicide formation. Diode leakage at the pn junction is another quick electrical measurement.

Auger depth profiling can be used to see barrier reactions. Auger measurement is slow and destroys samples, but it can be done without any sample preparation. Usually an as-deposited sample is compared to an annealed

sample(s), and barrier failure is evidenced by intermixing of metal and silicon across the barrier. The accumulation of material at the interfaces, and atom distributions across the film, are helpful in understanding the reactions behind adhesion failure.

Note that the Auger analysis shown in Figure 7.11 does not indicate TiO<sub>2</sub> formation even though coexistence of the titanium and oxygen might suggest it: Auger is about atoms and not about compounds. XRD could show TiO<sub>2</sub> formation by the appearance of diffraction peaks identified as arising from TiO<sub>2</sub>.

## 7.7 Alloys and Doped Films

In addition to elemental metals, many alloys are routinely used in microtechnology: amorphous TiW (about 30 at. % titanium), SiCr and NiCr for resistors. Early integrated circuits used aluminum for metallization. Aluminum–silicon (1% Si) was adopted, and later Al–Si was replaced by Al–Si–Cu for improved electromigration resistance. Similarly, dielectrics can be doped and alloyed. Fluorine-doped oxide, SiO<sub>x</sub>F<sub>y</sub>, can be deposited by PECVD, but film instability limits the usable fluorine concentration to about 5% (by weight), because film stability deteriorates and the material becomes hygroscopic. Other materials deposited by PECVD include carbon-doped oxide SiO<sub>x</sub>C<sub>y</sub> and SiC<sub>x</sub>N<sub>y</sub>, which are used as intermetal dielectric layers in multilevel metallizations.

CVD oxide can be doped by adding phosphine (PH<sub>3</sub>) gas to the source gas flow. Phosphorus-doped CVD oxide, PSG (Phosphorus-doped Silica Glass), is a widely

used doped film. Phosphorus oxide is formed by CVD and intermixed with silicon dioxide:



Doped oxide films typically have about 5% dopant by weight. Higher doping levels lead to porous, hygroscopic material. The toxicity of PH<sub>3</sub> (and B<sub>2</sub>H<sub>6</sub> for BSG) needs to be accounted for, but CVD reactors use silane, which is a flammable gas, so that the basic designs of CVD reactors are suitable for dangerous gases. TMP (trimethyl phosphite) and TMB (trimethyl borate) are less toxic alternatives to hydrides.

Polycrystalline SiGe (polySiGe) can be used instead of polysilicon. Germane, GeH<sub>4</sub>, is mixed with silane, SiH<sub>4</sub>, and almost any ratio of silicon and germanium can be made. Most MEMS applications use Si<sub>50</sub>Ge<sub>50</sub> or similar. The main benefit of polySiGe is its lower stress annealing temperature, which makes it easier to implement MEMS structures together with ICs. SiGe can also be etched selectively against silicon, enabling free-standing silicon structures to be made.

Sputtering is suitable for alloy deposition. In alloy sputtering the flux is enriched in the component with higher sputtering yield (yields from alloys are even less accurately known than yields from elemental solids; elemental solid yields are used as approximations). The composition of the sputtered flux is given by

$$C_{\text{film}} = \frac{Y_a}{Y_b} \frac{X_a}{X_b} \quad (7.6)$$

where the  $X_i$  are the concentration proportions in the target (they add to unity:  $X_a + X_b = 1$ ). Because matter is conserved, the target is enriched in the other component

$$C_{\text{target}} = \frac{Y_b}{Y_a} \frac{X_a}{X_b} \quad (7.7)$$

A steady-state situation develops and composition remains unchanged. Sputtering in a reactive atmosphere, in argon/nitrogen or argon/oxygen mixtures, results in nitride or oxide films, or stuffed films with small amounts of reactive impurities at grain boundaries. Molybdenum sputtering in a high-nitrogen-content atmosphere results in Mo<sub>2</sub>N compound formation, but a low-nitrogen sputtering atmosphere results in film designated as Mo:N, nitrogen-stuffed molybdenum. It is very close to molybdenum in chemical composition, but the minuscule nitrogen has an important role: it prevents crystallization, resulting in amorphous films. Typical applications of reactive sputtering are TiN, Ta<sub>2</sub>O<sub>5</sub>, ZnO, AlN, TiW:N and WO<sub>3</sub>. Often reactively sputtered films

are non-stoichiometric, and an annealing step (e.g., in oxygen) is needed to improve film quality.

The introduction of small amounts of nitrogen or oxygen into an argon plasma does not appreciably change the properties of the plasma discharge or of the growing film, but after a critical partial pressure is reached, the target surface transforms into nitride or oxide, and the plasma discharge is established at another equilibrium. If reactive gas flow is then reduced, the target remains nitrided/oxidized, and return to initial conditions takes places at much lower partial pressures, that is reactive sputtering exhibits hysteresis.

## 7.8 Multilayer Films

The performance of simple elemental or compound films, with or without barriers or adhesion layers, is often not enough, and multilayer films are introduced to offer improvement.

For many generations (0.8–0.5–0.35–0.25 μm) IC metallization was done with a Ti/TiN/Al/TiN film stack. Titanium acts as an adhesion layer, TiN as a diffusion barrier, Al as a current-carrying film and the top TiN has the dual role of mechanically stiffening the structure and reducing reflectivity (important for lithography). Metallization reliability has been greatly improved by adopting such multilayer metallization schemes, but a price has been paid elsewhere: the etching of such multilayer structures is difficult.

Dielectric mirrors with  $\lambda/4$  layer thicknesses for high-reflectance surfaces are one example of multilayer films. Undoped polysilicon, oxide and nitride are the usual films. For visible wavelengths, layer thicknesses around 100 nm are typical. Similar  $\lambda/4$  structures are used in thin-film bulk acoustic resonators (TFBARs): multilayers of W:SiO<sub>2</sub>, with thicknesses about 1.5 μm, act as acoustic mirrors, see Figure 7.12.

In PECVD, oxynitride films of composition SiO<sub>x</sub>N<sub>y</sub> can be easily made. By tailoring the composition, the refractive index can be varied from 1.46 to 2, covering the full range of SiO<sub>2</sub> and Si<sub>3</sub>N<sub>4</sub> refractive indices. By sandwiching the SiO<sub>x</sub>N<sub>y</sub> film between two films of lower refractive index, it acts as a waveguide (Figure 7.13). Doping of oxide by for example phosphorus (PSG) or germanium can also be used to tailor the refractive index, but only over a limited range before other film properties change too much.

## 7.9 Selective Deposition

Both CVD and electrochemical processes can be used for selective deposition, with electroless copper and CVD



**Figure 7.12** Bulk film acoustic resonator (FBAR) structure on a glass wafer: a piezoelectric ZnO resonator is sandwiched between gold and aluminum electrodes. TiW, Ni and Mo are thin adhesion promotion layers. W and SiO<sub>2</sub> form a  $\lambda/4$  acoustic wavelength filter. Adapted from VTT Microelectronics Annual Research Review (2001)



**Figure 7.13** Refractive index SiO<sub>2</sub>/SiO<sub>x</sub>N<sub>y</sub>/SiO<sub>2</sub> waveguide:  $n_f$  1.46/1.52/1.46. Reproduced from Hilleringmann and Goser (1995) by permission of IEEE

tungsten being the most studied ones. The silicon surface reduction process allows selective CVD tungsten in contact holes:



This reaction is selective because SiO<sub>2</sub> does not reduce WF<sub>6</sub>. However, about 20 nm of silicon is consumed, and the reaction is self-limiting: WF<sub>6</sub> cannot diffuse through the growing tungsten layer. Tungsten deposition



**Figure 7.14** Selective deposition: problems with unequal hole depths and spurious nucleation

is continued by silane reduction of tungsten hexafluoride on tungsten according to



This reaction, however, is mass transport limited, and difficult to control. Additionally, it faces problems when contact holes of different depths have to be filled: some are underfilled, some are overfilled (Figure 7.14).

Plug fill can be achieved by continuing deposition in hydrogen reduction mode, Equation 5.7. There is always the problem of selectivity loss. It is usually connected with residues from the preceding process steps, for instance incomplete resist removal. Selective deposition processes are rare in volume manufacturing even though they sometimes offer enormous simplifications in process integration.

## 7.10 Reacted Films

A rather interesting class of conducting thin films is the silicides, compounds of silicon and metals, for example TiSi<sub>2</sub>, CoSi<sub>2</sub>, NiSi, WSi<sub>2</sub> and PtSi. Silicides combine the good properties of silicon and metals, such as high-temperature stability and low resistivity, with the lowest values of about 15  $\mu\text{ohm}\cdot\text{cm}$ . Silicide CTEs are typically  $15 \times 10^{-6}/^\circ\text{C}$ . Young's moduli for silicides are on the order of 100 GPa.

Silicides are formed by many methods, for instance by CVD or alloy sputtering. Tungsten silicide WSi<sub>2</sub> is made by CVD. Most often silicides are made by solid state reaction of metal film and silicon, for example Pt + Si  $\rightarrow$  PtSi. The interesting feature of reacted silicides is that they can be made without silicide etching, in a self-aligned



**Figure 7.15** Silicide formation by metal/silicon reaction: (a) metal sputtering on oxide pattern; (b) reaction at metal/silicon interface, no reaction on oxide; (c) selective etching of unreacted metal leaves silicide

mode, Figure 7.15. This offers possibilities in advanced MOS transistor metallization (to be discussed in Chapter 26). The desired pattern is defined in the oxide, and metal is deposited. Metal/silicon reaction takes place during annealing in those areas where metal and silicon are in contact, but the metal does not react with the oxide. The unreacted metal can be etched away to leave the silicide and oxide.

The surface of the resulting silicide is approximately at the level of the original silicon surface, and the silicide is

formed under the original surface. This volume expansion means that reacted silicides are under compressive stress. Various silicides and their formation methods are listed in Table 7.3.

Titanium silicide is formed by annealing in argon or nitrogen. Typical conditions are about 750 °C and 30 s in rapid thermal annealing (RTA) equipment (see Chapter 32 for more on RTA). A simple one-step anneal in argon, which would produce a predictable thickness of titanium silicide, is not usable because silicide grows over oxide laterally.

Two-step nitrogen annealing has been developed to ensure silicide formation without silidification over the oxide. In nitrogen annealing, however, there is a competing reaction taking place at the surface: titanium nitridation. First, annealing has to be optimized so that the silicon/titanium reaction (TiSi formation) at the interface is faster than the gas phase nitridation of titanium into TiN. TiN film is beneficial because it suppresses the lateral growth of silicide over the oxide. The first anneal results in C49 phase TiSi<sub>2</sub> which has fairly high resistivity, see Figure 7.16.



**Figure 7.16** TiSi<sub>2</sub> C49 to C54 phase transitions followed by sheet resistance measurements. Reproduced from Mann *et al.* (1995) by permission of IBM

**Table 7.3** Silicide properties

| Silicide          | Resistivity ( $\mu\text{ohm}\cdot\text{cm}$ ) | Formation                                                               | Selective etch                                   |
|-------------------|-----------------------------------------------|-------------------------------------------------------------------------|--------------------------------------------------|
| TiSi <sub>2</sub> | 15–20                                         | Ti/Si reaction at about 750 °C                                          | NH <sub>4</sub> OH:H <sub>2</sub> O <sub>2</sub> |
| TiSi <sub>2</sub> | 15–20                                         | CVD TiCl <sub>4</sub> /SiH <sub>2</sub> Cl <sub>2</sub> /H <sub>2</sub> | —                                                |
| CoSi <sub>2</sub> | 15–20                                         | Co/Si reaction at 500 °C                                                | HCl:H <sub>2</sub> O <sub>2</sub> 3:1            |
| NiSi              | 15–20                                         | Ni/Si reaction at 400 °C                                                | HNO <sub>3</sub>                                 |
| WSi <sub>2</sub>  | 30                                            | CVD WF <sub>6</sub> /SiH <sub>2</sub> Cl <sub>2</sub> at 400 °C         | —                                                |
| PtSi              | 30                                            | Pt/Si reaction                                                          | HCl:HNO <sub>3</sub> 3:1                         |

In the case of nitrogen annealing, we have to remove not only unreacted metallic titanium but also TiN, so we need to know the selectivity for both Ti:TiSi<sub>2</sub> and TiN:TiSi<sub>2</sub> pairs. The thickness of titanium cannot be simply calculated from the titanium, silicon and TiSi<sub>2</sub> densities because some titanium is consumed by the TiN formation reaction. TiSi<sub>2</sub> thickness is also reduced by the fact that selective etches are not infinitely selective: some TiSi<sub>2</sub> is lost during titanium etching. If titanium thickness is scaled down and the rest of the process is unchanged, TiSi<sub>2</sub> thickness will decrease more than predicted by the simple metal-to-silicide relation because surface-nitrided thickness is independent of titanium thickness.

The second anneal transforms silicide C49 into C54 phase which has a resistivity of about 15  $\mu\text{ohm}\cdot\text{cm}$ . This resistivity of C54 titanium silicide is lower than that of many thin-film metals. Anneal temperature must be at least 700 °C to effectuate the phase transformation but in practice 850 °C for 30 s is usually used, see Figure 7.16. Too high an annealing temperature leads to silicide agglomeration: the silicide balls up and the film breaks down.

## 7.11 Simulation of Deposition

Topography simulation (for deposition, etching and polishing) works on fluxes and surface processes: at each grid point the incoming flux (from the fluid phase) and surface reaction probability are evaluated (with return flux of reaction products in the case of etching/polishing, or non-sticking species in the case of deposition) to calculate the new surface height. In principle the generation of incoming species could be simulated (for instance, ion and radical production in plasma) but this is usually not integrated into a topography simulator; rather, it is

part of a reactor simulator. New surface points are calculated and those points are connected to represent the surface. Accuracy is increased by calculating new points between existing points when the points are far apart and, similarly, by eliminating points that become close to each other.

Deposition models define atom arrival angles, and various models are available in most simulators: fully directional, hemispherical, conical, etc. Etch models include isotropic and anisotropic models and user-definable mixtures of the two. Model selection is very much an empirical question, and the predictive power of topography simulation is diminished by this semiempirical tailoring of model parameters.

The input for a typical topography simulation includes:

- surface topography already made
- material to be deposited
- deposition model (angular distribution of depositing species)
- thickness/rate and time.

Adjustable parameters include surface diffusivity, which determines how much lateral movement an impinging species is allowed before it is “frozen” in the growing film.

The topography simulator SAMPLE 2D, developed at the University of California, Berkeley, has been used to obtain the profiles shown in Figure 7.17. The hemispherical deposition model is an approximation of sputter deposition. Trench widths are 1  $\mu\text{m}$  and 0.5  $\mu\text{m}$ , depth is 1  $\mu\text{m}$ . Wall angle either 90° or 81°. Film thickness is 0.5  $\mu\text{m}$  on planar top surface.

Note that step coverage over isolated lines is always the same irrespective of line aspect ratio: step coverage



**Figure 7.17** Simulation of deposition step coverage with SAMPLE 2D. Hemispherical deposition model corresponds to sputtering. Trench widths are 1  $\mu\text{m}$  and 0.5  $\mu\text{m}$ , depth is 1  $\mu\text{m}$ . Wall angle either 90° or 81°. Film thickness is 0.5  $\mu\text{m}$  on planar top surface.



**Figure 7.18** A 3D Monte Carlo simulation of aluminum deposition into a contact hole: (a) high rate deposition; (b) low rate deposition. Both depositions are at the same temperature. The simulation is 3D, but only a cut through the contact hole centerline is shown. Reproduced from Baumann and Gilmer (1995) by permission of IEEE

depends on atom arrival angles and, by definition, isolated lines have large unobstructed spaces next to them, therefore identical step coverage will result.

Monte Carlo (MC) and molecular dynamics (MD) simulations offer more realism, for example the prediction of step coverage based on relaxation (Figure 7.18). Calculations can be speeded up by treating matter as 100 Å cluster spheres instead of individual atoms. Clusters, and the thus the atoms, come to rest at stable positions, for example when touching three other spheres. The arrival of new material and rearrangements of already deposited films can be simulated simultaneously. Temperature and sticking coefficient are used as parameters for surface mobility.

A 2D simulation can overestimate bottom coverage by 40%, compared to a 3D one. This is intuitively easy to understand because 2D simulation treats recesses as infinitely long trenches, with very large acceptance angles along the trenches, whereas 3D simulation takes into account the real acceptance angle.

### 7.11.1 Scales in simulation

The fundamental simplification of many topography/thin-film simulators is the fact that surface-controlled reactions are assumed. On a microscopic scale this is true: material is being added to or removed from a surface, but on a macroscopic scale this is a gross simplification. Etching and deposition processes can be either surface reaction limited or mass transport process limited. Transport of reactants from gas flow to the surface (as in a CVD reactor) or removal of reaction products by convection (like removal of hydrogen bubbles that result from silicon etching) can be more critical to etching or deposition than the surface processes. Whether surface reaction or mass transport is the mechanism which determines the reaction rate

has to be studied for each process. If the reaction is transport limited, then the simulation should be able to model fluid dynamics at the reactor scale, in addition to surface processes at the micrometer scale.

## 7.12 Thickness Limits of Thin Films

The thinnest films are obviously one atomic layer thick, as evidenced by ALD and MBE deposited films. Very thin films are often discontinuous and the thickness required for continuous films is process and material dependent. One criterion is transparency, which translates to about 10–20 nm.

PVD and CVD techniques are suitable for films in the thickness range of 10–1000 nm. This is partly a practical limitation due to deposition rates, which are generally 1–100 nm/min. In many cases thicker films are desired, and PVD or CVD methods are too slow. In CVD silicon epitaxy, 100 µm layer thickness is feasible because epitaxial growth rates can be as high as 5 µm/min and the limit is economical, physical. For most polycrystalline and amorphous CVD and PVD films, however, stresses build up to unacceptable levels for thicker films, limiting thicknesses to a few micrometers. Electroplated films can be very thick, and through-wafer plating is fairly standard. Deposition rates can be micrometers/minute, even though they are usually much smaller for thinner films.

The read/write heads of hard disk drives involve multi-layer structures. Shown in Figure 7.19 is the film structure of a giant magnetoresistive (GMR) read/write head. It is only 39 nm thick, but there are nine layers, the thinnest



**Figure 7.19** Thin-film stack of a giant magnetoresistive (GMR) read head. Redrawn after Kools *et al.* (2000)

of them just 0.8 nm thick. This is no laboratory curiosity, but a volume-manufactured piece.

The minimum thickness/minimum period of multilayer structures depends on growth process characteristics, surface smoothness and the sharpness of interfaces. Interface abruptness depends on the reactor operating principle: if growth is dependent on gas flow in the reactor, the minimum thickness is determined by the gas residence time in the reactor (to be discussed in Chapter 33), which can be fractions of seconds or tens of seconds. Flow systems, like CVD, are thus not suitable for very thin layers. Beam systems, evaporation, sputtering and MBE, with shutters, are able to turn deposition off and on over a time scale of less than a second.

Periodic multilayers have been fabricated for X-ray and extreme ultraviolet (EUV) optics: Si/Mo and W/C and similar light element/heavy element mirrors can have reflectances of 70% at 13 nm wavelength which is being considered for future EUV lithography. Periodicities are on the order of nanometers, identical to X-ray wavelengths. Surface roughness should be limited to a fraction of film thickness, which translates to 0.1–0.2 nm. The number of layers can be 50 for example. Surface smoothness must not be compromised as more and more layers are added. A layer-by-layer growth mode is preferred, and amorphous films are preferred because of their smoothness. Thickness control is important, from layer to layer and day to day.

Additionally, reactions between the layers have to be prevented. Molybdenum and silicon form silicide,  $\text{MoSi}_2$ , and its formation must be prevented. Sputtering is a room temperature process, and silicide formation thermodynamically should not take place. However, sputtering is a high-energy process, and on an atomic level the local temperature may rise, and energetic bombardment will knock atoms forward, enhancing the mixing of layers. If silicide is formed, its volume is about 10% less than that of molybdenum and silicon combined, and this needs to be accounted for. Alternatively, silicide formation must be prevented. Boron carbide has been tested as a barrier. The structure is shown in Figure 7.20. Layer thicknesses were extrapolated from bulk sputtering rates, which introduces some inaccuracy. A reflection of 70% was achieved at 13.5 nm (wavelength chosen for future EUV lithography systems).

$\text{ZrO}_2/\text{HfO}_2$  multilayers have been used in order to improve leakage currents in deposited capacitor dielectrics. These polycrystalline multilayers have been termed nanolaminates to separate them from epitaxial superlattices. The hardness of the TiN/AlN multilayer, 1800 on the Knoop hardness scale, is practically the average of TiN and AlN hardnesses for thicker



**Figure 7.20** A single four-layer element of a periodic EUV multilayer mirror for 13.5 nm lithography. The four-layer structure is repeated 50 times, to a total thickness of about 350 nm. Adapted from Bajt (2002)

multilayers, but when periodicity is reduced below 10 nm, the hardness increases to 4000.

## 7.13 Exercises

- Which method would you use to measure the following properties of PECVD silicon nitride film: (a) deposition rate; (b) uniformity; (c) stress; (d) stoichiometry; (e) structure; (f) step coverage?
- What are the resistivities of the titanium silicide C49 and C54 phases (Figure 7.15)?
- The speed of sound in  $\text{ZnO}$  is 5700 m/s. What is the intended operating frequency for the TFBAR shown in Figure 7.11?
- A periodic lattice of W and C is used as a  $\lambda/4$  X-ray mirror. What layer thicknesses should be used for 100 eV X-rays?
- If 20 nm of nickel reacts with an overabundance of silicon, how thick a layer of  $\text{NiSi}$  will be formed? Densities: Si, 2.3 g/cm<sup>3</sup>; Ni, 8.9 g/cm<sup>3</sup>;  $\text{NiSi}$ , 7.2 g/cm<sup>3</sup>.
- $\text{CoSi}_2$  is formed by a cobalt thin-film reaction with silicon. What is the position of the  $\text{CoSi}_2$  surface relative to the original silicon surface? Densities: Co, 8.9 g/cm<sup>3</sup>;  $\text{CoSi}_2$ , 5.3 g/cm<sup>3</sup>.
- Oxygen is soluble in titanium up to 34 at. %. How thick must a silicon dioxide film be in order to be dissolved by a titanium film 50 nm thick? The density of titanium is 4.5 g/cm<sup>3</sup> and that of silicon dioxide 2.3 g/cm<sup>3</sup>.
- Draw a deposited film profile over given topography for:
  - sputtered aluminum, 300 nm thick
  - CVD TEOS, 0.25  $\mu\text{m}$  thick

- (c) PECVD oxide, 0.2  $\mu\text{m}$  thick  
 (d) LPCVD polysilicon, 100 nm thick.
- 
9.  $\text{TiAl}_3$  is formed in a reaction between aluminum and titanium films. What will happen to the volume of the metal line? Densities: Al, 2.7 g/cm<sup>3</sup>; Ti, 4.5 g/cm<sup>3</sup>;  $\text{TiAl}_3$ , 3.35 g/cm<sup>3</sup>.
10. How would you measure the following film properties:
- (a) crystal size in poly-SiGe
  - (b) carbon content in  $\text{SiO}_x\text{C}_y$
  - (c) stoichiometry of  $\text{TiSi}_2$
  - (d) surface roughness of polysilicon
  - (e) resistivity of electroplated copper?
- ## References and Related Reading
- Bajt, S. (2002) Improved reflectance and stability of Mo-Si multilayers, *Opt. Eng.*, **41**, 1797–1804.
- Baumann, H.F. and G.H. Gilmer (1995) 3D modeling of sputter and reflow processes for interconnect metals, International Electron Devices Meeting, p. 89.
- Biasotto, C. *et al.* (2008) Silicon nitride thin films deposited by electron cyclotron resonance plasma enhanced chemical vapor deposition for micromechanical applications, *Thin Solid Films*, **516**, 7777–7782.
- Chou, B.C.S. *et al.* (1997) Fabrication of low-stress dielectric thin-film for microsensor applications, *IEEE Electron Device Lett.*, **18**, 599.
- Cote, D.R. *et al.* (1995) Low-temperature CVD processes and dielectrics, *IBM J. Res. Dev.*, **39**, 437.
- Fang, W. and C.-Y. Lo (2000) On the thermal expansion coefficients of thin films, *Sens. Actuators*, **84**, 310.
- Hansen, M. and K. Anderko (1958) **Constitution of binary alloys**, 2nd edn, McGraw-Hill.
- Hilleringmann, U. and K. Goser (1995) Optoelectronic system integration on silicon: waveguides, photodetectors, and VLSI CMOS circuits on one chip, *IEEE Trans. Electron Devices*, **42**, 841.
- Kang, U. *et al.* (1999) Pt/Ti thin film adhesion on  $\text{SiN}_x/\text{Si}$  substrates, *Jpn. J. Appl. Phys.*, **38**, 4147.
- Kools, J.C.S. *et al.* (2000) Deposition technology for thin film magnetic recording heads reader fabrication, *Thin Solid Films*, **377–378**, 705–711.
- Mann, R.W. *et al.* (1995) Silicides and local interconnections for high-performance VLSI applications, *IBM J. Res. Dev.*, **39**, 403.
- Martinu, L. and D. Poitras (2000) Plasma deposition of optical films and coatings: a review, *J. Vac. Sci. Technol.*, **A18**, 2619–2645.
- Murarka, S.P. (1993) **Metallization, Theory and Practice for VLSI and ULSI**, Butterworth-Heinemann.
- Pilvi, T. *et al.* (2008) Atomic layer deposition of  $\text{MgF}_2$  thin films using  $\text{TaF}_5$  as a novel fluorine source, *Chem. Mater.*, **20**, 5023–5028.
- Piramanayagam, S.N. (2007) Perpendicular recording media for hard disk drives, *J. Appl. Phys.*, **102**, 01301.
- Raaijmakers, I.J. *et al.* (1990) Microstructure and barrier properties of reactively sputtered Ti-W nitride, *J. Electron. Mater.*, **19**, 1221.
- Rossnagel, S.M. *et al.* (1996) Thin, high atomic weight refractory film deposition for diffusion barrier, adhesion layer and seed layer applications, *J. Vac. Sci. Technol.*, **B14**, 1819.
- Thornton, J.A. (1986) The microstructure of sputter-deposited coatings, *J. Vac. Sci. Technol.*, **A4**, (6), 3059.
- Vallat-Sauvain, E *et al.* (2000) Evolution of microstructure in microcrystalline silicon prepared by very high frequency glow-discharge using hydrogen dilution, *J. Appl. Phys.*, **87**, 3137 ...
- Wang, S.-Q. and J. Schlüter (1996) Film property comparison of Ti/TiN deposited by collimated and uncollimated physical vapor deposition techniques, *J. Vac. Sci. Technol.*, **B14**, (3), 1837.
- Wang, S.-Q. *et al.* (1996) Step coverage comparison of Ti/TiN deposited by collimated and uncollimated physical vapor deposition techniques, *J. Vac. Sci. Technol.*, **B14**, (3), 1846.
- Wang, Y.Y. *et al.* (1998) Synthesis and characterization of highly textured polycrystalline AlN/TiN superlattice coatings, *J. Vac. Sci. Technol.*, **A16**, 3341.
- Xu, Y.P. *et al.* (1992) A study of sputter deposited silicon films, *J. Electron. Mater.*, **21**, 373.

# Pattern Generation

A pattern generation tool transcribes the chip design data into a physical structure. It must be able to expose single pixels to create all possible designs, and expose them fairly fast since designs can consist of millions of pixels. Sometimes these patterns are written directly on the wafer in question, and the method is called direct write, but more often these patterns are written on a glass plate to produce photomasks for optical lithography. UV illumination through a photomask will expose all the patterns simultaneously, enabling very fast patterning, once the mask has been written.

Direct write is akin to writing with a pen: it is easy to write short notes, and you can change your mind on the fly to create new motifs. However, if you want to write a longer story, it can certainly be done, but it will take time. And if you want to make copies of your work, each copy takes the same time to write as the first one. With photomasks the time to make the first pattern is the same as in direct writing, but the subsequent copies can be made very quickly. However, if you want to change something, you need to create another original pattern. So optical lithography is akin to a printing press, with the same strengths and weaknesses as printing, compared to handwriting.

## 8.1 Pattern Generators

The first pattern generator consisted of a mechanical stage, aperture blades and a UV lamp (Figure 8.1). The wafer is covered with photoresist, a layer of photosensitive polymer. Resists come in two flavors, like photographic films: positive and negative. With positive resists the exposed areas will become soluble in developer, while with negative resists the exposed areas are crosslinked and become insoluble. To create a pixel, aperture blades are sized and positioned, followed by the exposing flash. After mechanical movement of the stage, the aperture sizing



**Figure 8.1** Optomechanical pattern generator: stage movement, aperture blade and flash bulb expose the pattern pixel by pixel

operation and flashing are repeated, with an operating frequency of about 1 Hz. This method was employed in the early era of microfabrication when linewidths were above 10 µm.

Today, if simple and fairly large patterns are needed, laser printers can be used to write the patterns. They are, after all, designed to write arbitrary shapes. While office laser printers can produce lines in the 100 µm range, more advanced laser writers are used in the printed circuit board industry, enabling linewidths down to a few tens of micrometers.

Dedicated laser mask enable submicron linewidths to be written. Even smaller features can be exposed by focused electron or ion beams. The beam can be scanned very quickly, up to 500 MHz, but modern integrated circuits can consist of billions of pixels, and writing times can be days for advanced chip designs.

Electron and laser beam systems are the standard tools for pattern generation. We will first discuss issues relevant to both, such as how to break up a complex chip design

into a form that can be exposed pixelwise, and how to stitch together large patterns from very small pixels.

First, the design data has to be fractured to suit the beam writer. All patterns must be broken down to squares or rectangles. This generates enormous data files: even a fairly simple chip design can be gigabytes after fracturing. Integrated circuits are amenable to a Manhattan street plan of  $x$  and  $y$  coordinates, while for instance CDs require a spiraling “street plan.” Pattern generators exist specifically designed for both basic types. A fractured pattern using an  $xy$ -oriented machine is shown in Figure 8.2. Note that circular and wedge shapes are approximated. If the beam writer has a small address grid, and a small spot size, this approximation can be made increasingly accurate but at the expense of writing time. If the pixel edge is halved, then the number of pixels to be drawn is quadrupled, meaning that writing time is also fourfold.

In addition to spot size, it is important to consider the address grid of the mask writer. The address grid determines how small increments can be drawn. While a beam spot is for example 100 nm, an address grid can be for example 25 nm. It is determined by stage mechanics and the laser interferometer measurement system.

The simplest writing strategy is raster scan: the beam scans all over the plate and at each spot an exposure/no-exposure decision is made. This works fine for dense patterns, but if the design consists of a few patterns far apart, the raster scan wastes time. On the other hand, writing time is independent of design.

Vector scanning enables skipping of empty (non-exposed) spaces, thus faster writing, at the expense of system complexity: managing gigabytes of data at 100 MHz data rate is a formidable task. Position accuracy is also more difficult to realize than in a raster scan: rapid sweeping from field to field is not as accurate as step by step from pixel to neighboring pixel. Raster scan and vector scan are compared in Figure 8.3.

Basically a single shot can expose a single pixel, but in practice the minimum pixel is exposed by for example



**Figure 8.2** Circular shape reconstructed as rectangles ready for pixelwise writing



**Figure 8.3** Raster scan and vector scan: in raster scan every pixel is scanned while in vector scan empty spaces are skipped

5 × 5 shots. There are several reasons for this. First of all, edge acuity can be much better if multiple, overlapping shots are used. In the case of a single shot per pixel, there is the danger of statistical errors: a line would be discontinuous if a single pixel were exposed incorrectly.

The ultimate resolution of electron or laser beam writing is only applicable to a small area, for example a field of  $250 \times 250 \mu\text{m}^2$ . This area can be scanned electromagnetically (e-beam) or acousto-optically (laser beam). If a larger area needs to be drawn, additional movements must be introduced. The stage scan is a mechanical movement, controlled by an interferometer. Patterns at adjacent fields must be accurately stitched together at intersections. For instance, the stitching error of subfields can be made as small as 6 nm in the  $x$ -direction, but not in the  $y$ -direction, because the former depends on beam scanning but the latter on mechanical stage movement

Writing time can be limited by several factors which depend on pixel size ( $d$ ), total area ( $A$ ), required exposure dose (resist sensitivity,  $S$ ), beam current ( $I$ ), electronic scan frequency ( $f$ ), mechanical scan speed ( $v$ ), electronic scan length ( $l$ ) and mechanical scan length ( $L$ ). First of all, the area that needs to be exposed by the current in use gives time  $\tau_1$ :

$$\tau_1 = AS/I \quad (8.1)$$

Exposed pixel size  $d$  affects writing time via  $\tau_2$ : when the same area is broken up into smaller pixels, more shots are needed. If the beam writer’s electronic scan frequency can be increased,  $\tau_2$  can be decreased:

$$\tau_2 = \frac{A}{fd^2} \quad (8.2)$$

The wafer stage mechanical movement time must be considered for a complete system, which gives  $\tau_3$ :

$$\tau_3 = A/Lv \quad (8.3)$$

Assuming the following parameters

$$\begin{aligned} A &= 10 \text{ cm}^2 \\ f &= 100 \text{ MHz} \\ d &= 100 \text{ nm} \\ L &= 10 \text{ cm} \\ l &= 250 \mu\text{m} \\ v &= 10 \text{ cm/s} \end{aligned}$$

and calculating for three different resist sensitivities ( $1, 10$  and  $100 \mu\text{C}/\text{cm}^2$ ), we get  $\tau_1$  exposure times of  $100, 1000$  and  $10000 \text{ s}$ , respectively. Mechanical movements are not limiting ( $400 \text{ s}$ ), but a  $100 \text{ nm}$  pixel size at  $100 \text{ MHz}$  results in  $10000 \text{ s}$  for  $\tau_2$ . Therefore, resist choice is not very critical because  $\tau_2$  is the limiting time. A rough estimate for mask writing time in this case is 3 hours.

## 8.2 Electron Beam Lithography

In this section we will study issues specific to electron beam writing. Electron beam spots in the  $5 \text{ nm}$  range are available. This is not limited by the wavelength of electrons ( $\lambda = 8 \text{ pm}$  for  $25 \text{ kV}$ ) but rather by electron source size and electron optics aberrations and diffraction for highly collimated beams. Beam current cannot be increased indefinitely because of Coulomb repulsion. Microampere currents are already quite intense, and  $2 \mu\text{A}$  beam current will lead to spreading of  $100 \text{ nm}$  because of interaction aberration.

One of the distinguishing features of electrons is that electrons are light mass objects and when they hit photoresist with high energy (typically  $10\text{--}50 \text{ kV}$ ) they scatter. Even though the beam spot is very small, scattering broadens the beam inside the resist and the exposed area is much larger than the beam spot size (Figure 8.4).

As can be seen, thinner resist would be beneficial because the beam will traverse it with less scattering. However, thin resist is problematic in further processing because it does not block ion implants or withstand harsh etching conditions. Thin resists are also more likely to be defective, with pinholes, lowering yield. Higher beam energy will help: the traversed electrons scatter less if they are energetic. However, these high-energy electrons do not increase resist exposure directly in proportion to their energy, therefore high energy is not a straightforward way to improve resolution.

Electrons scatter both forward and backward. These high-energy electrons are, however, not the major component of resist exposure: most of the resist exposure comes from low-energy secondary electrons which have been created when the beam slows down. These electrons have energies less than  $400 \text{ eV}$  and mostly less than  $200 \text{ eV}$ .



**Figure 8.4** The effects of resist thickness and electron beam acceleration voltage on beam scattering and resolution. Note that almost all electrons traverse the resist and penetrate the wafer

The corresponding travel distances of such electrons in the resist are  $10 \text{ nm}$  and  $5 \text{ nm}$ , respectively. Therefore the minimum exposed size is limited to about  $10 \text{ nm}$ . But there are many other factors contributing to final feature size, for example the development of exposed patterns (the analogy to photography is obvious: it is not only the quality of the negative that affects print quality, but also all the other steps, e.g., development time, concentration and temperature, which make a contribution).

An approximation to effective beam diameter in resist is given by

$$d_{\text{eff}} (\text{nm}) = 0.9 \times \left( \frac{t}{V} \right)^{1.5} \quad (8.4)$$

where resist thickness  $t$  is in nm and voltage in kV. Ideal photoresist should remain unaffected by the beam until a threshold dose is delivered, and then be fully exposed. Such an idealized resist response is shown as the box-like curve in Figure 8.5 (negative resist is assumed, i.e., exposed resist determines the line). Below threshold dose no pattern is formed, and above it a line of constant width is produced. Real resists and lithographic processes have finite contrast. Because of scattering and natural variability, some parts of the resists are exposed and a narrow line is formed at very low dose, and subsequently wider lines as the dose increases.

Some electrons experience backscattering (large-angle scattering) with about micrometer range. Exposure dose



**Figure 8.5** Dose–linewidth relationship in e-beam exposure



**Figure 8.6** Proximity effect: electron backscattering at resist/substrate interface leads to increased exposure in the space between the lines. Dose thus depends on the neighboring structures

thus depends on the neighboring structures. This is known as the proximity effect, see Figure 8.6.

The proximity effect can be combated by biasing structures smaller or larger so that the final pattern is of the desired size and shape. This is, however, a formidable task because there are a myriad of possible combinations of shapes in microstructures. It also slows down writing considerably, because proximity correction means more complex shapes, as will be discussed in Chapter 11, in the context of optical proximity correction.

To determine how many electrons or photons are needed to expose a pixel, we need to understand the sensitivity of photoresists (yes, they are called photoresists also in the case of electron beam lithography). There are dozens of different formulations for e-beam resists, some of which are listed in Table 8.1 below. There is an unfortunate inverse relationship between the resist sensitivity (measured as charge per area needed to expose it) and the minimum linewidth.

This sensitivity–linewidth tradeoff can be gauged as follows. Let us assume 100 nm by 100 nm pixels and resist sensitivity of  $0.3 \mu\text{C}/\text{cm}^2$ . The number of electrons needed to expose the pixels is then 186. This is precious few electrons, and statistical variations are great. In order to write  $10 \times 10 \text{ nm}$  pixels, two electrons would suffice, but in practice less sensitive resist is used to average out random variation.

While electron beam lithography is the workhorse of nano- and microfabrication, it has many difficulties. The slow writing speed for large areas is a major drawback, but because there really are no competing technologies in the 100 nm feature size range, electron beam writing must be employed. There are also challenges in meeting all the criteria simultaneously: for example, pattern density affects local heating by electron-deposited energy, and the conductivity of the substrate affects charging, which

can lead to the deflection of electrons. Despite all these problems, e-beam lithography is widely practiced as a research tool for writing test structures and devices, and in the production of photomasks for optical lithography. There are also some device manufacturing applications, in devices where very small series are produced. In those cases the cost of a mask is not spread over many devices, and direct writing does make economical sense.

**Table 8.1** E-beam resists: sensitivity vs. feature size

| Resist | Sensitivity<br>( $\mu\text{C}/\text{cm}^2$ ) | Beam energy<br>(kV) | Minimum feature<br>size (nm) |
|--------|----------------------------------------------|---------------------|------------------------------|
| PMMA   | 300                                          | 50                  | 10                           |
| EBR-9  | 25                                           | 50                  | 200                          |
| AZ5206 | 6                                            | 10                  | 250                          |
| COP    | 0.3                                          | 10                  | 1000                         |

### 8.3 Laser Pattern Generators

Laser beam pattern generators work on similar principles as e-beam systems, but there are a few important differences which make laser beam writing faster and cheaper. First of all, laser writing is done in room ambient, while e-beam systems operate in vacuum. The costs of the vacuum vessel and pumps are eliminated, and temperature control is easier in room ambient than in vacuum. Delicate mechanical movements of the stage are difficult to implement in vacuum, when it is necessary to minimize particles simultaneously. The mechanics of a laser pattern generator are shown in Figure 8.7. Mechanical movements are measured by interferometric techniques, and stage positioning uses piezoelectric translators for achieving nanometer position resolution.

Because laser beams use visible and UV wavelengths, a wide variety of photoresists developed for optical lithography are available to laser beam writing. Laser beam systems can write for example  $0.6\text{ }\mu\text{m}$  minimum lines at  $1\text{ cm}^2/\text{min}$  and  $6\text{ }\mu\text{m}$  lines at  $100\text{ cm}^2/\text{min}$ . Laser pattern generators are used whenever their linewidth resolution is adequate, and only when lines smaller than laser resolution are needed are the slower, more expensive e-beam systems used. In a CMOS process some mask levels may be written by e-beam and some by laser systems. Laser writing completely dominates mask making for flat-panel displays and other large-area applications, which, usually, work with larger linewidths.

### 8.4 Photomask Fabrication

Instead of directly writing millions of pixels on a wafer, beam writers can be used to write photomasks for optical lithography. Photomasks are glass plates with chromium



**Figure 8.7** Mechanics of the laser mask writer stage



**Figure 8.8** A 7 inch photomask: a chromium layer 100 nm thick on a glass plate holds the desired patterns

layers (about 100 nm thick) on them (Figure 8.8). The beam writer exposes the resist on the mask plate, in identical fashion to writing patterns on a wafer (just replace the wafer in Figure 8.1 by a mask plate).

Optical lithography with photomasks is the dominant patterning technology because optical exposure is fast: illumination through a photomask exposes up to  $10^{10}$  pixels in less than 1 second. The enormous throughput advantage of optical exposure warrants the making of mask plates, which can be costly: a set of 15 plates (needed for  $1\text{ }\mu\text{m}$  CMOS processes) costs \$15 000; and an advanced 45 nm process requires \$1 million mask sets (\$30 000 dollars each for 35 plates).

Optical lithography can be done with reduction optical systems (to be discussed in Chapter 10), which means that the patterns on the mask are larger than final structures on the wafer. This is a great relief for mask makers: the 100 nm final size on a wafer corresponds to 400 nm on the mask when  $4\times$  reduction optics is used. This is a major benefit of optical lithography compared to X-ray lithography and nanoimprint (Figure 1.9) which require the mask pattern size to be the same as the final pattern size on the wafer.

Soda lime glass plates are used for larger linewidths ( $>3\text{ }\mu\text{m}$ ) and quartz (fused silica) is the material of choice for micron and submicron work. This is for both thermal and optical reasons: making smaller patterns calls for shorter wavelength exposure and quartz transmission in ultraviolet wavelengths is superior to soda lime (Figure 8.9). The mask plate heats up during exposure, and thermal expansion needs to be addressed. Quartz has a much smaller coefficient of thermal expansion



**Figure 8.9** Optical transmission of various glasses 1 mm thick: 1, ultrapure quartz; 2, common quartz; 3, very pure soda lime glass; 4, common soda lime. Reproduced from Scholze (1977), with kind permission of Springer

than soda lime, therefore it maintains its dimensions more accurately.

Photomasks with chrome-on-glass also go by the name of binary masks, because there is either transmission or blockage of light, but nothing else. In a phase shift mask (PSM), the phase of the light is manipulated while traversing the mask. PSMs will be discussed in Chapter 10. If the mask is mostly covered by chrome, with only a small percentage of transparent area, it is said to be a dark-field (DF) mask (as in Figure 8.8); if it is mostly transparent, with only a small percentage of chrome, it is designated a light-field (LF) mask, also known as a bright field (BF) mask.

The processes needed to fabricate masks is shown in the process flow below. In addition to pattern writing, there are many other process steps, especially etching, which will affect the final pattern quality. There are also many inspection, measurement and repair steps, which, in fact, make up a major part of mask price.

### Process flow for mask fabrication

1. Mask blank preparation  
deposition of chrome on quartz  
resist application
2. Pattern writing  
e-beam or laser
3. Pattern processing  
resist development  
chrome etching  
resist stripping

4. Metrology  
CD (critical dimension) control
  5. Inspection for pattern integrity  
defects (in chrome)  
pattern fidelity (shape and position)
  6. Cleaning  
particle removal
  7. Repair  
focused ion beam etching and/or deposition
  8. Final defect inspection
- Adapted from Skinner *et al.* (1997)

## 8.5 Photomask Inspection, Defects and Repair

Photomask fabrication requires, in addition to scanning beam equipment, a repertoire of inspection and repair equipment. Three basic control measurements for masks are linewidth, position and defects. Linewidth is a local measurement over a test structure pattern. With linewidths in the micrometer range, measurement should be able to discern about 10 nm. Pattern position is a global measurement and it is usually fixed to a mask writing tool, controlled by a stage interferometer, and measured to about 10 nm accuracy over a mask plate size of 10 cm.

Two basic inspection strategies are used: optical inspection combined with comparison to a known perfect mask plate (known as die-to-die) or comparison between design data and the finished mask plate (die-to-data). Defects on the mask are fatal because they will be reproduced on all the wafers. Defects can be classified into two broad categories of soft defects and hard defects. Soft defects are mainly particles or resist residues that can be cleaned away. Hard defects are permanent spots or scratches in chrome or in quartz.

Defects come in many guises, but from a repair point of view there are two grand classes of defects:

- missing chrome
- extra chrome.

Missing chrome calls for a deposition process. Usually a metallic layer is deposited (e.g., tungsten). Extra chrome calls for an etching process. These repair processes will be discussed in Chapter 24 on serial microprocessing.

The geometric/topological classification of defects (see Figure 8.10 below) is as follows:

- protrusion (extra chrome attached to a feature)
- intrusion (partial loss of chrome in a feature)



**Figure 8.10** Mask defects: extra chrome (protrusion, bridging, pinspot) and missing chrome (necking, intrusion and pinhole). If defect is very small, it may cosmetic only because it does not print in lithography. Redrawn after ref. Skinner

- bridge (chrome connecting two features)
- necking (discontinuity in a line)
- pinhole (hole in a chrome)
- pinspot (extra chrome on a light-field area).

From yield and reliability points of view not all defects are equal. Defect must be understood as a very broad term: anything that prints on the wafer or changes critical dimension by more than 10% counts as a defect. This can be a light transmission error (e.g., roughness in quartz), a pattern error, a stochastic scratch or an undulating line edge.

Defect size is important: not all defects are able to destroy the functionality of the chip. As a rule of thumb, defects greater than one-third of the minimum linewidth are prospective “killer defects.” Smaller ones will not print on the wafer in the lithography process.

Optical defects not related to written patterns include:

- transmission variability in glass (LF areas)
- transmission variability in chrome (DF areas).

Transmission defects are subtle and, even if detected, their repair is not straightforward. Phase shift mask making is very expensive partly because of difficulties in the inspection and repair or transmission defects.

## 8.6 Photomasks as Tools

Photomasks are used to make devices, but they also serve as tools for process and device engineers. As shown in



**Figure 8.11** Test structure for lithography and etching: the central line is surrounded by dark-field and light-field areas, and it is found as an isolated line as well as an array line. In an ideal case the linewidth should be independent of its neighborhood

Figure 1.4, there are not only device chips on the wafer, but test chips as well. Process engineers want to see the resolution of the optical lithography process, and this is checked by the linewidth test structures. Process robustness is tested by implementing it on photomask structures which span a range around the baseline fabrication process. For example, if the design linewidth is 3 µm, test structures may span the range 1–10 µm. The same applies for spaces between the lines. Linewidth is dependent on the neighboring structures, therefore test structures should include lines of different kinds: isolated, nested, dense and sparse, as shown in Figure 8.11.

Device engineers design different geometries of devices, for example rectangular and octagonal inductor coils, or straight and meandering resistors. For transistor parameter extraction a set of test transistors with dimensions of, say, 2, 3, 5, 10, 20 and 50 µm is used.

### 8.6.1 Photomask requirements

Because of high cost and long delivery times, it is important to specify masks correctly, and not to overpay for them. Not all lithography steps are equal: some are critical for device performance and the highest possible mask quality must be specified. In CMOS this applies to the gate and contact level masks. The metallization levels are usually less demanding, and the masks can be made to looser specifications. And there are levels with even looser requirements: bonding pads for wire bonding or fluidic ports are tens or hundreds of micrometers in size. They can be realized by even cheaper masks. Many packaging applications require compatibility with wafer processing, but make do with cheaper masks. As stated before, the various inspection and measurement steps take up a major part of photomask price. Mask buyers can



**Figure 8.12** Thick photoresist patterning: with chrome mask (a) and with plastic mask (b). Reproduced from Lee *et al.* (2001), with kind permission of Springer Science and Business Media

specify defects, and accept imperfect plates with some defects which have been classified as non-fatal.

In a university lab, mask costs can be brought down by adopting completely different technologies for different mask levels. The cost of laser-printed overhead transparencies is next to nothing, but they are suitable for structures in the size range of hundreds of micrometers. Polymer-based masks suffer from wear and tear and from dimensional instability. It also takes some effort to insert plastic masks into a mask aligner designed for chrome-on-glass plates.

In addition to linewidth, other issues need to be addressed in selecting mask technology. Sometimes large-linewidth structures require very smooth edges, for example some optical devices. And if rounded structures are needed, the number of polygons used to approximate

circular shapes becomes important. These issues are depicted in Figure 8.12, showing the difference between chrome masks and plastic masks. Additionally, plastic masks may look opaque to the eye but still leak light in the UV, rendering them unusable, especially for thick resists which require long exposure times.

## 8.7 Other Pattern Generation Methods

There are needs for masks at different linewidths and cost levels. There is the consideration between mask making and direct writing when production runs are small. Alternative pattern generation methods have been devised, but so far they have not become competitive with e-beam and laser beam methods for mainstream applications.

One alternative technology is based on SLM (Spatial Light Modulators) technology, namely arrays of micromirrors made by microfabrication. They can be used as “programmable masks” or “virtual masks” which are projected on a wafer. Applications of this approach are shown Figures 23.2 and 39.21, and the actual micromirrors are detailed out in Figures 29.25 and 39.21.

All the above methods are general purpose pattern generation methods: they can produce any shape the designer has imagined. There are also many methods to produce certain shapes very effectively. For instance, interference lithography (holographic lithography) is based on intersecting laser beams, and it is a very convenient way to produce regular arrays of lines and spaces, for example optical gratings. Di-block copolymers are systems that will phase segregate and end up, for example, with dot and hole patterns. Other shapes can be made, but each pattern needs a different block copolymer. Microbeads can form regular hexagonal arrays during liquid drying and have been used to make photonic crystals. Electrochemical etching of aluminum can result in a regular hexagonal lattice of holes. Some of these special techniques will be discussed in Chapter 23.

## 8.8 Exercises

1. How far will (a) a 10 keV e-beam penetrate into silicon and (b) a 50 keV beam into quartz?
2. What is the difference between writing masks and writing directly on silicon from the viewpoint of an electron beam writer?
3. What is the smallest possible feature size that can be written with a 50 keV electron beam?
4. What is the photomask writing time for a gigabit circuit with 1 000 000 000 contact holes of 90 nm diameter, when the incrementing rate is 500 MHz and the mask plate area 10 cm by 10 cm? The photomask is four times the final size.
5. If a 7 inch mask plate for 1 $\times$  lithography with 1  $\mu\text{m}$  linewidth is written by e-beam, what is the limiting process and what would be an estimate of the writing time?
6. How is electron beam system throughput affected if 5 $\times$  masks are drawn, instead of 1 $\times$  masks?
7. Calculate the number of quanta needed to expose resists: electron beam resist sensitivity is 10  $\mu\text{C}/\text{cm}^2$ , and optical resist sensitivity 10  $\text{mJ}/\text{cm}^2$ .
8. Use a laser printer to make simple line/space test structures with different resolutions (say 600 dpi and 1200 dpi) and check by microscope for linewidths, line edge roughness and reproducibility.

## References and Related Reading

- Allen, P.C. (2002) Laser scanning for semiconductor mask pattern generation, *Proc. IEEE*, **90**, 1653.
- Eynon, B. and B. Wu (2005) **Photomask fabrication technology**, McGraw-Hill.
- Hahmann, P. *et al.* (2007) High resolution variable-shaped beam direct write, *Microelectron. Eng.*, **84**, 774–778.
- Kawano, H. *et al.* (2003) Development of an electron-beam lithography system for high accuracy masks, *J. Vac. Sci. Technol.*, **B21**, 823–827.
- Lee, L.J. *et al.* (2001) Design and fabrication of CD-like microfluidic platforms for diagnostics: polymer-based micro-fabrication, *Biomed. Microdevices*, **3**, 339–351.
- McCord, M.A. and M.J. Rooks (1997) Electron beam lithography, in P. Rai-Choudhury (ed.) **Handbook of Microlithography, Micromachining and Microfabrication**, SPIE.
- Meyer, A.R., A.M. Clark and C.T. Culbertson (2006) The effect of photomask resolution on separation efficiency on micro-fabricated devices, *Lab Chip*, **6**, 1355–1361.
- Pease, F.R. (2005) Maskless lithography, *Microelectron. Eng.*, **78–79**, 381–392.
- Pugh, G. *et al.* (1998) Impact of high resolution lithography on IC mask design, Custom Integrated Circuits Conference, IEEE, p. 149.
- Rizvi, S. (ed.) (2005) **Handbook of Photomask Manufacturing Technology**, CRC Press.
- Scholze, H. (1977) **Glas-Natur, Struktur und Eigenschaften**, 2nd edn, Springer.
- Skinner, J.G. *et al.* (1997) Photomask fabrication procedures and limitations, in P. Rai-Choudhury (ed.) **Handbook of Microlithography, Micromachining and Microfabrication**, SPIE.
- Wu, B. (2006) Photomask plasma etching: a review, *J. Vac. Sci. Technol.*, **B24**, 1–15.

# 9

## Optical Lithography

Optical lithography, also known as UV lithography or photolithography, uses UV lamps or UV lasers to expose photosensitive film through photomasks. Chromium patterns on a photomask block light at selected areas, forming patterns in the photosensitive film (which is called photoresist for historical reasons). Once a photomask with millions of pixels has been written, photoresist can be exposed very quickly: typical exposure times are on the order of 1 second.

After a photoresist pattern is thus formed, various possibilities are open: for instance, the open areas of the underlying material can be etched away, in the case of silicon dioxide, by hydrofluoric acid (Figure 9.1). Photoresist is removed after etching. Wafer processing then continues with thin-film deposition, doping or plating steps, and new lithographic steps. The successive layers have to be aligned to each other, so that desired contacts are formed. Overlay of successive layers is a critical factor in lithography.

There are three rather different elements in the optical lithography process:

1. Optics: radiation generation, propagation, diffraction, interference.
2. Chemistry: photochemical reactions in the resist, development.
3. Mechanics: mask to wafer alignment and parallelism, focusing.

We will first go through the lithography process step by step as the wafer sees it. A more detailed discussion of the various steps then follows. All discussion in this chapter refers to  $1\times$  lithography: the patterns on the mask are transferred to the wafer in the same size as they are on the mask. This mode is called contact/proximity lithography. Projection lithography with reduction optics will be dealt with in the next chapter.

### 9.1 Lithography Process Flow

#### 9.1.1 Bake and prime

Practically all microfabrication processes start with some sort of surface preparation step (these will be discussed in detail in Chapter 12). Surface preparation “initializes” the wafer, and the starting conditions will then be independent of preceding process steps or wait time. In lithography, the first preparation step is a baking process. Adsorbed water is removed by baking the wafer at  $100\text{--}200^\circ\text{C}$ .

The next step, wafer priming, is also known as adhesion promotion. Hexamethyl disilazane vapor (HMDS,  $(\text{H}_3\text{C})_3\text{Si-NH-Si-(CH}_3)_3$ ) is applied at reduced pressure. As shown in Figure 9.2, the oxygen in the hydroxyl group at the wafer surface will form a bond with silicon atom in HMDS, and the amine group reacts with hydrogen, releasing ammonia.

The monomolecular silane layer makes the wafer slightly hydrophobic (contact angles  $60^\circ\text{--}70^\circ$  for example), which prevents water readsorption. It also ensures good wetting by photoresist. This is especially important for materials like metals, polysilicon and PSG, because resist adhesion to these materials is poor. Adhesion promotion is also a guarantee against cleanroom humidity variations and a precaution against processing delays, as HMDS primed water retains its water-free condition even for a couple of days.

#### 9.1.2 Spin coating

Spin coating is the standard resist application method. A few milliliters of resist is applied on a static or slowly rotating wafer. Acceleration to about 5000 rpm spreads the resist over the wafer, leaving a very uniform layer (Figure 9.3). Standard resist thicknesses are about  $1\,\mu\text{m}$  when contact/proximity lithography is used.



**Figure 9.1** Making oxide patterns: (a) oxide film deposition; (b) photoresist application (negative resist used); (c) UV exposure through a photomask; (d) development of resist image; (e) etching of oxide; (f) photoresist removal. Courtesy Esa Tuovinen, University of Helsinki



**Figure 9.2** HMDS adhesion promotion (priming)

### 9.1.3 Pre-exposure bake

The spin-coated resist still contains solvent, and a bake step is needed to drive out most of the solvent. This bake is, for example, 90 °C for 30 min in an oven or 90 °C

for 60 s on a hotplate. Care is needed to optimize the bake temperature, because too high a temperature will decompose the photoactive compound, leading to lower sensitivity to exposing radiation. Pre-exposure bake is also known as soft bake or post-apply bake.

### 9.1.4 Alignment

Next, the wafer and the photomask are inserted into a mask aligner. In alignment, the patterns already on the wafer are aligned to the new mask patterns (Figure 9.4).

In the case of the first mask level, no alignment is needed. However, it is useful to print zero-level alignment marks on the wafer using a specific mask, and etch those marks in silicon. This is especially necessary when the first process step does not leave anything visible on the



**Figure 9.3** Spin coating process



**Figure 9.4** Alignment operation: left, wafer with a resistor and alignment marks; middle, photomask with contact holes and alignment marks; right, after linear translation and rotation of the wafer, the alignment marks on the wafer and mask coincide, and consequently the resistor and contact holes too

wafer surface; for example, doping by ion implantation will only insert some dopant atoms inside silicon.

### 9.1.5 Exposure

The mask aligner is, in spite of its name, also an exposure tool. The simplest lithographic technique is contact lithography: the photomask and the resist-covered wafer are brought into intimate contact, and exposed. A mask aligner is shown in Figure 9.5: mercury lamp UV radiation is filtered and uniformly distributed over the wafer. Mechanical parts ensure wafer and mask distance and parallelism. The lithography section of a cleanroom is similar to a dark room in a photographic lab, except that yellow light is used instead of red light, because photoresists are sensitive below 450 nm only.

Resolution is determined mainly by pattern dimensions on the mask. Extremely small patterns can be made in theory by pressing the mask into intimate contact with the resist (called vacuum contact). This is, however, a source of defects when resist debris adheres to the mask.



**Figure 9.5** Exposure/mask aligner tool: lamp–mirror–shutter–filter–lens system, and mask and wafer. Courtesy Süss Microtech

### 9.1.6 Post-exposure bake

Some resists require post-exposure bake (PEB). It leads to the diffusion of photogenerated molecules. These

molecules are responsible for changing the resist solubility in the developer and this diffusion smoothes out optical interference effects. In so-called chemically amplified resists (CARs) the UV exposure only initiates the photopolymerization process, and a bake is absolutely needed to complete it. During the bake, the catalyst molecules generated by UV exposure react and produce the solubility difference between exposed and unexposed resist.

### 9.1.7 Development

In development the soluble parts of the resist are etched away. In positive resists the exposed parts turn to carboxylic acid, which is removed by an alkaline developer, typically 0.26 M TMAH (tetramethyl ammonium hydroxide). In negative resists the exposed parts have been crosslinked and rendered stable, and the unexposed parts are removed by a solvent developer. Just as in photography, it is the combined effects of exposure and development that determine the final outcome. One can compensate the other to a certain extent by tuning it: overexposure and underdevelopment, and vice versa, may save the day (but certainly not lead to optimal results). Rinsing and drying complete the development, again in analogy to photography.

### 9.1.8 Hard bake

Hard bake hardens the resist, which is useful in the subsequent plasma etching and ion implantation steps, which are energetic processes. It is also useful in wet processes like etching or electroplating because it improves resist adhesion. The hard bake temperature is limited by the resist glass transition temperature  $T_g$ : above  $T_g$  the resist will flow, and shallow sloped walls will result (Figure 18.29), while vertical or nearly vertical walls are practically always desired.

### 9.1.9 Inspection and metrology

After hard bake the results of the lithography process are checked. Even when linewidths are below optical microscopy resolution, it is useful as an initial check: for instance, resist adhesion loss and delamination and other gross errors can be seen. Linewidth specification is usually set as  $\pm 10\%$  of design value. Linewidth measurements by mechanical stylus, AFM or SEM form the basis of lithography process control. SEM micrographs of photoresist patterns can be seen in Figure 9.6. If resist patterns are found to be faulty, the photoresist is removed and the wafers resist coated and exposed again. The rework rate is a few percent in manufacturing. It is one of the few process steps where rework is possible.



**Figure 9.6** Linewidth test structure of positive photoresist in SEM micrograph. Reproduced from Roth *et al.* (1999)

## 9.2 Resist Chemistry

In this section we will discuss in more detail the working mechanisms of photoresists. Resists have three main components:

1. Base resin, which determines the mechanical and thermal properties.
2. Photoactive compound (PAC), which determines sensitivity to radiation.
3. Solvent, which controls viscosity.

The most common base resin for positive resists is phenolic novolak, which is soluble in alkaline developers. Diazonaphoquinine (DNQ) photoactive compound acts as an inhibitor, and unexposed resist is therefore nonsoluble in developer. Upon exposure, the DNQ reacts to form an acidic compound which makes the exposed resist soluble. As can be seen in Figure 9.7, water is needed for the reaction to take place. This is one reason for cleanroom humidity control: cleanroom air is the source of this water vapor. Constant humidity around 40% RH is typical of cleanrooms. Another concern is gas evolution: nitrogen gas can lead to bubble formation inside the resist, especially when working with thick resists.

Negative resists can become insoluble because of an increase in molecular weight, that is polymerization. The resist becomes crosslinked either via free radical or acid-catalyzed polymerization. Negative resists are usually developed by solvents. This is a bit of a problem because solvents can swell the crosslinked resist, reducing pattern fidelity.



**Figure 9.7** Diazonaphthoquinine (DNQ) novolak resist reaction upon UV exposure. Photoactive compound reacts to form carboxylic acid, which is soluble in the developer

The crosslinking feature which makes negative resists stable also makes photoresist removal difficult, an obvious dilemma. Many negative resists are stable enough to be used as permanent structural materials in microsystems, especially microfluidics. These will be discussed in Chapter 18.

### 9.3 Resist Application

Spin coating is a very widely used method for photoresist spinning and polymer deposition in general. Spinning is a simple process for viscous material deposition. Spinners, with typical speeds up to 10 000 rpm, are found in every microfabrication laboratory. The main parameters for film thickness control are viscosity, solvent evaporation rate and spin speed. Spin-coated film thicknesses range from 0.1  $\mu m$  up to 500  $\mu m$ , with standard photoresists usually around 1  $\mu m$ . Special features of thick resists will be discussed in Chapter 18 on polymer microprocessing.

Depending on the wafer size and desired film thickness, a drop of 1–10 ml (=  $cm^3$ ) is dispensed at the wafer center. Rapid acceleration to for example 1000 or 5000 rpm spreads the liquid toward the edges. Half of the solvent can evaporate during the first few seconds, so rapid acceleration is essential because viscosity increases upon drying, and radially non-uniform thickness will result from viscosity differences. Spin speed can be controlled to about  $\pm 1$  rpm; this is important because  $\pm 50$  rpm will result in 10% thickness differences.

Film thickness depends mainly on resist viscosity ( $\eta$ ) and spin speed ( $\omega$ ), according to

$$t \propto \sqrt{\frac{\eta}{\omega}} \quad (9.1)$$

Spin speed can be used to tailor resist thickness over one decade, for example 0.5–5  $\mu m$ , but beyond that a new resist formulation with a different viscosity must be used. Viscosity is dependent on resist solid content (which can vary from 20 to 80%) and temperature. The solvent evaporation rate depends on the ambient environment, and a closed spinner bowl with saturated solvent vapor and an adjustable exhaust from the spinner bowl can both be used to control evaporation.

Spin coating is very good in terms of film uniformity:  $\pm 5$  nm can be achieved for 1  $\mu m$  films, but for thicker films it is not so good, but for example 100  $\mu m \pm 5 \mu m$ . Because spin coating works on liquid materials, smoothness is excellent: RMS roughness of spin-coated films is comparable to the initial smoothness of a silicon wafer, for example 0.2 nm.

Turbulence (both from the spin process itself and from cleanroom airflows) and ambient humidity (which is affected by the exhaust from the spinner bowl and cleanroom environmental controls) affect evaporation rate and consequently film thickness. Turbulence sets in earlier in larger systems, and spin speeds for 300 mm wafers are limited to about 2000 rpm to avoid turbulence. Pinhole defects in spin-coated films are thickness dependent: thinner films are more defective. Pinholes can be caused by particles on wafers, but also by particles in the dispensed fluid, even though all chemicals in microfabrication have been filtered with submicron filters. Air bubbles formed during dispensing (caused by an unclean dispense tip, for example) can cause either pinholes or large bubbles, in the millimeter range.

Resist is expelled over the edge of the wafer during spinning. At the wafer edge, however, a balance is formed where drying increases viscosity and reduces flow, and finally surface tension becomes the dominant force, leading to the formation of a bulge. This bulge is known as edge bead. Edge bead height sets a minimum mask-to-wafer distance and often is so high that an optimum distance (gap) cannot be set. Edge bead removal (EBR) is a process where a directed solvent jet etches the resist away from the wafer edges.

#### 9.3.1 Resist profile over topography

Spin processing over severe topography is difficult: liquid-like film will fill grooves and crevasses, and highly non-uniform thickness results. On the other hand, this planarizing effect is sometimes used to advantage: spin coating fills the gaps and smoothes out topography. Spin-on-glass planarization works this way (see Figure 16.7).

There are other resist coating techniques: namely, spray coating, electrochemical coating and casting.



**Figure 9.8** Resist over topography: (a) spin coated; (b) spray coated; (c) cast; (d) laminated dry film

Electrochemical coating requires special resist formulations, and is applicable to conductive surfaces only. Spray is applicable to thin resists. Casting is suitable for thick resists only. These techniques are especially suited to applications where resist coverage is needed over severe topography. And if there are holes in the wafer, as in MEMS devices after through-wafer etching, spin coating is unsuitable for many reasons: the resist would spill into the holes, and the vacuum chuck would not be able to hold the wafer during spinning (unless a special design were used). Still another way is to use dry film resists: negative resists are available as sheets that can be laminated on a wafer. These different resist coating techniques over an etched cavity are shown in Figure 9.8.

## 9.4 Alignment and Overlay

Because microdevices are built up layer by layer, overlay of successive layers on top of previous layers is a paramount performance criterion of optical lithography align/exposure tools. Overlay refers to general pattern placement, and alignment refers to specific spots on the wafer, namely the alignment marks (or alignment keys or targets), which are used for the alignment procedure. Because alignment is limited to specific structures (usually on the wafer or chip edge), it is not a full guarantee of overlay elsewhere. Overlay is affected by lens aberrations, wafer chuck irregularities (equipment-related problems), mask pattern misplacement (mask fabrication problems), or distortions of the wafer itself, such as warping or site flatness. We will, however, use the term alignment because it is an easy operational concept. The term mask aligner nicely underlines the importance of alignment. A contact/proximity aligner that can print  $3\text{ }\mu\text{m}$  minimum lines is typically capable of  $1\text{ }\mu\text{m}$  alignment accuracy between levels.

Alignment needs to be evaluated over a long time: device fabrication processes take weeks or even months.

For example, temperature differences between different exposures will affect alignment because of thermal expansion of the wafer, the wafer stage and the photomask. The lenses in the optical path of the exposure tool are subject to constant UV flood, and they too need to be thermally stabilized.

Alignment needs to be discussed from two rather different points of view:

1. **Equipment view:** This is an optomechanical problem of finding alignment marks on the mask and on the wafer, and manipulating them to coincide.
2. **Device design view:** This is a design issue and it depends on how structures need to overlap or relate to each other, for instance metallization has to overlap contacts and a guard ring has to surround the active device.

Alignment could be done using the devices themselves, but this is impractical because of the micrometer dimensions and multiple identical structures. Therefore separate alignment marks are used. Alignment marks are much larger than device features because they exist only for alignment and have nothing to do with resolution. Alignment is usually done on a wafer level, with two alignment marks as far from each other as possible, to increase theta (rotational) resolution.

The alignment sequence determines which layers are aligned to each other. Layers are not usually aligned sequentially to the previous layer, but to some important previous layer. The capacitor top electrode is aligned to the bottom electrode, even though other steps might have been done in between.

## 9.5 Exposure

A mercury lamp provides strong spectral peaks at wavelengths of 436, 405 and 365 nm. Typical exposure energies for standard positive novolak resists is in the range of  $100\text{ mJ/cm}^2$ , which translates to exposure times of a second or a few seconds. Thicker resists require larger exposure doses, and the times can be even minutes.

Proximity lithography is a modification of contact lithography: a small gap, for example  $3\text{--}50\text{ }\mu\text{m}$  is left between the mask and the wafer (Figure 9.9). Resolution is now not simply dependent on feature size on mask, but diffraction effects have to be accounted for. Both contact and proximity lithography are done in one and same machine: the gap between the mask and the wafer is an adjustable parameter, with values from zero upward. Contact/proximity lithography systems are  $1\times$ : the image is the same size as the original.



**Figure 9.9** Contact lithography: mask and wafer are in intimate contact. Proximity lithography: a gap of typically 5–50 µm is instituted between resist-coated wafer and the photomask

## 9.6 Resist Profile

Perfectly vertical resist walls ( $90^\circ$ ) are difficult to make. Positive resists usually have a slightly positive slope,  $85^\circ$ – $89^\circ$ , negative resists have typically similar negative profile. This is a natural consequence of exposure light intensity through the mask (Figure 9.10). Due to diffraction, some light is exposing resist underneath the opaque parts of the mask. With positive resists this exposed part becomes soluble, and a positive slope results. With negative resists the diffracted light contributes to crosslinking reactions, making the exposed parts less soluble in developer, resulting in a negative (also called retrograde) profile.



**Figure 9.10** Diffraction at chrome pattern edges affects photoresist profile: positive resist (left) with exposed parts developed away leads to positive slope; negative resist (right) with exposed parts hardened leads to negative profile

5 µm and 1 µm resist thickness, minimum lines of 1.5 µm (and 1.5 µm spaces between them) can be obtained.

With thick MEMS resists minimum linewidths are clearly much larger. This is shown in Figure 9.11 where patterns of various sizes are printed in resist 7 µm thick. Four different exposure modes are used and the best results are achieved for vacuum contact between the mask and the resist-coated wafer. Hard contact results are also good, but for soft contact and 20 µm proximity the smallest 2.5 µm feature is not exposed, and the profile of the resist gets fairly shallow.

X-ray lithography is  $1\times$  proximity lithography, too. Wavelength is just much smaller, for example 4 nm, enabling much smaller minimum lines. However, making masks with very small dimensions is problematic.

## 9.7 Resolution

Making individual narrow lines is not a major problem in microlithography, but rather making closely spaced narrow lines. Line plus space, which is microlithographic resolution, is called pitch. An individual narrow line can be made even accidentally by for example overexposure (but the line shape will be far from ideal). Resolution, or the ability to separate two patterns, is the criterion for lithography capability. Proximity lithography minimum resolvable linewidth (half-pitch) is calculated from Fresnel diffraction and approximated by

$$\text{linewidth} \approx \sqrt{\lambda \times \left(g + \frac{d}{2}\right)} \quad (9.2)$$

where  $g$  is the gap and  $d$  the resist thickness. Typical values for wavelength  $\lambda$  of exposing radiation are 436, 405 or 365 nm when a mercury lamp is used as the light source. Gaps range from zero (called vacuum contact) to a few tens of micrometers. With a 405 nm wavelength, gap of

## 9.8 Process Latitude

Many lithography parameters change and drift over time, including resist thickness (which is affected by for example viscosity change due to temperature change), exposure energy (which changes as the lamp ages), developer concentration (which may increase due to water evaporation), and many more. The process window for lithography can be defined in many ways, but exposure dose–development time is useful (Figure 9.12). The process window is the set of



**Figure 9.11** The effect of mask-wafer gap: vacuum contact (topmost) results in good resist profiles, hard contact (2<sup>nd</sup> from top) in acceptable profile while soft contact and 20  $\mu\text{m}$  proximity gap (bottom) result in sloped walls for 5  $\mu\text{m}$  and 10  $\mu\text{m}$  features and incomplete exposure for 2.5  $\mu\text{m}$  feature. Positive resist 7  $\mu\text{m}$  thick. Courtesy Süss Microtech



**Figure 9.12** Process window for lithography process: exposure time vs. development time

exposure dose and development time combinations that results in linewidths which are within specification, typically  $\pm 10\%$ .

Underexposure leads to no pattern formation, and it is therefore a fatal processing error. Overexposure clears all

resist in open spaces, but it also clears resist where it should not, leading to patterns with widths different from design widths. Also, the resist sidewall profile is affected.

## 9.9 Basic Pattern Shapes

There are four basic shapes that have to be patterned: line, trench, hole and dot. More complex shapes can all be patched together from these elementary shapes, as pattern generators do. An opaque chromium line on a mask will end up as a line on the wafer if positive resist is used, but as a trench in the case of negative resist. A transparent opening in chromium will result in a trench with positive mask, and a line with negative resist.

Patterns come in two basic varieties: isolated and array (Figure 9.13). Lithography for these is different, and the ultimate lithographic resolution is also shape dependent. For example, stray light is a major issue for a light-field structures, whereas in dark-field patterns it is not so much of an issue. Dark-field patterns are also less affected by particles: most of them fall on chrome and will not affect patterns.

## 9.10 Lithography Practice

After lithography, various processes are possible, and all of them exhibit different requirements for resists in terms of optimum thickness and sidewall profile, chemical stability, thermal and mechanical specifications, etc. Resists



**Figure 9.13** Isolated vs. array features and light field vs. dark field



**Figure 9.14** Processing after lithography: (a) ion implantation (Chapter 15); (b) wet etching (Chapter 11); (c) molding (Chapter 18); (d) plasma etching (Chapter 11); (e) electroplating (Chapters 5 & 29); (f) lift-off (Chapter 23)

face a serious scaling trade-off: thickness has to be scaled down for better resolution, but etch resistance and implant blocking capability cannot be sacrificed. Also, making resist thinner by a factor of two increases the number of pinhole defects by a factor of 10 or even 100.

Other criteria for resist selection include the following:

- sensitivity (photospeed)
- contrast
- exposure latitude
- shelf life.

In a university setting photospeed is not important, but in manufacturing setting it certainly is. Contrast (to be discussed in the next chapter) is important when vertical walls are needed. Exposure latitude is a measure of linewidth constancy in spite of variations in exposure energy. Shelf life refers to the fact that decomposition of the photoactive compound during storage eventually makes resist useless.

Figure 9.14 shows six different applications for resist patterns. In wet processes (wet etching, electroplating) adhesion is important, while in vacuum processes (ion implantation, plasma etching, lift-off) vacuum compatibility is essential. In molding, the molded piece must be detached from the mold, and surface energy, surface smoothness and sidewall angle are important considerations. The resist requirements for the different applications are listed in Table 9.1.

And of course there is the possibility that the resist structure is the final structure. Such permanent structures have to be evaluated for mechanical and thermal stability, surface properties and chemical tolerance. Chapter 18 will deal with permanent polymer microstructures.

## 9.11 Photoresist Stripping

After photoresist has served its role as a protective layer, it must be removed. There are a number of methods to

**Table 9.1** Resist requirements for different applications

|                                                                                                                                                                                                                                                                                                                                                |
|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Ion implantation:                                                                                                                                                                                                                                                                                                                              |
| <ul style="list-style-type: none"> <li>resist thickness of 1 <math>\mu\text{m}</math> will stop B, P, As, Sb ions with &lt;200 keV energy</li> <li>beam current heats resist: cooling or current limitation are needed</li> <li>resist carbonizes under heavy doses (<math>&gt;10^{15} \text{ cm}^{-2}</math>), difficult to remove</li> </ul> |
| Wet etching:                                                                                                                                                                                                                                                                                                                                   |
| <ul style="list-style-type: none"> <li>resist adhesion is important, resist may peel off</li> <li>resist will not tolerate strong acids or alkaline etch solutions</li> <li>hot etch baths degrade resist fast</li> </ul>                                                                                                                      |
| Molding:                                                                                                                                                                                                                                                                                                                                       |
| <ul style="list-style-type: none"> <li>smooth surface</li> <li>non-negative profile</li> <li>minimize chemical reactions with polymers</li> </ul>                                                                                                                                                                                              |
| Plasma etching:                                                                                                                                                                                                                                                                                                                                |
| <ul style="list-style-type: none"> <li>resist will be etched in plasma: its size and shape will change</li> <li>resist will be damaged by plasma (both bombardment and thermal effects)</li> <li>removal of damaged resist is difficult</li> </ul>                                                                                             |
| Electroplating:                                                                                                                                                                                                                                                                                                                                |
| <ul style="list-style-type: none"> <li>plating solutions are often chemically aggressive</li> <li>adhesion is important</li> </ul>                                                                                                                                                                                                             |
| Lift-off:                                                                                                                                                                                                                                                                                                                                      |
| <ul style="list-style-type: none"> <li>thickness of the film needs to be less than resist thickness</li> <li>resist sidewall profile preferably retrograde</li> <li>deposition process <math>T &lt; 120^\circ\text{C}</math> because of resist thermal limitation</li> </ul>                                                                   |

accomplish this, listed in Table 9.2. The choice depends on the particular process step, the materials present on the wafer, resist nature and established laboratory practice (which may be determined by historical precedence, environmental concerns or other idiosyncratic factors).

Sulfuric acid is a strong oxidant, and therefore an effective remover of organic materials. The photoresist surface is carbonized during ion implantation but hot sulfuric acid/peroxide mixture can remove it. However, sulfuric acid cannot be used if the wafer is metallized because the acid will etch metals too. Acetone is a fairly mild remover,

**Table 9.2** Photoresist stripping

| Techniques        | Mechanism                            |
|-------------------|--------------------------------------|
| Oxygen plasma     | Oxidation in vacuum                  |
| Ozone discharge   | Oxidation under atmospheric pressure |
| Acetone           | Dissolution in liquid                |
| Ozonized water    | Bond breaking and dissolution        |
| Sulfuric acid     | Oxidation in liquid                  |
| Organic amines    | Oxidation and dissolution in liquid  |
| Hydrogen peroxide | Oxidation in liquid                  |

but it cannot be used if the resist has been damaged or transformed by plasma or ion bombardment. Other organic strippers are amine and phenolic based, and more effective than acetone. Oxygen plasma is the general purpose resist stripping technique. Ozone is similar to oxygen plasma: highly reactive  $\text{O}_3$  is generated in the reactor and reacts with organic materials. It is a common practice to use two-step resist stripping: plasma (dry) removal followed by wet removal for best results.

Ultrapure ozonized water UPW- $\text{O}_3$  (10–100 ppm ozone in DI water) is potentially a major cost reduction invention in stripping. Strip rates of 150 nm/min can be achieved, and utilization of ozone is very efficient even though a simple chemical reaction might suggest otherwise:



$\text{CH}_2$  can be used as a model molecule for photoresist. This calculation shows that 10.3 g of ozone is needed to remove 1 g of resist; that is, a batch of 25 wafers (200 mm diameter wafers) would need about 10–100 kg of ozonized water depending on pattern density. Fortunately much less is needed: ozone breaks up longer molecules, and the smaller molecules are water soluble.

## 9.12 Exercises

- What fraction of resist ends up on the wafer in spin coating?
- What is the ultimate resolution in optical contact lithography?
- What is the best possible resolution of X-ray contact/proximity lithography?
- A silicon wafer of 100 mm diameter has 1  $\mu\text{m}$  lines fabricated on it. The photomask is made of soda lime glass with a coefficient of thermal expansion (CTE) of 10 ppm ( $10 \times 10^{-6}/^\circ\text{C}$ ). How accurately must the temperature in the patterning process be controlled in order to keep distortions from thermal expansion over

- the 100 mm wafer below  $0.3 \mu\text{m}$ ? Silicon CTE is  $2.5 \times 10^{-6}/^\circ\text{C}$ .
5. Plot minimum feature size in proximity lithography as a function of resist thickness from  $1 \mu\text{m}$  to  $1000 \mu\text{m}$  thicknesses.
  6. If 5000 rpm spinning produces a resist  $1 \mu\text{m}$  thick, plot resist thickness at other spin speeds!
  7. How do light-field and dark-field photomasks differ with respect to particle contamination?
  8. If a wafer with a resist  $350 \mu\text{m}$  resist is baked on a hotplate which is  $0.1^\circ$  off-horizontal, what will be the resist non-uniformity due to gravitational flow?
  9. What alignments are needed in fabricating the device of Figure 5.19?

## References and Related Reading

- Chuang, Y.-J., F.-G. Tseng and W.-K. Lin (2002) Reduction of diffraction effect of UV exposure on SU-8 negative thick photoresist by air gap elimination, *Microsyst. Technol.*, **8**, 308–313.
- Cui, Z. (2005) **Micro-Nanofabrication**, Springer.
- Franssila, S. and S. Tuomikoski (2010) MEMS lithography, in V. Lindroos *et al.*, **Handbook of Silicon Based MEMS Materials and Technologies**, Elsevier.
- Mack, C. (2007) **Fundamental Principles of Optical Lithography**, John Wiley & Sons, Ltd.
- Moreau, W. (1988) **Semiconductor Microlithography**, Plenum Press.
- Pham, N.P. *et al.* (2008) Photoresist coating and patterning for through-silicon via technology, *J. Micromech. Microeng.*, **18**, 125008.
- Rai-Choudhury, P. (ed.) (1997) **Handbook of Microlithography, Micromachining and Microfabrication**, SPIE.
- Roth, S. *et al.* (1999) High aspect ratio UV photolithography for electroplated structures, *J. Micromech. Microeng.*, **9**, 105–108.
- Saha, S.C. *et al.* (2008) Tuning of resist slope with hard-baking parameters and release methods of extra hard photoresist for RF MEMS switches, *Sens. Actuators*, **A143**, 452–461.
- Shaw, J.M. *et al.* (1997) Negative photoresists for optical lithography, *IBM J. Res. Dev.*, **41**, 81.

# Advanced Lithography

This chapter concentrates on the making of narrow lines for modern integrated circuits. Linewidth scaling has been driven by CMOS scaling for four decades, and the trend continues. While in the 1980s it was thought that  $1\text{ }\mu\text{m}$  would be the limit of optical lithography, in 2010 lines of  $50\text{ nm}$  are manufactured in high volumes by UV exposure (microprocessor gates are  $20\text{ nm}$ , but these are not done directly by optics, as will be explained later in this chapter). This tremendous progress is based on a number of innovations in optics, optomechanics, resist chemistry and related processing advances.

From a manufacturing point of view, lithography performance can be pinpointed to a few key elements:

- **CD control:** How accurately is the critical dimension (linewidths) controlled?
- **Overlay/alignment:** This concerns the placement of patterns relative to previous layers.
- **Defectivity:** This applies to pattern fidelity itself, as well to added particles: there should be none.
- **Metrology:** Masks have to be verified for correctness and the resist patterns inspected after lithography; both tasks are becoming formidable because of simultaneous linewidth reduction and chip size increase.
- **Cost:** Because lithography is done many times (25 photomasks in DRAM, 35 in modern microprocessors) it is imperative to keep throughput and yield high.

## 10.1 Projection Optical Systems

While contact and proximity lithography discussed in the previous chapter relied on  $1\times$  reproduction of the mask pattern on the wafer, today  $4\times$  reduction optics is the norm. This is a great relief for mask makers as  $50\text{ nm}$  features are “only”  $200\text{ nm}$  on the mask. The main elements of a reduction optical system are depicted in Figure 10.1.



**Figure 10.1** Reduction projection optical lithography system

In projection optical systems the optical system II of Figure 10.1 is the key element: it provides an image of the mask on the wafer. To emphasize the difference from  $1\times$  masks, reduction masks are often called reticles.

Projection optics today is used for chipwise exposure: one chip is exposed, the wafer is moved to a new position, and another chip is exposed. This approach is termed step-and-repeat, and the systems are known as steppers (Figure 10.2). In spite of the fact that it is easier to expose a chip-sized area than full wafers, stepper lenses are very large and expensive. The lens of a stepper with for example an  $8\text{ cm}^2$  field (on a wafer) can weigh 500 kg and cost \$10 million (Step-and-repeat was an existing technique in the photomask industry: the original chip pattern was written once on a mask blank and the final  $1\times$  full wafer mask with hundreds of identical chips was made by copying the original pattern many times over to



**Figure 10.2** Scanning projection and reduction step-and-repeat. Reproduced from Mack (2007) by permission of John Wiley & Sons, Ltd

another mask blank.) Step-and-repeat lithography was the mainstay of IC fabrication in the 1980s and 1990s.

The alternative method is step-and-scan (with tools known as scanners), where the mask and wafer are moved in opposite directions and the exposure is done through a slit. Scanners expose smaller areas, in a sense, because only the narrow slit is exposed, making it easier and cheaper to fabricate optics for scanners. The field size can be made larger because of cheaper optics. And the field size is limited by optics only in the slit height direction; in the scan direction it is limited by stage mechanics. The benefits of cheaper optics are partially lost because the mechanics of wafer stage and mask holder movements must be delicately coordinated. IC fabrication relies mostly on scanners today.

It is certainly slower to expose chips than to expose full wafers. Steppers in the early 1980s exposed 25 wafers per hour (WPH) while the then dominant  $1\times$  projection aligners had throughputs of 100 WPH. Today steppers and scanners achieve up to 200 WPH. Assuming a  $40\times 20$  mm exposure field and 300 mm wafer size, this translates to a fraction of a second per chip. Exposure field size is important for DRAMs and microprocessors, which are large chips, and an integral number have to fit an exposure field. With smaller chips it is possible to reduce reticle costs by including many different designs on the same reticle.

Many other operations in addition to exposure can be done individually for each chip, enabling tighter process specifications. For example, focusing and alignment can be done for each chip, and wafer warping can be compensated for. Additionally, chipwise exposure enables the optimization of lithography in an economical fashion. In test runs all chips can be exposed differently, in order to find the optimum exposure dose and focus conditions, and to check process robustness. The focus-exposure



**Figure 10.3** Focus-exposure matrix: linewidth plots with different focus depth and exposure dose ( $\text{mJ}/\text{cm}^2$ ) combinations: (a) array lines; (b) isolated lines. Reproduced from Mack (2007), figure 10.14, by permission of John Wiley & Sons, Ltd

(FE) matrix results are shown in Figure 10.3. It shows linewidths for different focus settings with exposure dose as a parameter. As can be seen, it is difficult to optimize for both isolated and array lines simultaneously.

Alternatively, the FE matrix can be displayed as constant linewidths in focus-exposure dose coordinates. The process window can then be expressed as the maximum rectangle that fits within the linewidth specification (usually 10%), see Figure 10.4. For the isolated lines the process window is much smaller than for dense array lines.

It is possible to change the reticle between exposures and to have many different chips on one wafer in any proportion. The inclusion of different designs for rapid prototyping or test chips is thus flexible. Because the reticle never makes physical contact with the wafers, its lifetime is infinite in principle. In practice it is determined



**Figure 10.4** Process window in focus–exposure coordinates: all combinations inside the boxes result in acceptable linewidths from Mack (2007) by permission of John Wiley & Sons, Ltd

by product lifetime: many chips have such short lifecycles that a reticle set is often used only once, to make a lot of 25 wafers of 1000–10 000 chips depending on size.

Projection optics allows an extra trick to preserve reticle quality. A transparent film, called a pellicle, is framed in front of the reticle. Airborne particles will settle on the pellicle film, which is about 100 µm above the chrome pattern. Particles on the pellicle will be out of focus, and do not print on the wafer.

## 10.2 Resolution of Projection Optical Systems

The resolution of a projection optical system is approximated by the Rayleigh relation, Equation 10.1, and the depth of focus (DOF) is given by Equation 10.2:

$$\text{resolution} \approx \frac{k_1 \lambda}{\text{NA}} \quad (10.1)$$

$$\text{DOF} \approx \frac{k_2 \times \lambda}{\text{NA}^2} = \pm \frac{\lambda}{2 \times \text{NA}^2} \quad (10.2)$$

where NA is the numerical aperture. From geometrical optics we get  $k_1$  as 0.5, which means that minimum lines will be  $\lambda/2\text{NA}$ . One approach to better resolution (smaller linewidths) is by wavelength reduction. This strategy has been steadily used: from 436 nm (g-line from an Hg lamp), to 365 nm (i-line from an Hg lamp) to 248 nm (KrF laser) to 193 nm (ArF laser). All else being equal, this alone would result in a factor of two improvement in resolution and a factor of four improvement in device areal density. NA enhancement is another clear route

**Table 10.1** Linewidth scaling of CMOS

| Wavelength<br>(nm) | Aperture<br>NA | $k$ factor  | Linewidth<br>(µm) | DOF<br>(µm) |
|--------------------|----------------|-------------|-------------------|-------------|
| $\lambda = 436$    | NA = 0.38      | $k_1 = 0.8$ | 1                 | $\pm 1.5$   |
| $\lambda = 365$    | NA = 0.48      | $k_1 = 0.6$ | 0.5               | $\pm 0.8$   |
| $\lambda = 248$    | NA = 0.60      | $k_1 = 0.6$ | 0.25              | $\pm 0.35$  |
| $\lambda = 248$    | NA = 0.65      | $k_1 = 0.5$ | 0.18              | $\pm 0.30$  |

which has been used. In 20 years NA has increased from about 0.2 to 0.9, a factor of five improvement through better optical design and manufacturing.

Table 10.1 lists CMOS resolution trends assuming  $k_2 = 1$  but letting  $k_1$  evolve.

Resolution enhancement by NA increase has been paid for dearly on the focus side: DOF is becoming very small indeed. DOF (Equation 10.2) is an optical concept but resist chemistry and resist profile specifications (which depend on subsequent process steps) must also be considered. Besides optical DOF, other factors must be accounted for: the wafer is not flat, neither is the wafer chuck, and stepper focus mechanisms are not perfect. All these contribute, say, 0.1–0.2 µm errors in focus. Previous etching and deposition steps can easily create topographic variations on the order of half a micrometer, so planarization is critical for lithography (see Chapter 16 on chemical mechanical polishing).

A 4× reduction makes mask making much easier. Errors in both resist image on the mask and the etched chrome image on the mask are reduced, leading to tighter linewidth tolerances on the wafer. Mask writer placement error is also reduced, improving the overlay between two layers. The more complicated optics of reduction systems (in contact printing there is no imaging optics) introduced some distortion but this is a minor price to be paid.

## 10.3 Resists

In Chapter 9 we treated resists as if they were digital on/off materials which either react to exposure or not. Now we are dealing with more realistic cases: resists have exposure threshold energy, finite contrast and finite selectivity in developers. Resists are also optical materials and part of an optical system with reflections, interference and absorption. All these aspects become more pronounced when resists go over topography; patterning on a planar surface is fairly straightforward.

The calculation of exposure uses normalized remaining inhibitor concentration  $M(x, t)$ : it describes the fraction of inhibitor left after exposure for a certain time in a

certain position inside the resist. Optical absorption  $\alpha$  in photoresist is given by

$$\alpha = AM(x, t) + B \quad (10.3)$$

where  $A$  is exposure-dependent and  $B$  exposure-independent absorption.  $A$  and  $B$  are known as Dill parameters, and their values for novolak resists are in the range of  $0.4\text{--}1/\mu\text{m}$  for  $A$  and  $0.01\text{--}0.1/\mu\text{m}$  for  $B$ . A decrease of inhibitor concentration depends not only on light intensity  $I(x, t)$ , but also on sensitivity to exposing radiation  $C$  and of course to inhibitor concentration  $M$ . Time-dependent inhibitor concentration is given by

$$\frac{\partial M}{\partial t} = -I(x, t)M(x, t)C \quad (10.4)$$

The sensitivity parameter  $C$  is also known as Dill  $C$  and its value for novolak resists is on the order of  $0.01\text{ cm}^2/\text{mJ}$ .  $A$ ,  $B$  and  $C$  are all of course wavelength dependent. Analytical solutions to resist exposure are very difficult and simulation is used extensively.

Resist sensitivity can be tailored for different wavelengths (or for electrons, ions or X-rays). Sensitivity is important for productivity. With typical exposure energies on the order of  $100\text{--}500\text{ mJ/cm}^2$  for DNQ positive resists, exposure times for standard resists  $1\mu\text{m}$  thick are on the order of  $1\text{ s}$  with  $500\text{ W}$  lamps.

The first DUV lasers had intensities that were too low for practical throughputs and this problem led to the development of high-sensitivity chemically amplified resists (CARs) in the 1980s. CAR works in two steps: photoacid generator (PAG) molecules decompose upon photon impact and these decomposition products catalyze more PAG decomposition so that a single photon can lead to 100 decomposition reactions. In the second step, the post-exposure bake, the photoreaction products diffuse (a few nanometers or a few tens of nanometers) and react, and the reaction products are responsible for the solubility difference between exposed and unexposed resist. The coefficient Dill  $A$  is zero for CARs, because the exposure itself does not change the resist greatly.

Because the reaction is catalytic, the exposure dose is very small and system throughput is high (the exposure time is just one factor in throughput: the stepper has to move between exposures, align and focus, and all these add up in throughput calculations). CARs need only  $10\text{--}50\text{ mJ/cm}^2$  exposure doses, one-tenth of novolak resists. However, the very fact that the reaction is catalytic poses a danger: if the reaction is quenched, and multiplication stops, the resist is not exposed. This can happen due to airborne contaminants which react with the resist. Ammonia is one prime culprit, and it cannot be

completely eliminated from cleanroom air because it is such an essential cleaning chemical (see Chapter 12) and it is released by the HMDS priming process (Equation 9.2). The two-step nature makes CAR processing time sensitive: no delays are allowed between exposure and bake.

### 10.3.1 Contrast

Photoresist contrast is important for both resolution and profile. A sigmoid (nonlinear) response function is essential for patterning. Optical wavefronts after the mask are not ideal square waves but rather attenuated sine waves, and a linear response as a function of exposure dose is rather useless because photoresist patterns would be smoothly curving bumps, not clearly defined rectangular shapes.

Contrast ( $\gamma$ ) is calculated for positive and negative resists as the slope of dose required to clear the resist completely, Equation 10.5, which is graphically shown in Figure 10.5:

$$\gamma_p = \frac{1}{\log\left(\frac{d_c}{d_0}\right)} \quad \gamma_n = \frac{1}{\log\left(\frac{d_0}{d_i}\right)} \quad (10.5)$$

where  $d_c$  is dose to clear all resist and  $d_0$  is extrapolated dose at the kink in the contrast curve. For negative resist  $d_0$  and  $d_i$  are defined analogously. Typical contrasts are 2–5 for novolak-based positive resist and 5–10 for deep-UV CARs. High contrast is beneficial for the resist profile: on the one hand it is easy to achieve vertical walls, while on the other hand high contrast resist is easily overexposed, or saturated.

Resist thickness is limited by its mechanical stability: resist patterns that are too tall will collapse during drying due to capillary forces. This same phenomenon is seen in surface MEMS: short stubby ones survive better than long slender ones (Equation 29.8). Resist thickness is constantly reduced, and resist aspect ratios are 2–3:1, which translates to resists  $300\text{ nm}$  thick for  $90\text{ nm}$  technology and  $130\text{ nm}$  thick for  $45\text{ nm}$  node.

## 10.4 Thin-Film Optics in Resists

Photoresist is a part of an optical system involving the illumination light source, the lenses and the photomask, and we have to include the substrate also, because some light reaching the substrate through the resist will be reflected back, contributing to pattern formation.

Photoresist thickness determines the optical path length for incoming and outgoing rays. Constructive and destructive interference inside the photoresist lead to intensity



**Figure 10.5** Resist contrast plots on a thickness–exposure dose axes for infinite contrast resist and real resists: left, positive resist; right, negative resist



**Figure 10.6** Reflections at air/resist and resist/substrate interfaces result in interference pattern of standing waves. Reproduced from Peterson *et al.* (1996) by permission of Henley Publishing



**Figure 10.7** The effect of resist thickness variation on linewidth. Reproduced from Mack (2007) by permission of John Wiley & Sons, Ltd

The effect of resist thickness variation of linewidth is shown in Figure 10.7. Minor variation of 50 nm (5% of resist thickness) can lead to 20% linewidth variation.

Post-exposure bake (PEB), which enhances the diffusion of photoproducts, will make standing wave effects smaller. A diffusion distance of about half the period will smooth out standing waves. However, when lines are very narrow, PEB cannot be used: the diffusion distance must be considerably smaller than the linewidth, otherwise it will affect the line shape too much.

variations in the vertical direction through the resist. This is seen as standing wave patterns in the developed resist (Figure 10.6). In the extreme case, the parts that received least light (in positive resist) will not be developed by a developer that has high selectivity between exposed and unexposed parts (high contrast developer).

Thin-film interference in the resist leads to thickness-dependent exposure doses. Depending on resist thickness, the total dose needed to expose the resist changes. If destructive interference takes place in the resist top surface, almost all of the illumination energy is absorbed in the resist, whereas in the case of constructive interference at the top surface, only half of the energy stays inside the resist. Maxima and minima alternate at  $\lambda/(4n)$  intervals; for example, for  $\lambda = 365$  nm exposure and 1.6 index of refraction for typical photoresist, this interval is 56 nm.

On a planar surface this problem can easily be solved by better control of photoresist thickness (spinning process), but on a structured surface there is no general solution to the variable resist thickness problem.

Swing ratio is a measure of variation introduced by thin-film optical effects. It is determined as exposure–dose variation (max – min) divided by mean value. It can be defined similarly for the linewidth. It is analogous to a lossy Fabry-Pérot interferometer and can be modeled by the following equation:

$$S = 4 e^{(-\alpha D)} \sqrt{R_1 R_2} \quad (10.6)$$

where  $R_1$  is reflectivity at the air/resist interface,  $R_2$  is reflectivity at the resist/substrate interface,  $\alpha$  is the resist absorption coefficient and  $D$  is resist thickness.

Obviously there are four ways to minimize the swing ratio. The first strategy is to minimize  $R_1$ , which translates to a top antireflective coating (TARC). Light traversing the TARC twice will interfere destructively and minimize reflections if TARC thickness matches the  $\lambda/(4n)$  condition. The TARC refractive index should be given by

$$n_{\text{TARC}} = \sqrt{n_{\text{resist}} n_{\text{air}}} \quad (10.7)$$

With the resist  $n$  typically around 1.65, the TARC refractive index should be about 1.3. TARC thickness would then be about 70 nm.

Photoresist-like spinning is a favorite method for coating TARC, and the material is very much photoresist-like (non-absorbing, however); it will be removed by the developer. Added process complexity is small. TARC is insensitive to substrate material, therefore it is a fairly general method to reduce reflections and swing. If, however, TARC is deposited over steps in a way similar to resist, TARC thickness will be variable, and its effectiveness reduced. There are not many materials with  $n = 1.3$ , so choices are not abundant.

The reduction of  $R_2$  involves bottom antireflective coatings (BARCs). The role of BARCs on absorbing metal surfaces is to reduce reflectivity, and on transparent surfaces (like oxides) to reduce thickness effects. BARCs work by index matching, just like TARC's, but also by absorption: absorbed light will not re-enter the resist. BARC thicknesses are similar to TARC's, in the range of 20–70 nm, but the materials and processes are different. BARCs must tolerate developers because, if they did not, they would undercut the resist patterns. BARCs are therefore patterned by dry etching. Spin-on polymer-based BARCs do exist, but inorganic BARCs that will be left as permanent parts of the finished devices are also used. Titanium nitride TiN is a BARC for aluminum lithography, but it is deposited in the same process as the aluminum, not in conjunction with

resist processing. Oxides and nitrides can also be used as BARCs. It is difficult to remove them selectively, and most often they remain as parts of finished devices. Inorganic BARCs can act as hard masks for etching; the resist is used as a mask for BARC etching, and BARC is then used as a mask for film etching.

The absorption strategy involves resist tailoring. The standard  $\alpha$  are around  $0.2\text{--}1/\mu\text{m}$ . Adding dyes to increase absorptivity  $\alpha$  to for example  $2/\mu\text{m}$  means that all radiation will be absorbed in the top resist layer, and the bottom part will not be exposed. So there is an optimum between swing ratio reduction and resist profile. Top surface imaging (TSI), overcomes the absorption dilemma by using very thin and very absorbing resists, which are not sensitive to profile variation like standard resists.

The fourth possible strategy, resist thickness increase, is at odds with resolution: if we wish to print narrow lines, thinner resists are better. Scaling to smaller linewidths with this strategy is therefore not an option at all.

## 10.5 Lithography Over Steps

The viscous flow of photoresist over steps leads inevitably to uneven resist thickness (Figure 10.8) and linewidth change at step edges. Because spin coating results in variable resist thickness over steps, the linewidth will be dependent on underlying steps via resist thickness changes.

On non-planar surfaces the effect of structures from previous steps causes some problems. Reflections from underlying metal lines can cause resist exposure in unwanted places. This is called reflective notching and is depicted in Figure 10.9.



**Figure 10.8** Thickness variation of spin-coated resist over topographic features



**Figure 10.9** Reflective notching: left, top view of distorted resist lines; right, cross-sectional view showing how the underlying metal line reflects incoming light into the resist sidewall

## 10.6 Optical Extensions of Optical Lithography

Over the years many alternatives have been proposed as replacements for optical lithography, among them X-ray lithography, e-beam projection, ion beam projection and e-beam direct write. Optical lithography has been able to prosper because a constant flux of improvements has been forthcoming. For instance, there is immersion, an old technique that has been known to microscopists since the nineteenth century. NA (Equations 10.1 and 10.2) can be increased by replacing air (refractive index 1.00) with a higher refractive index fluid, usually water ( $n = 1.35$ ). Immersion has been used in the production of 65 nm and more advanced chips.

Optical waves have four basic properties: amplitude, phase, direction and polarization. Engineering these has resulted in many new ways to improve optical lithography performance. Phase shift masks (PSMs), off-axis illumination (OAI), optical proximity correction

(OPC) and subresolution assist features (SRAFs) are collectively known as resolution enhancement technologies (RETs).

### 10.6.1 Phase shift masks (PSM)

Normal masks are called binary masks because there are two possibilities for light: transmission or blockage. In phase shift masks (PSMs) there is a third possibility: transmission with phase shift. Shifters are semitransparent structures that produce a  $180^\circ$  phase shift in the transmitted light. Light along the shifted path will be out of phase with the light going through the unshifted part, and the amplitude will go through zero. Intensity, which is amplitude squared, will be much steeper compared to a binary mask, which improves both resolution and edge contrast.

The phase shift for light traveling in air for a distance  $L$  is  $\Phi = 2\pi L/\lambda$ , and for light traveling in the phase shifter material with index of refraction  $n$ ,  $\Phi = 2\pi nL/\lambda$ . For a  $180^\circ$  phase shift,  $\Delta\Phi = 180^\circ$ , the condition for shifter thickness is given by

$$L = \frac{\lambda}{2(n - 1)} \quad (10.8)$$

In alternating PSM (Figure 10.10) a shifter is either etched or deposited for every second feature, which limits PSM applications to regular arrays. The rim shifter (see Figure 10.11) utilizes undercut and can be applied to any pattern shape and size.

Rim PSM fabrication makes use of ingenious self-alignment with back-side illumination (Figure 10.11): an ordinary binary mask is fabricated first, with chrome patterns on a quartz plate. Shifter material is then deposited all over the plate, and photoresist is spun. The structure is then exposed from the opposite side of the



**Figure 10.10** Binary mask (left) and alternating phase shift mask (right) compared: amplitude goes through zero for PSM, and intensity (= amplitude squared) is steep



**Figure 10.11** Two schemes for fabrication of rim PSMs: double exposure self-aligned on the left; standard single exposure on the right side. Both processes result in identical mask plates

mask plate and the chrome acts as a self-aligned mask for the shifters. The shifters are then etched, followed by chrome undercutting in a second etching step.

### Process flow for PSM fabrication by single exposure

- Chrome deposition
- Shifter deposition
- Photoresist application
- Pattern generation
- Shifter etching
- Chrome etching and underetching
- Photoresist stripping

Chrome undercutting in both methods results in exactly the same degree of dimensional control. The difference is in mask inspection and repair: in the self-aligned method, the chrome pattern can be inspected and repaired before shifter fabrication. Lack of inspection and repair for PSMs has been one main factor holding back their adoption. PSM has been used in production since the 180 nm generation. As shown in Figure 10.12, it is possible to print 100 nm lines with 193 nm wavelength.



**Figure 10.12** PSM: 100 nm lines printed with 193 nm light source. Reproduced from Fritze *et al.* (2003) by permission of IEEE

### 10.6.2 Off-axis illumination (OAI)

Normally the illuminating light hits the mask perpendicularly but in off-axis illumination (OAI) the light hits the mask tilted (Figure 10.13). This has the effect of shifting the diffraction pattern so that only the undiffracted zeroth order and the first diffracted order are transmitted (and two components are needed to form an image). While in normal illumination the zeroth order is at the middle of the aperture, and the first orders at the edges, giving pitch as  $\lambda/NA$ , in OAI the pitch is halved because the zeroth and first order are twice that far apart. Theoretically OAI can therefore provide a pitch that is half the standard illumination pitch. Various pupil designs for OAI are shown in Figure 10.14. One of the drawbacks of OAI is that parallel and perpendicular lines will be different, and in fact each mask should have a pupil optimized for it.



**Figure 10.13** Off-axis illumination (OAI). Reproduced from Mack (2007), by permission of John Wiley & Sons, Ltd



**Figure 10.14** Conventional, annular and quadrupole apertures



**Figure 10.15** Standard mask (top left) and its resulting resist pattern (top right); optical proximity corrected mask (bottom left) and its resist pattern (bottom right)

### 10.6.3 Optical proximity correction (OPC)

Isolated and dense lines print differently (Figure 10.3) because diffraction in arrays is different from that of an isolated structure. This effect becomes more significant when feature sizes are scaled down. It can be combated by designing the structures to take into account the neighboring structures. This is an inverse problem: we know the desired end result, and we have to work backward from the correct resist pattern to a mask pattern which will give us that. In Figure 10.15 this is represented by arrows pointing in both directions: the mask on the left produces the pattern on the right, or, alternatively, to get the pattern on the right, we have to design the mask to be as shown on the left.

### 10.6.4 Subresolution assist features (SRAFs)

Subresolution assist features (SRAFs) are lines that are too small to be printed in the lithography process, but they will still affect the diffraction patterns, enabling better control of feature shapes (Figure 10.16). SRAFs around an



**Figure 10.16** Subresolution assist features (SRAFs)

isolated line make it look more like an array line, making the difference between the two smaller, thus improving process latitude. On the negative side, it takes a lot more shots from the pattern generator to print all those extra small pixels.

## 10.7 Non-Optical Extension of Optical Lithography

Because optical lithography tools are expensive, there is a constant need to find ways to produce narrow lines in a more cost-effective way. One such technique is resist trimming. The minimum resist line is first produced by optical lithography, and isotropic plasma etching of the photoresist is then performed (Figure 10.17). The resist line gets narrower and thinner. This method is most suitable when narrow lines can be used as a starting point. In a university lab, a 1 µm minimum line can be



**Figure 10.17** Resist trimming: resist lines made narrower by isotropic etching of the resist in oxygen plasma. In both cases the resolution (line + space) remains constant

narrowed to 200 nm by 400 nm lateral narrowing on both sides (resulting in a resist 600 nm thick). In modern IC fabrication this technique is employed to narrow down critical individual lines: the microprocessors gates. Lines of 50 nm original width can be narrowed down to 20 nm: that is, 15 nm horizontal narrowing from both sides. Pitch is not affected, and device packing density remains unchanged, but transistor speed improves because the transit time from the source to drain becomes smaller, see Equation 26.1.

## 10.8 Lithography Simulation

Lithographic pattern formation starts with the designer's layout file which is turned into a physical mask plate in a mask shop. This mask is inserted into the exposure tool, where it modifies illumination from the light source. After complex photochemistry steps in the photoresist, development creates patterns in the resist. This information flow (Figure 10.18) has many points where errors can occur and where dimensions are not accurately transferred. Some of these are data errors related to formats used in drawing and mask writing, and some are physical and chemical errors related to both mask writing and exposure resolution and to etching tolerances.

Note that the mask writing process has a similar information flow and similar error sources: the mask



**Figure 10.18** Lithography information flow. Adapted from Brunner (1997)



**Figure 10.19** Modules of lithography simulation. Redrawn after Neureuther and Mack (1997)

writer has finite resolution; the photoresist used in mask writing is similar to resists used in optical lithography; and chrome etching has its non-idealities just like any other etching process.

Lithography simulation is a self-contained specialty within microfabrication simulations. It is partly physical simulation (optical modeling) and partly semiempirical (development simulation). Lithography simulators have three basic functions as shown in Figure 10.19. The first module is optical modeling, the second is photochemical, time-dependent, diffusion modeling, and the third module is an etch simulator specifically built for resists. The development of novolak resist in alkaline developer is an etching reaction, and it uses models similar to etching, but because its application field is very specific, higher accuracy is possible. These steps have been modeled with good success even though an understanding of many of the basic mechanisms in resist exposure and development is yet to be revealed.

SAMPLE 2D is a lithography and topography simulator containing a concise optical lithography model (Figure 10.20). Lithography simulation input parameters include light source data like wavelength, exposure dose, NA and coherence; resist thickness and Dill parameters  $A$ ,  $B$  and  $C$ ; wafer and resist refractive indices; and development rate parameters. SAMPLE 2D is able to predict resist profiles with standing waves. PROLITH is an advanced lithography simulator especially for deep-UV lithography.

## 10.9 Lithography Triangles

Optical lithography has been amazingly persistent, and it has been scaled down more than anybody could ever imagine. Contenders for optical lithography will be discussed in Chapter 38. Some of them excel in certain aspects, but optical lithography has all the elements it takes: light sources, resist, masks, metrology, design software,



**Figure 10.20** SAMPLE 2D simulation of resist exposure and development; nominal linewidth is  $1.0\text{ }\mu\text{m}$  (only right hand side is shown because the structure is symmetric): (a) exposure dose  $100\text{ mJ}/\text{cm}^2$ , development time  $65\text{ s}$ ; (b) dose  $80\text{ mJ}/\text{cm}^2$ , development  $75\text{ s}$ , leads to sloped profile; and (c) dose  $70\text{ mJ}/\text{cm}^2$ , development  $70\text{ s}$ , leads to incomplete development. In (d) conditions are identical to (c) but resist thickness is only  $0.5\text{ }\mu\text{m}$

etc. It takes time to build an infrastructure for a new lithography technology.

Critical dimension, CD, is basically a 2D concept. However, the resist sidewall is seldom  $90^\circ$ , so CD should



**Figure 10.21** Magic triangles in lithography. Adapted from Zell (2006)

really be able to describe the shape of a trapezoidal object. Real resist lines are also rough to some extent, and line edge roughness (LER) is used to describe this.

Fantastically narrow lines are printed optically but there are tradeoffs. These are shown by two “magic triangles” in Figure 10.21. Magic triangles tie together properties that are difficult to optimize simultaneously.

## 10.10 Exercises

1. Produce a graphical presentation of projection lithography resolution vs. depth of focus!
2. Estimate the contrasts of resists in Figure 10.3.
3. If subresolution assist features are half the feature size, how is e-beam mask writing time affected?
4. How much will the swing ratio be reduced if the top antireflection coating can reduce air/resist reflections by 20%? How much will the swing ratio be reduced if absorbance increases from  $0.5$  to  $1/\mu\text{m}$ ?
5. Calculate some good and bad resist thicknesses for novolak resist at  $365\text{ nm}$  exposure!
6. What is the resist thickness and linewidth in Figure 10.6?
7. How does resist trimming work if the original linewidth is  $0.5\text{ }\mu\text{m}$  and resist thickness  $1\text{ }\mu\text{m}$ ? What if the original linewidth is  $100\text{ nm}$  and resist thickness  $150\text{ nm}$ ?
8. How thick are phase shifters if a  $193\text{ nm}$  laser is used for exposure?

## References and Related Reading

- Ausschnitt, C. P. *et al.* (1997) Advanced DUV photolithography in a pilot line environment, *IBM J. Res. Dev.*, **41**, 21.
- Brunner, T. (1997) Pushing the limits of lithography for IC production, International Electron Devices Meeting, p. 9.
- Fritze, M. *et al.* (2003) Enhanced resolution for future fabrication, *IEEE Circuits Devices Mag.*, **1**, 43.
- Gu, A. and A. Zakhor (2008) Optical proximity correction with linear regression, *IEEE Trans. Semicond. Manuf.*, **21**, 263–271.
- Holmes, S. J. *et al.* (1997) Manufacturing with DUV lithography, *IBM J. Res. Dev.*, **41**, 7.
- Lin, B. J. (2006) The ending of optical lithography and the prospects of its successors, *Microelectron. Eng.*, **83**, 604–613.
- Mack, C. (2007) **Fundamental Principles of Optical Lithography**, John Wiley & Sons, Ltd.
- McCallum, M., M. Kameyama and S. Owa (2006) Practical development and implementation of  $193\text{nm}$  immersion lithography, *Microelectron. Eng.*, **83**, 640–642.

- Neureuther, A. R. and C. A. Mack (1997) Optical lithography modeling, in P. Rai-Choudhury (ed.) **Handbook of Microlithography, Micromachining and Microfabrication**, SPIE.
- Peace, R. F. and S. Y. Chou (2008) Lithography and other patterning techniques for future electronics, *Proc. IEEE*, **96**, 248–270.
- Peterson, B. *et al.* (1996) Approaches to reducing edge roughness and substrate poisoning of ESCAP photoresists, *Semicond. Fabtech*, **8**, 183.
- Rai-Choudhury, P. (ed.) (1997) **Handbook of Microlithography, Micromachining and Microfabrication**, SPIE.
- Schellenberg, F. M. (2005) A history of resolution enhancement technology, *Opt. Rev.*, **12**, 83–89.
- Zell, Th. (2006) Present and future of 193nm lithography, *Microelectron. Eng.*, **83**, 624–633.

# 11

## Etching

The pattern transfer process consists of two steps: lithographic resist patterning and the subsequent etching of the underlying material. Resist protects the areas where the material needs to remain, and open areas are etched.

In the etching process material is chemically and/or physically attacked and eroded in the unprotected areas. Some materials are spontaneously chemically etched, like silicon by fluorine, aluminum by chlorine, or oxide by hydrofluoric acid. Sometimes physical processes are needed to assist in etching, like ion bombardment in chlorine etching of aluminum oxide and fluorine etching of silicon dioxide. In those cases the resulting etch rate is a synergistic sum of both chemical and physical processes.

All materials can be etched by energetic ions, and this applies to the resist mask and the underlying film, too. But in practical processes it is important to achieve selectivity: that is, high etch rate ratio between two materials. In the ideal case etching would stop when film clears, but in practice some underlying material loss is almost inevitable (Figure 11.1). Resist is also consumed in the process, and the sidewall of etched structure is not necessarily perfectly vertical. It calls for engineering judgment

to decide which degree of profile control and selectivity are acceptable.

In lithography rework is easy: if the resist pattern is found to be faulty in inspection, the resist is stripped and lithography repeated, but once the pattern has been transferred into solid material by etching, rework is much more difficult, and usually impossible.

### 11.1 Etch Mechanisms

Etching is often divided into two classes, wet etching and plasma etching. Wet etching equipment consists of a heated quartz tank or bath plasma etching equipment consists of a vacuum chamber with a RF generator and a gas system.

The basic reactions in etching are given by

Wet etching:



Plasma etching:



There are several processes that must take place for etching to proceed:

- transport of etchants to surface (flow and diffusion)
- surface processes (adsorption, reaction, desorption)
- removal of product species (diffusion and flow).

These basic steps are the same for both wet etching and plasma etching, but clearly diffusion in the gas phase is



**Figure 11.1** Etching with photoresist mask: thin film has been etched, with some etching of the underlying material and resist, too. There is some undercutting and the profile is not perfectly vertical

more rapid than in the liquid phase. In plasmas there are many possibilities to provide extra energy to the process, in the form of accelerated ions. The chemical component of etching can be enhanced by heating, in both wet and plasma etching.

A comparison of the basic processes of plasma etching in Figure 11.2 to those of CVD (Figure 5.6) reveals obvious similarities. Gases are fed in by forced convection, some species diffuse through the boundary layer to the surface and react there, desorb and diffuse back to the main flow stream, and get pumped away. Note that utilization of source gases is never perfect: some source gas molecules “fly by” the wafer in the main flow. A very small fraction of molecules are ionized, and these ions enhance surface reactions. Plasma etcher can be run as a deposition tool, and PECVD reactor as an etcher, which is beneficial for chamber cleaning.

If etching is too slow, any of the above steps can be the cause of the problem: inadequate inflow of etchant, slow diffusion through a thick boundary layer, slow chemical reaction on the surface, low volatility (or solubility) of products. Redeposition of products on the surface can also take place, and sometimes byproducts, like hydrogen gas evolution (as in Equation 11.1), are so strong that they prevent etchant from reaching the surface.

Etch rates are typically 100–1000 nm/min, for both wet and plasma processes. The lower limit comes from

manufacturing economics and the upper limit from resist degradation, thermal runout and damage considerations. Silicon etching is exceptional: rates up to 20 µm/min are available in both wet etching ( $\text{HF:HNO}_3$ ) and in plasma etching (DRIE) in  $\text{SF}_6$ .

### 11.1.1 Footnote on terminology

The term dry etching, as an opposite to wet etching, is often used as a synonym for plasma etching, but there are dry methods which do not involve plasmas, e.g.  $\text{XeF}_2$  etching of silicon. We use the terms plasma etching and RIE interchangeably. RIE, Reactive Ion Etching is a misnomer: ions in RIE are in a minor role, most etching is done by excited neutrals. Ion etching, also known as ion milling, is a completely different kind of an etching process, to be discussed in Section 11.11.

## 11.2 Etching Profiles

The isotropic etching front proceeds as a spherical wave from all points open to the etchant. Because the etch front proceeds under the mask (resulting in undercut), isotropic etching cannot be used to make fine features (Figure 11.3). In fully isotropic etching lateral extent (undercutting) is identical to vertical etched depth.



**Figure 11.2** Basic processes in plasma etching: forced flow, boundary layer diffusion and surface processes enhanced by ion bombardment



**Figure 11.3** Undercutting in isotropic etching: wide lines are narrowed but narrow lines are completely undercut and released

Etch bias, defined as the difference between etched feature size and mask size, is then twice the etched depth.

Isotropic profile is the most commonly encountered etch profile. Most wet etchants result in an isotropic profile, and it is encountered also in plasma and dry etching. In plasma etching, the degree of isotropy can be controlled by the etching parameters, from fully isotropic to fully anisotropic (which may not be easy!).

Undercut can be compensated by making the initial mask feature larger than the desired width, for light-field structures, and smaller for dark-field structures. This approach works quite well for isolated structures, but in dense arrays its utility is limited.

Wet etching profiles are seldom perfectly isotropic, and both deep slopes and gently sloping sidewall profiles

are possible. The main parameters affecting the slope are the same as those governing the other main features of etching: etchant concentration and temperature. Silicon dioxide etching can produce steep slopes in  $\text{NH}_4\text{F:HF}$  at a ratio of 7:1 at  $25^\circ\text{C}$ , but a ratio of 30:1 at  $55^\circ\text{C}$  leads to a gentle slope. Gentle slopes may be desirable for step coverage in subsequent deposition steps.  $\text{NH}_4\text{F:HF}$  mixture is known as buffered HF (BHF) because the consumption of fluoride ions by the etching reaction is compensated, buffered, by ionization of  $\text{NH}_4\text{F}$  into ammonia and fluoride.

Undercutting is sometimes desirable and even necessary. In order to fabricate free-standing beams and plates, isotropic etching is a must: the beams are released when the underlying material is completely removed (Figure 11.3). Such released beams and plates are essential building blocks in surface micromechanics, for instance the bolometer of Figure 2.1 has been released by isotropic undercutting.

Anisotropic etching results in a vertical or almost vertical profile, which is suitable for fine structure fabrication. Isotropic and anisotropic profiles are compared in Figure 11.4. In plasma etching the profile can be tailored by process conditions: pressure, power, temperature and gas flows. Bombardment supplies energy to horizontal surfaces and drives vertical etching. Sidewalls do not experience ion bombardment, and therefore the etch rate in the lateral direction is reduced. Low-pressure operation favors anisotropy because bombardment is more directional, but low-pressure operation requires either a bigger pump or reduced flow rate, in which case the rate is lower. Other effects of ion bombardment include ion-induced desorption, ion-induced damage and ion-activated chemical reactions. Increased ion energy



**Figure 11.4** Isotropic (left) and anisotropic etch profiles (right)



**Figure 11.5** Mechanism of anisotropy: all surfaces are passivated by a thin film, but directional ion bombardment will clear films from a horizontal surface while leaving passivation film on the sidewalls, enabling etching to proceed vertically only

can be helpful for increasing vertical etch rate, but the masking material is also etched faster if power is high.

The other main mechanism for anisotropy is sidewall passivation. Passivating films, for example  $(CF_2)_n$ -type fluorocarbon films (from  $CHF_3$  or  $C_4F_8$  gases), deposit on all surfaces, but ion bombardment removes them from horizontal surfaces, and sidewalls remain passivated (Figure 11.5). Operation at cryogenic temperatures (e.g.,  $-120^\circ C$ ) leads to reduced chemical reaction rates according to the Arrhenius equation. Suppression of spontaneous reactions means that only ion bombardment-driven reactions take place, and they are anisotropic. Cryogenic etching of silicon in  $SF_6/O_2$  etching of silicon results in  $SiO_xF_y$  reaction products that are not very volatile, and they will redeposit on the wafer. And just as with fluoropolymer films, ion bombardment will remove this redeposited film from horizontal surfaces, leaving the sidewalls passivated.

### 11.3 Anisotropic Wet Etching

Isotropy, or homogeneity of space in all directions, is sometimes useful as we can neglect directions. Anisotropic processes are spatially directional, but there are two completely different usages of the term anisotropic etching: anisotropic wet etching and anisotropic plasma etching. Anisotropic wet etching could be termed crystal plane anisotropy, because the different silicon crystal planes have different etch rates.

Potassium hydroxide (KOH) is the prototypical anisotropic wet etchant for silicon. In KOH silicon (100) crystal planes are etched 200 times faster than (111) planes. KOH etching at about  $80^\circ C$  cannot be done with photoresist mask. Instead, resist is used to etch silicon dioxide or silicon nitride, and after resist stripping silicon etching is done in KOH with oxide or



**Figure 11.6** Anisotropic crystal plane-dependent wet etching of silicon: top view and cross-sectional view of a membrane and a V-groove etched on (100) silicon wafer

nitride mask. Initially the shape is determined by the fast etching crystal planes (100) and (110) but in the end etching terminates when only slow etching (111) planes are present. When the slow etching (111) planes meet, etching will terminate. The etched depth is determined by the mask opening and the angle between the (111) and (100) planes,  $54.7^\circ$ . This angle is very characteristic of silicon MEMS structures. Some simple geometries are depicted in Figure 11.6, namely V-grooves and membranes. Note that it is important that the structures are aligned to the crystal planes (indicated by the wafer flat).

A square-shaped mask pattern will lead to a square-shaped well, or, if thought of differently, to a silicon membrane that can be used in pressure sensors as the active bending element. If etching is continued until silicon is completely removed, a nitride membrane results. If the mask opening is small, the slow etching (111) planes will meet, and an inverse pyramid shape is formed (as seen in Figure 11.4a).

Because anisotropic wet etching is controlled by the crystal planes, etching can be continued extendedly without loss of control. Through-wafer etching is possible, enabling, among others, microrocket thrusters (Figure 11.7). The thruster is made of two identical wafers which have been bonded together. The variety of shapes that can be made by crystal plane-dependent etching is astonishingly large, as will be seen in Chapters 20 and 30.

### 11.4 Wet Etching

Wet etching mechanisms fall into two major categories: metal etching by electron transfer, Equation 11.3, and insulator etching by acid–base reaction, Equation 11.4:





**Figure 11.7** Anisotropic wet etching of silicon: two identical silicon wafers bonded together to form a microrocket fuel chamber and inlet and exit nozzles (with diffused resistor heater). Adapted from Mukerjee *et al.* (2000)

The rates of wet chemical reactions can have two behaviors which we encountered earlier in the CVD reaction mechanisms:

1. The surface reaction is slow and it determines the rate.
2. The surface reaction is fast and the rate is determined by etchant availability (transport of reactant by convection and diffusion).

Surface reaction-limited processes exhibit activation energies of 30–90 kJ/mol. The rate increases with increasing etchant concentration and is insensitive to stirring. Different crystal planes can have different surface reaction rates and this leads to anisotropy, for example in KOH and TMAH etching of silicon. Aluminum etching in  $H_3PO_4$  is also surface reaction limited:  $Al_2O_3$  dissolution is the rate determining step, with 54 kJ/mol activation energy.

Transport-controlled reactions are characterized by activation energies of 4–25 kJ/mol. Their rate increases with agitation and stirring because more reactant is being brought within the vicinity of the surface. Furthermore, all crystal planes etch at the same rate, which is natural. Silicon etching in HF:HNO<sub>3</sub> mixture is limited by HF diffusion through the product layer. The activation energy is 17 kJ/mol.

Wet etching processes are available for most materials, with diamond and GaN being the most notable exceptions. Table 11.1 lists a number of etch processes for photoresist masked etching. As a general rule, lower temperature and more dilute solutions can be tried if the photoresist mask does not survive the etching conditions. It must

**Table 11.1** Wet etchants for photoresist masked etching (mostly room temperature)

|                  |                                                                                                                                       |
|------------------|---------------------------------------------------------------------------------------------------------------------------------------|
| SiO <sub>2</sub> | NH <sub>4</sub> F:HF (7:1), <sup>a</sup> 35 °C                                                                                        |
| SiO <sub>2</sub> | NH <sub>4</sub> F: CH <sub>3</sub> COOH:C <sub>2</sub> H <sub>6</sub> O <sub>2</sub> (ethylene glycol): H <sub>2</sub> O (14:32:4:50) |
| Poly-Si          | HF:HNO <sub>3</sub> :H <sub>2</sub> O (6:10:40)                                                                                       |
| Al               | H <sub>3</sub> PO <sub>4</sub> :HNO <sub>3</sub> :H <sub>2</sub> O (80:4:16), water can be changed to acetic acid                     |
| Mo               | H <sub>3</sub> PO <sub>4</sub> :HNO <sub>3</sub> :H <sub>2</sub> O (80:4:16)                                                          |
| W, TiW           | H <sub>2</sub> O <sub>2</sub> :H <sub>2</sub> O (1:1)                                                                                 |
| Cr               | Ce(NH <sub>4</sub> )NO <sub>3</sub> :HNO <sub>3</sub> :H <sub>2</sub> O (1:1:1)                                                       |
| Cu               | HNO <sub>3</sub> :H <sub>2</sub> O (1:1)                                                                                              |
| Ni               | HNO <sub>3</sub> :CH <sub>3</sub> COOH:H <sub>2</sub> SO <sub>4</sub> (5:5:2)                                                         |
| Ti               | HF:H <sub>2</sub> O <sub>2</sub> (1:1)                                                                                                |
| Au               | KI:I <sub>2</sub> :H <sub>2</sub> O, KCN:H <sub>2</sub> O                                                                             |
| Pt, Au           | HNO <sub>3</sub> :HCl (1:3), “aqua regia,” H <sub>2</sub> O dilution may be used                                                      |

<sup>a</sup>Called BHF, for buffered HF, and also known as BOE, for buffered oxide etchant.

be remembered that other factors will change too, for example sidewall profile.

Oxide etch rate goes down linearly with decreasing HF concentration. However, aluminum etch rate goes up when HF concentration decreases: 49% HF etches aluminum at 38 nm/min, but 10:1 diluted HF results in a 320 nm/min rate. This is because water has an active role in aluminum surface oxidation. Buffering agents and other additives can dramatically change etch rates and selectivities, as shown in Table 29.2. Wet etch processes for non-masked etching are listed in Table 11.2. These processes remove films without making any patterns.

Wet etching is an indispensable tool in defect analysis: microstructural defects like stacking faults and pinholes can be made visible by wet etching. Sirtl, Secco, Wright, Dash and Sailor are etchants for delineating defects. In reverse engineering and failure analysis thin films are removed selectively by isotropic etching (wet or dry) to reveal layer by layer the wanted structures.

**Table 11.2** Wet etchants for unmasked etching

|                                |                                                                                  |
|--------------------------------|----------------------------------------------------------------------------------|
| SiO <sub>2</sub>               | DHF, dilute HF (1%), for removing native oxide (about 10 nm/min)                 |
| SiO <sub>2</sub> , PSG         | HF (49%) sacrificial layer removal (>1 μm/min)                                   |
| Si <sub>3</sub> N <sub>4</sub> | HF (49%) layer removal (4 nm/min)                                                |
| Si <sub>3</sub> N <sub>4</sub> | H <sub>3</sub> PO <sub>4</sub> (concentrated, 180 °C)                            |
| Si                             | HF:HNO <sub>3</sub> :H <sub>2</sub> O (6:10:40) (thinning etch, up to 20 μm/min) |
| Cu                             | HCl                                                                              |

Wet etching processes are easy in theory, but in practice wet etching is difficult:

1. Reaction products may affect the etching reaction, for example hydrogen evolves when silicon is etched by hydroxide (KOH, for instance) and this hydrogen can prevent etchant from reaching the surface.
2. Etching products catalyze or inhibit the reaction (HF:HNO<sub>3</sub> etching of silicon).
3. The etching reaction is sensitive to stirring (mass or heat transport limited).
4. The etching reaction is exothermic and temperature rises during etching (for these reactions, stirring decreases the etch rate because it removes heat).
5. Evaporation of etchant leads to concentration changes during etching.

#### 11.4.1 Wet etching tools

Wet etching comes in three major variants: tank immersion (bath) (Figure 11.8), spray tool and single wafer processor. The tank is for example a quartz vessel with heating and temperature control. It is filled with water and chemicals and the wafers are immersed in liquid for a required time and then transferred to similar tanks for rinsing. Spray tools handle a cassette (or cassettes) but, instead of immersion, liquid is sprayed from stationary nozzles on rotating wafer cassette(s). After the first spraying the process continues with either another chemical or water spray and nitrogen drying, in the same vessel. Fresh mixing of chemicals and lower liquid volumes are spray tool advantages over tanks. Single wafer tools are akin to photoresist spinners, and in a sense they are spray tools,



**Figure 11.8** Wet etching tank. Courtesy VTT

too. However, processing acts on a single wafer at a time, and works on the wafer topside only.

Heating wet etching tanks uniformly is no easy task because highly reactive and corrosive chemicals are used at high temperatures (e.g., 180 °C boiling nitric acid to etch nitride). Temperature uniformity depends on flow patterns in the tank. This is not trivial because stirring can enhance reactant supply, reaction product removal, or heat removal from an exothermic reaction.

The materials of the tanks and heaters must be compatible with the process, in chemical, thermal and mechanical respects. Teflon and quartz are often used in the most demanding applications, but both are expensive materials and difficult to machine. Polypropylene is used for less critical applications, while stainless steel is the material for solvent tanks.

## 11.5 Plasma Etching (RIE)

Vertical walls and highly accurate reproduction of photoresist dimensions translate to closely spaced structures. High packing density of devices is possible by anisotropic plasma etching. When undercut becomes significant relative to linewidth, wet etching faces serious problems. In IC fabrication this led to the adoption of plasma etching at about 3 μm linewidths. With anisotropy, that is vertical sidewalls, undercut compensation schemes became unnecessary, and all the resolving power of lithography tools could be used to increase device packing density.

Plasma etching is done in a vacuum chamber by reactive gases excited by RF fields (Figure 11.9). Both excited and ionized species are important for plasma etching. Excited molecules like CF<sub>4</sub>\* are very reactive, and ionic species like CF<sub>3</sub><sup>+</sup> are accelerated by the RF field and impart energy directionally to the surface. Plasma etching is thus a combination of chemical (reactive) and physical (bombardment) processes.

Deep reactive ion etching (DRIE) is an extension of RIE to make deep structures with high etch rate. DRIE reactors have two power generators, one for generating a high-density plasma ( $10^{12}$ – $13$  ions/cm<sup>3</sup>) and another for biasing the wafer electrode, see Figure 33.2. This arrangement enables high powers to be used for active species generation, but independent control of ion bombardment via the second power source. DRIE will be discussed in detail in Chapter 21.

#### 11.5.1 Plasma etch chemistries

In a plasma discharge a number of different mechanisms for gas phase reactions are operative. The discharge generates both ions and excited neutrals, and both are important



**Figure 11.9** Plasma etching system (RIE): gases are introduced through the top electrode; wafers are on the powered bottom electrode

for etching, that is



The most abundant species in the plasma reactor is the source gas. Etch reaction products are the next most abundant, and they may represent a few percent or 10% of all moieties. Excited neutrals may be present at a few percent, but ions are just a very minor component, for example one in 100 000. They are, however, often important for the mechanism.

Plasma etching is based on reaction product volatility. Silicon is easily etched by halogens: the fluorides ( $\text{SiF}_4$ ), chlorides ( $\text{SiCl}_4$ ) and bromides ( $\text{SiBr}_4$ ) of silicon are volatile at room temperature at millitorr pressures. No ion bombardment is needed for etching because the reactions are thermodynamically favored and the role of ion bombardment is used to induce directionality. Silicon nitride ( $\text{Si}_3\text{N}_4$ ) is etched by fluorine, producing  $\text{SiF}_4$  and  $\text{NF}_3$ . Aluminum is spontaneously etched by  $\text{Cl}_2$ , but the surface of aluminum is always protected by a native aluminum oxide layer a few nanometers thick, and aluminum etching can only commence after this oxide has been removed. Ion bombardment is essential for aluminum oxide etching.

**Table 11.3** Typical etch gases

| Fluorine               | Chlorine        | Bromine      | Stabilizers  | Scavengers   |
|------------------------|-----------------|--------------|--------------|--------------|
| $\text{CF}_4$          | $\text{Cl}_2$   | $\text{HBr}$ | $\text{He}$  | $\text{O}_2$ |
| $\text{SF}_6$          | $\text{BCl}_3$  |              | $\text{Ar}$  |              |
| $\text{CHF}_3$         | $\text{SiCl}_4$ |              | $\text{N}_2$ |              |
| $\text{NF}_3$          | $\text{CHCl}_3$ |              |              |              |
| $\text{C}_2\text{F}_6$ |                 |              |              |              |
| $\text{C}_4\text{F}_8$ |                 |              |              |              |
| $\text{XeF}_2$         |                 |              |              |              |

Table 11.3 lists typical plasma etch gases. These include not only the etchant gases but also scavengers and stabilizers.  $\text{SF}_6$ - and  $\text{CF}_4$ -based processes have typically 10–40% oxygen added to them. Oxygen has several roles: it reacts with  $\text{SF}_n$  and  $\text{CF}_n$  fragments, and keeps fluorine concentration high by preventing fluorine recombination with the fragments. Oxygen reacts with resist, which contributes to sidewall polymer formation and therefore improved anisotropy. Oxygen also increases the resist etch rate, so that selectivity can in fact be worse even though the silicon etch rate is enhanced.

### 11.5.2 Plasma etch mechanisms

Chemical bonds need to be broken for etching to take place. Bond energies, therefore, give indications of possible etching reactions. Reactions that lead to bonds stronger than the Si–Si bond will etch silicon, and if the products have stronger bonds than Si–O, silicon dioxide will be etched (Table 11.4). These simple predictions are experimentally confirmed: fluorine, chlorine and bromine will etch silicon because silicon–halogen bonds are stronger than silicon–silicon bonds. Only the Si–F bond is stronger than the Si–O bond and therefore only fluorine is predicted to etch oxide. But because of ion bombardment, oxide is slightly etched also in chlorine and bromine plasmas, but to a much lesser extent than in fluorine plasmas.

In practice, the volatility of reaction products (i.e., high vapor pressure) is used as a criterion for etchant selection. Boiling points of reaction products can be used to estimate volatility (Tables 11.5 and 11.6). (Note that boiling points are usually for 1 atm pressure, not for reduced pressures, but they are very useful as quick estimates for volatility.)

**Table 11.4** Bond energies (kJ/mol)

|       |      |       |     |
|-------|------|-------|-----|
| C–O   | 1080 | Si–F  | 550 |
| Si–O  | 470  | Si–Cl | 403 |
| Si–Si | 227  | Si–Br | 370 |

**Table 11.5** Etch product boiling points ( $T_{bp}$ , °C; d = decomposition, s = sublimation)

|                   |      |                     |      |                   |      |
|-------------------|------|---------------------|------|-------------------|------|
| SiF <sub>4</sub>  | -90  | SiCl <sub>4</sub>   | -70  | CO <sub>2</sub>   | -56  |
| NF <sub>3</sub>   | -206 | AlCl <sub>3</sub>   | 190  | PH <sub>3</sub>   | -133 |
| WF <sub>6</sub>   | 2.5  | GaCl <sub>3</sub>   | 78   | AsH <sub>3</sub>  | -116 |
| WOF <sub>4</sub>  | 110  | TiCl <sub>4</sub>   | -25  |                   |      |
| TaF <sub>5</sub>  | 96.8 | WOCl <sub>4</sub>   | 211  | SiBr <sub>2</sub> | 5.4  |
| MoF <sub>6</sub>  | 17.5 | WCl <sub>6</sub>    | 275  |                   |      |
| MoOF <sub>4</sub> | 98   | InCl <sub>2</sub>   | 235  |                   |      |
| NbF <sub>5</sub>  | 72   | MoCl <sub>5</sub>   | 194  |                   |      |
| GaF <sub>3</sub>  | 800s | PtCl <sub>4</sub>   | 370d |                   |      |
|                   |      | PbCl <sub>4</sub>   | -15  |                   |      |
|                   |      | Cr(CO) <sub>6</sub> | 110d |                   |      |

**Table 11.6** Non-etchable reaction products ( $T_{bp}$ , °C; d = decomposition, s = sublimation)

|                   |       |                  |      |
|-------------------|-------|------------------|------|
| CuCl <sub>2</sub> | 620   | TiF <sub>4</sub> | >400 |
| CuF <sub>2</sub>  | 950d  | PbF <sub>2</sub> | 855  |
| CrCl <sub>2</sub> | 824   | CrF <sub>2</sub> | 1100 |
| AlF <sub>3</sub>  | 1290s | TiF <sub>3</sub> | 1200 |

Reaction products like WOF<sub>4</sub> (from CF<sub>4</sub> and O<sub>2</sub> etching of tungsten) and AlCl<sub>3</sub> (Cl<sub>2</sub> etching of aluminum) have boiling points around 200 °C and are volatile enough for practical etching, but AlF<sub>3</sub> or CrF<sub>2</sub> have boiling points about 1000 °C and therefore fluorine is not a suitable etchant for these materials. Ion bombardment enhances the removal of material and can be used to drive reactions which might otherwise not be suitable for etching. Such reactions are, however, prone to residues. Gallium fluoride is involatile but chlorides are volatile, and therefore GaAs and GaN etching uses chlorine.

The exact plasma etch mechanisms remain unknown in many cases. It has been shown that damaged single crystal tungsten is etched much faster than perfect crystal. Silicon etch rate has been shown to be synergistic, with both ion bombardment and chemical components: etching with argon ion bombardment alone results in low rate, and similarly for XeF<sub>2</sub> gas etching, but for simultaneous Ar<sup>+</sup>/XeF<sub>2</sub> process the etch rate increases by a factor of 10–100.

## 11.6 Isotropic Dry Etching

There are plasma-less dry etching methods: XeF<sub>2</sub> and ClF<sub>3</sub> gases will spontaneously break down and release free fluorine, which aggressively etches silicon. XeF<sub>2</sub> is a crystalline solid which sublimes to produce XeF<sub>2</sub> vapor with a vapor pressure of a few torr. It is very reactive with

water and forms hydrofluoric acid (HF) which presents hazards to operators.

Another dry etching process without plasmas is oxide etching with HF vapor. Because it is based on diffusion of vapors, there is no directionality and the resulting profile is isotropic. This process, too, requires extensive safety precautions because of HF. Isotropic dry etching is very desirable in MEMS, as will be discussed in Section 29.6. The great benefit of dry etching compared to wet etching is the elimination of the drying step and the surface tension-caused mechanical distortions resulting from capillary forces.

## 11.7 Etch Masks

### 11.7.1 Resist selectivity

Usually a vertical resist sidewall is desirable and necessary for the best linewidth control in plasma etching. Most often the resist is, however, slightly sloped, for example 86° or 88° (positive slope), or even negatively sloped (retrograde). Fluorine plasmas are “milder” than chlorine plasmas, and resist selectivity is better, for example 5:1 or 10:1, while in chlorine plasmas 3:1 or 5:1 is more typical. In severe cases 1:1 is seen. This then means that only rather thin layers can be etched.

In wet etching resist selectivity is often infinite in practice, but adhesion of resist becomes an issue. This is dependent on priming, feature size, resist thickness and the chemical character of the resist. Some resists are specifically tailored for wet processing (etching and/or electroplating). Generally, thicker resists are mechanically more stable. If there is any loss of adhesion between the resist and substrate, etching will easily penetrate along the interface, leading to complete resist delamination, or just to a sloped sidewall, which may be beneficial in subsequent deposition steps.

### 11.7.2 Etching with a hard mask

Many wet and dry etching processes utilize hard masks because resists are simply not tolerant enough under harsh etch conditions. Harsh can mean aggressive chlorine plasmas, very long etch times or hot acids or bases like 80 °C 25% KOH for silicon anisotropic etching. In deep submicron processes resist thickness has to be scaled down for maximum lithographic resolution, but these thin resists are not always suitable as etch masks.

Photoresists are materials that combine photoactivity and mechanical/thermal/chemical stability; obviously photoactivity is the property that cannot be sacrificed. In order to find optimum materials as etch masks, the

concept of hard mask has been devised. The hard mask material is etched with photoresist mask, photoresist is then stripped and the etch process is performed using the hard mask only. The hard mask material can be optimized to suit the application, irrespective of photoresist.

In silicon etching in KOH, silicon dioxide or silicon nitride hard masks are standard materials. When glass wafers (or thick oxides) are etched, nickel, chromium, polysilicon and amorphous silicon are suitable masking materials for concentrated HF (49%). Silicon carbide (PECVD SiC) and tantalum pentoxide ( $Ta_2O_5$ ) are excellent hard masks for many wet and dry etching processes.

In DRIE etched depths can be  $500\mu m$  (through the wafer) and resists cannot be used as masks. Many materials are used as hard masks, such as silicon dioxide and aluminum. Aluminum nitride (AlN) and aluminum oxide ( $Al_2O_3$ ) are very resistant in many plasmas, but can be easily wet etched by KOH and even dilute NaOH photoresist developer. This fact can sometimes make processing much faster and easier, compared to other hard masks that are very stable materials (which is why they were chosen in the first place).

RIE processes that use  $Cl_2$  chemistry use metals like chromium or nickel as etch masks. However, the use of metal masks poses a problem in plasma etching. Even though the mask is chemically inert, it is always etched somewhat under ion bombardment. Redeposition of these non-volatile ion-etched species on the surfaces leads to non-etchable spots. This is called micromasking. In the case of perfect anisotropy micromasking leads to the formation of dense forest of pillars or grass.

Polysilicon gate etching can be done with an oxide hard mask. Because poly etching is highly selective against gate oxide, it is also highly selective against oxide hard mask, therefore a very thin oxide hard mask is enough, and very thin photoresist can be used to etch this hard mask. Elimination of carbon (i.e., elimination of photoresist) from the reaction brings about a major selectivity improvement: the selectivity between poly and oxide can be as high as 300:1 compared to 30:1 with resist mask, while keeping plasma parameters (RF power, pressure and gas flows) constant. In the presence of carbon CO is formed because it is energetically favorable (Table 11.4) and the source of carbon for CO formation is the photoresist, hence the lower oxide selectivity. In the absence of carbon/resist, no CO is formed.

## 11.8 Non-Masked Etching

Plasma etching replaced wet etching because of less undercut and better linewidth control. But this argument



**Figure 11.10** Etchback: (a) the trench wall is coated with a thin Ti/TiN layer and filled with CVD tungsten; (b) tungsten etchback will result in a planar surface and filled vertical plug

applies to patterning etching only: there are plenty of applications where etching is not used for making patterns, therefore there is neither photoresist nor hard mask.

Etchback is a process where extra material is removed, with the whole wafer exposed to plasma. In tungsten etchback, tungsten is removed from planar areas, leaving only the contact/via plugs filled (Figure 11.10). Etchback is used in multilevel metallization to make tungsten plugs (see Chapter 28 for more on this).

Non-masked wet etching removes for instance diffusion mask oxide after diffusion, and similarly, mask nitrides are removed in  $H_3PO_4$  or concentrated HF (which would immediately attack resist). Dilute HF removes native oxides. High rate maskless silicon etching is needed in wafer thinning.

## 11.9 Multistep and Multilayer Etching

Etching a single layer structure can be accomplished in a single step, but multistep etching can be used for improved process control. In polysilicon gate etching a three-step process is typical. The first step removes native oxide in a short, non-selective process. Bulk etching is then done, but before the end point is reached, the process is switched to a low-rate, high-selectivity recipe. Note that the underlying oxide loss is a sum of four different factors: (1) polysilicon film (non)uniformity; (2) polysilicon etch process (non)uniformity; (3) poly:oxide selectivity; and (4) overetch time.

### 11.9.1 Multilayer etching

Thin-film functionalities are enhanced by multilayer structures. This is bad news for etch engineers, because there is no guarantee that the materials behave at all similarly in etching.

It seldom happens that both (or all) layers can be etched with the same process parameters, and it may well be



**Figure 11.11** Double layer plasma etching: simultaneous optimization of both films is not easy – photoresist is still in place

that completely different etch chemistries must be used. In two-step double layer etching an end point signal must be obtained so that etching can be stopped, or else etch chemistry must provide high selectivity. High selectivity, however, is not always beneficial: if TiN on top of aluminum is etched in fluorine plasma, etching will definitely stop once the underlying aluminum is met, but the aluminum surface will turn to AlF<sub>3</sub>, which is a very stable material, and initiation of the aluminum etch step is endangered. Etching of polycide gate (WSi<sub>2</sub>/poly) is shown in Figure 11.11 with potential profile issues.

### Process flow for WSi<sub>2</sub> /polysilicon (polycide) etching

1. WSi<sub>2</sub> etching: Cl<sub>2</sub>/He/O<sub>2</sub> for WSi<sub>2</sub>
2. Poly etching: Cl<sub>2</sub>/HBr for poly
3. Poly end point step: HBr/He/O<sub>2</sub> for etching last 20 nm of poly
4. Overetch step: HBr/He/O<sub>2</sub> optimized for high oxide selectivity

Etching of the bottom layer has all the usual requirements about rate, selectivity and profile, and the extra requirement of not etching the top layer. Of course the acceptable degree of undercut in either of the layers calls for engineering judgment. Problems with film stacks which require different etch chemistries (chlorine vs. fluorine) has led to multichamber etch reactors, with each chamber reserved for one material and/or specific etch chemistry. This will be discussed in Chapter 31.

## 11.10 Etch Processes for Common Materials

### 11.10.1 Silicon

Fluorine, chlorine and bromine processes are standard for silicon etching, resulting in reaction products SiF<sub>4</sub>, SiCl<sub>4</sub>

and SiBr<sub>4</sub>, respectively. Single crystal silicon, polysilicon and amorphous silicon can all be etched with similar etch processes, which is not necessarily true for wet etching.

Fluorine processes are safer to use, but seldom fully anisotropic. Chlorine processes which have less spontaneous chemical etching and more ion enhancement result in vertical sidewalls, and the same applies to bromine processes. These two gases are, however, highly toxic, and the equipment for Cl<sub>2</sub> or HBr etching must be equipped with loadlocks. Loadlocks complicate system operation but simultaneously improve reproducibility since the reaction chamber is not exposed to room air and humidity.

Silicon isotropic wet etching is often done in HF:HNO<sub>3</sub>:CH<sub>3</sub>COOH (sometimes water is used instead of acetic acid). Depending on the ratio of nitric acid to hydrofluoric acid, the etch rate can be modified from a few hundred nanometers per minute to tens of micrometers per minute. A common mixture is 1:3:8 which etches silicon at about 3 μm/min at room temperature. Hydroxyl ions (OH<sup>-</sup>) form according to



Hydroxyl ions react with silicon to form silicon dioxide, Equation 11.6. Holes (h<sup>+</sup>) are needed in the reaction to balance charges. HF finally etches the oxide. The overall reaction is given by Equation 11.7:



The reaction is problematic for many reasons. First of all, it is autocatalytic: nitric acid (HNO<sub>3</sub>) produces nitrous acid (HNO<sub>2</sub>) which reacts with nitric acid to produce more nitrous acid. The reaction is also very exothermic, releasing a lot of heat which accelerates the reaction. If the etchant is stirred, heat is carried away, reducing the etch rate. Because the reaction needs holes, its rate depends on silicon doping level. The rate is high for highly doped silicon but decreases rapidly when doping concentration falls below 10<sup>17</sup> cm<sup>-3</sup>. This has applications in MEMS, like etching highly doped epitaxial layers, see Figures 22.7 and 30.7.

### 11.10.2 Silicon dioxide

Silicon dioxide plasma etching is driven by ion bombardment. Isotropic plasma etching of oxide is therefore difficult, but a high enough radical concentration will result in reasonable isotropic etch rates. These processes are, however, not selective against silicon. Any fluorine-containing

gas can be used as an etchant for oxide,  $\text{CF}_4$  or  $\text{SF}_6$  for example. But both gases etch silicon too, and they are suitable for non-selective etching only.

$\text{CHF}_3$ ,  $\text{C}_2\text{F}_6$  and  $\text{C}_4\text{F}_8$  are used as oxide etch gases when selectivity against silicon is required. They provide fluorine and carbon for etching ( $\text{SiF}_4$ ,  $\text{CO}_2$  etch products) and  $\text{CF}_2^*$  radicals which are polymer precursors. Polymerization takes place on silicon surfaces, whereas on oxide surfaces ( $\text{CF}_2$ )<sub>n</sub> polymerization does not take place due to oxygen supply: ion bombardment-induced reactions on oxide result in  $\text{CO}_2$  formation.

A three-step oxide etch process consists of a bulk etching step, an end point step which is highly selective (and polymerizing), followed by a low-power step that removes polymeric residues: a few extra nanometers of silicon are lost in the low-power etch step but the wafer cleaning that follows will be much easier.

Wet etching of oxide with HF-based etchants offers high selectivity against silicon, and if the only role of oxide etching is to remove oxide, not to make patterns, it is often advisable to use wet etching, while plasma etching excels in pattern etching. A major problem in wet etching of oxides is that different CVD oxides etch at widely different rates. Therefore thermal oxide etch rate is usually given as a reference.

### 11.10.3 Silicon nitride

Nitride etching has aspects of both silicon and oxide etching.  $\text{SF}_6$ - and  $\text{CF}_4$ -based plasma processes etch nitride quickly and isotropically, and without selectivity against silicon. They do, however, show some selectivity against oxide, for instance 2:1.  $\text{CHF}_3$ -based processes, on the other hand, etch nitride and provide selectivity against silicon. In fact,  $\text{CHF}_3$  and  $\text{C}_4\text{F}_8$ -based oxide etch processes usually perform well as nitride etch processes too, and result in anisotropic profiles, unlike  $\text{SF}_6$ - and  $\text{CF}_4$ -based processes.

The wet etchant for  $\text{Si}_3\text{N}_4$  is boiling concentrated phosphoric acid. Photoresist cannot tolerate such etching conditions. Instead, oxide is used as an etch mask: CVD oxide is deposited on top of nitride, and oxide is patterned by photoresist and etched with HF. After resist stripping oxide acts as a mask for nitride etching. When  $\text{CF}_4$  plasma was found to etch nitride, manufacturers were willing to invest in plasma etching, even though it was immature technology and not very production worthy, just because the wet etching process was so difficult.

### 11.10.4 Aluminum

Aluminum has a native oxide  $\text{Al}_2\text{O}_3$  which is very difficult to etch. Chlorine and chlorine-containing gases are

used, with  $\text{AlCl}_3$  as the main etch product. Multistep etching is needed to etch aluminum: in the first few seconds high power is used to ion-etch native  $\text{Al}_2\text{O}_3$  away; power is then reduced to etch the bulk of aluminum. Aluminum is spontaneously etched in  $\text{Cl}_2$ , and a polymerizing agent is needed to passivate the sidewalls for anisotropic profile;  $\text{CHCl}_3$  and  $\text{CH}_4$  are often used. In some low-pressure reactors  $\text{Cl}_2/\text{BCl}_3$  gases without polymer-forming gases will result in clean, anisotropic profiles. Nitrogen or argon is often added to stabilize the plasma and to improve photoresist selectivity.

If linewidths are large, phosphoric acid-based wet etching is very simple. Above 3  $\mu\text{m}$  linewidths  $\text{H}_3\text{PO}_4$  etchant will perform nicely, especially if the film is so thin that undercutting can be neglected.

### 11.10.5 Copper

Copper is not plasma etched in current microfabrication processes. It is a difficult material to etch because neither fluorides ( $\text{CuF}_2$ ) nor chlorides ( $\text{CuCl}_2$ ) are volatile at room temperature. Increased temperature will help, but even at 100–200 °C the rate is still low and photoresist is severely attacked. If aluminum is alloyed with copper (to improve electromigration resistance), aluminum etching will be difficult for the same reason. Al–0.5% Cu is still fairly easy to etch but Al–4% Cu leaves a residue of copper chlorides that is difficult to remove.

Wet etching of copper can be done by a number of acidic solutions. The criteria are really resist stability, metal thickness and other factors. Copper patterns can be made by two other techniques instead of etching: electroplating into a resist mold (Figures 5.11 and 5.12), or by depositing in a groove and polishing away the excess copper (Figure 16.1).

### 11.10.6 Refractory metals and silicides

Tungsten etching is similar to silicon in many respects. In fluorine plasmas the reaction product is  $\text{WF}_6$ ; in oxygen–halogen plasmas,  $\text{WO}_4$  or  $\text{WOCl}_4$ . Tungsten hexafluoride has a boiling point of 17 °C and an isotropic etching profile easily results. Oxyfluorides and oxychlorides are less volatile and ion bombardment is needed to remove them completely, which translates to better anisotropy. Molybdenum, too, is etched by both chlorine and fluorine plasmas, with or without oxygen. For titanium etching, chlorine etching is preferred, but fluorine etching is possible, and for TiW (30 at. % Ti)  $\text{SF}_6$  is a typical choice. Tantalum and niobium are etched similarly. Silicides  $\text{WSi}_2$ ,  $\text{MoSi}_2$  and  $\text{TaSi}_2$  are etched with both fluorine and chlorine-based processes which

are similar to silicon and refractory metal etch processes, for example SF<sub>6</sub>/O<sub>2</sub> for tungsten silicide or Cl<sub>2</sub>/O<sub>2</sub> for MoSi<sub>2</sub>.

Refractory metals are wet etched by hydrogen peroxide-based solutions. These are often harsh on photoresists, and therefore etching must be done at room temperature and in dilute solution.

## 11.11 Ion Beam Etching

Accelerated ions will etch all materials. This technology is known as ion beam etching (ion milling). It is, however, difficult to find suitable non-eroding masking materials: if all materials can be etched by ion bombardment, this applies to masking materials as well. Typical ion milling rates by argon ions are about 20 nm/min for Si, SiO<sub>2</sub>, SiC, Si<sub>3</sub>N<sub>4</sub> and resist, and 20–100 nm/min for metals. Ion milling is therefore much slower than plasma etching. One of the benefits of ion milling is inclined etching: the wafer can be tilted relative to the ion beam, and inclined structures can be made. Many solid state laser and magnetic materials (of the type Gd<sub>3</sub>Ga<sub>5</sub>O<sub>12</sub>, gadolinium gallium garnet) are etched by ion beam etching.

## 11.12 Etch Process Characteristics

### 11.12.1 Linewidth and profile

Linewidth is also known as CD, for critical dimension, in the IC industry. Linewidth measurement checks any deviation from design values. A deviation of 10% is acceptable for digital devices, but the error budget has to be divided between lithography and etching.

The sidewall profile of the finished feature has important implications for subsequent process steps: the step coverage of the next deposition process depends on it. The profile can be measured from top view optical or SEM measurements, but destructive cross-sectional SEM pictures are considered the ultimate profiles.

Line edges are seldom abrupt, and judgment must be used to locate a line edge properly. Real lines do not have perfectly vertical sidewalls but sloped, or even retrograde, walls (Figure 11.12), with edge roughness which can be a significant fraction of the linewidth for narrow lines. Multiple scans must be made to average over edge roughness. Substrate and film roughness add noise to stylus measurements, and, for soft materials, stylus penetration can be a



**Figure 11.12** Line profiles: left, ideal vertical wall; middle, retrograde wall; right, positively sloped wall with rough edge

problem. Linewidth can also be measured electrically, as was discussed in Chapter 2.

### 11.12.2 Selectivity

Selectivity is a measure of etch rate ratios (ERRs). It can be defined between the film and substrate and between the film and masking material. Selectivities range from 1:1 to 100:1 in typical plasma etching processes. Resist selectivities range from 1:1 to 10:1 in plasma etching (with 100:1 possible). In wet etching resist selectivity is often so good that it does not need to be considered, but resist adhesion loss and peel-off are severe limitations. When polymeric films are etched, selectivity and photoresist stripping are problematic: resist is polymeric material, too, and selectivity between two similar materials is difficult to achieve.

Etch stop is the term used for etching processes where the selectivity is so high that etching essentially stops when the underlying material is reached. This will be discussed in more detail in Chapter 20 because it has important implications in bulk MEMS.

### 11.12.3 Etch rate and etch time

Etch rate should be determined by plotting etched depth against etch time for a number of data points, and taking the slope as the rate. There are many mechanisms which make etch rate determination less straightforward than expected. There may be etch initiation lag: for example, a thin native oxide layer will prevent etching initially. This is especially true for silicon and aluminum. Then there are etch slowdown effects: etch rate in deep narrow structures is not identical to flat open areas, and as etching continues and the hole becomes deeper, etch rate will slow down. This issue is covered in more detail in Chapter 21.

Etch time seems like a simple concept: film thickness divided by etch rate. Add corrections for both film thickness and etch process non-uniformity, say 5% for each, and etch time results. Overetch required to clear film on



**Figure 11.13** Spacer formation: top, thin-film deposition over a step; bottom, anisotropic etching to clear film from horizontal surfaces leaves spacers at feature edges

planar samples amounts then to 10–20% depending on process details.

But when the films to be etched run over topography, the situation changes dramatically. Film thickness at the edge of a step will be the sum of the film thickness and step height. If anisotropic etching is stopped at the end point calculated from planar film thickness, residue equal to original step height remains at the edge (Figure 11.13). Long overetch will eventually remove this residue but this puts a high demand on etch selectivity between the two materials.

Sometimes it is desirable to leave this residue in place, and utilize it in the fabrication process. It is then termed spacer. Spacers have important applications in both MOS and bipolar transistor fabrication, as will be seen in Chapters 26 and 27. Note that it is essential for spacer formation that etching is anisotropic: in isotropic etching sideways etching would remove the material at the step edge.

If the steps are made of conducting material and the spacer is dielectric (e.g. CVD oxide), the spacer can be left in place. But if the steps are made of dielectric material and conductive lines go over them, the conductive spacers will short all the lines. Extended overetch is then needed to ensure complete spacer removal.

#### 11.12.4 Spacer lithography

The concept of spacer lithography (also known as sidewall lithography and double patterning) was originally



**Figure 11.14** Spacer lithography: after the formation of spacers the original pattern is etched away, and the spacers act as etch masks for the next etching step

introduced in 1984 when the end of optical lithography was speculated. It was used to fabricate MOS ICs in an industrial setting, and now it is again being considered for extending the life of optical lithography.

In spacer lithography the sidewalls form the final structure; the initial structure will be selectively etched away after spacer formation (Figure 11.14). The width of the structure is determined by the film thickness, which is easily controlled even in the tens of nanometers range. Of course additional lithography steps are needed to complete the patterns, but they are not critical lithography steps.

## 11.13 Selecting Etch Processes

When narrow lines need to be made, the obvious choice is plasma etching. But in many applications the choice of wet vs. plasma etching is a question of convenience: certain equipment or an etch bath is available or some suitable masking material is handy. When spherical etch profiles are required, or when undercutting is needed, isotropic etching must be used. Isotropic etching of silicon can easily be done by  $SF_6$  plasmas, and isotropic etching of silicon dioxide or glass by HF, both at fairly high rates, even tens of microns per minute.

The microbolometer of Figure 11.15 requires two lithography and three etching steps in its fabrication. The first lithography step defines the resistor pattern; anisotropic plasma etching defines the resistor. The second lithography step defines openings in the oxide, and the same photoresist serves as a mask for both anisotropic etching of oxide in  $CHF_3$  plasma and isotropic



**Figure 11.15** Bolometer fabrication process: left, resistor lithography and etching; right, second lithography, oxide etching and silicon isotropic etching

silicon etching in  $\text{SF}_6$  plasma. A SEM micrograph of the finished device is shown in Figure 11.16.

## 11.14 Exercises

- What would you use as plasma etch gases and etch masks for etching the following materials: diamond
  - TiN
  - SiC
  - InP
  - GaN
  - GaAs
  - $\text{PbZrTiO}_3$
  - BCB (benzo cyclobutadiene)?



**Figure 11.16** Spiral antenna microbolometer: silicon is isotropically etched to release the narrow resistor. SEM courtesy Leif Grönberg, VTT

- Silicon is etched in plasma according to the reaction  $\text{Si (s)} + 2 \text{Cl}_2 (\text{g}) \Rightarrow \text{SiCl}_4 (\text{g})$ . What is the theoretical maximum etch rate of a 200 mm diameter silicon wafer when chlorine flow is 100 sccm (standard cubic centimeters per minute)?
- Polysilicon etched depth in chlorine plasma is given in the table below. Determine the etch rate.

| Time<br>(s) | Depth<br>(nm) |
|-------------|---------------|
| 20          | 70            |
| 40          | 170           |
| 60          | 320           |
| 80          | 415           |

- The etch rate of <100>-silicon in 20% TMAH is given below. What is the activation energy?

| Temperature<br>(°C) | Rate<br>(μm/h) |
|---------------------|----------------|
| 60                  | 29             |
| 70                  | 36             |
| 80                  | 62             |
| 90                  | 87             |

- How much underlying oxide is lost when a tungsten film 500 nm thick is etched from a sample that has 300 nm steps on it? Tungsten:oxide selectivity is 10:1.

6. How much dimensional error does chromium wet etching introduce to (a) 1× photomasks; (b) 5× reticles?
7. What problems will be met in trying to compensate wet etching undercut by drawing lines on the mask broader?
8. If oxide film thickness is 500 nm, oxide non-uniformity  $\pm 5\%$  and oxide etching non-uniformity similarly  $\pm 5\%$ , plot the underlying silicon loss as a function of oxide:silicon etch selectivity!
9. Draw cross-sections of silicon etching with oxide hard mask for the following cases:
  - (a) KOH etching
  - (b) Cl<sub>1</sub> etching
  - (c) HF:HNO<sub>3</sub> etching
  - (d) TMAH etching
  - (e) SF<sub>6</sub> etching
  - (f) H<sub>3</sub>PO<sub>4</sub> etching.
10. How can the following structures be etched?



11. What is the difference in making inside vs. outside spacers by anisotropic etching?

## References and Related Reading

- Arana, L. R. *et al.* (2007) Isotropic etching of silicon in fluorine gas for MEMS micromachining, *J. Micromech. Microeng.*, **17**, 384–392.
- Armacost, M. *et al.* (1999) Plasma-etching processes for ULSI semiconductor circuits, *IBM J. Res. Dev.*, **43**, 39.
- Bell, F. H. and O. Joubert (1996) Polysilicon gate etching in high density plasmas, *J. Vac. Sci. Technol.*, **B14**, 3473.
- Clawson, A. R. (2001) Guide to references on III-V semiconductor chemical etching, *Mater. Sci. Eng.*, **31**, 1–438.
- Kiihamäki, J. *et al.* (2000) Depth and profile control in plasma etched MEMS structures, *Sens. Actuators*, **82**, 234–238.
- Lang, W (1996) Silicon microstructuring technology, *Mater. Sci. Eng.*, **R17**, 1–55.
- Mukerjee., E. V. *et al.* (2000) Vaporizing liquid microthruster, *Sens. Actuators*, **83**, 231–236.
- Oehrlein, G. S. and J. F. Rembetski (1992) Plasma-based dry etching techniques in the silicon integrated circuit technology, *IBM J. Res. Dev.*, **36**, 140.
- Sainiemi, L. and S. Franssila (2008) RIE, in D. Li (ed.) **Encyclopedia of Micro and Nanofluidics**, Springer.
- Walker, P. and W. H. Tarn (eds) (1991) **Handbook of Metal Etchants**, CRC Press.
- Williams, K. R. and R. S. Muller (1996) Etch rates for micromachining processes, *J. Microelectromech. Syst.*, **5**, 256–269.
- Williams, K. R. and R. S. Muller (2003) Etch rates for micromachining processes II, *J. Microelectromech. Syst.*, **12**, 761–778.

# 12

## Wafer Cleaning and Surface Preparation

Microfabrication takes place under highly controlled conditions: all materials for cleanroom construction, processing equipment and wafer handling tools are carefully selected to eliminate particles and to reduce atomic contamination. Water, gases and chemicals are purified and filtered. These are, however, passive precautions, and active wafer cleaning and surface conditioning must be undertaken before practically every major process step. These preparatory steps can account for up to 30% of all process steps.

Wafer cleaning is about removing particles and unwanted atoms and films, but it is also about leaving the surface in a known and controlled condition. This means damage removal, surface chemistry tailoring and proper hydrophobicity/hydrophilicity. As shown in Figure 12.1, ammonia/peroxide cleaning will render a silicon surface oxidized and hydrophilic while HF treatment results in a hydrogen-terminated hydrophobic surface.

In some cases surface preparation involves intentionally depositing films on the surface rather than removing material, as the term “cleaning” would hint. Many cleaning chemistries result in thin-film deposition, for example



**Figure 12.1** Surfaces treatments: (a) hydrophilic silicon surface after ammonia/peroxide cleaning attracts water; (b) hydrophobic silicon surface after HF cleaning repels water. Reproduced from Hattori (1998)

sulfuric and nitric acids will oxidize silicon surfaces, resulting in films 1 or 2 nanometers thick. The next process step after cleaning should commence immediately, before particle deposition and surface chemical reactions take place.

### 12.1 Classes of Contamination

Air cleanliness in an advanced cleanroom is so good that airborne particles are no longer the main contamination source. The human contribution has also been reduced significantly with correct gowning and working procedures, or by factory automation. These matters are dealt in more detail in Chapter 35 on cleanrooms.

The main sources of contamination are the fabrication processes themselves: films flaking from deposition chamber walls, vapors rising from wet benches, resist debris on wafer edges. Because of device size downscaling, contamination becomes evermore critical. Finer patterns demand control of finer particles, ultrathin films cannot tolerate unwanted atoms, and low leakage currents necessitate low metal contamination levels. The purity of starting materials is important: liquid chemicals can have impurity concentrations measured in parts per trillion (ppt) and sputtering target purities are for example 99.999%. Similar “5Nine” purities are typical for many process gases, but some applications need 99.99999% (7N) purity.

Contamination comes in various forms, which have different sources, effects on device and cleaning methods. The main classes of contamination are:

- particles
- metals
- organics
- native oxide.

Adsorbed water could be called contamination, too. Baking at elevated temperature removes water, and baking is

a standard preparation step in many processes. Water can be very persistent, and not all water is removed before a 400 °C bake.

Particles greater than about a third of minimum linewidth are potential causes of fatal flaws, for example shorts between conductors and even smaller particles can cause pinholes during film growth. The minimum dimension is really the thickness of the thinnest film, not the narrowest line. Particles are also a major concern in wafer bonding (the subject of Chapter 17).

Metal contamination cannot be avoided as long as machine parts are made of metals, so it has to be controlled by cleaning. Metal contamination on surfaces can spread into the silicon bulk, and metal precipitates in the bulk act as charge carrier traps. If the metals segregate into the oxide during oxidation, they can prevent, retard or degrade oxide film growth, and result in poor-quality oxides.

Organics often work through prevention of the cleaning process. This has to do with the generally hydrophobic nature of organics and repellency of water-based cleaning solutions. Organic materials in cleanrooms include photoresists, HMDS, various solvents (isopropyl alcohol (IPA), acetone, MEK, PGMEA), wafer boxes and chemical containers.

Native oxide films grow readily on silicon, aluminum and titanium, for instance. Growth is not instantaneous, however, and proper surface finishing can protect the surfaces for extended periods of time. Hydrophobic (HF-last) cleaning chemistry results in hydrogen-terminated surfaces (Figure 12.1) that can survive a week without native oxide forming, if the oxygen concentration in the ambient is low enough (e.g., <0.1 ppm). In normal cleanroom air, 0.5 nm native oxide film will grow in a few hours.

Native oxides degrade contacts, prevent epitaxial growth, cause defects at the substrate/epi interface and prevent solid state reactions like silicide formation. HF etching is a typical last step before oxidation and also before epitaxy. However, in epitaxy an additional in situ cleaning just prior to deposition is also done (Figure 34.7).

## 12.2 Chemical Wet Cleaning

Acid, base and solvent wet cleaning are the main methods of cleaning. Chemical wet cleaning is a simple, yet powerful method with high throughput. Wet benches are cheap and reliable tools but chemical consumption can become very high because the chemicals need to be changed regularly, several times a day, otherwise there is a risk of redeposition of contaminants from the cleaning solutions back to the wafers. Obviously, very high-purity chemicals must be used, otherwise the cleaning baths would become contamination sources themselves.

Practical cleaning processes use many different chemicals in sequence, each designed to attack different contamination: in a classic RCA clean the first ammonia/peroxide clean removes particles and organics, the hydrogen chloride/peroxide step is effective against metallic contamination and the HF step removes any oxide. Table 12.1 presents the most common wet cleaning chemicals. The water in cleaning solutions is de-ionized water (DI water), also known as ultrapure water (UPW).

The two main mechanisms for wet cleaning are:

1. Dissolution/decomposition.
2. Etching.

They have a very important distinction as far as surface roughness is concerned: etching processes tend to make surfaces rougher. This effect can be seen in subsequent process steps: LPCVD polysilicon deposited on thermal oxide coming directly from a furnace has larger grain size than poly that is deposited on freshly cleaned oxide. Increased surface roughness from peroxide cleaning solution leads to a larger number of nuclei, therefore smaller grain size. If the cleaned wafer is annealed, the surface will become smoother, and subsequently grain size will be larger.

Ammonia/peroxide solution works by oxidizing the silicon surface, and subsequently etching the oxide away:



Silicon etch rate in ammonia/peroxide is about 0.5 nm/min and a typical 10 min clean thus results in about 5 nm silicon etching. This leads to undercutting and removal of the particles. For modern CMOS fabrication, however, 5 nm silicon loss is unacceptably large.

The compositions given above are the traditional ones, and recently there has been a trend toward more dilute and lower temperature cleaning solutions. For example, RCA-1 clean of composition 0.1:1:5 is also used, and even ammonia water without peroxide has been used. Similarly, dilute HCl clean can replace RCA-2 in some applications.

**Table 12.1** Wet cleaning solutions

| Name/alias                                                                         | Chemical composition                                                       | Temp./time         |
|------------------------------------------------------------------------------------|----------------------------------------------------------------------------|--------------------|
| RCA-1 (SC-1, APM ammonia-peroxide)                                                 | NH <sub>4</sub> OH:H <sub>2</sub> O <sub>2</sub> :H <sub>2</sub> O (1:1:5) | 70–85 °C/10–20 min |
| RCA-2 (SC-2, HPM, hydrochloric acid-peroxide)                                      | HCl:H <sub>2</sub> O <sub>2</sub> :H <sub>2</sub> O (1:1:6)                | 70–85 °C/10–20 min |
| SPM sulfuric acid peroxide mixture, Piranha sulfuric peroxide mixture, or. Piranha | H <sub>2</sub> SO <sub>4</sub> :H <sub>2</sub> O <sub>2</sub> (4:1)        | 120 °C/10–20 min   |
| DHF (dilute HF)                                                                    | HF:H <sub>2</sub> O (1:100–1000)                                           | room temp./30 s    |
| Standard chemicals come in the following concentrations:                           |                                                                            |                    |
| HCl 37%                                                                            |                                                                            |                    |
| H <sub>2</sub> SO <sub>4</sub> 96%                                                 |                                                                            |                    |
| H <sub>2</sub> O <sub>2</sub> 30%                                                  |                                                                            |                    |
| NH <sub>4</sub> OH 29%                                                             |                                                                            |                    |
| HF 49%                                                                             |                                                                            |                    |

**Preparation:** The order of mixing and heating is important. Water is the first component, acids are added later, to prevent excessive heat generation. Hydrogen peroxide is added to a heated solution just prior to use, to minimize thermal decomposition.

**Bath life:** RCA clean is often used just once. Alternatively, peroxide can be added to prolong bath life (dubbed “spiking”). SPM bath life is similarly elongated by adding more peroxide.

**Disposal:** HF requires a separate disposal system because its health effects are different from other mineral acids, which may all be collected in the same container (volume and disposal rate limited). Sometimes acids which contain heavy metals must be collected separately (e.g., titanium or cobalt containing salicide etchants).

Cleaning bare silicon wafers is straightforward and the results can be measured at ease. Complexity increases when more materials are present: the cleaning method has to work on all these materials with similar efficiency. Measurements, too, become more difficult because, for example, optically transparent and opaque materials reflect laser light differently, and minimum detectable particle size gets bigger. Various limitations kick in when the first metal is deposited: acid cleans are then forbidden because acids dissolve and corrode metals.

With deep trenches and other highly 3D structures wetting ability becomes critical, and drying becomes more difficult too. The use of surfactants can improve wetting but they introduce problems of their own, namely possible organic residues. The measurement of surface properties (sidewall properties, really) in deep narrow trenches is a formidable task, even in the research phase where time and effort can be used to measure a few data points. Monitoring in a manufacturing environment is even more difficult because measurement speed becomes an issue.

Chemical consumption in wet benches is a major environmental concern. With larger wafer sizes, larger tanks have to be used, with increasing volumes of expensive high-purity liquids, which are dangerous to handle and which have to be disposed of under controlled conditions. CMOS fabrication on 200 mm wafers consumes thousands of liters of ultrapure water and tens of kilograms of liquid chemicals are required. Hundreds of liters of acid waste are produced. Rinse water can be recycled, and acid recovery and reuse are also common practices. Spin

processing (Figure 12.2) is more economical in chemical consumption.

## 12.3 Physical Wet Cleaning

Physical cleaning is mostly about particle removal. But the difference between chemical and physical wet cleaning is not clear cut: often chemical cleaning is enhanced by the application of physical forces, for example ultrasonic energy or spraying. Three methods of physical wet removal of particles are widely used:

1. Brush scrubbing.
2. Jet scrubbing/nozzle.
3. Ultrasonic/megasonic.

In brush scrubbing, nylon or PVA brushes move water close to the wafer surface and induce flow that brushes away the particles. This is especially effective when lots of particles or large particles are present on the wafer. Therefore, brush scrubbing is very useful in removing particles after wafer scribing or chemical mechanical polishing (Chapter 16).

In jet scrubbing, high-pressure water is sprayed on the wafer. The removal mechanism is similar to brush scrubbing. Increasing pressure improves cleaning efficiency, but electrostatic charging can damage thin films.

In sonic cleaning, shock waves supply sound energy that helps in particle removal. Ultrasonic agitation



**Figure 12.2** Wet cleaning: spin processor. Courtesy VTT

(20–40 kHz) is also beneficial in the wet removal of photoresist. However, cavitation (exploding bubbles) may damage the wafers. Above 1 MHz this is not an issue, and the method is termed “megasonics.” However, the removal efficiency is not very good for particles smaller than 300 nm.

## 12.4 Rinsing and Drying

Rinsing in DI water and drying must be considered as essential parts of any wet cleaning process. As a general strategy the wafer should be kept wet all along the cleaning process and minimize the number of times when wafers are drawn from liquid to air. When drying is required, there are a number of methods available: spinning, nitrogen blowing, vapor drying, lamp drying, vacuum drying; dry wafers can also emerge by slow removal from hot DI water. Spinning techniques are prone to charging and particle adherence, which are inherent in high-speed

spinning equipment. Various isopropyl alcohol (IPA) drying methods rely on the low surface tension and good wettability of IPA. In Marangoni drying, the wafer is drawn from water into an IPA–nitrogen atmosphere, and water is pulled back, leaving a dry surface. IPA drying methods must be considered for chemical consumption, hot vapors and solvent accumulation.

## 12.5 Dry Cleaning

Gases, vapors and plasmas can be used to clean wafers (Table 12.2). Because it is easy to integrate process modules with similar pressure and temperature regimes, it is attractive to integrate dry cleaning into a plasma and vacuum tool, for example plasma etchers, sputters, PECVD, RTP and single wafer epitaxial reactors.

Compared to wet cleaning, dry cleaning has two very advantageous features: there are no surface tension effects in small structures and no drying is necessary.

UV ozone has been tried for organics removal, UV Cl<sub>2</sub> for metal removal and HF vapor for native oxides. Argon and H<sub>2</sub> plasmas have also been utilized, in sputtering systems, to improve contact by etching oxide just prior to metal deposition. Dry cleaning has a central role in epitaxial systems where the utmost surface cleanliness is mandatory. Thin oxides can be desorbed by a hydrogen bake. The exact temperatures depend on surface termination: hydrogen-terminated surfaces can be baked at temperatures as low as 700 °C to reveal a perfect surface for epitaxy. To date, however, dry cleaning has remained a special method, especially because it is difficult to remove particle contamination with dry methods.

## 12.6 Particle Removal

Particle contamination is dangerous in lithography, but lithography is rather insensitive to metal ion contamination. Deposition processes are sensitive to small particles that can “grow” in size during deposition: the film encapsulates the particle. This may eliminate the particle as an

**Table 12.2** Dry cleaning agents

|          |                                              |
|----------|----------------------------------------------|
| Vapors   | Anhydrous HF                                 |
| Gases    | H <sub>2</sub> , HCl                         |
| Ions     | Ar+                                          |
| Atoms    | Si                                           |
| Photons  | UV (with Cl <sub>2</sub> or O <sub>3</sub> ) |
| Plasmas  | CF <sub>4</sub>                              |
| Aerosols | CO <sub>2</sub>                              |

**Table 12.3** Sources of particles

- Chemical reactions in deposition and etching
- Moving parts in tools: robot arms, valves, doors
- Static parts: wafer holders, cassettes, O-rings
- Vacuum: pumping, venting, condensation
- Gases, chemicals, water
- Cleanroom personnel

electrical or chemical contaminant but it will be a problem in lithography and bonding.

Fabrication processes themselves are major sources of particles. Table 12.3 lists some materials and mechanisms which contribute to particle contamination.

An electrical double layer forms on the particle surface in an ionic solution. The zeta potential is the potential at the fixed charge layer/diffuse charge layer interface (typically a few nanometers from the surface). It can be negative or positive, leading to either attraction or repulsion of particles toward the surface, depending on the wafer surface charge. The zeta potential of the particle is independent of particle size but depends on the electrolyte pH: in acidic conditions the zeta potential is typically positive, and in alkaline solution it is negative, as shown in Figure 12.3. The wafer surface will be oxidized in RCA-1, and it is negatively charged (Figure 12.1). When etching undercuts the particle, like charges repel each other and redeposition on the wafer is unlikely.

Particle removal efficiency in RCA depends on etch undercutting. As shown in Figure 12.4, it is really the etched depth that matters: all the different ammonia/peroxide compositions result in the same particle removal efficiency when the same amount of silicon is etched.

Cleaning solution composition and time have to be optimized with respect to both cleaning efficiency and



**Figure 12.3** Zeta potential: pH influences particle adhesion and removal (PSL = polystyrene latex). Reproduced from Hattori (1998)



**Figure 12.4** Etching as a method for particle removal: about 4 nm undercut etch is enough to remove most particles. Ammonia dilution as a parameter. Reproduced from Hattori (1998)

roughness increase. Decomposition of cleaning solutions and impurities can also catalyse surface reactions leading to increased roughness.

### 12.6.1 Wafer particle measurements

Particle measurements on wafers down to 100 nm size range can be performed by laser scattering equipment (Figure 2.6). A laser illuminates the wafer surface, and forward scattered (Mie scattering) and reflected light are measured by a (hemispherical) detector.

Scatterometric particle sizes are calibrated against contamination standards which have polystyrene latex spheres (PSLs) of certified sizes on them. These PSLs are nearly spherical, have tight size distribution and have a known refractive index of about 1.6. The number of particles is better calibrated against etched features with known light scattering properties and known positions on the wafer. Such standards can be cleaned and reused, whereas contamination standards cannot.

Because real particles are not spheres with known optical constants, particle sizes cannot strictly be measured by light scattering. Latex sphere equivalent (LSE) size should be reported. Mirror polished unpatterned wafers are good for basic studies, but real wafers present a number of problems. Because forward scattered light is reflected by the wafer before reaching the detector, thin films on the wafer must be taken into account. On the oxide, particle calibration needs to be done for each film thickness. On metallized wafers, surface roughness leads to a decreased signal-to-noise ratio, therefore small particles cannot be detected. Correlating a scattering

event to a physical particle is usually difficult, even though scatterometry produces a map of the wafer. If particles can be seen in a SEM, chemical identification is possible by either EDX or EMPA analysis. This can be important for particle source identification.

Most of the discussion above has centered on front-side cleaning and surface preparation, but wafer back-side cleanliness is becoming increasingly important, too. Big particles on the back side prevent wafer leveling for lithography, and make thermal contact to chucks poor.

## 12.7 Organics Removal

There are many sources of organic contamination in the cleanroom. Table 12.4 lists some of the most usual ones.

RCA-1 ammonia-peroxide solution is very good at removing organics because of the oxidizing power of peroxide. If a lot of organics need to be removed, sulphuric acid-peroxide (SPM) is used (as in photoresist stripping !). SPM cleaning leaves difficult-to-remove sulfur residues, and the RCA-1 step is often carried out immediately after SPM to turn sulfides into soluble sulfates. Organic residues often lead to surface roughening. Organic film protects part of the surface for some time during cleaning, while etching proceeds elsewhere.

Because sulfuric acid constitutes an environmental concern, and a safety hazard, other candidates have been sought for organics removal. Ozonated DI water with 10–100 ppm ozone has proven to be very effective. Furthermore, it is a room temperature process, as opposed to 120 °C SPM. The ultimate cleaning method for organic contamination is thermal oxidation: no organic compound can tolerate 1000 °C in an oxygen atmosphere. This provides a reference surface for analytical methods but of course is not a practical cleaning process.

### 12.7.1 Measurement of organic contamination

Organic contamination can be conveniently measured by FTIR (Fourier Transform Infrared Spectroscopy), which

**Table 12.4** Sources of organic contamination

- Liquid chemicals and vapors used in fabrication processes: HMDS, IPA, acetone
- Gases, for example according to the reaction  $n \text{ CF}_4 \Rightarrow (\text{CF}_2)_n + 2n \text{ F}^*$
- Organic films (resist, spin-on polymers)
- Wafer holders and boxes
- Vacuum systems: pump oils, O-rings
- Cleanroom materials: sealants

identifies not only elements but also chemical bonds. XPS can also identify chemical bonds, which is often important in understanding the origin of the contamination.

Molecular surface contamination can be measured by thermal desorption spectroscopy (TDS). TDS consists of a furnace connected to a mass spectrometer, and desorption of contaminants is monitored as a function of furnace temperature. Silicon surface condition has also been clarified by TDS: at 340 °C, water desorbs; at 400 °C, the hydrogen-terminated silicon surface undergoes the reaction  $\text{SiH}_2 \Rightarrow \text{SiH} + 1/2 \text{ H}_2$ ; and at 500 °C,  $\text{SiH} \Rightarrow \text{Si} + 1/2 \text{ H}_2$ .

## 12.8 Metal Removal

There are numerous sources of metals, even though other materials like silicon, Teflon, SiC and quartz are extensively used in making process equipment and wafer handling tools. Table 12.5 lists some common sources of unwanted metals.

### 12.8.1 Device effects of metal contamination

Metal contaminants degrade the performance of electronic devices in various ways depending on their chemical and physical nature, that is, reactivity with silicon and silicon dioxide, and diffusion. Harmfulness of metal atoms comes from the fact that they have their energy levels deep in the forbidden energy gap of silicon.

Non-electronic devices are less sensitive to metal contamination, but metals cannot be completely ignored: metal contamination causes stacking faults in oxidation, and metals can catalyze peroxide decomposition, which leads to reduced particle cleaning efficiency in RCA-1.

### 12.8.2 Metal removal

The acidic solutions  $\text{HCl}-\text{H}_2\text{O}_2$  and  $\text{H}_2\text{SO}_4-\text{H}_2\text{O}_2$  are the main methods for metal removal. Dilute HF, which removes a thin oxide layer, will additionally remove some metallic contaminants. Ammonia solutions

**Table 12.5** Sources of metal contamination

- Tool materials (shutter blades, collimators, chucks)
- System components (pipes, valves)
- Wafer handling (tweezers, robot arms, wafer holders)
- Chemicals (some resist developers contain NaOH as an essential constituent)
- Human contribution (sodium from sweat, heavy metals from cosmetics)

(RCA-1) can also form complexes with metals and remove some metals.

The cleaning efficiencies of HCl–H<sub>2</sub>O<sub>2</sub> and HF are very different, though. Both can reduce Fe and Ni levels below the detection limit but HF is much more effective in removing Al, and HCl–H<sub>2</sub>O<sub>2</sub> in removing Cu. Dilute HF needs to be specified because various workers use different concentrations. For aluminum removal 0.1% HF is sufficient, but below that amount the removal efficiency rapidly deteriorates. HCl concentration in HCl–H<sub>2</sub>O<sub>2</sub> has to be at least 5% for it to remove iron.

The wet chemicals themselves contain metallic impurities, and at the 10 ppb level their deposition on the wafer surface is of some concern. For example, iron at 10 ppb in RCA-1 solution results in a surface concentration of  $2 \times 10^{12}$  atoms/cm<sup>2</sup>. The use of higher purity chemicals helps to reduce the effect but it is an expensive way. If a RCA-1 bath is used several times, contamination from previous batches remains in the solution. RCA-1 must be accompanied by a clean that removes metals efficiently.

Newer cleaning solutions include HF:H<sub>2</sub>O<sub>2</sub> which has both oxidizing and metal removal capabilities. It can be used at room temperature as opposed to 70 °C typical of RCA cleans. HF:H<sub>2</sub>O<sub>2</sub> seems to increase surface roughness, so cleaning time needs to be optimized.

### 12.8.3 Measurement of metallic contamination

Metal surface concentrations range from  $10^{10}$  to  $10^{14}$  atoms/cm<sup>2</sup> depending on general contamination control strategies and particular process steps. Total reflection X-ray fluorescence (TXRF) uses a grazing incident angle to probe only the very top layer of the wafer, at nanometer thickness. It is most sensitive for medium-mass atoms and less sensitive toward both ends of the mass range. The detection limit of TXRF is about  $10^{10}$  atoms/cm<sup>2</sup>. TXRF is a non-destructive method that can be done on whole wafers, and it can be used to generate maps of contamination. This is sometimes very useful for determining the source of contamination.

In VPD (Vapor Phase Decomposition) and WSA (Wafer Surface Analysis) methods, surface impurities are first collected in the growing thermal oxide, which is then dissolved in HF. This concentrate is analyzed by graphite furnace atomic absorption spectroscopy (GFAAS), which is sensitive to picogram quantities, or detection limits of  $10^9$ – $10^{10}$ /cm<sup>2</sup>, or by an inductively coupled plasma mass spectrometer (ICP-MS) which has lower detection limits by an order of magnitude.

Metallic contaminants can be measured by their effects on charge carriers. Minority lifetime will be degraded by contamination. Surface photovoltage (SPV) and microwave photoconductivity decay ( $\mu$ PC) methods provide

this information. Quantitative iron determination is possible in p-type silicon by the SPV method because of a special feature of Fe–B pairs: at 200 °C they dissociate, and the interstitial Fe<sup>+</sup> is a much more effective recombination center.

## 12.9 Contact Angle

Removal of native oxide is easily seen by the behavior of water on the wafer surface: oxide is hydrophilic and water spreads evenly, but silicon is hydrophobic and water droplets form. Quantitative measurement of contact angle is used to characterize surface hydrophilicity/hydrophobicity. Hydrophilic surfaces have contact angles (CAs) < 90°, hydrophobic surfaces > 90° (Figure 12.5). In practice, distinct droplets form on surfaces with contact angles of 50°–90° while uniform wetting takes place when CA < 10°.

It is important that cleaning solutions are matched to surface properties: aqueous solutions will spread and dissolve contamination on hydrophilic surfaces, but their effect is erratic on hydrophobic surfaces. For example, organic residues from photoresist may render the surface hydrophobic, and two-step cleaning is required, the first step to remove organic contamination and the second step to remove particles, for instance. Ammonia/peroxide mixture is the standard solution for making silicon surfaces hydrophilic.

HF-last clean results in CA ~ 70°, and water droplets form on silicon. Water sometimes remains on the wafer after rinsing, resulting in watermarks during drying. These can be minimized by tailoring the contact angle to either high or low values.

Hydrophobic surfaces with CA > 90° are typical of polymeric surfaces: methyl-terminated surfaces (like PDMS) have CA ~ 110° and fluoropolymers (like Teflon) CA ~ 120°.

High contact angles indicate low surface energies and vice versa. Self-assembled monolayers (SAMs) (Figure 5.14) follow the general rule that the longer the carbon



**Figure 12.5** Examples of contact angles of water droplets: (a) on a hydrophilic surface, 20°; (b) on a hydrophobic surface, 95°; (c) on an ultrahydrophobic surface, 150°

chain in the polymer, the more hydrophobic the surface. The maximum contact angle possible on planar surfaces is about  $120^\circ$ . If higher contact angles are desired (so-called super or ultrahydrophobic surfaces), micro- and nanostructures coated by fluoropolymers can be used. The water droplets then rest on top of nanostructures like a fakir on a bed of nails. Contact angles approaching  $180^\circ$  can be achieved.

When contact angles are discussed, water contact angles are most often quoted (Table 12.6). It is important to remember that other liquids have other contact angles, and even salt concentration in water can drastically change contact angle: biological buffer solution on silicon has a contact angle of  $35^\circ$ , while DI water has  $70^\circ$ . Similarly, a surface coated with fluoropolymer DDMS is highly hydrophobic, water contact angle  $103^\circ$ , but the hexadecane contact angle is  $38^\circ$ . Contact angle can change during device operation due to adsorption of material on the surface: for example, protein adsorption can turn a hydrophobic surface into very hydrophilic one.

The values of contact angles depend on surface treatment details and the values in Table 12.6 are to be used as rough guides only. First of all, contact angle measurements are accurate only  $\pm 2\%$ . Surface roughness can also change contact angle.

What is more, contact angles change as time goes by: native oxide formation on silicon will mean that a freshly

HF-dipped wafer has a  $70^\circ$  contact angle, but once 1 nm of native oxide has been formed, the contact angle is reduced. Oxygen plasma modified polymer surfaces become hydrophilic but revert back to their original, more hydrophobic, structure. The timescale of this is very much dependent on materials and processes: sometimes it takes less than an hour or it can take months.

## 12.10 Surface Preparation

Conditioning the surface to a known state is of paramount importance. HMDS priming before lithography is a prototypical example: it leaves the wafer slightly hydrophobic due to the methyl-terminated surface (Figure 9.2). Resists are polymers, and a slightly hydrophobic surface makes good contact with resists. HMDS prevents atmospheric moisture from condensing on the wafer, improving adhesion. This equalizes wafers: irrespective of underlying film, and irrespective of storage time, the resist will see the same surface.

Sometimes surface roughening is the goal, for instance if it is necessary to prevent sticking (to be discussed in more detail in Chapter 29). Many etching processes lead to roughening, like RCA-1, and even minor roughening can prevent bonding. Elimination of particles is essential in wafer bonding (Chapter 17). Surface smoothness is also critical. RCA-1 clean is a compromise because there is some roughening, but good particle removal and hydrophilic surface finish.

Plasmas are used as a cleaning method, but it is more generally to be considered a surface modification method. Plasmas can etch surfaces, and this cleans the surface. Plasmas are energetic, and as a result the surface has broken bonds which tend to be reactive. This is advantageous in many applications: bonding benefits from readily available bonds, and the same applies to adhesion.

Oxygen plasma can be used to grow oxides on silicon and aluminum. These oxides are only 1 or 2 nanometers thick, but they can protect the underlying material. For instance, the  $\text{AlCl}_3$  etch product from  $\text{Cl}_2$  plasma etching of aluminum reacts with atmospheric water vapor to produce  $\text{HCl}$ , which corrodes aluminum. Therefore oxygen plasma treatment after aluminum plasma etching will stabilize the surface.

Sometimes the treatment only modifies the outermost atomic layer, replacing some atoms with other, more suitable, atoms or molecules. Polymer PDMS surface is methyl terminated (Figure 12.6) but oxygen plasma treatment will result in some methyl groups being replaced by hydroxyl groups. Hydroxyl groups are

**Table 12.6** Water contact angles for various surfaces and treatments

|                                  |        |
|----------------------------------|--------|
| Ammonia/peroxide cleaned silicon | 5°     |
| Oxygen plasma treated SU-8       | 5°–40° |
| Sulfuric acid cleaned silicon    | 10°    |
| RCA-1 + RCA-2 cleaned silicon    | 10°    |
| KOH etched silicon               | 25°    |
| Thermal oxide                    | 45°    |
| Native oxide                     | 45°    |
| Oxygen plasma treated PDMS       | 50°    |
| HMDS coated silicon              | 60°    |
| HF dipped silicon                | 70°    |
| Polyimide                        | 75°    |
| Native SU-8                      | 80°    |
| Native polystyrene               | 90°    |
| Native PDMS                      | 108°   |
| ECT (eicosanethiol)              | 110°   |
| Fluoropolymer                    | 120°   |
| Microstructure + PDMS            | 150°   |
| Nanostructure + fluoropolymer    | 170°   |

Note that all the values are approximate and depend on surface treatment details and duration, and on time delay.



**Figure 12.6** Native PDMS surface with methyl groups (top); oxidized hydrophilic PDMS surface with hydroxyl and methyl groups (bottom)

much more reactive (which is important in bonding) and are hydrophilic, while the methyl-terminated surface is hydrophobic.

Not only plasmas but also gases can be used for surface conditioning: hot HCl gas etches silicon, revealing undamaged silicon beneath the original surface (Equation 6.3). Thermal oxidation (Chapter 13) can also be used as a cleaning method: oxidation and oxide etching in HF will reveal a new silicon surface that is very clean.

CVD processes can also be used to deposit thin protective layers. A copper surface can be stabilized by short exposure to silane and subsequently to ammonia, forming a  $\text{CuSi}_x\text{N}_y$  layer. This layer will prevent electromigration at the copper surface (see Chapter 36 for more on reliability).

Deposition of self-assembled monolayers (SAMs) is used extensively to modify surfaces (Figure 5.14): by suitably selecting the terminating group of the SAM, a wide range of contact angles can be obtained. SAMs can be used as adhesion layers too: gold as a noble metal has poor adhesion, but gold and sulfur form strong bonds, therefore SAM with sulfur termination will improve gold adherence (Figure 17.10).

There are also equipment solutions to surface cleanliness: if processes are performed immediately after each other, or if the wafers are kept under a vacuum or nitrogen, there is no time and no chance for contamination and particles to deposit on the wafers. This will be discussed in Chapter 31.

Wettability and cleaning are surface issues: in SAMs the effects are due to a single molecular layer only. In many other cases it is also only the very surface that matters. In Figure 12.7 silicon nanoglass is shown. On the left hand side it is coated by  $\sim 1$  nm thick silicon dioxide and the contact angle is close to zero; on the right hand



**Figure 12.7** Nanometer thick layers of oxide (left) and fluoropolymer (right) drastically change contact angle. Reproduced from Jokinen *et al.* (2008) by permission of Wiley-VCH

side a few nanometers of fluoropolymer is deposited, and the contact angle is  $\sim 170^\circ$ .

## 12.11 Exercises

1. Translate surface iron contamination of  $10^{10}$  atoms/cm<sup>2</sup> into a number of monolayers!
2. If there is one monolayer coverage of organic contamination on a wafer, how much of that is counted as carbon atoms/cm<sup>2</sup>? Select one common organic chemical used in microfabrication as an example.
3. How can the structure of Figure 12.7 be made?
4. Calculate the chemical and DI water consumption over 24 hours for the following cleaning cycle  
SPM  
DIW rinse  
RCA-1  
DIW rinse  
DHF  
DIW rinse  
RCA-2  
DIW rinse 1  
DIW rinse 2  
when a tank for 25 wafers of 200 mm diameter is used. Assume a 4 h changing interval for RCA cleans and 24 h bath life for SPM and DHF.
5. What happens to particle contamination in (a) wet etching and (b) plasma etching?
6. If an Olympic swimming pool is full of UPW, how many droplets of sweat can be dissolved before  $\text{Na}^+$  and  $\text{Cl}^-$  exceed the specification level of 0.01 ppb?

- Make rough estimates of sweat salt concentration based on your own experiences.
7. Which cleaning chemicals are allowed and which are forbidden for the following materials, and why:
- aluminum
  - gold
  - copper
  - aluminum oxide
  - polyimide
  - silicon nitride?

## References and Related Reading

- Grannemann, E (1994) Film interface control in integrated processing systems, *J. Vac. Sci. Technol.*, **12**, 2741.
- Hattori, T. (ed.) (1998) **Ultraclean Surface Processing of Silicon Wafers**, Springer.
- Jokinen, V., L. Sainiemi and S. Franssila (2008) Complex droplets on chemically modified silicon nanograss, *Adv. Mater.*, **20**, 3453–3456.
- Kern, W. (1990) The evolution of silicon wafer cleaning technology, *J. Electrochem. Soc.*, **137**, 1887.
- Kinoshita, K. and T. Hara (2008) A cleaning process for fine particles of silicon nitride, *J. Electrochem. Soc.*, **155**, H642–H647.
- Kitajima, H. and Y. Shiramizu (1997) Requirements for contamination control in the gigabit era, *IEEE Trans. Semicond. Manuf.*, **10**, 267.
- Li, X.-M., D. Reinhoudt and M. Crego-Calama (2007) What do we need for a superhydrophobic surface? A review on the recent progress in the preparation of superhydrophobic surfaces, *Chem. Soc. Rev.*, **36**, 1350–1368.
- Middleman, S. and A. K. Hochberg (1993) **Process Engineering Analysis in Semiconductor Device Fabrication**, McGraw-Hill.
- Ohmi, T. *et al.* (1992) Dependence of thin-oxide film quality on surface microroughness, *IEEE Trans. Electron Devices*, **39**, 537.
- Okorn-Schmidt, H. (1999) Characterization of silicon surface preparation processes for advanced gate dielectrics, *IBM J. Res. Dev.*, **43**, 351.
- Reinhardt, K. A. and W. Kern (2007) **Handbook of Silicon Wafer Cleaning Technology**, 2nd edn, William Andrew.
- Schroder, D. K. (1998) **Semiconductor Material and Device Characterization**, 2nd edn, John Wiley & Sons, Inc.
- Zhang, F. *et al.* (2000) The removal of deformed submicron particles from silicon wafers by spin rinse and megasonics, *J. Electron. Mater.*, **29**, 199.

# 13

## Thermal Oxidation

Silicon dioxide,  $\text{SiO}_2$ , is probably a more important material in silicon technology than silicon itself: whereas GaAs and Ge have higher electron mobilities than silicon, and potentially faster transistors, they do not have native oxides which protect their surfaces, and neither do stable, thick oxides exist. Silicon dioxide has functions as passivation layers, capacitor dielectrics and electric isolation layers in finished devices. Thin oxides (1–20 nm) are used for example as capacitor dielectrics (including the CMOS transistor) and as tunnel electrodes in flash memories and chemical sensors, while thick oxides (100–1000 nm) serve as diffusion and etch masks and as electrical isolation layers. Thermal oxidation is often done repeatedly during device processing: for example, the first oxide may be used as an etch mask, and it is removed after etching, but another thermal oxidation is done immediately. This new oxide is again of high quality.

### 13.1 Thermal Oxidation Process

Silicon is easily oxidized: a native oxide of nanometer thickness grows on a silicon surface in a couple of hours or days, depending on the surface conditions, and similar thin oxides form easily in oxygen plasma or in oxidizing wet treatment. These oxides are, however, limited in their thickness and they are not stoichiometric  $\text{SiO}_2$ . Two basic schemes are used in thermal oxidation: wet and dry oxidation, respectively



Thermal oxidation is a slow process: 1 h of dry oxidation at 900 °C produces oxide about 30 nm thick and 1 h of wet oxidation about 130 nm. Exact values are dependent on silicon crystal orientation: the oxidation rate of <111>

is somewhat higher than that of <100> silicon; highly doped silicon oxidizes faster than lightly doped material.

Thin gate oxides, flash-memory tunnel oxides and DRAM capacitor oxides are grown in dry oxygen at 850–950 °C. In addition to better thickness control due to the lower rate, dry oxides exhibit lower interface charges and trap states which are important for transistor operation. Thin oxides also have many auxiliary and sacrificial roles: a thin oxide under nitride relieves stresses caused by the nitride film. Thicker oxides are used for device isolation and as masking layers for ion implantation, diffusion and etching steps. They are usually 100–1000 nm thick and grown by wet oxidation. Typical process temperatures are 950–1100 °C, and process times are hours or tens of hours.

Even though silicon is easily oxidized, oxidation furnaces are by no means cheap or simple equipment. They involve high-temperature parts which must be non-contaminating and which must tolerate oxidizing conditions and hot cleaning gases. Exacting temperature control and fine mechanics are also required. In Figure 13.1 a batch of silicon wafers is being loaded into a horizontal furnace.

When silicon and oxygen react to form  $\text{SiO}_2$ , the resulting oxide is roughly twice the volume of silicon it replaces: for a  $\text{SiO}_2$  layer of thickness  $D$ , silicon thickness consumed is  $0.45D$  as can be calculated from molar volumes:

density of silicon is 2.3 g/cm<sup>3</sup>

mass 28 g/mol molar volume 12.17 cm<sup>3</sup>

density of  $\text{SiO}_2$  is 2.2 g/cm<sup>3</sup>

mass 60 g/mol molar volume 27.27 cm<sup>3</sup>

The original silicon surface is somewhat below the oxide midpoint. This volume increase leads to restrictions in



**Figure 13.1** Wafers being loaded into a horizontal oxidation furnace. Courtesy VTT

the oxidation of structured surfaces, because stresses can become excessively large in the corners of the structures. On the other hand, the fact that oxidation consumes silicon can be used as a cleaning method: thin oxide is grown and immediately removed by HF etching, to reveal a perfect silicon surface.

Oxide thickness is usually measured by optical methods: either by ellipsometry or by reflectometry. Thermal oxides can be grown with very tight specifications, for an oxide 10 nm thick the uniformity is 1%, or equal to one atomic diameter. For thermal oxides a refractive index value of  $n = 1.46$  is usually used, but for oxides thinner than 10 nm this is not really accurate. A quick and easy way to gauge oxide thickness, especially for intermediate thicknesses of 50–500 nm, is by its color. An oxide color chart can be found in Appendix C.

Various electrical measurements are also used: breakdown voltage is one of many. High-quality silicon dioxide can sustain 10 MV/cm, even 12 MV/cm. Oxide defects and electrical quality are closely connected; this topic will be discussed further in Chapter 36.

## 13.2 Deal–Grove Oxidation Model

A model for oxide growth has been developed by Deal and Grove. It is a phenomenological macroscopic model which does not assume anything about the atomistic mechanisms of oxidation. Oxygen diffusion through the



**Figure 13.2** Model of thermal oxidation: oxygen diffuses through  $\text{SiO}_2$  film and reacts at the  $\text{SiO}_2/\text{Si}$  interface. Concentration of oxygen inside oxide decreases linearly

growing oxide and chemical reaction at silicon/oxide interface are modeled by the classical Fick diffusion equation and chemical rate equation. Model geometry and boundary conditions are shown in Figure 13.2.

Oxidation is modeled as if the boundaries were stationary (which is a reasonable assumption because oxidation is slow). The diffusion equation for oxygen is

$$\frac{dC}{dt} = 0 = D \frac{d^2C}{dz^2} \quad (13.3)$$

where  $C$  is oxygen molar concentration ( $\text{mol}/\text{m}^3$ ), subject to boundary conditions

$$C = C_s \quad z = 0 \quad \text{at the } \text{SiO}_2 \text{ surface} \quad (13.4)$$

$$-D \frac{dc}{dz} = R \quad z = Z \quad \text{at the } \text{SiO}_2/\text{Si} \text{ interface} \quad (13.5)$$

where  $R$  is the reaction rate at the interface (in units of  $\text{mol}/\text{m}^2\text{-s}$ ).

The latter equation specifies that all oxygen reaching the interface will react there to form oxide: there will be no build-up of unreacted oxygen inside oxide or silicon.

For a reaction like  $\text{Si}(\text{s}) + \text{O}_2(\text{g}) \Rightarrow \text{SiO}_2(\text{s})$  the rate is assumed to be first order, that is  $R = kC$ , directly related to the concentration of reactive species,  $C$ , and characterized by a rate constant  $k$ . We can then rewrite the second boundary condition as

$$-D \frac{dc}{dz} = kC \quad \text{at } z = Z \quad (13.6)$$

A solution that satisfies these conditions is

$$C = C_s - \left( \frac{kC_s}{kZ + D} \right) \quad (13.7)$$

The rate (at the  $\text{SiO}_2/\text{Si}$  interface  $z = Z$ ) is then

$$R = kC(Z) = \frac{kDC_s}{kZ + D} \quad (13.8)$$

To calculate the thickness growth rate, we must convert molar concentration to volume through density:

$$RM_{\text{SiO}_2} = \rho_{\text{SiO}_2} \frac{dZ}{dt} \quad (13.9)$$

where the molar volume of  $\text{SiO}_2$  is  $v = M_{\text{SiO}_2}/r_{\text{SiO}_2}$  ( $60 \text{ g/mol} \div 2.2 \text{ g/cm}^3 = 27.3 \text{ cm}^3/\text{mol}$ ).

When we solve for  $Z(t)$  from the rate equation, we get

$$\frac{dZ}{dt} = \frac{kDC_sv}{kZ + D} \text{ subject to } Z = 0 \text{ at } t = 0 \quad (13.10)$$

This leads to the oxide thickness equation

$$t = \frac{Z}{KC_sv} + \frac{Z^2}{2DC_sv} \quad (13.11)$$

When thin oxides are considered, we can ignore the second term, and the rate is then simply

$$Z = kC_st \quad (13.12)$$

or growth is linear in time and linearly related to rate constant  $k$ .

For thick oxides, we can ignore the first term and get

$$Z = \sqrt{2DC_sv t} \quad (13.13)$$

or growth is parabolic, related to diffusion length  $\sqrt{Dt}$ . The thicknesses of wet and dry thermal oxides obtained by a 1D simulator are shown in Figure 13.3.

The Deal–Grove model thus predicts linear oxidation rate initially, followed by a parabolic behavior for thicker oxides. The linear regime covers the initial stages of oxidation, with some success. The model works much better for thick oxides, and theory and experiment agree that doubling oxide thickness requires quadrupling oxidation time in the parabolic regime (this can be used as a quick estimate for oxidation time once one process is known and fixed). Alternatively, if thickness and time are known for one combination, others can be easily calculated from Equation 13.13: a 200 min dry oxidation results in oxide 165 nm thick, then 50 min corresponds to about 80 nm. It should be noted that the demarcation between the linear and parabolic regimes is not clear cut, and this introduces uncertainty into the simple calculation.

Dry oxidation is slower than wet oxidation even though diffusion of oxygen molecules through silicon



**Figure 13.3** Oxidation of  $<100>$  silicon at temperatures between 850 and 1050 °C, wet and dry. Maximum practical oxide thickness is 1 or 2 micrometers, because of the decrease in parabolic oxidation rate

dioxide is faster than diffusion of water molecules. The explanation lies in the water solubility of silicon dioxide: it is four orders of magnitude larger than oxygen solubility, and therefore the concentration of the oxidant in oxide is much greater.

Generally, when thin oxides are required, a lower temperature is used. It is not, however, possible to reduce temperature without affecting oxide quality. Therefore one solution is to reduce the partial pressure of oxygen: for instance, in the experiment of Figure 13.4 the gas mixture is 10% oxygen, 90% nitrogen. If thick oxides are needed, higher temperature and higher partial pressure of oxygen are used. HIPOX, for high-pressure oxidation, can reduce oxidation time significantly.

Oxidation rate depends slightly on silicon crystal orientation. Silicon of  $<111>$  orientation oxidizes faster than  $<100>$  (and all other orientations fall between these two). This is speculated to arise from the number of silicon atoms and bonds at the oxidation front. For thick oxides, diffusion of oxygen through oxide determines the rate, and



**Figure 13.4** Initial oxidation of (100) and (111) silicon at 850 °C, in 10% O<sub>2</sub> + 90% N<sub>2</sub> atmosphere. Reproduced from Niskanen *et al.* (2009), by permission of Elsevier

clearly this cannot depend on silicon crystal orientation, but in the linear regime the rate constant  $k$  is different for <111> and <100>. This is seen in Figure 13.4: <111> oxidation is faster than <100>. Note also that a thin oxide is present even at zero oxidation time. It is difficult to eliminate oxygen altogether: rinsing water has dissolved oxygen in it, and wafers are loaded into the furnace under atmospheric conditions. These topics will be discussed again in conjunction with CMOS gate oxidation in Chapter 26. The initial stages of oxidation are surprisingly poorly known, in spite of decades of research and substantial manufacturing experience. Most models cannot explain the very first nanometers of oxidation.

### 13.3 Oxidation of Polysilicon

Polysilicon oxidation presents a number of complications compared to single crystal silicon oxidation. Polysilicon consists of grains of many orientations which have different oxidation rates. Polysilicon grains are most often of (110) orientation and the oxidation rate of undoped poly is somewhere between the (100) and (111) silicon oxidation rates. In polycrystalline materials there are two different diffusion paths, through the bulk and along grain boundaries, requiring more advanced models. Grains also grow during oxidation, further complicating analysis.

The polysilicon surface is rough (Figure 7.4) and oxide quality will be inferior to single crystal oxides. Roughness and different oxidation rates of different grains lead to non-uniform poly oxide thickness (Figure 13.5). While



**Figure 13.5** Polysilicon is rough and consists of grains of different orientations, which oxidize at slightly different rates, leading to rough oxide with non-uniform thickness



**Figure 13.6** Flash memory cell: interpoly oxide between floating and control gates is limited to about 10 nm because poly oxides are of inferior quality

single crystal oxides can tolerate 10–12 MV/cm electric fields, poly oxides break at 3–5 MV/cm.

Poly oxides are intensively used in EPROM/flash memories (Figure 13.6). Tunnel oxide is clearly thermally oxidized single crystal silicon, but the floating gate (FG) is poly and interpoly oxide between the floating gate and control gate (CG) is therefore thermally oxidized polysilicon. The fact that it is difficult to scale down poly oxides is partly limiting flash memory scaling.

### 13.4 Oxide Structure

Thermally grown silicon dioxide is glassy and exhibits only short-range order, in contrast to quartz, which is crystalline SiO<sub>2</sub>. The basic unit of silica structure is SiO<sub>4</sub>, as shown in Figure 13.7.

In a perfect arrangement, such as crystalline quartz, all oxygen atoms bond to two silicon atoms (oxygen has valence 2, silicon valence 4) but at the silicon/oxide interface some bonds are not made, leaving unbonded charged oxygen atoms (Figure 13.8), making oxide less stable than quartz. This is also reflected in their



**Figure 13.7** Basic structure of silica: a silicon atom tetrahedrally bonds to four oxygen atoms



**Figure 13.8** The structure of the silicon/silicon dioxide interface: single crystalline silicon and amorphous oxide. There are dangling bonds (not having their full valence) and some have hydrogen atoms bonded to them

properties: quartz density is  $2.65 \text{ g/cm}^3$ , silicon oxide density  $2.2 \text{ g/cm}^3$ ; Young's modulus is  $107 \text{ GPa}$  for quartz and  $87 \text{ GPa}$  for oxide.

Incompletely oxidized silicon atoms are positively charged, and this is known as fixed oxide charge,  $Q_f$ . It is located in the oxide in the first few nanometers above the silicon surface. There is also interface trapped charge  $Q_{it}$  which can be either positive or negative. The defect density at the silicon/oxide interface can be made very low by proper cleaning and well-controlled oxidation. Both  $Q_f$  and  $Q_{it}$  densities are  $10^9\text{--}10^{11}/\text{cm}^2$ , which should be compared to the silicon surface density of  $10^{15} \text{ atoms/cm}^2$ . Thus only one atom in  $10^4$  or one in  $10^6$  has incomplete bonds.

Thermal oxidation is often complemented by a post-oxidation anneal (POA) in nitrogen (in the very same furnace; oxygen gas is switched to nitrogen). This step densifies the film and anneals out some defects. It also causes diffusion of dopants and it has to be included in calculation the doping profiles. Hydrogen anneal is used to passivate dangling bonds: hydrogen attaches to the free valence of the silicon and eliminates further charge trapping. However, high electric fields can easily accelerate electrons to such energies that hydrogen atoms become mobile during device operation.

If post-oxidation anneal is done in ammonia ( $\text{NH}_3$ ) or nitrous oxide ( $\text{N}_2\text{O}$ ), surface nitridation will take place (while nitrogen anneal is an inert anneal). Nitridation results are very much dependent on anneal details, but the whole oxide is not turned into nitride: only the surface layer is affected. Nitridation, too, has many effects: the dielectric constant goes up (nitride  $\epsilon_r \approx 7$  vs.  $\epsilon_r \approx 4$  for oxide) which increases capacitance; it also improves oxide tolerance to damage caused by hot electrons injected into the oxide. Nitrided oxide reduces boron diffusion through it, which is extremely important for thin CMOS gate oxides. Ammonia anneal also introduces hydrogen. Oxidation of nitrided oxide is a way to reduce hydrogen concentration, and the resulting film is known as ONO, for oxidized nitrided oxide.

Dopant atoms have a major impact on oxidation: high doping level will increase oxidation rate even by 200% for thin oxides at low temperatures. High phosphorus concentration in silicon is associated with high concentration of vacancies, and because the volume of silicon dioxide is larger than the volume of silicon it replaces, oxidation consumes vacancies. Highly doped phosphorus therefore provides plenty of vacancies for oxidation to proceed. Boron behaves differently: it is incorporated into the growing oxide, weakening its bond structure and thus enabling faster diffusion through it.

In doped polysilicon, dopants precipitate at grain boundaries. Boron doping leads to minor oxidation rate enhancement, and phosphorus doping to clearly increased oxidation rate, via increased vacancy concentration, just as in the case of single crystal material.

## 13.5 Local Oxidation of Silicon

When local oxidation of silicon is needed, a silicon nitride mask is used. Nitride will prevent oxygen diffusion, and areas under the nitride will not be oxidized. This is known as LOCOS, for local oxidation of silicon. LOCOS is pictured in Figure 13.9.



**Figure 13.9** LOCOS: top, before oxidation, thin-pad oxide and patterned nitride; bottom, after oxidation, no oxidation under nitride but “bird’s beak” at nitride edge. Note that about half of the oxide will be below the original silicon surface



**Figure 13.10** LOCOS for a sawtooth structure in (100) silicon. Nitride first acts as a silicon etching mask and then as an oxidation mask. Second KOH etching and HF oxide removal result in sawtooth relief. Reproduced from Ribbing *et al.* (2003), copyright IOP

### LOCOS process flow

- Thermal oxidation (pad oxide)
- LPCVD nitride deposition
- Lithography
- Nitride etching
- Photoresist strip
- Cleaning
- Wet oxidation

LOCOS variables are pad oxide thickness (10–50 nm), LPCVD nitride thickness (100–200 nm) and oxidation temperature. Pad oxide serves as a stress-relief layer, and it diminishes the stress-induced dislocations that thick nitride exerts in silicon. Nitride acts as a diffusion barrier for oxygen diffusion and as a mechanical stiffener: the thicker the nitride, the smaller the oxide growth under the mask. This lateral extension is known as bird’s beak, because of its visual appearance. Thinner pad oxide would help to minimize bird’s beak but at the expense of silicon damage from nitride stress.

The LOCOS oxide surface will be above the original silicon surface. If we desire to have the oxide surface level with the original surface, we can start the LOCOS process by etching into silicon, with the silicon etched depth approximately half the desired oxide thickness, which then will result in approximately equal surface height for oxide and silicon.

LOCOS isolation has been used for 30 years because of its simplicity. It has been scaled to much smaller

linewidths than anybody thought possible. Numerous modifications have been tried, but most have failed because the added process complexity has not offered enough improvement in isolation. Today, transistor isolation is based on shallow trench isolation (STI), which uses RIE of silicon and CVD filling of the trenches. This will be discussed in Chapter 26.

The fact that nitride acts as an oxidation mask can be combined with other process steps, for example nitride also acts as an etch mask for KOH etching of silicon. In Figure 13.10 a LOCOS-type KOH etching process is shown for wafers of (100) orientation. KOH etching is done using a nitride mask, resulting in inverse pyramid shapes and trenches. The wafer is cleaned and the newly formed silicon (111) sidewalls are oxidized. Nitride is then etched away, and a second silicon KOH etching step is performed. This leads to “period doubling” of trenches.

### 13.6 Stress and Pattern Effects in Oxidation

Oxide volume is greater than the volume of the silicon it replaces. Oxides are therefore under compressive stresses, and this causes a number of pattern-dependent phenomena which can be either beneficial or disadvantageous. Typical stress values are on the order of 300 MPa. Somewhere between 975 and 1000 °C oxide exhibits viscous flow. Oxidation above that temperature will result in reduced stress and wafer bow. Below that temperature, oxide needs to be treated as an elastic material with appropriate elastic



**Figure 13.11** SOI structure (left) after plasma etching. After low-temperature thermal oxidation (right) unoxidized silicon remains because stresses build up. Redrawn from Heidemayer *et al.* (2000)



**Figure 13.12** Cross-section of an oxidized silicon step with oxide thinning at both the convex (top) and concave (bottom) corner. Reproduced from Minh and Ono (1999) by permission of AIP

constants. Scaling of LOCOS to smaller linewidths meets an inevitable limit at submicron dimensions: stresses in the growing oxide prevent full oxidation of narrow gaps.

Thermal oxidation of small silicon wires (Figure 13.11) shows a self-limiting effect due to high stresses and this has been utilized in making nanostructures. This is illustrated in the SOI nanowire process.

### Process flow for silicon nanowires

- SOI wafer with device silicon 21 nm thick
- Lithography
- Silicon etching
- Photoresist stripping and wafer cleaning
- Thermal oxidation

Thermal oxidation proceeds for a while, but then a self-limiting effect sets in: a critical stress, which stops oxidation, is about 2.6 GPa at 850 °C. After the self-limiting oxide thickness has been grown, no further oxidation takes place. If oxidation is carried out at higher temperature, say 1000 °C, this stress can be overcome, and the whole structure will be oxidized.

Stresses are also responsible for non-uniform oxidation in convex and concave corners as shown in Figure 13.12. Uneven oxide thickness causes problems for reliability because electric field strength is different in the corners and planar areas. Etched trenches have concave corners, and therefore thermal oxidation leads to non-uniform thickness which may be detrimental to for example DRAM trench capacitors, because the capacitor dielectric has a weak spot at the corner. Etch processes can be tailored to some extent for smoother bottom profiles, but this is a limited option because the top corner needs rounding, too. Oxide and nitride can be deposited by conformal CVD, but in very deep trenches the conformality may not be adequate. Sacrificial thermal oxidation can be used to

smooth corners. Second thermal oxidation then provides the actual thin dielectric film, which serves for example as DRAM capacitor dielectric. Anisotropic wet etching of silicon produces V-grooves, and oxide at the apex of the V-groove is highly stressed. This oxide has a much higher wet etch rate in HF than ordinary oxide. This has been employed to make nanoscopic holes, as shown in Figure 13.13.

#### 13.6.1 Oxidation sharpening

Sharp tips are used as AFM probes and as field emitters in vacuum microelectronic devices, for high resolution in the former application and for low operating voltage in



**Figure 13.13** Oxide thinning at the apex is used as a method to fabricate nanoscopic holes: the apex can be etched open while leaving oxide elsewhere because oxide is thin and stressed at the apex. Reproduced from Minh and Ono (1999) by permission of AIP



**Figure 13.14** Silicon tip fabrication: left, isotropic silicon etching with an oxide mask; middle, thermal oxidation; right, silicon tip recovery by HF etching

the latter. Such tips can be fabricated by isotropic etching, but the final part of tip release is difficult: the mask will fall off. Thermal oxidation can help as shown in Figure 13.14: after initial isotropic (or KOH anisotropic) etching, the final sharpening takes place during oxidation. Mask removal is done by isotropic etching, but this is a non-critical, non-patterning etch. Thermal oxidation process control is also much tighter than shape control in an etch process.

## 13.7 Simulation of Oxidation

Oxidation simulation, together with diffusion simulation, is the backbone of all process integration simulators. Thermal oxidation is well understood and can be accurately modeled. However, the atomistic mechanisms of thin oxides (and early stages of oxidation in general) are still under intensive study.

Oxidation simulation requires as input:

- wafer orientation  $<100>/<111>$
- doping level
- temperature
- time
- oxidizing ambient wet/dry.

Additional model parameters such as oxygen partial pressure (1 atm as default), high concentration effects and viscous/elastic models can be used instead of default models.

The Deal–Grove model (Equation 13.1) is the default model for wet oxidation and for thick oxides in general. It is not, however, applicable to thin dry oxides. A power-law model from Nicollian and Brews (2002) can be used for this regime. Oxidation is modeled as

$$x_{\text{ox}} = a \left( \frac{t}{t_0} \right)^b \quad (13.14)$$

Simulators produce results that are accurate within experimental error for 1D oxidation. Additionally, simulators can account for segregation, the distribution of dopants at the oxide/silicon interface.

### 13.7.1 Segregation

Dopants which are initially in the silicon are redistributed between silicon and the growing oxide during oxide growth. As shown in Figure 13.15, boron is depleted and phosphorus accumulated in the silicon side of the interface. Segregation has a major effect on device properties: if the dopant is mostly incorporated in the oxide and depleted in the silicon near the interface, inversion of silicon doping type may occur. Segregation proceeds as long as the chemical potentials of the dopants differ in the oxide and silicon. The equilibrium segregation coefficient,  $m$ , is defined as the ratio of dopant in silicon to that in oxide.

Metal atoms experience segregation just like the dopants: for example, Al and Ca are segregated preferentially into the oxide (and cause oxide quality problems) whereas Ni and Cu diffuse into the bulk (and cause defects which act as lifetime killers).

## 13.8 Thermal Oxides vs. other Oxides

Thermal oxidation of silicon produces high-quality oxide (in terms of uniformity, density, electric breakdown field, etc.) but thermal oxidation is a high-temperature process. CVD oxides are sometimes indispensable because they allow lower process temperatures, but CVD oxides have, however, lower breakthrough voltage, higher wet etch rate in HF and rougher surfaces than thermal oxides. Sometimes CVD oxides have to be used because thermal oxides cannot be used. One such situation concerns very thick oxides: thermal oxidation of micrometer layers takes hours or days while CVD oxides of identical thicknesses can be done in a matter of minutes or an hour, depending on the particular equipment.

Very few materials can tolerate oxidizing ambients at about  $1000^{\circ}\text{C}$ . Metals cannot usually withstand such conditions. Tantalum is an exception: thermal oxidation of tantalum can produce device-quality  $\text{Ta}_2\text{O}_5$  thin films in a standard oxidation furnace (for reasons of contamination, a separate oxidation tube needs to be reserved for such an



**Figure 13.15** Segregation of dopant at silicon/oxide interface during wet oxidation (1000 °C, 60 min): (a) boron-doped wafer shows dopant loss at interface; (b) phosphorus-doped wafer shows accumulation of dopant at the interface. Substrate resistivity is 10 ohm-cm in both cases

exotic process). Silicides will thermally oxidize to form SiO<sub>2</sub>, with the exception of TiSi<sub>2</sub>, which will turn into TiO<sub>2</sub>. Tungsten polycide gates (WSi<sub>2</sub>/poly) can be oxidized similarly to polysilicon gates. Making the silicide silicon rich, WSi<sub>2.2</sub>, will ensure proper oxidation. Silicon carbide, SiC, can be oxidized to produce SiO<sub>2</sub> with standard silicon oxidation processes but the rate is very low compared to silicon oxidation.

In addition to thermal oxidation, there are other oxidation processes in microfabrication, namely chemical oxidation in acidic solutions, as discussed in the previous chapter; for example, nitric acid will oxidize silicon and form a SiO<sub>2</sub> layer 1 nm thick. This has been used as a depth profiling method: after each oxidation cycle oxide is removed and the surface analyzed, and with continued oxidation and HF etching cycles depth information is obtained, for example on dopant distribution. Anodic oxidation of silicon produces silicon dioxide, and similarly aluminum is anodically oxidized in acidic solutions (to alumina, Al<sub>2</sub>O<sub>3</sub>) which can be used for example as a tunnel dielectric in chemical sensors. Plasma is often used to grow thin oxides, for example aluminum oxide formed by

oxygen plasma treatment is 2 nm thick and useful as a tunnel oxide in Josephson junction superconducting devices. Similar plasma oxide is used as a passivation layer after chlorine plasma etching of aluminum.

### 13.9 Exercises

1. A wet oxidation of 250 minutes results in oxide 1 μm thick. How long will it take to grow 10 μm oxide under the same conditions? How long will it take to grow oxide 0.1 μm thick?
2. Holes are etched in thermal oxide 1 μm thick. The wafer is then given 1 hour of wet oxidation at 1000 °C. All oxide is then etched away. What is the resulting step height in silicon?
3. Recessed LOCOS has the final oxide surface at the same level as the original silicon surface. Its fabrication starts by etching through nitride and oxide into silicon as shown below. Calculate the required etched depth and choose proper oxidation conditions which will give the final oxide thickness. Draw a cross-sectional figure of the resulting structure.



4. Explain where thermal oxide will grow in the structure shown below.



5. Draw the following structure after thermal oxidation. Assume 500 nm height for the oxide islands and new oxide 500 nm thick.



6. What difference do the following materials have in 1000 °C dry oxidation:  
 (a) lightly doped single crystalline silicon  
 (b) highly phosphorus-doped single crystal silicon  
 (c) phosphorus-doped polysilicon  
 (d) undoped amorphous silicon  
 (e) lightly doped epitaxial silicon?

## Simulation exercises:

7. The Deal–Grove oxidation model is not valid for thin oxides. Experimental data for dry oxidation is shown below. Check how your simulator works for thin oxides!

| Time (min) | 850 °C | 1000 °C |
|------------|--------|---------|
| 20         | 6 nm   | 26 nm   |
| 40         | 8 nm   | 42 nm   |
| 60         | 11 nm  | 56 nm   |
| 80         | 13 nm  | 68 nm   |

Data from Massoud *et al.* (1985a).

8. Phosphorus-doped polysilicon (20–80 ohm/sq) oxidation produces 50 nm oxide in 30 min of dry oxidation at 1000 °C. At 900 °C dry oxidation results in 10 nm oxide. How do these values compare to single crystal silicon oxidation?

9. High-pressure oxidation (HIPOX) increases oxidation rates. Data for dry oxidation at 900 °C is as follows.

| Pressure (atm) | Time (min) | Thickness (nm) |
|----------------|------------|----------------|
| 10             | 30         | 40             |
| 10             | 60         | 65             |
| 10             | 120        | 100            |
| 20             | 30         | 55             |
| 20             | 60         | 100            |
| 20             | 120        | 180            |

Data from Lie *et al.* (1982).

How does your simulator handle HIPOX oxides?

10. What is the segregation behavior of n-type dopants As, P, Sb?

## References and Related Reading

- Aït-Kaci, A. *et al.* (2002) Characterization of sub-micrometre silicon films (Si-LPCVD) heavily in situ boron-doped and submitted to treatments of dry oxidation, *Semicond. Sci. Technol.*, **17**, 983–992.
- Chabal, Y. J. (ed) (2001) **Fundamental Aspects of Silicon Oxidation**, Springer.
- Green, M. L. *et al.* (1999) Understanding the limits of ultra-thin SiO<sub>2</sub> and Si-O-N gate dielectrics for sub-50 nm CMOS, *Microelectron. Eng.*, **48**, 25.
- Heidemayer, H. *et al.* (2000) Self-limiting and pattern dependent oxidation of silicon dots fabricated on silicon-on-insulator material, *J. Appl. Phys.*, **87**, 4580.
- Kameda, N. *et al.* (2007) High quality gate dielectric film on poly-silicon grown at room temperature using UV light excited ozone, *J. Electrochem. Soc.*, **154**, H769–H772.
- Krzeminski, C. *et al.* (2007) Silicon dry oxidation kinetics at low temperature in the nanometric range: modeling and experiment, *J. Appl. Phys.*, **101**, 064908.
- Lie, L. N., R. R. Razouk and B. E. Deal (1982) High pressure oxidation of silicon in dry oxygen, *J. Electrochem. Soc.*, **129**, 2828.
- Massoud, H. Z., J. D. Plummer and E. A. Irene (1985a) Thermal oxidation of silicon in dry oxygen growth-rate enhancement in the thin regime, Part I, *J. Electrochem. Soc.*, **132**, 2685.
- Massoud, H. Z., J. D. Plummer and E. A. Irene (1985b) Thermal oxidation of silicon in dry oxygen: growth-rate enhancement in the thin regime, Part II, *J. Electrochem. Soc.*, **132**, 2693.

- Minh, P. N. and T. Ono (1999) Non-uniform silicon oxidation and application for the fabrication of aperture for near-field scanning optical microscopy, *Appl. Phys. Lett.*, **75**, 4076.
- Nicollian, E. H. and J. R. Brews (2002) **MOS Physics and Technology**, John Wiley & Sons, Inc.
- Niskanen, A. et al. (2009) Ultrathin tunnel insulator films on silicon for electrochemiluminescence studies, *Thin Solid Films*, **517**, 5779–5782.
- Plummer, J. D., M. D. Deal and P. B. Griffin (2000) **Silicon VLSI Technology**, Prentice Hall.
- Ribbing, C., Cederström B. and Lundqvist M. (2003) Micro-fabrication of saw-tooth refractive x-ray lenses in low-Z materials, *J. Micromech. Microeng.*, **13**, 714–720.
- Shimidzu, H. (1997) Behavior of metal-induced oxide charge during thermal oxidation in silicon wafers, *J. Electrochem. Soc.*, **144**, 4335.
- Suryanarayana, P. et al. (1989) Electrical properties of thermal oxides grown over doped polysilicon thin films, *J. Vac. Sci. Technol.*, **B7**, 599.
- Vollkopf, A. et al. (2001) Technology to reduce the aperture size of microfabricated silicon dioxide aperture tips, *J. Electrochem. Soc.*, **148**, G587.
- Watanabe, T., K. Tatsumura and I. Ohdomari (2006) New linear-parabolic rate equation for thermal oxidation of silicon, *Phys. Rev. Lett.*, **96**, 196102.
- Zhang, C. and K. Najafi (2004) Fabrication of thick silicon dioxide layers for thermal isolation, *J. Micromech. Microeng.*, **14**, 769–774.

# 14

## Diffusion

The power of silicon microelectronics technology stems from the ability to tailor dopant concentrations over eight orders of magnitude by introducing suitable n- or p-type dopants into the silicon. The upper limit is set by the solid solubility of the dopants (about  $10^{21} \text{ cm}^{-3}$ ), the lower limit (about  $10^{12} \text{ cm}^{-3}$ ) by impurities which result from the silicon crystal growth. This enables a wealth of microstructures and devices, witnessed by the multiplicity of diode, transistor, thyristor and other semiconductor device designs. In silicon IC technology dopant diffusion is such a key step that the country of origin of semiconductor devices is defined as the country where diffusions were made.

Dopants can be introduced into silicon by five different methods:

- during crystal growth
- by neutron transmutation doping (NTD)
- during epitaxy
- by diffusion
- by ion implantation.

The first two techniques are applied to whole ingots, and epitaxy results in a uniformly doped layer all over the wafer. Diffusion and ion implantation are techniques to locally vary the dopant concentration, and they are the topics of this and the following chapter.

### 14.1 Diffusion Process

Diffusion is the movement of atoms along concentration gradients. Atoms from high-concentration areas move to areas of lower concentration (and if we wait long enough, there will be no concentration gradients). In microtechnology diffusion is a technique to introduce

and drive boron, phosphorus and other dopant atoms into the silicon lattice.

Thermal diffusion is a high-temperature process: diffusion temperatures for the common dopants are in the range of 900–1200 °C. Diffusion furnaces are identical to oxidation furnaces, and diffusion is a batch process where long process times are compensated by a huge loads, 100 or even 200 wafers, in a batch.

Thermal diffusion can be done from the gas phase. In gas phase doping the wafers are put in a furnace and a suitable doping gas,  $\text{POCl}_3$  for phosphorus doping, or  $\text{BBr}_3$  for boron doping, is introduced. The wafers are exposed to dopant atom vapors and doped (Figure 14.1). The alternative technique is diffusion from doped thin films. For example, boron-doped polysilicon, phosphorus-doped silica glass (PSG) or doped spin-on glass is deposited on the wafer, which is then put into a furnace. Dopants from the doped film diffuse into the silicon. The junction depth ( $x_j$ ) is the depth where diffused dopant concentration equals substrate dopant concentration. Solid solubilities of dopants in silicon are shown in Figure 14.2. If the dopant concentration exceeds the solubility limit, the dopant will precipitate, and it does not contribute to electrical conductivity.

If only certain areas need be doped, oxide is used as a diffusion mask. Even though the dopants do not diffuse through the oxide, they do modify it to the extent that diffusion mask oxides are practically always etched away after diffusion and new oxide is grown. Oxide mask thicknesses are typically around 500 nm, but might have to be thicker if it is absolutely required that no dopants penetrate the mask. This might be the case if very high-resistivity starting wafers are used: their initial doping level is very low, and a very small amount of dopant will easily counterdope them.

Typical diffusion process at 1000 °C for a few hours will result in roughly 1 μm diffusion depth. Much longer



**Figure 14.1** Thermal diffusion: (a) gas phase diffusion with oxide mask; (b) diffusion from doped thin film with oxide mask; (c) dopant profile and junction depth  $x_j$



**Figure 14.2** Solid solubilities of the most important dopants and impurities in silicon technology

diffusions are also common: CMOS well diffusion could be 16 hours at  $1150^\circ\text{C}$ , for  $5\text{ }\mu\text{m}$  depth.

Diffusion is often carried out in two steps: pre-deposition (or pre-diffusion) and drive-in. In pre-deposition a known and limited number of dopants are

introduced on the wafer, and during drive-in they will diffuse deeper. In gas phase diffusion this means that  $\text{POCl}_3$  gas is switched off during drive-in, and nitrogen or oxygen introduced. Ion implantation, where ionized dopants are accelerated into silicon, can be considered as a pre-deposition step for diffusion. Diffusion is therefore the general term for doping processes, irrespective of the actual mechanism of dopant introduction.

Doping can be performed many times over, and silicon doping type may change from p- to n-type and back again, depending on the process sequence. The device shown in Figure 14.3, a UV photodiode, is made in a modified npn-bipolar process. UV photons are absorbed in the top p<sup>+</sup> diffusion layer. The process flow will only show the diffusion aspects of the device.



**Figure 14.3** UV photodiode with shallow p<sup>+</sup> anode diffusion. The structure is based on an npn-bipolar transistor. Reproduced from Zimmermann (1999) by permission of Springer. Note that lateral and vertical dimensions in the figure are not true: diffusion is isotropic and for example 5  $\mu\text{m}$  vertical diffusion leads to 5  $\mu\text{m}$  lateral (sideways) diffusion on both sides!



**Figure 14.4** UV photodiode doping profile along the anode of Figure 14.3

### Process flow for UV photodiode (lithography, etch and oxidation steps omitted)

- p-type substrate wafer
- n<sup>+</sup> buried layer diffusion
- n epitaxial layer deposition
- p<sup>+</sup> substrate contact diffusion
- n<sup>+</sup> diffusion to contact buried layer
- p<sup>+</sup> base contact enhancement diffusion (under  $A_{IR}$ )
- p base diffusion
- n<sup>+</sup> cathode diffusion
- p<sup>+</sup> anode diffusion

The area directly underneath the anode changes its doping type three times: it is originally n-type epitayer, doped by PH<sub>3</sub> gas during epitaxy. Base diffusion changes it to p-type when the boron concentration exceeds the phosphorus concentration in the epitayer; n-cathode diffusion turns it back to n-type because the phosphorus concentration is higher than the boron concentration; and finally the surface anode diffusion with the highest boron concentration of all results in p<sup>+</sup> silicon. Figure 14.4 shows the dopant depth profile underneath the anode of Figure 14.3.

## 14.2 Diffusion Mechanisms

Fairly simple mathematical models can describe concentration profiles in solids, but at the atomistic level diffusion remains to be fully explained. This has consequences for simulators, because mechanisms are not fully known, therefore modeling remains inaccurate.

Dopant atoms move with the help of point defects: they jump to vacancies and interstitials. Substitutional dopants are fairly stable without point defects. Vacancies are always present through thermal equilibrium processes:



**Figure 14.5** Diffusion mechanisms: left, interstitial; middle, substitutional/vacancy; right, interstitialcy

vacancies are thermodynamic defects, and their nature is different from for example dislocations and stacking faults, which are “frozen.” Vacancies as a fraction of all sites can be estimated by

$$f = e^{(-E_a/kT)} \quad (14.1)$$

For 1 eV activation energy it gives about 0.01% vacant sites at 1000 °C (1273 K).

Here we outline some mechanisms for diffusion (Figure 14.5). In interstitial diffusion atoms jump from one interstitial site to another interstitial site which is always available. This is the diffusion mechanism for small atoms, like sodium and lithium. Substitutional/vacancy diffusion necessitates that an empty lattice site is available next to the diffusing atom. At high temperatures substitutional sites are thermally created. Antimony and arsenic demonstrate substitutional mechanisms. The interstitialcy mechanism is related to the substitutional mechanism: self-interstitial atoms move to lattice sites, and knock dopants out to interstitial sites, and from there they move to lattice sites. Boron and phosphorous are expected to diffuse via interstitialcy mechanism, but there are still some open questions even in diffusion of the most widely used dopants.

The substitutional and interstitialcy mechanisms with activation energies of about 3.5–4 eV are the most important for doping in silicon technology. Boron, phosphorus and arsenic as well as antimony, indium and gallium all have activation energies in this range. Therefore, doping by diffusion must take place at high temperature.

Many metallic impurities diffuse with the interstitial mechanism ( $E_a$  1–1.5 eV), and nickel for instance can diffuse through the wafer (500 μm) at 750 °C in one hour.

## 14.3 Doping of Polysilicon

Polysilicon can be doped almost like single crystal silicon. Again, the difference arises from the grain boundaries. Grain boundaries attract and lock dopants, and a proportionately smaller concentration of dopant ends up in the grains. Therefore polysilicon resistivity is never as low



**Figure 14.6** Silicon doping. Single crystal silicon resistivity is lower than poly for same doping concentration. True poly ( $610^{\circ}\text{C}$ ) and a-Si ( $560^{\circ}\text{C}$ ) resistivities also differ. From French (2002) by permission of Elsevier

as that of single crystal (Figure 14.3). This is true for in situ doping during poly deposition at all temperatures, as well as diffusion doping and ion implantation. Another feature seen in Figure 14.6 is the effect of poly deposition temperature: at  $570^{\circ}\text{C}$  “poly” is amorphous, and at  $610^{\circ}\text{C}$  poly is truly polycrystalline. They crystallize differently and end up with different grain sizes, hence different resistivities. The lowest possible n-type poly doping is achieved with phosphorus, resulting in  $400\mu\text{ohm}\cdot\text{cm}$  resistivity, while for boron-doped poly the low limit is about  $2000\mu\text{ohm}\cdot\text{cm}$ .

## 14.4 Doping Profiles in Diffusion

Concentration-dependent diffusion flux is described by Fick's first law:

$$j = -D \left( \frac{\partial N}{\partial x} \right) \quad (14.2)$$

where  $D$  is the diffusion coefficient ( $\text{cm}^2/\text{s}$ ) and  $N$  is concentration (in  $\text{cm}^{-3}$ ). The unit of flux is atoms/ $\text{s}\cdot\text{cm}^2$ .

Diffusion coefficients can be presented by

$$D = D_0 e^{(-E_a/kT)} \quad (14.3)$$

where  $D_0$  is the frequency factor (related to lattice vibrations,  $10^{13} - 10^{14}\text{ Hz}$ ),  $E_a$  is the activation energy (related

**Table 14.1**  $D_0$  and  $E_a$  values for boron and phosphorus

|                                  | Boron | Phosphorus |
|----------------------------------|-------|------------|
| $D_0$ ( $\text{cm}^2/\text{s}$ ) | 0.76  | 3.85       |
| $E_a$ (eV)                       | 3.46  | 3.66       |

to the energy barrier that the dopant must overcome),  $k$  is Boltzmann's constant,  $k = 1.38 \times 10^{-23} \text{ J/K}$  or  $8.62 \times 10^{-5} \text{ eV/K}$ , and  $T$  is temperature in kelvin.

Values of  $D_0$  and  $E_a$  for boron and phosphorus are given in Table 14.1. From those values the boron diffusion coefficient  $D$  can be calculated as  $4 \times 10^{-15} \text{ cm}^2/\text{s}$  at  $950^{\circ}\text{C}$  and as  $4.7 \times 10^{-14} \text{ cm}^2/\text{s}$  at  $1050^{\circ}\text{C}$ .

The characteristic diffusion length is given by

$$x \approx 2\sqrt{Dt} \quad (14.4)$$

so that  $1050^{\circ}\text{C}$  boron diffusion for 1 h corresponds to roughly  $0.26 \mu\text{m}$  diffusion depth. This distance is a characteristic length scale only: diffusion profiles are gently sloping and there is no clear cut-off depth.

The sheet resistance of doped layers in the general case is given by Equation 14.5a, and is approximated for a box profile (assuming constant dopant level up to junction depth, and then zero dopant) by Equation 14.5b. The box profile can be quite a reasonable assumption: for example, the arsenic profile of Figure 2.12 is fairly box-like.

$$\frac{1}{R_s} = \int_0^{x_j} q\mu(N(x) - N_b) \, dx \quad (14.5a)$$

$$\frac{1}{R_s} = q\mu x_j N(x) \quad (14.5b)$$

where  $q$  is elementary charge,  $\mu$  the mobility,  $N(x)$  the dopant concentration,  $N_b$  the background concentration and  $x_j$  the junction depth. Mobilities of n- and p-type silicon are about  $1400 \text{ cm}^2/\text{V}\cdot\text{s}$  and  $500 \text{ cm}^2/\text{V}\cdot\text{s}$ , respectively, at low concentrations ( $<10^{15} \text{ cm}^{-3}$ ), and about  $50 \text{ cm}^2/\text{V}\cdot\text{s}$  at high concentrations ( $>10^{19} \text{ cm}^{-3}$ ), irrespective of dopant. Sheet resistances of doped areas are typically from  $10 \Omega/\text{sq}$  to  $10 \text{k}\Omega/\text{sq}$ .

### 14.4.1 Infinite dopant supply (constant surface concentration of dopant)

Infinite dopant supply corresponds to gas phase doping where new dopant is constantly being injected into the diffusion tube. A heavily doped thin film (doped polysilicon

or PSG or BSG CVD oxide) can be approximated as an infinite source when diffusion times and temperatures are moderate. The concentration profile of the dopant in silicon is given by the complementary error function (erfc):

$$N(x, t) = N_0 \operatorname{erfc} \left( \frac{x}{4Dt} \right) \quad (14.6)$$

where  $N_0$  is the dopant concentration ( $1/\text{cm}^3$ ) in the surface layer,  $x$  the depth (cm),  $t$  the time (s) and  $D$  the diffusion constant at given temperature ( $\text{cm}^2/\text{s}$ ). Longer doping times will lead to deeper diffusions but the surface concentration is unchanged.

#### 14.4.2 Limited dopant supply (constant dopant amount)

Limited dopant supply describes the case of pre-deposition: a small amount of dopants is introduced in the first step. In the second step they diffuse deeper but because no new dopants are available, peak concentration at the surface will get lower as time goes by. Ion implantation, to be discussed in the next chapter, can be used to introduce a controlled number of dopants into silicon. The concentration profile of limited source diffusion is Gaussian:

$$N(x, t) = \frac{Q_0}{\sqrt{\pi Dt}} \exp \left( -\frac{x^2}{4Dt} \right) \quad (14.7)$$

where  $Q_0$  is the total amount of dopant on the surface ( $1/\text{cm}^2$ ). The junction depth  $x_j$  is given by

$$x_j = \sqrt{4Dt} \times \ln \left( \frac{Q_0}{C_{\text{subs}} \sqrt{\pi Dt}} \right) \quad (14.8)$$

Equation 14.8 cannot be solved in analytical form for diffusion time. An approximate solution for diffusion time can be obtained by a graphical solution: calculate  $x_j$  for a few diffusion times, plot the results, and estimate the junction depth from the graph. Simulators should be used for more accurate estimates.

#### 14.4.3 Diffusion profile measurement

Diffusion profiles are measured either physically or electrically. The standard physical measurement is secondary ion mass spectrometry (SIMS). The dynamic range of SIMS is up to eight orders of magnitude, that is dopant concentrations down to  $10^{14} \text{ cm}^{-3}$  can be detected (silicon atom density is  $5 \times 10^{22} \text{ cm}^{-3}$ ). Spreading resistance profiling (SRP) measurement (Figure 2.10) measures resistance with probes at the surface, and then beveling or anodic oxidation is done in order to have access to dopants

deeper inside the silicon. SRP data needs some major calculations before dopant profiles are obtained. Both SIMS and SRP are sample destructive methods. Simple four-point probe measurement gives sheet resistance, which is good for monitoring diffusion.

## 14.5 Diffusion Applications

Gas phase doping by  $\text{POCl}_3$  gas for n-type and  $\text{BBr}_3$  gas for p-type was used in the early years of semiconductor manufacturing for steps where a high degree of control was required, for example bipolar base diffusion. Solid source doping was used when high dopant concentration (near or at the solid solubility limit) was required, for example in bipolar emitters and MOS source/drain. Solid source doping has the problem that it is often very difficult to remove the dopant source material after diffusion, and residues may be left.

Resistors can be made easily if diffusion is part of the process. Polysilicon, with its higher resistivity, is better suited for making high-resistance resistors.

Bipolar devices and power transistors (Figure 1.14) rely on diffusion as the key element in their fabrication. As shown for MOS-controlled thyristor (Figure 14.7), many diffusions, some of them very deep, are needed.



**Figure 14.7** MOS-controlled thyristor. Reproduced from Sankara Narayanan *et al.* (2004), copyright © 2004 by permission of Elsevier

There are concentration and electric field effects which make actual device diffusions more complex than the simple Fick models predict. In the emitter push effect phosphorus diffusion enhances boron diffusion (see Figure 14.8). Boron diffusion alone would result in a profile predicted by simple theory, but boron diffusion under a phosphorus-doped region is much faster. This is explained by self-interstitial generation in the phosphorus diffusion process, and these interstitials enhance boron diffusion.

In oxidation-enhanced diffusion (OED) the vacancies generated by volume changes associated with thermal oxidation lead to enhanced diffusion underneath the oxide. This is pictured in Figure 14.9. Simulators can handle the emitter push effect, OED and high dopant concentration effects among other subtleties. We will see in the



**Figure 14.8** Emitter push effect: (a) unimpeded boron diffusion; (b) phosphorus diffusion pushes boron diffusion deeper under the same conditions



**Figure 14.9** Oxidation-enhanced diffusion (OED): vacancy injection during oxidation enhances dopant diffusion under oxide. Reproduced from Taniguchi *et al.* (1980) by permission of Electrochemical Society Inc.

next chapter that ion implantation produces vacancies, and these vacancies will similarly enhance diffusion.

## 14.6 Simulation of Diffusion

All high-temperature process steps contribute to diffusion, therefore diffusion is the omnipresent process to be simulated in the front-end of the process. There can easily be tens of steps that contribute to dopant profiles. Segregation effects during oxidation and dopant outdiffusion from free surfaces add to the computational and modeling loads.

The simulation of phosphorus diffusion needs to consider at least five species:

- phosphorus (P)
- vacancies (v)
- interstitials (i)
- phosphorus–vacancy pairs ( $P-v$ )
- phosphorus–interstitial pairs ( $P-i$ ).

Vacancies and interstitials are not permanent species like phosphorus atoms, and we must account for the annihilation of point defects, for example vacancy meets interstitial and annihilation takes place ( $v + i = \text{nil}$ ). Point defects can also form pairs like  $v-v$ . To make the situation even more difficult to analyze, many of the species are charged: diffusion models have to account for equilibrium processes like  $P^- + v^0 \rightleftharpoons Pv^-$  (charged phosphorus–vacancy pair) or  $P^- + i^0 \rightleftharpoons Pi^-$ . Clustering and precipitation of dopants lead to inactivation. These phenomena are especially important when concentrations are near the solid solubility limit.

A standard simulator requires as inputs for diffusion simulation:

- wafer orientation  $<100>/<111>$
- wafer doping level/resistivity
- dopant type
- concentration of dopant (gas phase/solid phase/implanted)
- temperature
- ambient (oxidizing/inert/reducing).

The doping profiles of Figure 14.10 have been calculated with the 1D simulator ICECREM.

Longer diffusion time of course leads to deeper diffusion, but the two cases of limited and infinite supply have differences. In the case of limited dopant supply the surface concentration decreases the longer the diffusion time. The area under the depth-concentration curve is constant: the number of dopant atoms is constant. In the infinite supply case the surface concentration remains constant: there is an ample supply of new dopant atoms.



**Figure 14.10** Diffusion at 1000 °C, for 100, 200 and 300 min in inert atmosphere for boron-doped substrate: (a) diffusion from a limited source, implanted dose  $10^{13} \text{ cm}^{-2}$ ; (b) diffusion from phosphorus-doped oxide film (with  $10^{20} \text{ cm}^{-3}$  phosphorus concentration, approximating an infinite source)

## 14.7 Diffusion at Large

Diffusion is of course a much larger concept than described above: it includes many processes where concentration gradients are reduced by atom movements; for example, in microfluidics two parallel liquid flows will mix due to diffusion, and hydrogen in a fuel cell is spread uniformly by a gas diffusion layer, an intricate web of nanotubes or microfibers.

Diffusion is inevitable in all high-temperature steps, but it can be minimized by minimizing process time. In rapid thermal annealing (RTA; or RTP for Rapid Thermal Processing) wafers are heated rapidly by powerful lamps, and  $2\sqrt{Dt}$  is brought down by annealing for very short times at high temperatures: a furnace diffusion of 950 °C for 30 min, might be replaced by a RTA process of 1050 °C for 10 s.

Many processes in addition to diffusion are temperature dependent: grain growth, interfacial chemical reactions, surface oxidation, etc., all experience higher rates

at higher temperatures. Also, unwanted processes like impurity diffusion are accelerated. Metallic impurities have diffusion coefficients that lead to 500 μm diffusion distances during typical heat treatments. This means that back-side contamination can reach the wafer front, and destroy device functionality there. Therefore wafer cleaning before any high-temperature step is very important.

In solar cell fabrication, cost reduction pressures are high. Diffusion steps have been simplified and shortened. P-type diffusion can be done by spin coating phosphoric acid on the wafer, and then performing 950 °C diffusion for 15 min. This resembles in some sense the result of incomplete cleaning: whatever is left on the surface will be driven inside the silicon by subsequent annealing.

## 14.8 Exercises

- What is the diffusion time required to form a pn junction at 1 μm depth at 1000 °C when boron pre-deposition is  $10^{14} \text{ cm}^{-2}$  and a phosphorus-doped wafer ( $10^{15} \text{ cm}^{-3}$ ) is used.

2. Polysilicon sheet resistance is 50 ohm/sq. What is the polysilicon thickness?
3. What is the sheet resistance of diffusion after the anneal shown in Figure 2.12?
4. If deep n-type diffusions are needed, which n-type dopant should be used?
5. How far will metallic impurities diffuse during thermal oxidation?
6. Explain the order of the fabrication steps for the spreading resistance thermometer shown below.



7. Explain step by step the fabrication process for the triple diffused bipolar transistor shown below.



8. Explain the order of diffusion steps in the fabrication of the MOS-controlled thyristor of Figure 14.7.

### Simulator exercises:

9. Boron-doped oxide film (200 nm thick, concentration  $10^{21} \text{ cm}^{-3}$ ) is deposited on a phosphorus-doped wafer

- ( $10^{15} \text{ cm}^{-3}$  phosphorus concentration). What is the junction depth doping after a 300 min, 1100 °C diffusion step?
10. What is the magnitude of the emitter push effect?
11. What is the magnitude of OED? Run some simulations to find which process parameters are important.

### References and Related Reading

- Baliga, J.B. (2001) The future of power semiconductor device technology, *Proc. IEEE*, **89**, 822 (special issue on power electronics technology).
- French, P.J. (2002) Polysilicon: a versatile material for microsystems, *Sens. Actuators*, **A99**, 3–12.
- Ghandhi, S.K. (1994) **VLSI Fabrication Principles**, 2nd edn, John Wiley & Sons, Inc.
- Kim, D.S. et al. (2006) Development of a phosphorus spray diffusion system for low-cost silicon solar cells, *J. Electrochem. Soc.*, **153**, A1391–A1396.
- MRS Bulletin (2000) Defects and diffusion in silicon technology, *MRS Bull.*, **25** (June), special issue.
- Naganawa, M. et al. (2008) Accurate determination of the intrinsic diffusivities of boron, phosphorus, and arsenic in silicon: the influence of SiO<sub>2</sub> films, *Jpn. J. Appl. Phys.*, **47**, 6205–6207.
- Plummer, J.D., M.D. Deal and P.B. Griffin (2000) **Silicon VLSI Technology**, Prentice Hall.
- Sankara Narayanan, E.M. et al. (2004) Progress in MOS-controlled bipolar devices and edge termination technologies, *Microelectron. J.*, **35**, 235–248.
- Taniguchi, K. et al. (1980) Oxidation enhanced diffusion of boron and phosphorus in (100) silicon, *J. Electrochem. Soc.*, **127**, 2243.
- Zimmermann, H. (1999) **Integrated Silicon Optoelectronics**, Springer, p. 36.

# Ion Implantation

Ion implantation (I/I) is a process where accelerated ions hit the silicon wafer, penetrate into silicon, slow down by collisional, stochastic processes, and come to rest within femtoseconds. Implantation today is the main method of introducing dopants into silicon, and in CMOS fabrication, it has replaced thermal diffusion almost completely.

Typical ion implantation energies are 10–200 keV, and doses are  $10^{11}$  to  $10^{16}$  ions/cm $^{-2}$ . Because implantation depths are of the order of hundreds of nanometers, the corresponding concentrations are ca.  $10^{15}$  cm $^{-3}$  to  $10^{20}$  cm $^{-3}$ . In 1 μm CMOS technology source/drain diffusions are made by  $5 \times 10^{15}$  cm $^{-2}$  ion implant doses, and the depth is ca. 200 nm, which translates to ca. 25 Ohm/sq. For more advanced CMOS technologies S/D sheet resistances are rapidly increasing because S/D depths are scaled down.

Lateral confinement of implanted dopants is better than in diffusion: sideways spreading under the mask is considerably less, as a rule of thumb, it is one-third of vertical range. This is especially important in advanced CMOS where extremely small dimensions are fabricated. In thermal diffusion the depth and the lateral extension are identical in the first approximation.

Photoresist can mask ion implantation, an obvious advantage over thermal diffusion which requires an oxide mask. However, implantation is always connected with a high temperature anneal step. Implantation damages the silicon crystal, and in order to recover defect-free single crystalline state, this damage has to be annealed away. Second, dopants must be activated, that is they have to be in substitutional lattice sites. Activation of dopants and damage removal both take place simultaneously, but the process cannot always be optimized for both, as will be discussed in Chapter 26.

## 15.1 The Implantation Process

Ion implantation is depicted in Figure 15.1a. Ions penetrate into silicon and into the mask, too. The mask has to be thick enough so that it will block ions. Photoresist, oxide, nitride and polysilicon are typically used as mask materials. The higher the implantation energy, the deeper the ions will penetrate, and the lighter the ion, the deeper it will go. The range of ions in matter ( $R_p$ ) is statistically distributed and rather broad. This is described by straggle,  $\Delta R$ , in Figure 15.1b.

Implanted ions scatter stochastically, traveling a distance  $R$  (range). However, we are more interested in projected range,  $R_p$ , the depth ions reach underneath the silicon surface (Figure 15.2). Straggle  $\Delta R$  is the deviation in range, the width of the depth distribution. As a rule of thumb, the concentration at  $R_p \pm 2\Delta R$  is 10% peak concentration, and at  $R_p \pm 3\Delta R$  it is 1% of peak concentration. Also of interest is lateral straggle,  $R_L$ , or the deviation from the incident direction.

Ions are decelerated in the lattice by nuclear and electronic stopping, that is by collisions with atomic nuclei of atomic number  $Z$  and mass  $M$ , and by collisions with the electronic cloud, respectively. Under a number of simplifying assumptions (about the nature of material, interaction potentials, energy independence of various variables, etc.) the Linhard solution to nuclear stopping for projectile ( $M_1, Z_1$ ) hitting a wafer of ( $M_2, Z_2$ ) is

$$S_n = 2.8 \times 10^{-15} \frac{Z_1 Z_2}{Z} \frac{M_1}{(M_1 + M_2)} \text{ eV cm}^2 \quad (15.1)$$

where  $Z$  is the reduced atomic number,  $Z = (Z_1^{2/3} + Z_2^{2/3})^{3/2}$ . Nuclear energy loss is independent of ion energy in this approximation. Electronic stopping is



**Figure 15.1** (a) Implantation: mask layer blocks selected areas; (b) dopant concentration profile inside silicon, with projected range  $R_p$  and straggle  $\Delta R$

**Table 15.1** Energy loss of implanted ions in silicon

Nuclear energy loss in silicon (independent of energy) in keV/ $\mu\text{m}$ :

|            |      |
|------------|------|
| Boron      | 92   |
| Phosphorus | 447  |
| Arsenic    | 1160 |

Electronic energy loss in silicon in keV/ $\mu\text{m}$ :

| $E$ (keV) | Boron | Phosphorus | Arsenic |
|-----------|-------|------------|---------|
| 10        | 65    | 88         | 90      |
| 50        | 145   | 196        | 200     |
| 100       | 205   | 277        | 283     |
| 200       | 290   | 391        | 401     |



**Figure 15.2** Key concepts for implanted ions:  $R$ : range is the length of ion travel;  $R_p$  is the projected range, and  $R_L$  lateral straggle

proportional to the square root of energy:

$$S_e = 3.3 \times 10^{-17} (Z_1 + Z_2) \sqrt{\frac{E}{M_1}} \text{ eV cm}^2 \quad (15.2)$$

Total energy loss is calculated as

$$\frac{dE}{dx} = -(S_n + S_e)N \quad (15.3)$$

where  $N$  is silicon atom density,  $5 \times 10^{22} \text{ cm}^{-3}$ .

Table 15.1 lists the nuclear and electronic energy losses in silicon. Equation 15.3 can now be used to estimate ranges. As a rough guide, the projected range is 90% of the range.

The combined energy loss from nuclear and electronic stopping for 100 keV phosphorus is thus 724 keV/ $\mu\text{m}$ . Equation 15.3 then gives a range of about 140 nm and a projected range of about 126 nm. Boron ions are light and they penetrate deep into silicon, with 100–500 nm as typical implanted depth, while heavy arsenic ions penetrate only 10–50 nm when standard implant energies of 10–200 keV are used.

Figure 15.3 shows dopant concentration profiles: the high peak concentration is usually below the surface (while in diffusion it is always at the surface) and there is a long tail deep into the silicon. Note that not only does higher energy result in deeper projected range, but also the peak concentration will be lower. Thinking visually of Figure 15.3, the areas under the concentration vs. depth curves must be identical for identical doses: the same numbers of dopant atoms are just distributed differently.

Masking layers for ion implantation have to be substantially thicker than projected ranges, to ensure that the ions in the tail of distribution do not penetrate the mask. Mask thickness is especially important when working with high-resistivity substrates: it does not take many dopant atoms to change dopant type.

Dose control in ion implantation is superior to thermal diffusion. Another benefit of ion implantation compared to thermal diffusion is doping through oxide. When energy is high enough, a sizable proportion of ions penetrate through the oxide and dope silicon, as shown in Figure 15.4. Both 50 keV and 150 keV boron ions will penetrate through the oxide into silicon. However, 50 keV arsenic ions are completely stopped and only a very small dose of 150 keV arsenic ions will reach silicon.

### 15.1.1 Measurements for implantation

Implanted wafers can be measured by a four-point probe for sheet resistance. It is a natural control measurement for doping. It is, however, a fairly slow feedback loop because the wafer has to be cleaned and annealed before 4PP measurement. Sheet resistance measurement sees only electrically active dopants, and annealing is therefore not just an auxiliary step for measurement but an essential part of ion implantation doping. What is more, the wafer



**Figure 15.3** (a) The 50 keV implantation of arsenic, phosphorus and boron: the lighter ions will penetrate deeper.  
(b) Phosphorus implantation with 50, 100 and 150 keV energies



**Figure 15.4** Implantation through oxide: 250 nm oxide will block 50 keV arsenic ions and most of 150 keV arsenic, too, but both 50 keV and 150 keV boron ions penetrate the oxide and dope silicon underneath

has to be discarded after 4PP measurement because it is a contact measurement and likely to cause contamination.

Alternatively, the dose can be monitored by modulated photoreflectance (also known as the thermal wave method). A modulated laser beam heats the wafer and

the periodic heat waves are monitored by another, small-power laser. Dissipation lengths are correlated to implant damage and hence to dose. This is a fast, non-contact measurement which needs no wafer preparation and can be done even on photoresist patterned wafers.

If the dopant depth distribution needs to be measured, the wafer destructive methods of spreading resistance profiling (SRP) and SIMS are available (Chapter 2). Both measurements are long and tedious and used mostly in the research phase of process development. Point defects created by implantation cannot be seen by physical analysis, but extended defects like dislocations can be seen by a TEM. Amorphization can be measured by a TEM or by XRD.

## 15.2 Implant Applications

Ion implantation emerged in the 1970s when accurate low-dose doping was required for CMOS threshold voltage control. Implantation excelled over thermal diffusion for this, and it opened up the way for implantation to start penetrating the doping business. Whenever shallow-doped structures are needed, ion implantation is used. This applies to piezoresistors on AFM cantilevers as well as advanced CMOS source/drain diffusions.

Actually, diffusion is never displaced by implantation because some diffusion takes place during implant activation anneal. And in many applications, like CMOS

wells, implantation and thermal diffusion are used in tandem: the accurate dose control of implantation is used to perform pre-diffusion, which determines the number of dopants, and long thermal drive-in, which determines the final dopant profile. This is necessary because implantation depths are shallow ( $<1\text{ }\mu\text{m}$ ) and wells used to be rather deep, in the  $5\text{ }\mu\text{m}$  range.

Self-aligned polysilicon gate was one of the great innovations of CMOS fabrication in the 1970s. Previously, source and drain (S/D) were formed by thermal diffusion, using an oxide mask. After oxide mask removal and new thermal oxidation, aluminum gate was deposited and aligned on S/D, leading to inevitable alignment errors, as shown in the left column of Figure 15.5. In the self-aligned gate process (right column of Figure 15.5) the polysilicon gate is patterned and acts as a mask for S/D doping. Suitably selecting the implant energy so that the thin oxide is penetrated but the thick poly is not, the channel area under the gate is not doped, and the S/D areas are automatically aligned with the poly gate. This reduces overlap capacitances and resistances. Aluminum cannot be used in the self-aligned gate process because the annealing step after implantation requires temperatures about  $1000\text{ }^\circ\text{C}$ .

Doping polysilicon is one of the applications where implantation offers multiple choices. Most manufacturers prefer to deposit undoped LPCVD poly because of better process control. Traditionally, poly doping was by



**Figure 15.5** Non-self-aligned aluminum gate MOS process (left) with thermally diffused S/D vs. ion-implanted, self-aligned, polysilicon gate MOS process (right)

$\text{POCl}_3$  gas phase doping, resulting in heavily doped n-type polysilicon. This is an application where thermal diffusion excels: namely, high doping concentration (e.g.,  $10^{20}\text{ cm}^{-3}$ ) blanket deposition. In the self-aligned process the poly becomes phosphorus doped during NMOS S/D implantation, and no separate poly doping step is needed. Similarly, when PMOS S/D are implanted by boron, PMOS gate poly becomes p-type as well. Of course such a dual gate structure could be done by two lithography and two doping steps before poly etching, but the self-aligned process eliminates two lithography steps. In another approach,  $n^+$  doped LPCVD poly is deposited, but lithography is done next, and PMOS areas are implanted by a heavy boron dose that is so high that dopant type inversion  $n^+ \rightarrow p^+$  occurs.

Photoresist masking works nicely for implants: it is easy to spin-coat thick enough resists to block ions. There are, however, a few issues to tackle. Accelerated ions break bonds due to their high energy. This can lead to resist carbonization, especially if high doses are used. The wafers also heat up during implantation because accelerated ions carry a lot of energy. This heating will further bake the resist and change its structure. Both of these processes make stripping of implanted resist difficult.

Channeling is a phenomenon where ions are channeled between silicon crystal planes, rather like light in optical fibers. This effect is more pronounced for light ions, and especially for  $<100>$  crystal orientation which has a very open structure (Figure 4.6). In order to avoid channeling, many implanters tilt the wafers by  $7^\circ$ , so that the ions hit the wafer obliquely, avoiding channeling. However, this leads to device asymmetry (Figure 15.6). Lateral diffusion during damage removal anneal is becoming very small in advanced devices, and this makes the problem more serious than before.

Another way to reduce channeling is to implant through thin oxide (e.g.,  $10\text{ nm}$ ). Thin oxide randomizes incoming ions, reducing channeling. These oxides serve another



**Figure 15.6** Doping asymmetry due to tilted implantation

function, too: implantation is a high-energy process, and accelerated ions can sputter metal atoms from implanter hardware. Thin oxide prevents these metal atoms from penetrating the silicon. In the post-implantation clean, this thin oxide and the metals on it can easily be removed by a HF dip.

### 15.3 Implant Damage and Damage Annealing

Nuclear stopping displaces atoms from the silicon lattice: a 100 keV arsenic ion displaces about 2000 silicon atoms along its trajectory. Damage creation depends on:

- implant species (heavy ions produce more damage)
- energy (more energy, more damage)
- dose (above about  $10^{14} \text{ cm}^{-2}$  extended damage set in)
- current (high current leads to overlapping collision cascades).

At low doses (below  $10^{14} \text{ cm}^{-2}$ ) the predominant damage type is point defects such as vacancies and interstitials, or clusters of point defects. At high doses extended defects are created. Dislocation loops are created in the crystalline silicon just next to the amorphous/crystalline interface. These are known as end-of-range (EOR) defects. If the concentration of dopants is above the solid solubility limit, dopants precipitate.

Boron does not cause appreciable amorphization, irrespective of dose, because it is a light-mass ion. High-dose phosphorus and arsenic implants can amorphize silicon, but if amorphization is needed without doping, germanium, a group IV element isoelectronic with silicon, can be used. The critical dose for amorphization is about  $10^{14} \text{ cm}^{-2}$ . Note that channeling is therefore not an issue for high implant doses because the amorphized surface layer will block the crystal channels.

The annealing temperature must be so high that atoms move. But contrary to thermal diffusion, atoms do not need to move significant distances. The dopant atoms could in theory find a suitable lattice site within an atomic distance. This is not possible in practice, though, because the defects created by implantation enhance diffusion.

Activation refers to dopant atoms being electrically active after annealing. They then occupy lattice sites in the crystal and act as donors or acceptors. A high concentration of active dopants is needed for low resistance. Dopant atoms above the solid solubility limit form precipitates or are found as interstitial atoms, and they do not contribute to electrical conductivity. The interplay



**Figure 15.7** Implantation–diffusion interaction matrix. Redrawn from Jones (1993)

between damage and doping is shown graphically in Figure 15.7.

### 15.4 Tools for Ion Implantation

Ion implantation acceleration voltages used to range from 10 to 200 kV but low-energy implanters (1 kV minimum) and high-energy implanters (max. 3 MV) exist. Low-energy implants are needed to fabricate shallow S/D junctions (below 100 nm) in deep submicron CMOS. High-energy implanters (HEIs) implant deep into silicon, 1 micrometer or even deeper. The ability to fabricate retrograde profiles, that is to have low concentration at the surface and high concentration deep down, exactly the opposite to thermal diffusion, offers some interesting possibilities, for example as a replacement for buried layers and epitaxy.

Medium-current implanters (MCIs) are 1  $\mu\text{A}$  to 5 mA, 20–200 keV, single wafer machines. High-current implanters (HCIs) are batch machines, with for example 13 wafers on a rotating wheel.

The minimum energy in a HCI is 80 keV. Extraction beam current scales as  $V^{3/2}$  which explains why a low-voltage HCI is not practical. This scaling means difficulties for low-energy, high-dose implantations which are needed for advanced CMOS S/D implants.

Implant currents can be anything from 1  $\mu\text{A}$  to 30 mA, and with doses from  $10^{11}$  to  $10^{16} \text{ cm}^{-2}$  implantation times are seconds or minutes. Beam currents are limited if photoresist is used as a mask: currents that are too high will damage the resist, and removal of the resist becomes difficult. Cooled wafer stations can be used to minimize resist damage. Implanter throughput can be up to 500 wafers per hour for low-dose applications.

Scaling down ion energy involves a number of techniques. One of the oldest is to implant molecular ions instead of simple ions:  $\text{BF}_2^+$  has a mass of 49 vs. 11 for  $\text{B}^+$ , and its range is about one-fifth the boron range to a first approximation. Carborane molecular ions ( $\text{C}_2\text{B}_{10}\text{H}_{12}^+$ ) provide 10 times more boron ions

per molecule, and energy can be high, yet range will be small. Replacement of boron ions with  $\text{BF}_2^+$  or carborane is not straightforward, however, because the behavior of fluorine or carbon during annealing and further processing needs to be assessed. True low-energy implanters must accept the fact that only low beam current is available. In the limit of 1 kV, the sputtering of surface atoms becomes important: because low implant energy equals low penetration depth, every atom layer removed from the surface will affect the final implant profile.

#### 15.4.1 Implanter design and operation

Implantation requires ions, and these are generated in ion sources which are plasma discharges. Dopants have been vaporized or be in a gaseous state before ionization. Dopant gases include  $\text{PH}_3$ ,  $\text{AsH}_3$ ,  $\text{BF}_3$ , but evaporation of solids in a furnace can also be used, and almost all elements in the periodic table can be implanted. However, the efficiency of solid sources is low and switching between ions slow. Ions are extracted from the source by voltage and enter the selection magnet (Figure 15.8).

Ion selection is based on a mass spectrometric separation according to radius of curvature  $r$  in a magnetic field  $\mathbf{B}$  balanced by the centrifugal force:

$$|\mathbf{F}| = |q(\mathbf{v} \times \mathbf{B})| = m|\mathbf{v}|^2/r = qV \quad (15.4)$$

where  $m$  is mass and  $q$  is charge, which can be solved for  $B = \sqrt{(2mV/qr^2)}$ . By adjusting the magnetic field of the selection magnet, the ion of the desired mass is selected. The magnet selection can be fooled by similar ion masses, termed mass contamination. Doubly charged molybdenum ions  $\text{Mo}^{+2}$  can pass along with  $\text{BF}_2^+$  ions (molybde-

nium is a common construction material for vacuum equipment). The  $^{11}\text{BFH}^+$  ion behaves like  $^{31}\text{P}^+$  for the selection magnet. This situation might emerge when  $\text{PH}_3$  gas is used after  $\text{BF}_3$  gas and some residual gas remains in the ion source. Energy purity refers to the spread of ion energies in the beam and consequently their range in silicon.

The acceleration tube must be kept under high vacuum in order to steer the beam toward the wafer in a collisionless fashion. After acceleration either electromagnetic or mechanical scanning spreads the beam over the wafer. Implantation is an inherently slow process because of the scanning nature of operation. Alternative implantation techniques which work in parallel mode have been devised: plasma immersion ion implantation (PIII) is a process where the wafer is immersed in plasma, and biased. Very high dose rates are possible, but energy purity is sacrificed because the selection magnet has been eliminated from the system. PIII may have uses in large-area applications like flat-panel displays because of its high throughput.

Wafers will be charged when ions are implanted. Current flows from the beam to the wafer holder, and it passes any oxides on its way. Also, beam non-uniformity between wafer centre and edge can cause lateral currents. Charging is compensated by flooding: electron gun-generated electrons hit the wafer and neutralize charges. This approach is prone to overcompensation and problems with electron charging. Plasma discharge, which produces an ion density an order of magnitude higher than the beam, is used in neutralization. Charge neutrality is inherent in the plasma system.

Implant dose is monitored during implantation by Faraday cup current measurement.



**Figure 15.8** The main elements of an implanter. Adapted from Current (1996) by permission of AIP

### 15.4.2 Safety aspects

Ion implanters pose a number of safety issues that have to be tackled. The obvious one is the high voltage that is present inside the machines. The second issue is X-rays, which are produced as ions decelerate. Lead radiation protection is routinely used around the parts where X-rays are generated. If hydrogen is implanted, as in the Smart-cut process (Chapter 22), nuclear reactions are possible at fairly low energies of 150 keV and gamma rays are then generated.

Implant gases AsH<sub>3</sub>, PH<sub>3</sub> and BF<sub>3</sub> are extremely toxic. Hydride detectors are placed inside the system to sniff for leaks. The operation and maintenance of an implanter are therefore for highly trained staff only. More safety issue discussion can be found in connection with cleanrooms in Chapter 35.

## 15.5 Ion Implantation Simulation

Implantation simulation must make a critical first choice on how to treat matter: amorphous matter is easy to model, but silicon really is single crystalline. Many simulators use single crystal silicon material parameters, but ignore the actual crystal structure.

Monte Carlo (MC) simulation offers many advantages over semianalytical implantation simulations because it can truly take silicon crystal structure into account. MC simulations not only can predict ranges and straggles, but enable physically based damage prediction, including amorphization. MC simulations are, of course, much more computationally intensive than semianalytic ones. SRIM (Simulation of Ranges of Ions in Matter) is one widely used MC simulator for implantation and other ion beam processes.

The input for a prototypical semianalytical implantation simulation includes:

- wafer orientation
- dopant concentration
- ion species
- energy
- dose.

The accuracy of simulation is very good in the surface and peak concentration regimes, but worse in the tail of the distribution (Figure 15.9). This is partly due to ion channeling, which is not readily implemented in semianalytical simulators. For heavier elements discrepancies can come from the amorphization treatment: single crystal may be used initially but, as the dose increases, the simulator adopts amorphous silicon material parameters for further



**Figure 15.9** Boron implantation into silicon, 20 keV,  $10^{15} \text{ cm}^{-2}$ . SIMS measured data shown by small diamonds, ICECREM simulation by large squares. The discrepancy in the tail results partly from ion channeling and partly from model deficiencies. SIMS data courtesy Jari Likonen, VTT

calculations. This approach is depicted in Figure 15.10 for increasing phosphorus doses.

## 15.6 Implantation Further

Implantation is used everywhere where doping is needed, from resistor fabrication (Chapters 25 and 30) to CMOS (Chapter 26) to advanced bipolar doping (Chapter 27). Implantation enables heavily doped shallow junctions for MOS S/D. Implantation is also a key technology in SOI wafer fabrication, with both oxygen and hydrogen implantation serving special roles, as will be described in Chapter 22. The thermal processes for annealing implant damage will be presented in Chapter 32.

## 15.7 Exercises

1. What will be the implant time for a wafer of 200 mm diameter when arsenic ions are implanted with a dose of  $10^{15} \text{ cm}^{-2}$  and implant current of  $100 \mu\text{A}$ ?
2. How much will wafer temperature rise during the implantation of arsenic ions of energy 100 keV and dose  $10^{15} \text{ cm}^{-2}$  with a current of 1 mA on a 200 m wafer? Make simplifying assumptions as needed.
3. Under the conditions of  $10^{15} \text{ cm}^{-2}$  phosphorus implant dose, 200 mm wafer size, PH<sub>3</sub> bottle volume 3 liters



**Figure 15.10** Different phosphorus doses compared:  $10^{12} \text{ cm}^{-2}$ ,  $10^{14} \text{ cm}^{-2}$  and  $10^{16} \text{ cm}^{-2}$  at 200 keV. The shape is different for  $10^{16} \text{ cm}^{-2}$  because it is above the amorphization limit, and different stopping parameters are applied for the amorphized region

- (STP), how many wafers can be implanted? If the ion current is 1 mA, what is the interval for bottle changing?
4. What is the range of 20 keV  $^{11}\text{B}^+$ ,  $^{49}\text{BF}_2^+$  and carborene  $\text{C}_2\text{B}_{10}\text{H}_{12}^+$  ions?
  5. What needs to be considered if boron implantation is replaced by  $\text{BF}_2^+$  implantation?
  6. What is the range of 100 keV germanium implantation?
  7. How deep will boron, phosphorus and arsenic of 100 keV penetrate into a photoresist mask?
  8. Translate implant dose  $10^{16} \text{ cm}^{-2}$  into film thickness assuming the thin film is made of implanted atoms only.

### Simulator exercises:

9. How thick an oxide layer is needed to mask boron implantation? Present your results as a function of boron energy.

10. Check by simulator the range of 100 keV phosphorus ions and compare it to the simple estimate discussed in the text.

### References and Related Reading

- Brodie, I. and J.J. Muray (1982) **The Physics of Microfabrication**, Springer.
- Chanson, E. *et al.* (1997) Ion beams in silicon processing and characterization, *J. Appl. Phys.*, **81**, 6513–6561.
- Current, M. (1996) Ion implantation for silicon device manufacturing: a vacuum perspective, *J. Vac. Sci. Technol.*, **A14**, 1115.
- Fair, R.B. (1996) Conventional and rapid thermal processes, in C.Y. Cheng and S.M. Sze, **ULSI Technology**, McGraw-Hill.
- Jones, K.S. (1993) Extended defects from ion implantation and annealing, in R.B. Fair, **Rapid Thermal Processing: Science and Technology**, Academic Press.
- Kawaguchi, M.N., J.S. Papanu and E.G. Pavel (2006) Low temperature, ion-enhanced, implanted photoresist removal, *J. Vac. Sci. Technol.*, **B24**, 651–656.
- LeCoeur, F. *et al.* (2000) Ion implantation by plasma immersion: interest, limitations and perspectives, *Surf. Coating Technol.*, **125**, 71.
- Mok, K.R.C. *et al.* (2005) Ion-beam amorphization of semiconductors: a physical model based on the amorphous pocket population, *J. Appl. Phys.*, **98**, 046104.
- Pelaz, L. *et al.* (2010) Simulation of pn-junctions: present and future challenges for technologies beyond 32 nm, *J. Vac. Sci. Technol.*, **B28**, p. C1A1
- Pelletier, J. and A. Anders (2005) Plasma-based ion implantation and deposition: a review of physics, technology, and applications, *IEEE Trans. Plasma Sci.*, **33**, 1944–1959.
- Rubin, L. and J. Poate (June/July 2003) Ion implantation in silicon technology, *The Industrial Physicist*, p. 12
- Suzuki, K. (2009) Extended Lindhard–Scharf–Schiott theory for ion implantation profiles expressed with Pearson function, *Jpn. J. Appl. Phys.*, **48**, 046510.
- White, N.R. (1996) Moore's law: implications for ion implant equipment – an equipment designer's perspective, Proceedings of the 11th International Conference on Ion Implantation Technology, Austin, TX, p. 355.
- Zechner, C. and V. Moroz (2008) Simulation of doping profile formation: historical evolution, and present strengths and weaknesses, *J. Vac. Sci. Technol.*, **B26**, 273–280.

# CMP: Chemical–Mechanical Polishing

Polishing is a key technology in silicon wafer manufacturing where final polishing yields wafers with RMS roughness as small as 0.1 nm, but it emerged elsewhere in microfabrication only in the late 1980's. It has several different uses:

- polishing of deposited films
- planarization of topography (step height reduction)
- removal of hard-to-etch materials

Polishing in microfabrication is a descendant of glass polishing which is an established technology since 400 years. Abrasive particles are dispersed in a suitable liquid to create a slurry, which is fed in between a polishing pad and the piece to be polished. In the case of a blanket wafer surface irregularities are smoothed out.

In the case of a wafer with topography, the elevated structures are preferentially removed since the local pressure is highest there. This leads to planarization. Both mechanical force and chemical etching are needed for high rate polishing, and the technology has been named CMP: chemical–mechanical polishing.

Grinding may look similar to CMP, but the two are quite different. In grinding abrasive particles of 1–100  $\mu\text{m}$  size are mounted in resin, and micrometer-sized chunks of material are removed by crack propagation and brittle fracture. Grinding is fast, but also very coarse: the substrate is damaged due to mechanical forces acting on the microstructures. This subsurface damage extends even 20  $\mu\text{m}$  deep. Grinding is used when hundreds of micrometers need to be removed, as in wafer thinning. CMP is used to remove micrometers only. In CMP abrasive particles of 10–300 nm are dispersed in a slurry. The mechanism is different from grinding, as CMP works in the atomic regime. Atomic bonds are weakened or broken, and removal is based on chemical reactions

between the slurry and the surface and the mechanical effect of the abrasive particles. Surface roughness after CMP is of the order of nanometers, orders of magnitude better than with grinding.

The first application of CMP in microfabrication was in multilevel metallization: oxide and tungsten were polished in order to planarize wafer topography, easing the next lithography step. As shown in Figure 16.1, two sequences result in identical final structures.

Copper cannot be plasma etched, but copper polishing is reasonably easy. Oxide is plasma etched, and copper is deposited into the grooves, with extra copper removed by CMP. Copper minimum linewidth is determined by oxide plasma etching, and no copper etching is needed. The difference between copper and tungsten is that tungsten can be plasma etched (Figure 11.10).



**Figure 16.1** Metal deposition, metal etching, oxide deposition, oxide polishing (left); oxide deposition, oxide etching, metal deposition, metal polishing (right)

## 16.1 CMP Process and Tool

The CMP tool consists of a solid, extremely flat platen, on which the polishing pad is glued (Figure 16.2). The wafer chuck, which holds the wafer upside down, is situated on a spindle. A slurry introduction mechanism feeds the slurry onto the pad. Both platen and spindle are rotated, and the linear velocity (used in Preston's equation) is the sum of two velocities.

There are four major elements in a CMP process:

- topography
- materials
- polishing pad
- slurry.

The machine parameters that can be used to control them are listed in Table 16.1.

Local polishing pressure is down force divided by contact area. For a flat wafer the pressure is low because load is evenly distributed over the whole geometrical area, but on a structured wafer the effective contact area is only a fraction of wafer area, and local pressure is much higher (Figure 16.3). The polishing rate is thus not constant: when the contact area is small, local pressure is high and the polishing rate is high. As polishing continues, the steps are reduced and contact area increases, leading to a decrease in polishing rate.



**Figure 16.2** Schematic structure of rotary CMP equipment. Wafer is held face down in the spindle chuck

**Table 16.1** CMP tool parameters and process responses

|                               |                 |
|-------------------------------|-----------------|
| Platen rotation               | 10–100 rpm      |
| Velocity                      | 10–100 cm/s     |
| Down force (applied pressure) | 10–50 kPa       |
| Slurry supply rate            | 50–500 ml/min   |
| Polish rate                   | 100–1000 nm/min |
| Selectivity                   | 1:1 to 100:1    |
| Uniformity across wafer       | 10%             |
| Wafer-to-wafer repeatability  | 10%             |



**Figure 16.3** Close-up of CMP set-up: the wafer, upside down, is pressed against the pad with slurry in between. Pad asperities make contact with the wafer

Structure height obviously affects CMP, but pattern density is also important because it determines the effective contact area: denser patterns are polished at a lower rate due to lower pressure. Polishing a single material is easier than polishing stacks of materials, or structures with different materials present simultaneously. The mechanical properties of the wafer itself must also be considered: if it is bowed, pressure will be different at the center and the edges, leading to non-uniform polishing. Pressure can be applied through the chuck to the wafer back side: this will equalize center–edge differences and compensate for wafer bow.

The pad should be rigid so that it uniformly polishes the wafer. However, such a rigid pad would have to be aligned and kept in alignment with the wafer surface at all times. Therefore pads are often stacks of hard and soft materials which conform to the wafer topography to some extent. Pads are porous polymeric materials (with 30–50 µm pore size) which are consumed in the process and must be reconditioned regularly. Polyurethane is commonly used for pads. Pads are very much proprietary, and are usually referred to by their trade names, rather than by any chemical or physical property.

Slurries incorporate both mechanical elements via abrasive particle size and hardness, and chemical effects via reactivity and pH of the fluid. Typical abrasive particles are silica ( $\text{SiO}_2$ ), alumina ( $\text{Al}_2\text{O}_3$ ) and ceria

(cerium oxide,  $\text{CeO}_2$ ). The abrasive particle size distribution is related to smoothness: monodispersed slurry leads to smoother surfaces.

The fluids in CMP are alkaline and acidic solutions, and the abrasive particle concentration can be for example a few percent. Copper is polished by 2%  $\text{NH}_4\text{OH}$  with 2.5 wt % alumina particles.

Slurries are a cause of concern for post-CMP: particles must be cleaned away after polishing. Like pads, slurries are often proprietary, and the information given is often restricted to pH value, base liquid (for instance,  $\text{NH}_4\text{OH}$  based) and abrasive particle size. Slurries can be buffered against consumption in the process (cf. etching in buffered HF). At the end of CMP a soft polishing step is often done: no slurry is used, just water. This step does not remove solid material but it is effective in washing away abrasive particles and corrosive chemicals.

Pad type, compressibility, hardness and elastic modulus, conditioning, pore size and ageing can be considered variables, too. Because there is a chemical component in CMP, temperature will have an effect on polishing results. CMP process factors resemble those encountered in etching: overpolishing (cf. overetching), selectivity and pattern density effects (see Chapter 21 for etch-related pattern density effects).

Plasma etching and CMP resemble each other also in the sense that both depend on the interaction between chemical and physical processes: in etching ion bombardment removes reaction products from the surface; in CMP mechanical abrasion removes surface layers which have been modified chemically, for instance by oxidative slurries.

Polish rate can be limited by the transport of reactants, or by surface processes, just like in etching or CVD. This can be found out by varying the input variables: if the rate is unaffected by a change in a variable, it cannot be the rate controlling factor. Another similarity to etching is pattern dependency: small pattern density leads to higher rates. Pattern size effect is, however, the opposite: in CMP small patterns are polished faster, but in etching small patterns will be etched slower than large ones.

## 16.2 Mechanics of CMP

There are three modes in polishing, depending on the degree of contact between the pad and the wafer, Figure 16.4. In direct contact (boundary lubrication) mode the pad makes contact with the wafer, resulting in high and constant friction because there is no lubrication from the slurry. Polish rate is very high. In rolling contact mode (mixed lubrication mode) slurry particles occasionally roll on the wafer surface. In non-contact



**Figure 16.4** Stribeck diagram of CMP: three different lubrication modes

mode (hydrodynamic lubrication mode) slurry particles are accelerated hydrodynamically and impart energy to the wafer surface, weakening the surface so that chemical attack can occur. Hydrodynamic lubrication takes place at high velocities where the load is borne by the fluid, and the system is well lubricated. The friction force between the pad and the wafer is very different in these modes and it is classified in a Stribeck diagram.

The penetration depth of the abrasive particles into the substrate is very small indeed: this is the reason for smooth surfaces with no visible grooves or scratches. The penetration depth is given by

$$R_s = \frac{3}{4} d \times \sqrt[3]{\left(\frac{P}{2kE}\right)^2} \quad (16.1)$$

where  $d$  is the abrasive particle diameter (for example, 100 nm),  $k$  is the filling factor of abrasive particles (for instance, 50%),  $P$  is local pressure (not down force, which is 10–50 kPa) and  $E$  is Young's modulus of the surface being polished. Penetration depths are on the order of a nanometer, which is similar to surface roughness after polishing, as would be expected. Increasing pressure will lead to deeper penetration but also to higher removal rate. Sometimes the abrasive particles agglomerate into huge chunks, which leads to much larger penetration depths, and microscratches tens of nanometers deep will result.

### 16.2.1 Preston model

Polish rates were measured experimentally by Preston (in 1927) and found to obey

$$R = \frac{\Delta H}{\Delta t} = K_p \times P \times \frac{\Delta s}{\Delta t} \quad (16.2)$$

where  $\Delta H$  is the change in the height of the surface,  $P$  the pad pressure,  $K_p$  the Preston coefficient and  $(\Delta s/\Delta t)$  the linear velocity of the pad relative to the wafer.



**Figure 16.5** Copper polish rate as a function of velocity (15 kPa pressure). Reproduced from Steigerwald *et al.* (1997) by permission of John Wiley & Sons, Ltd

Experimental results (Figure 16.5) show a fairly good fit to Preston's equation, especially in the low-pressure/low-velocity regime, that is in the direct contact mode.

The Preston coefficient is related to the elastic properties of the material, and it can be approximated by

$$K_p = \frac{1}{2E} \quad (16.3)$$

where  $E$  is Young's modulus. With Young's moduli in the range of 100 GPa for many inorganic and metallic solids, values of  $K_p$  are on the order of  $10^{-11} \text{ Pa}^{-1}$ . Applied pressures are on the order of 10 kPa and velocities on the order of 0.10 m/s, which lead to polish rates on the order of 10 nm/s or 600 nm/min, which is the correct order of magnitude. This estimate is, however, not accurate enough to be of predictive use. But it does explain many basic features of polishing, for instance the fact that tungsten is polished at a lower rate than oxide.

### 16.3 Chemistry of CMP

In CMP there are two components: in addition to the mechanical pressure, chemical modifications and etching take place. For instance, a tungsten surface is turned into tungsten oxide according to the reaction



Tungsten oxide has two important roles: it is a protective layer, and in the valleys it protects tungsten from further chemical attack. But it is mechanically weaker and more brittle material than tungsten, and in the high points it can

be removed by mechanical abrasion. The same mechanism is at work in copper polishing:  $\text{Cu}_2\text{O}$  is removed by mechanical action while copper is not. For hard materials like tungsten and tantalum the mechanical effects are usually important, whereas for soft materials like aluminum and polymers the chemical effects often dominate.

When  $\text{WO}_3$  is removed by polishing, the underlying metal is etched according to



Corresponding reactions in copper polishing are



Copper polishing is carried out with slurries like  $\text{Fe}(\text{NO}_3)_3$  and  $\text{H}_2\text{O}_2$ . Hydrogen peroxide oxidizes copper, which enhances the removal rate. Typical rates are 100–1000 nm/min, selectivity to oxide ranges from 40:1 to 200:1 and residual step height 100–300 nm. Copper polishing uniformities can be 10–15%, which is among the worst uniformities of any microfabrication process.

Aluminum polishing can be done in acidic solutions, for instance phosphoric acid (pH about 3–4) with alumina abrasive. Aluminum CMP proceeds by aluminum oxidation and mechanical removal of the oxide, not unlike copper and tungsten polishing. Selectivity to oxide can be 100:1.

Oxide polishing slurries are ammonia or KOH based, for instance 1–2%  $\text{NH}_4\text{OH}$  in DI water, with up to 30% silica abrasives of 50–100 nm. Oxide polishing slurries are mildly alkaline, with pH values of about 11. The oxide polishing mechanism depends on surface modification of the oxide: leaching of oxide by the slurry softens the top layer, and the mechanical abrasion rate goes up.

CMP slurries do etch without mechanical polishing, just like fluorine will etch silicon without plasma, but in both cases it is the interaction between chemical and mechanical processes that leads to the desired total process: slurry etch rates of 10 nm/min are typical, but CMP removal rates of 500 nm/min are standard.

### 16.4 Non-Idealities in CMP

CMP is an interplay between many process factors. Pressure, velocity, slurry composition, etc., can be varied for optimization, but device design cannot usually be changed

(even though sometimes dummy patterns are made in order to eliminate pattern density effects). Polish stop layers add process complexity too, but improved process control can balance the cost. Polish selectivities are not dissimilar from etch selectivities: they range from 1:1 to 200:1. Copper to tantalum selectivities are so high that measurements are difficult. Oxide to nitride selectivities can be 50:1, and this is useful in shallow trench isolation, which will be discussed in Chapter 26.

Because of finite selectivity some underlying layer loss is unavoidable. This is termed erosion and it is pictured in Figure 16.6. Another non-ideality is the dishing. It is caused by two factors: the pad conforms to some extent to the structures on the wafer, and softer material is polished faster than the surrounding hard material. Recess etching is a chemical effect. Recess in CMP can be as low as a few tens of nanometers, and in this respect CMP is superior to etchback.

Copper dishing is strongly feature size dependent, but rather insensitive to pattern density. Oxide erosion, on the other hand, is strongly pattern density dependent, but feature size independent.

On the practical side, slurry cost is a major problem. Slurries are consumables with very low utilization: in some processes it is estimated that only 2% of slurry actually participates in the process, the rest being swept away by platen rotation. Various solutions to this problem are being investigated: structured pads with grooves and channels of various shapes retain the slurry better, and also result in more uniform slurry distribution, leading to better uniformity. Another solution is to use a fixed abrasive: the abrasive particles are attached to the pad, and the slurry is replaced by particle-free chemicals.

Temperature is not constant during CMP: friction leads easily to a 10 °C temperature rise which is detrimental to reproducibility and uniformity. Rates of chemical reactions go up as expected, and this temperature

rise can easily double the removal rate. Pad hardness decreases as temperature goes up, which leads to more asperities in contact with the wafer and reduced local contact pressure. This effect, is, however, not significant compared to chemical rate increase.

## 16.5 Monitoring CMP Processes

Top view microscopy, either optical or with a SEM, can be used for gross checking of CMP. Stains from slurry residues, scratches, layer peeling and other coarse problems can be identified. Scanning probe methods, mechanical stylus and the AFM are widely used to study nanoscale phenomena. Submicron resolution is needed because many CMP effects are strongly feature size dependent. Many optical, electrochemical, mechanical, thermal and acoustic methods are being developed to monitor CMP in real time.

### 16.5.1 Post-CMP cleaning

The introduction of CMP was obviously resisted by many people because the very idea of bringing myriads of particles intentionally onto the wafer was against all accepted cleanroom and manufacturing policies. Post-CMP cleaning was, and remains, a topic of paramount importance. Brush cleaning and other physical cleaning techniques are good for removing large particles, but the smaller particles pose problems. RCA-1 cleaning is efficient in particle removal, but it is of limited use on metallized wafers. In addition to the particle problem, there is metal contamination: potassium hydroxide is a common slurry liquid, and copper residues may be embedded in PSG, which is a soft material. HF etching can remove a thin top layer of PSG and reduce the amount of copper. In order to minimize the spread of particle and chemical contamination, the CMP section is usually separated from the rest of the cleanroom, and DI water is drained immediately after use, even though used DI water is normally recycled.



**Figure 16.6** (a) Ideal CMP result; (b) erosion and dishing; (c) plug recess (chemical attack)

## 16.6 Applications of CMP

From a single application in multilevel metallization, CMP has expanded into a major technology which is used not only in IC fabrication but also in optics, MEMS, superconducting devices, micromagnetics and other areas. Polishing processes have been developed for tens of new materials, including but not limited to oxides of all sorts, noble metals, III–V semiconductors, Ge, SiGe and SiC, and polymers.

Conformal deposition processes replicate underlying topography dutifully and step height is unchanged after



**Figure 16.7** Planarization: (a) thin conformal deposition, no planarization; (b) thick conformal deposition, gap filling but no planarization, (c) local planarization by spin-on film; (d) global planarization by CMP of thick conformal deposition

conformal deposition, as shown in Figure 16.7a. Thick conformal deposition completely fills small gaps between lines but there is no step height change in large spaces, Figure 16.7b. Spin-on dielectrics flow over topography, resulting in smoothing but incomplete planarization, Figure 16.7c. CMP of thick conformal film is the closest we can get to global planarity, Figure 16.7d.

Planarization length (Figure 16.8) is defined by

$$R = \frac{h}{\tan \Theta} \quad (16.8)$$

Planarization lengths are in the range of micrometers or tens of micrometers in the maximum.

Polishing rate and planarization rate are two different concepts. Polishing rate is applicable to one material. Planarization rate is the rate of decrease in step height: the high peaks are polished, which decreases step height, but some material is removed from the valleys too, which decreases planarization rate. Toward the end of the process planarization rate drops to zero, even though the overall polishing rate is still finite.



**Figure 16.8** Planarization relaxation distance  $R$

Selectivity in CMP bears a close resemblance to etching: we need to know the polishing rates of the top and bottom films in order to calculate, for instance, substrate loss during overpolishing. As in etching, it is sometimes beneficial to have the same 1:1 selectivity between films, but most often it is desirable to remove one film relatively rapidly, and to have high selectivity against the bottom film, which can then be processed in a separate step.

Oxide polishing is the oldest and most widely practiced CMP process. Its main application is planarization in multilevel metallization in advanced ICs, where it provides a planar surface which makes subsequent lithography and deposition steps easy. One problem with oxide polishing is the lack of an end point: there is no clear end for polishing. This is called blind polishing. The opposite is stopped polishing, where for instance a nitride layer acts as a polish stop (cf. etch stop layer) but selectivities are not necessarily very high.

Tungsten polishing is another CMP process that was adopted rapidly. Contact holes and via holes are filled by CVD tungsten, which is then removed from planar areas, leaving just the contact plug filled with metal (Figure 16.1). The same structure can of course be obtained by tungsten etchback, and the first implementations of the tungsten plug process did use etchback. CMP has proven to be better than etchback with respect to plug loss: at the etching end point the etchable area decreases dramatically, and the etchant will attack the tungsten in the plug, leading to severe plug recess. CMP is much better in this respect, but naturally process optimization with either technology can bring about improvements.

CMP is used whenever global planarity is required. In addition to multilevel metallization for ICs, other applications have sprung up. In superconducting quantum interference devices (SQUIDs) CMP planarization of



**Figure 16.9** The infrared wavelength selective photonic lattice has been made with the help of CMP: oxide deposition, oxide trench etching, polysilicon LPCVD and polysilicon CMP have been repeated five times to create the lattice. As the last step all oxide has been etched away in HF. Reproduced from Lin *et al.* (1998) by permission of Nature

PECVD oxide is performed before metallization to eliminate step coverage problems and conductor cross-section variation to ensure high and constant current density, up to  $10^7 \text{ A/m}^2$ , an order of magnitude higher than in IC metallization.

Photonic crystals (photonic band gap materials) are artificial lattices where electromagnetic wave propagation is selectively restricted due to forbidden energy levels. There are many ways to fabricate photonic lattices and CMP is just one approach. Grooves are etched in the oxide and filled by CVD polysilicon. Poly is polished and another CVD oxide is deposited. Lithography and oxide etching are followed by another cycle of poly CVD

and CMP and the process is continued until the desired number of layers has been made. Oxide is finally etched away to create the air gaps. The resulting structure is shown schematically and in the SEM micrograph in Figure 16.9.

Wafer bonding (Chapter 17) depends on surface smoothness (and flatness). Bonding two silicon wafers with 0.1 nm RMS roughness is easy, but bonding two CVD oxide layers with 3 nm RMS roughness is impossible. CMP can bring smoothness of deposited films to a level comparable to silicon wafers (Figure 16.10), enabling bonding of many new materials. Polishing is also used in layer transfer applications (Chapter 22):



**Figure 16.10** AFM scans of PECVD oxide: (a) as-deposited film peak-to-valley height is 26 nm, with RMS roughness of 3.3 nm; (b) after CMP peak-to-valley height is 2 nm and RMS roughness is 0.2 nm. Courtesy Kimmo Henttinen, VTT

thin layers of semiconductor are cut from the substrate and bonded to another wafer. The cut surface is fairly smooth but polishing is needed to put it on par with silicon wafers.

Polycrystalline films are generally rough, and thicker films rougher than thinner. One application of CMP is copper polishing in GMR magnetic head fabrication: thin films a nanometer thick need to be deposited on an electroplated copper layer a micrometer thick. The roughness of plated copper can be 100 nm, and this needs to be brought to nanometer level. CMP of copper can achieve this.

## 16.7 CMP as a Whole

CMP is a relatively young microfabrication technique, and one with very complex phenomenology. As shown in Figure 16.11, there are issues related to chip design (density of patterns, pitch, die size), basic mechanical issues like material hardness as well as practical mechanical issues like platen rotation. Materials issues are also very much involved in the choice and processing of dielectric thin films, like film composition and stress, and gap fill, which is also a design issue.

## 16.8 Exercises

- What is the Preston coefficient for copper on theoretical grounds? What is the experimental value of the Preston coefficient? Use data from Figure 16.5.
- How do the polish rates of tungsten, silicon dioxide and polymers compare?
- How do polish rate and planarization rate measurements differ from each other?
- If a titanium layer 20 nm thick is used as a polish stop underneath tungsten 500 nm thick, and film thickness non-uniformities are  $\pm 5\%$  and CMP non-uniformity is  $\pm 10\%$ , what must polish selectivity be?
- Work out a step-by-step fabrication process for the photonic crystal shown in Figure 16.9. Include film thicknesses, too.

## References and Related Reading

- Hernandez, J. *et al.* (1999) Chemical mechanical polishing of Al and  $\text{SiO}_2$  thin films: the role of consumables, *J. Electrochem. Soc.*, **146**, 4647.
- Jindal, A. *et al.* (2003) Chemical mechanical polishing of dielectric films using mixed abrasive slurries, *J. Electrochem. Soc.*, **150**, G314.



**Figure 16.11** Fishbone diagram of CMP planarization. Reproduced from Rao (1993) with the permission of The McGraw-Hill Group of Companies

- Lin, S. Y., *et al.* (1998) A three-dimensional photonic crystal operating at infrared wavelengths, *Nature*, **394**, 251.
- Rao, G. P. (1993) **Multilevel Interconnect Technology**, McGraw-Hill.
- Steigerwald, J. M., S. P. Murarka and R. J. Gutman (1997) **Chemical Mechanical Planarization of Microelectronic Materials**, John Wiley & Sons, Inc.
- Stine, B. E. *et al.* (1998) Rapid characterization and modeling of pattern-dependent variation in chemical-mechanical polishing, *IEEE Trans. Semicond. Manuf.*, **11**, 129.
- Suni, I. I. and B. Du (2005) Cu planarization for ULSI processing by electrochemical methods: a review, *IEEE Trans. Semicond. Manuf.*, **18**, 341–349.
- Tas, D. K. *et al.* (2005) Online end point detection in CMP using SPRT of wavelet decomposed sensor data, *IEEE Trans. Semicond. Manuf.*, **18**, 440–447.
- Wrschka, P. *et al.* (2000) Chemical mechanical planarization of copper damascene structures, *J. Electrochem. Soc.*, **147**, 706.
- Yasseen, A. A. *et al.* (1997) Chemical-mechanical polishing for polysilicon surface micromachining, *J. Electrochem. Soc.*, **144**, 236.
- Zantye, P. B., A. Kumar and A. K. Sikder (2004) Chemical mechanical planarization for microelectronics applications, *Mater. Sci. Eng.*, **R45**, 89–220.
- Zhang, F. *et al.* (1999) Particle adhesion and removal in chemical mechanical polishing and post-CMP cleaning, *J. Electrochem. Soc.*, **146**, 2665.
- Zhong, Z. W., Z. F. Wang and B. M. P. Zirajutheen (2005) Chemical mechanical polishing of polycarbonate and poly methyl methacrylate substrates, *Microelectron. Eng.*, **81**, 117–124.

# Bonding

Bonding mates two wafers. This can be done at wafer manufacturer, to create more versatile starting wafers. It can be done as part of wafer processing, just like any other process step, and there is no need to limit it to two wafers: bonding can be continued with more and more wafers added. It can also be done at the end of the process to encapsulate the finished structures. Bonding is also essential in layer transfer: thin slices of material can be detached from substrate wafers and bonded to other wafers.

Bonding together two wafers with different properties (crystal orientations, doping levels, doping types), or wafers of two different materials, opens up new possibilities for the device engineer. Bonding dissimilar wafers is, however, difficult because lattice mismatches and thermal expansion differences create stresses and surface chemistries that are not always suitable for bond formation.

Bonding of structured wafers is used to create channels, cavities and gaps, for example for microfluidic channels, capacitive pressure sensors and RF switches. The gaps formed by bonding can be 500 µm, wafer thickness, as in Figure 17.1, or as small as submicron, as in Figure 17.2, and anything in between. Gases trapped inside closed

cavities need careful consideration because bonding temperatures can be quite high, up to 1200 °C.

There are other uses of the bonding technologies in microtechnology: wire bonding is about attaching leads to finished chips, and flip-chip bonding is about attaching fully processed chips to suitable substrates and carriers. Some of the techniques are common with wafer bonding, like thermocompression bonding, but in this chapter we will concentrate on wafer-level bonding applications only.

## 17.1 Bonding Basics

Two wafers can be joined by a number of methods, but two main classes can be distinguished:

- direct bonding (also known as fusion bonding or thermal bonding)
- bonding with intermediate layers (glasses, metals, adhesives).

At least theoretically, a wafer of any material can be bonded at room temperature to another wafer of any material via van der Waals intermolecular forces. This bonding requires that the bonding surfaces are sufficiently smooth, flat, clean and the surface chemistry is suitable for bond formation. A strong bond can then develop across the bonding interface. Two flat and smooth wafers with 0.3 nm RMS roughness do not actually have a very large contact area with each other: 0.3 nm RMS roughness translates to perhaps 3 nm peak-to-peak heights, and the weak van der Waals forces only work at distances of 0.5 nm.

In order to bring the wafers into intimate contact, several strategies are available: one of the wafers can conform to the other, and this can be done by heating. It is easy for polymers, but requires about 500–600 °C for glasses, while silicon wafers require about 1000 °C before viscous



**Figure 17.1** Wet etched silicon microreactor bonded to a glass wafer. Heater resistor on nitride membrane, catalyst metal inside channel



**Figure 17.2** Narrow gap bonding: left, tunable aluminum plate capacitor by silicon–glass bonding, adapted from Etxeberria and Gracia (2007); right, capacitive accelerometer, bonding of a silicon wafer between two glass wafers

flow takes place. A second alternative is wetting: suitable treatment will allow one material to cover the surface, for example solder flow upon heating or solvent treatment of a polymer wafer. A third method is to apply external force to ensure intimate contact.

When dissimilar materials with different thermal expansion coefficients are bonded, high stresses can emerge. This prevents many bonding processes. In anodic bonding silicon and glass CTEs have to be similar, and this can be done for some glasses but not for all of them (more on glass properties in Chapter 19). Quartz-to-silicon bonding remains much more difficult because there is no possibility to tailor the quartz CTE.

Direct bonding usually involves two identical wafers: silicon to silicon, glass to glass, PMMA to PMMA, etc. It results in identical chemical bonds at the interface as inside the material itself, and if breakage occurs, it takes place inside the wafers, not at the bond interface. The bond strength is then the same as the tensile strength of the material. The bonded wafers can be processed further as if they were one wafer.

If the temperatures involved are low enough, CTE-induced problems are minimal and dissimilar wafers can be bonded. There is constant progress toward lower and lower bonding temperatures, that is for lower temperatures without sacrificing bond strength. Direct bonding is preferred, because all the additional materials at the interfaces increase the possibilities for unwanted reactions, during either processing or device use.

Silicon–glass bonding is a special case of the direct bonding process. Usually different materials cannot be direct bonded. Silicon and glass are bonded permanently and hermetically by the application of elevated temperature and high voltage in a process called anodic bonding.

The applied voltage creates a strong electrostatic field which pulls the wafers together. The sodium and oxygen ions in the glass become mobile at about 300–500 °C, and the mobile oxygen ions move toward the silicon interface, where they react with silicon and form strong Si–O bonds.

Indirect bonding uses three classes of materials as “glues”: metals, glasses and polymers. Depending on the intermediate material, the force and temperature needed to effectuate bonding are very different: with polymer adhesives mild pressure and temperatures around 100–200 °C are sufficient, but with metal intermediates pressures of several megapascals and temperatures of 300–400 °C are typical. Glass frit bonding requires similar temperatures but less pressure.

Adhesive bonding can be used to bond anything to anything. Gluing is a form of adhesive bonding but in microfabrication more advanced methods are used. Polymer films (like epoxy SU-8) are spin coated on a wafer, patterned lithographically. This way the thickness of the adhesive is well controlled and narrow lines can be made. Thermal curing and UV curing can both be used.

Self-adhesive bonding is found with the polymer PDMS. It is an elastomer, and its softness allows intimate contact. The bond strength is small, and the bond is easily debonded, and bonded again, as long as the surfaces remain clean. PDMS is much used to create microfluidic channels. Either planar PDMS is attached as a roof to channels etched in silicon or glass, or structured PDMS is bonded to a planar silicon or glass wafer. Such bonding is simple because no alignment is needed. The great benefit of this process is that it is a room temperature process. This is important if, for instance, proteins are handled: they denature quickly

**Table 17.1** Bonding process steps

|                                   |
|-----------------------------------|
| Particle removal                  |
| Surface chemistry modification    |
| Vacuum pumping (optional)         |
| Wafer alignment (optional)        |
| Room temperature joining          |
| Application of force/heat/voltage |
| Wafer thinning (optional)         |

**Figure 17.3** Silicon-on-insulator (SOI) wafer fabrication by bonding an oxidized wafer to a bare silicon wafer: left, surface preparation; middle, room temperature joining and annealing for bond improvement; right, top wafer thinning

above 40 °C. Bonding of PDMS to glass and to PDMS is much practiced in microfluidics, as will be discussed in Chapter 18.

The basic requirements for good wafer bonding are as follows:

- The materials being bonded form chemical bonds across their interface.
- High stresses are avoided.
- No interface bubbles develop.

The driving force for bonding can be temperature, pressure, electric field or a combination of these. Irrespective of the details of various bonding processes, the general outline is the same. The basic steps of bonding are listed in Table 17.1.

If closed cavities are formed, the behavior of gases needs to be analyzed. Gases are trapped inside cavities when wafers are mated, but additional gases are released from reactions (especially during high-temperature bonding) and outgassing from cavity materials contributes to the final cavity atmosphere, too. To a first approximation these gases are treated by the ideal gas law, for example to calculate cavity pressure during annealing.

## 17.2 Fusion Bonding Blanket Silicon Wafers

When bonding is done to produce more advanced starting wafers, it is usually about blanket wafers (wafers with prefabricated cavities will be discussed in Chapter 30).

Silicon direct bonding to an oxidized silicon wafer results in strong silicon–oxygen bonds. The resulting material now has an oxide layer in between two silicon wafers, and this property is very useful: it is the basis of silicon-on-insulator (SOI) wafers. The basic steps of SOI wafer formation by bonding are shown in Figure 17.3. This type of SOI is known as BESOI, for bond etchback SOI, or simply as BSOI. There are other techniques for

SOI formation, to be discussed later on in this chapter and in Chapter 22. The SOI top layer, known as the device wafer, is electrically insulated from the bottom wafer, known as the handle wafer. This opens up many possibilities to optimize device properties in the device layer, and separate device and handle processing from each other, because the oxide acts as electrical insulation and as an etch stop layer.

Fusion bonding, like all bonding processes, begins with a cleaning step. RCA-1 cleaning with ammonia/peroxide mixture takes care of two requirements at the same time: it is effective in particle removal and it leaves the surface in a hydrophilic condition, with silanol groups (Si–OH). RCA-1 cleaned surfaces are smooth, and roughness remains smaller than 0.5 nm. CVD oxides are usually not smooth enough for bonding, and CMP must be done to achieve surface roughness below the 1 nm required for successful bonding (see Figure 16.10).

Surface energy ( $\gamma$ ) is the energy required to break a bond and to create two new surfaces. It can be estimated from bond strengths ( $E_{\text{bond}}$ ) and bond densities ( $d_{\text{bond}}$ ):

$$\gamma = \frac{1}{2} E_{\text{bond}} d_{\text{bond}} \quad (17.1)$$

The factor 1/2 comes from the fact that when a bond is broken, two surfaces are created. Two wafers in close contact are bonded via water molecules, Figure 17.4. We can get an estimate for surface energies from silicon atom surface density, about  $10^{15} \text{ cm}^{-2}$ , and hydrogen bond energies, 25–40 kJ/mol, which translate to about 200–350 mJ/m<sup>2</sup>. Measured values for room temperature hydrogen-bonded silicon wafers are between 50 and 80 mJ/m<sup>2</sup>. This indicates that less than maybe 20–30% of area is in contact with hydrogen bonds. This is understandable because the wafer surfaces are neither perfectly flat nor smooth but have local roughness and waviness, and hydrogen bonds have short range. The saturation value of surface energy after mild thermal treatment or extended time has been measured to be about 250 mJ/m<sup>2</sup>, in very good agreement with the simple bond strength calculation.



**Figure 17.4** Bonding of hydrophilic silicon surfaces via intermediate water molecules. Reproduced from Tong and Gösele (1999) by permission of John Wiley & Sons, Inc.



**Figure 17.5** Water removal and siloxane bond formation. Reproduced from Tong and Gösele (1999) by permission of John Wiley & Sons, Inc.

The reaction that takes place during annealing (for silicon–silicon fusion bonding is typically about 1000 °C) is siloxane bond (Si–O–Si) formation (Figure 17.5)



Siloxane bonds are much stronger than hydrogen bonds, and measured surface energies are about 1300 mJ/m<sup>2</sup>. This surface energy is almost constant from 150 to 800 °C.

However, surface energies calculated from Si–O bond energies (4.5 eV/bond or 430 kJ/mol) translate to about 3000 mJ/m<sup>2</sup>. This discrepancy is due to the fact that the surfaces are not yet fully bonded but have some areas which bond via hydrogen bonds only, and somewhere

above 1000 °C thermal oxide becomes viscous and flows, which increases contact area and leads to higher surface energy. Surface energies of 3000 mJ/m<sup>2</sup> are not encountered in experiments, however, because wafer breakage will take place inside the silicon because Si–Si bonds are weaker than the Si–O bonds.

Silicon-to-silicon bonding without oxides is possible. Hydrogen bonds are involved, again. Surface treatment with HF leaves the wafers hydrophobic with either H or F termination. Because HF does not remove particles like ammonia/peroxide solution, hydrophobic bonding is inferior to hydrophilic bonding. Silicon-to-silicon bonding without oxides can yield abrupt pn junctions when p- and n-type wafers are bonded.

### 17.2.1 Low-temperature direct bonding

Low-temperature bonding is needed for instance when there are structures on the wafer, and these structures do not tolerate standard fusion bonding temperatures, or because two dissimilar materials are being bonded and low temperature will keep CTE problems at bay.

Plasma activation is the main method to allow low-temperature bonding. It is not fully clear why plasma surface treatment is so beneficial. There are a number of processes that may contribute: ion bombardment breaks bonds at the surface, leaving surface atoms in reactive states; UV from plasma discharge also breaks Si–O and Si–H bonds; the surface is oxidized in oxygen plasma, but the oxide is porous and diffusion through it beneficial for water removal, Equation 17.2.

Plasma activation will enable bonding at 400 °C, which is low enough for fusion bonding to compete with anodic bonding. Bonding has been done at 200 °C, and even room temperature has been reported, but no such process has become standard procedure. Low-temperature bonding is even more sensitive to surface roughness than standard fusion bonding, and wafers with 0.1 nm RMS roughness are used.

## 17.3 Anodic Bonding

Anodic bonding of silicon to glass (also known as field-assisted thermal bonding, FATB) is the oldest bonding technique in microfabrication. It has many features which make it easy: glass is a soft material which will conform at 400–500 °C bonding temperatures, sealing structures and irregularities of up to 50 nm hermetically. Native oxides, and thin grown or deposited oxides, do not prevent bonding. Anodic bonding can be visually checked through the glass side: bonded surfaces look black, and non-bonding areas are lighter.

Not all glasses are amenable to anodic bonding (see Chapter 19 for a discussion on glass properties). Thermal mismatch between silicon and glass needs to be considered at two temperatures: the bonding temperature and room temperature/operating temperature of the device. Glasses have higher coefficients of thermal expansion than silicon, but a match at two temperatures is approximately met with glasses like Schott 8339 and 8329 and Corning 7070 and 7740 (Pyrex). The CTE of 7740 is almost constant at  $3.3 \times 10^{-6}/^{\circ}\text{C}$  from room temperature to  $450^{\circ}\text{C}$ , and that of silicon increases from  $2.5 \times 10^{-6}$  to  $4 \times 10^{-6}/^{\circ}\text{C}$ .

When glass is heated to about  $400^{\circ}\text{C}$ , sodium oxide ( $\text{Na}_2\text{O}$ ) decomposes into sodium and oxygen ions. The bonding process uses 300 to 1000 V applied to the glass wafer (Figure 17.6). Sodium ions ( $\text{Na}^+$ ) move toward the glass top surface and oxygen ions ( $\text{O}^{2-}$ ) toward the silicon wafer. This will create a depletion layer and electrostatic force pulls the glass and silicon wafer together. The resulting electrostatic forces are very strong: if the thickness of the depletion region is  $1\ \mu\text{m}$ , the field is on the order of  $500\ \text{MV/m}$  ( $E = 500\ \text{V}/1\ \mu\text{m}$ ) and the electrostatic force is proportional to  $E^2$ .

Oxygen ions react at the glass/silicon interface according to



and sodium ions are neutralized at the cathode. If higher temperatures are used, more sodium atoms are ionized and participate in the process, and will diffuse faster, so the depletion width is greater, leading to stronger bonds.



**Figure 17.6** Anodic bonding: at about  $350^{\circ}\text{C}$  glass ionizes partly, and the voltage of about 400 V drives oxygen ions toward the glass/silicon interface, creating a depletion region and a strong electrostatic field which pulls the wafers together

Bonding initiation is by applying pressure at the wafer center, but if bonding is done in vacuum, it is possible to bond without an initiation point. Typical bonding times are tens of minutes. It consists of alignment, chamber evacuation, temperature ramping, actual bonding and cooling. Total time is for example 30–60 minutes. This is fairly long for a single wafer operation, and special wafer holders have been designed so that wafer loading and unloading can be done while another wafer is being bonded.

There is usually the option for trading bonding temperature, voltage and time for each other. Microscope glass slides have been anodically bonded to silicon at  $150^{\circ}\text{C}$  (typically microscope slides have CTEs of 7–9 ppm/ $^{\circ}\text{C}$ ). One hour bonding was done, which is quite long, using a fairly high voltage of 500 V.

Anodic bonding leads to a hermetic bond seam. Vacuum cavities, or others with controlled atmosphere, can be sealed. A “collar” around the cavity is necessary, but there are no standardized design rules for this. The size of bondable area can be very small. For instance, anodic bonding has been used to make pillar frits: silicon pillars  $5\ \mu\text{m}$  in diameter (Figure 21.9) can be bonded to glass.

Oxidizable metal films like aluminum can be sealed between glass and silicon if the films are thin enough (<300 nm). Metals like gold or chromium will prevent bond formation because either they do not oxidize (like gold) or their oxides are conductive ( $\text{CrO}_2$ ). Signal lines out of a bonded structure can be done by diffused lines in the silicon wafer. The resistance of diffused wires will be high, but the surface remain planar, and these wires tolerate all possible processing steps. This method is also suitable for fusion bonded wafers.

Anodic bonding of multilayer structures is also possible: glass/silicon/glass systems (Figure 17.2, right) can be made in a single bonding step. Heating uniformity is important, and double side heating is usually employed. Contacting the middle wafer electrically requires special jigs.

## 17.4 Metallic Bonding

Thermocompression bonding (TCB) applies pressure and heat simultaneously to the samples. This is the standard wire bonding technique for attaching gold or aluminum leads to ICs. Gold is suitable because it is a noble metal: there are no gold oxides on the surface preventing TCB, and the softness of gold (low yield point) is also advantageous. Typical pressures and temperatures for wafer-level TCB are in the range of 1–10 MPa at  $300$ – $400^{\circ}\text{C}$ . Bonding times are then minutes or tens of minutes. An example of gold–gold bonding can be seen in Figure 30.8:

the two wafers forming the microphone have been bonded together by 300 °C, 15 min Au–Au bonding. Wafer-level TCB is difficult because uniform pressure is needed over a large area.

Another metallic bonding method is eutectic bonding. It is based on the fact that eutectic alloys have low melting points: for example, Au–Si eutectic alloy (19 at. % Si) melts at 363 °C even though the pure gold and silicon melting points are 1063 and 1421 °C, respectively. When gold dissolves into silicon, the eutectic point shifts to higher temperature, stopping the reaction. If silicon wafers with gold thin films are bonded, there is an ample supply of silicon, but if thin films are used, their thicknesses need to be designed so that eutectic composition is achieved. Any oxides on silicon will have to be carefully removed, otherwise Au–Si alloying will not take place.

There are tens of technologically important eutectic compositions that have been tested for bonding and many are useful in microsystem bonding. Aluminum–germanium and gold–tin systems are common. While Au–Si and Au–Sn are easy and well established, gold is a contamination source in silicon electronics. And even if there were no electronics on the wafers, gold would spread around in process tools, and therefore lab policy may prevent gold being used in equipment.

## 17.5 Adhesive Bonding

Adhesive bonding with a polymeric intermediate layer offers several advantages for bonding:

- temperatures from 20 °C to a few hundred degrees
- tolerant to (some) particle contamination
- bonding of structured wafers
- low-cost simple process.

Many negative resists have good bonding capabilities: SU-8, polyimides and benzocyclobutene have all been successfully used for adhesive bonding. Because polymers are soft materials they conform to particles, and voids are not a similar problem as with stiffer materials like silicon. The main problem with adhesive bonding is limited long-term stability and limited thermal range, with 100–400 °C maximum depending on polymer. Because of the low temperatures and benign processes, CMOS wafers can be used as substrates. A mirror array with individually addressable pixel elements steered by electronics in the bottom wafer is shown in Figure 17.7.

Prototypical steps in adhesive bonding are as follows:

1. Surface cleaning (optional adhesion promoter application).



**Figure 17.7** Aluminum mirror on nitride membrane is addressed pixel-wise by electronics in the bottom wafer. Photoresist serves the roles of both spacer and adhesive. Reproduced from Sakarya *et al.* (2002) by permission of Elsevier

2. Spin coating of polymer.
3. Initial curing (solvent bake).
4. Evacuating vacuum and joining the wafers.
5. Final curing of the polymer: pressure/heat/UV.

Initial curing is important to remove excess volatile material and thus reduce interface bubbles later on. The final curing temperature has to be above the glass transition temperature  $T_g$  of the polymer, otherwise no bonding will take place. Adhesive bonding, and bonding of polymeric materials in general, will be further discussed in Chapter 18.

## 17.6 Layer Transfer and Temporary Bonding

In layer transfer methods thin slices of material are cut from a donor wafer and bonded to a handle wafer. This cutting can be accomplished by weakening the wafer with ion-implanted hydrogen and then mechanically rupturing the top layer. SOI wafer fabrication by an ion cutting method (Smart-cut) will be discussed in more detail in Chapter 22.

Layer transfer can also be based on thin films. As an example of transferring a thin film, microchannel fabrication by transfer bonding of the photopolymer Ormocer is discussed. Ormocer film is spin coated on a polymer sheet (e.g., PDMS or PET) and bonded to a patterned Ormocer channel wafer. UV exposure through the transparent polymer sheet causes a reaction at the photopolymer interface. If the polymer sheet is inert, the bond strength between the sheet and the photopolymer is less than the bond strength between two photopolymer layers, and the carrier polymer sheet can be peeled away, leaving a structure made completely out of photopolymer. The process is shown in Figure 17.8.

As in all bonding, surface preparation is the key: the receiving surface must be able to form chemical bonds with the transferred layer. In the case of gold



**Figure 17.8** Channel formation by transfer bonding: photopolymer on a polymer sheet is bonded to a channel wafer and detached after curing the photopolymer by UV light



**Figure 17.9** Transfer of gold film from a PDMS stamp: the silicon surface is functionalized by thiol groups, and gold–sulphur bonds are stronger than gold–PDMS bonds

transfer, Figure 17.9, the receiving surface is coated by a thiol–SAM: a monomolecular layer with a sulfur atom at the end of the carbon chain, because gold forms a strong bond with sulfur (Figure 17.10) and this bond is stronger than the van der Waals hydrogen bonds which hold gold to PDMS.

Temporary bonding is used to assist handling very thin wafers, or to make very thin wafers: the wafer to be thinned is bonded to a carrier, thinned down and possibly processed further, while being bonded to the carrier. Then, at a suitable later stage, the temporary bond is detached, and the thin wafer may be bonded to its final carrier, as shown in Figure 17.11. This is common in 3D packaging applications. The final carrier is chosen for example because it has high thermal conductivity (used for laser heat sinks) or because it is a dielectric and eliminates parasitic capacitances (as in RF devices).

Bonding and layer transfer have many applications. In power transistors heating is a major issue, as is insulation voltage. While silicon is a good thermal conductor, it is a semiconductor and leakage currents will increase with temperature. Glass would solve this issue, but it is a thermal insulator. A solution is to deposit a thermally conducting but electrically insulating interlayer before bonding the power transistor to the glass. Aluminum nitride can be used (Figure 17.12). For this application



**Figure 17.10** Gold–thiol bonds



**Figure 17.11** Temporary bonding: device wafer and interim carrier bonded by adhesive; device wafer thinned by grinding/etching/polishing; adhesive bonding to final carrier; detachment of interim carrier



**Figure 17.12** Bipolar transistor with AlN heat spreader layer transferred to glass, with silicon wafer removed. Reproduced from La Spina *et al.* (2008), copyright © 2008 by permission of Elsevier

the AlN deposition process has to be optimized so that thermal conductivity is maximized while the piezoelectric effect is minimized.

Temporary bonding is a key element in 3D integration: fully processed wafers are joined together temporarily, thinned and metallized, for greater functionality. More examples will be shown in Chapter 39. Flexible, wearable electronics have been demonstrated by laminating thinned silicon chips on metallized polymers. One example is shown in Figure 39.13.

## 17.7 Bonding of Structured Wafers

Blanket wafer bonding requires smoothness and flatness, and if one (or both) of the wafers have been structured, they may be compromised from a bonding point of view. If etching has been done to make microchannels (Figure 17.1), the etch mask has to be removed somehow. If this removal process, for example wet or plasma etching, introduces microscale roughness, bonding properties may be compromised. On a wafer scale, metal deposition may introduce stresses which will bend the wafer, making bonding difficult or even impossible.

### 17.7.1 Bond alignment

Attaching a blanket capping wafer to the top of a structured wafer is easy. If both wafers have structures they need to be aligned to each other. Perfect alignment is as difficult as in lithography, and processes must be designed to work with less than perfect alignment (Figure 17.13).

Glass and PDMS are very good with respect to bond alignment: transparency makes optical alignment straightforward. Anodic bonding alignment resembles standard lithography: the glass wafer with its metal patterns can be aligned to the bottom silicon wafer (photomasks are glass plates with metal patterns).

In the microturbine (Figure 1.17) five silicon wafers have been bonded together with about  $1\text{ }\mu\text{m}$  alignment to create air and fuel channels, rotors and stators. Bonding of structured silicon wafers requires a double-sided alignment tool. Alignment marks on the first wafer are registered, the second wafer is aligned to those marks and



**Figure 17.13** Bond alignment: none, perfect and misaligned

wafers are then brought into contact. Alternatively, infrared through-wafer alignment can be used with silicon, because it is transparent in the infrared. The critical step is to maintain the alignment while the wafers are transferred to the bonding equipment. This is accomplished by a special fixture that fits both the aligner and the bonder, therefore wafers need not be handled after alignment.

### 17.7.2 Gas entrapment

Silicon and glass are impermeable to gases, therefore the pressure inside a cavity needs to be understood. This is especially important when thin membranes are made, because they will deflect due to any pressure difference between the cavity and the outside world (Figure 17.14). With polymer devices the issues are quite different and mostly concerned with keeping gases outside the cavity, that is hermeticity of the bond. In both cases gases desorb from the structures themselves during bond anneal. Polymers are more prone to outgassing, but then, on the other hand, polymer bonding temperatures are much lower.

When wafers with cavities are joined together, three possibilities for the cavity atmosphere are commonly encountered: air, nitrogen and vacuum. When vacuum-bonded wafers are cooled down and taken to atmospheric pressure, the membrane will be deflected down. In fusion bonding oxygen has reacted with silicon and formed oxide (this is dry oxidation, Equation 13.2). Nitrogen remains, resulting in a pressure of about 0.8 atm, since air is roughly 80% nitrogen, 20% oxygen. Joining the wafers under a nitrogen atmosphere will result in equal pressure after bond annealing. The ideal gas law is a good approximation for gas pressures inside cavities.

The pressure inside microcavities can be calculated from membrane bending and Equation 17.4, which gives circular membrane centerpoint deflection ( $w$ ) under applied pressure:

$$w = 0.662a \times \sqrt[3]{\frac{\Delta P \times a}{Et}} \quad (17.4)$$



**Figure 17.14** Deflection of membrane due to vacuum cavity: atmospheric pressure bends the membrane. Note that the design on the right retains parallelism of the gap

where  $a$  is membrane radius,  $t$  the thickness and  $E$  Young's modulus. Alternatively, the chips can be placed in a vacuum chamber which is pumped down. The flat-membrane condition is equated to gas pressure inside the cavity.

However, gases are not just trapped inside cavities and desorbed from walls – some gases are generated in the bonding process itself. The direct bonding reaction products of silicon are hydrogen and water. These gases behave differently: hydrogen dissolves into silicon dioxide, and if the oxide is thicker than 50 nm it can absorb all the hydrogen, but in the case of thin oxide, hydrogen bubbles may evolve. Water oxidizes silicon (wet oxidation !), because the temperature of bond annealing is similar to the oxidation temperatures.

If CVD oxide is used in bonding, it will release gases when annealed: remember that hydrogen is always embedded in CVD oxides. These oxides have to be annealed above the bonding temperature before bonding. This is even more pronounced when polymeric materials are present: polymers can absorb quite considerable amounts of water, and this will be released upon heating.

In anodic bonding oxygen is diffusing toward the interface, and oxygen gas accumulates in the cavity. Bonding pressure needs some attention when anodic bonding is done on wafers with cavities. At millitorr pressures a glow discharge can be initiated in the cavity. Therefore either a good vacuum or atmospheric pressure is desirable.

### 17.7.3 Mechanical strength

Even when bonding is successful from the surface chemistry/bond strength points of view, there may be problems: it is not guaranteed that the cover wafer is strong enough for the span intended (Figure 17.15). Membranes will deflect because of a pressure difference, but this is elastic, and when pressures are equalized, flat membranes will result. But with thin membranes at elevated temperatures there is the danger (and possibility) of irreversible plastic deformation. First of all, thin membranes are mechanically not very rigid, and, second, yield stresses are much

reduced at elevated temperatures. See Figure 19.15 for a discussion on irreversible membrane deflection.

Span and gap are related: if narrow gaps with long spans need to be made, bending of the wafers can result in closure of the gap. This can be circumvented by using stiffer materials (higher Young's modulus) or thicker wafers. The design also affects strength: circles are sturdier than long lines, for instance. The ratio of span to gap can be for instance 1000:1 (millimeter span for micrometer gap) for anodically bonded sturdy designs.

## 17.8 Bond Quality Measurements

Cleanliness is paramount in wafer bonding: particles at the bond interface will prevent bonding locally. Anodic bonding can easily be observed through the glass side, but if the wafers are not transparent, infrared optical measurement through the wafer is possible. For silicon this translates to 1.1  $\mu\text{m}$  wavelength and above. The height of voids can be inferred from interferometric rings, with  $\lambda/4$  as the minimum detectable height, or about 0.28  $\mu\text{m}$  for silicon.

Acoustic microscopy (SAM, Scanning Acoustic Microscopy) can be used to check voids in the finished wafer stack non-destructively. The wafer is immersed in water and high-frequency ultrasound is aimed at it. Theoretical resolution is down to micrometers, but full wafer scanning is often desirable, and scanning time can be traded for resolution so for example a 1 min scan with 100  $\mu\text{m}$  resolution is done.

Bond strength can be measured by a simple crack opening method: a razor blade is inserted between the wafers and the crack opening length is measured. Bond strength  $\gamma$  is given by

$$\gamma = \frac{3Ed^3t_b^2}{32L^4} \quad (17.5)$$

where  $E$  is Young's modulus,  $d$  the wafer thickness,  $t_b$  the blade thickness and  $L$  the crack length. This method is quick and easy with no wafer preparation, but because of the  $L^4$  dependence on crack length, any uncertainties in crack length determination will lead to erratic results.

Debonding the wafers and visual or microscopic examination reveal bond interface quality. Bond strength can also be checked by pull tests. In a pull test the two wafers are torn apart, and the debonding force is measured. This is better than the crack method because it measures bond strength all over the wafer, not just at the edge (where the bond might be of inferior quality). The problem with a bond test is that the bond may be stronger than the attachment of the pulling set-up, and only a lower limit is obtained.



**Figure 17.15** Sagging over a long span

Anodic bonding bond strengths are 20–30 MPa, similar to the best polymer-to-polymer bonds. The usual polymer bonds are 1–10 MPa and PDMS self-adhesive bonds about 0.5 MPa. Bond tightness can be measured by gas leakage. When patterned and etched silicon wafers have been direct bonded to form cavities, etched depths of 6 nm can be sealed gas-tight, but 9 nm grooves will result in leakage. Higher anneal temperature will seal slightly better. Anodic bonding is much more flexible: even 50 nm grooves can be sealed in a gas-tight manner.

## 17.9 Bonding for Packaging

Bonding is also a typical method to create zero-level packages for MEMS devices. A cavity with controlled ambient is needed for several reasons: damping of mechanical resonators depends on pressure; clogging of movable structures by particles must be prevented; and surfaces have to be protected from atmospheric water vapor condensation. With all the sensitive, delicate micromechanical parts covered by a capping wafer, subsequent operations of dicing, encapsulation, mounting, etc., can be generic, whereas packaging of unprotected chips with vibrating beams or bending plates would have to be developed for each and every design separately. The alternative method for cavity formation is deposition. This will be discussed in Chapter 30. Deposition avoids

one of the main drawbacks of bonding, which is the extra cost of an additional wafer.

Glass-frit bonding is an old established technique for fairly low-temperature hermetic bonding. Hermeticity equals low leakage and long-term stability of the cavity atmosphere. The glass frit consists of a powder of low-melting-point lead glass and polymeric binder. It is screen printed on a wafer (making it amenable to large linewidths only) and baked to remove binder. The glass frit is then pressed against the wafer to be bonded and heated to about 400 °C. The glass frit is quite thick, for example 15 µm, which makes it insensitive to particles and microstructures. It is easy to have metallization run



**Figure 17.16** MEMS resonator enclosed by a silicon cap wafer and glass-frit bonding: metallization runs under the glass frit from a vacuum cavity. Glass frit conforms to some extent to height differences

**Table 17.2** Bonding techniques compared

| Method                  | Advantages                 | Disadvantages                       |
|-------------------------|----------------------------|-------------------------------------|
| Direct Si bonding       | Strong bond                | Smooth and flat surface needed      |
| Low- <i>T</i> Si direct | Hermetic sealing           | Very high temperature >1000 °C      |
| Fusion, glass           | Hermetic at low <i>T</i>   | Very flat and smooth surface needed |
| Fusion, polymer         | CMOS compatible            | Sensitive to surface preparation    |
| Anodic                  | Strong bond                | 650 °C needed for Pyrex             |
|                         | Hermetic                   | Wafer deformation a problem         |
|                         | Seals small irregularities | Depends on <i>T<sub>g</sub></i>     |
| Thermocompression       | Non-flat surface OK        | Flow of material may block cavities |
|                         | Hermetic                   | High voltage and discharge danger   |
| Eutectic                | Hermetic                   | Sodium accumulation on cathode      |
| Glass frit              | Strong bond                | Requires CTE matched glass          |
|                         | Hermetic                   | High forces needed                  |
| Adhesive                | Covers metallization       | Difficult on wafer level            |
|                         | Versatile                  | Flat surface needed                 |
|                         | Covers metallization       | Requires special metals             |
|                         | Insensitive to particles   | Large-area bond seam                |
|                         |                            | Lead may be contamination source    |
|                         |                            | Non-hermetic                        |
|                         |                            | Low-temperature uses only           |
|                         |                            | Reactions with fluids               |

under the glass frit, thus enabling electrical contacts to hermetic cavities (Figure 17.16). The poor thickness control prevents the use of glass-frit bonding in applications where bonding defines device-critical vertical dimensions. Glass-frit bonding is utilized in many bulk micromechanical applications like pressure sensors and accelerometers.

Adhesive bonding can be used in much the same way as glass-frit bonding; for example, structures resembling exactly Figure 17.15 can be made by adhesive bonding. However, the polymer seal is not hermetic, and the long-term reliability of adhesively bonded structures has to be assessed for each application. All polymers are permeable to some degree, even though Teflon-like polymers have permeabilities 10 000 times less than that of PDMS, but metals have permeabilities another 10 000 times less than the densest polymers.

## 17.10 Bonding at Large

Because there are so many different bonding techniques available, some bonding solution is almost invariably applicable. All methods have their pros and cons, and they depend on device requirements. Table 17.2 lists some important issues of major bonding techniques.

## 17.11 Exercises

1. Which materials can be present on a wafer undergoing silicon–silicon fusion bonding?
2. Calculate the gas pressure inside an anodically bonded cavity when bonding has been done at ambient pressure and 400 °C temperature!
3. Outline the fabrication sequence for the tunable capacitor of Figure 17.2! Which process steps contribute to gap size?
4. Design a fabrication process for the accelerometer of Figure 17.2 (right).
5. What is the resolution of a 160 MHz acoustic measurement of voids?
6. Which measurements could reveal the role of sodium ion depletion in anodic bonding?
7. How much will a silicon membrane 0.5 µm thick and 1 mm in diameter bend due to 0.2 atm pressure difference? What about a 5 µm membrane?

8. Calculate the bond strength from Equation 17.5 for two silicon wafers when a blade of 50 µm has opened a crack of 8 mm.

## References and Related Reading

- Berthold, A. *et al.* (2000) Glass-to-glass anodic bonding with standard IC technology thin films as intermediate layers, *Sens. Actuators*, **82**, 224.
- Christiansen, S. H., R. Singh and U. Gösele (2006) Wafer direct bonding: from advanced substrate engineering to future applications in micro/nanoelectronics, *Proc. IEEE*, **94**, 2060–2106.
- Dziuban, J. A. (2006) **Bonding in Microsystem Technology**, Springer.
- Etxeberria, J. A. and F. J. Gracia (2007) Tunable MEMS volume capacitors for high voltage applications, *Microelectron. Eng.*, **84**, 1393–1397.
- Henttinen, K. *et al.* (2000) Mechanically induced Si layer transfer in hydrogen-implanted Si wafers, *Appl. Phys. Lett.*, **76**, 2370.
- Huff, M. A. *et al.* (1993) Design of sealed cavity microstructures formed by silicon wafer bonding, *J. Microelectromech. Syst.*, **2**, 74.
- Jourdain, A. *et al.* (2002) Investigation of the hermeticity of BCB-sealed cavities for housing (RF-)MEMS devices, *Proceedings of IEEE MEMS 2002*, p. 677.
- La Spina, L. *et al.* (2008) Aluminum nitride for heatspreading in RF IC's, *Solid-State Electron.*, **52**, 1359–1363.
- Lee, B. *et al.* (2003) A study on wafer level vacuum packaging for MEMS devices, *J. Micromech. Microeng.*, **13**, 663.
- Mack, S. *et al.* (1997) Analysis of bonding-related gas enclosure in micromachined cavities sealed by silicon wafer bonding, *J. Electrochem. Soc.*, **144**, 1106.
- Niklaus, F. *et al.* (2006) Adhesive wafer bonding, *J. Appl. Phys.*, **99**, 031101.
- Sakarya, S. *et al.* (2002) Technology of reflective membranes for spatial light modulators, *Sens. Actuators*, **A97–98**, 468.
- Tong, Q.-Y. and U. Gösele (1999) **Semiconductor Wafer Bonding**, John Wiley & Sons, Inc.
- Tsau, C. T., S. M. Spearing and M. A. Schmidt (2002) Fabrication of wafer-level thermocompression bonds, *J. Microelectromech. Syst.*, **11**, 641–647.
- Vallin, Ü, K. Jonsson and U. Lindberg (2005) Adhesion quantification methods for wafer bonding, *Mater. Sci. Eng.* **R50**, 109–165.

# 18

## Polymer Microprocessing

Polymer microprocessing employs a wide variety of methods, some of them scaled-down versions of macroscopic polymer processing, like injection molding and casting, and some borrowed from microfabrication and applied to novel materials like CVD of parylene, plasma deposition of Teflon and lithography of epoxy. Polymers are essential in optical lithography and newer patterning methods like embossing and nanoimprint lithography (NIL) also make patterns in polymers. Most of the time photoresist or NIL polymer is stripped away after it has served as etch, implant or plating mask, but it is also possible to leave the polymer structure as a permanent part of the finished device. Such structures are central in this chapter. In Figure 18.1a epoxy pillars 300 µm tall, 30 µm in diameter made by optical lithography are shown, and in Figure 18.1b imprinted epoxy grating with 60 nm feature size is shown.

Polymers are soft materials (have small Young's modulus) and this property is utilized when flexible structures or sensitive mechanical elements are needed. Polymers can have exceptional combinations of properties, like optical transparency and elasticity (PDMS); or a refractive index of 1.34 and low loss tangent in RF (amorphous fluoropolymer); or low thermal expansion and excellent solvent tolerance (polyimides). But then some other properties might be disadvantageous: water absorption is high and solvent tolerance often poor. Because of the wide range of polymers existing, it is often possible to find a material with a suitable mix of properties. Polymers are cheap, which is important when disposable devices are made. The main drawback of polymers is their limited stability (mechanical softness, limited thermal range, porosity) and this applies to both device processing and operation.

Polymer processing methods can be divided into two gross categories: direct fabrication and replication. Direct fabrication results in the final structure, as in lithographic patterning. Replication methods rely on a master or

mold, which is used to force the polymer into the desired shape. These two methods can result in exactly the same final structures, as shown in Figure 1.9. There are many different implementations, and the terminology is somewhat diffuse. Embossing and imprinting both use a 3D stamp to force the structure into softened polymer, while molding (also spelled moulding) and casting both describe processes where polymer is flowing (because either it is molten or the monomer is fluid) and filling a master/mold. This difference between imprinting and molding is depicted in Figure 18.2.

Various hybrid methods exist, like UV embossing, where liquid photopolymer is stamped and cured by UV radiation. Molding techniques can be applied not only to polymers, but also to inorganic materials, like steel powder. These will be discussed in Chapter 23 on special processes.

### 18.1 Polymer Materials

Polymer molecules are long chains of repeating monomer units. In polyethylene (PE) the  $C_2H_4$  monomer unit is repeated thousands of times, resulting in molecular weights (MWs) of for example 30 000 or 300 000. Polymerization processes do not produce exact chain lengths but result usually in rather wide distributions, and in addition to chemical formula, it is important to know MW. Many polymer properties are a function of MW, for example viscosity, which is important in micromolding and stamping processes.

The bonds between carbon atoms in the chain are strong, but the chains interact only via weak hydrogen bonds. Polymers are held together also by mechanical entanglement of the chains. Because of the weak hydrogen bonds, polyethylene easily softens when heated: the polymer chains slide past each other.



**Figure 18.1** (a) fluidic sieve with  $300\text{ }\mu\text{m}$  high,  $30\text{ }\mu\text{m}$  diameter pillars made by photolithography, courtesy Santeri Tuomikoski, Aalto University; (b) optical grating,  $60\text{ nm}$  linewidth made by nanoimprint lithography, reproduced from Martinsen (2008) by permission of Elsevier. Both structures are made of SU-8 polymer



**Figure 18.2** Molding (left): material flow into mould master. Imprinting/embossing (right): the stamp is pressed against solid (but softened) polymer

In polyvinyl chloride (PVC) the repeating unit is  $\text{CHCl}-\text{CH}_2$ , and in polytetrafluoroethylene (PTFE)  $\text{C}_2\text{F}_4$  units are repeated. Parylene-C has the repeating unit consisting of a benzene ring with chlorine. Such aromatic benzene rings render polymers thermally stable. BCB (benzocyclobutadiene) is another aromatic thermally stable polymer widely used in microsystems. Epoxy SU-8 consists of aromatic rings and eight epoxy groups which will result in tight crosslinking upon heating. The bonds are strong covalent bonds which are not weakened by heating, therefore cross-linked SU-8 does not flow. The chemical formulas of common microfabrication polymers are shown in Figure 18.3.

Polymers are available in many formats: they are available as sheets and wafers, with thicknesses ranging from a few micrometers to millimeters. They serve as substrates for processing. Some are available as solutions and can be applied by spin coating, while others can be synthesized from monomers in liquid phase, vacuum and

plasma processes, and applied as thin films not unlike sputtered or PECVD films. Some polymers are available in many formats: polymethylmethacrylate (PMMA) is used in wafer format for hot embossing, and as spin-coated thin film it serves as electron beam resist. More commonly PMMA is known as Plexiglas. Similarly, polyimides are used as substrates  $50\text{ }\mu\text{m}$  thick for flexible electronics under the trade name Kapton, and as high thermal stability spin-coated negative resists, as well as spin-coated non-photoactive films.

Silicon is also present in many polymers, and because of its chemical similarity to carbon (both are column IV elements that form four bonds), there are interesting silicon-based polymers, for example siloxanes. Poly(dimethyl)siloxane (PDMS) has the basic repeating unit of  $-(\text{O}-\text{Si}(\text{CH}_3)_2-\text{O}-)$ . Its backbone consists of silicon and oxygen, but the side chains are organic groups. Similar polymers with different organic groups have been synthesized with interesting properties ranging between polymers and  $\text{SiO}_2$ . These materials can sometimes combine the best of both worlds: for example, low dielectric constant and improved thermal stability. One such hybrid polymer is Ormocer (for organically modified ceramic) which can be processed like negative resist, but which results in oxide-like material, for instance a low contact angle compared to polymers, and much higher temperature stability than most polymers.

Polymers can be divided into two major classes: thermoplasts and thermosets (or duroplastics). Thermoplastic molecules have only weak bonds between the polymer chains and, when heated, the polymer flows (like polyethylene). Thermoplasts can be repeatedly heated and cooled, with no permanent change. Many common



**Figure 18.3** Polymer chemical structures. Left column: PE, PDMS, PFTE, parylene. Right column: polyimide, SU-8 epoxy

polymers are thermoplasts: PE, polycarbonate (PC), PMMA, COC (Cyclic Olefin Copolymer), PET, PEEK, PVC and PS.

Thermosets are polymers that react upon heating and form strong covalent bonds between the chains. Thermosets will flow for a while when heated for the first time, but then crosslinking and hardening take place. The resulting crosslinked material will not flow even above  $T_g$  because of the rigidity of their structure. Thermosets include photoresists, epoxies (like SU-8), PTFE, polyimides, polyesters, parylene and BCB.

Elastomers form a third group. In these materials the bonds between the chains are flexible and the material returns to its shape even after 1000% elongation. Rubbers are typical elastomers. In microfabrication PDMS and Viton are common elastomers.

There are other classifications, too. Some polymers are amorphous. They are glassy, in the same sense as glasses, they exhibit no long-range order and they are optically transparent. Amorphous polymers include PMMA, PC, COC, PI, PDMS and PVC. Semicrystalline polymers

show local crystallinity in the midst of amorphous areas. Sometimes they are called crystalline polymers for short, but there are really no crystalline polymers. In semicrystalline polymers the crystal/amorphous interfaces diffract and reflect light, and the resulting material is opaque. Semicrystalline polymers include PP, PET, PEEK, PVDF and PTFE.

Polymers are often called by their trade names, increasing confusion. Teflon is PTFE, Topas and Zeonor are COC, PMMA is Plexiglass and PC Lexan. Kapton is polyimide and Mylar is polyester. Viton and CYTOP are fluoropolymers, AZ resists are novolak resins and Sylgard 184 is PDMS.

Polymer properties vary a lot, and Table 18.1 gives just a glimpse of typical values. It indicates the major properties of polymers in general. Table 18.2 lists in a little more detail the properties of common microfabrication polymers. In both tables the values do not cater for all blends and varieties; for instance, imides are a large class of polymers with wide-ranging properties. Temperature limits are partly set by the glass transition temperature  $T_g$ ,

**Table 18.1** Typical polymer properties

|                              | Value                 | Comments                                                       |
|------------------------------|-----------------------|----------------------------------------------------------------|
| Density (kg/m <sup>3</sup> ) | ~1000                 | Teflon ~2000                                                   |
| Dielectric constant          | 2–4                   | SiO <sub>2</sub> , $\epsilon \sim 4$                           |
| Volume resistivity (ohm-cm)  | $10^{14}$ – $10^{17}$ | Similar to SiO <sub>2</sub> and Si <sub>3</sub> N <sub>4</sub> |
| Index of refraction          | 1.35–1.65             | Resists ~1.6                                                   |
| Dielectric strength (MV/cm)  | 2–3                   | SiO <sub>2</sub> ~10                                           |
| Young's modulus (GPa)        | 1–5                   | PDMS ~0.01, SiO <sub>2</sub> ~76                               |
| Tensile strength (MPa)       | 10–100                | Imides even higher                                             |
| Residual stress (MPa)        | 10–30                 | PDMS very low                                                  |
| Water contact angle (deg)    | 70–100                | Imides 50°, PTFE 114°                                          |

and partly by thermal degradation temperature, or melting. Additional properties and selection criteria will be discussed below.

It must stressed that different manufacturers offer polymers of the same basic formula with different properties, and any values in this chapter are indicative of major trends only.

Note that MW and polymer processing history (baking time and temperature, and hence crosslinking density) have considerable effects on many of the above parameters.

Other properties which may be of importance in certain applications include the dielectric loss tangent, which ranges from 0.0002 for PTFE to 0.04 for PMMA. Shrinkage upon crosslinking is a source of stresses, and it can be considerable (5–10%).

Polymer selection involves further considerations: chemical stability against acids, bases and solvents, for example. PDMS has excellent chemical resistance to weak acids and alkaline solutions but it is unsuitable

for most organic solvents. SU-8, parylene, PTFE and polyimides are very resistant to most solvents, acids and bases. PTFE is exceptional in its chemical stability and low water absorption (<0.01%), while for PDMS it is 0.1% and for PMMA and polyimides a few percent.

Polymers are generally electrical and thermal insulators, and can serve many of the same functions as silicon dioxide and silicon nitride: as intermetal insulators, gate dielectrics, passivation films and free-standing membranes. Semiconducting polymers like polythiophene vinylene (PVT) and PEDOT can be used in transistor channels and conducting polymer polyanilin (PANI) has been used for “metallization” of transistors. The resistivity of a conducting polymer can be as low as 10 000  $\mu$ ohm-cm, a thousand times higher than metals, but only a hundred times the resistivity of polysilicon. As will be discussed in Chapter 26, polymers can serve every function needed in a transistor (Figure 26.22). The main advantages of polymer electronics are flexibility (Figure 18.4) and low-temperature processing, but their performance does not match silicon devices, and polymer transistors are geared toward very different applications like embedded electronics, enabling cheap intelligence in products.

## 18.2 Polymer Thermal Properties

Because polymer MWs have wide distributions, polymer properties change gradually, not abruptly. The glass transition temperature ( $T_g$ ) is an important parameter. Below  $T_g$  polymers are brittle and glass-like, but above  $T_g$  thermoplasts become soft (viscosity decreases) and they will flow and can be molded. Baking above  $T_g$  is also a stress reduction method: above  $T_g$  the polymer will relax stresses because it becomes softer and is able to accommodate stresses. Baking photoresist above  $T_g$  will mean that it will flow and end up as a hemispherical dot (see Figure 18.30 below). While a non-vertical resist sidewall profile is generally regarded as a processing

**Table 18.2** Properties of common microfabrication polymers

|           | $T_g$ (°C) | $T_{deg}$ (°C) | CTE (ppm/°C) | Thermal conductivity (W/K-m) | UV transparency |
|-----------|------------|----------------|--------------|------------------------------|-----------------|
| PMMA      | 100        | 200            | 70           | 0.2                          | Opaque          |
| PC        | 150        | 230            | 65           | 0.2                          | >350 nm         |
| PDMS      | -130       | 400            | 300          | 0.15                         | >240 nm         |
| SU-8      | 240        | 340            | 100          | 0.2                          | >350 nm         |
| Polyimide | 400        | 620            | 3–50         | 0.2                          | Opaque          |
| Parylene  | 150        | 290            | 35           | 0.1                          | >300 nm         |
| Teflon    | 130        | 330            | 100          | 0.1                          | opaque          |



**Figure 18.4** Flexible glucose sensor. Reproduced from Kudo *et al.* (2006), copyright © 2006 by permission of Elsevier

problem, these dots can be used for microlenses. Elastomers are weakly crosslinked materials with values of  $T_g$  below room temperature, and therefore they are soft and easily deformed by pressure, but when pressure is removed, they return to their original shape. PDMS is the most widely used elastomer in microfabrication, with  $T_g$  of  $-130^\circ\text{C}$ .

Temperature is a key variable in polymer microprocessing because polymer viscosity depends strongly on temperature. The different temperature regimes of a polymer are shown in Figure 18.5. Viscosity, which describes the mechanical response to a shearing force, is plotted vs. temperature. The first major transition occurs at  $T_g$ . Polymer becomes viscous, and can be embossed/imprinted, even though embossing is usually done at higher temperature, around the flow transition, in order to speed up the process. At higher temperatures the polymer melts; this is the regime of injection molding. Not all polymers flow and melt: some will decompose when heated.

Polymer dimensional stability is poor compared to inorganic materials. Thermal expansion is huge, typically  $50\text{--}100\text{ ppm}/^\circ\text{C}$ , compared to  $3\text{ ppm}/^\circ\text{C}$  for silicon and  $23\text{ ppm}/^\circ\text{C}$  for aluminum. This leads to many problems: even mild temperatures, like the  $120^\circ\text{C}$  encountered in lithography bake processes, can cause large thermal stresses. And the shrinkage of polymer after it has been molded at elevated temperatures has to be accounted for. In CDs and DVDs these two factors are somewhat easier than in microfabrication in general, because the molten polymer is fed in from the middle, and shrinkage is radially symmetric.



**Figure 18.5** Thermoplastic polymer viscosity vs. temperature diagram. Adapted from Schift (2008)

Polymers are low-density materials and always porous to some extent. Water vapor absorption varies a lot: for some fluoropolymers the coefficient of permeability is  $10^{-15}\text{ mol}\cdot\text{m}/\text{m}^2\cdot\text{s}\cdot\text{Pa}$ , for polyimide two orders magnitudes more, and yet another two orders of magnitude more for elastomers. So while polymer packaging of microsystems is attractive because of ease, it is not applicable to cases where hermetic sealing is needed. Permeability is sometimes useful: in cell culture devices oxygen can diffuse through the PDMS roof, and no separate fluidic channels are needed to provide oxygen. Metallization of polymers is difficult because desorption of water from pores will poison the vacuum, and this water vapor will oxidize metals, resulting in increased resistivity.

Polymers can also be filled with other materials, to modify their properties. Microfabrication makes no exception: for electrical conductivity, carbon black has traditionally been used, and carbon nanotubes and silver nanoparticles are now used in microtechnology. Glass spheres and glass fibers have been used in the macro world, and the same applies in the micro world. Young's modulus can be tailored by an order of magnitude by fillers. Magnetic nanoparticles can be used to make polymers react to magnetic fields and it is also possible to retain photoactivity when the nanoparticle size is small relative to exposure wavelengths (and their density low enough).

### 18.3 Thick-Resist Lithography

Thick can mean very different thicknesses to different people. In IC fabrication resist thickness is from a hundred nanometers to a few micrometers. In MEMS and thin-film magnetic recording head (TFH) fabrication, thick can

mean anything from 5 to 200  $\mu\text{m}$ , and in X-ray lithography thick extends into the millimeter regime. As with thin resists, a number of sometimes conflicting requirements are imposed on thick resists, such as resolution, sidewall profile, sensitivity (photospeed), thermal stability, adhesion and shelf life.

Thick-resist processing has a few extra factors that need attention, compared to standard resists. Rapid solvent evaporation has to be prevented because rapid and large shrinkage leads to defective and non-uniform films as the surface layer dries and encapsulates the material beneath. One solution is a closed spinner bowl which creates a saturated solvent vapor atmosphere. This buys extra time to ensure uniform resist spreading before viscosity increases so much that flow stops. Solvent evaporates during final spinning to some extent, but for thick resists it is advantageous to perform an additional slow spinning step at the end, to further dry the resist.

With thick resists the edge bead is sizable, and it will prevent contact exposure because the mask touches the edge bead at wafer edges. Edge bead removal is beneficial also for cleanliness: the extra resist material can flake off and cause particle problems.

Soft bake can be done in a convection oven, on a hotplate or by infrared heaters. Solvent removal is diffusion limited and long baking times are required for thick resists; for 100  $\mu\text{m}$  thick resists, bake times are about an hour. While oven bakes are usually done at one temperature, hotplates allow easy temperature ramping. During the bake the wafers should be perfectly horizontal because viscosity is smaller at elevated temperatures and therefore resist can easily flow. A minor bevel can cause major resist thickness non-uniformity across the wafer. Positive novolak resists need an additional rehydration period after baking: water is an essential compound in exposure (Figure 9.7).

In novolak-DNQ resists, exposed DNQ still absorbs some light, and thick layers of such resist will need very high exposure doses, that is long exposure times. The practical thickness limit of novolak-DNQ positive resists is 50–100  $\mu\text{m}$ , which translates to exposures of tens of minutes. In negative resists based on a photoacid generator, the reactive acid molecule is produced throughout the volume of the resist, and absorbance is not affected by the exposure. Therefore even layers a millimeter thick can be exposed in reasonable times. All polymers absorb to some extent below 350 nm, and in order to avoid absorbance by the base resin, wavelengths below 350 nm should be filtered out. Exposure is supposed to interact with the photoactive compound, not with the base resin.

It is usually not the exposure but the mechanical strength of the final resist structures that limits the

thicknesses of negative resist structures. This involves also the properties of liquids used in development and rinsing and drying. Capillary forces can be so strong that drying water can pull neighboring resist lines into contact. This will be discussed in more detail in Chapter 29, because such drying-induced stiction problems are central in surface micromechanics.

In order to make deep trenches or high pillars, collimated light is needed: if incoming light is not collimated, sizable exposure will take place underneath the mask. This same reason explains why thick resists are not exposed by projection mask aligners.

Thick-resist development is normally done by immersion. Development is diffusion limited and development time increases as a function of thickness. Development is also slower in narrow grooves compared to large open areas. Development times are tens of minutes.

For negative photoresists post-exposure bake (PEB) is required to finish the crosslinking reaction. For positive novolak resists PEB is not normally required. It is important that the temperature is high enough to complete crosslinking, but not too high because crosslinking creates stresses (which can only be controlled by crosslinking density). Stresses also appear because of thermal expansion coefficient differences between the substrate and resist.

Even though resists are soft materials and thin relative to wafer thickness, visible wafer bowing can result from resist thermal stresses.

Polymer structures of high aspect ratio in for example SU-8 resist (Figure 18.1a) resemble DRIE structures in appearance (Figure 21.9). Both technologies are capable of 10:1 aspect ratios easily, 40:1 ratios with effort and 100:1 ratios have been reported in special cases. In both SU-8 and DRIE the sidewall profile is seldom 90°, and an acceptable sidewall angle is a question of engineering judgment. For example, in mirror and interferometer applications, or with parallel-plate capacitors, perfect verticality is desired, while in fluidics or mechanical applications deviations can be tolerated to some extent. The smoothness of resist sidewalls can be as low as 10 nm, which is better than that of DRIE sidewalls.

Multilayer structures can be made by repeated exposure–bake–spin coat–expose–bake cycles, leaving the development step till the very end. This is shown in Figure 18.6 both schematically and in the SEM micrograph. The structure is used in tissue engineering.

### 18.3.1 Thick resists as structural materials

There are many microfluidic applications for thick-resist structures. The choice of microfluidic channel wall



**Figure 18.6** Three-level SU-8 structure: three spin coatings and three exposures with different masks, and one development at the end. Reproduced from Mata *et al.* (2006) by permission of IOP



**Figure 18.7** SU-8 microvalve (left); SU-8 pneumatic actuator (right). Reproduced from Seidemann *et al.* (2002), copyright © 2002 by permission of Elsevier

material is mostly about surface chemistry, surface charge for electro-osmotic flow, adsorption of analytes and contact angle for wetting and capillarity. Transparency and lack of autofluorescence for optical detection, integration of electrodes for conductivity detection and bondability are also important.

Polymers are suitable for mechanical devices, too. Spring constants can be smaller by making long, slender beams, and this is certainly done in MEMS, but the alternative is to use softer materials, like polymers (see Equation 29.2). In Figure 18.7 mechanical elements

made of SU-8 epoxy polymer are shown for a passive microfluidic valve and pneumatic actuators. The SU-8 structures are free to move because a copper thin film underneath the polymer has been etched away.

## 18.4 Molding Techniques

### 18.4.1 Replica molding in PDMS

Molding of PDMS is easy and therefore widely used. A simple SU-8 process can be used as a mold master. PDMS

prepolymer is mixed with a crosslinking agent (e.g., in a 10:1 ratio) and degassed. It is poured over the mold master and degassed again to ensure bubble removal. Curing can be anything from 65 °C for 10 hours to 80 °C for 2 hours, depending on process details. PDMS will demold easily because of its inertness and flexibility (it is an elastomer).

PDMS devices are not only easy to fabricate, but also good for aqueous fluids; they are optically transparent down to 250 nm and mechanically flexible. A simple PDMS device is shown in Figure 18.8. The PDMS piece is bonded to a flat glass wafer, forming a fluidic actuator/valve. However, because the PDMS CTE is about 300 ppm/°C, PDMS is not suitable for applications that require accurate pattern positioning.

One of the great advantages of PDMS is its self-adhesiveness: a clean PDMS piece will adhere to a clean surface spontaneously, and this bond is strong enough for many applications. It is great for R&D but long-term stability of the bond is not good. If a stronger bond is



**Figure 18.8** PDMS vertical input piston induces lateral deformation in output piston, closing a microchannel. Adapted from Lee *et al.* (2007)



**Figure 18.9** Optofluidic chip: SU-8 master and its PDMS cast replica. Microlenses focus light onto a microfluidic channel. Adapted from Seo and Lee (2004)

needed, oxygen plasma activation of PDMS can be used (more on polymer bonding later in this chapter).

An optofluidic chip is shown in Figure 18.9: microlenses focus light onto the microfluidic channel, for improved detection. The whole device is made of PDMS and bonded to a planar glass wafer. The lenses come “free of charge”: they are formed in the same molding process as the channels.

More complex mold masters enable more complex shapes. Figure 18.10 shows a hemispherical SU-8 mold master. It is made by standard lithography, but after development SU-8 is baked above its glass transition



**Figure 18.10** PDMS microlenses by resist flow, stamping and transfer bonding. Adapted from Chen *et al.* (2005)



**Figure 18.11** Microfluidic chip with three PDMS layers: pneumatic pressure in control channel deforms the thin PDMS membrane, closing the active liquid channel. Fluidic inlets are made by piercing. Adapted from Lin and Su (2008)

temperature, which will make it flow. Figure 18.11 shows a fluidic system: two SU-8 masters are used, one with the standard SU-8 process and the other by resist flow. Three PDMS steps are required: two casting steps and one spinning step for thin PDMS membrane formation. The actuator channel does not need to close completely, but the analyte channel with its hemispherical shape will be fully closed.

#### 18.4.2 Micromolding in capillaries (MIMIC)

Micromolding in capillaries takes advantage of microfabricated mold masters: liquid precursor fills microfluidic channels in the master by capillary forces. The precursor is then cured, and the master is removed. This method is applicable to passive structures (like Figure 18.12) and active polymer transistor components alike. Not only polymers but other materials that can be made soluble can be processed, for example nanoparticles in suspension, metal powders, etc. Micromolding is also applicable to multiple layer thicknesses in a single molding step.

#### 18.4.3 Injection molding

Injection molding set-up is pictured in Figure 18.13. Injection molding is applied for micrometer dimensions in mass manufacturing: molten plastic is injected into a mold insert to fabricate CDs, DVDs and Blu-ray discs (with 1, 0.6 and 0.3 µm feature sizes, respectively). However,



**Figure 18.12** Polyurethane net by micromolding in capillaries. Reproduced from Kim *et al.* (1996), copyright 1996 American Chemical Society



**Figure 18.13** Injection molding: high-pressure injection of molten polymer into a mold

from a general microfabrication point of view, these are easy applications because aspect ratios are about 0.2 only, pattern density is quite uniform and pattern sizes are quite similar. Because the molds are expensive (they have to tolerate molten polymers, often around 300 °C), long production series are compulsory to bring the cost down.

The main parameters of injection molding are the temperature of the injected polymer, mold temperature, injection speed and pressure, and holding time and pressure. Micro injection molding requires some modifications relative to large-scale systems: for instance, materials usage needs to be rethought, otherwise only a fraction of polymer ends up in the finished parts. In mold fabrication technologies, materials, dimensions and surface roughness need to be assessed. Polymer properties must be understood, for example viscosity changes rapidly in micromolds because the large surface-to-volume ratio leads to rapid cooling of the polymer melt.

Compared to all other replication methods, injection molding is the fastest: it takes only a few seconds per piece. Injection molding is used in microfluidics for

disposable devices where cost minimization is essential, but some very complex injection-molded microfluidics applications exist, too.

## 18.5 Hot Embossing

Hot embossing involves pressing a master against a polymer at a temperature around  $T_f$ , typically 50–100 °C above the polymer glass transition temperature  $T_g$ , see Figure 18.14. Popular hot embossing polymer PMMA has  $T_g \sim 100$  °C and polycarbonate (PC) has  $T_g \sim 150$  °C. The embossing force is on the order of 100–1000 kPa and hold time is on the order of minutes. De-embossing takes place after cooling below the glass transition temperature, therefore the embossed shape is retained. In theory the features are true replicas of the master, and any size patterns can be made, from nanometer size to macroscopic (but nanometer master fabrication is very expensive!). Note that in order to emboss large areas, uniform pressure needs to be applied. In addition to precision mechanics, a soft compliance layer helps Figure 18.14.

The hot embossing process is visualized in the time-temperature-pressure graph in Figure 18.15. The process starts by heating the wafer (and the master) to  $T_2$ , which is for example 80 °C above the polymer  $T_g$ . Pressure of 100–1000 kPa is then applied to press the master against the softened polymer. Polymer will flow and fill the cavities in the master, forming an inverse replica of the master. After a few minutes the temperature is ramped down, and



**Figure 18.15** Pressure–temperature diagram of hot embossing: heat to  $T_2$  ( $50\text{--}100$  °C  $> T_g$ ) apply pressure (100–1000 kPa), cool down to  $T_1$  (e.g., 20 °C below  $T_g$ ), and remove pressure

below  $T_g$  (at  $T_1$ , which is 80 °C for PMMA) force is removed. The polymer has solidified into the form of the master, and because the temperature is now below  $T_g$ , it will retain its shape upon de-embossing.

Hot embossing has three major issues: filling of structures by polymer, reproduction fidelity and de-embossing. Polymeric materials have CTEs on the order of 50–100 ppm/°C, whereas silicon has a CTE of 2.6 ppm/°C and nickel, a typical electroplated master material, 13 ppm/°C. Thermal cycling is mandatory in hot embossing but it should be minimized around  $T_g$  to avoid thermal mismatch cracking.

Hot embossing results in microparts with low internal stresses because stresses have time to relax in the slow process. Its slowness is compensated in R&D by the simplicity of the experimental set-up: changing masters and materials is quick and easy, and the various technologies for master fabrication enable cheap masters. Hot embossing is applicable to large areas, with 200 mm wafer size embossing tools available. The slowness is inherent in hot embossing because the polymer and master are heated by conduction, and heating is an integral part of the process and not easily decoupled from other process parameters.

The thickness of hot embossed structures can be varied enormously, from nanometers to millimeters (Figure 18.16). There is no resolution limit, and



**Figure 18.14** Thermal imprinter/hot embossing tool: soft compliance layer assures conformal large-area contact between master and substrate



**Figure 18.16** An embossed optical bench 400  $\mu\text{m}$  high.  
Courtesy Mathias Worgull, Forschungszentrum Karlsruhe

embossing can replicate structures down to 10 nm size, making the master become the limiting factor. Aspect ratios of embossed structures can be as high as 20:1 and up to 50:1 when special release coatings have been applied. Making needle-like structures by embossing is difficult, however, because filling such long needles by polymer is often inadequate.

## 18.6 Nanoimprint Lithography

Nanoimprint lithography (NIL) is hot embossing applied to nanofabrication. The smallest imprinted features have been 5 nm wide. This indicates that any nanoscopic irregularities in hot embossing masters will be replicated. This was in fact observed long ago, but NIL only emerged in the late 1990s. In lithography application, the removal of a residual layer at the bottom of features (Figure 1.9) becomes a necessity, and the thickness contrast  $h_r/h_f$  must be high enough in order to reasonably make patterns.

At the macroscopic level the NIL process depends critically on stamp contact with the substrate. Using a soft compliance layer between the stamp and the upper platen, small deviations from planarity can be compensated. Thermal NIL works on 200 mm wafers. On a microscopic level air evacuation from cavities is important, and a vacuum can help in this. The squeeze flow of polymer is obviously very important. Polymer viscosities are on the order of  $10^3$ – $10^7$  Pa·s and imprinting pressures around  $1000 \text{ N/cm}^2$ . Temperature and pressure can be traded for each other, and low pressure can be compensated by longer process time. But as discussed above, structure size affects the time

requirement, and therefore time is not a completely free variable.

NIL is a very simple process for making nanostructures: if master fabrication can be subcontracted, the NIL equipment price is only a fraction of the optical lithography tool capable of identical linewidths: from \$100 000 for a NIL lab system, to \$1–3 million for an advanced system. There is a twist, however: in optical lithography tools alignment is a standard feature, while the cheaper NIL systems offer no alignment or only rudimentary alignment. And if alignment accuracy is 500 nm and resolution 50 nm, most the high-resolution capability is left unused. But if a single layer pattern is needed, imprint lithography is very cost effective. Hard disks have been suggested as an application and optical devices like gratings can also be done without alignment.

### 18.6.1 Imprinting theory

An idealized description of the embossing/imprinting process is shown in Figure 18.17. A downward force  $F$  presses the stamp against a polymer with viscosity  $\eta$ . Polymer height  $h$  changes over time as  $h(t)$ . Stefan's law, Equation 18.1, governs the flow of polymer between solid parallel plates. That is,

$$F = \frac{3\pi R^4}{2h(t)^3} \frac{dh}{dt} \eta \quad (18.1)$$

Interpretation of Stefan's equation tells us that achieving a thin residual layer is exceedingly difficult because



**Figure 18.17** Factors contributing to Stefan's law: viscosity, force, stamp protrusion size, residual thickness

of the  $1/h^3$  term. This is a critical difference between traditional hot embossing and NIL is in layer thicknesses. In embossing the polymer sheet is for example 2 mm thick, and 50  $\mu\text{m}$  structures are embossed. Residual thickness is 1950  $\mu\text{m}$ . In NIL original polymer thickness is for example 300 nm and target residual thickness 50 nm. In Stefan's equation  $h$  is large and  $dh/dt$  is small.

Another observation is that in fact it is easier to make small structures than large ones, because the required force increases as  $R^4$ . Low viscosity is beneficial because then lower force can be used, therefore elevated temperature is practically always used, even though cold embossing does exist. A further point to note is that structure filling is nonlinear in time: initially polymer flow will fill the cavities in the master fast, but filling becomes slower as time goes on.

A fundamental principle of mass conservation illustrates why the imprinting of uneven areal density structures is difficult. In Equation 18.2 mass conservation is used to calculate final heights. The volume of polymer is unchanged (nothing disappears during imprinting) and it is only surface heights that change. The situation is pictured in Figure 18.18. The residual thickness  $h_f$  depends on the areal density ( $A_0/A_f$ ) of patterns:

$$A_0 h_0 = A_f h_f + (A_0 - A_f) h_e \quad (18.2)$$

### 18.6.2 Practical NIL

Several non-idealities make NIL more difficult. Non-uniform pattern height results when a rigid stamp (or low imprint pressure) does not allow large patterns to form, while nanostructures are rapidly filled. A submicrometer grating is imprinted in a fraction of a second, while large areas can take minutes. In the case of high imprinting force the stamp may bend and the smaller features penetrate deeper into the polymer, resulting in thicker



**Figure 18.18** Mass is conserved in imprinting, and residual thickness  $h_f$  and final height  $h_e$  can be calculated once the embossed area is known

residual layers for the large structures. Incomplete nanopattern filling can also result because of the volume constancy requirement: large and small areas side by side both affect pattern height.

There are many variants of NIL. In UV-NIL, photoactive polymer is used. In one version liquid photopolymer is used, and the process is basically a room temperature process. The structure is cured by UV light (Figure 18.19). Compared to thermal imprint, optical/UV-NIL has certain strong points: UV curing can be much faster than thermal ramps, an obvious productivity feature. And if temperature ramps can be eliminated, polymer shrinkage problems can also be minimized. On the other hand, thermal NIL masters can be made of silicon, nickel or practically anything. Optical methods obviously require transparent masters, which limits the choices. For accurate



**Figure 18.19** UV-NIL: stamping of photopolymer with a transparent master stamp and curing by UV exposure. Reproduced from Aura *et al.* (2008) by permission of Elsevier

nanofabrication fused silica masters are used. In the micrometer range PDMS is an excellent master material. There are also equipment issues: optical paths have to be designed, and if both heat and light are needed, the design becomes even more difficult.

Thermal NIL can be done on 200 mm wafers (but the writing time and cost of a 200 mm stamp with nanostructures are astronomical), therefore many imprint methods “expose” smaller areas. This is similar to step-and-repeat optical lithography (Figure 10.2): stamp size is for example  $25 \times 25$  mm, and multiple stampings are needed to fill the wafer. The method is sometimes called step-and-stamp, and as S-FIL, for step-and-flash imprint lithography, in the case of UV-NIL.

## 18.7 Masters for Replication

Masters and molds can be made by a number of technologies:

- photolithography of SU-8
- photolithography and silicon etching
- photolithography and metal electroplating
- discharge machining
- laser machining
- mechanical milling
- natural objects.

Photoresists are common mold masters, especially SU-8, which is often used as a master for PDMS casting. The benefits of SU-8 are in the simplicity of making masters in various sizes, both laterally and in the  $z$ -direction, from micrometers to millimeters. Additionally, since SU-8 is a polymer, its thermal expansion is similar to other polymers, which reduces thermal stresses.

Silicon is an excellent master mold material in many cases: its machining accuracy is good, the surface finish is good (especially in wet etching) and with DRIE practically any shape can be made. Wet etching is more limited in shapes, but the  $54.7^\circ$  angle is good for detachment, see Figure 18.20.

Thomas Alva Edison used sputtered gold seed layer, wax mask and gold electroplating to fabricate phonograph masters. The technology entered production in 1901 and it could replicate  $125\text{ }\mu\text{m}$  pitch ( $200$  grooves/inch) structures  $25\text{ }\mu\text{m}$  thick into phonograph records. Electroplating is still a major method for mold master fabrication, with nickel as the metal of choice. In microfluidic applications the dimensions are similar to Edison’s, and in fact traditional machine tools can be used to fabricate the masters, but often the surface finish is too rough and pattern complexity makes machining throughput low, though it is a useful method for making a few test devices.



**Figure 18.20** Mold master should have positive slope for easy detachment; negative (retrograde) profile does not allow detachment

An important issue in all replication methods is the detachment (also known as de-embossing or release). In fact, it is often equal in importance to replication itself. There are a number of requirements that are essential and beneficial. First of all, the master must have positively sloped walls (Figure 18.20). Release is not possible with retrograde walls.

Another requirement is surface smoothness: the smoother the surface, the easier the release. Micro-fabricated master molds are very smooth, especially photoresist mold (and metal mold made using resist molds) and anisotropically wet etched silicon molds. Plasma etched molds are somewhat rougher. Milled and machined molds are clearly much rougher.

Chemical bond formation between master and replica should be avoided. This is especially important in molding processes where monomers are chemically reactive, not just physically flowing. Diamond is a good choice for a mold because of its inertness and abrasion resistance. Another way is to deposit a sacrificial layer on the mold master, and release the structures by etching this sacrificial layer away.

Alternatively, the mold can be coated by low-surface-energy material. Teflon-like films and fluorinated SAMs are obvious candidates, because of their extremely low friction. Alternatively, the surface of the polymer can be treated with fluoropolymer and then stamped.

Often polymers can be used as masters and molds for other polymers. PDMS has been used to emboss polycarbonate, and PDMS is also an excellent transparent material for UV embossing masters. Of course, thermal and other limitations apply, but clearly the choices are many. The polymer masters are soft, while the silicon and metal masters are rigid. Depending on process details,

either property can be beneficially utilized, for example a soft master will conform and make intimate contact, allowing large-area stamping which is not possible with a rigid master unless the wafer and master are very flat and carefully leveled.

Molding can be continued to further generations: instead of using the molded piece itself, it can be used as a new mold. This process can be continued for many generations in certain applications before the quality of molded pieces becomes unacceptable. However, each generation results in a reverse polarity structure of its parent, so it is necessary to decide beforehand which generation is going to be used. If the master fabrication process is expensive, as in X-ray LIGA, it is useful to have a slave master made of the expensive original, and thus limit the costs.

Master fabrication can sometimes benefit from existing structures: both butterfly wings and lotus leaves have been used as masters for replication, the former for optical effects, the latter for superhydrophobicity. In order to use the master repeatedly, it has to be transferred to some more stable material, but there are many choices.

## 18.8 Processing on Polymers

Polymer etching is not often done because lithography, imprinting and casting produce polymer microstructures easily. Thermosets must sometimes be etched because molding is not possible. Polymers are etched by oxygen-containing gases, with or without additional gases, for example O<sub>2</sub>/CF<sub>4</sub>, or O<sub>2</sub>/Ar. Non-erodible masks like aluminum and chromium are used in deep etching. But as in silicon etching, achieving anisotropy is difficult because of the spontaneous chemical reactions. In most cases the rates are limited to about 1 μm/min. Wet etching is usually not an option for polymers. In theory solvent etching could be done by suitable selection of polymer and solvent. Sometimes NaOH etching is done, not for patterning, but rather for surface treatment.

Polymers can be polished by CMP and this has been used in the creation of thermal insulation: a deeply etched cavity wafer is coated by BCB, which is planarized by CMP. Devices are then processed on BCB. Thermal isolation is often done by etching away the silicon wafer, leaving only a thin membrane. This approach has problems with mechanical strength. BCB has a high operating temperature, but at about 400 °C it is still low compared to inorganic materials.

### 18.8.1 Deposition of polymers

The adhesion of polymers on thin films and thin films on polymers is generally poor because polymers do not

readily form bonds. Adhesion then depends on mechanical interlocking (surface roughness) and surface cleanliness. Surface cleaning can be done prior to metal deposition by many techniques, for example baking, wet treatment, plasma treatment. These have the effect of removing contaminants and absorbed water vapor, and etching also induces surface roughening, which is beneficial for adhesion. Use of adhesion promoter layers underneath the main metal can be made as usual, for example titanium or chromium. Evaporated films have poor adhesion on polymers, but sputtering fares somewhat better. Sputtered atoms have kinetic energy and impinge on the polymer surface, and some of them are implanted inside the polymer, creating a mixed layer which holds the substrate and film together. If layer thickness is measured on a polymer film, the result will differ from an inorganic substrate, because some of the deposited atoms will find their way inside the polymer, due to both their kinetic energy and polymer porosity.

Adhesion can be improved by nano- and microstructuring the surfaces. Figure 18.21 shows one technique for improved polymer adhesion. It utilizes an isotropic etch profile to create undercut structures which will lock polymer structures, just as a retrograde profile will prevent molded piece detachment from the mold master (Figure 18.20).

One technique to secure mechanical interlocking is to cast polymer over T-shaped or mushroom-shaped metals (Figure 18.22). Overplating (Figure 5.13) is performed on a standard resist pattern. The T-shape of the plated metal now securely keeps the cast PDMS in place, and the structure tolerates further processing, in this case through-wafer DRIE. The resulting membrane is flexible due to PDMS and can be magnetically actuated due to the metallic coil structure. The standard approach does not work well, even with the adhesion layer, because the large thermal expansion of PDMS during resist baking will lead to metal detachment.



**Figure 18.21** Isotropically etched holes anchor the polymer mechanically



**Figure 18.22** Flexible PDMS membrane for micropump. Left, standard approach of metal plating on top of PDMS; right, PDMS casting on overplated metal. Adapted from Yin *et al.* (2007)

Resist spinning on polymers requires careful attention to solvents used in resists. A reaction between polymer and solvent can render the whole process unusable. Therefore alternative methods like shadow masks are often used (see Chapter 23) because neither lithography nor etching is required. This is beneficial because the typical 120 °C temperature of resist baking can already result in thermal stresses that will delaminate the metal from the polymer.

Thin-film metals have higher resistivity than bulk materials, and thin-film metals on polymers have even higher resistivities than those on silicon or glass. On PMMA or other fairly non-porous polymer metal, the resistivity can be for example 10% or 100% higher than on reference oxide, but on porous polymer like PDMS the resistivity can be even 1000% higher than on oxide. Post-deposition annealing for resistivity reduction is not an option with most polymers.

In Figure 18.23 PDMS is used to make highly flexible antennas. The process uses aluminum as a sacrificial layer, copper as the antenna conductor and chromium as an adhesion layer and barrier between the copper and oxide. PDMS is spin coated on top and cured. Another PDMS piece is then bonded on top of the first PDMS, and TMAH

**Figure 18.23** Metallized flexible PDMS antenna. Adapted from Tiercelin *et al.* (2006)

is used to etch aluminum. PDMS bonding to a copper carrier wafer finalizes the process. Unlike the process shown in Figure 18.22, there are no process steps above room temperature once PDMS has been cured, so the large CTE of PDMS does not come into play.

PECVD films can be deposited on polymers, temperature permitting. It is often possible to lower the deposition temperature and get reasonable film quality, even though lower temperature invariably correlates with poorer film quality. ALD alumina ( $\text{Al}_2\text{O}_3$ ) is usually deposited at around 220 °C, but it can be done at 80 °C which enables deposition on polymers. However, its properties are different; for instance, it etches much faster in wet etching than the standard film.

## 18.9 Polymer Bonding

The basics of bonding were presented in Chapter 17. In this chapter we will explore applications specific to polymers. As in all bonding processes, a few universal requirements apply: the surfaces to be bonded must be clean

of particles, have proper bond-forming surface chemistry, and the substrates must be flat and smooth to ensure intimate contact between the wafers to be bonded. Adequate bond strength at the lowest possible temperature is desired, and with polymers there is little room for annealing for bond strength improvement.

Bond strengths of polymer–polymer bonds are typically in the 1–10 MPa range, and differ greatly depending on process details. The best results have been obtained for COC solvent bonding, with over 30 MPa, which is adequate for liquid chromatography chips.

### 18.9.1 Thermal bonding

The term thermal bonding is often used to describe a process similar to fusion bonding or direct bonding in silicon technology: two wafers of the same material are bonded by application of elevated temperature. The terminology is somewhat confusing: thermal bonding can also refer to thermocompression bonding (where pressure is an important element). Vacuum-assisted thermal bonding is to be considered standard procedure in direct bonding: most fusion bonding methods use some vacuum to reduce gas bubbles between the wafers. At elevated temperatures polymers may degrade due to reactions with oxygen, therefore a vacuum is useful.

In polymer bonding the glass transition temperature plays a critical role: above  $T_g$  the materials become soft, which compensates some non-flatness and non-smoothness. In the case of two identical materials, interface bond strength should be identical to material bulk strength. Bonding identical polymers is easier than dissimilar materials because thermal mismatch is eliminated. One of the advantages of fusion bonding is that the final structure is made of a single material, and this is beneficial in microfluidics as all the walls of the system have identical contact angle, surface charge and adsorption.

Most polymer bonding processes operate between 100 and 200 °C: for example, 95–165 °C for PMMA (the values differ both because of experimental details like pressure and because of polymer molecular weight differences), 250 °C for BCB, 130–230 °C for parylene ( $T_g = 90$  °C, decomposition 290 °C), 160 °C for CYTOP fluoropolymer and around 200 °C for polyimides. Typical bonding times are a few hours. This is rather slow and alternative methods are often motivated by faster bonding processes. Pressures of 500–1000 N are usual.

Optimizing thermal bonding processes is difficult: differences between polymers mean that temperature–pressure relations have to be found for each polymer. Bonding is also dependent on chip design: bonding shallow, wide channels is much more

critical to roof sagging than bonding narrow, deep channels. A general purpose process has to be able to do both simultaneously, or else designers must be given restrictions on allowed structures.

### 18.9.2 Bonding by surface treatments

PDMS elastomer is soft and it is easy to achieve intimate contact. This is one reason for its adhesiveness. But because the PDMS surface is mostly methyl groups, they are not very reactive, and the bond is easily detached. This reversibility is advantageous sometimes, but if more permanent bonds are needed, oxygen plasma treatment of PDMS will turn some of the methyl groups to hydroxyl groups (–OH), which are much more reactive (Figure 12.6). Oxygen plasma-treated PDMS can be bonded to another PDMS piece permanently, and also to glass or oxide because their surfaces are also hydroxyl terminated. The bond strengths are on the order of 0.5 MPa, which is not much, but adequate for many applications. Contact angle measurement after plasma treatment can be used as a quick monitoring method: small contact angles, or hydrophilic surfaces, are essential to strong bonds.

One technique to ensure intimate contact between bonding surfaces is to make them soft by solvent treatment. The solvent should be selected to match the polymer, and properly selected solvent will soften only the very top layer of polymer, for example 50 nm in the case of DMSO and PMMA. This soft layer is sufficient for polymer chains to become mobile and make contact. A thicker solvated layer is in fact detrimental: softened polymer flow will clog microchannels. UV treatment can also be used to modify surface chemistry by breaking bonds (and by cleaning the surface). It can even be that  $T_g$  of the surface layer is lowered so much that bonding can be carried out below bulk  $T_g$ .

### 18.9.3 Localized bonding

Laser welding of polymers is possible but it must be applied to bond dissimilar wafers: the top wafer has to be transparent, but the bottom wafer has to be absorbing. Laser bonding benefits include minimized thermal distortions because the heating is very local.

Ultrasonics heats up the polymer pieces, and softening will help make a good contact between the wafers. The best results are obtained if there are structures designed to guide ultrasonic energy, which makes the method sensitive to chip design. Microwave welding relies on the heating of metal thin films, which will then soften the polymers, leading to bonding. The obvious drawback is the need to deposit and pattern metallizations on polymers.

#### 18.9.4 Adhesive bonding

The advantage of adhesive bonding is its universality: any two wafers can be joined together by an adhesive, irrespective of their surface chemistries, thermal expansion coefficients or other properties. Bond strengths in epoxy-based adhesive bonding can reach 20 MPa which is much higher than most polymer–polymer direct or thermal bonding methods. However, the foreign material between the wafers is sometimes undesirable, and direct bonding must then be used. For example, the adhesive may interact with the analytes in microfluidic systems. Thermal stability of adhesive bonds is limited, and their long-term behavior is poorly known.

Adhesives work by many different mechanisms, and not all of them are suitable for microsystems. Some adhesives solidify by drying, releasing gases which will be trapped at the bonding interface (unless one or both of the wafers are permeable to gases). Thermoplastic adhesives are applied hot, and are soft and make good contact. They solidify upon cooling, forming bonds, and are very general purpose adhesives. Epoxies are thermosets that experience hardening either by UV light or by heating. UV adhesives like SU-8 are good because the bonding temperature is low, but there is the limitation that one of the wafers has to be transparent. BCB is a thermoset polymer which can be cured by heating (to about 200–300 °C).

Thermoplastic adhesives tolerate a maximum of 300 °C, while some thermoset adhesives can tolerate even 450 °C. They are generally also more resistant to chemicals.

In many applications it is advantageous to have all the walls of a fluidic channel made of one material only. A simple full wafer, single photomask process for full SU-8 channels is illustrated in Figure 18.24 (left). The first layer, blanket exposed, serves as the floor; the second layer, exposed through a mask, defines the walls; and the third layer, on a Pyrex wafer, serves as the roof. After joining the wafers, the SU-8 on Pyrex is exposed through the Pyrex wafer to bond it to cured SU-8.

However, because of the large thermal expansion mismatch between SU-8 and silicon and glass, large SU-8 structures are prone to cracking. Therefore it is advisable to break up large SU-8 areas into smaller areas. As shown in Figure 18.24 (right), two photomasks are needed in this version. The bottom SU-8 is exposed through mask #1 but not developed. The second SU-8 is spin coated and exposed with the second photomask. Both layers are then developed. The third layer is spin coated on top of the Pyrex wafer as before.

In a similar fashion, three layers of SU-8 have been used to fabricate an electrospray nozzle (Figure 18.25). Additionally, the structure has been released from the supporting glass wafer by HF etching.



**Figure 18.24** SU-8 microfluidic channels by SU-8 adhesive bonding: left, simple channel by one-photomask process; right, two-photomask process with stress relief auxiliary channels. Adapted from Tuomikoski *et al.* (2005)



**Figure 18.25** Electrospray tip made of three-layer SU-8: after two layers (left); and after bonding (right). Reproduced from Tuomikoski *et al.* (2005) by permission of Wiley-VCH

Another benefit of adhesive bonding is the ease of creating patterns: either photoactive epoxy like SU-8 is defined by lithography, or non-photoactive glue is patterned by screen printing. Adhesive bonding can create overhanging structures which cannot be made by lithography or imprinting processes.

The drawback of adhesive bonding is that the “glue” must usually be flowable, and there is the danger that it will flow into microchannels and cavities already made. This can be minimized by working with thinner glues, but then the advantage of adhesive bonding starts to disappear: thick adhesive compensates wafer non-flatness, but with thin adhesive the usual flatness requirement re-emerges. Thin adhesive is also less forgiving of particle contamination. Dry resists (laminates) can be used to close structures, as will be discussed at the end of this chapter.



**Figure 18.26** Microfluidic channel with electrodes, lithography with auxiliary carrier film: SU-8 on PET foil, which is detached after SU-8 exposure. Adapted from Abgrall *et al.* (2006)

## 18.10 Polymer Devices

The rest of this chapter discusses examples of polymer microdevices. Various aspects include the interaction of

materials properties with fabrication and operation, and a comparison of different techniques and materials.

Dry film resists are useful because of their simplicity and ability to cross trenches (recall Figure 9.8). The drawback is poor resolution: dry resists are suitable for structures in the tens of micrometers range and larger. Sometimes standard resist and an auxiliary film can serve the same purpose, as shown in Figure 18.26: resist is spin coated on PET foil for example, and this stack is then inverted on top of the structure. Exposure through the PET film hardens the resist, and PET film is peeled off.

The Braille actuator shown in Figure 18.27 uses three layers of PDMS. It is very similar to the microvalve shown in Figure 18.11: both devices use pneumatic pressure to move the PDMS membrane up and down.

An in-plane fluidic valve (diode) has also been made of PDMS (Figure 18.28). A flap valve by design, the large vertical PDMS flap deflects to the right, but is prevented from deflecting to the left, creating a fluidic diode.

PDMS permeability to oxygen is a benefit in cell growth chambers, but in many applications this permeability is a drawback: drug stability is compromised if oxygen permeates through the device and makes contact with the drug, and in PCR DNA amplification at 95 °C operating temperature water will be vaporized and liquid volume will decrease over time. Of course other materials can be used instead of PDMS, or else additional layers can be added to the structure: for example, in Figure 18.29 a low-permeability Mylar film is bonded to PDMS to reduce vapor losses.



**Figure 18.27** Fabrication steps of a Braille actuator: bonding of three PDMS pieces. Reproduced from Moraes *et al.* (2009) by permission of IOP



**Figure 18.28** PDMS fluidic diode: left-to-right flow is allowed, right-to-left flow is blocked. Adapted from Adams *et al.* (2005)



**Figure 18.29** Drug delivery device: SU-8 mold for PDMS casting, Mylar vapor barrier. Reproduced from Su *et al.* (2002), copyright 2002, by permission of IEEE



**Figure 18.30** Polymer microprocessing and devices

## 18.11 Polymer Overview

Polymers are a vast group of materials, with widely different chemical structures, physical properties, processing methods and device applications. The mind map of Figure 18.30 lists some important aspects of polymers for microfabrication.

## 18.12 Exercises

1. If SU-8 structures 50 µm thick need to be exposed, what is the smallest feature size that can be fabricated?
2. What degree of positional repeatability can be achieved in stamping if the polymer stamp CTE is 60 ppm/°C and the cleanroom temperature control is  $\pm 1$  °C?
3. Design a fabrication process for the SU-8 microvalve of Figure 18.7.
4. How can you make hemispherical microlenses by hot embossing?
5. Draw a time–height plot to show how feature filling evolves in imprinting/embossing: (a) for submicrometer features; (b) for 20 µm features.
6. What ratios of original NIL resist height to residual thickness are feasible?
7. Calculate silicon wafer bow due to stress from an SU-8 layer 200 µm thick.
8. Explain the fabrication process of the fluidic diode of Figure 18.28.
9. Design a fabrication process for the three-layer PDMS valve shown below.



10. Invent a process for making superhydrophobic surfaces by NIL using real lotus leaves as a starting point.

11. Explain step by step the fabrication of an SU-8 channel with electrodes as in Figure 18.26.

## References and Related Reading

- Abgrall, P. *et al.* (2006) Novel fabrication method of flexible and monolithic 3D microfluidic structures using lamination of SU-8 films, *J. Microelectromech. Syst.*, **16**, 113–121.
- Adams, M.L. *et al.* (2005) Polydimethylsiloxane based microfluidic diode, *J. Micromech. Microeng.*, **15**, 1517–1521.
- Aura, S. *et al.* (2008) Novel hybrid material for microfluidic devices, *Sens. Actuators*, **B132**, 397–403.
- Beck, M. *et al.* (2002) Improving stamps for 10 nm level wafer scale nanoimprint lithography, *Microelectron. Eng.*, **61–62**, 441–448.
- Becker, H. and C. Gärtner (2008) Polymer microfabrication technologies for microfluidic systems, *Anal. Bioanal. Chem.*, **390**, 89–111.
- Chen, K.-S., K. Lin and F.-H. Ko (2005) Fabrication of 3D polymer microstructures using electron beam lithography and nanoimprinting technologies, *J. Micromech. Microeng.*, **15**, 1894–1903.
- del Campo, A. and C. Greiner (2007) SU-8: a photoresist for high-aspect-ratio and 3D submicron lithography, *J. Micromech. Microeng.*, **17**, R81–R95.
- Dixit, P. *et al.* (2007) Fabrication and characterization of fine pitch on-chip copper interconnects for advanced wafer level packaging by a high aspect ratio through AZ9260 resist electroplating, *J. Micromech. Microeng.*, **17**, 1078–1086.
- Giboz, J., T. Copponnex and P. Mele (2007) Microinjection molding of thermoplastic polymers: a review, *J. Micromech. Microeng.*, **17**, R96–R109.
- Guo, J.L. (2007) Nanoimprint lithography: methods and material requirements, *Adv. Mater.*, **19**, 495–513.
- Haas, K.-H. (2004) Hybrid Inorganic-organic polymers based on organically modified Si-alkoxides, *Adv. Eng. Mater.* pp. 571–582
- Heckele, M. and W.K. Schomburg (2004) Review on micro molding of thermoplastic polymers, *J. Micromech. Microeng.*, **14**, R1–R14.
- Kang, W.-J. *et al.* (2006) Novel exposure methods based on reflection and refraction effects in the field of SU-8 lithography, *J. Micromech. Microeng.*, **16**, 821–831.
- Kim, E., Y. Xia and G.M. Whitesides (1996) Micromolding in capillaries: applications in materials science, *J. Am. Chem. Soc.*, **118**, 5722–5731.
- Koukharenko, E. *et al.* (2005) A comparative study of different thick photoresists for MEMS applications, *J. Mater. Sci.: Mater. Electron.*, **16**, 741–747.
- Kudo, H. *et al.* (2006) A flexible and wearable glucose sensor based on functional polymers with Soft-MEMS techniques, *Biosens. Bioelectron.*, **22**, 558–562.
- Lee, D.-W. and Y.-S. Choi (2008) A novel pressure sensor with a PDMS diaphragm, *Microelectron. Eng.*, **85**, 1054–1058.

- Lee, S.J. *et al.* (2007) Characterization of laterally deformable elastomer membranes for microfluidics, *J. Micromech. Microeng.*, **17**, 843.
- Lin, B.-C. and Y.-C. Su (2008) On-demand liquid-in-liquid droplet metering and fusion utilizing pneumatically actuated membrane valves, *J. Micromech. Microeng.*, **18**, 115005.
- Loechel, B. (2000) Thick-layer resists for surface micromachining, *J. Micromech. Microeng.*, **10**, 108.
- Luharuka, R. *et al.* (2006) Improved manufacturability and characterization of a corrugated Parylene diaphragm pressure transducer, *J. Micromech. Microeng.*, **16**, 1468–1474.
- Mata, A., A.J. Fleischman and S. Roy (2006) Fabrication of multi-layer SU-8 microstructures, *J. Micromech. Microeng.*, **16**, 276–284.
- Meng, E., P.-Y. Li and Y.-C. Tai (2008) Plasma removal of Parylene C, *J. Micromech. Microeng.*, **18**, 045004.
- Moraes, C. *et al.* (2009) Solving the shrinkage-induced PDMS alignment registration issue in multilayer soft lithography, *J. Microelectromech. Syst.*, **19**, 065015.
- Prakash, A.R. *et al.* (2006) Small volume PCR in PDMS biochips with integrated fluid control and vapour barrier, *Sens. Actuators*, **B113**, 398–409.
- Pranov, H. *et al.* (2006) On the injection molding of nanosstructured polymer surfaces, *Polym. Eng. Sci.*, **46**, 160–171.
- Samel, B., M.K. Chowdhury and G. Stemme (2007) The fabrication of microfluidic structures by means of full-wafer adhesive bonding using a poly(dimethylsiloxane) catalyst, *J. Micromech. Microeng.*, **17**, 1710–1714.
- Schift, H. (2008) Nanoimprint lithography: an old story in modern times? A review, *J. Vac. Sci. Technol.*, **B26**, 458–480.
- Seidemann, V., S. Bütfisch and S. Büttgenbach (2002) Fabrication and investigation of in-plane compliant SU8 structures for MEMS and their application to micro valves and micro grippers, *Sens. Actuators*, **A97–98**, 457–461.
- Seo, J. and L.P. Lee (2004) Disposable integrated microfluidics with self-aligned planar microlenses, *Sens. Actuators*, **B99**, 615–622.
- Stephan, K. *et al.* (2007) Fast prototyping using a dry film photoresist: microfabrication of soft-lithography masters for microfluidic structures, *J. Micromech. Microeng.*, **17**, N69–N74.
- Su, Y.-C., L. Lin and A.P. Pisano (2002) A water-powered osmotic microactuator, *J. Microelectromech. Syst.*, **11**, 736.
- Sun, Y. and Y.C. Kwok (2006) Polymeric microfluidic system for DNA analysis: a review, *Anal. Chim. Acta*, **556**, 80–96.
- Tiercelin, N. *et al.* (2006) Polydimethylsiloxane membranes for millimeter-wave planar ultra flexible antennas, *J. Micromech. Microeng.*, **16**, 2389–2395.
- Tsao, C.-W. and D.L. DeVoe (2009) Bonding of thermoplastic polymer microfluidics, *Microfluid Nanofluid*, **6**, 1–16.
- Tuomikoski, S. and S. Franssila (2005) Free-standing SU-8 microfluidic chips by adhesive bonding and release etching, *Sens. Actuators*, **A120**, 408–415.
- Tuomikoski, S. *et al.* (2005) Fabrication of enclosed SU-8 tips for electrospray ionization-mass spectrometry, *Electrophoresis*, **26**, 4691–4702.
- Worgull, M. *et al.* (2006) Modeling and optimization of the hot embossing process for micro- and nanocomponent fabrication, *J. Microlithogr. Microfabr. Microsyst.*, **5**, 011005.
- Yin, H.-L. *et al.* (2007) A novel electromagnetic elastomer membrane actuator with a semi-embedded coil, *Sens. Actuators*, **A139**, 194–202.

# Glass Microprocessing

The term glass is used as a shorthand for many different materials, but the common silicate glasses are amorphous silica with alkali metal oxides. Glass used for beverage bottles contains 72.5% SiO<sub>2</sub>, 13% Na<sub>2</sub>O, 9.3% CaO, with the remainder consisting of K<sub>2</sub>O, MgO, Al<sub>2</sub>O<sub>3</sub> and Fe<sub>2</sub>O<sub>3</sub>. This basic formula has remained pretty much unchanged since antiquity.

Borosilicate glasses are important in microsystems because their coefficients of thermal expansion can be matched with silicon, important for stress-free bonding. Each manufacturer has slightly different compositions, but Pyrex 7740 borosilicate glass suitable for anodic bonding has a composition of 80.3% SiO<sub>2</sub>, 12.2% B<sub>2</sub>O<sub>3</sub>, 4.0% Na<sub>2</sub>O, 2.8% Al<sub>2</sub>O<sub>3</sub>, 0.4% K<sub>2</sub>O and 0.3% CaO. Other anodically bondable glass might replace Na<sub>2</sub>O for Li<sub>2</sub>O and include some Fe<sub>2</sub>O<sub>3</sub> and TiO<sub>2</sub>. The wide variability of glass properties can be seen in Table 19.1 where the properties of different borosilicate glasses do vary a lot.

There are at least 3000 varieties of glass. While silicon properties can be tailored by doping and crystal orientation, glasses can be tailored by a huge repertoire of methods. First of all, glass composition can be varied, and sometimes very small changes in composition lead to drastically different properties: changing Na<sub>2</sub>O content from 15.75% to 15.5%, Al<sub>2</sub>O<sub>3</sub> content from 3.4% to 2.2% and increasing MgO from 3.0% to 3.2% make glass biosoluble. Pyrex's thermal expansion can be tailored by its composition, but it is also affected by the cooling rate used during solidification. Tempering can be used to introduce compressive stress in the surface layers of glass, blocking the propagation of cracks and greatly improving mechanical strength.

The emphasis in this chapter will be on Pyrex-type glasses that are available in wafer format and widely used in MEMS and microfluidics. Flat-panel display processing on large glass panes is discussed in Chapter 37 and only briefly touched upon here.

## 19.1 Structure and Properties of Glasses

Glasses are metastable liquids without long-range order. Glass networks are based on Si–O bonds (Figure 19.1). These bonds form tetrahedral basic units (Figure 13.18). When dopant atoms are incorporated into a silicon dioxide network, they can take either substitutional or interstitial positions. Boron can take the position of a silicon atom in the network and form oxide (B<sub>2</sub>O<sub>3</sub>). Sodium, potassium and lead are interstitial network modifiers which bond to one silicon atom only, because they have a valence of one. Therefore they do not contribute to networked structures.



**Figure 19.1** The structure of glass: Si–O network with metal atoms dispersed (the fourth bond of silicon is out-of-plane)

As temperature rises, the electrical behavior of glass is not too different from that of semiconductors. Thermally generated charge carriers are important in many applications, but in glasses these thermally generated charge carriers are sodium ions, not holes or electrons. In Pyrex glass the room temperature resistivity is  $10^{14}$  ohm-cm, at  $250^{\circ}\text{C}$  it is  $10^8$  ohm-cm, but at  $350^{\circ}\text{C}$  only  $10^6$  ohm-cm. Anodic bonding is also based on sodium ionic conduction (at  $300\text{--}500^{\circ}\text{C}$ ).

Glasses do not experience sharp melting but gradual softening. As a simplification the glass transition temperature is often used to characterize glasses: at  $T_g$  glass behavior changes from a brittle–elastic solid to viscous melt. Room temperature viscosities of glasses are  $10^{18}$  Pa-s, and  $T_g$  is the temperature when viscosity equals  $10^{12.3}$  Pa-s. The usable temperature range of glass wafers is limited by softening, which takes place at about  $820^{\circ}\text{C}$  for Pyrex glasses. In practice the processing temperatures are limited to about  $600^{\circ}\text{C}$  because even minor dimensional and shape changes, for example increased warp, may render bonding impossible and lithography out of focus. Table 19.1 lists some important properties of borosilicate glasses, and compares them to quartz glasses.

The proper term for pure amorphous  $\text{SiO}_2$  is fused silica even though it is often called “quartz.” It is very different from glasses: it does not exhibit temperature-dependent resistivity because there are no metal oxides which dissociate and release metal and oxygen ions.

One major microfabrication difference between soda lime glass and fused silica is optical transmission: the elimination of metals leads to superior optical transmission. Fused silica is transparent from NIR to deep UV while soda lime transmission in UV is strongly

attenuated (Figure 8.9). This means that photomasks for DUV lithography must be made on “quartz” but above 400 nm exposure wavelength the much cheaper soda lime glass can be used. The coefficient of thermal expansion for soda lime glass is  $10\text{ ppm}^{\circ}\text{C}$  (cf.  $2.6\text{ ppm}^{\circ}\text{C}$  for Si) and as a photomask material soda lime glass is limited to applications above  $3\text{ }\mu\text{m}$  linewidths where dimensional control requirements are lax (recall Exercise 9.4). Fused silica with its extremely low CTE retains its dimensions much better under intense UV illumination.

Quartz is pure crystalline silica. Because quartz is crystalline, it can be anisotropically wet etched by HF while glasses are etched isotropically because of their amorphous structure. And because of crystallinity, quartz properties are anisotropic, for example thermal conductivity parallel and perpendicular to the  $z$ -axis differs by a factor of two, and Young’s modulus parallel to the  $z$ -axis is 97 GPa but 76 GPa perpendicular to it. These properties have been used to make micromechanical devices.

CVD oxides are sometimes called glasses, for example undoped silica glass (USG) and phosphorus-doped silica glass (PSG). These are not glasses proper because they do not contain metal oxides. They are more akin to silica in chemical composition. Spin-on glasses come in two main varieties, silicate (inorganic) and siloxane (inorganic–organic hybrid), but neither is glass in the traditional sense. All these “glasses” share some properties with glasses, like amorphous structure, electrical and thermal insulation, and optical transparency, but not ionic conductivity, or sodium contamination danger.

## 19.2 Glass Substrates

Glass can be cut into many shapes, and it is available in silicon-compatible formats: round wafers, even with wafer flat, even though glass is amorphous and flat in silicon is used to identify crystal orientation. Flats are useful when bonding glass to silicon wafers with flats.

The glass wafer fabrication process is not too different from the silicon wafering process. The main steps are:

1. Shape cut
2. Edge grinding
3. Lapping
4. Polishing
5. Cleaning
6. Inspection
7. Packaging

An example of glass wafer specification is shown in Table 19.2. Because there are many glass varieties, the specs are never as detailed or standardized as with silicon.

**Table 19.1** Quartz vs. borosilicate glasses

|                                                      | Quartz glasses            | Borosilicates             |
|------------------------------------------------------|---------------------------|---------------------------|
| Density ( $\text{g}/\text{cm}^3$ )                   | 2.0–2.2                   | 2.25–2.45                 |
| Tensile strength (MPa)                               | 70–120                    | 80–150                    |
| Compressive strength (MPa)                           | 1600–2000                 | 600–1000                  |
| Young’s modulus (GPa)                                | 62–75                     | 65–85                     |
| CTE ( $10^{-6}/^{\circ}\text{C}$ )                   | 0.53                      | 3–6                       |
| Resistivity (ohm-cm)                                 | $10^{18}\text{--}10^{19}$ | $10^{14}\text{--}10^{18}$ |
| Dielectric constant $\epsilon_r$                     | 3.7–3.9                   | 4.5–8                     |
| Thermal conductivity (W/m-K)                         | 4.8                       | 2.0–3.8                   |
| Glass transition <sup>a</sup> ( $^{\circ}\text{C}$ ) | 1300                      | 600                       |

<sup>a</sup>Defined as temperature where viscosity  $\eta = 10^{12.3}$  Pa-s.  
Source: Hülsenberg, D., A. Harmisch and A. Bismarck (2008)

**Table 19.2** Glass wafer specifications

|                  |                                                    |
|------------------|----------------------------------------------------|
| Diameter         | $100.0 \pm 0.3$ mm                                 |
| Thickness        | $500 \pm 25$ $\mu\text{m}$                         |
| Bow/warp         | <10 $\mu\text{m}$                                  |
| Top side         | Polished                                           |
| Back side        | Polished                                           |
| Roughness        | <1.5 nm RMS                                        |
| TTV <sup>a</sup> | <10 $\mu\text{m}$                                  |
| Primary flat     | 32.5 mm                                            |
| Particles        | <5 $\mu\text{m}$                                   |
| Scratch and dig  | 60–40 according to MIL-PRF-13830                   |
| Edge exclusion   | 6 mm                                               |
| Packaging        | Under class 1000 according to Federal Standard 209 |

<sup>a</sup>Total Thickness Variation.

Glass wafer thicknesses range from 50  $\mu\text{m}$  to 10 mm and in general glass wafers can be found with a very wide range of specs. In many respects glass wafer specs are similar to silicon wafer specs, for instance wafers with tighter TTV can be bought, or wafers with less than 0.5 nm RMS roughness (surface roughness is an important factor for bondability). Glass wafers are available up to 300 mm in diameter, just like silicon. Thick glass wafers are standard in the millimeter range. Glass is commonly available also as square or rectangular sheets, ranging in size from microscope slides (76  $\times$  25 mm) to flat-panel display “mother panes” (2160  $\times$  2460 mm).

### 19.3 General Processing Issues with Glasses

Some problems of glass substrates are related to their processing in silicon-oriented labs. Even though glass wafers are round like silicon, have flats like silicon and are available in the same thicknesses as silicon, complications can still arise, especially in automated tools. The detection of the presence and movements of wafers in processing equipment is based on either optical or capacitive sensors, and these are fooled by transparent dielectric wafers. Amorphous silicon or polysilicon deposition on the wafer back side can be used as a preventive measure, but the role of this extra film needs to be considered for all process steps and tools.

Glasses contain alkali metals which are serious contaminants to silicon transistors. There are various ways of dealing with this: use fused silica instead of glass (which will make bonding very difficult); limit glass processing to specific equipment and/or cleanroom areas; or take special precautions, like depositing barriers on glass to prevent

sodium diffusion. This is a standard starting step in flat-panel display manufacturing, and it works: the thin-film transistors in displays are silicon devices which work well for extended periods.

In this section we will briefly discuss various processing issues and then devote more attention to two technologies, etching and bonding. Glasses are also machined by various non-standard techniques (non-standard from a microfabrication point of view), powder blasting for example. This will be discussed in Chapter 23. Various grinding, milling and drilling methods will be discussed in Chapter 24.

#### 19.3.1 Surface preparation

Glass can be cleaned with RCA cleans just like silicon wafers; in fact, the RCA clean was invented for glass cleaning in TV tube manufacturing. The surface of the glass is chemically identical to oxidized silicon, with Si–OH as the typical surface termination. Surface functionalization techniques developed for one can easily be transferred to the other, like silane coatings for hydrophobicity.

#### 19.3.2 Lithography

The exposure of resist on a glass wafer requires more energy than exposure on silicon because there is no reflected light. An additional 50% of the exposure dose might be used as a starting point. Reflections from etched structures or thin films need to be accounted for, because they can cause unwanted exposure (see Figure 10.9).

Fabrication on glass substrates offers intriguing ways for self-alignment by back-side exposure. A bottom gate thin-film transistor (TFT) is shown in Figure 19.2. After chromium bottom gate lithography, etching and stripping, a stack of PECVD oxide (gate oxide), a-Si:H (channel) and nitride (passivation) are deposited. Photoresist is applied on the top side, but exposure is made from the back side, with the Cr gate blocking light (photomasks are glass plates with chromium patterns on them!). Resist is then developed and nitride etched. After resist stripping and wafer cleaning, chromium is deposited. During annealing, chromium silicide will form on the a-Si layers, but not on the nitride (recall Figure 7.15).

#### 19.3.3 Deposition on glass

Basically, deposition on glass is identical to deposition on silicon, with only minor differences. Sputtering and evaporation, and PECVD, can be done as on silicon. With thermal CVD temperature limitations arise: for example,



**Figure 19.2** Top: Cr gate has been patterned on the glass substrate, and PECVD gate oxide, a-Si:H channel and nitride stopper layers have been deposited. Bottom: resist on the top side is exposed through the back side, making S/D metallization self-aligned to the gate. Redrawn after Hirano *et al.* (1996)

polysilicon deposition by LPCVD around  $600^{\circ}\text{C}$  is at the limit of mechanical strength of glass wafers. Another issue is contamination: very few laboratories allow polysilicon deposition on glass wafers.

Film stresses are always present. When processing thick SU-8 resist on thin microscope glass, considerable bending can occur, even rendering the whole device useless. Thicker glass would help, but sometimes optical thickness considerations require thin substrates.

Because glass is used in optical devices, transparent conducting oxides (TCOs) are often needed, and indium-doped tin oxide (ITO), fluorine-doped tin oxide (FTO) and the like are used. These films can be deposited by vacuum techniques like sputtering, evaporation and (PE)CVD, but in large-area devices like flat-panel displays (FPDs) and solar cells, pyrolysis and various wet processes are also used.

Another big optical application is windowpane coating for self-cleaning and optical and thermal properties.

Titanium dioxide in its crystalline anatase phase is a photocatalyst that can break down organic dirt on windows, which is then flushed away by rain. Thin-film multilayer filters are deposited on glass, for example by tailoring infrared reflectivity for minimizing thermal losses in windows or improving windshield optics. Metallic and dielectric layers with thicknesses from 10 nm onward are sputtered by the square meter in the glass industry for building and automotive applications. Over 90% of the sputtering target industry (by weight) is geared up for these applications.

### 19.3.4 Glass polishing

While silicon microtechnology adopted polishing by CMP only recently, glass polishing for microscope and telescope lenses is 400-year-old technology. In fact, early CMP equipment was directly borrowed from the glass polishing industry. Melted glass as well as spin-coated glass can be polished to create smooth surfaces, for either ease of further processing, or optical properties.

### 19.3.5 Glass embossing, molding and blowing

Glass softens upon heating and embossing can be done to create glass microstructures. The principles are the same as discussed for polymer embossing in Sections 18.3–18.5. First of all, embossing must be done above  $T_g$ . For glasses the transition temperatures are around  $400\text{--}800^{\circ}\text{C}$ . Second, reaction between master and glass must be avoided. Because of the higher temperatures involved in glass embossing, this is more difficult than with polymers, but non-oxidizing materials like TiN and TiAlN have been successfully used. For de-embossing, the surface quality of the master is important, and microfabricated silicon and nickel promise smooth-enough surfaces.

Glass molding is an old technique for lens fabrication: instead of polishing each lens individually, molten glass is poured over a mold, and lenses are formed. The basic idea and lens shapes are identical to polymer lenses (Figure 18.10), but of course the mold material has to be high-temperature resistant. The method is best suited for cheap mass-produced lenses.

The silicon mold can be made by RIE, enabling delicate shapes that are not possible with glass etching. Figure 19.3a shows the starting situation: the silicon mold has been filled with borofloat glass (at temperatures around  $800^{\circ}\text{C}$ ). Three possible process options then emerge: in Figure 19.3b glass etching and polishing end up with thick glass structures embedded in silicon. These might be used as thermal isolation islands. In



**Figure 19.3** (a) Silicon mold made by RIE, filled by molten glass with three alternative process continuation options: (b) glass etching/polishing to planarize; (c) silicon etching/polishing to planarize; and (d) complete silicon etching



**Figure 19.4** Polysilicon micromotor before glass coating and after glass casting and polishing. Reproduced from Yasseen *et al.* (1999), copyright 1999, by permission of IEEE

Figure 19.3c silicon etching and polishing result in embedded silicon islands inside glass. These might be used as conductive paths or as thermal sinks. Finally, in Figure 19.3d all the silicon has been etched away to create glass microstructures of high aspect ratio. One example is shown in Figure 19.4, namely a polysilicon mold, with and without molded glass.

## 19.4 Glass Etching

### 19.4.1 Wet etching

Etching of glass follows the same basic ideas as etching of silicon dioxide. Fused silica etching is very similar

to thermal oxide etching. Hydrofluoric acid etches silica according to Equations 19.1 and 19.2. The resulting etch profile is isotropic (Figure 19.5):



The insoluble  $\text{SiF}_4$  is turned to soluble  $\text{H}_2\text{SiF}_6$ . Pyrex and other glasses are much more difficult to etch because insoluble fluorides are formed, for example  $\text{AlF}_3$  and  $\text{CaF}_2$ . These etch products slow down the etch rate but are removed by undercutting.

The Pyrex etch rate in hydrofluoric acid (49%) is about  $8 \mu\text{m}/\text{min}$  at room temperature. Etching  $300 \mu\text{m}$  can be done in 40 min, and the resulting surface roughness is about 20 nm. A microfluidic nozzle made by double-sided etching of a Pyrex wafer  $500 \mu\text{m}$  thick is shown in Figure 19.5. Amorphous LPCVD silicon was used as the etch mask. The addition of some HCl to HF improves smoothness: the insoluble metal fluorides are dissolved by chlorine, and surface roughness below 10 nm can be obtained in deep etching. HCl has only a minor effect on etch rate and the most common mask materials behave similarly with regard to HF and 10:1 HF:HCl mixture. But remember that, because each manufacturer has a proprietary glass composition, glass etch rates are variable. High aluminum oxide content will reduce the etch rate, and the same applies to magnesium and calcium.

When glass or fused silica wafers are etched, the etched depth can be anything up to  $500 \mu\text{m}$ , and the issue of etch mask stability becomes paramount. This applies to both



**Figure 19.5** Wet etching Pyrex wafer from both sides: a-Si mask, 49% HF. Reproduced from Saarela *et al.* (2009) by permission of IOP



**Figure 19.6** Tensile stresses in thin films tend to open cracks which allow etchant to penetrate through the mask

wet and dry etching, but for different reasons. Photore sist masks fail by adhesion loss. Concentrated HF is a very aggressive etchant, and it etches rapidly under the mask. If there is, for example, a water-rich interface layer between the wafer and resist, etchant will penetrate very fast laterally. Faster lateral than vertical etching is often seen in isotropic etching. In wet etching, mask mechanical strength is important, especially when large undercut develops, and long overhangs are formed.

Common hard masks for glass etching include metals (Ni, Cr, Au) and CVD films: PECVD a-Si and SiC and LPCVD a-Si. In theory all these are resistant to HF, but often fail in practice. The main explanation is tensile film stress. Any cracks in tensile-stressed film tend to enlarge as shown in Figure 19.6, and the etchant will eventually penetrate through the mask. The surface of the mask matters: if it is hydrophobic, it lasts longer because it will reject aqueous etchant, while if it is hydrophilic the surface will attract etchant.

Double layer films are often used as etch masks: Cr/Au is an obvious choice, because gold needs an adhesion promotion layer. But it is useful to deposit multiple layers of gold, as the later layers will seal some of the cracks in the preceding films, improving mask quality.

#### 19.4.2 RIE

Oxide etching is driven by ion bombardment, therefore anisotropy is good, but mask selectivity is low. The problem of mask erosion really arises when deep etching is done. Non-erodible masks for fluorine plasmas include many metals (Ni, Cr, Al, Cu), oxides and nitrides ( $\text{Al}_2\text{O}_3$ , AlN). If these are deposited by PVD or CVD, the practical thickness limit is a few micrometers. Electroplated metals nickel and copper can be made to any thickness. The same applies to thick resists. Thick-resist SU-8 has been found to be quite a good mask for glass DRIE. The stability of SU-8 results in the problem of resist stripping



**Figure 19.7** Pyrex glass DRIE: 80 µm etched depth with nickel mask 5 µm thick. Reproduced from Kolari *et al.* (2008) by permission of IOP

afterward. A profile of plasma-etched Pyrex is shown in Figure 19.7. The sidewall angle is steep, but not vertical.

Oxide and glass RIE is critically dependent on the exact composition of the material. In both plasma and wet etching pure silica ( $\text{SiO}_2$ ) is easier to etch than glass with metal oxides ( $\text{Na}_2\text{O}$ ,  $\text{CaO}$ ,  $\text{K}_2\text{O}$ ,  $\text{MgO}$ ,  $\text{Al}_2\text{O}_3$ ,  $\text{Fe}_2\text{O}_3$ ). In plasma etching the lower volatility of the metal fluorides results in a lower rate and larger risk of residues.

Silicon is etched in fluorine plasmas but if the silicon is thick enough, and selectivity reasonable, deep glass etching can be done with a silicon mask. Complete silicon wafers with through-wafer holes have been used as etch masks for glass DRIE. Both anodic bonding and simple mechanical clamping have been employed.

One RIE etch application of glass/fused silica is making masters for UV NIL (see Figure 18.19). There the problem is not etched depth or mask erosion but feature size control, sidewall angle control and smoothness.

It should be remembered that there are good alternatives to glass etching. One technique is to make a silicon mold and cast molten glass on it, and remove the silicon (Figure 19.3), leaving glass microstructures whose dimensional accuracy is determined by the silicon DRIE process. In the case of larger structures drilling, powder blasting and spark-assisted etching can be used. There are also laser processes to pattern glass, and some of these are described in Chapter 24.

#### 19.5 Glass Bonding

All the usual requirements for successful bonding apply to glass bonding: flat, smooth surfaces, cleaned of

particles and with proper bond forming surface chemistry, are needed. Various surface cleaning and activation processes have been tried, including RCA-1, sulfuric acid, nitric acid, KOH, acetone and other solvents. Because the surface chemistry of glasses is similar to oxidized silicon, RCA cleans (ammonia/peroxide, see Chapter 12) are most common.

Thermal fusion bonding takes place above the glass transition temperature, which translates to about  $600^{\circ}\text{C}$  for Pyrex glasses. Bonding times range from 1 to 6 hours, with extended cooldown. Specialty glasses may have much lower or higher values of  $T_g$ , but fusion bonding must be between two identical wafers. Fused silica bonding can be done around  $1000^{\circ}\text{C}$ , but because fused silica does not soften like glasses, bonding is very sensitive to flatness and smoothness deviations.

Anodic bonding of glass and silicon (Section 17.3) is a very special case of direct bonding, and it requires special properties from the glass, namely suitable sodium concentration and matching CTE. A glass wafer is connected as a cathode and a silicon wafer as an anode. Current increases rapidly at the initiation of bonding because the contact area increases, and then decreases exponentially as oxygen ions react at the interface to form  $\text{SiO}_2$  and the oxide becomes thicker (Figure 19.8). When the current has dropped to 10% of its peak value, bonding is termed finished.

Bonding glass to glass anodically is possible if thin films are deposited on one of the wafers. Silicon nitride, silicon carbide, polysilicon and a-Si can be used. When a voltage is applied, sodium ions migrate toward the interface, which acts as a diffusion barrier for sodium (silicon dioxide does not work because it is not a barrier for sodium). A depletion layer is formed and electrostatic force pulls the wafers strongly together, just as in standard anodic bonding. Oxygen ions diffuse toward the interface and react there to form Si–O bonds. The interlayer film should be free of hydrogen, because otherwise oxygen



**Figure 19.9** Non-bonded area and bond strength as a function of temperature and voltage, for anodic bonding of two glass wafers with polysilicon interlayer. Adapted from Wei *et al.* (2003)

will react with the hydrogen and not with silicon. Other outgassing products from PECVD films are also problematic. Bondable area and bond strength increase with voltage and temperature as expected (Figure 19.9), but often it is desirable to have reasonable bond strength at low temperature.

Glass–glass bonding can be done in a liquid environment. Two cleaned wafers with a proper surface finish are brought together under liquid, and capillary forces draw the wafers together (this is something that must be avoided in release etching of micromechanical free-standing structures, as will be discussed in Chapter 29). If the surfaces are clean and smooth, fairly strong bonds can form even at room temperature, and annealing can be done to increase bond strength.

Various wet cleaning strategies have been developed for liquid bonding. Sulfuric acid (with and without peroxide) cleaning has been shown to work, with or without RCA-1 (ammonia/peroxide) activation. The actual bonding takes place under a DI water stream.

Another bonding technique uses spin coating of a sodium silicate layer, a few tens of nanometers thick, on top of one of the wafers, followed by a  $90^{\circ}\text{C}$  anneal for 1 h. Bond strengths similar to silicon–silicon fusion bonding are possible, showing that Si–O–Si bonds have been formed at high density across the interface.

Adhesive bonding is an option in many microfluidic devices. Again, the principles described in Chapter 17 apply here: adhesive bonding is a low-temperature process which is insensitive to some particle contamination, and it is quite general purpose.

Because glasses are transparent, UV curing adhesives are handy. But adhesive bonding introduces additional



**Figure 19.8** Anodic bonding current vs. time: initial current peak, followed by a long tailout

materials, which may interfere with analytes in microfluidic systems.

## 19.6 Glass Devices

Capillary electrophoresis chips are the most studied microfluidic devices. They can be made of polymers but glass devices are also widely used. Most often they are made simply by etching fluidic channels in one wafer (in the case of polymers, by casting or embossing) and bonding it to a blanket wafer which has the fluidic inlets. In case the capping wafer is the same material, thermal or fusion bonding is performed, or a PDMS cover can be attached (permanently or intermittently). The chips do not usually have metal electrodes integrated; rather Eppendorf tubes are attached to fluidic inlets and platinum wires stuck in. This works well for high-voltage electroosmotic pumping, but if electrochemical detection is used, the detection electrodes need to be integrated in the fluidic channels (recall Figure 18.26).

In the capillary electrophoresis (CE) device shown in Figure 19.10 the fluidic channels are HF etched in the top wafer, and fluidic inlets are drilled in the top wafer, too. Electrodes are fabricated in the bottom wafer. In order to have a flat surface for fusion bonding, the metal electrodes are recessed (Figure 19.11). A glass wafer is etched to the same depth as the designed metal thickness. After etching of the glass, the resist is dried by baking, and used again, as a lift-off mask for metal deposition. Next, an acetone wash removes photoresist, and together with it the metal that was deposited on top (see Chapter 23 for more on lift-off). The resulting metal surface is fairly close to the



**Figure 19.11** Embedded metal electrodes: photoresist masked etching of glass wafer, metal deposition without removing the resist, followed by lift-off. Metal thickness is designed to match the etched depth. Redrawn from Keynton (2005)

original glass surface, except for minor inaccuracies in both etched depth and metal thickness.

Glass–glass fusion bonding is also used in the microfluidic nebulizer shown in Figure 19.12. The nebulizer has heater electrodes that vaporize incoming liquid and mix it with nebulizer gas, shooting out a hot vapor jet from a nozzle. This vapor jet is ionized and the ions analyzed by a mass spectrometer. The same device can be made of silicon but glass is preferred because it is thermally insulating: in order to vaporize high-boiling-point analytes,



**Figure 19.10** Capillary electrophoresis (CE) chip: fluidic channels (gray) etched in top glass wafer and electrodes and pads (black) in bottom glass wafer. Fluidic reservoirs for buffer (B), sample (S), sample waste (SW) and buffer waste (BW) have high-voltage electrodes. Low-voltage electrodes are for conductivity detection (CD)



**Figure 19.12** Fusion bonded glass–glass fluidic nebulizer chip. Reproduced from Saarela *et al.* (2007), Royal Society of Chemistry



**Figure 19.13** Glass–glass bonded PCR reactor. Redrawn from Easley *et al.* (2007)

the chip operates at 400 °C; in a glass chip this hot zone is local, but in a silicon chip heat spreads all over.

When fast temperature ramping is required, it is essential to minimize thermal mass. This can be achieved by etching away excessive glass. In the PCR reactor of Figure 19.13 this has been done after bonding. The channels end up being suspended, with thin roofs and floors. Timed etching was used, so ultimate thinness cannot be achieved.

Glass is the material of choice for not only high-temperature devices like the nebulizer and the PCR reactor, but also for cryogenic devices, and because of the thermal insulation. The cryogenic cooler of Figure 19.14 is fabricated from three glass wafers, requiring seven photomasks. The top wafer is 175 µm thick, and flow channels 50 µm deep are fabricated by HF etching. The middle wafer is 145 µm thick, and again the flow channels are HF etched, but the through-wafer vias are made by powder blasting. In the bottom wafer all the structures are powder blasted because they are large, deep and non-critical, for example the bottom wafer is



**Figure 19.14** Cryogenic cooler with three wafers fusion bonded. Reproduced from Lerou *et al.* (2006) by permission of IOP

thinned to improve thermal isolation. Fusion bonding of the three wafers is done in a single step.

Glass blowing is an old technique that has been given a microfabrication twist. A thin glass sheet is bonded in air to a cylindrical cavity formed by DRIE (Figure 19.15). When the device is heated to 850 °C, the trapped air expands, blowing the thin, softened glass sheet into a spherical bubble. Again in the spirit of traditional glass blowing, the structure is annealed at 560 °C for 30 min to remove stresses and slowly cooled below the strain point, 510 °C for Pyrex. In further processing the device is filled with rubidium chloride and barium, for atomic clock action.

The DNA processing chip of Figure 19.16 consists of three glass wafers and a PDMS layer. The top glass wafer holds the fluidic channels, the second glass wafer the thermal elements, namely heaters and temperature sensors. The PDMS layer acts as a movable membrane valve and is actuated by vacuum/air pressure in the bottom manifold wafer. Microfluidic chips like this offer benefits in DNA processing in many ways: small sample volumes (nanoliters) are important for reagent cost minimization, small size is also important for rapid temperature ramping, and the integration of many analysis steps on one device eliminates manual labor, saves time and improves reproducibility. The PCR chip of Figure 25.7 is similarly of glass–polymer hybrid design.

## 19.7 Specialty Glasses

The variety of glasses is vast, so we offer here only a short glimpse of some specialty glasses. Vycor is porous glass. Various annealing steps cause phase separation, and etching in HCl and NaOH removes practically everything else except silica (with a few percent sodium borate). By



**Figure 19.15** Blown glass sphere atomic clock. Reproduced from Eklund *et al.* (2008), copyright © 2008 by permission of Elsevier



**Figure 19.16** Bioprocessor for nanoliter DNA samples, fabricated on a glass/glass/PDMS/glass wafer stack of 100 mm diameter. Reproduced from Blazej *et al.* (2006), copyright 2006 National Academy of Sciences USA

tailoring the annealing steps and etching conditions, various pore sizes in the range of 10–100 nm are available. Such porous glass has a huge surface area which is useful in many biochips.

Photostructurable glasses (e.g. Foturan) can be exposed by UV light, and after annealing the exposed areas can be anisotropically etched in HF. These glasses contain about 70% silica, 10% Li<sub>2</sub>O, and the usual alkali metal oxides plus small amounts (<1%) of cerium, silver and antimony and tin oxides. When hit by UV radiation cerium ions absorb the light and turn Ce<sup>3+</sup> into Ce<sup>4+</sup> and release an electron, which reacts with Ag<sup>+</sup> to form metallic silver. During the thermal treatment atomic silver forms nuclei and lithium metasilicate crystallizes. This crystallized phase can be etched in 10% HF with 20:1 selectivity to amorphous glass. The etch rate can be up to 20 μm/min. Etched depth is controlled by UV exposure dose. For example, 8 J/m<sup>2</sup> is used to expose through the wafer, while 3 J/m<sup>2</sup> exposes half the wafer thickness.

The problem is that crystal size is quite large, in the range of 10 micrometers, which means that small structures cannot be made and sidewalls will be rough. T<sub>g</sub> values of photostructurable glasses are around 500 °C with CTEs ranging from 7 to 10 ppm/°C, which means that bonding with silicon is out of the question. Figure 19.17 shows a MEMS combustor. The middle wafer is made of photostructurable glass, including the movable piston. De-



**Figure 19.17** MEMS combustor and reciprocating device. Reproduced from Lee *et al.* (2002) by permission of IOP

spite gas leakage between the movable piston and frame, 6 mm movement has been achieved within 0.1 s.

## 19.8 Exercises

1. Design a process for molding glass microlenses.
2. List the processes and phenomena that become important for borosilicate glass as temperature goes up.
3. Estimate the device dimensions for the PCR reactor of Figure 19.13. How thin a glass is there after etching?
4. How could the glass melting process of Figure 19.3 be used to fabricate a thermally insulated PCR reactor?
5. In the cryogenic cooler of Figure 19.14 some structures are wet etched, others powder blasted. Which ones and why?
6. Design an alternative version of the nebulizer of Figure 19.12 but with the heater inside the channel!

## References and Further Reading

- Bien, D. C. S. *et al.* Characterization of masking materials for deep glass etching, *J. Micromech. Microeng.*, **13**, S34.
- Blazej, R., Kumaresan, P. and Mathies, R. A. (2006) Micro-fabricated bioprocessor for integrated nanoliter-scale Sanger DNA sequencing, *Proc. Natl Acad. Sci.*, **103**, 7240–7245.
- Danel, J. S. and G. Delapierre (1991) Quartz: a material for microdevices, *J. Micromech. Microeng.*, **1**, 187–198.
- Easley, C. J., J. A. C. Humphrey and J. P. Landers (2007) Thermal isolation of microchip reaction chambers for rapid non-contact DNA amplification, *J. Micromech. Microeng.*, **17**, 1758–1766.
- Eklund, E. J. *et al.* (2008) Glass-blown spherical microcells for chip-scale atomic devices, *Sens. Actuators*, **A143**, 175–180.
- Hirano, N. *et al.* (1996) A 33cm diagonal high-resolution TFT-LCD with fully self-aligned a-Si TFT, *IEICE Trans. Electron.*, **E79**, 1103.
- Hülsenberg, D., A. Harnisch and A. Bismarck (2008) **Microstructuring of Glasses**, Springer.
- Iliescu, C., B. Chen and J. Miao (2008) On the wet etching of Pyrex glass, *Sens. Actuators*, **A143**, 154–161.
- Keynton, R. S. (2004) Design and development of microfabricated capillary electrophoresis devices with electrochemical detection, *Anal. Chim. Acta.*, **507**, 95–105.
- Kim, M.-S. *et al.* (2005) Fabrication of microchip electrophoresis devices and effects of channel surface properties on separation efficiency, *Sens. Actuators*, **B107**, 818–824.
- Kolari, K., V. Saarela and S. Franssila (2008) Deep plasma etching of glass for fluidic devices with different mask materials, *J. Micromech. Microeng.*, **18**, 064010.
- Lee, D. H. *et al.* (2002) Fabrication and test of a MEMS combustor and reciprocating device, *J. Micromech. Microeng.*, **12**, 26–34.
- Lee, J.-H. *et al.* (2008) A simple and effective fabrication method for various 3D microstructures: backside 3D diffuser lithography, *J. Micromech. Microeng.*, **18**, 125015.
- Lerou, P. P. P. M. *et al.* (2006) Fabrication of a micro cryogenic cold stage using MEMS-technology, *J. Micromech. Microeng.*, **16**, 1919–1925.
- Saarela, V. *et al.* (2007) Glass microfabricated nebulizer chip for mass spectrometry, *Lab Chip*, **7**, 644–646.
- Saarela, V. *et al.* (2009) Microfluidic heated gas jet shape analysis by temperature scanning, *J. Micromech. Microeng.*, **19**, 055001.
- Wang, H. Y. *et al.* (1997) Low temperature bonding for micro-fabrication of chemical analysis devices, *Sens. Actuators*, **B45**, 199–207.
- Wei, J. *et al.* (2003) Low temperature glass-to-glass wafer bonding, *IEEE Trans. Adv. Packag.*, **26**, 289–294.
- Yasseen, A. A., J. D. Cawley and M. Mehregany (1999) Thick glass film technology for polysilicon surface micromachining, *J. Microelectromech. Syst.*, **8**, 172–179.

# Anisotropic Wet Etching

Microsystems technology relies on anisotropic wet etching of silicon for many major applications. Bulk micromechanics, the machining of silicon by anisotropic wet etching, depends on silicon crystal plane-dependent etching, and many surface micromechanical and SOI devices make use of silicon wet etching for auxiliary structures, even though the main device features are defined by plasma etching. Because  $<100>$  silicon is the workhorse of microsystems, our discussion concentrates on it. Both  $<110>$  and  $<111>$  etching will be briefly discussed.

Etched grooves, trenches and wells exemplify the basic features of crystal plane-dependent etching. They can be used as sample wells and flow channels in microfluidics, or as optical fiber alignment fixtures. Other basic structures are membranes (diaphragms), beams and cantilevers. Mechanical devices like pressure sensors, resonators and AFM cantilevers rely on these basic elements. Through-wafer structures include nozzles and orifices, for example for ink jets or micropipettes.

## 20.1 Basic Structures on $<100>$ Silicon

Anisotropic etching relies on aligning the structures with wafer crystal planes. The primary flat, which is along the  $[110]$  direction, is used as a reference. Rectangular structures with concave corners are easily made, with four  $(111)$  sidewalls and the  $(100)$  plane as the bottom (Figure 20.1).

Self-limiting depth is the depth where the slow etching  $(111)$  planes meet. The angle between the  $(100)$  and  $(111)$  planes is  $54.7^\circ$  and the self-limiting depth ( $d$ ) for mask opening  $W_{\text{mask}}$  is calculated from

$$d = \frac{W_{\text{mask}}}{\sqrt{2}} \quad (20.1)$$



**Figure 20.1** Anisotropic wet-etched profiles in  $<100>$  wafer. The sloped sidewalls are the slow etching  $(111)$  planes; the horizontal planes are  $(100)$ . Etching will terminate if the slow etching  $(111)$  planes meet

A pattern  $742 \mu\text{m}$  wide on the mask will then result in a through-wafer “zero-sized” hole, for a wafer  $525 \mu\text{m}$  thick.

Typical mask films for anisotropic wet etching, thermal oxide and LPCVD nitride are deposited on both sides of the wafer. This is useful in protecting one side while processing the other, but it can also be utilized to speed up processing: this was used in microthruster (Figure 11.7) etching from both sides simultaneously. In the AFM cantilever-tip structure (Figure 20.2) the tip is etched first, followed by oxide etch and  $\text{p}^{++}$  diffusion. Another oxidation is done for backside mask and front side protection.

## 20.2 Etchants

A number of alkaline etchants have been tried for crystal plane-dependent etching but KOH has emerged as the main etchant. Typical etch rates are about  $1 \mu\text{m}/\text{min}$  which translates to 6 hours for through-wafer etching of  $380 \mu\text{m}$  wafers. KOH poses a contamination hazard for CMOS work, therefore CMOS-compatible etchants are desirable. Tetramethyl ammonium hydroxide,  $(\text{CH}_3)_4\text{NOH}$ , usually known as TMAH, is such a compound. In fact, both NaOH and TMAH are used as photoresist developers,



**Figure 20.2** AFM cantilever and tip: (a) oxide-masked etching of tip; (b)  $p^{++}$  boron doping on front side; (c) Thermal oxidation and KOH etching from back side, stopping on  $p^{++}$  layer

in diluted concentrations and at room temperature, so the contamination danger can be handled with proper working procedures. Organic amines have also been used

for anisotropic etching, most notably ethylene diamine ( $(\text{NH}_2)(\text{CH}_2)_2\text{NH}_2$ ) mixture with pyrocatechol and water, known as EDP or EPW. Hydrazine ( $\text{N}_2\text{H}_4$ ) has also been tried. Both amines pose occupational safety and health hazards, and thus are not widely used. Ammonia has been shown to etch silicon reasonably well, but the stability of ammonia etch baths during extended etching needs special attention. Etch rates of  $1.5 \mu\text{m}/\text{min}$  at  $70^\circ\text{C}$  have been demonstrated, with high selectivity against oxide and aluminum masks, and very smooth surfaces of  $2.4 \text{ nm RMS}$  roughness, whereas typical KOH-etched surfaces exhibit  $5\text{--}10 \text{ nm RMS}$  roughness.

Even though all the alkaline etchants share the same basic features of etching  $(100)$  crystal planes quickly and  $(111)$  planes slowly, the actual selectivity between the crystal planes needs careful attention. KOH exhibits 200:1 selectivity between  $(100)$  and  $(111)$ , whereas selectivity is only 30:1 in TMAH. Exact selectivities are dependent on etchant concentration and temperature. But when other crystal planes are considered, even more differences pop up: when planes like  $(110)$  and high-index planes like  $(311)$  are studied, the differences multiply. Figure 20.3 shows the etch rates for KOH. It is important for etch geometry which are the (local) maximum etch rate planes. Table 20.1 lists etch rate relative to  $(110)$ .

Isopropanol (IPA) addition to KOH will change the relative etch rates of crystal planes and, depending on exact conditions, either the  $(100)$  or  $(110)$  planes will be the maximum etch rate planes. Other surfactants have also been used to modify crystal plane selectivities.

Because etch times are rather long, etchant stability over hours and days needs special attention. When silicon is etched in TMAH, the etched silicon dissolves



**Figure 20.3** Etch rates in different crystal directions in 50% KOH at  $78^\circ\text{C}$ : (a)  $<100>$  Si: fast, but not maximum etching in  $(010)$  direction; (b)  $<110>$  Si:  $(010)$  near maximum etch rate. Reproduced from Seidel *et al.* (1990a) by permission of Electrochemical Society Inc.

**Table 20.1** Etch rates normalized to (110) crystal plane in 30% KOH

|       |       |       |       |
|-------|-------|-------|-------|
| (110) | 1.00  | (100) | 0.548 |
| (210) | 1.072 | (211) | 0.906 |
| (221) | 0.491 | (310) | 1.00  |
| (311) | 0.987 | (320) | 1.06  |
| (331) | 0.797 | (530) | 1.069 |
| (540) | 1.039 | (111) | 0.004 |

Source: Sato, K. et al. (1998).

into the etchant solution and changes its properties, especially selectivities. In order to stabilize the etchant, some silicon is dissolved into TMAH before etching, in a concentration higher than that caused by etched silicon. Pyrocatechol is employed in EDP for similar reasons: the decomposition of ethylene diamine releases small amounts of pyrocatechol, which changes the etchant composition, but if pyrocatechol is added in large amounts to begin with, the decomposition has negligible effect.

## 20.3 Etch Masks and Protective Coatings

Silicon dioxide and silicon nitride are the common masking materials for anisotropic wet etching. Oxide etch rate in KOH is a few nanometers per minute, while TMAH and EDP etch oxide hardly at all. Nitride is more resistant than oxide in both etchants. Silicon etch rate and mask etch rate depend on temperature and concentration but some general guidelines can be given. An oxide thickness of  $2\text{ }\mu\text{m}$  is needed for through-wafer etching in KOH whereas  $200\text{ nm}$  is sufficient in TMAH or EDP. Plasma oxides etch fastest and thermal oxides slowest. LPCVD nitride films of  $200\text{ nm}$  are thick enough for almost any applications, but PECVD nitrides have widely varying properties.

As a practical issue it should be noted that thermal oxide and LPCVD nitride are furnace processes and film is grown/deposited on both sides of the wafer so that the wafer back side is protected. This is important when deep etching is done. PECVD deposition is usually on the front side of the wafer only.

All silicon etchants etch aluminum, which means that either aluminum deposition has to be done after silicon etching, or aluminum has to be protected during silicon etching. In some cases aluminum has been replaced by another metal, like gold. Some relief can be achieved by saturating TMAH solution with silicon, but typically only very short alkaline etchings are done after metallization.

### 20.3.1 Peeling masks/nested masks

Photoresist coating over severe topography can be eliminated by double masking (also known as peeling masks or nested masks): two different mask materials are patterned on a planar wafer, before the first silicon etching step (Figure 20.4). Mask 1 is discarded after the first silicon etching step, and silicon etching continues



**Figure 20.4** Peeling mask/nested mask: nitride (medium gray) deposition and patterning; CVD oxide (dark gray) deposition and patterning; first silicon etching; oxide etching in HF; second silicon etching with nitride mask



**Figure 20.5** Symmetrical bulk silicon micromachined accelerometer: wafer  $380\text{ }\mu\text{m}$  thick has been etched through from both sides simultaneously. SEM courtesy VTI Technologies

with the second mask. Combinations of resist, oxide and nitride are common. An accelerometer fabricated by peeling masks and double-sided symmetric wet etching is shown in Figure 20.5.

## 20.4 Etch Rate and Etch Stop

Silicon etch rate follows the Arrhenius exponential law: for 30% KOH at 60 °C it is 24 µm/h, at 80 °C it is 79 µm/h and at 100 °C it reaches 225 µm/h. KOH, TMAH and EDP practical etch rates are of the order of 0.5–1 µm/min for (100) crystal planes. Table 20.2 lists the major properties of alkaline etchants for silicon.

In addition to silicon etch rate, other factors must be considered: mask etch rate and surface roughness often increase when silicon etch rate increases. The surface roughness of deep (almost through-wafer) KOH etched silicon is about 10–50 nm (RMS). This depends on a number of factors: KOH concentration and temperature (high and low temperatures give rougher surfaces), agitation (ultrasonics reduces roughness) and additives (arsenic and antimony oxides improve smoothness, and surfactants do so too). However, other properties also change: surfactants affect crystal plane selectivities, and the process needs to be characterized anew.

**Table 20.2** Alkaline anisotropic etchants: some main features of etchants

| Etchant                                                  | KOH    | TMAH   | EDP/EPW       |
|----------------------------------------------------------|--------|--------|---------------|
| Rate (at 80 °C)<br>(µm/min)                              | 1      | 0.5    | 1 (at 115 °C) |
| Typical concentration                                    | 40%    | 25%    | 80%           |
| Selectivity<br>(100):(111)                               | 200:1  | 30:1   | 35:1          |
| Selectivity<br>Si:SiO <sub>2</sub>                       | 200:1  | 2000:1 | 10 000:1      |
| Selectivity<br>Si:Si <sub>3</sub> N <sub>4</sub>         | 2000:1 | 2000:1 | 10 000:1      |
| Etch stop factor<br>(10 <sup>20</sup> cm <sup>-3</sup> ) | 10     | 100    | 50            |

Etch stop is an idealization; infinite selectivities are not encountered in the real world. High selectivity is termed etch stop when selectivity is so high that etch timing becomes non-critical. Etch stop can happen through various mechanisms. The etch rate of boron-doped silicon decreases rapidly when doping levels exceed 10<sup>19</sup> cm<sup>-3</sup>, as shown in Figure 20.6. The exact mechanism is unknown but high stresses in heavily doped film may play



**Figure 20.6** The p++ etch stop: (a) with KOH concentration as a parameter; (b) with etch temperature for 24% KOH as a parameter. Reproduced from Seidel *et al.* (1990b) by permission of Electrochemical Society Inc.

a part. This property is frequently used in bulk MEMS, as a way to fabricate simple mechanical structures. It is, however, not possible to fabricate electrical devices on such highly doped material. For instance, piezoresistors cannot be made because the  $p^{++}$  etch stop doping level is higher than the piezoresistor doping level. The stresses in  $p^{++}$  doped structures make them mechanically inferior to lightly doped material. Furthermore, slips are introduced in silicon because of high stresses, and this makes bonding of highly doped wafers difficult.

On the positive side, more shapes can be made. Simple through-wafer etching can produce only square



**Figure 20.7**  $p^{++}$  etch stop: dots and rectangles were masked during doping and are etched in KOH, while  $p^{++}$ -doped areas form a silicon membrane

and rectangular nozzles, but with the boron etch stop technique all shapes become possible. As shown in Figure 20.7, circular nozzles can be defined by lithography and boron ion implantation. Features can also be placed close to each other. KOH etching stops at  $p^{++}$  regions, but continues in the areas that were protected by photoresist during implantation. In this application back side to front side alignment is non-critical: the nozzle shape is defined by the front-side implantation mask only.

#### 20.4.1 Electrochemical etch stop

When a silicon wafer is an anode in an alkaline etching solution, biased positively, above the passivation potential the surface will be oxidized, which stops silicon dissolution. The n-type layer of a pn structure can similarly be protected (Figure 20.8). Positive potential, above the passivation potential, is applied to the n-type layer. Etching of p-type silicon continues until the diode is destroyed, and n-type silicon is then passivated. Electrochemical etch stop can be achieved with lightly doped material, and it is therefore possible to fabricate a number of devices on such material.

### 20.5 Front-Side Processed Structures

Initially the etched shapes are determined by the fast etching planes, usually the (100) planes. But in the end it is



**Figure 20.8** (a) Electrochemical cell for silicon electrochemical etching in KOH: p-type silicon etched; n-silicon passivated by anodic oxide. Reproduced from Wong *et al.* (1992) by permission of Electrochemical Society Inc. (b) Passivation potential and anodic oxidation regime. Reproduced from Collins (1997) by permission of IEEE



**Figure 20.9** The largest rectangle bounded by [110] orientations along and perpendicular to flat will be etched: original mask openings in white, final etched shapes shown by dotted lines. Oxide shown gray. Free-standing oxide bridge will be formed in the middle geometry. Defects of arbitrary shape will end up as rectangles

the slow etching planes, usually the (111) planes, that determine the shape. Mask patterns are usually aligned to major [110] crystal axes (Figure 20.9), but due to either wafer miscut or alignment error, patterns can be off-axis. Anisotropic wet etching carves out the shape that is determined by the largest rectangle bounded by the pattern. This can be used to fabricate free-standing structures by undercutting, as in the middle structure of Figure 20.9. The nitride clips of Figure 20.10 are formed similarly.

Note that inverse polarity structures (opposite mask oxide vs. silicon openings) behave rather differently in anisotropic wet etching (Figure 20.11).

Cantilevers and bridges can be made by front-side micromachining by undercutting. The structures are aligned not to the main axes of silicon, but for example 45° off, so that fast etching planes appear.

Sensor resistors, catalyst metals, AFM tips or other microstructures must be integrated with cantilevers. Whatever structures are made, they have to be processed before the silicon release etch because topology and topography do not allow lithography after release.

If the structures are aligned along the [100] direction (45° relative to wafer flat), instead of the usual flat direction [110], new possibilities arise. The basic situation is shown in Figure 20.12.



**Figure 20.10** Silicon nitride microclips for optical fiber positioning. Reproduced from Bostock *et al.* (1998) by permission of IOP



**Figure 20.11** Etched shape difference with mask polarity: upper row shows the mask patterns (oxide mask shown gray); lower row shows structures after a short anisotropic wet etch



**Figure 20.12** Orientation of structures on (100) wafer. Alignment to wafer flat leads to 54.7° angles and {111} sidewalls. Alignment of 45° relative to flat leads to {110} walls and {100} vertical walls result when rates of {110} relative to {100} fulfill conditions 20.2 and 20.3. Reproduced from Powell and Harrison (2001) by permission of IOP



**Figure 20.13** (a) An etched well with  $45^\circ$  slanted sidewalls in  $<100>$  wafer by  $45^\circ$  degree off-orientation. Reproduced from Strandman *et al.* (1995) by permission of IEEE. (b) Mesas with  $90^\circ$  angles in  $<100>$  wafer, before and after etch mask removal. Reproduced from Vazsonyi *et al.* (2003) by permission of IOP

For instance,  $45^\circ$  walls suitable for fiber coupling mirrors and  $90^\circ$  sidewall mesas can be made. These structures depend on the relative etch rates of (100) and (110) planes according to

$$\text{rate}\{100\}/\text{rate}\{110\} < 1/\sqrt{2} \quad 90^\circ \text{ walls} \quad (20.2)$$

$$\text{rate}\{100\}/\text{rate}\{110\} > \sqrt{2} \quad 45^\circ \text{ walls} \quad (20.3)$$

Condition (20.2) leads to vertical walls which are (100) planes, and condition (20.3) leads to  $45^\circ$  walls which are (110) walls. This is shown in Figure 20.13. Also shown are vertical (100) walls. However, severe undercut is unavoidable in order to make vertical walls in  $<100>$  silicon.

KOH etchant of 25–50% fulfills condition (20.2), and KOH–IPA solution is an example of condition (20.3). When the rate condition is close to the limit values, as with <25% TMAH, inadequate stirring or some other disturbance can lead to unexpected changes in final shapes.

## 20.6 Convex Corner Etching

The etch rate of (100) planes is high relative to (111) planes. When simple concave shapes are etched, the fast etching planes will disappear and the slow etching (111) planes will dominate in the final structure. The fastest etching planes, usually (110) and some high-index planes like (311), are not present in the simple rectangular wells, channels and nozzles, which have only concave  $90^\circ$  inside corners. Convex corners reveal these high-etch-rate

planes, and etching under the mask takes place. The time evolution of a convex corner shape is shown in Figure 20.14. The etched shape is initially determined by the fast etching planes, but the slow etching (111) planes will finally be limiting the structures.

Depending on initial mask size and etch time, either flat mesas or sharp silicon needles will be produced (Figure 20.15).

Nitride bridges, membranes and cantilevers, as well as  $p^{++}$  doped silicon structures, are made by corner undercutting (Figure 20.16).

### 20.6.1 Corner compensation

We noted above that convex corners are dominated by (311) planes (Figure 20.15). In many designs it would be very useful to have sharp corners. This is possible with a little extra effort in mask design by adding compensation structures, shown in Figure 20.17. The fast etching planes start to erode at convex corners. But the final convex corner is protected by this sacrificial structure so that after the compensation structure has been etched away, a rectangular corner remains.

Timing is the difficult part: if etching is stopped too early, a peak remains on the corner. Overetching leads to a structure with an undercut corner, similar to the non-compensated case, but with less undercut. Even though this method looks perfect in two dimensions, it leaves some small  $<311>$  surfaces in three dimensions. Another shortcoming of this method is that it can take a lot of space to form these compensation structures.



**Figure 20.14** Time evolution of convex corner undercutting. Reproduced from Shikada *et al.* (2001) by permission of Springer



**Figure 20.15** Left: etching a silicon mesa. Reproduced from Schröder *et al.* (2001), copyright 2001, by permission of IEEE. Right: silicon pyramid. Reproduced from Wilke *et al.* (2005) by permission of IOP



**Figure 20.16** Cantilever and bridge structures by front-side etching. Underetching from convex corners is used, with structures aligned to [110] main axes on a wafer. Simple rectangular holes along [110] axes result in V-grooves only. Reproduced from Ma *et al.* (2009) by permission of IOP



**Figure 20.17** (a) Different designs for corner compensation. (b) Optical microscope image of a compensated corner after etching. Courtesy Ville Voipio and Seppo Marttila, Aalto University



**Figure 20.18** Silicon membranes: (a) timed; (b) etch stop; (c) SOI membranes

## 20.7 Membrane Fabrication

There are two basic membrane (diaphragm) structures: either the membrane is made of a deposited film, or it is made of single crystal silicon. In the first case etching is quite simple: all silicon is removed, and the thin film remains. There are two main considerations for the membrane material: it has to be (slightly) tensile stressed because a compressively stressed film would buckle, and too high a tensile-stressed film would crack. The film has also to be resistant to alkaline etchants. Silicon nitride fulfills both requirements, and it is widely used. It is also electrically (and thermally) insulating so that resistors can be readily deposited on it, and it is optically transparent. (We use the word membrane for a continuous film: some workers in the fluidics field assume the membrane to be permeable, as in filters. We call such structures perforated membranes.)

Silicon membrane fabrication pictured in Figure 20.18a relies on timed etching but this is a very unsatisfactory approach, especially if thin membranes are needed. Something like  $40\text{ }\mu\text{m}$  is the thinnest that can reasonably be made by timed etching in a manufacturing environment.

Etch stop techniques ( $\text{pn}$  junction,  $\text{p}^{++}$ ) offer better thickness control, and SOI wafers are also amenable to very good membrane thickness control. Etch stops based on  $\text{p}^{++}$  and electrochemical techniques differ with respect to the final membrane properties: the  $\text{p}^{++}$  membrane is highly doped and highly stressed (because boron is a small atom and the lattice has contracted due to the high concentration of boron), so it cannot be used for electrical devices and it is no good for mechanical devices either. It can be used for passive structures, however. Standard  $\text{p}^{++}$  etch stop has two variants: either the  $\text{p}^{++}$  layer is made by diffusion (or implantation) or it is an epitaxial layer. Because

the doping levels required for etch stop are very high indeed, diffusion  $\text{p}^{++}$  is limited to very thin membranes.

Alternatively, Si:Ge:B etch stop can be used: germanium, a big atom, is introduced to compensate for the small boron atoms. Electronically nothing changes because germanium is isoelectronic with silicon. Si:Ge:B etch stop layers are always grown by epitaxy. Usually a structural layer with freely chosen doping is epitaxially grown on top of the Si:Ge:B etch stop layer. This gives a lot of freedom, but epitaxy is an expensive process.

The  $\text{pn}$  junction etch stop has the same variants as  $\text{p}^{++}$  as far as doping methods are concerned, that is diffusion and epitaxy, therefore the same thickness limits. Additionally, the n-layer has to be electrically contacted, and this contact has to be protected from alkaline silicon etchant. Holders of various designs have been invented, with the drawback that part of the wafer front side is used for sealing the holder, leading to silicon real-estate loss, sometimes up to 20% fewer chips than in free etching.

SOI wafers offer an elegant but somewhat expensive way of making membrane structures. The buried oxide



**Figure 20.19** Metallized nitride membrane to be bonded to a backplate chip to make a microphone. Adapted from Kwon and Lee (2007)

of SOI acts as an etch stop layer, leaving the SOI device layer untouched by the etch process. Bonded SOI device layer thicknesses are usually specified at about 10%, so that a 10  $\mu\text{m}$  membrane with  $\pm 1 \mu\text{m}$  thickness variation results.

Microphones are pressure sensors too, and many structural features of the two are identical. The microphone of Figure 20.19 consists of a metallized silicon nitride membrane and an acoustic backplate. The air gap is defined by wet etching (9  $\mu\text{m}$ ) plus Au/Sn solder thickness (3  $\mu\text{m}$ ). Acoustic holes are defined by DRIE (150  $\mu\text{m}$  deep, 60  $\mu\text{m}$  in diameter). Thin and large membranes and small gaps equal sensitive microphones, but small gaps are prone to stiction during fabrication and problematic because of condensing water.

The thermal pressure sensor shown in Figure 20.20 has a number of important points on membrane structures. The top wafer has two thin silicon supports and a large silicon mass (boss) etched in it. The device responds to pressure because of the thin support beams, and it will move downward retaining parallelism with the lower wafer because the large boss does not bend. Comparison of thin membrane and thick bossed membrane was shown in Figure 17.14. Second, the thick boss will act as a heat sink (due to the excellent thermal conductivity of silicon). The central wafer has a nitride membrane. The heater resistor and thermopile detector are patterned on it. The fact that nitride is an electrical and thermal insulator is crucial. The heat produced by the resistor is lost to the gas, and when pressure reduces the gap, heat



**Figure 20.20** Thermal pressure sensor: flexible silicon heat sink will bend when pressure is applied, and the narrower gap enhances heat conduction from the heater resistor to thermopile detector



**Figure 20.21** Nitride membrane with silicon heat spreader island, with the two photomasks used in its fabrication. Adapted from Briand *et al.* (2000)

transfer to the boss is enhanced. Temperature is sensed by the thermopile detector. Similar bossed design is also used in capacitive pressure sensors.

A silicon boss can be made for thermal uniformity reasons in a micro hotplate sensor (Figure 20.21). The first mask defines the silicon island size (which depends on both mask dimensions and initial wet etch depth) and the second mask defines the size of the membrane (together with wafer thickness).

Membrane devices can be made in three basic ways:

1. Membrane first, thin films afterward.
2. Thin films first, membrane etching at the end.

3. Partial wafer etching first, then thin films, then final silicon removal at the end.

The first method is easy in the sense that the harsh KOH (or TMAH) etching step is already done, and the chemical stability of thin films need not be considered. The drawback is that the wafer becomes rather fragile when the membrane is made, and the larger the membrane, the bigger the concern. This applies to processing in general (spin coating, high temperature steps, etc.) and thin-film stresses in particular: high stresses may damage the membrane. The “membrane-first” approach was used to make the membrane chip of the microphone in Figure 20.19. Electrode metallization (Cr/Ni/Au) and solder bonding metals (Au/Sn) were deposited afterward.

The membrane-last approach necessitates some sort of protection of the structures processed on top of the wafer. This protection may be polymer coating, but photoresist is not chemically tolerant. Instead, parylene, fluoropolymers, or black wax can be used. Other protective layers include the same materials that are used as etch masks, namely oxide and nitride. Instead of protective films, a single wafer holder can protect the top side during back-side KOH/TMAH etching. This approach leads inevitably to some loss of effective area on the wafer front. But the holder can provide other functions, like electrical contacts in electrochemical etching or the etch stop technique, which can compensate the costs. The membrane-last approach was used in the micro hotplate device of Figure 20.22. However, the chemically sensitive film was deposited after membrane formation because it does not tolerate KOH, unlike platinum, oxide and nitride.

The mixed approach tries to combine the best of both worlds: mechanical stability offered by a silicon



**Figure 20.22** Membrane-last micro hotplate sensor

“backbone” during processing, and shorter final etch time, which means lesser protection requirements for the front-side devices. Then, for instance, simple PECVD oxide can serve as a protective coating.

## 20.8 Through-Wafer Structures

A nozzle is a basic through-wafer structure. It can be done by single-sided lithography and etching: the nozzle size is determined by the mask size, wafer thickness and silicon crystal geometry. The condition for zero nozzle orifice on a  $<100>$  wafer is given by Equation 20.1. This simple process is very sensitive to wafer thickness variations and it is not practical for making small nozzles.

In the microthruster of Figure 11.7 the size of the nozzle depends critically on etch timing: in order to achieve inlet shapes as drawn, etching must be stopped immediately when the top and bottom etching fronts meet in the middle of the wafer, otherwise the structures start widening as shown in Figure 20.23. So, even though  $<100>$  silicon usually results in  $54.7^\circ$  angles, vertical walls can be obtained in  $<100>$  etching. The SEM micrograph



**Figure 20.23** Time evolution of etching through  $<100>$  silicon from two sides simultaneously



**Figure 20.24** Accelerometer by wet etching of <100> silicon. Courtesy VTI Technologies



**Figure 20.25** Hollow silicon tube for densitometer, double masking with thermal oxide and LPCVD nitride. Reproduced from Najmzadeh *et al.* (2007) by permission of IOP



**Figure 20.26** Cantilever biosensor. Adapted from Yue *et al.* (2004)

of Figure 20.24 shows what a wet-etched accelerometer looks like when etching is stopped at the right moment.

Through-wafer etching is often done in steps: etching some structures, doing further processing and continuing etching later on. In the densitometer of Figure 20.25 fusion bonding has been done on partially etched wafers. Wet etching was then continued, to create hollow tubes for liquid density measurements.

In the cantilever biosensor of Figure 20.26, etching from two sides simultaneously is used to increase the apparent etch rate. The etched shapes are not critical to device operation, and the functionality is in the gold-coated LPCVD nitride cantilever: antibodies are attached to the gold-coated cantilever via gold–sulphur (thiol) bonds. When antigens bind to antibodies, surface stress increases and the cantilever bends. This bending is detected optically. Silicon is very suitable for this application for many reasons: it can withstand LPCVD nitride deposition conditions, it can be etched selectively, it can be bonded to glass (and to PDMS) and its surface is suitable for fluidics.

### 20.8.1 Membranes with perforations

Perforated nitride membranes can be made by double-sided lithography and anisotropic wet etching. Top-side lithography determines the size for perforations. Plasma etching of nitride exposes silicon to wet etchant. Inverted pyramids will form in silicon, and etching will grind to a halt. But because (111) planes have a small but finite etch rate, the inverted pyramids will eventually merge,



**Figure 20.27** Perforated nitride membranes: simple and doubly supported designs. Adapted from Kuiper *et al.* (2000)

and etching then continues. This could be helped by designing filter openings not along the main crystal axes, but slanted. However, this strategy might still not work: hydrogen evolution may prevent etchant from entering small cavities. Back-side etching is therefore needed, too. Nitride is a very advantageous material for this application because of its very high etch tolerance in KOH and TMAH. Nitride is also mechanically a good material, but large membranes and high pressures pose problems. The formation of dual support bars (Figure 20.27) enables large-area membranes with high aperture ratios (high percentage of open area).

## 20.9 <110> Etching

Silicon of <110> orientation offers an interesting possibility anisotropically to wet-etch perfectly vertical walls when the mask is aligned so that slow etching (111) planes form the sidewalls. However, just as in the case of <100> silicon etching, the relative rates of different crystal planes can be changed by etchant concentration and temperature. It is possible to find conditions where a square bottom profile can be achieved, for instance KOH (23% wt)-H<sub>2</sub>O-isopropanol (10–15% wt) at 85 °C or 30% KOH at 70 °C give the etch profile shown in Figure 20.28.

Under other etch conditions a self-limiting shape, a U-groove, is encountered, for instance with 40% KOH at 70 °C (Figure 20.29). U-grooves are self-limiting just like V-grooves on (100) wafers, when planes that etch slower than (110) appear.

Etching will proceed until the six slow etching (111) planes meet. The self-limiting depth  $D$  for U-grooves



**Figure 20.28** Rectangular groove bottoms in KOH-IPA etching of <110> silicon. Reproduced from Dwivedi *et al.* (2000) by permission of Elsevier



**Figure 20.29** Etching of <110> silicon: slow etching (111) planes form vertical sidewalls. Depending on etchant concentration, composition and temperature, slow etching planes start to limit the groove

for initial mask opening sizes  $a$  and  $b$  (Figure 20.30) is given by

$$D = \frac{(a + \sqrt{2} \times b)}{2 \times \sqrt{6}} \quad (20.4)$$

A major limitation of vertical walled structures on (110) silicon is that only diamond-shaped structures will have all four walls vertical. Diamonds have 70.5° and 109.5° angles. Rectangular shapes will turn into hexagons, but diamonds oriented with crystal axes will retain their shape in the etching process.

There is no fundamental limitation as to size of wet-etched structures. In practice minimum size may be limited by lithography or by crystal plane etch selectivity. With 100 nm initial size and 100:1 etch selectivity, a



**Figure 20.30**  $<110>$  etched shapes: solid lines indicate mask openings, dotted lines final etched shapes. Diamonds oriented along major crystal axes retain their shape

100 nm undercut appears after etching 10  $\mu\text{m}$  deep, but finite crystal plane selectivity can be designed in, and in fact structures with aspect ratios greater than 100:1 have been made.

Chemical microreactors form a broad class of micro-fabricated devices, usually with various chamber/flow channel geometries. The hydrogen separation device shown in Figure 20.31 is one example of the benefits that microfabrication has to offer. Hydrogen can diffuse through palladium–silver thin film, but other gases cannot. The reactor is fabricated on  $<110>$  silicon, with etching stopping on the oxide. The oxide is etched away to reveal



**Figure 20.31** A microreactor for hydrogen separation. KOH etching of  $<110>$  silicon stops on oxide, HF etching removes oxide, and palladium membrane remains. Pd allows hydrogen diffusion but blocks all other gases. Adapted from Tong (2003)

the Pd–Ag membrane. The flow channels are formed by anodic bonding. Defect-free thin metal membranes can be made reproducibly because fabrication takes place in a cleanroom, and because the silicon surface is extremely flat and smooth. Higher separation selectivity between hydrogen and other gases is possible because thin, yet defect-free, membranes do not leak. Moreover, the membranes tolerate high pressures because silicon etching can be used to leave silicon support struts as desired. This enables higher pressures and higher gas fluxes.

## 20.10 $<111>$ Silicon Etching

$<111>$  silicon wafers cannot be etched in KOH because (111) planes are the slow etching planes. If, however, initial trenches are opened by plasma etching, other crystal planes will be exposed. The depth of the structure is determined by the initial plasma etch step because the bottoms are (111) planes just like the wafer surface and they do not etch further in KOH.

The sixfold symmetry which was seen in the vertex view of silicon crystal (Figure 4.6) is evident in  $<111>$  wafers (Figure 20.32). Triangular and hexagonal patterns will retain their shapes if oriented along the proper crystal planes. The sidewalls will be either 70.5° or 90°. Structures with concave corners will result in hexagonal final shapes as (111) planes meet.

Free-standing thin-film structures can be made by etching an initial release hole, and then continuing with anisotropic wet etching. Complete undercutting leads to free-standing structures not unlike those made on (100) silicon. However, lateral undercutting in some directions is fairly large, as shown in Figure 20.33.

If free-standing silicon bridges and beams need to be made (Figure 20.34), multiple etch and deposition steps are needed.

### Process flow for $<111>$ silicon microbridge

1. Oxide mask
2. First silicon RIE
3. CVD oxide
4. Oxide RIE
5. Second silicon RIE
6. Silicon wet etching
7. Oxide removal



**Figure 20.32**  $<111>$  silicon crystal planes. Note the hexagonal symmetry. Not all walls are bound by slow etching (111) planes. Reproduced from Park *et al.* (1999) by permission of Institute of Pure and Applied Physics



**Figure 20.33** Etching of  $<111>$  silicon bridge: two rectangular pattern openings are undercut, and etching will proceed until slow etching (111) planes are met. Undercutting to the left and right of the bridge is large compared to bridge width. Reproduced from Park *et al.* (1999) by permission of Institute of Pure and Applied Physics

This microbridge can be of any doping type, while (100) silicon bridges are made either by  $p^{++}$  or  $pn$  etch stop or by timed etching. The bridge thickness of a (111) bridge is determined by the first silicon RIE step, and release gap thickness by the second silicon RIE step. The depths of RIE steps are not very accurate but because the bridge roof and ceiling are slow etching (111) planes, surface quality is excellent.

## 20.11 Comparison of $<100>$ , $<110>$ and $<111>$ Etching

If an initial trench has been etched in the wafer by anisotropic plasma etching (i.e., vertical sidewalls, Figure 20.35), anisotropic wet etching will proceed until slow etching (111) planes are encountered. On a (100) wafer this will result in a rhombohedral structure with  $54.7^\circ$  angles. On a (110) wafer the flat bottom will be further etched and, depending on relative etch rates in the etchant in question, either a flat bottom remains or a U-groove sets in. On (111) wafers either vertical or slanted walls will result, depending on pattern orientation.

Anisotropic wet etching of silicon has served the MEMS community well for decades and is still going strong. There are of course applications where (D)RIE excels, and wet etching is no substitute, but the opposite is also true in certain applications. Batch processing with excellent throughput, uniformity and surface quality is a strong point of wet etching. Exact slope angles determined by silicon crystal planes cannot be matched by DRIE: obtaining perfectly vertical walls by DRIE is only approximate and often a wet etch step is done to reveal the vertical crystal planes, as will be shown in Figure 21.24. Wet etching is also used to smooth surfaces after DRIE.

Integrating wet etching into a process flow requires a number of issues to be tackled: mask oxide thickness is on the limit of practical thermal oxides for through-wafer etching with KOH, but then TMAH can be used. However, the two are not direct substitutes for each other. Use of a silicon nitride etch mask usually solves masking



**Figure 20.34** Silicon bridges in (111) silicon. The first RIE defines silicon bridge thickness. A spacer is formed before the second RIE step which defines the release gap. The spacer protects the bridge during undercutting etch in KOH. Reproduced from Park *et al.* (1999) by permission of Institute of Pure and Applied Physics



**Figure 20.35** RIE vertical walled starting shape (dotted), followed by KOH wet etching: top, <100>; middle, <110>; bottom, <111> wafers

problems. DRIE will be discussed in the following chapter and Chapter 30 discusses MEMS applications at length.

## 20.12 Exercises

- What is the activation energy of <110> silicon etching in 30% KOH?

|  | 30  | 40  | 50   | 60 | 70 | 80  | 90  | 100 °C   |
|--|-----|-----|------|----|----|-----|-----|----------|
|  | 4.7 | 9.8 | 19.4 | 37 | 68 | 121 | 209 | 350 μm/h |

- KOH etch rates ( $\mu\text{m}/\text{h}$ ) for <100> silicon are given below. Calculate the activation energies for the different KOH concentrations!

|     | 40 °C | 60 °C | 80 °C | 100 °C |
|-----|-------|-------|-------|--------|
| 20% | 7.09  | 26.7  | 86.3  | 246    |
| 30% | 6.48  | 24.4  | 79    | 225    |
| 40% | 5.28  | 19.9  | 64.4  | 183    |
| 50% | 3.77  | 14.2  | 45.9  | 131    |

Data from Lang (1996).

- Silicon <100> wet etch rate in 25% KOH at 90 °C has been measured to be 2.5  $\mu\text{m}/\text{min}$ , and activation

energy was determined to be 0.61 eV (59 kJ/mol). If membranes 40  $\mu\text{m}$  thick are made in wafers 380  $\mu\text{m}$  thick, and etch bath temperature is controlled to  $\pm 1^\circ\text{C}$ , how does this affect membrane thickness control?

- In Exercise 1.5 pressure sensor membrane deflection was calculated. Compare this to the surface roughness of a KOH etched wafer.
- Micromechanical pressure sensor chips have diaphragms 40  $\mu\text{m}$  thick which are 1  $\times$  1 mm in area. How many such chips can be made on
  - 380  $\mu\text{m}$  thick 3 inch wafers?
  - 525  $\mu\text{m}$  thick 100 mm wafers?
  - 675  $\mu\text{m}$  thick 150 mm wafers?
- Nozzles are fabricated by etching anisotropically through a <100> silicon wafer 380  $\mu\text{m}$  thick. A mask pattern 540  $\mu\text{m}$  wide is used. Calculate the size of holes produced by an ideal process. Then calculate the effect of the following real-world uncertainties:
  - wafer thickness variation  $380 \pm 5 \mu\text{m}$
  - total thickness variation (TTV) of 1  $\mu\text{m}$
  - <100>:<111> crystal plane selectivity 33:1 vs. 30:1.

7. If a piezoresistive pressure sensor membrane is made in an epitaxial layer, and diaphragm etching is stopped by a pn junction etch stop, how do the following affect membrane thickness:
- (a) wafer thickness
  - (b) wafer TTV
  - (c) epitaxial layer thickness?
8. Detail all the fabrication steps of the AFM cantilever–tip of Figure 20.2.
9. Detail the process sequence for making the device shown in Figure 20.7.
10. What is the angle between the (111) and (311) planes shown in Figure 20.29?
11. Design “corner compensation” structures that will result in a circular hole in a wet etched <100> wafer.
12. Design a process and mask set for the fabrication of silicon bridges on (110) wafers.
13. Calculate the aperture ratios (percentage of open area) in the two filter designs of Figure 20.27.
14. The deflection of a circular membrane under pressure is given by  $h = 0.666(r^4\Delta p/Et)^{1/3}$ , where  $r$  is diaphragm radius,  $t$  thickness and  $E$  Young’s modulus. What deflection corresponds to 25 mtorr pressure difference? What is the corresponding capacitance change?
15. How small an opening can you make by wet etching in (a) bulk wafers and (b) SOI wafers for the structure shown below? Assume top-side etched depth to be 10  $\mu\text{m}$ .



16. How is nozzle shape control different below if etching is done (a) from both sides simultaneously, (b) from the top side first, then thermally oxidized and etched from the back side?



17. How would you fabricate the optical fiber positioning device shown below? Draw photomasks and cross-sections after each silicon etch step.



Reproduced from Hoffman and Voges (2002) by permission of IOP

18. Design a process and calculate the mask dimensions for a  $500 \times 500 \mu\text{m}$  nitride membrane with a  $300 \times 300 \mu\text{m}$  silicon island, 20  $\mu\text{m}$  thick, for the heat spreader structure of Figure 20.21.
19. Estimate the dimensions of structures in the accelerometer of Figure 20.5!

## References and Related Reading

- Bostock, R.M. *et al.* (1998) Silicon nitride microclips for the kinematic location of optic fibres in silicon V-shaped grooves, *J. Micromech. Microeng.*, **8**, 343–360.
- Briand, D. *et al.* (2000) Design and fabrication of high-temperature micro-hotplates for drop-coated gas sensors, *Sens. Actuators*, **B68**, 223–233.
- Brida, S. *et al.* (2000) Microstructures etched in doped TMAH solutions, *Microelectron. Eng.*, **53**, 547–551.
- Collins, S.C. (1997) Etch stop techniques for micromachining, *J. Electrochem. Soc.*, **144**, 2242.
- Dwivedi, V.K. *et al.* (2000) Fabrication of very smooth walls and bottoms of silicon microchannels for heat dissipation of semiconductor devices, *Microelectron. J.*, **31**, 405.
- Elwenspoek, M. and H. Jansen (1998) **Silicon micromachining**, Cambridge University Press.
- Fang, W. (1998) Design of bulk micromachined suspensions, *J. Micromech. Microeng.*, **8**, 263–271.
- Hannemann, B. and J. Fruhauf (1998) New and extended possibilities of orientation dependent etching in microtechnics, Proceedings of IEEE MEMS’98, p. 234.
- Hoffmann, M. and E. Voges (2002) Bulk silicon micromachining for MEMS in optical communication systems, *J. Micromech. Microeng.*, **12**, 349.

- Kovacs, G.T.A. *et al.* (1998) Bulk micromachining of silicon, *Proc. IEEE*, **86**, 1543.
- Kuiper, S. *et al.* (2000) Wet and dry etching techniques for the release of sub-micrometre perforated membranes, *J. Microelectromech. Syst.*, **10**, 171–174.
- Kwon, H.-S. and K.-C. Lee (2007) Double-chip condenser microphone for rigid backplate using DRIE and wafer bonding technology, *Sens. Actuators*, **A138**, 81–86.
- Lang, W. (1996) Silicon microstructuring technology, *Mater. Sci. Eng.*, **R17**, 1–55.
- Lindroos, V. *et al.* (2010) **Handbook of Silicon Based MEMS Materials and Technologies**, Elsevier.
- Ma, T., Y. Liu and T. Li (2009) A <100> direction front-etched membrane structure for a micro-bolometer, *J. Micromech. Microeng.*, **19**, 035022.
- Najmzadeh, M., S. Haasl and P. Enoksson (2007) A silicon straight tube fluid density sensor, *J. Micromech. Microeng.*, **17**, 1657–1663.
- Oosterbroek, R.E. *et al.* (2000) Etching methodologies in <111>-oriented silicon wafers, *J. Microelectromech. Syst.*, **9**, 390.
- Pal, P., K. Sato and S. Chandra (2007) Fabrication techniques of convex corners in a (100)-silicon wafer using bulk micromachining: a review, *J. Micromech. Microeng.*, **17**, R111–R133.
- Park, S. *et al.* (1999) Mesa-supported, single-crystal microstructures fabricated by the surface/bulk micromachining process, *Jpn. J. Appl. Phys.*, **38**, 4244.
- Powell, O. and H. Harrison (2001) Anisotropic etching of {100} and {110} planes in (100) silicon, *J. Micromech. Microeng.*, **11**, 217.
- Sasaki, M. *et al.* (2000) Anisotropically etched Si mold for solid polymer dye microcavity laser, *Jpn. J. Appl. Phys.*, **39**, 7145.
- Sato, K. *et al.* (1998) Characterization of orientation-dependent etching properties of single-crystal silicon: effects of KOH concentration, *Sens. Actuators*, **A64**, 87–93.
- Schröder, H. *et al.* (2001) Convex corner undercutting of {100} silicon in anisotropic KOH etching: the new step-flow model of 3-D structuring and first simulation results, *J. Microelectromech. Syst.*, **10**, 88–97.
- Seidel, H. *et al.* (1990a) Anisotropic etching of crystalline silicon in alkaline solutions I, *J. Electrochem. Soc.*, **137**, 3612.
- Seidel, H. *et al.* (1990b) Anisotropic etching of crystalline silicon in alkaline solutions II, *J. Electrochem. Soc.*, **137**, 3626.
- Shikida, M. *et al.* (2000) Differences in anisotropic etching properties of KOH and TMAH solutions, *Sens. Actuators*, **80**, 179.
- Shikida, M. *et al.* (2001) A new explanation of mask undercut in anisotropic silicon etching: saddle point in etching rate diagram, *Proceedings of Transducers'01*, p. 648.
- Strandman, C. *et al.* (1995) Fabrication of 45° degree mirrors together with well-defined V-grooves using wet anisotropic etching of silicon, *J. Microelectromech. Syst.*, **4**, 214.
- Tong, H.D. (2003) Microfabrication of palladium–silver alloy membranes for hydrogen separation, *J. Microelectromech. Syst.*, **12**, 622–629.
- Vazsonyi, E. *et al.* (2003) Anisotropic etching of silicon in a two-component alkaline solution, *J. Micromech. Microeng.*, **13**, 165.
- Wilke, N. *et al.* (2005) Process optimization and characterization of silicon microneedles fabricated by wet etch technology, *Microelectron. J.*, **36**, 650–656.
- Wong, S.S. *et al.* (1992) An etch stop utilizing selective etching of n-type silicon by pulsed potential anodization, *J. Microelectromech. Syst.*, **1**, 187.
- Yue, M. *et al.* (2004) A 2-D Microcantilever array for multiplexed biomolecular analysis, *J. Microelectromech. Syst.*, **13**, 290–299.

# Deep Reactive Ion Etching

Reactive ion etching (RIE) (which we will use as a synonym for plasma etching) opens up many possibilities which cannot be done by wet etching. Practically any size and shape can be made by RIE. All sorts of mechanical devices, like springs, beams and plates, are fabricated by plasma etching, as are vertical capacitors and mirrors, in-plane and out-of-plane microneedles and nozzles, and fluidic filters and separation devices.

DRIE is an extension of RIE for deep etching, but depth is very different from application to application. For example, in SOI MEMS there is a need to etch the SOI device layer, which can be 5–50 µm, but in many cases also the handle wafer needs to be etched, that is 300–500 µm. While typical RIE rates are on the order of 0.1–1 µm/min, DRIE rates are 2–20 µm/min. Even higher rates have been reported, but then other criteria have been sacrificed: side-wall profile control is poor and high rates are only available for designs which have small etchable area.

This chapter concentrates on silicon DRIE processes, with some basic structures discussed. Comparison is often made to anisotropic wet etching because sometimes DRIE and KOH can both be used, and selection has to be reasoned. More advanced device examples will be found in Chapter 30 on MEMS integration.

## 21.1 RIE Process Capabilities

Small linewidths can be made by RIE. If lithographers have been able to print something on the wafer, etch engineers have always been able to etch those patterns. RIE was adopted in IC fabrication at about 3 µm linewidths, and it has been scaled down to current generation 45 nm processes. RIE is therefore suitable for nanostructure fabrication.

Structures of high aspect ratio can be made (Figure 21.1). Pillars and holes can be very deep and narrow, with 100:1 record structures used in DRAM

trench capacitors (10 µm deep, 100 nm wide). In MEMS through-wafer structures 10 µm wide and 400 µm deep have been made. Making structures of high aspect ratio is by no means easy, and etch rate slows down in such deep cavities, as will be discussed later in this chapter.

Combining the narrow gap and high aspect ratio capabilities, RIE enables comb-drive devices. Comb drives are widely used as sensors and actuators in microsystems. Basically a comb drive is a vertical capacitor with moving and fixed capacitor plates. The principle is shown in Figure 21.2 and a SEM micrograph of it is shown in Figure 21.3. All the capacitor fingers are undercut, but every other one is connected to a fixed anchor and every other to a movable plate (more on this in Chapter 29 on surface micromachining).

A comb drive can be used as a capacitive sensor, with its capacitance given by Equation 21.1 and force by Equation 21.2. DRIE is important in many ways, because it can be used to fabricate narrow gaps, spacing  $d$  is small and the number of finger pairs,  $n$ , can be made large and, because deep structures can be etched, height  $h$  and area  $A$  will be large:

$$C = \frac{n\epsilon A}{d} \quad (21.1)$$

$$F = \frac{1}{2}CV^2 = \frac{n\epsilon A}{2d}V^2 \quad (21.2)$$

In sensor applications the gap between the capacitor plates changes due to some external force, for example acceleration (in accelerometers and gyroscopes) or gravity (inclinometers). High sensitivity is possible because of large capacitor area. When a comb drive is used as an electrostatic actuator, the narrow gap means low actuation voltages, and large area equals large forces. Long actuation distances of tens of micrometers can be made, and this is used for example in the laterally moving optical attenuator of Figure 21.4. DRIE is essential also in fabricating



**Figure 21.1** Nanopillars (left) and micropillars by DRIE (right). Courtesy Nikolai Chekurov Aalto University, and Kai Kolari, VTT



**Figure 21.2** Comb-drive actuator

the vertical mirror which is inserted in the optical path. A variable optical attenuator (VOA) has been made by two  $45^\circ$  mirrors, one fixed, one movable (Figure 21.4). DRIE is essential for both the vertical wall and  $45^\circ$  angle.

The third major difference to wet etching is shape freedom: RIE will truthfully replicate any shape that has been patterned, irrespective of wafer crystal orientation. This opens up endless opportunities for device design. In making watch springs and other mechanical structures, the capability to fabricate variable cross-sections opens up new possibilities to fine-tune mechanical features, unlike traditional machining (Figure 21.5).

RIE produces vertical walls. This is true of anisotropic wet etching of  $<110>$  silicon as well, but there are limitations in wet etching: it is only possible along certain crystal planes, and cannot for example make two mirrors that are  $90^\circ$  to each other. With RIE, any shapes can



**Figure 21.3** SEM micrograph of a comb-drive actuator (left) and vertical mirror at the end of the actuator (right). Reproduced from Acar and Shkel (2005) by permission of IOP and from Yun *et al.* (2006), copyright 2006, by permission of Elsevier



**Figure 21.4** Variable optical attenuator (VOA) with fixed and movable  $45^\circ$  vertical mirrors (left); optical fibers inserted into VOA (right). Reproduced from Kim and Kim (2005) by permission of IOP



**Figure 21.5** Mechanical parts for watches. Courtesy CSEM

be made, as the gyroscope of Figure 21.6 reveals. RIE walls are seldom exactly  $90^\circ$ , but for example  $90 \pm 2^\circ$ , which may, or may not, be critical. In the case of vertical capacitors (and the gyroscope is one) and mirrors,  $90^\circ$  sidewalls are a must. Negative angles ( $>90^\circ$ ), also called retrograde, are problematic in many cases, for example in micromolding and casting, see Figure 18.20.

Micro-optics offers a wealth of examples where the special qualities of DRIE are utilized. The shape freedom is used to fabricate curving and spiraling waveguides (Figure 21.7). Sometimes the radius of curvature is as small as  $5\text{ }\mu\text{m}$ , so the small feature size capability of DRIE is important. There are many applications where subwavelength sizes are needed, for example in antireflection “coatings” by artificial refractive index structures (Figure 21.7).

Another important issue in micro-optics is the sidewall quality. Most applications would prefer perfectly vertical

sidewalls, but sometimes slanted walls or continuous surface profiles are needed. Any roughness on the sidewalls is usually detrimental to optical performance, and DRIE is sometimes followed by additional smoothing steps (Section 21.6 below).

In order to increase device packing density, the front surface area should be reserved for active devices only. This is especially important for optical devices like solar cells and CMOS camera chips. One solution is through-silicon vias (TSVs), making contacts on the back of the wafer, and having vias through the wafer to devices on the front side. DRIE and high aspect ratios are involved, but the etched depth is typically only  $30\text{--}100\text{ }\mu\text{m}$ , depending on whether etching is done from the front or back side. In all cases it is important that via profile is positively sloping, or at least not retrograde, because the vias need to be filled by a conductor later on. TSV will be discussed in Chapter 39.



**Figure 21.6** Gyroscope. Courtesy VTI Technologies

## 21.2 RIE Process Physics and Chemistry

Fluorine, chlorine and bromine etch silicon. Fluorine is chosen for MEMS DRIE because of its etch rate: etch rates an order of magnitude higher are possible with SF<sub>6</sub> than with Cl<sub>2</sub> or HBr. More fluorine can be generated at the same power level, and fluorine is very reactive. Fluorine, however, does not easily result in a perfectly anisotropic profile, but in many MEMS applications near-vertical profiles can be tolerated. In DRAM trench capacitor etching profile control is of paramount importance, and Cl<sub>2</sub> and HBr are used, and the etched depth is small enough so that a slower rate can be tolerated. Two major variants of MEMS DRIE processes exist: Bosch and cryogenic. Both take advantage of sidewall passivation mechanisms (Figure 11.5), but differ in many details.

In the Bosch process (named after the company which developed it) SF<sub>6</sub> and C<sub>4</sub>F<sub>8</sub> gases are pulsed: an SF<sub>6</sub>

pulse etches a few micrometers of silicon, but etching is not completely anisotropic. A C<sub>4</sub>F<sub>8</sub> pulse is then applied, and a fluoropolymer protective film is deposited all over the wafer. The next SF<sub>6</sub> etching pulse removes the polymer film from the trench bottom by ion-assisted etching, but the sidewalls do not experience ion bombardment and remain protected (though are slightly etched by fluorine radicals). After removing the protective film from the trench bottom, SF<sub>6</sub> etching of silicon can continue. The next C<sub>4</sub>F<sub>8</sub> pulse deposits a new protective film and then another SF<sub>6</sub> pulse is fed into the reactor. The pulsed operation leads to an undulating sidewall (shown schematically in Figure 21.8 and in the SEM micrograph of Figure 21.9 (right)). It is important to remove the fluoropolymer film completely at the end of the etching process because it can interfere with the following steps, for example by preventing wet cleaning via its hydrophobicity.

In cryogenic deep etching continuous SF<sub>6</sub>/O<sub>2</sub> flow is used at -120 °C. Etching proceeds vertically because a film of SiO<sub>x</sub>F<sub>z</sub> from the reaction products is deposited on the cooled feature walls, and it is removed from horizontal surfaces by ion bombardment, leaving the sidewalls protected. Lateral etching by spontaneous chemical reaction is also suppressed by low temperature as predicted by the Arrhenius law (Equation 1.1). The smooth sidewall of the cryogenic process is contrasted to the undulating Bosch profile in the SEM micrographs of Figure 21.9.

Both the Bosch and cryogenic processes are low-pressure, high-density plasma processes. Various process parameters can be used to fine-tune DRIE processes, as shown in Figure 21.10. Process inputs like pressure, RF power, flow rate and temperature influence ion density, energy and angular distribution; radical density, surface reaction probability and desorption probability are also determined by machine parameters. In the end we are not interested in ion angular distributions, but



**Figure 21.7** RIE in micro-optics: left, spiraling waveguides, reproduced from Xu *et al.* (2009) by permission of Optical Society of America; right, end-of-ridge waveguide antireflective facets, reproduced from Schmid *et al.* (2007) by permission of Optical Society of America



**Figure 21.8** Bosch process, etch and passivation pulses repeated: (a)  $\text{SF}_6$  isotropic etch step; (b)  $\text{C}_4\text{F}_8$  passivation layer deposition; (c) next etch step

in wafer-level measurable quantities, like etch rate, selectivities, sidewall angle and the like.

Some general guidelines can be given regarding process optimization, but process recipes are very much reactor dependent, and not easily transferred from one etcher to another. Higher RF power results in higher etch rate, as expected, but the etch rates of oxide and resist also increase, lowering selectivities, so some balance must be obtained. Lower pressure is good for anisotropy because at low pressure there are fewer ion collisions

and more directional ion bombardment. But if high flow rates are needed (and they are needed to provide enough etchant), high pumping capacity is mandatory. Temperature increase has to be prevented because the resist is degraded when temperatures approach and exceed  $100^\circ\text{C}$ . Passivation film thickness (either from  $\text{C}_4\text{F}_8$  source gas or from  $\text{SiO}_x\text{F}_y$  reaction products) is good for anisotropy, but the passivation film slows down etching, and there is a danger of overpassivation, which leads to residues, and in the extreme to silicon nanoglass, often known as black silicon (because it is an efficient absorber of light).

### 21.3 Deep Etching

Because DRIE etch rates are high, through-wafer etching is a standard step. This can be used to make holes, nozzles, membranes, cantilevers and others. Figure 21.11 shows a through-wafer etched triangular aligner for optical fiber. Its size is designed to match the fiber size, obviously, but the flexible fins allow for minor dimensional variations and ensure intimate contact. Holders of silicon nitride for fiber alignment made by wet etching were shown in Figure 20.10, but these were for in-plane fiber alignment, and wet etching could not be used to make a through-wafer version.

In Figure 17.2a a variable capacitor was shown. DRIE through-wafer etching releases the aluminum membrane and allows it to move. Aluminum is used as an etch mask, and in fluorine plasmas very high etch selectivity is obtained against aluminum because  $\text{AlF}_3$  is involatile. If the variable capacitor were made by wet etching, the process flow would need to be completely different because aluminum does not tolerate KOH or TMAH. Making the aluminum membrane would then need to be done after silicon etching.



**Figure 21.9** Silicon microstructures by cryogenic (left) and Bosch processes (right). Courtesy Ali Shah, Aalto University and Mikael Sterner and Joachim Oberhammer, KTH Royal Institute of Technology



**Figure 21.10** The interdependence of reactor parameters to plasma parameters and etch responses on a wafer. Courtesy Lauri Sainiemi, Aalto University



**Figure 21.11** A holder for optical fiber. Reproduced from Holm *et al.* (2000), copyright 2000, by permission of Elsevier

Both the thin SOI device wafer and the thick handle wafer can be etched by DRIE. In the torsion mirror of Figure 21.12, all thin-film deposition and patterning steps on the SOI device layer are finished, and device layer silicon is then etched. Back-side lithography and handle wafer etching then commence. In this case all the critical structures are in the device layer, and back-side etching is non-critical, so both wet etching and DRIE can be used. BOX etching finally releases the torsion bars and combs and allows them to move.

## 21.4 Combining Anisotropic and Isotropic DRIE

As discussed in Chapter 11, anisotropic etching is the special case and isotropic profile is the default profile. The high rate offered by DRIE is beneficial also in isotropic etching. The combination of anisotropic and isotropic etching steps can be used to make free-standing single crystal silicon structures with vertical walls (Figure 21.13). The method relies on anisotropy/isotropy control in many steps.

### Process flow for suspended silicon bridges

- Oxide mask
- DRIE for the trench
- Oxide CVD
- Oxide RIE: bottom cleared, sidewalls remain
- Isotropic silicon etching

This method has many limitations, the most important being that the width of the released structure and its thickness are related, and the method is best suited for narrow and thick structures. The underside of the structure is not well defined, which may be a limitation in some applications.

Silicon bridges with excellent dimensional control can be achieved when SOI wafers are used (Figure 21.14).



**Figure 21.12** Schematic comb-drive torsional analog mirror: left, before SOI BOX etching; right, SEM micrograph after HF etching. Reproduced from Tsou *et al.* (2005) by permission of IOP



**Figure 21.13** Suspended silicon bridge fabrication: (a) oxide-masked DRIE; (b) CVD oxide deposition and anisotropic oxide etching; (c) isotropic etching of silicon

Thermal oxide masks the DRIE of SOI device layer etching. CVD oxide is then deposited. Next, anisotropic RIE of oxide leaves sidewalls protected, but CVD oxide on the bottom is etched. So far the bulk silicon and SOI processes are identical, but then oxide etching continues and BOX is etched, too. The oxide mask on device silicon is also consumed, so the three oxide thicknesses have to be carefully designed. After etching through BOX, the isotropic silicon etch process step releases the bridge. The bridge is protected by thermal mask oxide from the top, by CVD oxide on sidewalls and by BOX from the bottom, and the bridge dimensions are preserved during the isotropic silicon etch step. The stress test structure of Figure 2.2 was made with this process. Bridge doping is a free parameter, while in many wet-etched versions p<sup>++</sup> or pn etch stop techniques necessitate certain doping levels.



**Figure 21.14** Microbridge in SOI device layer: (a) mask oxide, SOI device silicon and BOX anisotropic etching with resist mask; (b) PECVD oxide; (c) oxide anisotropic etching; (d) SOI handle silicon isotropic etching. Redrawn from Sainiemi *et al.* (2009)

The same method has been used to create buried microchannels (Figure 21.15). Anisotropic silicon etching is used to etch a narrow trench, which is protected by CVD oxide. Isotropic silicon etching creates a circular channel. Conformal deposition closes the channel. Because thin-film deposition thicknesses are practically limited to a few micrometers, the initial trench opening has to be in the micrometer range.

## 21.5 Microneedles and Nozzles

In-plane microneedles can be made very long because silicon is strong. A length of a few millimeters is not unusual. Out-of-plane microneedles are limited to the wafer thickness, in practice a few hundred micrometers. There are needles and nozzles for biomedical applications (blood extraction and drug injection), for chemical applications (for sample droplet dispensing, spraying and vaporization), printers (ink jets, DNA array spotting), and for cooling, aerosol generation and cell probing.

The in-plane fluidic microneedle shown in Figure 21.16 makes use of DRIE's special capabilities, as it could not be fabricated by wet etching. The process flow is as follows. The buried channel is formed as described above using both anisotropic and isotropic DRIE, and conformal CVD. The fluidic reservoir part (along cut line AA') is very wide. It will be coated by CVD but not closed. Needle shape is determined by the next DRIE step. Chip size is defined on the same mask. Needle thickness is determined by the back-side DRIE release etch.



**Figure 21.15** Buried microchannels: (a) anisotropic DRIE, sidewall spacer formation and isotropic DRIE; (b) removal of spacer and conformal CVD. SEM micrograph from de Boer *et al.* (2000) by permission of IEEE



**Figure 21.16** In-plane microneedle: buried microchannel, Reproduced from Paik *et al.* (2004), copyright 2004, by permission of Elsevier

Deep structures can be etched easily, but lithography over deep steps is difficult. Double masking (or peeling or nested masks) can be used just as in wet etching (Figure 20.4). For example, oxide is grown and patterned, and a second pattern of resist is done on top (Figure 21.17). The resist serves as a mask during the first shallow DRIE step



**Figure 21.17** Nested mask: (a) oxide lithography and etching, followed by new resist spinning and lithography and silicon etching with the resist mask; (b) resist mask removal and second silicon etching with oxide mask

and is then stripped away. The oxide revealed underneath will act as a mask for the second, deeper DRIE step.

There are various ways of making nozzles with the through-wafer fluidic channel. In this section we will compare many different versions, with regard to lithography–etching interplay: the number of process steps differs, and masking and alignment requirements vary. In every case, we are talking about arrays of nozzles, because here DRIE excels over wet etching. In wet etching nozzle density is limited to about  $4/\text{mm}^2$  while DRIE nozzle arrays can have densities of  $100/\text{mm}^2$  easily, and  $1000/\text{mm}^2$  with some process tuning. The most common limiting factor is the through-wafer etching at high aspect ratio:  $10\ \mu\text{m}$  holes equal 40:1 aspect ratios, and most applications will settle for larger channels (even though the nozzles can be made very small).

The first approach uses three front-side lithography and DRIE steps (Figure 21.18). If the first DRIE etch (determining nozzle height) is shallow, resist spinning over topography can reasonably be done. The second lithography defines the nozzle itself. The third patterning



**Figure 21.18** Nozzle array with common fluid reservoir: three lithography and three DRIE steps. Redrawn from Yu *et al.* (2009)

on the wafer back side is non-critical for both lithography and DRIE: the structures are large, and alignment requirements are lax. All the nozzles will be squirting the same fluid because of the common reservoir but the density of nozzles is very high.

The DRIE requirements for the nozzles of Figure 21.19 are much more demanding than in the previous example. For 100 µm nozzles aspect ratios of 4:1 are needed in through-wafer DRIE. On the other hand, only two lithography steps are needed, and both of them are done on a planar surface. Alignment is critical to make the top and bottom channel openings coincide.

In both the above cases simple silicon nozzles sufficed, but if for instance a dielectric nozzle is needed, the process becomes a little more complex (Figure 21.20). This



**Figure 21.20** Oxide nozzle for patch clamp cell probing. After two-step DRIE with peeling masks: (a) thermal oxidation; (b) anisotropic oxide RIE; (c) anisotropic and isotropic silicon RIE; (d) back-side silicon DRIE and thermal oxidation. Adapted from Lehnert *et al.* (2002)



**Figure 21.19** Silicon nozzle array: (a) back-side DRIE; (b) front-side DRIE; (c) SEM micrographs of nozzles. Reproduced from Deng *et al.* (2006), copyright 2006, by permission of Elsevier

device is used in patch clamp cell membrane electrical studies, and the device has to have gigaohm resistance. A peeling mask process similar to Figure 21.17 is used to create the initial top pattern. The wafer is then thermally oxidized, and oxide RIE is performed. Oxide on the front surface is thicker than elsewhere, because there was mask oxide to begin with. The oxide on the sidewalls is preserved because oxide RIE is anisotropic, but the bottoms of etched features are cleared of oxide. Silicon DRIE is then continued, to form an oxide nozzle (Figure 21.21). Another oxidation step ensures dielectric isolation everywhere.

The side-opened microneedle (Figure 1.13) provides a sharp tip for effective piercing of the epithelium, but because the fluidic channel is not at the tip, it will not



**Figure 21.21** Oxide nozzles of previous figure as seen by a SEM. Reproduced from Lehnert *et al.* (2002), copyright 2002, American Institute of Physics

be easily clogged. The fabrication process is depicted in Figure 21.22. Front-to-back alignment is needed, but because device dimensions are fairly large, 100 µm, alignment is not difficult. The process flow is as follows:

### Process flow for the side-opened microneedle:

| Step                      | Comment                                                                                                                      |
|---------------------------|------------------------------------------------------------------------------------------------------------------------------|
| Thermal oxidation         | Both sides of wafer, protects front                                                                                          |
| Back-side lithography     | Non-critical lithography                                                                                                     |
| Oxide RIE on back         | Etch mask pattern                                                                                                            |
| DRIE from backside        | Fluid access channel                                                                                                         |
| Oxide etching in HF       | Oxide removed from both sides                                                                                                |
| Wet oxidation             | Front etch mask and etch stop in channel                                                                                     |
| Top-side lithography      | Needle lateral shape defined                                                                                                 |
| Oxide RIE on top          | Etch mask pattern, no back-side protection                                                                                   |
| DRIE sequence on top side | First isotropic shallow etch, 30 µm<br>Second anisotropic medium etch, 100 µm<br>Third isotropic narrowing etch, until oxide |
| Oxide etch                | HF etching opens channels and clears tip                                                                                     |



**Figure 21.22** Microneedle with a sharp tip for piercing and side-opened liquid channel. See also SEM micrograph of the finished needle in Figure 1.13. Reproduced from Griss and Stemme (2003), copyright 2003, by permission of IEEE

scalloping) in the Bosch process makes it unsuitable for many optical applications. There are ways to minimize undulation for example by using shorter pulses, which unfortunately results in a slower etch rate. The sidewall angle may also change, so undulation minimization will affect many other process responses.

High ion energy is useful because it can sputter away material, but it is non-selective etching. It may be beneficial in the early stages of etching to have a non-selective etch step which will remove everything, for example native oxide. But too high an ion energy starts to affect the shape. Sharp corners will be rounded by ion bombardment and deflected high-energy ions will lead to trenching at the feature bottoms, Figure 21.23d. Deflected ions in general will result in deviations about verticality, for instance barreling (bowing) of the profile (Figure 21.23b).

The sidewalls of DRIE etched walls are quite rough, with 50–100 nm RMS roughness typical. Optical scattering losses increase rapidly when roughness exceeds about 5% of wavelength (which corresponds to 20–30 nm for visible wavelengths). A combined plasma–wet process can provide benefits: wet smoothing after dry etching. A short KOH/TMAH/EDP etch step etches of course in crystal plane-dependent fashion and if suitable crystal planes are found, the surface quality will be equal to wet etching. These walls are very smooth and precisely oriented, as witnessed by their use as a master for polymer optical

## 21.6 Sidewall Quality

As a first approximation DRIE sidewalls are vertical (Figure 21.9) and this is important in many applications. Vertical capacitors and mirrors necessitate perfectly vertical walls. In an optical etalon (Exercise 30.13) the vertical mirrors should be perfectly parallel, otherwise the light reflected in between will rapidly leak out. There are many possible non-idealities. The first is a non-vertical but smooth sidewall, with a deviation of a few degrees from the vertical (Figure 11.4). A similar negative (retrograde) slope is also common (Figure 21.1 (left)) and they can be tuned to some extent by changing the process parameters. Other sidewall non-idealities include Bosch process undulation (Figure 21.9b) and roughness and bowing, which are shown schematically in Figure 21.23. The periodic undulation (also known as



**Figure 21.23** DRIE sidewall non-idealities: (a) retrograde profile; (b) barreling (bowing); (c) roughness; (d) faceting (top) and trenching (bottom)



**Figure 21.24** Bosch process ripple removed by wet etching in EPW, simultaneously ensuring vertical, crystal plane-defined walls. Reproduced from Sasaki *et al.* (2000) by permission of Institute of Pure and Applied Physics

device casting (Figure 21.24). This typically translates to 10–30 nm RMS roughness.

Smoothing is also important for mechanical strength: a surface with cracks and hillocks is mechanically weak because the irregularities act as starting points for fracture. A two-step dry-dry etching will help: after the DRIE step

an isotropic low-power plasma etch is done to remove the damaged surface layer. A factor of two increase in mechanical strength can be achieved. The role of lapping in removing grinding damage is analogous. Hydrogen annealing, which is used in Smart-cut SOI wafer fabrication (Section 22.5), smoothes out small irregularities, and it is a useful post-DRIE treatment too; for example, smoothing requires a 1000–1100 °C anneal for 10 minutes.

Post-etch smoothing can take many forms. Anisotropic wet etching obviously works better for rectangular shapes (on <100> silicon) and hexagonal shapes (on <111> silicon) than for circular shapes. Thermal oxidation plus HF wet etching is also possible (Figure 21.25). If the process is continued by LPCVD polysilicon deposition and poly oxidation, smoothness improves and simultaneously the etched hole can be made smaller. All these steps increase process complexity, so there has to be a strong motivation for such multistep smoothing/narrowing.

## 21.7 Pattern Size and Pattern Density Effects

Etch rates are very much dependent on particular process details, including etch reactor geometry, RF power coupling scheme, wafer temperature and gas flow rate. They also depend on device layout. First of all, the etch rate depends on etchable area: more area, slower rate, and vice versa. Second, the etch rate of small features is slower than that of large features. Third, etch rate slows down. This happens in high aspect ratio structures. All these phenomena necessitate a more detailed analysis of how the interactions of chemical and physical processes really work in etching.

The loading effect, or area-dependent reaction rate, is a common phenomenon in chemical reactions. For a process optimized for certain etchable area, the flow may not be high enough to supply reactants to keep an identical etch rate when area is increased. This is a major problem for ASIC manufacturers who face hundreds of different designs.

The loading effect is very general and operates in all etching processes. It manifests itself when reactions are under a mass transport/diffusion-limited regime. If the flow of reactants is insufficient, the rate cannot be maintained. On the other hand, if the reaction is controlled by surface reactions (i.e., there is ample supply of reactants, and the etching process is limited by surface reactions), the process does not exhibit loading effects.

Loading effects operate at various scales:

- In batch reactors the etchable area changes because the number of wafers changes.



**Figure 21.25** Reducing ripple in the Bosch process by postprocessing. Reproduced from Matthews and Judy (2006), copyright 2006, by permission of IEEE

- In single wafer reactors different chip designs have different etchable areas.
- The local pattern density on a chip is different in every design (microloading).

Microloading manifests itself as an etch depth difference between isolated and array features: there is more material to be etched in arrays, therefore the rate is slower. Identical trench widths will result in different etched depths depending on pattern density in the neighborhood (Figure 21.26). Other pattern dependences discussed below are deceptively similar, yet different.

### 21.7.1 RIE lag and aspect ratio-dependent etching

Plasma etching of structures of 1:1 aspect ratio is fairly straightforward, but somewhere around 2:1 aspect ratio a phenomenon known as RIE lag manifests itself: smaller features etch slower than larger features (Figure 21.26).

Gas conductance in deep narrow holes is low and the reactants simply cannot reach the bottom effectively (or the reaction products removed). Ion bombardment is also affected: ions experience sidewall collisions in deep structures, and the bombardment at the bottom is reduced. These effects lead to a reduced etch rate in deep structures of high aspect ratio. RIE lag can be seen from a single experiment with a test structure that contains many different linewidths. RIE lag is not related to RIE reactors; it is present in all plasma etching systems irrespective of actual reactor design.

Aspect ratio-dependent etching (ARDE) is a dynamic effect: the etch rate slows down as etching proceeds, for every linewidth! The basic mechanism for RIE lag and ARDE is the same. But in order to see ARDE, many wafers have to be etched, with different etch times. To appreciate the difference, see Figure 21.27 (left) where etch rate is plotted vs. etch time. As etching proceeds, the etch rate slows down. This is true for 2  $\mu\text{m}$ , 5  $\mu\text{m}$  and all other linewidths alike. ARDE is therefore seen as a downward-sloping line in Figure 21.27 (right). RIE lag, on the other hand, is the vertical displacement of the two lines: wider lines are higher up on the chart.

DRIE is fairly straightforward for structures with aspect ratios of 10:1 while 20:1 is much more demanding. And even though 40:1 has been demonstrated in the lab, it is not to be considered a standard fabrication step. For



**Figure 21.26** Microloading vs. RIE lag. Microloading results in different etched depths for identical lines, with lines in low pattern density areas resulting in deeper structures. RIE lag: smaller etch rate for narrower lines



**Figure 21.27** Raw data of etched depth vs. etch time for 2  $\mu\text{m}$  and 5  $\mu\text{m}$  features (left); plotted as rate vs. time (right), RIE lag is the vertical distance between the lines and ARDE is the slope

380  $\mu\text{m}$  wafers these numbers translate to feature sizes about 40, 20 and 10  $\mu\text{m}$ . In bonded SOI wafers, device layer thicknesses range from 5  $\mu\text{m}$  upward. Feature size is then limited by lithography and undercutting of the pulsed (Bosch) process, rather than by aspect ratio effects.

Another charging-related non-ideality in DRIE is the notching effect (or footing effect): when the silicon etching end point is reached, the underlying oxide (either oxide on the back side of a bulk wafer, or BOX) becomes charged. This charging leads to repellency of incoming ions, and they are deflected sideways, enhancing lateral etching near the silicon/oxide interface (Figure 21.28). Note that RIE lag has an effect on notching: the larger features have experienced longer overetching, therefore the notching effect has had more time to operate. Charge can

accumulate on isolated conductors, and the oxide beneath these conductors can be damaged by this charge accumulation. Not only plasma etching, but all plasma processes, PECVD and sputtering are potential sources of oxide damage.

## 21.8 Etch Residues and Damage

Etch anisotropy relies on passivation films, and this means that we intentionally deposit material on the wafer; not all of it is beneficial. Too much passivation results in a reduced etch rate, and really heavy passivation equals etch stop. This can happen locally, so that some areas are not attacked by the etchant, leading to roughness and in extreme cases pillar-like structures. Similar pillar-like residues result from particles on the surface. In the case of a completely anisotropic process, even nanoscopic particles act as etch masks (Figure 21.29).

Similar to overpassivation, redeposition can lead to roughness and pillar-like structures. If the etch product is only marginally volatile, it can be redeposited on the wafer surface and act as a mask. It is also possible that the mask material is redeposited: high ion energy can detach atoms from an inert mask (remember that ion beams can etch anything) and if redeposited on the wafer, they will certainly act as masks; after all, mask material is chosen to be non-etchable.

Often etch products are incorporated into a sidewall passivation film. They can also react with photoresists. Photoresist removal after aggressive plasma etching is therefore difficult. Aggressive etching will also etch the resist mask, and line narrowing can take place. The opposite effect can also happen (but not often!): so much



**Figure 21.28** Notching effect. Reproduced from Chekurov *et al.* (2007) by permission of IOP



**Figure 21.29** Nanoparticle-masked etching. Reproduced from Sainiemi *et al.* (2007) by permission of IOP

material (etch products, redeposited non-etchable material, passivation film, resist debris) is deposited on the resist sidewall that in fact linewidth increases!

RIE lag is not the only lag that is encountered in etching. Etch initiation lag is also typical. If etched depth is plotted vs. etch time, the line should go through the origin, but often there is a time lag before etching starts. Native oxides is one explanation, and other residues can also be present on the wafer surface. The “first-wafer effect” is often seen in RIE: the etch rate only stabilizes after a few wafers have been etched. There are both thermal and chemical reasons for this. RIE is an energetic process and the system heats up during RIE. And even if the etch reactor is clean to begin with, passivation films will soon cover its inside surfaces. If the reactor is used for very different materials, using different gases, like SF<sub>6</sub>-based silicon etching and Cl-based GaAs etching, the etch residues in the chamber will foul the process.

## 21.9 DRIE vs. Wet Etching

Both DRIE and anisotropic wet etching have their advantages, as in Tables 21.1 and 21.2, and in many applications both etching techniques are needed. The decision in favor of either technique depends not only on technological factors like etched shape, sidewall angle or surface quality, but on practical issues like etch rate, back-side protection or equipment availability.

The ink jet example of Figure 21.30 shows how many different etch techniques are utilized in one device: the manifold etching through the wafer is done by TMAH anisotropic wet etching, the critical inlet channel is defined by DRIE, and the chamber is made hemispherical by isotropic plasma etching. The nozzle

**Table 21.2** Anisotropic wet etching main features

- Very accurate dimensional control by crystal plane-dependent etching
- Structural shapes limited by crystal plane-dependent etching
- Accurate 45°, 54.7°, 70.5° or 90° sidewall
- Smooth and well-defined surfaces
- About 4–8 hours for through-wafer etching for a single wafer
- About 4–8 hours for through-wafer etching for a batch of 25 wafers
- Etches both sides, protection needed on back side
- Etches both sides, symmetric structures can be made in a single etch step
- Aggressive to metals and many other materials, protective layers needed
- Limited selection of mask materials, thick oxide and LPCVD nitride standard
- Many etch stop mechanisms available: boron p++, pn junction, SOI BOX



**Table 21.1** DRIE main features

- Any shape can be made
- Tightly spaced structures can be made
- Vertical structures of high aspect ratio are possible
- Difficulty in making silicon membranes unless SOI wafers used
- Photoresist masking is possible
- Single-sided processing, no back-side protection needed
- 1–3 hours for through-wafer etching in single wafer operation
- 1–3 days to etch a batch of 25 wafers through-the-wafer

**Figure 21.30** Ink jet etching features: chamber is etched by isotropic plasma etching, anisotropic plasma etching for inlet channel, anisotropic spacer etching for nozzle guides and anisotropic TMAH wet etching for the manifold. Reproduced from Shin *et al.* (2003) by permission of IEEE

guides are spacers formed by plasma etching. In a flame ionization detector (Figure 1.10), the nozzle chamber is KOH etched in silicon, and micronozzles and sample gas channels are etched in silicon by DRIE. Pyrex glass wafers are isotropically wet etched in HF.

## 21.10 Exercises

- What must SF<sub>6</sub> gas flow at least be in a DRIE reactor if the silicon etch rate is 10 µm/min, wafer size 150 mm and etchable area 20%?
- Determine the DRIE single crystal silicon etch rate from the following trench etching data:

| Etch time<br>(min) | Etched depth (µm) |            |            |
|--------------------|-------------------|------------|------------|
|                    | 80 µm wide        | 40 µm wide | 12 µm wide |
| 20                 | 109               | 104        | 85         |
| 40                 | 205               | 193        | 156        |
| 60                 | 292               | 278        | 215        |

- How much etch non-uniformity can native oxide cause in silicon (D)RIE?
- Design a fabrication process for the cell filter shown below.



- Reproduced from Prince *et al.* (2007), by permission of *Proc. IMechE*
- Compare silicon tip fabrication by anisotropic wet etching, plasma etching and thermal oxidation.
  - Draw top view mask layouts to show how the microneedle of Figure 1.13 and 21.22 is made.
  - Explain the dimensions and film thicknesses and quantify the narrowing of the 3 µm hole shown in Figure 21.25.
  - How could you make a DRIE version of the filter shown in Figure 20.27? Could you improve the aperture ratio (percentage of open area)?

- What areal densities of nozzles could be made by the processes of Figures 21.18, 21.19 and 21.20?
- Design a fabrication process for the variable optical attenuator of Figure 21.4.
- Explain step by step the fabrication of the comb-driven torsional mirror of Figure 21.12!
- Explain in detail the process steps of buried channel formation (Figure 21.15).

## References and Related Reading

- Acar, C. and A.M. Shkel (2005) Structurally decoupled micro-machined gyroscopes with post-release capacitance enhancement, *J. Micromech. Microeng.*, **15**, 1092–1101.
- Agarwal, R., S. Samson and S. Bhansali (2007) Fabrication of vertical mirrors using plasma etch and KOH:IPA polishing, *J. Micromech. Microeng.*, **17**, 26–35.
- Chekurov, N. *et al.* (2007) Atomic layer deposition enhanced rapid dry fabrication of micromechanical devices with cryogenic deep reactive ion etching, *J. Micromech. Microeng.*, **17**, 1731–1736.
- Chen, K.-S. *et al.* (2002) Effect of process parameters on the surface morphology and mechanical performance of silicon structures after deep reactive ion etching (DRIE), *J. Microelectromech. Syst.*, **11**, 264.
- de Boer, M.J. *et al.* (2000) Micromachining of buried micro channels in silicon, *J. Microelectromech. Syst.*, **9**, 94.
- Deng, W. *et al.* (2006) Increase of electrospray throughput using multiplexed microfabricated sources for the scalable generation of monodisperse droplets, *J. Aerosol Sci.*, **37**, 696–714.
- Gottsch, R.A. *et al.* (1992) Microscopic uniformity in plasma etching, *J. Vac. Sci. Technol.*, **B10**, 2133–2147.
- Griss, P. and Stemme, G. (2003) Side-opened out-of-plane microneedles for microfluidic transdermal liquid transfer, *J. Microelectromech. Syst.*, **12**, 296.
- Holm, J. *et al.* (2000) Through-etched silicon carriers for passive alignment of optical fibers to surface-active optoelectronic components, *Sens. Actuators*, **82**, 245–248.
- Jansen, H.V. *et al.* (2009) Black silicon method X: a review on high speed and selective plasma etching of silicon with profile control: an in-depth comparison between Bosch and cryostat DRIE processes as a roadmap to next generation equipment, *J. Micromech. Microeng.*, **19**, 033001.
- Kiihamäki, J. and S. Franssila (1999) Pattern shape effects and artefacts in deep silicon etching, *J. Vac. Sci. Technol.*, **A17**, 2280.
- Kim, C.-H. and Y.-K. Kim (2005) MEMS variable optical attenuator using a translation motion of 45° tilted vertical mirror, *J. Micromech. Microeng.*, **15**, 1466–1475.
- Lee, M.-C.M and M.C. Wu (2006) Thermal annealing in hydrogen for 3-D profile transformation on silicon-on-insulator and sidewall roughness reduction, *J. Microelectromech. Syst.*, **15**, 338–343.

- Lehnert, T. *et al.* (2002) Realization of hollow SiO<sub>2</sub> micronozzle for electrical measurements on living cells, *Appl. Phys. Lett.*, **81**, 5063–5065.
- MacDonald, N.C. (1996) SCREAM MicroElectroMechanical Systems, *Microelectron. Eng.*, **32**, 49.
- Matthews, B. and J.W. Judy (2006) Design and fabrication of a micromachined planar patch-clamp substrate with integrated microfluidics for single-cell measurements, *J. Microelectromech. Syst.*, **15**, 214–222.
- Paik, S.-J. *et al.* (2004) In-plane single-crystal-silicon microneedles for minimally invasive microfluid systems, *Sens. Actuators*, **A114**, 276–284.
- Prince, M. *et al.* (2007) The development of a novel Bio-MEMS filtration chip for the separation of specific cells in fluid suspension, *Proc. IMechE, Part H: J. Eng. Med.*, **221**, 113–128.
- Ranganathan, N. *et al.* (2008) The development of a tapered silicon micro-micromachining process for 3D microsystems packaging, *J. Micromech. Microeng.*, **18**, 115028.
- Sainiemi, L. *et al.* (2007) Rapid fabrication of high aspect ratio silicon nanopillars for chemical analysis, *Nanotechnology*, **18**, 505303.
- Sainiemi, L. *et al.* (2009) Fabrication of thermal microbridge actuators and characterization of their electrical and mechanical responses, *Sens. Actuators*, **A149**, 305–314.
- Sasaki, M. *et al.* (2000) Anisotropically etched Si mold for solid polymer dye microcavity laser, *Jpn. J. Appl. Phys.*, **39**, 7145.
- Schmid, J.H. *et al.* (2007) Gradient-index antireflective subwavelength structures for planar waveguide facets, *Opt. Lett.*, **32**, 1794–1796.
- Shin, S.J. *et al.* (2003) Firing frequency improvement of back shooting inkjet printhead by thermal management, *Proceedings of Transducers'03*, p. 380.
- Sterner, M., N. Roxhed, G. Stemme and J. Oberhammer (2007) Mechanically tri-stable SPDT metal-contact MEMS switch embedded in 3D transmission line, *Proceedings of the 37th European Microwave Conference, 2007*, pp. 1225–1228.
- Tsou, C. *et al.* (2005) A novel self-aligned vertical electrostatic combdrives actuator for scanning micromirrors, *J. Microelectromech. Syst.*, **15**, 855–860.
- Xu, D.-X. *et al.* (2009) Spiral cavity Si wire resonators as label-free biosensors, *OSA Integrated Photonics and Nanophotonics Research and Applications (IPNRA)*, paper IMB2.
- Yu, L.M. *et al.* (2009) A microfabricated electrode with hollow microneedles for ECG measurement, *Sens. Actuators*, **A151**, 17–22.
- Yun, S.-S., S.-K. You and J.-H. Lee (2006) Fabrication of vertical optical plane using DRIE and KOH crystalline etching of (110) silicon wafer, *Sens. Actuators*, **A128**, 387–394.

# Wafer Engineering

Silicon crystal growth influences wafer processing in various ways: for example, the quality of thin thermal oxide depends on microscopic voids formed during crystal pulling, and oxygen content and distribution inside wafers are essential for trapping metallic impurities. In spite of continuous developments, wafers never become “perfect” because each new generation of devices sets different demands on wafers, for example larger and thicker wafers are needed, which changes crystal pulling conditions and thermal balances, leading to different dominating defect mechanisms. These wafers will be processed at lower temperatures, using novel processes, which will change the relative importance of impurities and defects.

When silicon-on-insulator (SOI) wafers are made, different techniques result in different possible thicknesses for buried oxide layer and the top device silicon. Defects in different SOI technologies are different, and have varying degrees of impact on devices. In epitaxy multiple layers enhance possibilities for optimizing various device design targets, but bulk wafers are always being considered as an option because of the higher price of epi (and SOI).

In addition to silicon-to-silicon bonding, there are other possible materials that can be combined with silicon in order to achieve the best of both worlds. Silicon-on-glass leads to transparency, while indium phosphide-on-silicon leads to high-speed electronics, and silicon-on-diamond is the ultimate thermal conduction solution for high-power devices. Wafer bonding is the key technology to make these heterogeneous substrates.

## 22.1 Silicon Crystals

Vacancies and interstitials will always be present even in the best possible material because they are created by thermal equilibrium processes (there are also vacancies and interstitials resulting from the crystal pulling process).

Vacancies and interstitials diffuse like dopant atoms. They also aggregate at temperatures of 900–1050 °C. Vacancy clusters are voids, and aggregates of atoms are precipitates, for example dopant atoms or oxygen above the solubility limit will form precipitates (see Figure 14.2 for solubility vs. temperature). Silicon interstitials can also cluster, forming dislocation loops (also known as L-pits). Processes which cause volume changes, like thermal oxidation, are prone to produce defects. Oxidation-induced stacking faults (OISFs) are one class of such defects. On the other hand, the volume changes associated with oxidation inject vacancies into silicon, and these affect diffusing species. The temperature differences between wafer center and edge can generate dislocations, so the material that was perfect to begin with is modified critically if improper processing is carried out.

When a wafer is heated non-uniformly in a high-temperature step, for example during epitaxy or oxidation, temperature differences between wafer center and edge can easily lead to thermal stresses above silicon yield strength. These stresses will relax by forming defects, for example slip lines. The order of magnitude of thermally generated stress  $\sigma$  can be gauged by

$$\sigma = \varepsilon E = \alpha L \times \Delta T \times E \quad (22.1)$$

where strain  $\varepsilon$  depends on the silicon coefficient of thermal expansion,  $\alpha$ , temperature difference  $\Delta T$  and wafer size,  $L$ . Silicon yield strength (or critical shear stress) is strongly temperature dependent: at 850 °C it is about 50 MPa, at 1000 °C only on the order of 10 MPa, and about 1 MPa at 1200 °C.

Oxygen is always present in CZ silicon because of the silica crucible (Figure 22.1). All dissolved oxygen does not remain in the melt or get incorporated into the ingot: most of it is evaporated as SiO. Oxygen at 5–20 ppma (according to ASTM standard F121-83) will end up in the crystal.



**Figure 22.1** Oxygen from fused silica ( $\text{SiO}_2$ ) crucible and carbon from graphite holder

The silica crucible is not mechanically strong enough at about 1400–1500 °C temperatures, and a graphite susceptor provides mechanical strength. The silica crucible reacts with the graphite holder according to the equation  $\text{SiO}_2 + 3 \text{ C} \Rightarrow \text{SiC} + 2 \text{ CO}$ . This carbon monoxide gas is the source of carbon which is always present in CZ crystals, at concentrations about  $10^{16} \text{ cm}^{-3}$ , below 1 ppm.

Oxygen is initially dissolved in interstitial sites but can precipitate during thermal treatments. These precipitates, often known simply as O<sub>2</sub>P, are important in many respects, for example O<sub>2</sub>P are beneficial for wafer mechanical strength because they will prevent slip lines from spreading in the wafer. Carbon impurities act as nucleation sites and centers for oxygen precipitation. Oxygen precipitates are prime examples of bulk micro defects (BMDs). Oxygen concentration can have a drastic effect on KOH wet etching of silicon: oxygen precipitates cause local stresses which change the relative etch rates of crystal planes. This will be discussed in Chapter 30 (see Figure 30.4). Very small and uniform oxygen precipitates can be achieved with low oxygen concentrations, for example 7 ppma. This may, however, be too low for ICs, and wafer selection for CMOS-MEMS integration is difficult.



**Figure 22.2** Vacancy cluster (COP) size vs. density at different pull rates. Reproduced from Aminzadeh *et al.* (2002), copyright 2002, by permission of IEEE

Microvoids are clusters of vacancies formed inside the ingot during crystal pulling. They are known, for historical reasons, as COPs (Crystal-Originated Particles).

Typical COP sizes are 50–200 nm, and they are found in concentrations of  $10^4$ – $10^6 \text{ cm}^{-3}$  (Figure 22.2). When wafers are cut and polished, COPs end up at wafer surface after alkaline cleaning step during wafer processing. A COP is detected by laser scattering because it reflects light almost like a particle (advanced multiangle scatterometry tools can distinguish COPs from particles). Vacancy clusters were therefore classified as particles, and were given the name COP. It was the fact that the number of COPs did not decrease in cleaning (and it could in fact increase!) that lead to a reassessment of their nature.

COP formation during crystal pulling depends on V/G ratio: pull rate to temperature gradient at crystallization interface. When V/G is high, usually at high pull rates, COPs (microvoids) are formed throughout the crystal. At medium pull rates oxidation induced stacking faults (OISF) appear at periphery. Yet lower pull rates lead to L-pits appear outside the OISF ring and at very low pull rates OISF disappears and L-pits are everywhere. Because pull rates tend to be smaller for larger diameter crystals, larger wafers have quite different defect profile from smaller crystals.

## 22.2 Gettering

Defects are not passive objects: they change and morph during wafer processing, especially during high-temperature steps. For example, metallic impurities diffuse, agglomerate and trap charge carriers. Metallic impurities come from EGS polysilicon, the silica crucible, the graphite crucible and heaters and other hot parts of the crystal growth system. The segregation

coefficients of most metals (Table 4.2) are very small, and the crystal is purified relative to melt. Metals are, however, fast diffusers in silicon, and they react with other defects, forming clusters. Metals affect electronic devices by creating trapping centers in silicon midgap, reducing minority carrier lifetimes and lowering mobility. Metals can also precipitate at the Si/SiO<sub>2</sub> interface and reduce oxide quality. The allowed iron level in advanced silicon wafers is limited to  $10^{10} \text{ cm}^{-3}$ .

In order to contain the impurities, and keep the active transistor area defect-free, a multizone approach to wafers has been devised. The wafer is designed to have different zones with different roles. The top surface is the prime quality area, with minimized particles, minimal roughness and ultimate flatness. Under the surface is the defect-free region, known as the denuded zone (DZ). The rest of the wafer can be used as a getter, a sink for collecting and holding impurities. The back side of the wafer can also be made to act as a getter. These zones are depicted in Figure 22.3.

This multizone approach is clearly not suitable for devices that extend through the wafer, like power transistors, solar cells or bulk MEMS, because of the vertical non-uniformity it introduces. If both ICs and MEMS devices are made on the same wafer, it is beneficial to have small, uniform oxygen precipitates, as a compromise which satisfies to some extent the demands of both internal gettering and vertical uniformity.

Oxygen is depleted in the top silicon layer by annealing in hydrogen. Oxygen outdiffuses from the surface, and an oxygen-depleted region develops. This denuded zone thus has low oxygen concentration and minimized oxygen-induced defects. It is formed in three steps:

1. The outdiffusion step ( $1100\text{--}1200^\circ\text{C}$ ; 1–4 h): oxygen diffuses out of the surface region, leaving  $<5 \text{ ppma}$  oxygen.
2. The nucleation step at  $600^\circ\text{C}$ : SiO<sub>x</sub> formed homogeneously throughout the wafer volume.



**Figure 22.3** Wafer cross-section with denuded zone (not to scale)

3. SiO<sub>x</sub> precipitate growth and gettering ( $950\text{--}1200^\circ\text{C}$ , 4–16 h).

Denuded zone depth depends strongly on device requirements and can range from 10 to 40 μm.

Similar treatments can be done to create a bulk wafer which resembles an epitaxial wafer: a p-type wafer is annealed in hydrogen at  $1200^\circ\text{C}$ , leading to boron and oxygen outdiffusion. A reduction of boron concentration by an order of magnitude can be achieved, essentially forming p<sup>-</sup> on a p<sup>+</sup> type structure (similar to Figure 4.15). The denuded zone is formed simultaneously. Also, vacancy clusters (COPs) are annealed out and the wafer surface is smoothed during the hydrogen anneal.

Gettering of impurities can be done either inside silicon (intrinsic gettering, IG) or at the wafer back side (external gettering, EG). In both cases four essential steps must take place: (1) release of metals from active circuit region; (2) transport of these metals to the gettering zone; (3) capture; and (4) retention for the rest of the process.

Extrinsic gettering on the wafer back side can be achieved by a number of techniques: a damaged layer from sand blasting or laser damage, polysilicon thin film, phosphorus doping and ion implantation damage are possible. The number of gettering sites increases in these steps, or metal diffusion is modified, as in the case of phosphorus. Ion implantation damage is annealed away during high-temperature steps, and its gettering ability reduced, but new back-side implants can be made. Extrinsic gettering can be added to a process flow before critical high-temperature steps. Solar cell fabrication, which is very cost sensitive, uses gettering to ensure cleanliness in a few critical steps only.

External gettering becomes difficult with growing wafer size and even completely unusable. Large-diameter wafers need to be thicker for mechanical strength, and diffusion distances for impurities increase. Simultaneously, process temperatures are getting lower (see Chapter 26 for more details), which further decreases the efficacy of back-side gettering. And finally, double-sided polishing and the need to keep the wafer back side flat and clean eliminate many external gettering options.

Intrinsic gettering is closely related to bulk microdefects (BMDs) and the thermal cycles that the wafer will experience during processing. Oxygen precipitates act as precipitation sites for other impurities, creating an impurity gradient which drives impurities toward designed precipitation sites. Wafer oxygen concentration is thus critical for intrinsic gettering. By and large, intrinsic gettering is determined when wafer processing begins. On the other hand, oxygen precipitates mechanically strengthen the wafer, which is even more important for large-diameter wafers.

## 22.3 Wafer Mechanical Specifications

Wafer thickness refers to centerpoint thickness (Table 22.1). It is difficult to produce tight thickness specifications because some wafering steps are batch processes for many wafers at a time and some are single wafer steps, therefore variations are inevitable. Wafer thicknesses are compromises between material usage and mechanical strength. Mechanical strength is especially important in high-temperature steps as many mechanical properties (for instance, yield strength) are strongly temperature dependent. MEMS devices which extend through the whole wafer require exacting thickness control. Anisotropic wet etching results in slanted 54.7° sidewalls which waste area, and more so for thicker wafers. In DRIE thick wafers lead to longer etch times.

Specialty wafers with practically any thicknesses are available but thin-wafer handling is very difficult. Mechanical stability increases with thickness, and thickness has to increase with wafer size. Through-wafer MEMS have not been done on 300 mm so far, and 200 mm is on the fringe, too. Thicker wafers are readily available in thicknesses of 1–1.5 mm.

Wafers bend and warp at high temperatures, either because their front and back sides are heated non-uniformly (e.g., lamp heating from the top side only), or because films deposited on one side of the wafer introduce stress. This is sometimes compensated by the fact that both sides of the wafer are coated, as in thermal oxidation, but when the film on either side is etched, asymmetry is created and bowing develops. Even though epitaxial films are assumed to be zero-stress films, doping leads to bowing. Because boron atoms are smaller than silicon atoms, the resulting film will be tensile stressed (and large arsenic atoms result in compressive stresses): the higher the doping level, the more the stress. Boron doping of  $10^{18} \text{ cm}^{-3}$  corresponds to  $10^{-4}$  lattice mismatch, which leads to bowing by tens of micrometers. This makes wafer bonding, for instance, much more difficult.

Bow and warp relate to shape deformations of free, unclamped wafers (Figure 22.4). Wafers can be concave



**Figure 22.4** Bow (left): deviation of median surface from reference plane; warp (right): range of median surface distances from reference plane

or convex, or undulating. Bow may be eliminated by clamping, that is forcing the wafer flat on a chuck. Warp is the difference between the maximum and minimum distances of the median surface. Warp is a bulk property, in contrast to flatness, which is a surface property. Warp and bow can develop during high-temperature process steps, or result from ingot sawing and lapping operations. The presence of excessive bow and warp will affect lithographic performance via depth-of-focus problems.

Wafer surface topography can be divided into a few distinct scales: roughness is atomic/nanoscale, flatness is chip-scale and bow and warp are wafer-scale phenomena. Smoothness and flatness are essential parameters for fusion bonding: wafers with 0.1 nm roughness are preferred for fusion bonding. Anodic bonding is more forgiving to surface roughness, and wafers with 0.5 nm roughness are fine for anodic bonding.

Flatness is characterized by total indicator reading (TIR). It is a front-side reference measurement. TIR is defined as the sum of the maximum positive and negative deviations from a reference plane. If this reference plane is chosen to coincide with the focal plane of the mask aligner, focal plane deviation (FPD) is defined as the largest deviation, positive or negative, from this plane (Figure 22.5). TIR and FPD are measured for clamped wafers, while bow and warp are measured for unclamped (free) wafers.

Flatness is measured over an area that is relevant to the lithography process and chip size. It directly impacts linewidth variation through lithographic depth of focus. Lithographic processes utilizing 1× full wafer imaging systems are sensitive to global flatness, whereas step-and-

**Table 22.1** Standard wafer sizes and thicknesses (centerpoint thickness)

|        |                   |                                                         |
|--------|-------------------|---------------------------------------------------------|
| 3 inch | 380 $\mu\text{m}$ |                                                         |
| 100 mm | 525 $\mu\text{m}$ | (380 $\mu\text{m}$ for MEMS; thinner wafers exist)      |
| 150 mm | 625 $\mu\text{m}$ | (380 $\mu\text{m}$ for MEMS; 250 $\mu\text{m}$ minimum) |
| 200 mm | 725 $\mu\text{m}$ | (500 $\mu\text{m}$ MEMS)                                |
| 300 mm | 770 $\mu\text{m}$ |                                                         |



**Figure 22.5** Left, total indicator reading (TIR); right, focal plane deviation (FPD)

repeat imaging systems are sensitive to local site flatness, over an exposure area which is less than 10 cm square.

## 22.4 Epitaxial Wafers

Epitaxial wafers offer extreme purity: carbon and oxygen, which are always present in CZ wafers, are practically absent in epitaxial layers. If there are COPs in bulk material, they will be buried by the epitaxial layer, reducing a 100 nm void to a few nanometers. Epitaxial layers are not defect-free, however; stacking faults created in epitaxial growth are the largest yield limiters in epitaxy.

Whereas CZ wafers have cylindrical distribution of doping and defects because of the rotation during crystal pulling, epitaxial deposition is highly uniform. Epi doping uniformity is typically below 4% and thickness uniformity about 1%. Epitaxial deposition is very reproducible, for both resistivity and thickness. And while resistivity in an ingot has lengthwise gradient, and bulk wafers therefore have slightly different resistivities, epitaxial wafers have identical resistivity.

Minimum thickness by CVD homoepitaxy is about 0.5 μm, and the maximum thickness is determined by economics of epitaxial growth, not by physics and chemistry. Practical maximum epitaxial thicknesses are about 100 μm for certain power semiconductor devices. Epitaxial wafers have applications in almost all areas of microfabrication (Table 22.2), but the wafer cost limits their use to more expensive applications only. In CMOS, one-third of wafer usage is epi, two thirds bulk.

Because p<sup>++</sup> etch stop material is too heavily doped for electrical devices and mechanically poor due to tensile stresses from high boron concentration, alternatives have been explored. An advanced etch stop structure relies on a double epitaxial layer structure: an etch stop layer and a device layer. The first epilayer is heavily boron doped, but in order to minimize mechanical stresses from boron doping, the film is compensated by germanium ( $10^{21} \text{ cm}^{-3}$  germanium,  $10^{20} \text{ cm}^{-3}$  boron) (Figure 22.6). The boron atom is smaller than the silicon one, and germanium one is larger, which prevents stresses from volume mismatch building up. Germanium is a column IV element (beneath



**Figure 22.6** SIMS doping profile of Si:Ge:B epitaxial etch stop layer. Courtesy Okmetic

silicon in the periodic table), therefore isoelectronic with silicon, so no electrical effects are introduced. The second layer, lightly doped, is deposited on top of the Si:Ge:B etch stop layer. This second layer is the actual device layer, and we can choose device doping level freely.

## 22.5 SOI Wafers

Three SOI techniques will be discussed in this chapter:

- Bonded SOI (also known as BESOI, for bond and etch-back SOI)

**Table 22.2** Epitaxial wafer applications

| Technology     | Subst.         | Epi             | $\rho$ (ohm-cm) | Thick (μm) | Motivation   |
|----------------|----------------|-----------------|-----------------|------------|--------------|
| CMOS           | p <sup>+</sup> | p               | 5–10            | 5–20       | Latch-up     |
| Power MOS      | n <sup>+</sup> | n               | 5–10            | 10–20      | Conductivity |
| Analog bipolar | p <sup>+</sup> | p               | 1–20            | 10–100     | Speed        |
| MEMS           | p              | n               | 1–10            | 7–150      | Etch stop    |
| MEMS           | p              | p <sup>++</sup> | 0.005           | 3–5        | Etch stop    |

- Smart-cut (ion cut layer transfer)
- SIMOX (implanted oxygen).

Each has its characteristic SOI device layer thickness as well as typical buried oxide (BOX) thickness. They also differ in defect density and in the applicability of cavity structure formation. Bonded SOI is best suited for thick device layer applications in MEMS. BOX thickness is quite freely chosen. SIMOX and Smart-cut are used for thin SOI device layers in CMOS. In SIMOX both BOX and device silicon thicknesses are limited, while in Smart-cut silicon thickness is limited but BOX thickness is a free variable. SOI specifications and applications are listed in Table 22.3.

Intermediate SOI device layer thicknesses are difficult to make and usually involve an epitaxial step, as will be discussed below.

### 22.5.1 Bonded SOI

#### 22.5.1.1 The bonding process

Bonding is a straightforward way to make SOI structures. The bonded SOI technique uses the bonding of two wafers (one or both oxidized) followed by thinning (Figure 17.3). Bonding without thinning has its applications too, see Figures 1.17 and 30.16.

Wafer bonding allows independent optimization of the top device layer and the supporting handle wafer. The handle wafer is chosen for mechanical support, thermal compatibility, micromachining, doping level or some other property. The device layer can have material, crystal orientation, doping level or thickness tailored to the particular device design, irrespective of handle wafer properties.

Thinning of the device wafer involves grinding, polishing and etching, much like in silicon wafering (Table 4.3). Thinning down to  $10\text{ }\mu\text{m}$  thickness is reasonably easy, and  $5\text{ }\mu\text{m}$  can be done. For layers thinner than that, special techniques are required: either real-time thickness monitoring during final polishing, or etch stop layers.

Figure 22.7 shows a process for thin device layer bonded SOI. Epitaxial layers with different etching properties have to be grown on the device wafer before bonding. Grinding (or etching) removes the bulk of the



**Figure 22.7** Thin bonded SOI: top carrier wafer with two epitaxial layers and thermal oxide is bonded to an oxidized handle wafer. KOH etching of carrier wafer until  $\text{p}^{++}$  etch stop layer, and HF:HNO<sub>3</sub> etch to remove  $\text{p}^{++}$  silicon

silicon, and selective etching (KOH/TMAH) removes the remaining material until the  $\text{p}^{++}$  etch stop layer is met. High boron doping is used but a second epitaxial layer is grown on it. The highly doped etch stop layer can then be removed by, for example, 1–3–8 etchant (HF, HNO<sub>3</sub> and CH<sub>3</sub>COOH in the volume ratio of 1:3:8) which does not etch lightly doped material. Etch stop layers enable fabrication of device silicon layers 100 nm thick with  $\pm 5\text{--}10\text{ nm}$  variation.

Bonding two oxidized wafers instead of one bare and one oxidized wafer is useful when thick BOX is required. The thickest BOXs made of thermal oxides are  $4\text{ }\mu\text{m}$  ( $2\text{ }\mu\text{m}$  of oxide on both wafers) thick. Bonding of wafers with thick deposited (CVD) oxides has been actively studied, but the films are generally not smooth enough for good bonding. If CMP is used to polish the surface (Figure 16.10), process cost rapidly increases.

Edge processing is needed in BESOI, as shown in Figure 22.8. Wafer edges are rounded because sharp edges are crack-prone. Some 3 mm of device layer silicon is removed from the edges, which means that it is easy to identify the SOI device side.

**Table 22.3** SOI wafer applications

| Application | Device layer        | Buried oxide          | SOI technology   |
|-------------|---------------------|-----------------------|------------------|
| CMOS        | 10–200 nm           | 200–400 nm            | Smart-cut, SIMOX |
| Bipolar     | 1–10 $\mu\text{m}$  | 0.1–1.0 $\mu\text{m}$ | Various          |
| MEMS        | 5–50 $\mu\text{m}$  | 0.5–4 $\mu\text{m}$   | Bonded SOI       |
| Power IC    | 1–100 $\mu\text{m}$ | 1–4 $\mu\text{m}$     | Bonded SOI       |



**Figure 22.8** Bonded SOI edge treatment: top, after thinning; bottom, after edge treatment

In hydrophilic bonding there are two possibilities for the pair to be bonded: a silicon wafer and an oxidized wafer, or two oxidized wafers. The latter results in reduced bond strength, just 70–80% of the former, but the resulting structure is symmetric with respect to interfaces. In SOI wafer specifications it is stated which wafer has thermal oxide on it.

#### 22.5.1.2 Bonding chemistry and physics

The water released during the formation of Si–O–Si bonds will oxidize silicon further ( $\text{Si} + 2 \text{H}_2\text{O} \Rightarrow \text{SiO}_2 + 2 \text{H}_2$ ; wet oxidation!). The effect of this oxide is the more important, the thinner the oxide on the wafers; if wafers with thick oxides are bonded, water diffusion will be slow and additional oxidation minuscule. A combination of thin (or native) oxide wafer and thick oxide wafer is a compromise: oxidation will proceed according to the aforementioned equation, strengthening the bond, and hydrogen can dissolve in the oxide, preventing the build-up of gas bubbles at the interface.

While hydrophilic bonding is mainstream technology, hydrophobic bonding is of interest in some applications. For example, forming pn diodes by bonding requires bonding without interfacial oxide. HF-last cleaned wafers have a hydrogen-terminated surface (Figure 12.1) and bonding proceeds as described by



In the case of hydrophobic surfaces ( $-\text{Si}-\text{H}$  terminated) roughness is on the order of 0.5 nm and their bonding properties are worse than those of hydrophilic surfaces. Hydrogen bonds between HF units are weak. Hydrogen bubble prevention is very important in hydrophobic bonding. Hydrogen will diffuse along the bonding interface, and will not dissolve into bulk silicon below 500 °C. The bond energies of hydrophobic bonding are much lower



**Figure 22.9** Surface energies for hydrophilic (HL) and hydrophobic (HB) bonding. Reproduced from Tong and Gösele (1999) by permission of John Wiley & Sons, Inc.

than those of hydrophilic bonding at low temperatures (as shown in Figure 22.9), but they can be improved by annealing.

#### 22.5.2 Smart-cut™ ion cut layer transfer

In the Smart-cut method hydrogen is implanted into an oxidized wafer (called the donor wafer) which is subsequently bonded to a handle wafer. The donor wafer is then split along the mechanically weak region that was created by hydrogen implantation (Figure 22.10). The hydrogen implantation method has been patented, under the name Smart-cut, and wafers manufactured with the method are marketed as Unibond.

Hydrogen bubble-induced layer splitting is based on hydrogen implantation. Gas bubbles form at the depth of maximum hydrogen concentration. These bubbles lead to mechanical weakening of the silicon material and microcracks lead to cleavage of the implanted layer when a suitable thermal treatment or mechanical pressure is applied. The donor wafer provides mechanical strength during thermal or mechanical treatment, and the whole wafer can be split. In the case of a thin silicon layer, hydrogen bubbles can burst the thin top layer.

#### Process flow: Smart-cut

- Thermal oxidation of donor wafer
- $\text{H}^+$  implantation into donor wafer (60 kV,  $6 \times 10^{16} \text{ cm}^{-2}$ )

- Hydrophilic bonding at room temperature
- Anneal at 400–600 °C to split the wafers
- High-temperature anneal at 1100 °C for 2 h strengthens the chemical bonds
- Final polishing or hydrogen anneal

The hydrogen dose required for bubble formation is  $3\text{--}10 \times 10^{16} \text{ cm}^{-2}$ . The thickness of the splitting layer is related to the  $\text{H}^+$  energy, which can accurately and easily be controlled. CMP or hydrogen annealing is necessary to eliminate the microroughness of the SOI device layer, even though the layer thickness just after splitting is uniform to a few nanometers.

An alternative way to detachment is mechanical force. Water jets or pressurized gas can be used. The bonding energy at the bonding interface is much higher than that in the H-implanted region which is embrittled. Thus, even at room temperature, the H-implanted layer can be peeled off from the donor wafer.

Smart-cut is a quite generic technique: implantation and mechanical splitting have been applied to transfer thin layers of many precious materials onto cheaper handle wafers. In the case of a silicon handle, the bonding process is always the same, that is the familiar silicon-to-oxide bonding. The device layer material may impose some limitations, but for example germanium-on-insulator (GeOI) and strained silicon-on-insulator (SSOI) have been made in straightforward fashion.

### 22.5.3 Separation by implantation of oxygen (SIMOX)

In SIMOX technology, SOI structure is realized in two main steps. The first step is oxygen ion implantation into a silicon wafer and the second step is a high-temperature anneal during which the implanted oxygen atoms form a buried oxide layer inside the silicon. Process conditions are:

- oxygen dose  $2 \times 10^{18} \text{ cm}^{-2}$
- oxygen energy 150–200 keV
- wafer temperature 550–650 °C

Good uniformity of both the silicon device layer and BOX thickness can be achieved. SIMOX is best suited for thin-silicon/thin-BOX applications. Typically both layers are on the order of 100 nm thick. Increasing oxygen ion energy would lead to deeper penetration, but at the cost of more expensive equipment.

SIMOX material exhibits inherent defect problems: the device silicon layer is damaged by the implantation process and it cannot be fully recovered during annealing. SIMOX dislocation densities can be  $10^6/\text{cm}^2$ , orders of magnitude more than in bulk silicon. Implantation time poses another limitation: the required doses are two orders of magnitude higher than those in common usage in CMOS manufacturing.

An improvement of SIMOX quality can be achieved by internal oxidation (ITOX), shown in Figure 22.11. During



**Figure 22.10** Smart-cut: (a)  $\text{H}^+$  implantation into an oxidized donor wafer; (b) donor wafer is bonded to a handle wafer; (c) cleavage along the weak region created by implantation results in a SOI wafer



**Figure 22.11** SIMOX SOI wafer fabrication: oxygen implantation through oxide to create buried oxide inside silicon, followed by thermal oxidation (called ITOX, for internal oxidation) which anneals silicon and thickens and densifies buried oxide

ITOX the device silicon layer is annealed and simultaneously BOX is densified. Oxidation will consume silicon and make the device silicon layer thinner. SIMOX BOX electrical breakdown voltages are 3–5 MV/cm, but can be increased to 8 MV/cm by ITOX. Thermal oxides exhibit  $\geq 10$  MV/cm breakdown voltages.

Hydrogen implantation in Smart-cut also produces damage, but hydrogen is a light ion, and the damage can be more readily repaired. From a productivity point of view, hydrogen implantation dose (and thus time) in Smart-cut is 10–100 times less than in SIMOX.

One interesting option with SIMOX is patterned (or partial) isolation: oxide masked implantation will result in BOX only at desired locations (Figure 22.12). This can mean local patterns, or larger scale patterns: for example,



**Figure 22.12** Masked implant for SIMOX: BOX is formed only locally. Reproduced from Cheng *et al.* (2005), copyright 2005, Elsevier

the DRAM cell array is done on bulk, while the peripheral circuits are made on SOI.

## 22.6 Bonding Mechanics

Because of local roughness and global waviness, the two wafers will not touch fully. It is possible to estimate the dimensions of cavities which can be closed in the bonding process. The same equations govern both random cavities from wafer irregularities as well as micromachined cavities.

Gap closing is a function of wafer thickness ( $t$ ), wafer mechanical strength determined by Young's modulus ( $E$ ) and Poisson's ratio  $\nu$ , and surface energy  $\gamma$  (about 100 mJ/m<sup>2</sup> for room temperature silicon–silicon bonding) (Figure 22.13).

Cavities of radius  $R$  ( $R > 2t$ ,  $R \gg h$ ) will be closed if the distance between the wafers,  $h$ , is smaller than that given by Equation 22.3, and for cavities of radius  $R$  ( $R < 2t$ ,  $R \gg h$ ) it is given by Equation 22.4:

$$h \leq \frac{R^2}{\sqrt{\frac{2Et^3}{3\gamma(1-\nu^2)}}} \quad (22.3)$$

$$h \leq 3.5\sqrt{\frac{R\gamma(1-\nu^2)}{E}} \quad (22.4)$$

For dissimilar materials (different  $E$ ) more complex formulas have to be used.

Particles between wafers cause non-bonding areas (voids) because wafers cannot conform abruptly to particles (Figure 22.14). The radius of non-bonding area is given by

$$R = \sqrt[4]{\frac{2Et^3}{3\gamma(1-\nu^2)}}\sqrt{h} \quad (22.5)$$



**Figure 22.13** Geometry for analyzing closing of cavities for the case height  $h \ll$  radius  $R$ . Wafer thickness is  $t$



**Figure 22.14** Particle-caused void in bonding: left, a large particle leads to a non-bonded area much larger than the particle itself; right, wafers conform to particles below critical size



Below a critical particle size  $h_{\text{crit}}$  the wafers are able to conform to particles, and void size is practically identical to particle size. This critical size is given by

$$h_{\text{crit}} = 5 \sqrt{\frac{t\gamma}{E}} \quad (22.6)$$

## 22.7 Advanced Wafers

Bonding wafers with more layers on them leads to more complex wafers. This approach can be continued with more wafers (and more grinding, etching and CMP steps), but of course each additional wafer incurs extra costs. One such application is shown in Figure 39.10; there, two buried conductor layers of  $\text{WSi}_2$  are employed.

Bonding of dissimilar materials is made easier if surface quality is good, and low-temperature annealing is sufficient to achieve adequate bond strength. Then it is possible to bond for example silicon to III-V semiconductors like InP or GaAs. The process shown in Figure 22.15 uses epitaxial InGaAs etch stop layer to protect the InP device layer during thinning. Plasma activation of InP enables low-temperature bonding. Of course, all the usual requirements of smoothness and flatness must be fulfilled, too.

### 22.7.1 Silicon-on-diamond

One of the drawbacks (in many but not all applications) of SOI wafers is thermal insulation due to BOX. The alternative dielectrics silicon nitride or aluminum oxide have a thermal conductivity an order of magnitude higher. The ultimate thermal conduction material is diamond, with a thermal conductivity a thousand times higher than silicon dioxide, yet it is an electrical insulator. The wafers are dubbed SOD, for silicon-on-diamond (Figure 22.16). SOD wafer fabrication starts by CVD diamond deposition (Equation 5.11) on a hydrogen-implanted, Smart-cut-like wafer. As discussed in Chapter 5, it is not really diamond,

**Figure 22.15** Heterogeneous bonding: InP to silicon bonding using InGaAs etch stop layer. Redrawn from Arokiaraj *et al.* (2006)

but diamond-like carbon (DLC) and its thermal conductivity is less than that of diamond gems, yet better than that of copper, for films a few micrometers thick. In order to improve the mechanical strength, a thick polysilicon can be deposited on it. Or, if the device processing only tolerates low-temperature processing, copper electroplating instead of polysilicon can be used, to utilize the high thermal conductivity of copper. The cleavage of the carrier wafer at  $450^\circ\text{C}$  completes the process. Strong Si-C bonds are formed at the interface, according to the reaction



During the annealing steps this water will oxidize the silicon wafer. Hydrogen released in wet oxidation will be incorporated in the DLC film, which has hydrogen in it to begin with.

## 22.8 Variety of Wafers

A wafer manufacturer has thousands of specifications. Ingots are different, and the wafering process, especially the



**Figure 22.16** Silicon-on-diamond (SOD): diamond and copper deposition on hydrogen-implanted handle wafer, with splitting in Smart-cut fashion

thermal treatments, adds to the variety. DSP wafers, epitaxial wafers and SOI wafers increase this variety further, as shown in Figure 22.17.

In CMOS bulk wafers command about 60% of the market, epitaxial wafers about 30% and SOI less than 10%. These values are in terms of dollars; in terms of area the shares of epi and SOI are smaller. In MEMS DSP wafers, SOI and epi are more prominent, but overall the MEMS wafer market is just a few percent of the total wafer market. In addition to those prime wafer markets, there are other markets, for solar silicon, test wafers, reclaimed wafers and non-polished wafers. The latter three are important for non-device applications as monitors for individual process steps, or for equipment testing and development. In 2010, 450 mm wafers

**Figure 22.17** From ingot to customer

are available only as test wafers, with basically the mechanical features in place, namely diameter and thickness. These wafers are used by equipment makers in developing reactors, robotics, and associated tools and techniques.

## 22.9 Exercises

- If the CZ ingot neck is 2 mm in diameter, what is the maximum ingot weight that can be pulled before silicon yields catastrophically?
- If COP density in ingot is  $10^5 \text{ cm}^{-3}$ , what is COP density on the wafer surface?
- What is the maximum device silicon thickness in (a) SIMOX and (b) Smart-cut if a 200 keV implanter is used?
- If ultrathin SOI needs to be made, Smart-cut device silicon has to be thinned down, in the extreme case of 6 nm MOS transistors to, for example, 2 nm. Analyze the strengths and weaknesses of doing this 50 nm  $\rightarrow$  2 nm thinning by the following techniques:
  - wet etching in TMAH
  - plasma etching in  $\text{Cl}_2$  or  $\text{SF}_6$
  - CMP
  - thermal oxidation + HF wet etching.
- How thick a silicon dioxide layer will be formed inside silicon when the implant dose is  $2 \times 10^{18} \text{ cm}^{-2}$  in SIMOX?
- If hydrogen ion currents are in the milliamperes range, how many wafers per hour can be implanted for Smart-cut?

7. What is the critical particle radius for 100 mm silicon wafer bonding?
8. What is the non-bonded area caused by a  $0.3 \mu\text{m}$  particle on 150 mm wafers? If 150 mm wafers are specified to have 50 particles of size  $0.3 \mu\text{m}$ , what fraction of wafer area will be unbonded?
9. Analyze metallic bonding by copper as shown below. What are its strengths and limitations?



Redrawn after Chen *et al.* (2005)

## References and Further Reading

- Alexe, M. and U. Gösele (2004) **Wafer Bonding**, Springer.
- Aminzadeh, M. *et al.* (2002) Pseudo epi – cost reduction approach and a paradigm shift in substrate material, *IEEE Trans. Semicond. Manuf.*, **15**, 486–492.
- Arokiaraj, J., S. Vicknesh and A. Ramam (2006) Integration of indium phosphide thin film structures on silicon substrates by direct wafer bonding, *J. Phys.: Conf. Ser.*, **34**, 404–409.
- Borghesi, A. *et al.* (1995) Oxygen precipitation in silicon, *J. Appl. Phys.*, **77**, 4169.
- Celler, G.K. and S. Cristoloveanu (2003) Frontiers in silicon-on-insulator, *J. Appl. Phys.*, **93**, 4955–4978.
- Chen, K.N. *et al.* (2005) Process development and bonding quality investigations of silicon layer stacking based on copper wafer bonding, *Appl. Phys. Lett.*, **87**, 031909
- Cheng, X. *et al.* (2005) Patterned silicon-on-insulator technology for RF Power LDMOSFET, *Microelectron. Eng.*, **81**, 150–155.
- Dornberger, E., D. Temmler and W. van Ammon (2002) Defects in silicon crystals and their impact on DRAM device characteristics, *J. Electrochem. Soc.*, **149**, G226–G231.
- Falster, R., V.V. Voronkov and F. Quast (2000) On the properties of the intrinsic point defects in silicon: a perspective from crystal growth and wafer processing, *Phys. Stat. Solidi (b)*, **222**, 219–244.
- Ghyselen, B. *et al.* (2004) Engineering strained silicon on insulator wafers with the Smart Cut™ technology, *Solid-State Electron.*, **48**, 1285–1296.
- Gösele, U. *et al.* (1999) Wafer bonding for microsystems technologies, *Sens. Actuators*, **74**, 161–168.
- Graff, K. (2000) **Metal Impurities in Silicon-Device Fabrication**, 2nd edn, Springer.
- Matsumura, A. (2003) Technological innovation in low-dose SIMOX wafers fabricated by an internal thermal oxidation (ITOX) process, *Microelectron. Eng.*, **66**, 400–414.
- Miller, D.C. *et al.* (2007) Characteristics of a commercially available silicon-on-insulator MEMS material, *Sens. Actuators*, **A138**, 130–144.
- Muller, T. *et al.* (2000) Assessment of silicon wafer material for the fabrication of integrated circuits sensors, *J. Electrochem. Soc.*, **147**, 1604–1611.
- Müssig, H.-J. *et al.* (2001) Can Si(113) wafers be an alternative to Si(001)? *Microelectron. Eng.*, **56**, 195.
- Pettinato, J.S. and D. Pillai (2005) Technology decisions to minimize 450-mm wafer size transition risk, *IEEE Trans. Semicond. Manuf.*, **18**, 501–509.
- Sama, S. *et al.* (2001) Investigation of Czochralski silicon growth with different interstitial oxygen concentrations and point defect populations, *J. Electrochem. Soc.*, **148**, G517.
- Shimura, F. (ed.) (1994) **Semiconductors and Semimetals: Oxygen in silicon**, Willardson.
- Tang, Z. *et al.* (2009) Effect of nanoscale surface topography on low temperature direct wafer bonding process with UV activation, *Sens. Actuators*, **A151**, 81–86.
- Taraschi, G., A.J. Pitera and E.A. Fitzgerald (2004) Strained Si, SiGe, and Ge on-insulator: review of wafer bonding fabrication techniques, *Solid-State Electron.*, **48**, 1297–1305.
- Tong, Q.-Y. and U. Gösele (1999) **Semiconductor Wafer Bonding**, John Wiley & Sons, Inc.
- Varma, C.M. (1997) Hydrogen-implant induced exfoliation of silicon and other crystal, *Appl. Phys. Lett.*, **71**, 3519.
- von Ammon, W., E. Dornberger and P.O. Hansson (1999) Bulk properties of very large diameter silicon single crystals, *J. Cryst. Growth*, **198/199**, 390–390.

# Special Processes and Materials

This chapter is different from the previous ones: instead of one topic, it is a collection of various specialty techniques used in wafer processing. Some of the techniques are materials specific, some are excellent in one limited application and some require non-standard equipment. Many of them are quite simple, but on the other hand they may not offer full microfabrication benefits. All these techniques can be applied on a wafer scale, just like other microfabrication processes, and this is the difference from techniques to be presented in the following chapter: those are serial methods for (slowly) writing one pattern at a time.

## 23.1 Substrates other than Silicon

A plethora of materials have been used in addition to silicon. Some of these have already been discussed: glasses, fused silica and crystalline quartz (Chapter 19), and polymers, for instance PET and Kapton (Chapter 18). Other substrate materials used in microfabrication include:

- steel
- nickel
- alumina
- AlN
- ZnO
- GaN
- SiC
- sapphire
- PCBs
- LTCCs.

Steel is cheap and available in large areas, which is good for solar cells; nickel can act as an electrode in a fuel cell; alumina and aluminum nitride have excellent microwave properties; ZnO is a transparent

conducting glass; GaN can be used for UV detection; SiC is excellent in high-temperature, harsh-environment cases; sapphire is hard and transparent and can support silicon epitaxy; LTCCs (Low-Temperature Co-fired Ceramics) are ceramics that can be processed easily. PCBs (Printed Circuit Boards) are polymer plates with thick copper metallization ready-made on them. So for larger linewidth applications which require metals on a thermally insulating substrate, PCBs offer a quick starting point.

Compared to round and standardized silicon wafers, these exotic substrates have a number of issues that need to be addressed. Square and rectangular shapes are not well suited to photoresist spinning, but of course laminate resist and spray resist are options. Non-standard sizes complicate processing in equipment designed for silicon processing. Multicrystalline silicon solar cells are most often 5 and 6 inch squares, and they are much thinner than silicon wafers, for example 200 µm. Handling thin wafers is more delicate than with thicker ones. There are also very thick substrates, and for example thermal equilibrium of a thick insulating substrate in a single wafer tool may take a lot longer than with a thin silicon wafer of high thermal conductivity. And some substrates are 10 times heavier than silicon of the same diameter, which may be an issue for handler robots.

Silicon and glass are flat and smooth, which does not apply to many other substrates. For instance, polished metals might have a surface roughness of 100 nm, a hundred times more than silicon. Thin films of 100 nm thickness cannot really cover such a mountainous substrate very well. Some of the substrates are porous, for example all polymers are somewhat porous. This leads to problems in vacuum systems: water vapor (and other chemicals) are slowly released from pores, leading to vacuum poisoning and subsequently to poor metal quality.

## 23.2 Pattern Generation

The general purpose pattern generation methods discussed in Chapter 8 are applicable to all possible pattern shapes: rectangles, circles, wedges, spirals, etc. There are, however, ways to make patterns in much simpler ways, but these methods are usually capable of producing just one shape only, for example lines, dots or circles.

### 23.2.1 Holographic lithography

Holographic lithography, also known as interference lithography, uses intersecting laser beams to create line gratings, dot arrays and 3D meshes. It is very easy to create regular patterns by interfering laser beams, with minimum linewidths in the 100 nm range for visible wavelength lasers. It is also possible to vary the line-to-space ratio, and to produce, for example, 200 nm lines with 200 nm spaces, or with 1000 nm spaces. 3D shapes can also be made (Figure 23.1).

### 23.2.2 Microstereolithography

Rapid prototyping of 3D objects can be accomplished by the photohardening of negative-resist types of polymers. There are several different ways of doing this:

- physical mask and blanket exposure of 2D layers
- virtual mask and blanket exposure of 2D layers
- $x - y$  scanning laser and mechanical  $z$ -stage control
- two-photon laser polymerization

(the latter two are serial microfabrication methods and belong to Chapter 24, really).

In all the approaches the 3D design is broken down to 2D layers which will be hardened layer by layer starting from the bottom layer. After a layer has been exposed,

the  $z$ -controller lowers the stage, and new photopolymer is allowed to cover the previous layer (Figure 23.2). In the masked versions each layer requires a mask, and clearly the physical mask approach is not amenable to any but the simplest of designs. Virtual masks are patterns generated by micromirrors or video projectors that use software to create exposure pattern on-the-fly.

Exposure times are similar to lithography (and no wonder: similar photoactive polymers are used), with a few seconds for layer thicknesses of tens of micrometers. Structures with up to a thousand layers have been made, in a few hours. Minimum feature sizes depend on mask technology but tens of micrometers and larger are usual. If direct laser writing is used, even micrometer sizes are possible. The layer-by-layer structure of the finished objects is clearly seen in Figure 23.3.

In addition to polymers, metallic and ceramic microstructures can be made by stereolithography. Metallic or ceramic powders, with solvents and binders, are hardened just like photoactive polymers. And it is also possible to make metallic microstructures in a roundabout way: a polymeric 3D structure is made by stereolithography, and it is filled by electroplating. Polymer is then removed. Compared to lithographic resist molds, stereolithography enables retrograde shapes and hollow structures to be made.

### 23.2.3 Block copolymer lithography

Block copolymers (BCPs) consist of blocks that are bonded together covalently, but free to rotate and bend in space. If the two blocks are chosen to repel each other, for example by having hydrophobic and hydrophilic ends in the chains, the block copolymer will self-organize into a regular pattern.

The natural size scale for block (co)polymer structures is the size of macromolecules, 5–50 nm. This is very



**Figure 23.1** Built-in 3D meshes by three-laser interference lithography. Reproduced from Sato *et al.* (2006) by permission of IOP



**Figure 23.2** Mask projection stereolithography: lamp, optical system, micromirror array virtual mask, resin tank and *z*-stage. Adapted from Kang *et al.* (2005)



**Figure 23.3** Example 3D objects made by microstereolithography: the magnification on the right clearly reveals the layer-by-layer structure. Reproduced from Stampfli *et al.* (2008) by permission of IOP

attractive for making nanostructures. There are a number of drawbacks, however: the phase separation process is very slow; and a 24 h bake might be needed to form the pattern. Also, while local order can be very good, long-range order of BCP patterns is often less perfect. This might be acceptable for some applications, but fatal in many others.

Depending on the particular polymers, many shapes are possible. For example, dot and hole arrays and line

gratings can be made (Figure 23.4). It is essential that one of the polymers can be selectively etched relative to the other, to create a pattern that can be used as an etch or plating mask.

BCP patterns are molecule size and also molecule thick, 5–50 nm, which means that they are very thin and not very good as etch masks. In Figure 23.5 the BCP has been used as a mask for SiO<sub>2</sub> etching, and the oxide then acts as a hard mask for SOI silicon layer etching.



**Figure 23.4** Block copolymer lithography: left, PS–PB BCP self-organized lines, reproduced from S.O. Kim *et al.* (2003) by permission of Nature Publishing Group; right, PS–b–PMMA BCP self-organized dots between lithographically defined lines, reproduced from Park *et al.* (2003), copyright 2003, Elsevier



**Figure 23.5** Membrane filter fabrication by BCP lithography: PS BCP nanostructures, other processing standard. Reproduced from Black *et al.* (2006), copyright 2006, American Institute of Physics

#### 23.2.4 Colloidal bead lithography

Colloidal lithography is also known by the names of nanobead patterning and micro- and nanoparticle assembly. All the same, the method is based on micro- or nanobeads assembling into a regular hexagonal array upon drying (Figure 23.6). The basic arrangement for nanobead lithography is very easy: spinning or otherwise spreading a nanobead-containing solution on a wafer, and letting it dry in a controlled manner so that the



**Figure 23.6** Bead lithography: single layer of polystyrene beads, after drying. Courtesy Kestas Grigoras, Aalto University

beads will self-organize. Both single layer structures and multilayers can be done, see Figure 23.3.

The beads act like photoresist patterns, for example, and they can be used as etch masks. Or metal can be evaporated in the spaces between the beads, Figure 23.7. These spaces between the beads are triangular, but annealing and ball-up (surface energy minimization) will lead to round metallic nanostructures. Alternatively, the bead array can be buried completely by a thick polymer, and then the beads dissolved, leaving an array of empty spaces, known as inverse opals.



**Figure 23.7** Evaporated metal through bead pattern, and after bead dissolution

## 23.3 Patterning

### 23.3.1 Microcontact printing ( $\mu$ CP)

Microcontact printing ( $\mu$ CP) is a microlithographic version of ink-and-stamp patterning: a soft polymeric stamp (most often made of PDMS) is wetted by “ink,” for example alkanethiol  $\text{CH}_3(\text{CH}_2)_{15}\text{SH}$  or octadecyltrichlorosilane OTS (Figure 5.14), and the wetted stamp is pressed against a gold surface (Figure 23.8). A reaction between thiol and gold leaves a self-assembled monolayer (SAM) pattern on the wafer because the bonds between the ink and surface are stronger than the bonds between the stamp and ink.

Microcontact printing is an example of so-called soft lithographies, which include a number of methods where soft stamps and masters are utilized, as opposed to hard masks of chromium-on-glass type. The claims that soft lithography techniques work without cleanrooms is not true: any technique that intends to produce micron and submicron features requires cleanliness matching those pattern sizes. But if  $100\text{ }\mu\text{m}$  lines are made, then micrometer particles obviously are not of great concern and relaxed cleanliness requirements apply.

SAMs are usually only  $2\text{--}3\text{ nm}$  thick, and their usefulness as a plating, etch or lift-off mask needs to be improved, even though  $20\text{--}30\text{ nm}$  etched depths have been demonstrated, but this is clearly not enough for the majority of applications.

The stamps are soft and elastic, which ensures an intimate contact with the substrate, and allows for minor irregularities. Soft materials are sensitive to printing force: too much pressure will deform the soft stamp and the patterns will be distorted. Stiffer materials offer higher resolution, but worse contact. Hybrid stamps with a hard backplate/soft cushion and a stiff stamping surface have been devised in order to have the best of both worlds (Figure 23.9).

Of course the patterns need not be flat: in fact pyramid-shaped tips are useful (Figure 23.10). If a very small



**Figure 23.8** Microcontact printing on a gold coated surface: left, alkanethiol-inked PDMS stamp and gold-coated wafer; middle, alkanethiol stamped against the gold surface; right, alkanethiol pattern on wafer, PDMS stamp lifted



**Figure 23.9** Composite stamp. Adapted from Michel *et al.* (2001)



**Figure 23.10** PDMS pyramid stamper. Adapted from Huo *et al.* (2008)

pattern is needed, the tip is gently pressed, but if larger patterns are needed, more force can be applied and the soft PDMS will be pressed against the substrate, forming a larger pattern.

The inks must balance between two opposing trends: shorter molecules form better SAMs, but also diffuse faster. Longer molecules result in imperfect SAMs, but experience less diffusion, therefore they allow more accurate pattern edges.

Contact area plays an important role: light-field structures, with small contact area, are easier because the separation force is small. Structures with aspect ratios around 1:1 and structures with uniform fill factors, such as periodic gratings, are easy. If aspect ratios of structures to be stamped differ from unity or from each other considerably, stamping becomes problematic. Structures of low aspect ratio, say 0.1, are subject to sagging, and those with high aspect ratio are subject to lateral instability and collapse (Figure 23.11).



**Figure 23.11** Problems with soft stamps: when spacing is sparse, sagging can take place. In stamps of high aspect ratio, lateral collapse can take place

### 23.3.2 Stamping non-planar objects

PDMS is flexible, and this opens up special applications: patterns can be contact printed on curved surfaces. Gratings on optical fibers have been realized. Similarly, a round object can be rolled over a PDMS stamp and a spiral structure created. Microcoils have been made this way. Alternatively, the PDMS piece can be curved, and used as mold.

## 23.4 Powder Blasting

In powder blasting abrasive particles are ejected toward the wafer at about 100 m/s, leading to removal rates on the order of tens of micrometers per minute. Material removal is based on mechanical crack propagation. This is the same as grinding in wafer thinning, and therefore the methods share the same strengths and limitations: a very high rate of material removal, but rough and damaged surfaces, with the damage extending micrometers inside the material. A powder blasting system is shown schematically in Figure 23.12. In order to powder-blast a full wafer, a scanning system has to be implemented.

When abrasive particles of tens of micrometers are used, the rate is high, up to 1 mm/min, but surface roughness is similar to particle size. When smaller particles are used, the rate is lower but submicrometer roughness can be obtained. A powder-blasted profile is sloped, and while powder blasting can be used to make through-wafer holes, these are not holes with high aspect ratios. Powder blasting is extensively used to blast holes in glass wafers because both wet and dry etching of deep holes is difficult.

Masking in powder blasting requires something that is not eroded by abrasive particles. Hard materials like perforated steel and thick electroplated metals like nickel are candidates. Selectivities on the order of 50:1 are possible with metal masks. Photoresists cannot be used: they are too brittle and will crack. Soft, elastomeric materials



**Figure 23.12** Powder blasting. Reproduced from Cui (2005) with kind permission of Springer Science and Business Media

like PDMS absorb energy from the particles, and are deformed, not broken. Selectivities of 100:1 to 1000:1 are possible with elastomeric masks. The minimum feature sizes that can be made depend not only on mask size, but also on particle size used, with minimum linewidth roughly 10 times the particle size.

One special application of powder blasting is inclined blasting (Figure 23.13). Because the particle jet is directed, the sample can be tilted relative to the beam. This enables free-standing structures to be made.

## 23.5 Deposition

### 23.5.1 Lift-off metallization

Lift-off is metallization with sacrificial resist: after lithography, metal deposition is done on the resist pattern, followed by resist dissolution in solvent and



**Figure 23.13** Inclined angle powder blasting, quartz paddle resonator. Courtesy Martin Gijs, EPFL



**Figure 23.14** Lift-off process: left, metal deposition on resist pattern; right, resist dissolution and metal lift-off

lift-off, with all the metal that is not in contact with substrate being removed (Figure 23.14).

Lift-off metallization should have poor step coverage, and therefore the method of choice is evaporation, even though sputtering can be used too. The deposition process has, however, photoresist-imposed limitations: it must take place under about  $150^{\circ}\text{C}$  temperature because of resist thermal stability. There is always some deposition on the sidewalls, too, but if the films are thin, they are discontinuous, and resist dissolution can take place.

Special techniques have been devised to make lift-off easier. One way is to use negative resists which typically have negative sidewall angles (Figure 9.10). Another technique is to use a two-layer resist (Figure 23.15). The bottom resist, called the lift-off resist (LOR), can be thick because it has no role in defining linewidth. LOR's main



**Figure 23.15** Negative resist and two-layer lift-off resist. SEM micrographs reproduced from Chen *et al.* (2004), copyright 2004, Elsevier

role is to undercut the top resist in development, and it is designed for a high development rate (and good adhesion and thermal stability). The top resist is thin to enable high-resolution patterning.

Lift-off is used in magnetic recording head (GMR heads) fabrication: lead metallization has to make contact with the GMR element as closely as possible. This self-alignment is shown in Figure 23.16 and described in the process flow below.

### Process flow for GMR sensor metallization

- GMR sensor deposition
- Lithography of lift-off resist
- Ion beam etching of GMR layers
- Deposition of hard bias metal
- Deposition of lead metal
- Resist stripping (= lift-off)

Ion beam etching (ion milling) is used to etch the GMR sensor (Figure 7.19) because there are so many different materials that finding RIE gases would be impossible, and therefore argon ion milling is the straightforward way. After milling the GMR head, the same resist acts as the metallization lift-off mask: hard bias metal (CoPtCr) and lead metallization (Au for resistance minimization) are sputtered, and lifted off. Because one resist does it all, the metallization will be automatically aligned to the GMR sensor.



**Figure 23.16** GMR head fabrication: (a) ion beam etching of GMR sensor with two-layer lift-off resist profile; (b) two-layer metallization; (c) lift-off. Adapted from Chilدرس and Fontana (2005)

Lift-off is very general: all metals and their alloys can be patterned with the same basic process. Lift-off is suited for hard-to-etch metals like gold and platinum. A GMR head is also suitable for ion beam etching because the total thickness is only about 40 nm and the slow rate is therefore not a major handicap.

Lift-off is, however, a special technique, and not an alternative to etching in general. If an etching process exists, it is universally used, and lift-off is reserved for those special applications where etching is difficult or impossible. Because lift-off calls for poor step coverage, it is difficult to make lift-off metallization over topography.

### 23.5.2 Shadow masks

Shadow masks (also known as stencil masks) are mechanical aperture plates. Shadow mask patterning is basically lift-off with a mechanical mask instead of a resist mask. The shadow mask is aligned with and temporarily attached to a substrate, and this stack is then positioned in



**Figure 23.17** Deposition with a shadow mask

the deposition system (Figure 23.17). Basically any deposition technique can be used, but line-of-sight methods like evaporation lead to the smallest penumbra under the mask. The smaller gap between the stencil and the wafer minimizes unwanted deposition, too.

Some materials are so sensitive that their deposition has to be the very last process step, for example (bio)chemical sensor films. The application of photoresist on these films is not possible, and solvent dissolution cannot be used, ruling out lift-off. Another reason for using stencil masks is that wafer topology does not allow photoresist spinning: for example, there are through-wafer holes, or free-standing cantilevers. And the shadow mask saves a lot of process steps: resist spinning, baking, exposure, development, rinsing and drying are eliminated.

If the shadow mask and wafer can be aligned with each other in a mask aligner or a bond aligner, micrometer alignment accuracy is possible; often, however, shadow masks are only used for non-critical applications where manual  $\pm 10 \mu\text{m}$  alignment is enough. Minimum linewidths that are possible with shadow masks are in the 10 nm range. In practice, however, such small apertures are quickly clogged by the deposited film, and the sidewall profiles of the deposited structures are far from ideal. But in theory any hole larger than atomic size can be used as a shadow mask. One special limitation of shadow masks is the impossibility of donut-shaped structures. There are, however, tricks around this: by using very narrow bridges and relying on deposition penumbra under the mask, donut shapes can be patterned. The general solution is to perform shadow mask deposition twice, with “half-masks.”

### 23.5.3 Screen printing

Screen printing is an old technique which has emerged as the major technique in solar cell metallization. A perforated screen holding the metallization pattern is



**Figure 23.18** Screen printing: a squeegee pushes metal paste through the gridded screen

stretched over the wafer, and paste is applied to the screen. A squeegee scans the screen, pushing paste through the holes in it (Figure 23.18). Additional drying and firing steps are then required to finish metallization, but this is similar to other thin-film techniques: the film is seldom in final form immediately after deposition. Screen printing is cheaper than vacuum deposition methods, and it can produce thick metal wires, for example 10 µm, reducing resistive losses. Limitations are similar to shadow masking: for example, donut-shaped patterns cannot be made.

Other applications of screen printing in microfabrication include adhesive and glass-frit bonding: the adhesive or glass frit is defined by screen printing. In all these applications rather wide lines are satisfactory, in the range of tens or hundreds of micrometers.

#### 23.5.4 Inclined angle evaporation

The directionality of evaporation, its line-of-sight deposition geometry, is favorable for lift-off, and if this is combined with a tilted wafer, very small structures can be made (Figure 23.19). Some of the smallest ever MOSFETs have been demonstrated by oblique angle evaporation.

#### 23.5.5 Glancing angle deposition

Columnar growth (Figure 7.2) is taken to extremes in glancing angle deposition (GLAD). The wafer is tilted to about 80° relative to the arriving atoms, and the atoms arrive obliquely. With low surface mobility and a shadowing effect from already deposited atoms, tall, narrow crystals will grow. If the initial surface is smooth, random columns will form, and if there are patterns, they will act



**Figure 23.19** Inclined angle evaporation leads to structures smaller than the lithographic feature size

as shadowing elements and GLAD film deposits on top of those (Figure 23.20).

GLAD films are highly porous but electrically conductive. They can be used as top electrodes in chemical sensors: fluids can penetrate through the GLAD film, and interact with the sensor film. Alternatives to GLAD top electrodes include thin percolated metal films – films that are so thin that they are physically discontinuous but electrically continuous. Percolated films have much higher resistance than the much thicker GLAD films. Normal films can also be used, but deposition, lithography and etching of those films on top of soft polymeric sensor films is undesirable.

## 23.6 Porous Silicon

Silicon is not etched in HF. If, however, silicon is made an anode in an electrochemical etching set-up (Figure 23.21), etch rates of about 1 µm/min are observed. Hydrofluoric acid, with or without ethanol and/or water, is used as an electrolyte. Platinum is the standard cathode. Depending on current density, silicon can be etched in two rather different modes: pore formation and electropolishing (Figure 23.22). Both electropolishing and pore formation take place in the anodic regime. In pore formation etching proceeds anisotropically downward leaving a silicon “skeleton” with up to 80% empty space.

The reactions that take place in HF electrolyte are given by





Pore formation starts at the wafer surface from a defect or an intentional initial pit. Electronic holes from the bulk silicon are transported to the surface, and they react at the defect or pit. Further etching occurs at the newly formed pore tips because they attract more holes due to higher electric field strength, and the process leads to a uniform porous layer depth as the holes are consumed by the growing tips and other surfaces are depleted of holes. This etching mode takes place under low hole concentration, and it is limited by hole diffusion, not by mass transfer in the electrolyte cell. If the hole density increases, some holes reach the surface and react there, leading to surface smoothing. This is the electropolishing regime and ionic transfer from the electrolyte plays a role. These two regimes are pictured in Figure 23.22.



**Figure 23.21** Electrochemical etching set-up for making porous silicon



**Figure 23.20** GLAD films: left, random crystals; right, seeded crystals. Courtesy Marianna Kemell, University of Helsinki



**Figure 23.22** Regimes of silicon anodic etching in HF: porous silicon formation and electropolishing. Reproduced from Collins (1997) by permission of Electrochemical Society Inc.

In p-type silicon holes are naturally present, and in n-type silicon they can be generated by illumination. A very wide range of pore sizes from 2 nm to 20  $\mu\text{m}$  can be etched by varying electrolyte concentration, current density and illumination. Pore size dependence on silicon resistivity is shown in Figure 23.23. As a rough guide for n-type silicon, the pore diameter in micrometers is half the resistivity in ohm centimeters: for 1  $\mu\text{m}$  pores in 2 ohm-cm, and 20 nm pores in 0.04 ohm-cm material.

If the initial pits are prepared by lithography and etching, regular arrays of pores can be made (Figure 23.24). Because etch rate depends on etching current, current modulation can be used to increase etch rate, periodically, leading to interesting 3D shapes.



**Figure 23.23** Pore size ranges of electrochemically etched silicon: macroporous, mesoporous and microporous regimes. Reproduced from Lehmann (1995) by permission of IEEE

There are a couple of drawbacks in electrochemical etching (and deposition): electrical contact has to be made to the wafer back side, and this contact has to tolerate the etchant. Concentrated HF (49%) is often employed, which seriously limits the choice of metals. Alternatively, a wafer holder can be used to protect the wafer back side, and any metal is good. However, such a holder takes up area on the wafer front, reducing the number of usable chips.

Porous silicon is single crystalline silicon even though it is a sponge-like network rather than continuous material. Epitaxial deposition on porous silicon is possible, and other thin films can be deposited too. Depending on the process, either pores will be filled or porous material will be covered with a continuous surface of thin-film material.

Conformal CVD into macroporous grooves is no different from CVD into etched grooves of similar dimensions

### 23.6.1 Sacrificial structures using porous silicon

The electrochemical etch rate of n-type silicon (10–20 ohm-cm) in HF electrolyte is very low compared to p-type silicon or low-resistivity n-type silicon (about 0.01 ohm-cm). Doping (by diffusion or epitaxy) can therefore be used to create local porous silicon patterns. Alternatively, protective etch masks can be used as in any etching. Photoresist, silicon nitride, amorphous silicon and silicon carbide are candidates; silicon dioxide cannot be used because of the HF electrolyte, and photoresist is limited to cases with dilute HF.

Free-standing microstructures can be made by depositing films on top of porous silicon, patterning them and then removing the porous silicon (Figure 23.25). Porous silicon presents a curious case where etch selectivity can be obtained between silicon and silicon: porous silicon etching proceeds rapidly because the sidewalls between the pores can be as thin as a few nanometers, whereas solid silicon is attacked from the top surface only. Etch selectivity can be as high as 100 000:1. The material for the free-standing structure can be for instance silicon nitride, but epitaxial silicon can also be used.

Porous silicon is a mechanically weak material, and it can be destroyed by capillary forces during drying. It can also be destroyed by gas bubbles: KOH etching releases hydrogen (Equation 11.1) and if gas evolution is rapid the bubbles can burst porous structures. For this reason dilute KOH, 0.1–1%, is used, rather than the 20–50% typical of silicon anisotropic etching.

In a modification of the above scheme, a free-standing structure can be made from bulk single crystal silicon. A p-type silicon substrate (10 ohm-cm) is used. A deep, heavy p-diffusion is done, followed by a shallow



**Figure 23.24** Porous silicon: (a) random initial pits; (b) lithographically defined initial pits and constant current etching, reproduced from Grigoras *et al.* (2001); (c) lithographically defined and modulated etching, reproduced from Trifonov *et al.* (2008), copyright 2008, Elsevier



**Figure 23.25** Fabrication of a free-standing bridge on a p-type substrate: left, n-diffusion of selected areas, followed by electrochemical etching; middle, bridge material deposition; right, removal of porous silicon in dilute KOH resulting in a bridge over a cavity. Adapted from Hedrich *et al.* (2000)



**Figure 23.26** Single crystal silicon bridges by porous silicon sacrificial layer: left, p-type and n-type diffusions; middle, electrochemical etching; right, KOH etching of porous silicon. Adapted from Lee *et al.* (2000)

n-diffusion (Figure 23.26). Electrochemical etching attacks the p-diffusion and transforms it into porous silicon. The n-type silicon is intact in electrochemical etching. A short KOH etching removes the porous silicon, leaving an n-type single crystal silicon microbridge.

## 23.7 Molding with Lost Mold

Hard-to-etch materials can be made into patterns by the following methods:

- ion beam etching (argon ion milling)
- selective deposition
- blanket deposition and polishing
- molding and mold dissolution.

Ion beam milling is a brute force method. It suffers from very slow rates. Selective deposition depends critically on chemical surface processes which are hard to control. Highly conformal deposition combined with polishing is quite universal, but conformal deposition processes are rare.

Molding is rather a universal process because so many different ways of transporting the material are available. The reverse of the final pattern is fabricated in silicon or resist, filled with the desired material, and the mold removed. AFM tips of silicon nitride are made this way: inverse pyramids are etched into silicon, filled by LPCVD nitride, and silicon is etched away. The diamond structures shown in Figure 23.27 are made similarly: CVD diamond deposition plus silicon wafer etching.

When the mold is completely removed, shape freedom is unlimited. These processes are variously called dissolved wafer, lost mold or disposable mold processes. Because there are no detachment problems (Figure 18.20), the most pressing issue is filling of the mould. If the material to be molded can fill retrograde features, then any shape can be made.

Etch selectivity between silicon and the molded material limits the use of this method: the usual silicon etchants, hot concentrated KOH or HF:HNO<sub>3</sub> mixtures, are very aggressive solutions. Alternatively, silicon can be removed by isotropic SF<sub>6</sub> plasma etching or by XeF<sub>2</sub> or ClF<sub>3</sub> dry etching. A number of devices have been made with silicon molds: for



**Figure 23.27** Diamond microstructures made with silicon wafer disposable molds. Reproduced from Björkman *et al.* (1999) by permission of Elsevier



**Figure 23.28** Silicon mold (top) and cast PDMS replica (bottom): the whole silicon wafer has been etched away to release the PDMS “mushroom”, courtesy Lauri Sainiemi, Aalto University

instance, PZT ultrasonic transducers and parylene needles, in addition to hollow diamond channels. In Figure 23.28 the PDMS “mushrooms” have been made by etching a mold in silicon in a fashion similar to Figure 21.15, filling with PDMS and then completely removing the silicon wafer.

Resist molds are easy to make and electroplating into resist is widely used. Resist molds are, however, usable only once. But resist can be used as a master to make PDMS parts, which can be used as disposable molds. In Figure 23.29 stainless steel powder (with solvent and binder) has been cast into PDMS mold and cured. After



**Figure 23.29** Steel powder molded by a PDMS mould. Courtesy Pia Suvanto, Aalto University

detachment, it is further annealed to improve its mechanical properties.

## 23.8 Exercises

1. How does shadow mask thickness affect the resulting linewidth and profile?
2. What is the gap effect on feature size and shape in shadow mask patterning?
3. When new dielectrics are tested, it is usual to deposit the top electrode through a shadow mask because of speed and simplicity. How much error will be introduced to dielectric constant  $\epsilon_r$  if shadow mask dimensional control is  $100 \pm 5 \mu\text{m}$ ?
4. It has recently been proposed to use shadow masks in ion implantation. Explore the issues that need to be addressed for such an approach.

5. Explain in detail how donut shapes can be made using shadow masks.
6. PDMS with CTE of 300 ppm/ $^{\circ}\text{C}$  is made by molding over a 100 mm silicon wafer. By how much will features shrink? What positional accuracy can be achieved?
7. Explain in detail how the hollow diamond tubes shown in Figure 23.27 are made!
8. What dimensional limitations apply to structures made by the porous silicon sacrificial layer technique?
9. Estimate the feature sizes that can be made by inclined angle evaporation in Figure 23.19!
10. How could shadow mask and lift-off types of methods work for liquid phase patterning?
11. How can the porosity of porous silicon be measured by weighing?
12. What percentage of silicon atoms are surface atoms if porous silicon has a 5 nm pore size and 3 nm wall thickness?
13. Explain the difference (in materials, applications, processes, temperatures, etc.) between the etching and lift-off method.

## References and Related Reading

- Björkman, H. *et al.* (1999) Diamond replicas from microstructured silicon masters, *Sens. Actuators*, **73**, 24.
- Black, C.T. *et al.* (2006) Highly porous silicon membrane fabrication using polymer self-assembly, *J. Vac. Sci. Technol.*, **B24**, 3188–3191.
- Chen, Y., K. Peng and Z. Cui (2004) A lift-off process for high resolution patterns using PMMA/LOR resist stack, *Microelectron. Eng.*, **73–74**, 278–281.
- Childress, J.R. and R.E. Fontana (2005) Magnetic recording read head sensor technology, *C. R. Physique*, **6**, 997–1012.
- Collins, S.C. (1997) Etch stop techniques for micromachining, *J. Electrochem. Soc.*, **144**, 2242.
- Cui, Z. (2005) **Micro-Nanofabrication**, Higher Education Press/Springer.
- Ekkels, P. *et al.* (2009) Evaluation of platinum as a structural thin film material for RF-MEMS devices, *J. Micromech. Microeng.*, **19**, 065010.
- Graff, M. *et al.* (2004) Microstenciling: a generic technology for microscale patterning of vapor deposited materials, *J. Microelectromech. Syst.*, **13**, 956–962.
- Grigoras, K., A.J. Niskanen and S. Franssila (2001) Plasma etched initial pits for electrochemically etched macroporous silicon structures, *J. Micromech. Microeng.*, **11**, 371–375.
- Hawkeye, M.H. and M.J. Brett (2007) Glancing angle deposition: Fabrication, properties, and applications of micro- and nanostructured thin films, *J. Vac. Sci. Technol. A*, **25**, 1317–1335.
- Hedrich, F., S. Billat and W. Lang (2000) Structuring of membrane sensors using sacrificial porous silicon, *Sens. Actuators*, **84**, 315.
- Huij, J.C. (2003) Guided molecular self-assembly: a review of recent efforts, *Smart Mater. Struct.*, **12**, 264–271.
- Hulteen, J.C. and R.P. Van Duyne (1995) Nanosphere lithography: a materials general fabrication process for periodic particle array surfaces, *J. Vac. Sci. Technol.*, **A13**, 1553–1558.
- Huo, F. *et al.* (2008) Polymer pen lithography, *Science*, **321**, 1658–1660.
- Kang, H.-W., J.-W. Rhie and D.-W. Cho (2009) Development of bi-pore scaffold using indirect solid freeform fabrication based on microstereolithography technology, *Microelectron. Eng.*, **86**, 941–944.
- Kim, H.-C. and W.D. Hinsberg (2008) Surface patterns from block copolymer self-assembly, *J. Vac. Sci. Technol.*, **A26**, 1369–1382.
- Kim, S.O. *et al.* (2003) Epitaxial self-assembly of block copolymers on lithographically defined nanopatterned substrates, *Nature*, **424**, 411–414.
- Kim, G.M., M.A.F. van den Boogaart and J. Brugger (2003) Fabrication and application of a full wafer size micro/nano stencil for multiple length-scale surface patterning, *Microelectron. Eng.*, **67–68**, 609–614.
- Lee, C.-S., J.-D. Lee and C.-H. Han (2000) A new wide-dimensional freestanding microstructure fabrication technology using laterally formed porous silicon as a sacrificial layer, *Sens. Actuators*, **84**, 181.
- Lehmann, V. (1995) Porous silicon – a new material for MEMS, Proceedings of IEEE MEMS'95, p. 1.
- Michel, B. *et al.* (2001) Printing meets lithography: soft approaches to high-resolution patterning, *IBM J. Res. Dev.*, **45**, 697–719.
- Park, C., J. Yoon and E.L. Thomas (2003) Enabling nanotechnology with self assembled block copolymer patterns, *Polymer*, **44**, 6725–6760.
- Pawlowski, A.-G., A. Sayah and M.A.M. Gijs (2005) Accurate masking technology for high-resolution powder blasting, *J. Micromech. Microeng.*, **15**, S60–S64.
- Perl, A., D.N. Reinhoudt and J. Huskens (2009) Microcontact printing: limitations and achievements, *Adv. Mater.*, **21**, 2257–2268.
- Sato, H. *et al.* (2006) An all SU-8 microfluidic chip with built-in 3D fine microstructures, *J. Microelectromech. Syst.*, **16**, 2318–2322.
- Sayah, A. *et al.* (2005) Elastomer mask for powder blasting microfabrication, *Sens. Actuators*, **A125**, 84–90.
- Shibata, T. *et al.* (2002) Stencil mask ion implantation technology, *IEEE Trans. Semicond. Manuf.*, **15**, 183–188.

- Stampfl, J. *et al.* (2008) Photopolymers with tunable mechanical properties processed by laser-based high-resolution stereolithography, *J. Micromech. Microeng.*, **18**, 125014.
- Sun, C. *et al.* (2005) Projection micro-stereolithography using digital micro-mirror dynamic mask, *Sens. Actuators*, **A121**, 113–120.
- Syms, R.R.A. *et al.* (2009) Silicon microcontact printing engines, *J. Micromech. Microeng.*, **19**, 025027.
- Trifonov, T. *et al.* (2008) Macroporous silicon: a versatile material for 3D structure fabrication, *Sens. Actuators*, **A141**, 662–669.

# Serial Microprocessing

Microfabrication manufacturability stems largely from the fact that millions or even billions of identical structures are made simultaneously, by lithography, imprinting, embossing or molding, and then etched, implanted, metallized or otherwise further processed in parallel fashion. In this chapter we will discuss different breeds of processes: single features are made, slowly and painstakingly, but with great precision. Pixel by pixel, single feature processing is slow, but there are major benefits: the cost and waiting time for the photomask/master/mold is eliminated, and new designs can be implemented on-the-fly by changing the code that drives the writing tool. The cycle time of experimenting becomes much faster than with mask/master/mold-based techniques. These processes are used in research and development, and whenever a limited number chips are enough. And of course they are used when no alternatives exist, as in photomask writing.

There are two main varieties of serial microprocessing tools: beam based and tip based. The beam spot size and the tip radius vary greatly, but all rely on scanning the sample surface to create the pattern. Focused ion beams (FIBs), focused electron beams (FEBs) and laser beams are all used, with or without additional gases (Figure 24.1). The same beams can also be used to expose resist, but in this chapter we are interested in fabricating the microstructures directly in the final material, sidestepping the intermediate polymer step. The tip-based methods have also been used to write resist patterns, but again we are interested in direct etching/deposition of the final structures.

Vacuum, atmospheric and liquid phase systems are all in use. Ion and electron beams are of course vacuum techniques, but lasers and tips can operate in all environments. However, there are no universal techniques: all are more or less specialized in some materials and some environments.

## 24.1 Focused Ion Beam (FIB) Processing

Ion beams are energetic, as we know from ion implantation (Chapter 15). The processes that occur include not only implantation but also amorphization and sputtering, that is ion beam etching. Secondary electrons are also emitted (Figure 24.2).

In order to achieve high enough beam current, a liquid gallium ion source is commonly used in FIB systems (and in SIMS analysis systems, too). In serial microprocessing ion energies are similar to ion implantation (e.g., 50 keV), but they are focused to very small spot sizes, for example 10 nm at 10 A/cm<sup>2</sup> current density. Sputtering, which is also present in implantation to a minor extent, can be the dominant phenomenon in FIB. The ion beam-induced secondary electron emission is most useful and in fact FIB systems act as electron microscopes, too.

Ion beam etching was mentioned in Chapter 11. In ion beam etching (as in ion implantation) a broad beam system is used. The mechanism is the same for focused beams but now we do not need a mask because etching only takes place where the beam hits. This is used in repairing photomasks: chromium pinspots, protrusions and bridges are etched away (Figure 8.10). The small spot size of FIB is essential in this application. The imaging capability of secondary electrons is also essential.

The FIB etching process is slow, but the rate can be enhanced by adding a reactive gas, for example fluorine-containing gases like trifluoroacetamide (C<sub>2</sub>H<sub>2</sub>F<sub>3</sub>NO) for silicon etching. The same trick is used in broad beam etching where the method is called CAIBE (for chemically assisted ion beam etching) and also RIBE (for reactive ion beam etching). In addition to the slow rate, another major problem with ion beam etching is the high energy which erodes sharp corners, leading to rounding (Figure 24.3).