1 Introduction
This is the material for an intermediate-level MOSFET analog circuit design course, held at JKU under course number 336.009 (“KV Analoge Schaltungstechnik”).
The course makes heavy use of circuit simulation, using Xschem for schematic entry and ngspice for simulation. The 130nm CMOS technology SG13G2 from IHP Microelectronics is used.
Tools and PDK are integrated in the IIC-OSIC-TOOLS Docker image, which will be used during the coursework.
All course material (source code of this document, Jupyter notebooks for calculations, Xschem circuits, etc.) is made publicly available on GitHub (follow this link) and shared under the Apache-2.0 license.
Please feel free to submit pull requests on GitHub to fix errors and omissions!
The production of this document would be impossible without these (and many more) great open-source software products: VS Code
, Quarto
, Pandoc
, TexLive
, Jupyter Notebook
, Python
, Xschem
, ngspice
, CACE
, pygmid
, schemdraw
, Numpy
, Scipy
, Matplotlib
, Pandas
, Git
, Docker
, Ubuntu
, Linux
, …
1.1 IHP’s SG13G2 130nm CMOS Technology
SG13G2 is the name of a 130nm CMOS technology (strictly speaking BiCMOS) from IHP Microelectronics. It features low-voltage (thin-oxide) core MOSFET, high-voltage (thick-oxide) I/O MOSFET, various types of linear resistors, and 7 layers of Aluminum metallization (5 thin plus 2 thick metal layers). This PDK is open-source, and the complete process specification can be found at SG13G2 process specification. While we will not do layouts in this course, the layout rules can be found at SG13G2 layout rules.
For our circuit design, the most important parameters of the available devices are summarized in the following table:
Component | Device Name | Specifications |
---|---|---|
Low-voltage (LV) NMOS | sg13_lv_nmos |
operating voltage (nom.) \(V_\mathrm{DD}=1.5\,\text{V}\), \(L_\mathrm{min}=0.13\,\mu\text{m}\), \(V_\mathrm{th}\approx 0.5\,\text{V}\); isolated NMOS available |
Low-voltage (LV) PMOS | sg13_lv_pmos |
operating voltage (nom.) \(V_\mathrm{DD}=1.5\,\text{V}\), \(L_\mathrm{min}=0.13\,\mu\text{m}\), \(V_\mathrm{th}\approx -0.47\,\text{V}\) |
High-voltage (HV) NMOS | sg13_hv_nmos |
operating voltage (nom.) \(V_\mathrm{DD}=3.3\,\text{V}\), \(L_\mathrm{min}=0.45\,\mu\text{m}\), \(V_\mathrm{th}\approx 0.7\,\text{V}\); isolated NMOS available |
High-voltage (HV) PMOS | sg13_hv_pmos |
operating voltage (nom.) \(V_\mathrm{DD}=3.3\,\text{V}\), \(L_\mathrm{min}=0.45\,\mu\text{m}\), \(V_\mathrm{th}\approx -0.65\,\text{V}\) |
Silicided poly resistor | rsil |
\(R_\square=7\,\Omega \pm 10\%\), \(\text{TC}_1=3100\,\text{ppm/K}\) |
Poly resistor | rppd |
\(R_\square=260\,\Omega \pm 10\%\), \(\text{TC}_1=170\,\text{ppm/K}\) |
Poly resistor high | rhigh |
\(R_\square=1360\,\Omega \pm 15\%\), \(\text{TC}_1=-2300\,\text{ppm/K}\) |
MIM capacitor | cap_cmim |
\(C'=1.5\,\text{fF}/\mu\text{m}^2 \pm 10\%\), \(\text{VC}_1=-26\text{ppm/V}\), \(\text{TC}_1=3.6\text{ppm/K}\), breakdown voltage \(>15\,\text{V}\) |
MOM capacitor | n/a | The metal stack is well-suited for MOM capacitors due to 5 thin metal layers, but no primitive capacitor device is available at this point. |
1.2 Schematic Entry Using Xschem
Xschem is an open-source schematic entry tool with emphasis on integrated circuits. For up-to-date information of the many features of Xschem and the basic operation of it please look at the available online documentation. Usage of Xschem will be learned with the first few basic examples, essentially using a single MOSFET. The usage model of Xschem is that the schematic is hierarchically drawn, and the simulation and evaluation statements are contained in the schematics. Further, Xschem offers embedded graphing, which we will mostly use.
A summary of important Xschem keyboard shortcuts is provided in Section 19.
1.3 Circuit Simulation Using ngspice
ngspice is an open-source circuit simulator with SPICE dependency (Nagel 1975). Besides the usual simulated types like op
(operating point), dc
(dc sweeps), tran
(time domain), ac
(small-signal frequency sweeps), and noise
(small-signal noise analysis), ngspice offers a script-like control interface, where many different simulation controls and result evaluations can be done. For detailed information please refer to the latest online manual.
Important ngspice simulation commands and options (e.g., how to control convergence settings) are listed in Section 20.
1.4 Integrated IC Design Environment (IIC-OSIC-TOOLS)
In order to make use of the various required components (tools like Xschem and ngspice, PDKs like SG13G2) easier, we will use the IIC-OSIC-TOOLS. This is a pre-compiled Docker image which allows to do circuit design on a virtual machine on virtually any type of computing equipment (personal PC, Raspberry Pi, cloud server) on various operating systems (Windows, macOS, Linux). For further information like installed tools, how to setup a VM, etc., please look at IIC-OSIC-TOOLS GitHub page.
Please make sure to receive information about your personal VM access ahead of the course start.
Experienced users can install this image on their personal computer, for JKU students the IIC will host a VM on our compute cluster and provide personal login credentials.
In this course, we assume that students have a basic knowledge of Linux and how to operate it using the terminal (shell). If you are not yet familiar with Linux (which is basically a must when doing integrated circuit design as many tools are only available on Linux), then please check out a Linux introductory course or tutorial online, there are many resources available.
A summary of important Linux shell commands is provided in Section 18.
1.5 Setting up the Design Directory
- Open your VM by entering the URL in your browser.
- Open a terminal (third icon in the taskbar at the bottom). You should get the following prompt:
/foss/designs >
- Clone the git repository into the current directory:
git clone https://github.com/iic-jku/analog-circuit-design.git
- This GitHub repository includes a file called
.designinit
, which sets the PDK and certain paths. However, this must be located in/foss/designs/
- Therefore, we first need to copy it there:
cp analog-circuit-design/.designinit .
- Then we adjust the variable
XSCHEM_USER_LIBRARY_PATH
by opening the file in an editor e.g.nano .designinit
- Change the last line from
export XSCHEM_USER_LIBRARY_PATH=$DESIGNS/xschem
toexport XSCHEM_USER_LIBRARY_PATH=$DESIGNS/analog-circuit-design/xschem
- Change the last line from
- To apply the changes, we need to close the current terminal window:
exit
- Open again a terminal
- Test if the correct PDK gets selected:
echo $PDK
(you should getsg13g2
as the answer) - Change into the GitHub repository:
cd analog-circuit-design
- Start xschem using
xschem
or directly open a specific schematic usingxschem xschem/dc_lv_nmos.sch
1.5.1 Creating Backups
You can easily create backups of your work by creating a zip archive of the complete directory:
- Change to the parent directory:
cd /foss/designs
- Create a zip archive from the complete design folder:
zip backup.zip analog-circuit-design -r
1.5.2 Updating the Repository
- Create a backup!
- Go to directory: e.g.
cd /foss/designs/analog-circuit-design
- Fetch newest changes from the origin:
git fetch origin
- Merge changes from the origin into local branch ‘master’:
git merge origin/main
It is possible that git merge
does not complete successfully. Either you are able to resolve the merge conflict manually, or it may be easier to make a fresh clone of the repository and adding your local changes manually from the backup.
Please think twice before executing any git command without a backup, as this could lead to permanent loss of data!
2 First Steps
In this first chapter we will learn to use Xschem for schematic entry, and how to operate the ngspice SPICE simulator for circuit simulations. Further, we will make ourself familiar with the transistor and other passive components available in the IHP Microelectronics SG13G2 technology. While this is strictly speaking a BiCMOS technology offering MOSFETs as well as SiGe heterojunction bipolar transistors (HBTs), we will use it as a pure CMOS technology, which is available from IHP under the name SG13C.
2.1 The Metal-Oxide-Semiconductor Field-Effect-Transistor (MOSFET)
In this course, we will not dive into semiconductor physics and derive the device operation bottom-up starting from a fundamental level governed by quantum mechanics or a simplified solid-state physics based approach resulting in the well-known square-law model. Instead, we will treat the MOSFET behaviorally by assuming a 4-terminal device, and the performance of this device regarding its terminal voltages and currents we will largely derive from the simulation model.
Since we have an emphasis on integrated circuit design in this course the size of the MOSFET can be adapted by changing its width \(W\) and its length \(L\). As we will see later, \(L\) has a profound impact on the MOSFET performance allowing to trade-off speed versus output conductance versus device-to-device matching. The width \(W\) is more of a scaling parameter to adapt the current density (strictly speaking charge density) forming in the MOSFET channel to a desired current. More about this later.
The circuit symbol that we will use for the n-channel MOSFET is shown in Figure 1, and for the p-channel MOSFET it is shown in Figure 2. A control voltage between gate (“G”) and source (“S”) controls the current flow between drain (“D”) and source. The MOSFET is a 4-terminal device, so the bulk (“B”) can also control the drain-source current flow. Often, the bulk is connected to source, and then the bulk terminal is not shown to declutter the schematics.
Strictly speaking is the drain-source current of a MOSFET controlled by the voltage between gate and bulk (\(V_\mathrm{GB}\)) and the voltage between drain and source (\(V_\mathrm{DS}\)). Since bulk is often connected to source anyway, and many circuit designers historically were already familiar with the operation of the bipolar junction transistor (BJT), it is common to consider the gate-source voltage (besides the drain-source voltage) as the controlling voltage.
This focus on gate-source suggests that the source is special compared to the drain. In a typical physical MOSFET, however, the drain and source are constructed exactly the same (i.e., the MOSFET is a symmetric device), and which terminal is drain, and which terminal is source, is only determined by the applied voltage potentials, and can change dynamically during operation (think of a MOSFET operating as a switch… which side is the drain, which side is the source?).
Unfortunately, this focus on a “special” source has made its way into some MOSFET compact models. The model that is used in SG13G2 luckily uses the PSP model, which is formulated symmetrically with regards to drain and source, and is thus very well suited for analog and RF circuit design. For a detailed understanding of the PSP model please refer to the model documentation.
For hand calculations and theoretical discussions we will use the following simplified large-signal model, shown in Figure 3. A current source \(I_\mathrm{D}\) models the current flow between drain and source, and it is controlled by the three control voltages \(V_\mathrm{GS}\), \(V_\mathrm{DS}\), and \(V_\mathrm{SB}\). Note that in this way (since \(I_\mathrm{D}= f(V_\mathrm{DS})\)) also a resistive behavior between D and S can be modelled. In case that B and S are shorted then simply \(V_\mathrm{SB} = 0\) and \(C_\mathrm{SB}\) is shorted.
In an ideal MOSFET no dc current is flowing into the gate, the behavior is purely capacitive. We model this by two capacitors: \(C_\mathrm{GG}= C_\mathrm{GS}+ C_\mathrm{GD}+ C_\mathrm{GB}\) is the total capacitance when looking into the gate of the MOSFET. \(C_\mathrm{GS}\) is usually the dominant capacitance, and \(C_\mathrm{GD}\) models the capacitive feedback between D and G, usually induced by a topological overlap capacitance in the physical construction of the MOSFET. This capacitance is often small compared to \(C_\mathrm{GS}\), but in situations where we have a large voltage swing at the drain this capacitance will be affected by the Miller effect (see Section 16.1). In hand calculations we will often set \(C_\mathrm{GD}= C_\mathrm{GB}= C_\mathrm{DB}= C_\mathrm{SB}= 0\).
To model a physical MOSFET there will be also a requirement for resistors in the model to account for terminal access resistances (\(R_\mathrm{G}\), \(R_\mathrm{D}\), and \(R_\mathrm{S}\)) as well as resistors to model second-order effects like non-quasistatic operation. For lower frequencies and bulk MOSFETs we will not consider these resistors, and just deal with the capacitive behavior.
Now, as we are skipping the bottom-up approach of deriving the MOSFET large-signal behavior from basic principles, we need to understand the behavior of the elements of the large-signal model in Figure 3 by using a circuit simulator and observing what happens. And generally, a first step in any new IC technology should be to investigate basic MOSFET performance, by doing simple dc sweeps of \(V_\mathrm{GS}\) and \(V_\mathrm{DS}\) and looking at \(I_\mathrm{D}\) and other large- and small-signal parameters.
As a side note, the students who want to understand MOSFET behavior from a physical angle should consult the MOSFET chapter from the JKU course “Design of Complex Integrated Circuits” (VL 336.048). A great introduction into MOSFET operation and fabrication is given in (Hu 2010), which is available freely online and is a recommended read. A very detailed description of the MOSFET (leaving usually no question unanswered) is provided in (Tsividis and McAndrew 2011).
Now, in order to get started, basic Xschem testbenches are prepared, and first simple dc sweeps of various voltages and currents will be done. But before that, please look at the import note below!
Throughout this material, we will largely stick to the following notation standardized by IEEE:
- A dc quantity is shown with an upper-case variable name with upper-case subscripts, like \(V_\mathrm{GS}\).
- Double-subscripts denote dc sources, like \(V_\mathrm{DD}\) and \(V_\mathrm{SS}\).
- An ac (small-signal) quantity (incremental quantity) has a lower-case variable name with a lower-case subscript, like \(g_\mathrm{m}\).
- A total quantity (dc plus ac) is shown as a lowercase variable name with upper-case subscript, like \(i_\mathrm{DS}\).
- An upper-case variable name with a lower-case subscript is used to denote RMS quantities, like \(I_\mathrm{ds}\).
In contrast to the passive devices resistor \(R\), inductor \(L\), and capacitor \(C\), which can only dissipate energy (and are often treated in a linearized fashion), transistors (like the MOSFET) are called “active”, since they can provide signal power amplification. However, transistors can not create energy out of thin air, but merely convert dc energy (supplied by the power supply) into ac energy. They have to have nonlinear transfer characteristics to do this, but it has been shown that a piecewise-linear characteristic is sufficient (Jakoby 2022). This is very good news for circuit design, as usually we strive for linear behaviour!
2.1.1 Large-Signal MOSFET Model
We start with an investigation into the large-signal MOSFET model shown in Figure 3 by using the simple testbench for the LV NMOS shown in Figure 4.
For modelling the MOSFET behavior in a circuit simulator like ngspice different models are available. Some of these models have been widely adopted, like the BSIM (Berkeley Short-channel IGFET Model) or PSP (Philips Penn State) model. The PSP model version 103.6 is used in the IHP SG13G2 PDK for the LV and HV MOSFET. This model has several advantages:
- Physics-based surface-potential model
- Symmetric formulation with respect to drain and source
- Support for mobility reduction, velocity saturation, DIBL, gate current, lateral doping gradient effects, STI stress, NQS, etc.
The PSP 103.6 model documentation can be found here. In chapter 8 the dc operating point output of the model (these parameters can be queried in ngspice) is explained, which is helpful to interpret the simulation output.
Please try to execute the following steps and answer these questions:
- Get the LV NMOS testbench (available at https://github.com/iic-jku/analog-circuit-design/blob/main/xschem/dc_lv_nmos.sch) working in your IIC-OSIC-TOOLS environment.
- Make yourself familiar with Xschem (change the schematic in various ways, run a simulation, graph the result).
- Make yourself familiar with ngspice (run various simulations, save nets and parameters, use the embedded Xschem graphing, explore the interactive ngspice shell to look at MOSFET model parameters).
- Explore the LV NMOS
sg13_lv_nmos
:- How is \(I_\mathrm{D}\) affected by \(V_\mathrm{GS}\) and \(V_\mathrm{DS}\)?
- Change \(W\) and \(L\) of the MOSFET. What is the impact on the above parameters? Can you explain the variations?
- Look at the capacitance values for \(C_\mathrm{GS}\), \(C_\mathrm{GB}\), \(C_\mathrm{GD}\), and \(C_\mathrm{DB}\). How are they affected by \(W\) and \(L\) and by changing the bias conditions (play with \(V_\mathrm{GS}\) and \(V_\mathrm{DS}\))?
- When looking at the model parameters in ngspice, you see that there is a \(C_\mathrm{GD}\) and a \(C_\mathrm{DG}\). Why is this, what could be the difference? Sometimes these capacitors show a negative value, why? (Hint: Study Note 1)
- Build testbenches in Xschem for the LV PMOS, the HV NMOS, and the HV PMOS. Explore the different results.
- For a given \(W\) and \(L\), which device provides more drain current? How are the capacitances related?
- If you would have to size an inverter, what would be the ideal ratio of \(W_p/W_n\)? Will you exactly design this ratio, or are the reasons to deviate?
- There are LV and HV MOSFETs, and you investigated the difference in performance. What is the rationale when designing circuits for selection either an LV type, and when to choose an HV type?
- Build a test bench to explore the body effect, start with LV NMOS.
- What happens when \(V_\mathrm{SB}\neq 0\)?
2.1.2 Small-Signal MOSFET Model
As you have seen in the previous investigations, the large-signal model of Figure 3 describes the behavior of the MOSFET across a wide range of voltages applied at the MOSFET terminals. Unfortunately, for hand analysis dealing with a nonlinear model is close to impossible, at the very least it is quite tedious.
However, for many practical situations, we bias a MOSFET with a set of dc voltages applied to its terminal, and only apply small signal excursions during operation. If we do this, we can linearize the large-signal model in this dc operating point, and resort to a small-signal model which can be very useful for hand calculations. Many experienced designers analyze their circuits by doing these kind of hand calculations and describing the circuit analytically, which is a great way to understand fundamental performance limits and relationships between parameters.
We will use the small-signal MOSFET model shown in Figure 5 for this course. The current-source \(i_\mathrm{d}= g_\mathrm{m}v_\mathrm{gs}\) models the drain current \(I_\mathrm{D}\) as a function of \(V_\mathrm{GS}\) with
\[ g_\mathrm{m}= \frac{\partial I_\mathrm{D}(V_\mathrm{GS}, V_\mathrm{DS}, V_\mathrm{SB})}{\partial V_\mathrm{GS}}, \]
and the resistor \(g_\mathrm{ds}\) models the dependency of the drain current by \(V_\mathrm{DS}\):
\[ g_\mathrm{ds}= \frac{\partial I_\mathrm{D}(V_\mathrm{GS}, V_\mathrm{DS}, V_\mathrm{SB})}{\partial V_\mathrm{DS}} \]
The drain current dependency on the source-bulk voltage (the so-called “body effect”) is introduced by the current source \(i_\mathrm{d}= g_\mathrm{mb} v_\mathrm{sb}\):
\[ g_\mathrm{mb}= \frac{\partial I_\mathrm{D}(V_\mathrm{GS}, V_\mathrm{DS}, V_\mathrm{SB})}{\partial V_\mathrm{SB}} \]
As has been mentioned before, in many situations (and whenever we want to use a simplified model) we connect source and bulk of the MOSFET together. This results in the much simplified small-signal model shown in Figure 6.
As any electronic device the MOSFET introduces noise into the circuit. In this course we will only consider the drain-source current noise of the MOSFET, given by
\[ \overline{I_\mathrm{n}^2} = 4 k T \gamma g_\mathrm{d0}, \tag{1}\]
where \(\overline{I_\mathrm{n}^2}\) is the one-sided power-spectral density of the noise in A\(^2\)/Hz; \(k\) is the Boltzmann constant; \(T\) is the absolute temperature; \(\gamma\) is a (fitting) parameter in simplified theory changing between \(\gamma = 2/3\) in saturation and \(\gamma =1\) for triode operation; \(g_\mathrm{d0}\) is equal to \(g_\mathrm{m}\) in saturation and \(g_\mathrm{ds}\) in triode).
Sometimes we will refer to different operating modes of the MOSFET like “saturation” or “triode.” Generally speaking, when the drain-source voltage is small, then the MOSFET acts as a voltage-controlled resistor (since the impact of both \(V_\mathrm{GS}\) and \(V_\mathrm{DS}\) on \(I_\mathrm{D}\) is large), and this mode of operation we call “triode” mode.
When the drain-source voltage \(V_\mathrm{DS}\) is increased, at some point the drain-source current saturates and is only a weak function of the drain-source voltage, while still being well controlled by \(V_\mathrm{GS}\). This mode is called “saturation” mode.
As you can see in the large-signal investigations, these transitions happen gradually, and it is difficult to define a precise point where one operating mode switches to the other one. In this sense we use terms like “triode” and “saturation” only in an approximate sense.
We can also consider an even more reduced small-signal MOSFET model compared to Figure 6, which is shown in Figure 7. In this, we just consider the transconductance \(g_\mathrm{m}\), the input capacitor \(C_\mathrm{gg}\), as well as the output conductance \(g_\mathrm{ds}\). Note that we can redraw the pi-model of Figure 7 into the \(\tau\)-model of Figure 8. Depending on the circuit configuration, either the first or the second form results in simpler calculations of the circuit equations.
A metric which is useful to assess the speed of a MOSFET is the so-called transit frequency \(f_\mathrm{T}\). It is defined as the frequency where the small-signal current gain (output current divided by the input current) of a MOSFET driven by a voltage-source at the input and loaded by a voltage source at the output drops to unity (reaches one). It can easily be derived using the simplified MOSFET small-signal model of Figure 6 by driving it with a voltage source and shorting the output to (neglecting the feed-forward current introduced by \(C_\mathrm{gd}\)) \[ \omega_\mathrm{T} = 2 \pi f_\mathrm{T} \approx \frac{g_\mathrm{m}}{C_\mathrm{gg}} = \frac{g_\mathrm{m}}{C_\mathrm{gs}+ C_\mathrm{gd}+ C_\mathrm{gb}}. \tag{2}\] This frequency is an extrapolated frequency where the MOSFET operation is dominated by several second-order effects (hence Equation 2 is not valid any longer). A rule-of-thumb is to use a MOSFET up to approximately \(f_\mathrm{T} / 10\). In any case, \(f_\mathrm{T}\) is a proxy of the speed of a MOSFET; in other words, how much input capacitance \(C_\mathrm{gg}\) is incurred when creating a certain \(g_\mathrm{m}\).
As a home exercise, try to derive Equation 2 starting from Figure 6. By showing this transformation you can proof that indeed both circuits are electrically equivalent.
Now we need to see how the small-signal parameters seen in Figure 5 can be investigated and estimated using circuit simulation.
Please try to execute the following steps and answer the following questions:
- Reuse the LV NMOS testbench (available at https://github.com/iic-jku/analog-circuit-design/blob/main/xschem/dc_lv_nmos.sch).
- Explore the LV NMOS
sg13_lv_nmos
:- How are \(g_\mathrm{m}\) and \(g_\mathrm{ds}\) changing when you change the dc node voltages?
- What is the ratio of \(g_\mathrm{m}\) to \(g_\mathrm{mb}\)? What is the physical reason behind this ratio (you might want to revisit MOSFET device physics at this point)?
- Take a look at the device capacitances \(C_\mathrm{gs}\), \(C_\mathrm{gd}\), and \(C_\mathrm{gb}\). Why are they important? What is the \(f_\mathrm{T}\) of the MOSFET?
- Look at the drain noise current according to the MOSFET model and compare with a hand calculation of the noise. In the noise equation there is the factor \(\gamma\), which in triode is \(\gamma=1\) and in saturation is \(\gamma=2/3\) according to basic text books. Which value of \(\gamma\) are you calculating? Why might it be different?
- Go back to your testbench for the LVS PMOS
sg13_lv_pmos
:- What is the difference in \(g_\mathrm{m}\), \(g_\mathrm{ds}\), and other parameters between the NMOS and the PMOS? Why could they be different?
A Maxwell capacitance matrix (Maxwell 1873) provides the relation between voltages on a set of conductors and the charges on these conductors. For a given conductor set with \(N\) conductors (and thus \(N\) terminals) the relation is \[ \mathbf{Q} = \mathbf{C} \cdot \mathbf{V} \] where \(\mathbf{Q}\) is a vector of the charges on the \(N\) conductors, \(\mathbf{C}\) is a \(N \times N\) capacitance matrix, and \(\mathbf{V}\) is the potential vector. In the case of two conductors and physical capacitances between them, \(\mathbf{C}\) is given by \[ \mathbf{C} = \begin{pmatrix} C_{11} + C_{12} & -C_{12} \\ -C_{21} & C_{21} + C_{22} \\ \end{pmatrix} \] where \(C_{xx} = \partial Q_x / \partial V_x\) is the auto capacitance from a conductor \(x\) towards infinity (ground), and \(C_{xy} = \partial Q_x / \partial V_y\) is the mutual capacitance from node/conductor \(x\) to node/conductor \(y\). For a physical capacitor \(C_{xy} = C_{yx}\).
Using the above equation to calculate \(Q_1\) (the charge on conductor \(1\)) results in \[ Q_1 = ( C_{11} + C_{12} ) V_1 - C_{12} V_2 = C_{11} (V_1 - 0) + C_{12} (V_1 - V_2) \] which is the expected result.
Such a Maxwell capacitance formulation is also used in the MOSFET model to describe the charge at a terminal as a function of potential at another terminal. So, \[ C_\mathrm{GD}= \frac{\partial Q_\mathrm{G}}{\partial V_\mathrm{D}} \] or \[ C_\mathrm{GG}= \frac{\partial Q_\mathrm{G}}{\partial V_\mathrm{G}} \] with \(Q_\mathrm{G}\) the charge at terminal G in response to either \(V_\mathrm{D}\) or \(V_\mathrm{G}\). Note that in a MOSFET, generally \(C_{xy} \ne C_{yx}\)!
2.2 Conclusion
Congratulations for making it thus far! By now you should have a solid grasp of the tool handling of Xschem and ngspice, and you should be familiar with the large- and small-signal operation of both NMOS and PMOS, and the parameters describing these behaviors. If you feel you are not sufficiently fluent in these things, please go back to the beginning of Section 2.1 and revisit the relevant sections, or dive into further reading about the MOSFET operation, like in (Hu 2010).
3 Transistor Sizing Using gm/ID Methodology
When designing integrated circuits it is an important question how to select various parameters of a MOSFET, like \(W\), \(L\), or the bias current \(I_\mathrm{D}\). In comparison to using discrete components in PCB design, or also compared to a bipolar junction transistor (BJT), we have these degrees of freedom, which make integrated circuit design so interesting.
Often, transistor sizing in entry-level courses is based on the square-law model, where a simple analytical equation for the drain current can be derived. However, in nanometer CMOS, the MOSFET behavior is much more complex than these simple models. Also, this highly simplified derivations introduce concepts like the threshold voltage or the overdrive voltage, which are interesting from a theoretical viewpoint, but bear little practical use.
One of the many simplifications of the square-law model is that the mobility of the charge carriers is assumed constant (it is not). Further, the existence of a threshold voltage is assumed, but in fact this voltage is just existing given a certain definition, and depending on definition, its value changed. In addition, in nm CMOS, the threshold voltage is a function on many thing, like \(W\) and \(L\).
An additional shortcoming of the square-law model is that it is only valid in strong inversion, i.e. for large \(V_\mathrm{GS}\) where the drain current is dominated by the drift current. As soon as the gate-source voltage gets smaller, the square-law model breaks, as the drain current component based on diffusion currents gets dominant. Modern compact MOSFET models (like the PSP model used in SG13G2) use hundreds of parameters and fairly complex equations to somewhat properly describe MOSFET behavior over a wide range of parameters like \(W\), \(L\), and temperature. A modern approach to MOSFET sizing is thus based on the thought to use exactly these MOSFET models, characterize them, put the resulting data into tables and charts, and thus learn about the complex MOSFET behavior and use it for MOSFET sizing.
Being a well-established approach we select the \(g_\mathrm{m}/I_\mathrm{D}\) methodology introduced by P. Jespers and B. Murmann in (Jespers and Murmann 2017). A brief introduction is available here as well.
The \(g_\mathrm{m}/I_\mathrm{D}\) methodology has the huge advantage that it catches MOSFET behavior quite accurately over a wide range of operating conditions, and the curves look very similar for pretty much all CMOS technologies, from micrometer bulk CMOS down to nanometer FinFET devices. Of course the absolute values change, but the method applies universally.
3.1 MOSFET Characterization Testbench
In order to get the required tabulated data we use a testbench in Xschem which sweeps the terminal voltages, and records various large- and small-signal parameters, which are then stored in large tables. The testbench for the LV NMOS is shown in Figure 9, and the TB for the LV PMOS is shown in Figure 10.
The testbenches are relatively straightforward, with one exception: The drain current noise is sensed via the drain voltage source vd
and converted to a noise voltage (node n
) using a current-controlled voltage source (CCVS). This is necessary as the .noise
simulation statement works with voltages.
We will use Jupyter notebooks to inspect the resulting data, and interpret some important graphs. This will greatly help to understand the MOSFET behavior.
3.2 NMOS Characterization in Saturation
First, we will start looking at the LV NMOS. In Section 3.3 we have the corresponding graphs for the LV PMOS. In this lecture, we will only use the LV MOSFETs. While there are also the HV types available, they are mainly used for high-voltage circuits, like circuits connecting to the outside world. Here, we only will design low-voltage circuits running at a nominal supply voltage of \(1.5\,\text{V}\), so only the LV types are of interest to us.
In the plots that follow we will set \(V_\mathrm{ds}= V_\mathrm{DD}/2\) to keep the MOSFET in saturation (as this is the region of operation where most MOSFET are operated when working in class-A). We will later also look at the MOSFET performance in triode operation, as this is the operation mode where the MOSFET is used as a switch. While the \(g_\mathrm{m}/I_\mathrm{D}\) method is primarily intended to be used for circuits where the MOSFETs are held in saturation and are biased by a certain bias current (usually referred to class-A), the generated tables using the testbenches of Section 3.1 contain the data for all MOSFET bias points.
The first import graph is the plot of \(g_\mathrm{m}/I_\mathrm{D}\) and \(f_\mathrm{T}\) versus the gate-source voltage \(V_\mathrm{GS}\). First let us answer the question why \(g_\mathrm{m}/I_\mathrm{D}\) is a good parameter to look at, and actually this is also the central parameter in the \(g_\mathrm{m}/I_\mathrm{D}\) methodology. In many circuits we want to get a large amplification from a MOSFET, which corresponds to a large \(g_\mathrm{m}\). We want to achieve this by spending the minimum biasing current possible (ideally zero), as we almost always design for lowest power consumption. Thus, a high \(g_\mathrm{m}/I_\mathrm{D}\) ratio is good.
Designing for minimum power consumption is pretty much always mandated. For battery-operated equipment it is a paramount requirement, but also in other equipment electrical energy consumption is a concern, and often severely limited by the cooling capabilities of the electrical system.
However, as can be seen in the below plot, there exists a strong and unfortunate trade-off with device speed, characterized here by the transit frequency \(f_\mathrm{T}\). It would be ideal if there exists a design point where we get high transconductance per bias current concurrently to having the fastest operation, but unfortunately, this is clearly not the case. The \(g_\mathrm{m}/I_\mathrm{D}\) peaks for \(V_\mathrm{GS}< 0.3\,\text{V}\), and the highest speed we get at \(V_\mathrm{GS}\approx 1.2\,\text{V}\). The dashed vertical line plots the nominal threshold voltage, as you can see in this continuum of parameter space, it marks not a particularly special point.
Note that \[ \frac{g_\mathrm{m}}{I_\mathrm{D}} = \frac{1}{n V_\mathrm{T}} \tag{3}\] for a MOSFET in weak inversion (i.e., small gate-source voltage). \(n\) is the subthreshold slope, and \(V_\mathrm{T} = k T / q\) which is \(25.8\,\text{mV}\) at \(300\,\text{K}\). We thus have \(n \approx 1.38\) for this LV NMOS, which falls nicely into the usual range for \(n\) of \(1.3\) to \(1.5\) for bulk CMOS (FinFET have \(n\) very close to \(1\)).
For the classical square-law model of the MOSFET in strong inversion, \(g_\mathrm{m}/I_\mathrm{D}\) is given as \[ \frac{g_\mathrm{m}}{I_\mathrm{D}} = \frac{2}{V_\mathrm{GS}- V_\mathrm{th}} = \frac{2}{V_\mathrm{od}} \tag{4}\] with \(V_\mathrm{th}\) the threshold voltage and \(V_\mathrm{od}\) the so-called “overdrive voltage.” The latter is sometimes also dubbed the effective gate-source voltage \(V_\mathrm{eff}\) (Carusone, Johns, and Martin 2011).
Why are we so often using a temperature of \(300\,\text{K}\) for a typical condition? As this corresponds to roughly \(27^{\circ}\text{C}\), this accounts for some self heating compared to otherwise cooler usual room temperatures. Further, engineers like round numbers which are easy to remember, so \(300\,\text{K}\) is used as a proxy for room temperature.
As we can also see from belows plot, the peak transit frequency of the LV NMOS is about \(75\,\text{GHz}\), which allows building radio-frequency circuits up to ca. \(f_\mathrm{T} / 10 = 7.5\,\text{GHz}\), which is a respectable number. It is no coincidence, that the transition for RF design in the GHz-range switched from BJT-based technologies to CMOS roughly in the time frame when 130nm CMOS became available (ca. 2000).
Note that \(f_\mathrm{T}\) saturates and even decreases again at around \(V_\mathrm{GS}= 1.2\,\text{V}\) due to second-order effects of the transistor like velocity saturation and DIBL. Velocity saturation describes the saturation of the velocity of the electrons at a certain \(V_\mathrm{GS}\). As a consequence, also \(g_\mathrm{m}\) saturates. Since \(f_\mathrm{T} \propto g_\mathrm{m}/ C_\mathrm{gs}\), \(f_\mathrm{T}\) can not further increase. DIBL explains the effect when a too high \(V_\mathrm{GS}\) is applied and therefore the channel is confined to a narrow region at the surface, leading to more carrier scattering and thus lower mobility.
The following figure plots \(f_\mathrm{T}\) against \(g_\mathrm{m}/I_\mathrm{D}\) for several different \(L\). As you can see, device speed maximizes for a low \(g_\mathrm{m}/I_\mathrm{D}\) and a short \(L\). Note, that the drain-source voltage is kept at \(V_\mathrm{DS}= 0.75\,\text{V} = V_\mathrm{DD}/ 2\), which is a typical value keeping the MOSFET in saturation across the characterization sweeps. Further, the source-bulk voltage is kept at \(V_\mathrm{SB}= V_\mathrm{S} - V_\mathrm{B} = 0\,\text{V}\), which means bulk and source terminals are connected. If \(V_\mathrm{SB}\neq 0\,\text{V}\), then the so-called body effect, bulk effect or back-gate effect occurs.
If \(V_\mathrm{SB}> 0\,\text{V}\), then \(V_\mathrm{th}\) increases. If \(V_\mathrm{SB}< 0\,\text{V}\), then \(V_\mathrm{th}\) decreases. At first glance, this effect may sound unwanted (and often also is), however, improved circuit designs by changing the bulk potential can be realized. For example, circuit 16 in (Pretl and Eberlein 2021) shows an implementation of a current mirror by exploiting the body effect. However, keep in mind that a deep-n-well (DNW), also called triple-well, is needed for NMOS transistors when the bulk is not tied to GND. This DNW may not be supported by the used PDK or can add additional costs to the wafer processing.
The next plot shows the ratio of \(g_\mathrm{m}/ g_\mathrm{ds}\) versus \(g_\mathrm{m}/I_\mathrm{D}\). The ratio \(g_\mathrm{m}/ g_\mathrm{ds}\) is the so-called “self-gain” of the MOSFET, and shows the maximum voltage gain we can achieve in a single transistor configuration. As one can see the self gain increases for increasing \(L\), but this also gives a slower transistor, so again there is a trade-off. This plot allows us to select the proper \(L\) of a MOSFET if we know which amount of self gain we need.
The following figure plots the drain current density \(I_\mathrm{D}/W\) as a function of \(g_\mathrm{m}/I_\mathrm{D}\) and \(L\). With this plot we can find out how to set the \(W\) of a MOSFET once we know the biasing current \(I_\mathrm{D}\), the \(L\) (selected according to self gain, \(f_\mathrm{T}\), and other considerations) and the \(g_\mathrm{m}/I_\mathrm{D}\) design point we selected. The drain current density \(I_\mathrm{D}/W\) is a very useful normalized metric to use, because the physical action in the MOSFET establishes a charge density in the channel below the gate, and the changing of the \(W\) of the device merely transforms this charge density into an absolute parameter (together with \(L\)).
The following plot shows the minimum drain-source voltage \(V_\mathrm{ds,sat}\) that we need to establish in order to keep the MOSFET in saturation. As you can see, this value is almost independent of \(L\), and increases for small \(g_\mathrm{m}/I_\mathrm{D}\). So for low-voltage circuits, where headroom is precious, we tend to bias at \(g_\mathrm{m}/I_\mathrm{D}\ge 10\), wheres for fast circuits we need to go to small \(g_\mathrm{m}/I_\mathrm{D}\le 5\) requiring substantial voltage headroom per MOSFET stage that we stack on top of each other.
For analog circuits the noise performance is usually quite important. Thermal noise of a resistor (the Johnson-Nyquist noise) has a flat power-spectral density (PSD) given by \(\overline{V_\mathrm{n}^2}/\Delta f = 4 k T R\), where \(k\) is Boltzmann’s constant, \(T\) absolute temperature, and \(R\) the value of the resistor (the unit of \(\overline{V_\mathrm{n}^2}/\Delta f\) is \(\text{V}^2/\text{Hz}\)). This PSD is essentially flat until very high frequencies where quantum effects start to kick in.
We usually leave the \(\Delta f\) away for a shorter notation, so we write \(\overline{V_\mathrm{n}^2}\) when we actually mean \(\overline{V_\mathrm{n}^2}/\Delta f\). In case of doubt look at the unit of a quantity, whether is shows \(\text{V}^2\) or \(\text{V}^2/\text{Hz}\) or \(\text{V}/\sqrt{\text{Hz}}\) (or \(\text{I}^2\) or \(\text{I}^2/\text{Hz}\) or \(\text{I}/\sqrt{\text{Hz}}\)).
Please also note that the pair of \(k T\) pretty much always shows up together, so when you do a calculation and you miss the one or the other, that is often a sign for miscalculation. Boltzmann’s constant \(k = 1.38 \cdot 10^{-23}\,\text{J/K}\) is just a scaling factor from thermal energy expressed as a temperature \(T\) to energy \(E = k T\) expressed in Joule.
Further, when working with PSD there is the usage of a one-sided (\(0 \ge f < \infty\)) or two-sided power spectral density (PSD) (\(-\infty < f < \infty\)). The default in this lecture is the usage of the one-sided PSD.
In this lecture the only MOSFET noise we consider is the drain noise (as discussed in Section 2.1.2), showing up as a current noise between drain and source. For a realistic MOSFET noise model, also a (correlated) gate noise component and the thermal noise of the gate resistance needs to be considered.
The factor \(\gamma\) (Equation 1) is a function of many things (in classical theory, \(\gamma = 2/3\) in saturation and \(\gamma = 1\) in triode), and it is characterized in the following plot as a function of \(g_\mathrm{m}/I_\mathrm{D}\) and \(L\). So when calculating MOSFET noise we can lookup \(\gamma\) in the below plot, and use Equation 1 to calculate the effective drain current noise.
In a MOSFET, unfortunately, besides the thermal noise according to Equation 1, there is also a substantial low-frequency excess noise, called “flicker noise” due to its characteristic \(\overline{I_\mathrm{d,nf}^2} = K_\mathrm{f}/f\) behavior (this means that this noise PSD decreases versus frequency). In order to characterize this flicker noise the following plot shows the cross-over frequency \(f_\mathrm{co}\), where the flicker noise is as large as the thermal noise. As can be seen in the below plot, this frequency is a strong function of \(L\) and \(g_\mathrm{m}/I_\mathrm{D}\). Generally, the flicker noise is proportional to \((W L)^{-1}\), so the larger the device is, the lower the flicker noise. The parameter \(g_\mathrm{m}/I_\mathrm{D}\) largely stays constant when we keep \(W/L\) constant, so for a given \(g_\mathrm{m}/I_\mathrm{D}\) flicker noise is proportional to \(1/L^2\). However, increasing \(L\) lowers device speed dramatically, so here we have a trade-off between flicker-noise performance and MOSFET speed, and this can have dramatic consequences for high-speed circuits.
The physical origin of flicker noise is the crystal interface between silicon (Si) and the silicon dioxide (SiO2). Since these are different materials, there are dangling bonds, which can capture charge carriers traveling in the channel. After a random time, these carriers are released, and flicker noise is the result. The amount of flicker noise is a function of the manufacturing process, and will generally be different between device types and wafer foundries.
As you can see in the following plot, \(f_\mathrm{co}\) can reach well into the 10’s of MHz for short MOSFETs, significantly degrading the noise performance of a circuit.
During the design phase, it might be convenient to have an overview of the most relevant sizing plots of the NMOST on one page. This overview can be downloaded here.
3.3 PMOS Characterization in Saturation
In the following, we have the same plots as discussed in Section 3.2, but now for the PMOS.
In all PMOS plots we plot positive values for voltages and currents, to have compatible plots to the NMOS. Of course, in a PMOS, voltages and currents have different polarity compared to the NMOS.
\(g_\mathrm{m}/I_\mathrm{D}\) and \(f_\mathrm{T}\) versus the gate-source voltage \(V_\mathrm{GS}\):
\(f_\mathrm{T}\) against \(g_\mathrm{m}/I_\mathrm{D}\) for several different \(L\). One can see a significantly lower top speed for the PMOS compared to the NMOS, which means for high-speed circuits the NMOS should be used. The reason for this is the approximately two to three times higher mobility of electrons compared to the mobility of holes.
\(g_\mathrm{m}/ g_\mathrm{ds}\) versus \(g_\mathrm{m}/I_\mathrm{D}\). Unfortunately, one can see a modelling error for the PMOS in this plot. The self gain \(g_\mathrm{m}/ g_\mathrm{ds}\) reaches non-physical values, which indicates an issue with the \(g_\mathrm{ds}\) modelling for the PMOS. We can not use these values for our circuit sizing, so we will use the respective NMOS plots also for the PMOS.
This example shows how important it is to benchmark the device models when starting to use a new technology. Modelling artifacts like the one shown are quite often happening, as setting up the device compact models and parametrize them according to measurement data is a very complex task. In any case, just be aware that modelling issues could exist in whatever PDK you are going to use!
Drain current density \(I_\mathrm{D}/W\) as a function of \(g_\mathrm{m}/I_\mathrm{D}\) and \(L\):
Minimum drain-source voltage \(V_\mathrm{ds,sat}\) versus \(g_\mathrm{m}/I_\mathrm{D}\) and \(L\):
Noise factor \(\gamma\) versus \(g_\mathrm{m}/I_\mathrm{D}\) and \(L\):
Flicker noise corner frequency \(f_\mathrm{co}\) versus \(g_\mathrm{m}/I_\mathrm{D}\) and \(L\). If you compare this figure carefully with the NMOS figure you can see that for some operating points the flicker noise for the PMOS is lower than for the NMOS. This is often true for CMOS technologies, so it can be an advantage to use a PMOS transistor in places where flicker noise is critical, like an OTA input stage. Using PMOS has the further advantage that the bulk node can be tied to source (which for NMOS is only possible in a triple-well technology, which is often not available), which gets rid of the body effect.
During the design phase, it might be convenient to have an overview of the most relevant sizing plots of the PMOST on one page. This overview can be downloaded here.
3.4 NMOS and PMOS Characterization in Triode
Besides using the MOSFET as a transconductor in saturation we often use the MOSFET as a switch in triode mode (to either switch voltages or currents). In this triode/switch mode of operation we are mainly interested in two parameters:
- The resistance of the switch/MOSFET when it is turned on (\(R_\mathrm{on} = 1 / g_\mathrm{ds}\)).
- The shunt capacitance of the switch when it is turned off (\(C_\mathrm{off}\) is defined by the coupling capacitances between drain and source).
In the operation of the NMOS as a switch the gate is usually pulled to \(V_\mathrm{DD}\) and the bulk is permanently connected to \(V_\mathrm{SS}\) to achieve the lowest \(R_\mathrm{on}\) (to turn the switch off the gate is pulled towards \(V_\mathrm{SS}\)). Likewise, to turn on a PMOS, the gate is usually pulled to \(V_\mathrm{SS}\) and the bulk is connected to \(V_\mathrm{DD}\). In this situation the drain/source potential is somewhere between \(V_\mathrm{DD}\) and \(V_\mathrm{SS}\), so the MOSFET will experience a \(V_\mathrm{SB}\neq 0\), leading to a quite noticeable bulk effect. Once the drain/source potential is sufficiently high for NMOS (low PMOS) the switch resistance will drastically degrade.
In order to get a feeling for the behavior of the MOSFET as a switch the following characterization plots show \(R_\mathrm{on}\) and \(C_\mathrm{off}\) for the NMOS and the PMOS, respectively. Both plots are for \(L = L_\mathrm{min} = 0.13\mu m\), as for switches usually minimum length devices are used. Only for special applications (e.g., the drain-source leakage current in off-mode is a concern) MOSFETs with increased \(L\) are used.
As can be seen in the previous plot, an NMOS can be used to switch at potentials close to \(V_\mathrm{SS}\), while a PMOS is the better choice when switching at potentials close to \(V_\mathrm{DD}\). To construct a switch which can work for all voltage levels between \(V_\mathrm{DD}\) and \(V_\mathrm{SS}\) an NMOS is put in parallel to a PMOS, resulting in the well-known transmission gate.
As the switch on-resistance gets lower when increasing \(W\), the off-capacitance gets larger. Thus, a good performance indicator for comparing switches in a given technology is the \(R_\mathrm{on} C_\mathrm{off}\) product.
When designing switches, often the unwanted effects of charge injection and clock feed-through have to be considered and, as always, a trade-off has to be chosen. More details can be found in literature, e.g, (Carusone, Johns, and Martin 2011) or (Razavi 2017).
4 First Circuit: MOSFET Diode
The first (simple) circuit which we will investigate is a MOSFET, where the gate is shorted with the drain, a so-called MOSFET “diode”, which is shown in Figure 11. This diode is one half of a current mirror, which we will investigate in Section 6.
Why looking at a single-transistor circuit at all? By starting with the simplest possible circuit we can develop important skills in circuit analysis (setting up and calculating a small-signal model, calculating open-loop gain, calculate noise, etc.) and Xschem/ngspice simulation testbench creation. We safely assume that also the Mona Lisa was not Leonardo da Vinci’s first painting, so let’s start slow.
This diode is usually biased by a current source, shown as \(I_\mathrm{bias}\) in the figure. Depending on MOSFET sizing with \(W\) and \(L\), a certain gate-source voltage \(V_\mathrm{GS}\) will develop. This voltage can be used as a biasing voltage for other circuit parts, for example.
It is important to realize that this configuration employs a feedback loop for operation. The voltage at the drain of the MOSFET is sensed by the gate, and the gate voltage changes until \(I_\mathrm{D}\) is exactly equal to \(I_\mathrm{bias}\). In this sense this is probably the smallest feedback circuit one can build.
4.1 MOSFET Diode Sizing
We will now build this circuit in Xschem. For sizing the MOSFET we will use the \(g_\mathrm{m}/I_\mathrm{D}\) methodology introduced in Section 3.
Please build a MOSFET diode circuit in Xschem where you use an LV NMOS, set \(I_\mathrm{bias} = 20\,\mu\text{A}\), \(L = 0.13\,\mu\text{m}\), and we want to use \(g_\mathrm{m}/I_\mathrm{D}= 10\) (often a suitable compromise between transistor speed and \(g_\mathrm{m}\) efficiency).
- Use the figures in Section 3.2 to find out the proper value for \(W\).
- What is the \(f_\mathrm{T}\) for this MOSFET? What is the value for \(g_\mathrm{m}\) and \(g_\mathrm{ds}\)?
- Draw the circuit in Xschem, and simulate the operating point. Do the values match to the values found out before during circuit sizing?
Before continuing, please finish the previous exercise. Once you are done, compare with the below provided solution.
- Using the fact that \(I_\mathrm{bias} = I_\mathrm{D} = 20\,\mu\text{A}\) and \(g_\mathrm{m}/I_\mathrm{D}= 10\) directly provides \(g_\mathrm{m}= 0.2\,\text{mS}\).
- Using the self-gain plot, we see that \(g_\mathrm{m}/g_\mathrm{ds}\approx 21\), so \(g_\mathrm{ds}\approx 9.5\,\mu\text{S}\). The \(f_\mathrm{T}\) can easily be found in the respective plot to be \(f_\mathrm{T} = 23\,\text{GHz}\).
- The \(W\) of the MOSFET we find using the drain current density plot and the given bias current. Rounding to half-microns results in \(W = 1\,\mu\text{m}\).
- Since we are looking at the graphs, we further find \(\gamma = 0.84\), \(V_\mathrm{ds,sat} = 0.18\,\text{V}\), and \(f_\mathrm{co} \approx 15\,\text{MHz}\).
- In addition, we expect \(V_\mathrm{GS}\approx 0.6\,\text{V}\).
An example Jupyter notebook to extract these values accurately you can find here. An Xschem schematic for this exercise is provide as well.
4.2 MOSFET Diode Large-Signal Behavior
As discussed above, the MOSFET diode configuration is essentially a feedback loop. Before we will analyze this loop in small-signal, we want to investigate how this loop settles in the time domain, and by doing this we can observe the large-signal settling behavior. To simulate this, we change the dc bias source from the previous example to a transient current source, which we will turn on after some picoseconds. The resulting Xschem testbench is shown in Figure 12.
When simulating the circuit in Figure 12 another interesting effect can be observed: While the turn-on happens quite rapidly (essentially the bias current source charges the gate capacitance, until the gate-source voltage is large enough that the drain current counteracts the bias current), the turn-off shows a very long settling tail. This is due to the fact that as the gate capacitance is discharged by the drain current, the \(V_\mathrm{GS}\) drops, which in turn reduces the drain current, which will make the discharge even slower. We have an effect similar to a capacitor discharge by a diode (Hellen 2003).
It is thus generally a good idea to add power-down switches to the circuits to disable the circuit quickly by pulling floating nodes to a defined potential (usually \(V_\mathrm{DD}\) or \(V_\mathrm{SS}\)) and to avoid long intermediate states during power down. This will also allow a turn-on from a well-defined off-state.
Exemplary implementations of power-down switches are shown in the Xschem implementation of the improved OTA in Figure 39. Which \(W/L\) ratio should these transistors have? Well, in general, switches often have minimum length to be fast and to have low \(R_\mathrm{on}\). However, if there are no critical specifications (e.g., like power-down time), then these transistors are often used as dummy transistors for other circuit parts and are sized to fit the layout best.
4.3 MOSFET Diode Small-Signal Analysis
We now want to investigate the small-signal behavior of the MOSFET diode. Based on the small-signal model of the MOSFET in Figure 5 we realize that gate and drain are shorted, and we also connect bulk to source. We can thus simplify the circuit to the one shown in Figure 13.
For small-signal analysis we would not need to declare one node as the ground potential. However, when doing so, and selecting the ground node strategically, we can simplify the analysis, as we usually do not formulate KCL for the ground node (as we have only \(N-1\) independent KCL equations, \(N\) being the number of nodes in the circuit), and the potential difference equations are simpler if one node is at \(0\,V\).
For calculating the small-signal impedance of the MOSFET diode we formulate Kirchhoff’s current law (KCL, also Kirchhoff’s first law or Kirchhoff’s junction rule) at the top node to get \[ i_\mathrm{bias} - s C_\mathrm{gs}v_\mathrm{gs}- g_\mathrm{m}v_\mathrm{gs}- g_\mathrm{ds}v_\mathrm{gs}= 0. \]
It follows that \[ Z_\mathrm{diode}(s) = \frac{v_\mathrm{gs}}{i_\mathrm{bias}} = \frac{1}{g_\mathrm{m}+ g_\mathrm{ds}+ s C_\mathrm{gs}}. \tag{5}\]
When neglecting \(g_\mathrm{ds}\), at dc we get \(Z_\mathrm{diode} = 1 / g_\mathrm{m}\), which is an important result and should be memorized.
In circuit analysis it is often algebraically easier to work with admittance instead of impedance, so please remember that Ohm’s law for a conductance is \(I = G \cdot V\), and for a capacitance is \(I = s C \cdot V\). When writing equations, it is also practical to keep \(s C\) together, so we will strive to sort terms accordingly.
Looking at Equation 5 we see that for low frequencies, the diode impedance is resistive, and for high frequencies it becomes capacitive as the gate-source capacitance starts to dominate. The corner frequency of this low-pass can be calculated as \[ \omega_\mathrm{c} = \frac{g_\mathrm{m}+ g_\mathrm{ds}}{C_\mathrm{gs}} \approx \omega_\mathrm{T} \] which is pretty much the transit frequency of the MOSFET!
4.4 MOSFET Diode Stability Analysis
Figure 14 shows a negative feedback system with input \(X(s)\) and output \(Y(s)\), where \(H_\mathrm{ol}(s)\) is the transfer function of the feed-forward path (also called open-loop gain) and \(G(s)\) is the transfer function of the feedback network. The loop-gain is the product of both transfer functions \(T(s) = H_\mathrm{ol}(s) G(s)\) and is used for the stability analysis. The closed-loop gain is defined as \(H_\mathrm{cl}(s) = Y(s) / X(s)\) can be derived with \(Y(s) = H_\mathrm{ol}(s) [X(s) - Y(s) G(s)]\) to be \[ H_\mathrm{cl}(s) = \frac{Y(s)}{X(s)} = \frac{H_\mathrm{ol}(s)}{1 + H_\mathrm{ol}(s) G(s)} = \frac{H_\mathrm{ol}(s)}{1 + T(s)} \tag{6}\]
If the open-loop gain is sufficiently large \(H_\mathrm{ol}(s) \gg 1\), then the closed-loop gain simplifies to \(H_\mathrm{cl}(s) \approx 1 / G(s)\). This result is convenient, since it is independent of \(H_\mathrm{ol}(s)\). Therefore, the overall gain is only set with the feedback gain \(G(s)\) in operational amplifier circuits.
In the case of the MOSFET diode, \(G(s) = 1\) and therefore \(T(s) = H_\mathrm{ol}(s)\) and \(H_\mathrm{cl}(s) \approx 1\). Therefore, this chapter uses open-loop gain and loop-gain as synonyms.
The gain-bandwidth product (GBP or GBWP) or transit frequency \(f_\mathrm{T}\) of a first-order open-loop system is the product of the open-loop dc gain \(H_\mathrm{ol,dc} = H_\mathrm{ol}(f = 0\,\text{Hz})\) and the open-loop \(-3\,\text{dB}\) cut-off frequency \(f_\mathrm{c,ol}\) of \(H_\mathrm{ol}(s)\). \[ \text{GBWP} = f_\mathrm{T,ol} = H_\mathrm{ol,dc} f_\mathrm{c,ol} \]
If a frequency-independent negative feedback \(G\) (e.g., a resistive divider) is applied to this open-loop system, the transit frequency changes to \[ f_\mathrm{T,cl} = f_\mathrm{T,ol} \sqrt{1 - G^2} \] Hence, the closed-loop transit frequency is slightly lower than the open-loop transit frequency \(f_\mathrm{T,cl} < f_\mathrm{T,ol}\).
The closed-loop \(-3\,\text{dB}\) cut-off frequency \(f_\mathrm{c,cl}\) can then be calculated from the open-loop transit frequency and the feedback gain. \[ f_\mathrm{c,cl} = H_\mathrm{ol,DC} f_\mathrm{c,ol} G = f_\mathrm{T,ol} G \]
This theory might be interesting when Middlebrook’s and Tian’s methods for loop gain analysis are later compared in the MOSFET diode testbench (see Figure 16).
The diode-connected MOSFET forms a feedback loop. What is the open-loop gain? For calculating it, we are breaking the loop, and apply a dummy \(C_\mathrm{gs}^{*}\) at the right side to keep the impedances correct. A circuit diagram is shown in Figure 15, we break the loop at the dotted connection. As we can see in this example, it is critically important when breaking up a loop for analysis (also for simulation!) to keep the terminal impedances the same. Only in special cases where the load impedance is very high or the driving impedance is very low is it acceptable to disregard loading effects!
We are now driving (with a voltage source) \(v_\mathrm{in}\) and record the output voltage \(v_\mathrm{out}\) to find the open-loop response \(H_\mathrm{ol}(s)\). By inspecting Figure 15 we see that \[ v_\mathrm{out} = - g_\mathrm{m}v_\mathrm{in} \frac{1}{g_\mathrm{ds}+ s C_\mathrm{gs}}. \]
The open-loop gain \(H_\mathrm{ol}(s)\) is thus \[ H_\mathrm{ol}(s) = \frac{v_\mathrm{out}}{v_\mathrm{in}} = -\frac{g_\mathrm{m}}{g_\mathrm{ds}+ s C_\mathrm{gs}} = - \frac{g_\mathrm{m}}{g_\mathrm{ds}} \frac{1}{1 + s \frac{C_\mathrm{gs}}{g_\mathrm{ds}}}. \tag{7}\]
Inspecting Equation 7 we realize that:
- The dc gain \(g_\mathrm{m}/ g_\mathrm{ds}\) is the self-gain of the MOSFET, so \(20 \log_{10}(0.2 \cdot 10^{-3} / 9.6 \cdot 10 ^{-6}) = 26.4\,\text{dB}\).
- There is a pole at \(\omega_\mathrm{p} = -g_\mathrm{ds}/ C_\mathrm{gs}\), which is at \(9.6 \cdot 10 ^{-6} / (2 \pi \cdot 1.4 \cdot 10^{-15}) = 1.1\,\text{GHz}\).
With this single pole location in \(H_\mathrm{ol}(s)\) this loop is perfectly stable at under all conditions (remember that a single pole results in a maximum phase shift of –90\(^\circ\)).
The question is now how to simulate this open-loop gain, i.e., how to break the loop open in simulation? In general there are various methods, as we can use artificially large (ideal) inductors and capacitors to break loops open and still establish the correct dc operating points for the ac loop analysis. This is called Rosenstark’s method (Rosenstark 1984). However, mimicking the correct loading can be an issue, and requires a lot of careful consideration.
There is an alternative method which breaks the loop open only by adding an ac voltage source in series (thus keeps the dc operating point intact), or injects current using an ac current source. Based on both measurements the open-loop gain can be calculated. This is called Middlebrook’s method (Middlebrook 1975) and is based on double injection, and we will use it for our loop simulations. This method is detailed in Section 15.
There are several other methods like Tian’s method (Tian et al. 2001), for example. A comprehensive overview can be found in (Neag et al. 2015) which describes ten different simulation-based loop gain analysis methods.
We now want to simulate the open-loop transfer function \(H_\mathrm{ol}(s)\) by using Middlebrook’s method and confirm our analysis above.
Please build a simulation testbench in Xschem to simulate the open-loop transfer function of the MOSFET diode. Confirm the dc gain and pole location as given by Equation 7.
If you are getting stuck you can look at this Xschem testbench, shown in Figure 16.
From simulation we see that the open-loop gain is \(24.9\,\text{dB}\) at low frequencies, which matches quite well our prediction of \(26.4\,\text{dB}\). In the Bode plot we see a low-pass with a \(-3\,\text{dB}\) corner frequency of \(1.4\,\text{GHz}\), which again is fairly close to our prediction of \(1.1\,\text{GHz}\).
Keep in mind that the above simulation only verifies the small-signal stability in one certain operating point. If we later look at the stability of an OTA it might be a good idea to verify the small-signal stability in different operating points.
Furthermore, one can apply a step response to the closed-loop system input and estimate the phase margin from the overshoot at the output (see “Automatic Control” lecture). One could also use a step-wise step response to simulate different operating points for a certain time (see “Introduction in Integrated Circuit Design” lecture).
4.5 MOSFET Diode Noise Calculation
As a final exercise on the MOSFET diode circuit we want to calculate the output noise when we consider \(V_\mathrm{GS}\) the output reference voltage which is created when passing a bias current through the MOSFET diode. The bias current we will assume noiseless.
We are going to use the small-signal circuit shown in Figure 17.
As we have already calculated the small-signal diode impedance in Equation 5 we will use this result, and just note that the drain current noise of the MOSFET flows through this impedance. The noise voltage at \(v_\mathrm{gs}\) is thus given as \[ \overline{V_\mathrm{n}^2} = |Z_\mathrm{diode}|^2 \overline{I_\mathrm{n,d}^2}. \]
The drain current noise of the MOSFET is given as (introduced in Section 2.1.2) \[ \overline{I_\mathrm{n,d}^2} = 4 k T \gamma g_\mathrm{m}. \]
For low frequencies (ignoring \(g_\mathrm{ds}\) and \(C_\mathrm{gs}\)) we get \[ \overline{V_\mathrm{n}^2} = |Z_\mathrm{diode}|^2 \overline{I_\mathrm{n,d}^2} = \frac{1}{g_\mathrm{m}^2} 4 k T \gamma g_\mathrm{m}= \frac{4 k T \gamma}{g_\mathrm{m}} \] which is the thermal noise of a resistor of value \(1 / g_\mathrm{m}\) enhanced by the factor \(\gamma\).
We now calculate the full equation, and after a bit of algebra arrive at \[ \overline{V_\mathrm{n}^2}(f) = \frac{4 k T \gamma g_\mathrm{m}}{(g_\mathrm{m}+ g_\mathrm{ds})^2 + (2 \pi f C_\mathrm{gs})^2}. \tag{8}\]
If we are interested in the PSD of the noise then Equation 8 gives us the result. If we are interested in the rms value (the total noise) we need to integrate this equation, using the following identity:
\[ \int_0^\infty {\frac{a}{b^2 + c^2 f^2} df} = \frac{\pi}{2} \frac{a}{b \cdot c} \tag{9}\]
Using the integral help in Equation 9, we can easily transform Equation 8 to \[ V_\mathrm{n,rms}^2 = \int_0^\infty \overline{V_\mathrm{n}^2}(f) df = \frac{k T \gamma g_\mathrm{m}}{(g_\mathrm{m}+ g_\mathrm{ds}) C_\mathrm{gs}}. \tag{10}\]
The form of Equation 10 is the exact solution, but we gain additional insight if we assume that \(g_\mathrm{m}+ g_\mathrm{ds}\approx g_\mathrm{m}\) and then \[ V_\mathrm{n,rms}^2 = \frac{k T \gamma}{C_\mathrm{gs}}. \tag{11}\]
Inspecting Equation 11 we see our familiar \(kT/C\) noise multiplied by the factor \(\gamma\)!
If you have never calculated this before then you should work through the following: Calculate the total output noise of an \(RC\)-lowpass filter. Formulate the transfer function in the Laplace domain, and put the eqivalent resistor noise voltage source at the input, calculate the transfer to the output, and then integrate the output PSD (like we did for the MOSFET diode noise).
You fill find that the output noise is \[ V_\mathrm{n,rms}^2 = \frac{k T}{C} \] which is independent of \(R\)! This is a surprising result, and is the well-known \(kT/C\) noise. Intuitively, we could argue that the noise increases with larger \(R\), but at the same time, the bandwidth decreases and therefore \(R\) does not add additional noise.
Side note: The shortest derivation of this formula involves the equipartition theorem: Any system in thermal equilibrium has an energy of \(kT/2\) per degree of freedom. This \(RC\) system has one degree of freedom in the capacitor, and the stored energy in the capacitor is \(CV_\mathrm{rms}^2/2\). Equating both energies we find that \(V_\mathrm{rms}^2 = kT/C\).
To calculate the total output noise of a generalized passive network Bode’s noise theorem is quite practical (see Section 16.2).
Calculating the rms noise voltage for our MOSFET diode we get \[ \sqrt{V_\mathrm{n,rms}^2} = \sqrt{1.38 \cdot 10^{-23} \cdot 300 \cdot 0.84 / 1.4 \cdot 10^{-15}} = 1.58\,\text{mV}, \] which is a sizeable value! Think about it, can this rms noise voltage be measured with an oscilloscope? If not, why? We run circuits in this technology at \(V_\mathrm{DD}= 1.5\,\mathrm{V}\), which leaves us with a signal swing of ca. \(1.1\,\mathrm{V_{pp}}\) (single-ended), resulting in a dynamic range in this case of \(20 \log_{10} (0.39 / 1.58 \cdot 10^{-3}) \approx 48\,\text{dB}\) assuming a sinusoidal signal. In order to get a feeling which dynamic range is “good”, we can calculate the required dynamic range of a 16-bit audio ADC to be \(6.02 \cdot 16 + 1.76\,\text{dB} = 98.08\,\text{dB} \approx 100\,\text{dB}\). This calculation should make clear that, for example, the correct sizing of the sample&hold capacitor is crucial for low rms noise voltage.
In general, in integrated circuit design, we often have only small parasitic capacitances on many nodes that could sum up to unwanted high noise according to Equation 11. If one wants to lower the noise an increased capacitance could limit the bandwidth (and thus the \(kT/C\) noise).
Remember: Large bandwidth circuits integrate noise over a wide bandwidth resulting in (potentially) considerable rms noise. The way to lower the total noise is to lower the PSD of the noise contributions, which usually requires increased power consumption. So in a nutshell:
Large bandwidth plus small noise equals large power consumption.
Please build a simulation testbench in Xschem to simulate the noise performance of the MOSFET diode, and confirm the rms noise value that we just calculated. Look at the rms value and the PSD of the noise, and play around with the integration limits. What is the effect? Can you see the flicker noise in the PSD? How much is its contribution to the rms noise? What is the value of \(f_\mathrm{co}\), and does it correspond to the above calculated one?
If you are getting stuck you can look at this Xschem testbench, shown in Figure 18.
4.6 Conclusion
In this section we investigated the simple MOSFET-diode circuit. We learned important skills like how to derive a small-signal model and how to calculate important features like noise and open-loop gain for stability analysis. We introduced Middlebrook’s method to have a mechanism to open up loops in simulation (and calculation) without disturbing dc operating points or introduce errors by changing loading conditions.
If you feel that you have not yet mastered these topics or are uncertain in the operation of Xschem or ngspice, please go back to the beginning of the section and read through the theory and redo the exercises.
5 Common-Source Amplifier
We now want to step up our game, and use more components to design something useful. We will use a basic circuit structure, namely a single-ended common-source amplifier. The structure of this circuit, using a resistor as a load, is shown in Figure 19.
The function of this circuit is as follows: Assuming the MOSFET \(M_1\) is kept in saturation, then a small-signal voltage \(v_\mathrm{in}\) applied at the gate is converted into a drain current \(i_\mathrm{d}\) by the MOSFET’s transconductance \(g_\mathrm{m}\). Then, this current is converted into a voltage again in the resistor \(R_1\). Ultimately, we have a dc voltage gain \(A_\mathrm{v}\) of
\[ A_\mathrm{v} = \frac{v_\mathrm{out}}{v_\mathrm{in}} \approx - \frac{g_\mathrm{m}v_\mathrm{in} R_1}{v_\mathrm{in}} = - g_\mathrm{m}R_1. \]
As explained above, it is a good approach to see electronic circuit components as
- voltage-to-current (MOSFET as common-source or common-gate; resistor),
- current-to-voltage (resistor),
- current-to-current (MOSFET as common-gate), and
- voltage-to-voltage (MOSFET as common-drain)
conversions for better understanding.
5.1 Sense Amplifier Driving 50 Ohm Matched Load
Let us now size and design an exemplary implementation of this amplifier (of course using the \(g_\mathrm{m}/I_\mathrm{D}\) method). In order to have useful real-life specifications, we want to build an amplifier which can be used to sense an on-chip voltage and drive off-chip measurement equipment. Often, this equipment has an input impedance of 50\(\Omega\), and we want to have an impedance-matched output. The voltage gain shall be set to 1 (essentially, we want to sense a voltage and drive the measurement equipment).
The resulting circuit is shown in Figure 20. As the load is usually ground-referred, and we want to avoid a dc-block at the output, we use a PMOS amplifier stage (compare with Figure 19).
Please think about why exactly we want this measurement amplifier be based on a PMOS instead of an NMOS.
The power matching requirement at the output mandates that \(R_1 = R_\mathrm{load}\), so \(R'_1 = R_\mathrm{load} \parallel R_1 = 25\,\Omega\). The voltage gain requirement of \(|A_\mathrm{v}| = 1\) results in \(g_\mathrm{m}= 1 / R'_1 = 40\,\text{mS}\).
We now need to find \(W\) and \(L\) of \(M_1\) and calculate the required bias current \(I_\mathrm{D}\). We also need to find the proper \(V_\mathrm{GS}\) to set this current. As usual, we use a Jupyter notebook to calculate these values. Since we require modest speed of this buffer we use a \(g_\mathrm{m}/I_\mathrm{D}=8\) and set \(L = 0.13\,\mu\text{m}\). The notebook is available here. The resulting circuit including all component values is shown at the end of the notebook (the red input capacitor shows the \(C_\mathrm{gg}\) of the MOSFET).
When sizing the MOSFET for this example we found that we need a fairly large \(W\), resulting in a MOSFET aspect ratio of \(W/L \gg 1000\). When constructing an integrated circuit out of individual MOSFET we strive for an overall IC dimension that is roughly quadratic. For MOSFET with large aspect ratios we need to get them into a comfortable shape.
In order to achieve this, we construct the MOSFET out of smaller pieces, and the size of this pieces (called “gate fingers”) are controlled by the parameter ng
. These MOSFET gate fingers all have the same \(L\), but their width is \(W_\mathrm{finger} = W / \text{ng}\). All this individual smaller MOSFET are connected in parallel.
In order to increase the MOSFET model accuracy, often the maximum value of \(W_\mathrm{finger}\) is limited. In the case of SG13G2 \(W_\mathrm{finger} \le 10\,\mu\text{m}\).
In order to construct even larger MOSFET, we can connect multiple MOSFET in parallel. We can do this in the circuit editor by placing and connected these MOSFET; but since this is often used there is a more convenient way: By using the parameter m
(“multiplier” or “multiplicity”) we instantiate \(m\) MOSFET connected in parallel.
When to use ng
and when to use m
? The use of ng
results in a more compact IC layout, and is thus generally preferable. Only in certain instances (e.g., when using a really large \(W\)) m
should be used. Further, the thoughtful use of ng
allows to construct all the NMOS and PMOS of a circuit out of the same gate finger elements. This will result in a very compact layout!
Please go through the sizing notebook of the measurement amplifier and double-check the calculations. Do you agree that the calculations are correct?
Once you agree with the circuit sizing please build an Xschem simulation testbench where you simulate the small-signal voltage gain \(A_\mathrm{v}\) of this measurement amplifier if it is driven with an ideal voltage source. Keep in mind that the maximum MOSFET finger width is 10µm in this technology, so you need to set the parameter ng
accordingly (see Important 1).
- What is the dc gain of this amplifier when loaded with 50\(\Omega\)?
- The dc gain is likely not exactly 0dB. Why is this so?
- Increase the width \(W\) of the PMOS until the gain is correct. What is the \(W\) that you had to set, and how much is \(I_\mathrm{D}\) now?
- What is the bandwidth (i.e., the -3dB corner frequency) of the output voltage, when the voltage source has a source resistance of 1k\(\Omega\)?
If you get stuck, here is the solution to this exercise, and it is also shown in Figure 21.
By now we have designed a measurement amplifier based on a common-source stage. One problem with this stage is the relatively large input capacitance \(C_\mathrm{GG}\) of approx. 0.3pF, which loads the input source. Another issue even more severe is that the fact that the bias point in this circuit is set by the dc voltage level at the input. In general, we want a setup where the bias points of the circuit are largely independent of the dc input voltages. This is why in integrated circuit design we often design differential circuits where the input and output signals are given by the differential voltages, and are largely independent from the common-mode voltages. This is usually an advantage.
6 Current Mirror
In this section we will look into a fundamental building block which is often used in integrated circuit design, the current mirror (R. Widlar 1965). A diagram is shown in Figure 22 with one MOSFET diode converting the incoming bias current into a voltage \(V_\mathrm{GS}= f(I_\mathrm{D1}) = f(I_\mathrm{bias})\), and two output MOSFETs working as current sources, which are biased from the diode. By properly selecting all \(W\) and \(L\) the input current can be scaled, and multiple copies can be created at once. Shown in the figure are two output currents \(I_\mathrm{out1}\) and \(I_\mathrm{out2}\), but any number of parallel branches can be realized (note that this is true for MOSFET as no gate current flows; for the case of BJTs, the base current flowing into the various transistors is subtracted from \(I_\mathrm{bias}\), so usually a compensation circuit is added).
Neglecting the impact of \(g_\mathrm{ds1}\) and \(g_\mathrm{ds2}\), the output current \(I_\mathrm{out1}\) is then given by \[ I_\mathrm{out1} \approx I_\mathrm{bias} \frac{W_2}{L_2} \frac{L_1}{W_1} \] and the output current \(I_\mathrm{out2}\) is given by \[ I_\mathrm{out2} \approx I_\mathrm{bias} \frac{W_3}{L_3} \frac{L_1}{W_1}. \]
For good matching care has to be taken that the MOSFET widths and lengths are constructed out of unit elements of identical size, where an appropriate amount of these single units are then arranged in series or parallel configuration to arrive at the target \(W\) and \(L\) (remember MOSFET parameters ng
and m
, see Important 1).
As we know from earlier investigations of the MOSFET performance in Section 3 the drain current of a MOSFET is a function of \(V_\mathrm{GS}\) and \(V_\mathrm{DS}\). As long as the MOSFET stays in saturation (i.e., \(V_\mathrm{DS}> V_\mathrm{ds,dsat}\)) the drain current is just a mild function of \(V_\mathrm{DS}\) (essentially the effect of \(g_\mathrm{ds}\), which is the output conductance of the MOSFET). A fundamental flaw/limitation of the basic current mirror shown in Figure 22 is the mismatch of the \(V_\mathrm{DS}\) of the MOSFETs. The input-side diode has \(V_\mathrm{GS}= V_\mathrm{DS1}\), whereas the output current sources have a \(V_\mathrm{DS2,3}\) depending on the connected circuitry. Improved current mirrors exist (fixing this flaw), however, when a current mirror is required with mediocre performance requirements this structure can be used for its simplicity.
Please construct a current mirror based on the MOSFET-diode which we sized in Section 4. The input current \(I_\mathrm{bias} = 20\,\mu\text{A}\), and we want three output currents of size \(10\,\mu\text{A}\), \(20\,\mu\text{A}\), and \(40\,\mu\text{A}\).
Sweep the output voltage of all three current branches and see over which voltage range an acceptable current is created. For which output voltage range is the current departing from its ideal value, and why?
You see that the slope of the output current is quite bad, as \(g_\mathrm{ds}\) is too large. We can improve this by changing the length to \(L = 5\,\mu\text{m}\) (for motivation, please look at the graphs in Section 3). In addition, for a current mirror we are not interested in a high \(g_\mathrm{m}/I_\mathrm{D}\) value, so we can use \(g_\mathrm{m}/I_\mathrm{D}= 5\) in this case. Please size the current mirror MOSFETs accordingly (please round the \(W\) to half micron, to keep sizes a bit more practical). Compare this result to the previous one, what changed?
In case you get stuck, here are Xschem schematics for the original and the improved current mirrors.
7 Differential Pair
Like the current mirror in Section 6 the differential pair is an ubiquitous building block often used in integrated circuit design (Blumlein 1937). The fundamental structure is given in Figure 23.
In order to understand its operation it is instructive to separate the input condition into (1) a purely differential voltage, and (2) a common-mode voltage, and see what the impact on the output currents is.
7.1 Differential Operation of the Diffpair
For a differential mode of operation we assume that the input common mode voltage is constant, i.e., \((V_\mathrm{in,p} + V_\mathrm{in,n})/2 = V_\mathrm{CM}\). The differential input voltage \(v_\mathrm{in} = V_\mathrm{in,p} - V_\mathrm{in,n}\), so that \[ V_\mathrm{in,p} = V_\mathrm{CM} + \frac{v_\mathrm{in}}{2} \] and \[ V_\mathrm{in,n} = V_\mathrm{CM} - \frac{v_\mathrm{in}}{2}. \]
For a small-signal differential drive the potential at the tail point stays constant and we can treat it as a virtual ground. The output current on each side is then given by (neglecting \(g_\mathrm{ds}\) and \(g_\mathrm{mb}\) of \(M_1\) and \(M_2\)) \[ i_\mathrm{out,p} = g_\mathrm{m1} \left( \frac{v_\mathrm{in}}{2} \right) \] and \[ i_\mathrm{out,n} = g_\mathrm{m2} \left( -\frac{v_\mathrm{in}}{2} \right). \]
Usually we assume symmetry in the differential pair, so \(g_\mathrm{m1} = g_\mathrm{m2} = g_\mathrm{m}\). The differential output current \(i_\mathrm{out}\) is then given by \[ i_\mathrm{out} = i_\mathrm{out,p} - i_\mathrm{out,n} = g_\mathrm{m}v_\mathrm{in} \tag{12}\]
We see in Equation 12 that the differential output current is simply the differential input voltage multiplied by the \(g_\mathrm{m}\) of the individual transistor. We also note that the bottom conductance \(g_\mathrm{tail}\) plays no role for the small-signal differential operation.
7.2 Common-Mode Operation of the Diffpair
Usually, the source conductance \(g_\mathrm{tail}\) is realized by a current source and ideally should be \(g_\mathrm{tail} = 0\). If this is the case, then the output currents are not a function of the common-mode input voltage (\(I_\mathrm{tail}\) is set by the tail current source), and \[ I_\mathrm{out,p} = I_\mathrm{out,n} = \frac{I_\mathrm{tail}}{2}. \]
However, if we assume a realistic tail current source then \(g_\mathrm{tail} > 0\). For analysis we can simply look at a half circuit since the circuit operation is symmetric. In order to simplify the analysis a bit we remove all capacitors from the MOSFET small-signal model and set \(g_\mathrm{ds}= g_\mathrm{mb}= 0\). We then arrive at the small-signal equivalent circuit shown in Figure 24 (note that we set \(v_\mathrm{in,p} = v_\mathrm{in,n} = v_\mathrm{in}\) and \(i_\mathrm{out,p} = i_\mathrm{out,n} = i_\mathrm{out}\) under symmetry considerations).
Formulating KVL for the input-side loop we get \[ v_\mathrm{in} = v_\mathrm{gs}+ \frac{i_\mathrm{d}}{g_\mathrm{tail}}. \]
With \(i_\mathrm{out} = i_\mathrm{d}= g_\mathrm{m}v_\mathrm{gs}\) we arrive at \[ i_\mathrm{out} = \frac{g_\mathrm{m}g_\mathrm{tail}}{g_\mathrm{m}+ g_\mathrm{tail}} v_\mathrm{in} \tag{13}\]
Interpreting Equation 13 we can distinguish the following extreme cases:
- If \(g_\mathrm{tail} = 0\) (ideal tail current source) then \(i_\mathrm{out} = 0\), the common-mode voltage variation from the input is suppressed and does not show up at the common-mode output current (which is constant due to the ideal tail current source). This is usually the case that we want to achieve.
- If \(g_\mathrm{tail} \rightarrow \infty\) then \(i_\mathrm{out} = g_\mathrm{m}v_\mathrm{in}\), which means the output current is a function of the MOSFET \(g_\mathrm{m}\). If everything is perfectly matched, then the differential output current is zero, but the common-mode output current changes according to the common-mode input voltage. In special cases this can be a wanted behavior, this configuration is called a “pseudo-differential pair.”
Note that the result of Equation 13 is also valid for the general case of a degenerated common-source transistor stage (see Figure 25). The effective transconductange \(g_\mathrm{m}'\) is given by (\(R_\mathrm{degen} = g_\mathrm{tail}^{-1}\))
\[ g_\mathrm{m}' = \frac{i_\mathrm{out}}{v_\mathrm{in}} = \frac{g_\mathrm{m}g_\mathrm{tail}}{g_\mathrm{m}+ g_\mathrm{tail}} = \frac{1}{g_\mathrm{m}^{-1} + R_\mathrm{degen}}. \]
When no degeneration resistor is used, then \(g_\mathrm{m}' = g_\mathrm{m}\). If a degeneration of \(R_\mathrm{degen} \gg g_\mathrm{m}^{-1}\) is used then \(g_\mathrm{m}' = 1 / R_\mathrm{degen}\). This result is worth memorizing.
8 A Basic 5-Transistor OTA
Suited with the insights of basic transistor operation (Section 2 and Section 3) and the working knowledge of the current mirror (Section 4 and Section 6) as well as the differential pair (Section 7) we can now start to design our first real circuit. A fundamental (simple) circuit that is often used for basic tasks is the 5-transistor operational transconductance amplifier (OTA). A circuit diagram of this 5T-OTA is shown in Figure 26.
While we repeat the basics of elementary MOSFET amplifier stages (like common-source stage, common-gate stage, and current mirror) in this course material, the following compendium (Murmann 2013) is recommended for review. It is freely available at https://github.com/bmurmann/Book-on-MOS-stages.
In addition, we can highly recommend these references Razavi (2017) for further study.
The operation is as follows: \(M_{1,2}\) form a differential pair which is biased by the current source \(M_5\). \(M_{5,6}\) form a current mirror, thus the input bias current \(I_\mathrm{bias}\) sets the bias current in the OTA. The differential pair \(M_{1,2}\) is loaded by the current mirror \(M_{3,4}\) which mirrors the drain current of \(M_1\) to the right side. Here, the currents from \(M_4\) and \(M_2\) are summed, and together with the conductance effective at the output node a voltage builds up.
An operational amplifier is a universal electronic building block characterized by (in the ideal case):
- infinite input impedance at the input ports (\(R_\mathrm{in} \rightarrow \infty\))
- zero output resistance, i.e., a voltage output (\(R_\mathrm{out} = 0\))
- infinite voltage gain (\(A_\mathrm{v} = V_\mathrm{out} / V_\mathrm{in} \rightarrow \infty\))
An operational transconductance amplifier is a building block characterized by (again in the ideal case):
- infinite input impedance at the input ports (\(R_\mathrm{in} \rightarrow \infty\))
- infinite output resistance, i.e., a current output (\(R_\mathrm{out} \rightarrow \infty\))
- infinite transconductance (\(G = I_\mathrm{out} / V_\mathrm{in} \rightarrow \infty\))
In integrated circuits, we very often load an OPA/OTA high-ohmic, i.e., with a capacitive load. Hence, an OTA can be used to create a voltage-mode amplifier with high gain, approaching the properties of the OPA. If an OTA is used to drive a low-ohmic load, the voltage gain will be low, and we have to use this block as a transconductance amplifier. Since the output changes behavior depending on high- or low-ohmic loading, we label the output in Figure 26 accordingly.
Why then implement an OTA instead of an OPA? Usually, an OTA is a simpler structure than an OPA. As a general rule, the simplest circuit that can do a job is usually the best choice.
We note that \(M_{1,2}\) and \(M_{3,4}\) need to be symmetric, thus will have the same \(W\) and \(L\) dimensioning. \(M_{5,6}\) we scale accordingly to set the correct bias current in the OTA.
8.1 Voltage Buffer with OTA
In order to design an OTA we need an application, and from this we need to derive the circuit specifications. We want to use this OTA to realize a voltage buffer which lightly loads an input voltage source and can drive a large capacitive load. Such a configuration is often used to, e.g., buffer a reference voltage that is needed (and thus loaded) by another circuit. The block diagram of this configuration is shown in Figure 27.
If the voltage gain of the OTA in Figure 27 is high, then \(V_\mathrm{out} \approx V_\mathrm{in}\). We now want to design an OTA for this application fitting the following specification values (see Table 2). These values are rather typical of what could be expected for such a buffer design.
Specification | Value | Unit |
---|---|---|
Supply voltage | \(1.45 < \underline{1.5} < 1.55\) | V |
Temperature range (industrial) | \(-40 < \underline{27} < 125\) | degC |
Load capacitance \(C_\mathrm{load}\) | \(50\) | fF |
Input voltage range (for buffering 2/3 bandgap voltage) | \(0.7 < \underline{0.8} < 0.9\) | V |
Signal bandwidth (3dB) | \(>10\) | MHz |
Output voltage error | \(<3\) | % |
Total output noise (rms) | \(<1\) | mVrms |
Supply current (as low as possible) | \(<10\) | µA |
Stability | stable for rated \(C_\mathrm{load}\) | |
Turn-on time (settled to with 1%) | \(<10\) | µs |
Externally provided bias current (nominal) | \(20\) | µA |
8.2 Large-Signal Analysis of the OTA
The first step when receiving a design task is to look at the specifications, and see whether they make sense. The detailed performance of the design will be the result of the circuit simulation, but before we step into sizing we need to do a few simple calculations to (a) do back-of-the-envelope gauging if the specification makes sense, and (b) use the derived analytical equations as guide for the sizing procedure.
In terms of large-signal operation, we will now check whether the input and output voltage range, as well as the settling time can be roughly met. Since we do not know yet the resulting \(V_\mathrm{GS}\) of the transistors we use \(0.6\,\text{V}\) as an initial guess. If we run into issues with that guess we know how to later steer the sizing procedure.
- When the input is at its maximum of \(0.9\,\text{V}\), we see that we need to keep \(M_1\) in saturation. We can calculate that \(V_\mathrm{DS1} = V_\mathrm{DD}- |V_\mathrm{GS3}| + V_\mathrm{GS1} - V_\mathrm{in} = 1.45 - 0.6 + 0.6 - 0.9 = 0.55\,\text{V}\), which leaves enough margin.
- When the input is at its minimum of \(0.7\,\text{V}\), we see that the \(V_\mathrm{DS5}\) of \(M_5\) is calculated as \(V_\mathrm{DS5} = V_\mathrm{in} - V_\mathrm{GS1} = 0.7 - 0.6 = 0.1\,\text{V}\), so this leaves little margin, but we can make \(V_\mathrm{GS1}\) smaller, so it should work out.
- For the output voltage, when the output voltage is on the high side, it leaves \(|V_\mathrm{DS4}| = V_\mathrm{DD}- V_\mathrm{out} = 1.45 - 0.9 = 0.55\,\text{V}\), which is enough margin.
In summary, we think that we can make an NMOS-input OTA like the one in Figure 26 work for the required supply and input- and output-voltages. If this would not work out, we need to look for further options, like a PMOS-input OTA, or a NMOS/PMOS-input OTA.
Another large-signal specification item that we can quickly check is the settling time. Under slewing conditions, the complete bias current in the OTA is steered towards the output (try to understand why this is the case), so when the output capacitor is fully discharged, and we assume just a linear ramp due to constant-current charging of the output capacitor, the settling time is \[ T_\mathrm{slew} \approx \frac{C_\mathrm{load} V_\mathrm{out}}{I_\mathrm{tail}} = \frac{50 \cdot 10^{-15} \cdot 1.3}{10 \cdot 10^{-6}} = 6.5\,\text{ns} \] so this leaves plenty of margin for additional slow-signal settling due to the limited bandwidth, as well as reducing the supply current.
The small-signal settling (assuming one pole at the bandwidth corner frequency) leads to an approximate settling time (1% error corresponds to \(\approx 5 \tau\)) of \[ T_\mathrm{slew} \approx \frac{5}{2 \pi f_\mathrm{c}} = \frac{5}{2 \pi \cdot 1 \cdot 10^{6}} = 0.8\,\mu\text{s}. \] which also checks out.
8.3 Small-Signal Analysis of the OTA
In order to size the OTA components we need to derive how the MOSFET parameters define the performance. The important small-signal metrics are
- dc voltage gain \(A_0\)
- gain-bandwidth product (GBW)
- output noise
The specification for GBW is directly given in Table 2, while the dc gain we have to calculate from the voltage accuracy specification. For a voltage follower in the configuration shown in Figure 27 the voltage gain is given by \[ \frac{V_\mathrm{out}}{V_\mathrm{in}} = \frac{A_0}{1 + A_0}. \tag{14}\]
So in order to reach an output voltage accuracy of at least 3% we need a dc gain of \(A_0 > 30.2\,\text{dB}\). To allow for process and temperature variation we need to add a bit of extra gain as margin.
In order to get the correct dc voltage per the specification we require the large-signal gain calculated with Equation 14. However, calculating the large-signal gain of a circuit is quite involved (usually mandating the use of a large-signal nonlinear model for the used components), so we typically resort to do a simpler small-signal calculation instead, like in Section 8.3. We deliberately introduce this error, but we should not get confused about the difference between large- and small-signal operation!
8.3.1 OTA Small-Signal Transfer Function
In order to derive the governing small-signal equations for the OTA we will make a few simplifications:
- We will set \(g_\mathrm{mb}= 0\) for all MOSFETs.
- We will further set \(C_\mathrm{gd}= 0\) for all MOSFETs except for \(M_4\) where we expect a Miller effect on this capacitor, and we could add its effect by increasing the capacitance at the gate node of \(M_{3,4}\) (for background please see Section 16.1). However, as this does not create a dominant pole in this circuit, we consider this a minor effect (see Equation 17). Thus, only \(C_\mathrm{gs34}\) is considered at the gate node of the current mirror load.
- We assume \(g_\mathrm{m}\gg g_\mathrm{ds}\), so we set \(g_\mathrm{ds1} = g_\mathrm{ds3} = 0\).
- The drain capacitance of \(M_2\) and \(M_4\), as well as the gate capacitance of \(M_2\) we can add to the load capacitance \(C_\mathrm{load}\). Note that \(C_\mathrm{gs2}\) can be added because of the feedback connection between the inverting input and the output. However, this is not shown in the small-signal equivalent circuits below, because we are interested in the open-loop transfer function.
The resulting small-signal equivalent circuit is shown in Figure 28.
Please review the MOSFET small-signal equivalent model in Figure 5 at this point. For the PMOS just flip the model upside-down.
We can further simplify the output side by recognizing that the impedance (when looking from the output down) is \(g_\mathrm{ds2}\) in series with \(g_\mathrm{ds5} + g_\mathrm{m12}\). This is a valid simplification as we treat \(M_1\) as a common-gate stage when looking from the output, and since it is loaded by a low impedance of \(g_\mathrm{m34}^{-1}\), we can approximate the impedance looking into the source of \(M_1\) with \(g_\mathrm{m12}^{-1}\). With the approximation that \(g_\mathrm{m}\gg g_\mathrm{ds}\) the parallel connection of \(g_\mathrm{m12}\) and \(g_\mathrm{ds5}\) is dominated by \(g_\mathrm{m12}\) (\(g_\mathrm{m12} + g_\mathrm{ds5} \approx g_\mathrm{m12}\)) and its series connection with \(g_\mathrm{ds2}\) (\(g_\mathrm{m12}^{-1} + g_\mathrm{ds2}^{-1} \approx g_\mathrm{ds2}^{-1}\)), resulting in the fact that we can ground the lower end of \(g_\mathrm{ds2}\). You should probably stop here and read this paragraph again, slowly.
Therefore, we can move \(g_\mathrm{ds2} + g_\mathrm{ds4}\) in parallel to \(C_\mathrm{load}\). Further, assuming a differential drive with a virtual ground at the tailpoint we can remove \(g_\mathrm{ds5}\). The current source \(g_\mathrm{m34} v_\mathrm{gs34}\) is replaced with the equivalent conductance \(g_\mathrm{m34}\). All the previous steps result in the further simplified equivalent circuit shown in Figure 29.
In the simplified circuit model in Figure 29 we can see that we have two poles in the circuit, one at the gate note of \(M_{3,4}\), and one at the output. Realizing that \(v_\mathrm{in,p} = v_\mathrm{in}/2\) and \(v_\mathrm{in,n} = - v_\mathrm{in}/2\) we can formulate KCL at the output node to \[ -g_\mathrm{m34} V_\mathrm{gs34} - \left( -g_\mathrm{m12} \frac{V_\mathrm{in}}{2} \right) - V_\mathrm{out} (g_\mathrm{ds2} + g_\mathrm{ds4} + s C_\mathrm{load}) = 0. \tag{15}\] We further realize that \[ V_\mathrm{gs34} = -g_\mathrm{m12} \frac{V_\mathrm{in}}{2} \frac{1}{g_\mathrm{m34} + s C_\mathrm{gs34}}. \tag{16}\]
By combining Equation 15 and Equation 16 and after a bit of algebraic manipulation we arrive at \[ A(s) = \frac{V_\mathrm{out}(s)}{V_\mathrm{in}(s)} = \frac{g_\mathrm{m12}}{2} \frac{2 g_\mathrm{m34} + s C_\mathrm{gs34}}{(g_\mathrm{m34} + s C_\mathrm{gs34}) (g_\mathrm{ds2} + g_\mathrm{ds4} + s C_\mathrm{load})}. \tag{17}\]
When we now inspect Equation 17 we can see that for low frequencies the (open-loop) gain is \[ A(s \rightarrow 0) = A_0 = \frac{g_\mathrm{m12}}{g_\mathrm{ds2} + g_\mathrm{ds4}} \tag{18}\] which is plausible, and confirms the requirement of a high impedance at the output node. For very large frequencies we get \[ A (s \gg) = \frac{g_\mathrm{m12}}{2 s C_\mathrm{load}} \tag{19}\] which is essentially the behavior of an integrator, and we can use Equation 19 to calculate the frequency where the gain drops to \(|A(s_\mathrm{ug})| = 1 = g_\mathrm{m12}/|2 j \omega_\mathrm{ug} C_\mathrm{load}|\): \[ f_\mathrm{ug} = \frac{g_\mathrm{m12}}{4 \pi C_\mathrm{load}} \]
When looking at Equation 17 we see that we have a dominant pole at \(s_\mathrm{p}\) and a pole-zero doublet with \(s_\mathrm{pd}\)/\(s_\mathrm{zd}\): \[ s_\mathrm{p} = -\frac{g_\mathrm{ds2} + g_\mathrm{ds4}}{C_\mathrm{load}} \tag{20}\] \[ s_\mathrm{pd} = -\frac{g_\mathrm{m34}}{C_\mathrm{gs34}} \tag{21}\] \[ s_\mathrm{zd} = -\frac{2 g_\mathrm{m34}}{C_\mathrm{gs34}} \tag{22}\]
Looking at Equation 21 and Equation 22 we see that this pair is intimately linked by the same parameters and can only move together. Hence we call it a “doublet.”
8.3.2 OTA Noise
For the noise analysis we ignore the pole-zero doublet due to \(C_\mathrm{gs34}\) (we assume minor impact due to this) and just consider the dominant pole given by Equation 20. For the noise analysis at the output we set the input signal to zero, and thus we arrive at the simplified small-signal circuit shown in Figure 30.
We see that \[ \overline{V_\mathrm{gs34}^2} = \frac{1}{g_\mathrm{m34}^2} \left( \overline{I_\mathrm{n1}^2} + \overline{I_\mathrm{n3}^2} \right). \]
Remember that uncorrelated noise quantities need to be power-summed (i.e., \(I^2 = I_1^2 + I_2^2\))!
We can then sum the output noise current \(\overline{I_\mathrm{n}}\) as \[ \overline{I_\mathrm{n}^2} = \overline{I_\mathrm{n2}^2} + \overline{I_\mathrm{n4}^2} + g_\mathrm{m34}^2 \frac{1}{g_\mathrm{m34}^2} \left( \overline{I_\mathrm{n1}^2} + \overline{I_\mathrm{n3}^2} \right) = 2 \left( \overline{I_\mathrm{n12}^2} + \overline{I_\mathrm{n34}^2} \right). \]
As a next step, let us rewrite the OTA transfer function \(A(s)\) (see Equation 17) by getting rid of the pole-zero doublet as a simplifying assumption to get \[ A'(s) = \frac{g_\mathrm{m12}}{g_\mathrm{ds2} + g_\mathrm{ds4} + s C_\mathrm{load}}. \tag{23}\]
Inspecting Equation 23 we can interpret the OTA transfer function as a transconductor \(g_\mathrm{m12}\) driving a load of \(Y_\mathrm{load} = g_\mathrm{ds2} + g_\mathrm{ds4} + s C_\mathrm{load}\). We can thus redraw Figure 27 in the following way, injecting the previously calculated noise current into the output node. The result is shown in Figure 32.
First we short the input terminal to ground and then we connect a current source \(I_\mathrm{out}\) at the output terminal, see Figure 31. Since we can neglect the gate leakage current into the inverting input terminal of the OTA, KCL at the output node is simply: \[ I_\mathrm{out} + g_\mathrm{m12}\left(-V_\mathrm{out}\right) = 0 \] Thus, the output impedance is easily calculated. \[ Z_\mathrm{out} = \frac{V_\mathrm{out}}{I_\mathrm{out}} = \frac{V_\mathrm{out}}{g_\mathrm{m12}V_\mathrm{out}} = \frac{1}{g_\mathrm{m12}} \]
We see that the feedback around the transconductor \(g_\mathrm{m12}\) creates an impedance of \(1/g_\mathrm{m12}\). We can now calculate the effective load conductance of \[ Y'_\mathrm{load} = g_\mathrm{ds2} + g_\mathrm{ds4} + s C_\mathrm{load} + g_\mathrm{m12} \approx g_\mathrm{m12} + s C_\mathrm{load}. \tag{24}\]
The output noise voltage is then (using Equation 1) \[ \overline{V_\mathrm{n,out}^2}(f) = \frac{\overline{I_\mathrm{n}^2}}{|Y'_\mathrm{load}|^2} = \frac{\overline{I_\mathrm{n}^2}}{g_\mathrm{m12}^2 + (2 \pi f C_\mathrm{load})2^2} = \frac{8 k T (\gamma_{12} g_\mathrm{m12} + \gamma_{34} g_\mathrm{m34})}{g_\mathrm{m12}^2 + (2 \pi f C_\mathrm{load})^2}. \]
We can use the identity Equation 9 to calculate the rms output noise to \[ V_\mathrm{n,out,rms}^2 = \int_0^\infty \overline{V_\mathrm{n,out}^2}(f) df = \frac{k T}{C_\mathrm{load}} \left( 2 \gamma_{12} + 2 \gamma_{34} \frac{g_\mathrm{m34}}{g_\mathrm{m12}} \right). \tag{25}\]
Inspecting Equation 25 we can see that the integrated output noise is the \(k T / C\) noise of the output load capacitor, enhanced by the \(\gamma_{12}\) of the input differential pair, plus a (smaller) contribution of the current mirror load \(M_{3,4}\). Intuitively, this result makes sense.
Please take your time and carefully go through the explanations and derivations for the 5-transistor-OTA in Section 8.2 and Section 8.3. Try to do the calculations yourself; if you get stuck, review the previous chapters.
8.4 5T-OTA Sizing
Outfitted with the governing equations derived in Section 8.3 we can now size the MOSFETs in the OTA, we remember that we have to size \(M_{1,2}\) and \(M_{3,4}\) equally.
First, we need to select a proper \(g_\mathrm{m}/I_\mathrm{D}\) for the MOSFET. Remembering Section 3 we see that for the input differential pair we should go for a large \(g_\mathrm{m}\), thus we select a \(g_\mathrm{m}/I_\mathrm{D}= 10\). As \(g_\mathrm{ds}\) of \(M_2\) could limit the dc gain (Equation 18) we go with a rather long \(L = 5\,\mu\text{m}\). For current sources a small \(g_\mathrm{m}/I_\mathrm{D}\) is a good idea, so we start with \(g_\mathrm{m}/I_\mathrm{D}=5\) (because we can not go too low because of \(V_\mathrm{ds,sat}\)) and also an \(L = 5\,\mu\text{m}\). The \(g_\mathrm{m}/I_\mathrm{D}\) is also useful to estimate the required drain-source voltage to keep a MOSFET in saturation (i.e., keep the \(g_\mathrm{ds}\) small) with this approximate relationship:
\[ V_\mathrm{ds,sat} = \frac{2}{g_\mathrm{m}/I_\mathrm{D}} \tag{26}\]
Please size the 5T-OTA according to the previous \(g_\mathrm{m}/I_\mathrm{D}\) and \(L\) suggestions. Please calculate the \(W\) of \(M_{1-6}\) and the total supply current. Please check wether gain error, total output noise, and turn-on settling is met with the calculated devices sizes and bias currents.
The sizing procedure and its calculation are best performed in a Jupyter notebook, as we can easily look up the exact data from the pre-computed tables:
8.5 5T-OTA Simulation
With the initial sizing of the MOSFETs of the 5T-OTA done, we can design the 5T-OTA circuit and setup a simulation testbench to check the performance parameters. Since this is the first time we draw a more complex schematic, and use a hierarchical design, we should note that drawing a schematic is an art, and there exists a set of rules and recommendations how to name pins, how to use annotations, and so on. Please read Section 21 before you start into your design work.
Please design the circuit of the 5T-OTA. Put the OTA circuit in a separate schematic, create a symbol for it, and use this symbol in a testbench you create in Xschem for this 5T-OTA used as a voltage buffer as shown in Figure 27. Use typical conditions for the simulation, and check how well the specification in Table 2 is met, and how well the derivations in Section 8.2 and Section 8.3 fit to the simulation results.
If you get stuck, you can find the testbench and 5T-OTA schematic here (for the small-signal analysis) and here (for the large-signal settling simulation). For interested students, the loop gain analysis with Middlebrook’s and Tian’s method of the 5T-OTA can be found here.
8.6 5T-OTA Simulation versus PVT
As you have seen in Section 8.5 running simulations by hand is tedious. When we want to check the overall performance, we have to run many simulations over various conditions:
- The supply voltage of the circuit has tolerances, and thus we need to check the performance against this variation.
- The temperature at which the circuit is operated is likely changing. Also the performance against this has to be verified.
- When manufacturing the wafers random variations in various process parameters lead to changed parameters of the integrated circuit components. In order to check for this effect, wafer foundries provide model files which shall cover these manufacturing excursions. Simplified, this leads to a slower or faster MOSFET, and usually NMOS and PMOS are not correlated, so we have the process corners SS, SF, TT, FS, and FF. So far, we have only used the TT models in our simulations.
The variations listed in the previous list are abbreviated as PVT (process, voltage, temperature) variations. In order to finalize a circuit all combinations of these (plus the variations in operating conditions like input voltage) have to be simulated. As you can imagine, this leads to a huge number of simulations, and simulation results which have to be evaluated for pass/fail.
There are two options how to tackle this efficiently:
- As an experienced designer you have a very solid understanding of the circuit, plus based on the analytic equations you can identify which combination of operating conditions will lead to a worst case performance. Thus, you can drastically reduce the number of corners to simulate, and you run them by hand.
- You are using a framework which highly automates this task of running a plethora of different simulations and evaluating the outcome. These frameworks are called simulation runners.
Luckily, there are open-source versions of simulation runners available, and we will use CACE in this lecture. CACE is written in Python and allows to setup a datasheet in YAML which defines the simulation problem and the performance parameters to evaluate against which limits. The resulting simulations are then run in parallel and the simulation data is evaluated and summarized in various forms.
There is a CACE setup available for our 5T-OTA. The datasheet describes the operating conditions and the simulations tasks. For each simulation a testbench template is needed, this one is used for ac simulations, this one is used for noise simulation, and this one is used for transient simulation.
After a successful run, a documentation is automatically generated. The result of a full run of this OTA design is presented here:
CACE Summary for ota-5t
netlist source: schematic
Parameter | Tool | Result | Min Limit | Min Value | Typ Target | Typ Value | Max Limit | Max Value | Status |
---|---|---|---|---|---|---|---|---|---|
Output voltage ratio | ngspice | gain | 0.97 V/V | 0.987 V/V | any | 1.000 V/V | 1.03 V/V | 1.006 V/V | Pass ✅ |
Bandwidth | ngspice | bw | 10e6 Hz | 15551000.000 Hz | any | 26912100.000 Hz | any | 34051700.000 Hz | Pass ✅ |
Output noise | ngspice | noise | any | 0.308 mV | any | 0.371 mV | 1 mV | 0.455 mV | Pass ✅ |
Settling time | ngspice | tsettle | any | 0.135 us | any | 0.142 us | 10 us | 0.155 us | Pass ✅ |
Plots
gain_vs_temp
gain_vs_vin
gain_vs_vdd
gain_vs_corner
bw_vs_temp
bw_vs_vin
bw_vs_vdd
bw_vs_corner
noise_vs_temp
noise_vs_vin
noise_vs_vdd
noise_vs_corner
settling_vs_temp
settling_vs_vin
settling_vs_vdd
settling_vs_corner
8.6.1 PVT Simulation Analysis
Looking at the CACE report in Note 2 we see that (luckily) the specification is met for all parameters. This is great news! We now have a design that we carefully simulated across PVT and other corners, and which is ready for layout. Once we have the layout ready, we can extract the wiring parasitic (\(R\) and \(C\)) as well as other layout-dependent effects like well proximity. Using this augmented netlist we can then again use CACE to check performance across conditions and parameter variations, and if we still pass all specification points then our design is finished.
9 Cascode Stage
As we have seen in Section 8 the performance of the OTA is generally quite acceptable (see Table 2), but we might want to aim for better output voltage accuracy. As our analysis has shown the output voltage tolerance is limited by the open-loop dc gain \(A_0\) of the OTA (see Equation 14), which in turn is limited by the output conductance of \(M_2\) and \(M_4\) in Figure 26, which is also confirmed by the analytical result in Equation 18.
During the sizing procedure we have seen that the achievable \(g_\mathrm{m}/ g_\mathrm{ds}\) ratio of a single MOSFET is limited, even if we increase \(L\). We are thus searching for a better option, and here (local) feedback in form of a cascode comes to help.
For analysis of a cascode, we use the following single-transistor stage shown in Figure 33.
In order to derive the operation of the cascode analytically, we draw the small-signal equivalent circuit in Figure 34. We assume that \(V_\mathrm{B}\) is a low-ohmic bias voltage, thus we replace it by ac ground. We further set \(g_\mathrm{mb}= 0\).
Since the gate is assumed at a fixed potential, we can put \(C_\mathrm{gs}\) in parallel to \(G_\mathrm{S}\) as \(G_\mathrm{S}^{*} = G_\mathrm{S} + s C_\mathrm{gs}\), and we can put \(C_\mathrm{gd}\) in parallel to \(G_\mathrm{D}\) as \(G_\mathrm{D}^{*} = G_\mathrm{D} + s C_\mathrm{gd}\). As a result we will disregard these capacitors for now, and just consider \(G_\mathrm{S}\) and \(G_\mathrm{D}\).
9.1 Cascode Output Impedance
As a first step, we want to calculate the output impedance at the drain of the MOSFET (i.e., looking into the drain). For this, we replace \(G_\mathrm{D}\) with a current source. The resulting small-signal equivalent circuit is shown in Figure 36.
We realize that \(i_\mathrm{out}\) flows through \(G_\mathrm{S}\) and drops \(v_\mathrm{gs}\) (note the sign): \[ v_\mathrm{gs}= -\frac{i_\mathrm{out}}{G_\mathrm{S}} \]
Further, \(v_\mathrm{out} = -v_\mathrm{gs}+ v_\mathrm{ds}\). Calculating KCL at the output node results in \[ i_\mathrm{out} - g_\mathrm{m}v_\mathrm{gs}- g_\mathrm{ds}v_\mathrm{ds}= 0. \]
Using the previously found identities, and after a bit of algebraic manipulations we arrive at \[ g_\mathrm{out} = \frac{i_\mathrm{out}}{v_\mathrm{out}} = \frac{g_\mathrm{ds}}{1 + \frac{g_\mathrm{m}+ g_\mathrm{ds}}{G_\mathrm{S}}} = \frac{g_\mathrm{ds}\cdot G_\mathrm{S}}{G_\mathrm{S} + g_\mathrm{m}+ g_\mathrm{ds}} \tag{27}\]
We find that if \(G_\mathrm{S} = 0\) (an open) then \(g_\mathrm{out} = 0\), and if \(G_\mathrm{S} = \infty\) (a short) then \(g_\mathrm{out} = g_\mathrm{ds}\). We can calculate the benefits of a cascode if we assume we put a cascode on top of a common-source transistor stage (thus \(G_\mathrm{S} = g_\mathrm{ds}'\)) and get \[ g_\mathrm{out} = \frac{g_\mathrm{ds}\cdot g_\mathrm{ds}'}{g_\mathrm{ds}' + g_\mathrm{m}+ g_\mathrm{ds}} \approx g_\mathrm{ds}' \frac{g_\mathrm{ds}}{g_\mathrm{m}}. \tag{28}\]
The output impedance of the lower MOSFET (\(g_\mathrm{ds}'\)) is reduced by the self-gain of the cascode transistor! This is a powerful technique to increase the output impedance of a transistor stage by cascoding, much better than increasing \(L\).
9.2 Cascode Input Impedance
To calculate the input impedance of a cascode (i.e., looking into the source) we replace \(G_\mathrm{S}\) with a current source. The resulting small-signal equivalent circuit is shown in Figure 37.
We note that \(v_\mathrm{gs}= -v_\mathrm{in}\) and that \(i_\mathrm{in}\) flows through \(G_\mathrm{D}\), resulting in \(v_\mathrm{D} = i_\mathrm{in} / G_\mathrm{D}\). Note that \(v_\mathrm{ds}= v_\mathrm{D} - v_\mathrm{in}\). Formulating KCL at the input node results in \[ i_\mathrm{in} + g_\mathrm{ds}v_\mathrm{ds}+ g_\mathrm{m}v_\mathrm{gs}= 0. \]
After some manipulation we find that \[ g_\mathrm{in} = \frac{i_\mathrm{in}}{v_\mathrm{in}} = \frac{(g_\mathrm{m}+ g_\mathrm{ds}) \cdot G_\mathrm{D}}{g_\mathrm{ds}+ G_\mathrm{D}}. \tag{29}\]
Setting \(G_\mathrm{D} = 0\) (an open) results in \(g_\mathrm{in} = 0\) as well, so the input impedance of the cascode is very large when the drain impedance is large.
However, setting \(G_\mathrm{D} = \infty\) (a short or low-ohmic impedance) results in the well-known result of \(g_\mathrm{in} = g_\mathrm{m}+ g_\mathrm{ds}\approx g_\mathrm{m}\), which means that the input impedance looking into a cascode is approximately \(1/g_\mathrm{m}\).
This has the practical benefit that a capacitance connected at this node results in a high-frequency pole, which is often not critical in terms of stability. Further, the voltage swing at a cascode input node is small due to the often small impedance, and this minimizes the Miller effect at connected inter-node capacitors (see Section 16.1).
10 Improved OTA
With the new learned know-how of the cascode stage we can set out to improve our original basic 5T-OTA design. Essentially this means to add cascodes to \(M_2\) and \(M_4\) in Figure 26. For symmetry reasons we will add cascodes to both sides, and the resulting schematic is shown in Figure 38.
The transistor name appendix “C” indicates a cascode device sitting atop its base transistor. The bias voltage \(V_\mathrm{bias1}\) is referenced to \(V_\mathrm{SS}\), the bias voltage \(V_\mathrm{bias3}\) is referenced to \(V_\mathrm{DD}\), and the floating bias voltage \(V_\mathrm{bias2}\) creates a voltage bias for \(M_\mathrm{1C}\) and \(M_\mathrm{2C}\) relative to the tail point, so that the \(V_\mathrm{DS}\) of \(M_{1,2}\) stays constant with a changing common-mode input voltage.
It is critically import for a stable performance across PVT that the bias voltages for the cascode gates are created in a manner that tracks variations with process, temperature, and supply voltage!
The current mirrors constructed out of \(M_\mathrm{5/5C,6/6C}\) and \(M_\mathrm{3/3C,4/4C}\) are a special kind of cascode current mirror for low-voltage operation, also referred to as high-swing cascode current mirror (Jespers and Murmann 2017). This type is very often used, as it forces the \(V_\mathrm{GS}\) and \(V_\mathrm{DS}\) of \(M_{5,6}\) (and \(M_{3,4}\)) to be equal, so the current mirror ratio is independent of \(g_\mathrm{ds}\).
Try to verify the above statement of equal drain-source voltages by deriving both, an equation for \(V_\mathrm{DS5}\) assuming a high-swing cascode current mirror (Figure 38) and \(V_\mathrm{DS5}\) in case of a simple cascode current mirror, where the reference branch \((M_\mathrm{6/6C})\) is comprised of two mosfet diodes.
Further, by properly selecting the bias voltages of the cascode a low-voltage operation is achieved as \(V_\mathrm{DS}\) can be minimized, allowing even triode operation of the current-mirror MOSFETs (as, noted above, a large \(g_\mathrm{ds}\) is not a big issue).
A simplified small-signal gain calculation of this improved OTA uses the result of Equation 18 and Equation 28 to arrive at the approximate dc gain of \[ A_0 \approx \frac{g_\mathrm{m12}}{g_\mathrm{ds2} \frac{g_\mathrm{ds2C}}{g_\mathrm{m2C}} + g_\mathrm{ds4} \frac{g_\mathrm{ds4C}}{g_\mathrm{m4C}}} \tag{30}\] leading to a significant boost in dc gain due to cascoding. We will use this increased gain to reduce the \(L\) of all MOSFET to
- save area (smaller \(L\) will lead to smaller \(W\) for a given \(W/L\) ratio) and
- will push the additional poles and zeros at the inner nodes of the cascode transistors (e.g., the connection of the drain of \(M_5\) to the source of \(M_\mathrm{5C}\)) to higher frequencies to result in stable behavior and a reasonable gain transfer function (too many poles and zeros in the pass band of the amplifier create many issues with stability margin).
10.1 Sizing the Improved OTA
Like the sizing of the 5T-OTA in Section 8.4 we will again use the \(g_\mathrm{m}/I_\mathrm{D}\) method using a Python notebook. Instead of using \(L = 5\,\mu\text{m}\) we will this time use a reduced \(L = 0.5\,\mu\text{m}\) for \(M_\mathrm{1/1C,2/2C,3/3C,4/4C}\) (for speed reasons) and \(L = 1\,\mu\text{m}\) for \(M_\mathrm{5/5C,6/6C}\) for better common-mode rejection (the tail current mirror is less critical in terms of speed and stability).
We set \(g_\mathrm{m}/I_\mathrm{D}= 13\) across the board for a good trade-off between speed, current efficiency, and voltage headroom for the MOSFETs (this is now way more critical than in the basic 5T-OTA as we stack now double as many MOSFET at the same supply voltage). Please look at Section 3 to confirm this choice.
Looking at this sizing result we see that we achieve an improved \(A0 > 43\,\text{dB}\) while meeting also the other performance requirements of Table 2 with margin. In addition, we check the voltage headroom of the critical MOSFET to see if we can squeeze it into the available supply voltage range, and see that this is possible with our above choice selection of parameters.
Please take a detailed look at the above sizing notebook and play with the numbers and calculations. Do you find a better trade-off for the input parameters? Can you understand the ratio behind the choices and calculations?
10.2 Designing the Improved OTA
Based on the collected experience in this lecture and the result of the sizing procedure in Section 10.1 you should be able to design this OTA. If you want, please go ahead and try an implementation and check its performance with CACE.
As an alternative there is a prepared OTA design shown in Figure 39 which we will discuss in detail next.
10.2.1 Discussion of the OTA Design
We will now do an analysis of the circuit design of the OTA including all the complications which make this design practical.
- For easier navigation, the device identifier are consistent with the circuit sketch in Figure 38.
- Some MOSFET dimensions are rounded to make a better fit in IC layout. Please also look carefully at \(W\), \(L\), and \(\mathrm{ng}\). The parameter \(\mathrm{ng}\) sets how the total \(W\) of a MOSFET should be split into individual MOSFET fingers with \(W_\mathrm{f} = W / \mathrm{ng}\). This is done to arrive at a suitably sized MOSFET physical implementation. As we will not deal with IC layout in this lecture we will leave it at that.
- In order to allow good matching in the IC layout, MOSFETs (and other components) have to be constructed from equal pieces. To that end, \(W/L\) scaling is done using unit elements (see finger width \(W_\mathrm{f}\)). Sometimes, besides \(W\) the length \(L\) has to be scaled, and this leads to the oddly-looking series stacking of some MOSFET (easily recognizable by the connected gates). In order to increase circuit readability, a subcircuit could be constructed hiding this series stacking of MOSFET, but it is sometimes easier to avoid subcircuits. There is a fine line in this trade, sometime a depth of 4 is the decision point between subcircuit use/no-use.
- As you can (hopefully) see the circuit is carefully drawn to ease readability. Important nets are named, text comments state certain properties like nominal voltage levels, bias currents, etc. Current sensing elements are added to directly see the dc currents in the circuit simulation.
- The bias voltage generation for the cascodes is included as well. The voltage drop for the bottom transistors is developed across a resistor (this is a simple but effective way, but other implementations are possible as well). We are using a dummy branch for bias generation (constructed with \(M_\mathrm{7/7C,8/8C}\) and \(R_3\)).
- The floating bias voltage \(V_\mathrm{bias2}\) is created by implementing a current source from \(V_\mathrm{DD}\) (\(M_\mathrm{9/9C}\)), then a MOSFET diode \(M_{11}\) with resistor \(R_4\), and an additional current source towards \(V_\mathrm{SS}\) (\(M_\mathrm{10/10C}\)). Instead of using the bottom current source the current in the tail current source \(M_\mathrm{5/5C}\) could have been increased, but matching is better with the chosen approach.
- Power-down transistors \(M_{\mathrm{pd,}x}\) are added to allow a proper shutdown of the circuit with a digital enable input. It is generally a good idea to clamp floating nodes in off-mode so that no issues during power-down (like increased leakage currents) or delayed startup or shutdown are occurring. It is further a good design principle to buffer all incoming digital signals with inverters connected to the local supply. This lowers the risk of unwanted noise coupling or excessive slew rates on the incoming digital signals.
- Sensitive bias nodes are buffered with decoupling capacitors. We are using MOSFET as nonlinear capacitors, which is not an issue in this application, but we value the increased capacitive density. Please note how the MOSFET are connected (to \(V_\mathrm{DD}\)? or to \(V_\mathrm{SS}\)?).
The resistor used in this circuit are subcircuits to allow series connection of unit resistor elements. The schematic of one element is shown in Figure 40. It is using an effective method to create a series string of connected resistors using wire bundles. Try to understand the circuit, consult the Xschem manual, and look at the resulting SPICE netlist to confirm your finding.
Note that a parallel connection of devices is effectively possible using the multiplier notation of Xschem.
10.3 Simulation of Improved OTA
Now that the circuit design of the improved OTA is done, we can use the same simulation testbenches as for the basic OTA. The testbenches are shown in Figure 41, Figure 42, and Figure 43.
Please use the above testbenches to simulate the improved OTA:
- Check the dc bias points. Are they good? How stable are they across PVT variations?
- What are the small-signal parameters like gain, noise and bandwidth? Are they fitting the specification?
- What is large-signal performance? Is the settling fast enough? Is the settling well behaved, i.e., are there overshoots or other strange ringing indicating potential stability issues?
- Try to improve the design. Change various device parameters and see what happens. Whenever you change something, check the dc operating point first. If the dc operating point is not good no further simulations make sense.
10.4 Corner Simulation of Improved OTA
Just like for the basic OTA we use the CACE system to check the performance of the improved OTA design holistically across variations like PVT and input signal variations. The results of the CACE run are shown below in Note 3.
CACE Summary for ota-improved
netlist source: schematic
Parameter | Tool | Result | Min Limit | Min Value | Typ Target | Typ Value | Max Limit | Max Value | Status |
---|---|---|---|---|---|---|---|---|---|
Output voltage ratio | ngspice | gain | 0.99 V/V | 1.000 V/V | any | 1.001 V/V | 1.01 V/V | 1.009 V/V | Pass ✅ |
Bandwidth | ngspice | bw | 10e6 Hz | 107908000.000 Hz | any | 226025000.000 Hz | any | 292975000.000 Hz | Pass ✅ |
Output noise | ngspice | noise | any | 0.346 mV | any | 0.407 mV | 1 mV | 0.497 mV | Pass ✅ |
Settling time | ngspice | tsettle | any | 0.196 us | any | 0.212 us | 5 us | 0.226 us | Pass ✅ |
Plots
gain_vs_temp
gain_vs_vin
gain_vs_vdd
gain_vs_corner
bw_vs_temp
bw_vs_vin
bw_vs_vdd
bw_vs_corner
noise_vs_temp
noise_vs_vin
noise_vs_vdd
noise_vs_corner
settling_vs_temp
settling_vs_vin
settling_vs_vdd
settling_vs_corner
The improved performance allows to improve the specifications in a few important points, notably the output voltage tolerance which is an important metric for a reference voltage buffer. We have intentionally increased the power consumption a little bit, but we negotiated with the chip lead designer a changed bias current level, so overall the situation is even slightly improved. The new situation with the improved design is summarized in Table 3 (unchanged entries are not shown).
Specification | Basic 5T-OTA | Improved OTA | Unit |
---|---|---|---|
Output voltage error | \(<3\) | \(<1\) | % |
Total output noise (rms) | \(<1\) | \(<0.5\) | mVrms |
Supply current (as low as possible) | \(<10\) | \(<20\) | µA |
Turn-on time (settled to with 1%) | \(<10\) | \(<1\) | µs |
Externally provided bias current (nominal) | \(20\) | \(5\) | µA |
11 Biasing
So far, we have pushed one unresolved issue in front of us. We have studied ways to multiply and scale bias currents using current mirrors (see Section 6) and we have also found ways to buffer (and scale) bias voltages (see Section 8.1, scaling can be readily achieved by using a resistive divider in the feedback path). We can generate bias voltages by running a current through a resistor. However, what has not been discussed so far is how to generate one stable bias current in the first place.
If we would have a stable reference voltage, then we can use the arrangement shown in Figure 44 to generate a bias current that is given by (note the use of feedback using an error amplifier which could be a simple OTA)
\[ I_\mathrm{bias} = \frac{V_\mathrm{ref}}{R_1}. \]
Using \(A_1\) and \(M_1\) the voltage \(V_\mathrm{ref}\) is regulated across \(R_1\) and available at the drain of \(M_1\).
The resistor \(R_1\) will show variation with the process tolerance, but we can either (a) use an external resistor for a precise current generation, or (b) use some sort of trimming to correct this resistor value. Depending on the further use of the bias currents a variation of the resistor might no be a bad thing. Assuming we use the resulting bias current to generate somewhere a bias voltage \(V_\mathrm{bias}\) by running the current through resistor \(R_2\), then this voltage is given by
\[ V_\mathrm{bias} = R_2 I_\mathrm{bias} = \frac{R_2}{R_1} V_\mathrm{ref}. \]
We can see in the above equation that the absolute values of \(R_1\) and \(R_2\) do not play a role, only the ratio is important. Luckily, components in integrated circuits match very well, so we can multiply and scale one reference voltage \(V_\mathrm{ref}\) across our IC.
Since all components on chip will experience manufacturing tolerances of at least \(\pm 10\%\) we strive for something more accurate. We can resort to an off-chip property like an externally provided reference current or voltage, or use the power supply as a voltage reference (often, power supply rails are specified to \(\pm 5 \ldots 10\%\)).
The only option left is to use a material property of the silicon itself for stable reference voltage generation.
11.1 Bandgap Reference
It has been realized that a bipolar junction transistor (BJT) has the wonderful property, that the base-emitter voltage \(V_\mathrm{BE}\) has the following approximate relationship vs. temperature (R. J. Widlar 1971)
\[ V_\mathrm{BE}\approx V_\mathrm{g0} \left( 1 - \frac{T}{T_0} \right) + V_\mathrm{BE0} \left( \frac{T}{T_0} \right) \tag{31}\]
where \(V_\mathrm{g0} = 1.205\,\text{V}\) is the bandgap voltage of silicon at \(0\,\text{K}\) and \(V_\mathrm{BE0}\) is the base-emitter voltage of a BJT as reference temperature \(T_0\). Further, the difference in \(V_\mathrm{BE}\) of two BJT operated at different emitter current densities \(J_1\) and \(J_2\) is given as
\[ \Delta V_\mathrm{BE}= \frac{k T}{q} \ln \left( \frac{J_1}{J_2} \right) \tag{32}\]
with \(k\) the Boltzmann constant, and \(q\) the elementary charge.
Adding Equation 31 and Equation 32 results in a reference voltage of value
\[ V_\mathrm{ref} = V_\mathrm{g0} \left( 1 - \frac{T}{T_0} \right) + V_\mathrm{BE0} \left( \frac{T}{T_0} \right) + \frac{k T}{q} \ln \left( \frac{J_1}{J_2} \right) \tag{33}\]
which can be made temperature-insensitive when the terms, which are a function of \(T\), cancel each other, and only
\[ V_\mathrm{ref} = V_\mathrm{g0} \tag{34}\]
remains. We thus have created an on-chip reference (called the bandgap voltage reference) which is almost independent of manufacturing tolerances with zero temperature coefficient. Of course, this is only true neglecting second-order effects, but nevertheless, reference accuracies of \(\pm 1 \ldots 3\%\) without trimming are perfectly possible.
The original implementation in (R. J. Widlar 1971) uses NPN transistors. The question is, where do we find BJT in a CMOS process? Luckily, when looking at the typical implementations, we find that there is a layer sandwich of P-N-P available. While the PNPs constructed parasitically out of this available layers are available for free without extra processing cost, they are very slow, show unusually small \(\beta < 10\), and the collector is tied to \(V_\mathrm{SS}\) as it is the substrate. Still, for bandgap references, they are very useful.
A simple implementation of a bandgap reference circuit is shown in Figure 45. If we scale \(R_1\) and \(R_2\) correctly then we can achieve Equation 34. Note that the output voltage is ca. \(1.2\,\text{V}\), so operating this circuit on low supply voltages will be problematic.
\(M_1\) to \(M_4\) are scaled in a way that in both branches the same bias current is flowing. Further, \(M_1\) and \(M_2\) ensure that there is the same potential at their sources. Since the PNP are scaled by the factor \(n\) (thus the current density is different) so that the following voltage develops across \(R_1\):
\[ \Delta V_\mathrm{BE}= \frac{k T}{q} \ln m \]
Hence, the bias current in all the branches is given by
\[ I_\mathrm{bias} = \frac{\Delta V_\mathrm{BE}}{R_1} = \frac{1}{R_1} \frac{k T}{q} \ln m. \tag{35}\]
Inspecting Equation 35 we see that \(I_\mathrm{bias} = k_1 T\) is a linear function of temperature \(T\), a property that is very useful and called PTAT (proportional to absolute temperature). With \(M_5\) we mirror this bias current into the output branch, and the output voltage \(V_\mathrm{ref}\) is then given by
\[ V_\mathrm{ref} = V_\mathrm{BE}+ \frac{R_2}{R_1} \frac{k T}{q} \ln m. \]
By proper selection of \(R_1\), \(R_2\) and \(n\) we can satisfy Equation 33 to result in Equation 34.
For an improved implementation of Figure 45, the current mirrors should be cascoded, and a startup circuit should be included to guarantee proper operation after enabling it. Further, Equation 31 and Equation 32 build on the relationship \(I_\mathrm{C} = f(V_\mathrm{BE})\), while we control \(I_\mathrm{E}\) in this circuit. If \(\beta\) is large then \(I_\mathrm{C} \approx I_\mathrm{E}\), but this is not the case for the used PNPs.
The circuit of Figure 45 has been implemented in Xschem and is shown in Figure 46. The current sources have been improved by using cascodes. No base current compensation is implemented, as it is assumed that the \(\beta\) of the PNP are similar although they are operated at different emitter current densities. Note the addition of a startup branch with \(M_\mathrm{startup}\) which is inactive during normal operation but will inject a startup current if no proper bias point has yet been found. There is no circuitry added for enabling/disabling the circuit, which would also be needed for a practical implementation.
The resistors \(R_{1-4}\) have been implemented out of unit elements of ca. \(5\,\text{k}\Omega\) for optimum matching. Building a bandgap for the first time on silicon likely will show a slightly deviating temperature coefficient, which is why we keep a few dummy resistors around in \(R_2\) to compensate the TC in a redesign.
The Xschem schematic is available here and the simulated reference voltage vs. temperature is shown in Figure 47. For a typical process we achieve a TC of \(\pm 0.2\%\).
Please note how tight the dc operating point is in this design to keep all MOSFET saturated. We only use \(100\,\text{mV}\) nominally as headroom. The circuit in Figure 46 works only marginally at \(V_\mathrm{DD}= 1.5\,\text{V}\), but would not work at \(1.2\,\text{V}\) or lower. Improved circuit architectures for \(<1\,\text{V}\) operation exist Eberlein, Panagopoulos, and Pretl (2018).
11.2 Banba Bandgap Reference
The Banba reference (Banba et al. 1999) is quite a bit trickier to design than the classical bandgap shown in Figure 45. It requires the use of an error amplifier; luckily, we can use the 5T-OTA which we designed in Section 8. Since a loop is involved the startup of this circuit is not easy and requires the use of a transient simulation and a proper pre-charge of critical nodes. We can use the ngspice scripting language to (a) set the temperature for a sweep, (b) run a transient simulation, and (c) capture the final reference voltage and save it.
A first design has been implemented and is shown in Figure 48. The supply voltage (which could be lower than \(1.5\,\text{V}\)) is limited by our OTA design; however, it works well at \(V_\mathrm{DD}= 1.5\,\text{V}\). The simulated reference voltage (which is scaled to roughly \(V_\mathrm{bandgap} / 2\)) is shown in Figure 49.
Note the self-biasing of the OTA with the current generated in the reference branches. The feedback loop needs capacitors for stabilization, and we use area-efficient MOSFET for this task. For a detailed explanation of this circuit please refer to (Banba et al. 1999), but in brief, the operation is as follows:
The CTAT and PTAT currents required for compensating each others TC’s are built using a \(\Delta V_\mathrm{BE}\) cell (given the PTAT current) with parallel resistors (providing the CTAT current). Voltages are sensed using an error amplifier and current sources are controlled to achieve matching currents in both branches. This current can then also be mirrored to flow through another resistor which can be scaled to produce the output voltage.
In comparison to the classical bandgap reference shown in Figure 45 (where the developed currents in all branches are PTAT) the currents developed in the Banba reference are constant vs. temperature.
The minimum supply voltage for the Banba reference is the \(V_\mathrm{BE}\) of the PNP plus the saturation voltage of one MOSFET current source, so ca. \(V_\mathrm{DDmin} \ge 0.8\,\text{V} + 0.2\,\text{V} \approx 1\,\text{V}\). Of course we also need to design an OTA which can work at this low supply.
12 A Fully-Differential OTA
To be added in a future release.
13 An RC-OPAMP Filter
To be added in a future release.
14 Summary & Conclusion
By now, you should be familiar with the use of a schematic entry tool (Xschem) and circuit simulator (ngspice). You have learned the basic performance trade-offs, and the large- and small-signal behavior of the MOSFET. You can use the \(g_\mathrm{m}/I_\mathrm{D}\) method to size MOSFET for class-A operation. You can design simple amplifiers based on OTA structures. In summary, you are on a good way to become a good analog or mixed-signal circuit designer!
We hope you have enjoyed these lecture notes! If you have feedback, suggestions, additions, or corrections, please send us an e-mail, create a GitHub issue, or provide a GitHub pull request. Thank you in advance for your contributions!
15 Appendix: Middlebrook’s Method
If we want to do a closed-loop gain analysis (for stability or other investigations), we have the need to break the loop at one point, apply a stimulus, and monitor the response on the other end. By doing this we want to keep the loading (i.e. the impedance) on both ends similar to the original case. To achieve this, we break the loop at one point by inserting (1) an ac voltage source, and (2) attach an ac current source, as shown in Figure 50 and Figure 51. The derivation of this approach is presented in (Middlebrook 1975), and has the big advantage that loading is not changed compared to the closed-loop situation, and the bias points are also unchanged.
For both cases we do an ac analysis, and find the corresponding transfer functions \(T_\mathrm{v}\) and \(T_\mathrm{i}\) as \[ T_\mathrm{v} = -\frac{V_\mathrm{r}}{V_\mathrm{f}} \] and \[ T_\mathrm{i} = -\frac{I_\mathrm{r}}{I_\mathrm{f}}. \]
Then, we can calculate the open-loop transfer function \(T(s) = H_\mathrm{ol(s)}\) as \[ T(s) = \frac{T_\mathrm{v} T_\mathrm{i} - 1}{T_\mathrm{v} + T_\mathrm{i} + 2} = \frac{V_\mathrm{r} I_\mathrm{r} - V_\mathrm{f} I_\mathrm{f}}{2 V_\mathrm{f} I_\mathrm{f} - V_\mathrm{r} I_\mathrm{f} - V_\mathrm{f} I_\mathrm{r}}. \]
The four ac quantities \(V_\mathrm{f}\), \(V_\mathrm{r}\), \(I_\mathrm{f}\), and \(I_\mathrm{r}\) we can readily find by circuit simulation or calculation.
Please note that Middlebrook’s method works well for \(T \gg 1\), so it will show inaccuracies at the crossover frequency of the open-loop gain at \(T(s) \approx 0\). An improved method (slightly more complicated) can be found in (Tian et al. 2001).
16 Appendix: Useful Circuit Theorems
16.1 Miller’s Theorem
Using Miller’s theorem we can find the equivalent circuit of an impedance connected between two nodes, and we know the transfer function between these nodes. The given situation is shown in Figure 52, and the equivalent circuit is shown in Figure 53.
Using Miller’s theorem (Sheikholeslami 2015) we can calculate \[ Z_1 = \frac{Z}{1 - A} = \frac{Z}{1 - V_\mathrm{B} / V_\mathrm{A}} \] and \[ Z_2 = \frac{Z}{1 - A^{-1}} = \frac{Z}{1 - V_\mathrm{A} / V_\mathrm{B}} \] to arrive at an equivalent circuit, given that \(A = V_\mathrm{B} / V_\mathrm{A}\) is the voltage gain between nodes A and B. A derivation of this theorem is relative straightforward considering the current through \(Z\) when looking into the impedance from either node A or node B and calculating an equivalent impedance causing the same current.
Note that if \(V_\mathrm{A} = V_\mathrm{B}\) then there is no current flow through \(Z\), and accordingly the impedances \(Z_1 = Z_2 = \infty\).
Miller’s theorem can be quite handy when an impedance is strapped between two nodes, and we want to break this connection in a calculation, e.g., considering the effect of \(C_\mathrm{GD}\) in a MOSFET.
16.2 Bode’s Noise Theorem
The total integrated noise of any (no matter how complicated) \(RLC\) network (interpreted as a one-port) is given by
\[ \overline{V_\mathrm{n}^2} = kT \left( \frac{1}{C_\infty} - \frac{1}{C_0} \right), \]
where \(C_\infty\) is the capacitance looking into the network with all resistors and inductors open-circuited, and \(C_0\) is the capacitance looking into the circuit when all inductors and resistors are shorted (Pavan 2019).
Reference (Pavan 2019) is an excellent read deriving Bode’s noise theorem from different angles.
17 Appendix: 5T-OTA Small-Signal Output Impedance
This section gives additional details to the analysis presented in Section 8.3. Here we provide the full calculation of the output impedance/conductance of the 5T-OTA for frequencies below the dominant pole, i.e. we neglect any capacitors.
17.1 Open-Loop Configuration
For the open-loop case, the gates of \(M_1\) and \(M_2\) are tied to ground and thus, both \(v_\mathrm{gs}\) are equal. \[ v_\mathrm{in,p}=v_\mathrm{in,p}=0~\text{V} \] \[ v_\mathrm{gs1} = v_\mathrm{gs2} \tag{36}\]
KCL at the output node: \[ i_\mathrm{out} -g_\mathrm{ds4} v_\mathrm{out} - g_\mathrm{m34}v_\mathrm{gs34} - i_{g_\mathrm{ds2}} - g_\mathrm{m2}v_\mathrm{gs2} = 0 \tag{37}\]
KCL at the tail node: \[ g_\mathrm{m1}v_\mathrm{gs1} + g_\mathrm{m2}v_\mathrm{gs2} + i_{g_\mathrm{ds2}} + g_\mathrm{ds5}v_\mathrm{gs2} = 0 \]
Using Equation 36 we can eliminate \(v_\mathrm{gs1}\) and solve for \(i_{g_\mathrm{ds2}}\). \[ i_{g_\mathrm{ds2}} = -\left(g_\mathrm{m1}+g_\mathrm{m2}+g_\mathrm{ds5}\right)v_\mathrm{gs2} \tag{38}\]
Furthermore, we need an expression for \(v_\mathrm{gs34}\). Ohm’s law at the conductance \(g_\mathrm{m34}\) will suffice. \[ v_\mathrm{gs34} = -\frac{g_\mathrm{m1}}{g_\mathrm{m34}}v_\mathrm{gs1} \tag{39}\]
KVL from the output node down to ground (over \(g_\mathrm{ds2}\) and \(g_\mathrm{ds5}\)) in combination with Equation 38 gives us an expression for \(v_\mathrm{gs2}\) \[ v_\mathrm{gs2} = -\frac{g_\mathrm{ds2}}{g_\mathrm{m1}+g_\mathrm{m2}+g_\mathrm{ds2}+g_\mathrm{ds5}}v_\mathrm{out} \tag{40}\]
Now, we can plug in all quantities into Equation 37. First, Equation 38 is inserted, which provides an expression for the current through the output conductance \(g_\mathrm{ds2}\) of \(M_2\). \[ i_\mathrm{out} -g_\mathrm{ds4} v_\mathrm{out} - g_\mathrm{m34}v_\mathrm{gs34} + \left(g_\mathrm{m1}+g_\mathrm{ds5}\right)v_\mathrm{gs2} = 0 \]
Second, \(v_\mathrm{gs34}\) is substituted by Equation 39. Since we have assumed a matched pair of transistors for the current mirror comprised of \(M_3\) and \(M_4\), \(g_\mathrm{m34}\) perfectly cancels out of the equation, and is effectively replaced by the transconductance \(g_\mathrm{m1}\) of the input transistor \(M_1\). \[ i_\mathrm{out} -g_\mathrm{ds4} v_\mathrm{out} + \left(2g_\mathrm{m1}+g_\mathrm{ds5}\right)v_\mathrm{gs2} = 0 \]
Third, Equation 40 gives as an expression for the last remaining unknown \(v_\mathrm{gs2}\). Thus, the factor in front of \(v_\mathrm{out}\) defines the conductance at the output node. \[ i_\mathrm{out} -\left[g_\mathrm{ds4} + \left(2g_\mathrm{m1}+g_\mathrm{ds5}\right)\frac{g_\mathrm{ds2}}{g_\mathrm{m1}+g_\mathrm{m2}+g_\mathrm{ds2}+g_\mathrm{ds5}}\right]v_\mathrm{out} = 0 \tag{41}\] Before, we interpret this result, we use are assumption of matched input transistors (\(g_\mathrm{m12}=g_\mathrm{m1}=g_\mathrm{m2}\)) and slightly rearrange the equation to give us more insight. \[ i_\mathrm{out} -\left[g_\mathrm{ds4} + \frac{g_\mathrm{ds2}\cdot\left(2g_\mathrm{m12}+g_\mathrm{ds5}\right)}{g_\mathrm{ds2}+\left(2g_\mathrm{m12}+g_\mathrm{ds5}\right)}\right]v_\mathrm{out} = 0 \tag{42}\] Now, we can identify the common equation of the total resistance of two parallel resistors. However, we are dealing with conductances here, so the same equation describes the total conductance of two conductances in series, while parallel conductances are simply summed. In parallel to \(g_\mathrm{ds4}\), there is effectively the series connection of \(g_\mathrm{ds2}\) and \(\left(2g_\mathrm{m12}+g_\mathrm{ds5}\right)\) at work. If we apply the general assumption of \(g_\mathrm{m}\gg g_\mathrm{ds}\), only the parallel connection of \(g_\mathrm{ds4}\) and \(g_\mathrm{ds2}\) remains. Therefore, moving \(g_\mathrm{ds2} + g_\mathrm{ds4}\) in parallel to \(C_\mathrm{load}\) in Section 8.3 was valid. \[ \frac{i_\mathrm{out}}{v_\mathrm{out}}\approx g_\mathrm{ds4} + g_\mathrm{ds2} \tag{43}\]
17.2 Closed-Loop Configuration
In contrast to the open-loop case, we keep the gate of \(M_1\) connected to ground and tie the input of \(M_2\) to the output node \(v_\mathrm{out}\). \[ v_\mathrm{in,n} = v_\mathrm{out} \tag{44}\]
KCL at the output node: \[ i_\mathrm{out} -g_\mathrm{ds4} v_\mathrm{out} - g_\mathrm{m34}v_\mathrm{gs34} - g_\mathrm{ds2}v_\mathrm{gs2} - g_\mathrm{m2}v_\mathrm{gs2} = 0 \tag{45}\]
We use KVL from the output node down to ground to find an expression for \(v_\mathrm{gs2}\). \[ v_\mathrm{gs2} = v_\mathrm{out} + v_\mathrm{gs1} \tag{46}\]
KCL at the tail node: \[ g_\mathrm{m1}v_\mathrm{gs1} + g_\mathrm{m2}v_\mathrm{gs2} + g_\mathrm{ds2}v_\mathrm{gs2} + g_\mathrm{ds5}v_\mathrm{gs2} = 0 \tag{47}\]
Using Equation 46 to substitute \(v_\mathrm{gs2}\) in {#eq-app-vbufzout-kcl-vtail-cl} we find an equation for \(v_\mathrm{gs1}\). \[ v_\mathrm{gs1} = -\frac{g_\mathrm{m2}+g_\mathrm{ds2}}{g_\mathrm{m1}+g_\mathrm{m2}+g_\mathrm{ds2}+g_\mathrm{ds5}}v_\mathrm{out} \tag{48}\]
Again, we derive the output conductance by plugging Equation 46, Equation 39 and Equation 48 step by step into Equation 45. First, we use Equation 46 to eliminate \(v_\mathrm{gs2}\). \[ i_\mathrm{out} -\left(g_\mathrm{ds4} + g_\mathrm{ds2} + g_\mathrm{m2}\right)v_\mathrm{out} - g_\mathrm{m34}v_\mathrm{gs34} - \left(g_\mathrm{ds2} + g_\mathrm{m2}\right)v_\mathrm{gs1} = 0 \]
Second, Equation 39 also holds for the closed-loop case and lets us eliminate \(v_\mathrm{gs34}\). \[ i_\mathrm{out} -\left(g_\mathrm{ds4} + g_\mathrm{ds2} + g_\mathrm{m2}\right)v_\mathrm{out} - \left(g_\mathrm{ds2} + g_\mathrm{m2} - g_\mathrm{m1}\right)v_\mathrm{gs1} = 0 \]
Third, we use Equation 48 to eliminate the remaining unknown \(v_\mathrm{gs1}\). \[ i_\mathrm{out} -\left(g_\mathrm{ds4} + g_\mathrm{ds2} + g_\mathrm{m2}\right)v_\mathrm{out} + \left(g_\mathrm{ds2} + g_\mathrm{m2} - g_\mathrm{m1}\right)\frac{g_\mathrm{m2}+g_\mathrm{ds2}}{g_\mathrm{m1}+g_\mathrm{m2}+g_\mathrm{ds2}+g_\mathrm{ds5}}v_\mathrm{out} = 0 \]
A more simpler result can be obtained, if we neglect \(g_\mathrm{ds2}\) and \(g_\mathrm{ds5}\) in Equation 48 first (\(g_\mathrm{m}\gg g_\mathrm{ds}\)) and then plug it into our main equation. Additionally, we use \(g_\mathrm{m12}=g_\mathrm{m1}=g_\mathrm{m2}\) to further simplify the equation. \[ i_\mathrm{out} -\left(g_\mathrm{ds4} + \frac{3}{2} g_\mathrm{ds2} + g_\mathrm{m12}\right)v_\mathrm{out} \approx 0 \]
If we apply \(g_\mathrm{m}\gg g_\mathrm{ds}\) again, we arrive at the same result which was used for the noise calculation in Section 8.3, compare the expression for \(Y'_\mathrm{load}\) given by Equation 24 . \[ i_\mathrm{out} -\left(g_\mathrm{m12}\right)v_\mathrm{out} \approx 0 \]
18 Appendix: Linux Cheatsheet
The most useful commands for the Linux command line are:
ls
to list files and directoriescd
to change directory (e.g.cd analog-circuit-design/xschem
)cd ..
to move one directory level downmkdir
to create a new directory (e.g.mkdir my_directory
)touch
to create an empty file (e.g.touch file.txt
)rm
to remove files (e.g.rm file.txt
)rm -r
to remove recursively, for example a directory (e.g.rm -r my_directory
)cp
to copy files (e.g.cp file.txt destination
)cp -r
to copy recursively a directory (e.g.cp -r directory destination
)mv
to rename files (e.g.mv file.txt new_name.txt
)mv
to move files into other directories (e.g.mv file.txt directory
)cat
to view the contents of a file (e.g.cat file.txt
)find
to search for files and directories (e.g.find /path -name "*.txt"
)nano
to edit file (e.g.nano file.txt
)Ctrl + C
to forcefully terminate a running processhtop
to open the “task manager”
More advanced commands can be found under https://www.geeksforgeeks.org/linux-commands-cheat-sheet.
19 Appendix: Xschem Cheatsheet
When opening Xschem, using Help -> Keys
a pop-up windows comes up with many useful shortcuts. The most useful are:
19.0.0.1 Moving around in a schematic:
Cursor keys
to move aroundCtrl-e
to go back to parent schematice
to descend into schematic of selected symboli
to descend into symbol of selected symbolf
full zoom on schematicShift-z
to zoom inCtrl-z
to zoom out
19.0.0.2 Editing schematics:
Del
to delete elementsIns
to insert elements from libraryEscape
to abort an operationCtrl-#
to rename components with duplicate namesc
to copy elementsAlt-Shift-l
to add wire labelAlt-l
to add label pinm
to move selected objectsShift-R
to rotate selected objectsShift-F
to mirror / flip selected objectsq
to edit propertiesCtrl-s
to save schematict
to place a textShift-T
to toggle theignore
flag on an instanceu
to undo an operationw
to draw a wireShift-W
draw wire and snap to close pin or net point&
to join, break, and collapse wiresA
to make symbol from schematicAlt-s
to reload the circuit if changes in a subcircuit were made
19.0.0.3 Viewing/Simulating Schematics
5
to only view probesk
to highlight selected netShift-K
to unhighlight all netsShift-o
to toggle light/dark color schemes
to run a simulationa & b
to add cursors to an in-circuit simulation graphf
full zoom on y- or x-axis in in-circuit simulation graph
20 Appendix: ngspice Cheatsheet
Here is an unsorted list of useful ngspice settings and command:
20.1 Commands
ac dec|lin points fstart fstop
performs a small-signal ac analysis with either linear or decade sweepdc sourcename vstart vstop vincr [src2 start2 stop2 incr2]
runs a dc-sweep, optionally across two variablesdisplay
shows the available data vectors in the current plotecho
can be used to display text,$variable
or$&vector
, can be useful for debugginglet name = expr
to create a new vector;unlet vector
deletes a specified vector; access vector data with$&vec
linearize vec
linearizes a vector on an equidistant time scale, do this before an FFT; withset specwindow=windowtype
a proper windowing function can be setmeas
can be used for various evaluations of measurement results (see ngspice manual for details)noise v(output <ref>) src (dec|lin) pts fstart fstop
runs a small-signal noise analysisop
calculates the operating point, useful for checking bias points and device parametersplot expr vs scale
to plot somethingprint expr
to print it, useprint all
to print everythingremzerovec
can be useful to remove vectors with zero length, which otherwise cause issues when saving or plotting datarusage
plot information about resource usage like memorysave all
orsave signal
specifies which data is saved during simulation; this lowers RAM usage during simulation and size of RAW file; do save before the actual simulation statementsetplot
show a list of available plotsset var = value
to set the value of a variable; use variable with$var
;unset var
removes a variableset enable_noisy_r
to enable noise of behavioral resistors; usually, this is a good ideashell cmd
to run a shell commandshow : param
, likeshow : gm
shows the \(g_\mathrm{m}\) of all devices after running an operating point withop
spec
plots a spectrum (i.e. frequency domain plot)status
shows the saved parameters and nodestf
runs a transfer function analysis, returning transfer function, input and output resistancetran tstep tstop <tstart <tmax>>
runs a transient analysis untiltstop
, reporting results withtstep
step size, starting to plot attstart
and performs time steps not larger thentmax
wrdata
writes data into a file in a tabular ASCII format; easy to further processwrite
writes simulation data (the saved nodes) into a RAW file; default is binary, can be changed to ASCII withset filetype=ascii
; withset appendwrite
data is added to an existing file
20.2 Options
Use option option=val option=val
to set various options; important ones are:
abstol
sets the absolute current error tolerance (default is 1pA)gmin
is the conductance applied at every node for convergence improvement (default is 1e-12); this can be critical for very high impedance circuitsklu
sets the KLU matrix solverlist
print the summary listing of the input datamaxord
sets the numerical order of the integration method (default is 2 for Gear)method
set the numerical integration method togear
ortrap
(default istrap
)node
prints the node tableopts
prints the option valuestemp
sets the simulation temperaturereltol
set the relative error tolerance (default is 0.001 = 0.1%)savecurrents
saves the terminal currents of all devicessparse
sets the sparse matrix solver, which can run noise analysis, but is slower thanklu
vntol
sets the absolute voltage error tolerance (default is 1µV)warn
enables the printing of the SOA warning messages
20.3 Convergence Helper
option gmin
can be used to increase the conductance applied at every nodeoption method=gear
can lead to improved convergence.nodeset
can be used to specify initial node voltage guesses.ic
can be used to set initial conditions
21 Appendix: Circuit Designer’s Etiquette
21.1 Prolog
A consistent naming and schematic drawing style, as well as VHDL/Verilog coding scheme, is a huge help in avoiding errors and increasing productivity. Even if just one person works on a design, the error rate is lowered. If multiple persons work together in a team, a consistent working style is a big help for smooth cooperation without misunderstanding each other’s intentions. Consistency also helps to reuse existing blocks. In a well-done design, the documentation is included in the schematic/source code, so there is no searching for a piece of documentation somewhere else (which is often not found anyway).
21.2 Pins
- Name package pins (interfacing with the outside the IC) in UPPERCASE, and all internal signals in lowercase.
- Supply voltages like VDD/VCC and ground like VSS/GND need to start with either
VDD
,VCC
,VSS
,VEE
orGND
, plus a suitable suffix. Examples:VDD1
,VDD_AMP
,vdd_ldo_out
,VSS_ANA
(uppercase means connected to a pin, lowercase means a VDD is created on-chip by, e.g., an LDO). - Preferred are
VDD
/VSS
for CMOS andVCC
/GND
for bipolar circuits. In BiCMOS circuitsVDD
/VSS
are preferred, as usually, the digital content is the major part. - Digital signals in an analog schematic should start with
di_
(for digital input) ordo_
(for digital output). Example:di_ctrl1
. In the rare case of a bi-directional digital signaldio_
can be used. - Name digital signals consistently:
di_pon
is active-high,di_pon_b
is active-low (_b
standing for the negating “bar”); as an alternative, this last signal could be nameddi_disable
.di_reset
is an active-high reset, but often a reset is active-low, so it needs to be nameddi_reset_b
(an alternative isdi_resetn
). - In mixed-voltage designs, it might be useful to append the voltage level of a signal to avoid connecting incompatible inputs and outputs. Example:
do_comp_1v2
ordi_poweron_3v3
. - Digital buses always have the MSB to the left and LSB to the right. Example:
do_adc[7:0]
. - Analog signals should start with a
v
for a voltage signal ori
for a current signal. It is often useful to include a value for bias signals or make the naming meaningful. RF signals, which are often neither voltage nor current signals, start the name withrf_
. Examples: Signal and pin names likeibias_30u
(30uA of bias current),vbg_1v2
(a bandgap voltage of 1.2V),vin_p
,v_filt_out_n
, andrf_lna_i
speak for themselves. - Appending analog signals with
_i
and_o
might be useful if a clear direction is obvious in the signal flow. If a signal is bi-directional, it is better to skip_io
. - Consistently use pin types
input
,output
, orinout
to indicate signal flow. Power supply pins are ofinout
type.
21.3 Schematics
- In analog schematics, add a textual note about basic circuit performance. For example, in an amplifier, note things like suitable supply range, typical and w.c. current consumption, gain, GBW, input voltage range, PSRR, and other useful information.
- If a circuit has a quirk or is particularly clever, add a note on how it works, so others can understand the function without excessive analysis (reviewing a circuit should not be a brain teaser).
- Use provided borders or drawing templates for schematics, and fill the data in, like circuit designer name, date, change history, project name, etc.
- Use a versioning system for your data, and check in often. This avoids data loss, and going back to an earlier design stage is simple.
SVN
is often preferable toGIT
for binary data. - Draw uncluttered clear circuits. Ideally, the circuit function is apparent by inspection quickly. Everyone can obscure an inverter so that it takes 5 minutes to recognize it, but this is not a good design.
- Don’t alter the standard grid setting while drawing schematics (also make sure that the pins in your drawn symbols are on the standard grid)! Off-grid schematic elements will haunt you and your colleagues forever!
- Once a schematic is finished, take the time to name component instances properly (you can use speaking names like
Rstab
or simply useR1
,R2
, etc.). Use iterated instances to clean up the circuit. Use wire bundles to clean up circuits where useful. A clever technique is to use bundles and iterated instances to efficiently draw large resistor ladders, for example (however, use with care). - Avoid connection-by-name, as it makes the circuit hard to read. However, there is a fine line to not cluttering circuits. Signals with many connections (
vdd
,vss
,pon
,pon_b
) are often better done with connection-by-name instead of drawing a wire. - Some tools allow the use of colored wires, which might be used to mark signal paths, bias lines, etc. However, this should not be overdone; use it with care.
- If you add auxiliary elements like current probes, ensure they get proper treatment when creating the netlist for the LVS (some elements should be shorted, and some elements simply taken out). Ideally, only use a single schematic for simulation, LVS, etc. By using tool features this can usually be done, and avoids the need to keep multiple schematics of one block in sync.
- Use annotations in the schematics to (1) denote current levels in branches, (2) denote bias voltage levels, (3) explain the function of logic input signals, and (4) put in logic tables if not obvious.
- Add comments concerning the layout, like matching devices, certain considerations of placement, sensitive nodes, etc.
- Add simple ASCII diagrams for timing signals if useful.
- Name internal signals (signals connected to pins are anyway named like the port) in a meaningful way; this makes tracking signals in simulation or layout much easier (automatic net names like
net0032
are of not much help). - Properly name instances, not just
I1
orI2
; better isamp1
,inv2
, etc. (a descriptor in a tool output likeI1/I13/I5/net017
is not helpful; compare that toadc1/bias/bg/vref_int
). - On check-and-save, never ignore warnings; just fix them! They will annoy you and others forever and might flag critical design flaws.
- Name cells interpretably, ideally making the function clear already by the name. It is often useful to prefix or postfix a cell by the project name and design iteration. Example: In the project
GIGAPROJECT
, the cells which are changed in the second design step are prefixed withg2_
, likeg2_amp_bias
. Of course, more letters as a project abbreviation are useful if a name collision is likely to happen. - Cell names in lowercase are a good choice, as otherwise, capitalization leads to inconsistency in cell names. Use
_
to break words instead ofCamelCase
, likeamp_bias_startup
. - When building a design, start with the hierarchy first; plan a suitable design structure, and define all interfaces. Implement simple behavioral models for every circuit block (either with controlled sources or using Verilog-A or VHDL/Verilog digital models). In this way, you can simulate the overall design early and find issues in the hierarchy or the interconnects. Then, populate the hierarchy with the detailed circuit designs in the leaf cells. At each point in the design process, you have a design that can be simulated, with some blocks as behavioral models and some blocks already designed. Try to avoid scattered circuit elements (digital or analog) in the hierarchy; it is better to push all components into the leaf cells.
- Avoid huge schematics, better break them down into smaller, maintainable, and self-contained blocks, and provide a simulation test bench for these simple blocks. In this way, later re-simulation across the hierarchy is easily possible.
- When building up the hierarchy, choose pin names and signal names as consistently as possible. Example: use the signal name
vref_int
when connecting two leaf cells with the pin namesvref_int_o
andvref_int_i
. - Avoid the excessive use of net breakers like small resistors, as they inhibit net tracing and can lead to simulation convergence issues. If a net breaker is needed (or a current should be probed) use a 0V dc voltage source.
21.4 Symbols
- Spend time drawing nice symbols! Ideally, the underlying circuit functionality is apparent by just looking at the symbol.
- Arrange the pins in a meaningful way.
- Group pins that belong together. An often useful arrangement is to locate the inputs on the left side, outputs on the right, digital control inputs at the bottom, and supplies at the top.
- Make the origin of a symbol in the top-left corner. In this way, symbols can be changed more easily, for example, by swapping out different versions of blocks.
- The cell name (and potentially library name) should be visible in the symbol, not only in the properties.
21.5 Design Robustness
- It is good practice to buffer incoming digital signals with a local inverter (connected to the local block supply) before connecting it to internal nodes. This improves the slew rate of the control signal and lowers the chance of unwanted cross-talk.
- Consider dummy elements for good matching, and try to make useful unit sizes of components. This will make the layout creation much smoother.
- The golden rule of good analog performance is good matching, and good matching is achieved by identical components (size, orientation, surroundings)! If the layout does not look nice (humans like symmetry), it will not perform well.
- Consider supply decoupling and bias voltage decoupling inside the cells. Often, dummy elements can be used for that. Be aware, however, of unwanted supply resonances (think bond wire
L
and decouplingC
) and slow transients of bias nodes after disturbance. - Always implement a proper power-down mode. Avoid floating nodes in off-mode. The better defined the on- as well as the off-mode are, the less the chance of leakage currents. Always simulate both modes (on and off), and also simulate a transient power-up of a circuit to identify issues with slow bias start or insufficient turn-off, or nasty feedback loop instabilities during transients.
- When drawing the first schematic, add parasitic capacitances to each node. If all nodes are labeled, a capacitor bank is easily put into one corner of the schematic with parasitic caps tied to the ground. Use 5fF as a starting value (and replace it later with the correct value from parasitic extraction). This accounts for some wiring parasitics in layout and helps to account for these layout impairments early in the design phase and later when simulating the schematic instead of the extracted netlist with parasitics.
21.6 Rules for Good Mixed-Signal and RF Circuits
- Separate analog and digital power supply, connect to package pins with multiple bond wires/bumps, and separate noisy and clean
vdd
/vss
from each other! - Prevent supply loops; keep
vdd
andvss
lines close to each other (incl. bond wires and PCB traces)! This minimizesL
and coupling factork
. - Some prefer a massive (punched) ground plane, which is possible if you have enough metal levels. With a ground plane, the return path of a signal or supply line is just a few microns away.
- Use chip-internal decoupling capacitors, and decouple bias voltages to the correct potential (
vdd
orvss
, or another node, depending on the circuit)! - Use substrate contacts and guard rings to lower substrate crosstalk but use a quiet potential for connection; use triple-well if available! Connecting a guard-ring/substrate contact to a noisy supply is a prime noise injector (usually unwanted).
- Physically separate quiet and noisy circuits (at least by the epi thickness)!
- Reduce circuit noise generation as much as possible (avoid switching circuits if possible, use constant-current circuits instead, and use series/shunt regulators for supply isolation).
- Reduce sensitivity of circuits to interference (by using a fully differential design with high PSRR/CMRR, symmetrical layout parasitics, and good matching)!
21.7 VHDL/Verilog Coding Guide
These recommendations are specifically targeted at Verilog; however, they apply similarly to VHDL.
- Use automatic checkers (linters) to see whether your code contains errors or vulnerabilities. Commercial or open-source tools allow this, e.g., Icarus Verilog (
iverilog -g2005 -tnull FILE.v
) or Verilator (verilator --lint-only -Wall FILE.v
). - Write readable and maintainable code; use speaking variable names, and use a naming convention for inputs (beginning with
i_
) and outputs (beginning witho_
). Active-low signals have an_n
or_b
in their name, likei_reset_n
. Use comments to explain the intention. - With a synchronous reset reset-related racing conditions are often avoided. If an asynchronous reset is desirable (which is often the case), ensure the reset signals are free from race conditions.
- Module-local registers and wires could append
_w
(for Verilogwire
) or_r
(for Verilogreg
) to make their function clear. This is not required in SystemVerilog where the unified typelogic
should be used. - Use an
assign
statement for logic as this often is easier to read than analways @(*)
block. The ternary operatorCOND ? TRUE : FALSE
can help with conditional assignments and is often a better choice than a (nested)if ... else
statement. - Declare all outputs explicitly with either
reg
orwire
. - Use local parameter definitions with
localparam
in a module to make the code easier to follow. Name parameters in UPPERCASE. - Take care to reset all registers to a defined state (in simulation and HW).
- Use the rule of “one file per module.” The filename shall match the module declaration.
- Use
`default_nettype none
at the beginning of a file containing a module definition. After the module you can use`default_nettype wire
. This will add a safety net against typos in signal names. - In a logic assign block, use
assign @(*) begin ... end
instead of spelling out the signals in the sensitivity list. Forgetting a signal could lead to serious mismatches between simulation and HW. - Make your code flexible by making bit widths and other values parameterized using a
localparam
or module parameter. - Be cautious of implicit type conversions and bit-width adaptions; better make explicit conversions and match bit widths in assignments.
- Use only blocking assignments (
=
) inalways @(*)
blocks, and only non-blocking assignments (<=
) in clockedalways @(posedge ...)
blocks.
21.8 Further Reading
- Good information about drawing schematics, design testbenches, etc: https://circuit-artists.com
- Sutherland/Mills, Verilog and SystemVerilog Gotchas - 101 Common Coding Erorrs and How to Avoid Them, Springer, 2010
- B. Razavi, The Analog Mind, recurrent column in IEEE Solid-State Circuits Magazine
References
Reuse
Citation
@online{pretl2025,
author = {Pretl, Harald and Koefinger, Michael and Dorrer, Simon},
title = {Analog {Circuit} {Design}},
date = {2025-01-03},
url = {https://iic-jku.github.io/analog-circuit-design},
doi = {10.5281/zenodo.14387481},
langid = {en-US}
}