Quantifying Clipping Softness

We present a formal description of clipping functions and a method to analyze their softness mostly audio applications (guitar electronics, audio DSP, etc.) in mind. We also present the softest clipping function, the Blunter, and report the results of an experiment showing that it is indeed the softest function given our description of clipping softness.

Introduction

Clipping is a fundamental concept in signal processing. In high fidelity applications it may be an undesirable artifact of limited headroom and/or failed gain staging, but it can also be an intentional creative effect like in guitar electronics or some music production gear. Either way, the clipping softness has major implications how it is perceived.

There are multiple studies about non-linear distortions in audio that include some analysis of hard and soft clippers. Some focus on detecting these kinds of distortions¹², others focus on how these distortions are perceived¹³⁴. However, these studies only use soft clippers as a part of their study, which is not directly about softness. In other words, these studies have not studied clipping softness itself in detail.

$\text{arcsinh}$ . Even if they do, it is still unclear how abruptly the clipper reaches from the threshold to the limit.

(a) Hard clipping

(b) Soft clipping

(c) Arctan soft clipping

Figure 1: Comparison of three clipping types. (a) has unambiguous limit/threshold. The threshold in (b) would not be immediately clear if we didn't show it. In fact, it is only known because we happened to use a soft clipper with a well defined threshold. However, where is the limit and threshold in (c)?

Furthermore, the value for the hypothetical threshold and limit changes as the gain of the input signal changes and/or the gain of the output signal changes. All real world systems have these parameters, often given by the designer (like guitar amplifier gain and volume controls) of the system, but sometimes they are implicit to the system. For example, an implicit output gain could be given by the choice of components in an electronic circuit and implicit input gain could be given by the loudness of a singer singing into a microphone. This leads to yet another question: how would you compare the clipping softness of systems with varying input and output gain characteristics?

Our model deals with the threshold and limit ambiguities by analyzing the second derivative of the clipping functions instead of analyzing some hypothetical thresholds and limits. The second derivative describes exactly how abruptly the changes in the input signal change with increasing input signal levels. We will also look at an alternative definition of softness based on change of higher order harmonics as input amplitude increases and present methods to normalize input and output gains to enable meaningful comparison of different clipper functions.

But before anything, we perform a quick review of any relevant background information. The reader is assumed to have some understanding about these topics, but the main purpose of the review is to frame the information in a way that suits our problem.

Background Review

$x(t)$ can be represented by a sum of sinusoids (Fourier series):

x (t) = \frac{a_{0}}{2} + \sum_{n = 1}^{\infty} (a_{n} \cos (\frac{2 π}{T} n t) + b_{n} \sin (\frac{2 π}{T} n t)),

$T$ $2\pi n / T$ $a_n$ $b_n$ $a_0 / 2$ $a_n + b_n$ $\sqrt{a_n^2 + b_n^2}.$ Each Fourier coefficient can be computed like so:

\begin{aligned} (1) & a_{n} & = \frac{2}{T} \int_{0}^{T} x (t) \cos (\frac{2 π}{T} n t) d t \\ (2) & b_{n} & = \frac{2}{T} \int_{0}^{T} x (t) \sin (\frac{2 π}{T} n t) d t \end{aligned}

Clipping a pure sinusoid generates harmonics. A common way of measuring overall distortion of a system is to feed a sinusoid to it and measure the amplitudes of the generated harmonics relative to the fundamental. This is known as THD (total harmonic distortion), and it can be computed using the Fourier coefficients like so:

\begin{matrix} (3) & THD = \sqrt{\frac{\sum_{n = 2}^{\infty} (a_{n}^{2} + b_{n}^{2})}{a_{1}^{2} + b_{1}^{2}}} . \end{matrix}

Definitions

Clipping Function

clipping function $f$ clipper $f'$ $f$ must be a Sigmoid-shaped function.

$f,$ a normalized clipping function is defined as

f_{1} (x) = A_{out} f (A_{in} x),

$A_\text{in}$ input gain $A_\text{out}$ is the output gain. Normalization is discussed later in more detail.

Clippers can be categorized in three categories:

Bounded clippers cannot increase their output once some limit is reached. In other words, bounded clippers are piece-wise functions that have zero derivative beyond any given signal level limit.
Converging clippers have a limit when signal level approaches infinity.
Diverging clippers do not converge. Output signal level can be increased indefinitely.

Asymmetric clippers can be any combination of these categories for each side of the waveform.

Real-world clippers seen in music production gear often have a non-flat frequency response and non-zero phase response⁵. This complicates any attempts to analyze softness significantly: any results will be frequency dependent. However, analyzing the non-linearities isolated from any linear effects is often preferred anyway for meaningful comparisons. Measuring a hard clipper processed with a steep low pass filter to be softer than a soft clipper processed with a high pass filter doesn't seem like a very useful result. Therefore, we will be assuming flat frequency response and zero phase if not stated otherwise.

Hardness and Softness

hardness $H_f$ softness $S_f$ of any given clipping function can be defined as

\begin{aligned} (4) & H_{f} & = max (| f_{1}^{″} (x) |) \\ (5) & S_{f} & = \frac{1}{H_{f}} . \end{aligned}

$\max$ $x$ axis or number of local extrema. Real world clippers may be (and commonly are) asymmetric, and they might have multiple local extrema, which might be the case when composing clippers. We think that the simplification is justified: it can be expected that the sharpest edge of the clipping function dominates the abruptness of change in harmonic content. We also expect our model to be predominantly used for individual clippers that are usually unimodal instead of multi-modal composed clippers. We will also simplify the study further by focusing on symmetric clippers for brevity.

$g(x) = h(kx),$ $k$ is constant, then the chain rule gives us

\begin{aligned} g^{'} (x) & = k h^{'} (k x) \\ g^{″} (x) & = k^{2} h^{″} (k x), \end{aligned}

which allows us to determine hardness in terms of unnormalized clippers like so:

\begin{matrix} (6) & H_{f} = A_{out} A_{in}^{2} max (| f^{″} (A_{in} x) |) . \end{matrix}

$A_\text{in}$ $\max(|f''(A_\text{in}x|),$ $\eqref{hardness1}$ can be simplified to

\begin{matrix} (7) & H_{f} = A_{out} A_{in}^{2} max (| f^{″} (x) |) . \end{matrix}

WTHD Hardness

The THD is not a good measure of how distorted an audio signal is perceived to be. For example, it completely ignores the fact that higher order harmonics are perceived more strongly and offensively⁴. A simple heuristic to accommodate for the perception of the higher order harmonics is to weight the harmonics' amplitudes linearly. We define weighted total harmonic distortion (WTHD) to be

WTHD = \sqrt{\frac{\sum_{n = 2}^{\infty} n (a_{n}^{2} + b_{n}^{2})}{a_{1}^{2} + b_{1}^{2}}} .

If we feed a sinusoid to a clipper, as the input amplitude of the input sinusoid changes, so does the the Fourier coefficients, and thus the WTHD as well. So the WTHD of a clipper can be described as a function of input amplitude:

\begin{matrix} (8) & {WTHD}_{f} (α) = \sqrt{\frac{\sum_{n = 2}^{\infty} n (a_{f n} (α)^{2} + b_{f n} (α)^{2})}{a_{f 1} (α)^{2} + b_{f 1} (α)^{2}}}, \end{matrix}

$\alpha$ is the changing input amplitude and

\begin{aligned} a_{f n} (α) & = \frac{2}{T} \int_{0}^{T} f (A_{in} α \sin (\frac{2 π}{T} t)) \cos (\frac{2 π}{T} n t) d t \\ (9) & b_{f n} (α) & = \frac{2}{T} \int_{0}^{T} f (A_{in} α \sin (\frac{2 π}{T} t)) \sin (\frac{2 π}{T} n t) d t \end{aligned}

$f.$ $f(A_\text{in}\alpha\sin(2\pi t/T))$ odd function $\cos(2\pi nt/T)$ even function $a_{fn}(\alpha) = 0.$ $\eqref{wthdf}$ to

\begin{matrix} (10) & {WTHD}_{f} (α) = \frac{\sqrt{\sum_{n = 2}^{\infty} n b_{f n} (α)^{2}}}{b_{f 1} (α)} . \end{matrix}

$\eqref{wthdf}$ $\eqref{wthdf0}$ for an alternative definition of hardness. We will define the WTHD hardness and WTHD softness to be

\begin{aligned} H_{WTHD f} & = max ({WTHD}_{f}^{'} (α)) \\ S_{WTHD f} & = \frac{1}{H_{WTHD f}} . \end{aligned}

This potentially more accurately would match the perception of softness. By measuring the change in WTHD, we are measuring how the clipping "feels". For example, an electric guitar player playing trough a hard clipper with low input gain might note that playing softly enough results in no distortion and playing harder immediately produces noticeable higher order harmonics.

The problem with this definition is that computing it for arbitrary functions would require computing the Fourier series repeatedly and observing how the WTHD changes. This could be computationally very expensive, so we will focus on the second derivative based definition instead. However, we will see later that the second derivative based and the WTHD based definitions are related to each other.

$A_\text{out}$ $A_\text{out}$ is not needed for this definition: WTHD is already normalized against the amplitude of the output fundamental.

We will be referring to non-WTHD (second derivative based) hardness as just hardness, WTHD is explicitly stated when referring to WTHD hardness from now on.

Hard Clipper and Quadratic Soft Clipper

hard clipper $h.$ $T$ is reached after which the output stays constant. We will set this threshold to one. Then, a positive side hard clipper can be described as

\begin{matrix} h_{+} (x) = {\begin{cases} x, & x \leq 1 \\ 1, & x \geq 1. \end{cases} \end{matrix}

quadratic soft clipper $s,$ $k \in (0, 1].$ $T - k,$ $T + k,$ $P$ $T - k$ $T + k.$ The resulting function is a valid clipping function, however we are only using it to analyze hard clipping, so we will ignore normalization. Then, a positive side soft clipper can be described as

\begin{aligned} P (x) & = a x^{2} + b x + c \\ a & = - \frac{1}{4 k} \\ b & = \frac{1}{2} + \frac{T}{2 k} \\ c & = - \frac{T^{2}}{4 k} + \frac{T}{2} - \frac{k}{4} \\ (11) & s_{+} (x) & = {\begin{cases} x, & x \leq 1 - k \\ P (x), & 1 - k \leq x \leq 1 + k \\ 1, & 1 + k \leq x . \end{cases} \end{aligned}

It is trivial to see that the maximum of the second derivative of the soft clipper is completely determined by the spline. We can also see that

H_{s +} = max (- s_{+}^{″} (x)) = max (- a) = \frac{1}{4 k},

$s$ $k$ and the limit of hardness and softness of the soft clipper as the knee size approaches to hard clipping is

\begin{aligned} H_{h} & = lim_{k \to 0 +} \frac{1}{4 k} = \infty \\ S_{h} & = lim_{k \to 0 +} \frac{1}{H_{h}} = 0, \end{aligned}

$\lim H_{\text{WTHD}h} = \infty$ as well. We leave this analysis as an exercise to the reader.

The analysis for negative side would be identical of course. However, it should be noted that asymmetrical hard clipping will always have a softness of zero, regardless of clipping thresholds of either side. In fact, the other side may not be clipped at all and the result is still zero. This may seem confusing and like it could undermine the usefulness of the model. And indeed, fully asymmetrical (one side linear) clipping will have a very distinct sound from symmetrical clipping. However, both asymmetrical and symmetrical hard clipping have an important property: once a threshold is reached (doesn't matter which one or both), the sound is immediately notably distorted. The abruptness of the clipping is what softness is measuring, a complete description of tonal characteristics of any given clipping function is outside of the scope of this study.

While we only needed to consider simplified unipolar hard clipper and soft clipper, their complete descriptions can be useful for DSP or other purposes. So for completeness, the full generic description of asymmetric hard clipper and quadratic soft clipper is as follows:

\begin{aligned} h (x) & = {\begin{cases} T_{-}, & x \leq T_{-} \\ x, & T_{-} \leq x \leq T_{+} \\ T_{+}, & T_{+} \leq x \end{cases} \\ P_{+} (x) & = - \frac{1}{4 k_{+}} x^{2} + (\frac{1}{2} + \frac{T_{+}}{2 k_{+}}) x + (- \frac{T_{+}^{2}}{4 k_{+}} + \frac{T_{+}}{2} - \frac{k_{+}}{4}) \\ P_{-} (x) & = \frac{1}{4 k_{-}} x^{2} + (\frac{1}{2} - \frac{T_{-}}{2 k_{-}}) x + (\frac{T_{-}^{2}}{4 k_{-}} + \frac{T_{-}}{2} + \frac{k_{-}}{4}) \\ s (x) & = {\begin{cases} T_{-}, & x \leq T_{-} - k_{-} \\ P_{-} (x), & T_{-} - k_{-} \leq x \leq T_{-} + k_{-} \\ x, & T_{-} + k_{-} \leq x \leq T_{+} - k_{+} \\ P_{+} (x), & T_{+} - k_{+} \leq x \leq T_{+} + k_{+} \\ T_{+}, & T_{+} + k_{+} \leq x, \end{cases} \end{aligned}

$T_- < 0,$ $T_+ > 0,$ $k_- \in (0, -T_-],$ $k_+ \in (0, T_+].$ Symmetric hard and soft clippers are simpler:

\begin{aligned} h (x) & = {\begin{cases} x, & | x | \leq T \\ T sign (x), & | x | \geq T \end{cases} \\ P (x) & = - \frac{1}{4 k} x^{2} + (\frac{1}{2} + \frac{T}{2 k}) x + (- \frac{T^{2}}{4 k} + \frac{T}{2} - \frac{k}{4}) \\ s (x) & = {\begin{cases} x, & | x | \leq T - k \\ sign (x) P (| x |), & T - k \leq | x | \leq T + k \\ T sign (x), & T + k \leq | x |, \end{cases} \end{aligned}

$T > 0$ $k \in (0, T].$

Normalization

$\eqref{a}$ $\eqref{b}$ will be

\begin{aligned} a_{n} & = \frac{2}{T} \int_{0}^{T} f (A_{in} \sin (\frac{2 π}{T} t)) \cos (\frac{2 π}{T} n t) d t \\ (12) & b_{n} & = \frac{2}{T} \int_{0}^{T} f (A_{in} \sin (\frac{2 π}{T} t)) \sin (\frac{2 π}{T} n t) d t, \end{aligned}

$a_n = 0$ and a simplified THD of

{THD}_{f} = \frac{\sqrt{\sum_{n = 2}^{\infty} b_{n}^{2}}}{b_{1}} .

It should also be noted that symmetric clippers will not produce any even harmonics⁶. This is not necessarily important for our analysis, but it is useful to know to decrease computation time when applicable.

$\text{arcsinh}$ $\text{arcsinh}$ has more noticeable clipping threshold, which is described well by the extreme of the second derivative. We will be focusing on more modest input gains (e.g. potentially used by mixing engineers and semi-clean electric guitar tones) in this study. Softness anyway has a more profound effect with lower gains since higher gains transform any periodic input to an approximated square wave.

The output gain controls the overall volume. It does not change the clipping characteristics, but the hardness (the second derivative) is directly proportional to it, so it must be normalized. It will be normalized by total power of the clipping function given some signal, which is commonly described by root mean square (RMS):

RMS = \sqrt{\frac{1}{t_{1} - t_{0}} \int_{t_{0}}^{t_{1}} x (t)^{2} d x} .

standard deviation $\sigma$ instead, which subtracts the DC from the signal. The standard deviation is defined as

\begin{matrix} (13) & σ = \sqrt{\frac{1}{t_{1} - t_{0}} \int_{t_{0}}^{t_{1}} (x (t) - μ)^{2} d x}, \end{matrix}

where

μ = \frac{1}{t_{1} - t_{0}} \int_{t_{0}}^{t_{1}} x (t) d x

mean $\mu = 0.$ $\text{RMS} = \sigma.$

$\sin(t) \in [-1, 1]$ $f$ $[-1, 1]$ $\arctan(x)$ $\arctan(h(x)).$ $[-1, 1],$ then (depending on THD normalization value) these clippers might result in an identical output gain normalization value. The hard clipping on the latter clipper would be completely ignored.

We need a signal that is roughly an average of all signals in some sense. A good candidate could be Gaussian noise, which has a probability mass function that follows the normal distribution. The probability mass describes how likely it is for a sample to get a specific value⁷. This is important for us, because we need heuristics to determine how our clipping function would transform any given input values to output values on average. Since we cannot know our input signals, a probabilistic approach seems appropriate.

Gaussian noise does have one huge issue for practical measurements: we would need to generate a huge amount of samples of it in order to converge. This could be done using any pseudo-random number generator with uniform distribution and Box-Muller transform⁸, but it would require huge amount of processing before it gets useful due to reliance of statistical convergence. Luckily, we don't need Gaussian noise, we just need anything that will give us a similar probability mass. It turns out that sampling the quantile function of any given distribution enough times at regular intervals will yield the corresponding probability mass⁹. This means that as our signal we can use the quantile function of a Gaussian called the probit function, which can be computed using

probit (t) = \sqrt{2} {erf}^{- 1} (2 t - 1),

$t \in (0, 1)$ $\text{erf}^{-1}$ $\eqref{stddiv}$ , we get our output gain normalization:

\begin{aligned} (14) & σ_{f} & = \sqrt{\int_{0}^{1} f (A_{in} probit (t))^{2} d t} \\ (15) & A_{out} & = \frac{1}{σ_{f}} \end{aligned}

$\text{probit}$ $\lim_{t\to 0} \text{probit}(t) = -\infty$ $\lim_{t\to 1} \text{probit}(t) = \infty.$ This means that the clipping function will considered in it's full domain (unlike when normalizing input gain), which is especially useful for composed clippers.

The Blunter

There exists a symmetric clipper that is softer than any other symmetric clipper for a range of THD normalization values, which we will call the Blunter. We will be referring to it quite a lot, so it worth defining and naming it. The unnormalized Blunter is defined as

\begin{aligned} B (x) & = {\begin{cases} 2 x - | x | x, & | x | \leq 1 \\ sign (x), & | x | \geq 1 \end{cases} \\ = {\begin{cases} - 1, & x \leq - 1 \\ 2 x + x^{2}, & - 1 \leq x \leq 0 \\ 2 x - x^{2}, & 0 \leq x \leq 1 \\ 1, & 1 \leq x . \end{cases} \end{aligned}

The normalized Blunter is

B_{1} (x) = A_{B out} B (A_{B in} x) .

$k = T$ $\eqref{hardness}$ $|(x - 2x^2)''| = 2,$ the hardness of the Blunter is

H_{B} = 2 A_{B out} A_{B in}^{2} .

$H_B$ would most certainly indicate that either the input gain, the output gain, or both are unnormalized. This is a bit difficult to prove analytically, but it can be experimentally shown that Blunter is indeed the softest clipper.

Finding the Softest Clipper

We provide a repository¹⁰ that contains code for multiple experiments and tests for this study. The main experiment in src/smoothest.c generates all potential symmetric clipping functions with given precision BASE. For each of the generated functions, a normalized input gain and output gain is calculated to finally find the hardness of the function. Finally, the function with minimum hardness is found. Names f and f_* refer to clipping function lookup-tables.

Counter

An algorithm was developed that generates all potential symmetric clipping functions given a discrete precision of BASE. The basic idea is based on a counter: take BASEBASE $O(\text{n}^\text{n}),$ $n = \text{BASE},$ so we need to find a way of skipping as many counts as possible. Our counting algorithm exploits clippers monotonicity and unimodal derivative to reduce counts.

An important concept for our algorithm is flushing. It is demonstrated in Figure 2 and can be described as follows: Knowing that the first derivative of a valid clipping function must be greater than or equal to zero, each time we increment a digit in the middle to a value that is greater than the digit on it's right side, we can duplicate the incremented digit to all of the digits on the right side (flush). Of course, a real counter would only increment digits in the middle when carrying, but the concept of flushing is important for us.

(a) Cannot increment just this one.

(b) Have to increment all these.

Figure 2: This could be the output of the counter at some point when counting in BASE=10. (a) shows how increasing a value in the middle breaks monotonicity, so we have to add one to each point on the right as well, which will give us (b).

Since the counter only generates the positive side, it's output has to be duplicated to the negative side. In our implementation, the actual duplication is done later in the processing chain, but our counter has to take it into account anyway. To preserve symmetry on duplication, the digit at index zero must be fixed to zero. Also, given unimodal derivative, the derivative at index zero must be non-zero. This means that the counter must count from index one, and we count each digit from one to BASE instead of zero to BASE-1 like a regular counter. Given these constraints, our counting algorithm can be described as follows:

Initialize the counter: the digit at index zero shall be set to zero, the rest shall be set to one.
Knowing that the first derivative is unimodal, we can deduce that the first derivative is also monotonically decreasing on the positive side, so we can skip all increments from the right that would increase the derivative from zero to one. Flush from the next digit on the right of the first digit with non-zero derivative.

(a) Cannot increment just this one.

(b) Have to increment all these.

Figure 3: To preserve monotonicity of the derivative, one must increment from the leftmost point where the derivative is zero.

The next few functions can be obtained by incrementing and flushing the next digits on the right by using reasoning from Step 2. The index for flushed digit can be cached to keep track where to flush next. Increment and flush until the digit equals BASE or until the index equals BASE.

Figure 4: Step 3 applies Step 2 repeatedly. Here we show how Step 3 is applied nine times to produce the first nine clippers of the sequence.

Since we are conceptually counting numbers here, once a to-be-incremented digit equals BASE, the digit from the left must be incremented. However, the first derivative is known to be decreasing, we can only increment the leftmost digit of a segment with same derivative—increasing any digit on the right side would increase the derivative, so find where the derivative changes and increment that. Normally when counting, incrementing a digit would zero all digits on the right side, but this would break monotonicity, so flush instead.

(a) Cannot increment this one.

(b) Have to increment from the left and flush.

Figure 5: Since we cannot increment the point shown in (a), we must find where the derivative changes. In this case, the leftmost point with the same derivative is the next one on the left, so increment/flush from there.

If the first digit equals BASE, then we are done. Otherwise, go to Step 2.

Code for generating next function in sequence is called f_next(), which can be found in src/shared.h. It has been verified to find all valid function tables by comparing it's generated sequence of function tables to the sequence of function tables generated by a naïve counter.

The precision of the generated tables is horrific at this point. Not only our lookup table consists of small integers, but the derivative also decreases in discrete steps. As seen in Figure 6, This means that the second derivative would consist of large spikes at these steps and zeros otherwise, so we must smooth out the steps.

(a) Generated arctan

(b) Generated derivative

(c) Generated second derivative

Figure 6: The generated unnormalized clipper (linearly interpolated) that most closely matched arctan (after filtering) in BASE=40. The counter's logic using discrete derivatives is clearly visible from the discrete steps seen in (b) and even in (a) to some extent. Taking the finite difference once more yields to a completely unusable second derivative seen in (c).

Filter

To smooth out the discrete steps, we had to process the clipper lookup table with a smoothing filter. Before filtering, it is important to have the duplication of the positive side to the negative side done. The filtering had three major requirements:

Good step response: the filter should preserve the clippers overall shape (no ringing).
Good stopband attenuation: the derivatives are extremely sensitive to high frequencies.
Zero-phase: f[0] must stay at zero. This is only possible after duplicating positive side to negative side.

$a_0 = 1/2$ $b_1 = 1/2$ . Being somewhat heavy handed with the filtering was justified by the fact than any clipper with hard edges could not be the softest, although we must be careful not to filter too much: this would make all generated clippers identical. Being single pole, the overall shape of the clipper was preserved well, and the multiple filters in series gave reasonably good attenuation at high frequencies. To keep it zero phase, the clipper would be duplicated, then both duplicates would be filtered, the first one from right to left, the other one from left to right. Then results were added together for the final zero-phase result. As an added bonus, being IIR, the infinite step response allows generating very good converging clippers.

Any low-pass filter will ruin the first samples it processes, so we had to extrapolate our clippers. We chose to do quadratic extrapolation by finding the first and second differing samples from the edges to estimate first and second derivatives at the edges. This implicitly assumes non-zero first derivative, which unfortunately slightly reduced the generators capability to generate bounded clippers (a flat tail extrapolated to non-flat), but it considerably improved it's capability to generate converging and diverging clippers, so it was worth it.

$\text{BASE}=40$ $\arctan$ $\arctan''$ well enough. The clipper's data will not be modified further. We expect that comparing millions of generated functions will give us statistically correct results despite the spike-noise.

(a) Generated arctan

(b) Generated derivative

(c) Generated second derivative

Figure 7: The generated clipper after filtering, linearly interpolated, that most closely matched arctan in BASE=40. Effects of the discrete derivatives are clearly significantly reduced.

Lookups

$\sin(t) \in \R$ $\text{probit}(t) \in \R,$ $\text{probit}$ ) could also get go out of bounds, so extrapolation was important as well. Linear extrapolation was not enough: it turned all clippers to diverging ones. Quadratic extrapolation more closely followed the overall shape the clipper. However, the parabola opening downwards on the positive side (and vice versa on the negative side) would break monotonicity if input amplitude exceeds the tip of the parabola. To fix this, we simply clamped the output beyond the tips.

Normalization

$A_{B\text{in}} \approx 1,$ which was about 2.22559 %. This is the upper limit where Blunter should be the softest. Lower values are also expected to make Blunter the smoothest, because in this range the sinusoid used for input gain normalization fully fits the polynomial section (no hard limiting). Higher values would favor diverging clippers since they have no hard limits. We observed that secant method gave us really good approximations very fast (less than three iterations on average). The Fourier series for the THD calculations were calculated using fixed point integers and pre-calculated tables of sines of frequencies of harmonics to improve computation speed.

$\text{THD} \approx 0,$ $A_\text{in} \approx 0.3.$ Having our first evaluated value at the first guess and the average minimum point, we could create a secant line between the observed point and the minimum point to get the next estimate.

(a) Zoomed out

(b) Zoomed in

Figure 8: THD as a function of input gain (in 0.1 sized steps). The wider view of (a) confirms that it is at least somewhat safe to consider THD to be monotonically increasing, so secant method is likely to not fail often due to local extrema. (b) shows that most generated clippers have a more or less linear region when input gain is roughly below 0.3, which we can use to estimate the second guess for the secant method.

$\eqref{f_rms}$ $\eqref{out_gain}$ . The code uses rms and ignores DC since symmetric clippers does not produce DC.

$\text{BASE}=40$ $\text{arcsinh},$ a scaled and shifted logistic function, and of course the Blunter. The mean of all absolute differences was measured to be 0.1236 %, which will greatly improve further when increasing BASE.

Hardness

$\eqref{hardness0}$ $A_\text{in} > 1,$ $A_\text{in} > 2,$ $\eqref{hardness}.$

Results

$\text{BASE} = 100,$ $B(1) = 1$ ). This confirms to a reasonable accuracy the softest function generated does in fact represent the Blunter.

Hard coded Blunter's precise softness was measured to be approximately 0.405966, which is considerably higher than what was measured from the generated function. This is expected: our generator generated the functions based on low precision discrete derivatives and second derivatives. Even after filtering, these discrete steps would still show in the generated function as spikes, as seen before in Figure 6 and Figure 7. However, since Blunter has a constant second derivative, we expect to have many spikes in the second derivative that are spread out as evenly as possible. Figure 9 (c) shows the second derivative of the generated function, where you can clearly see these spikes. While they are indeed very evenly spread out, they will nonetheless increase the measured hardness.

(a) Softest generated clipper

(b) Generated derivative

(c) Generated second derivative

Figure 9: Softest generated clipper and it's derivatives. Since it was expected to match the Blunter, the first derivative was expected to have a somewhat linear section, and the second derivative was expected to have a somewhat constant section, which is what we observed (albeit a bit noisy).

$f[0],$ which is what we also see in the Figure 9 (c). Also, the IIR filtering would somewhat gradually decrease the magnitude of the second derivative to zero at the end of the generated domain. It should be noted that the second derivative is very sensitive to these sort of inaccuracies, but we got a reasonably good quality second derivatives and a very precise result anyway.

Comparing Hardness to WTHD Hardness

$\eqref{wthdf}$ $b_n$ $\alpha,$ $\eqref{b_fn}$ $\eqref{b_n}$ .

$\alpha$ $\alpha = 1/32$ $\alpha = 1$ and computing the differences between adjacent HFC samples. Knowing that the 32 added Fourier series calculations would increase the computation time significantly, we decreased BASE to 80.

We also wrote another program that compares how the second derivative based hardness relates to WTHD hardness. This was to make sure that these significantly different definitions of hardness do indeed measure roughly the same thing. It must be noted that the definitions are quite different, so some discrepancy is expected.

Results

A total of 123 223 638 functions were analyzed. The softest function was found at index 3 235 483 with a WTHD hardness of 0.288513. Again, this function was compared with the Blunter. Now the mean of the absolute differences between these functions was 0.0196245, which is 1.57308 % relative to normalized Blunter's highest value, so even with this very different definition of hardness, we still found the softest to be remarkably close to the Blunter.

$\text{BASE} = 60,$ $\alpha$ would also cause significant noise. Furthermore, the definitions of hardness and WTHD hardness are based on fundamentally differing principles. But despite the large amounts noise and the differing hardness definitions, it is still clear that they are somewhat closely related. Clearly increasing hardness increases WTHD hardness on average.

$\eqref{qsoft}$ with knee size varying from 0.01 to 1 in 0.01 sized steps. As we can see, in the case of the quadratic soft clipper, both hardness definitions match extremely precisely. This emphasizes that the relationship between the definitions is real.

(a) WTHD hardness vs hardness for practically all clippers

Hardness vs WTHD Hardness on soft clipper

(b) WTHD softness vs softness for quadratic soft clippers with varying knee size

Figure 10: Comparison of our different hardness definitions. WTHD hardness is used on y-axis, hardness is used on x-axis.

Future Work

While it is expected that the constant second derivative makes the Blunter the softest for all THD normalization values below ours, our experiment only showed that this is the case in the upper limit. It is also expected that the Blunter is the softest clipper when including asymmetric clippers (if it is the softest on one side, why would the other one be any different?), but again, our experiment ignored those to keep computation times sensible. More experiments with different THD normalization values and asymmetric clippers are needed.

$\max$ $\max$ ignores all extrema other than the one with the highest magnitude. While we think that our assumption for the simplification is reasonable, it needs to be verified and potentially refined.

Before implementing the main experiment, we did some rough subjective tests to see if it is worth to conduct the study to begin with. We found that the second derivative is potentially audible, so we moved further with the study. However, those preliminary tests were not rigorous at all, they were only conducted to make sure that there is anything meaningful to be studied in softness to begin with. This is why these preliminary tests were not discussed in this study. Much more rigorous subjective tests are needed to confirm if either, both, or neither softness definitions actually matches perceived softness in any way.

Conclusion

We presented a method to quantify and analyze clipping softness to address the lack of work that solely focus on clipping softness. We defined clipping hardness and softness mathematically and used the definition to analyze hard clipper and verified that it has zero softness following intuition. We then discussed how input and output gains are normalized in detail to enable meaningful comparisons of clippers. We also presented the Blunter, a quadratic soft clipper, which we claimed to be the softest clipper given our model. The claim was backed with an experiment that showed that if we generate all potential clippers and find the softest one, the generated softest clipper will in fact be the Blunter. Finally, we showed that hardness is related to WTHD hardness.

References

1 Wilson, A., and B. M. Fazenda. ‘Profiling the Distortion Characteristics of Commercial Music Using Amplitude Distribution Statistics’, 2015. ↩ ↩

2 AUDIO CLIPPING DETECTION. Patent, issued August 2014. https://www.freepatentsonline.com/y2014/0226829.html. ↩

3 Wilson, Alex, and Bruno Fazenda. ‘Characterisation of Distortion Profiles in Relation to Audio Quality’, 09 2014. https://www.dafx14.fau.de/papers/dafx14_alex_wilson_categorisation_of_distort.pdf. ↩

4 Tan, Chin-Tuan, Brian Moore, and Nick Zacharov. ‘The Effect of Nonlinear Distortion on Perceived Quality of Music and Speech Signals’. Journal of the Audio Engineering Society 51 (11 2003): 1012–31. https://aes2.org/publications/elibrary-page/?id=12197 ↩ ↩

5 "Klon Centaur Analysis." https://www.electrosmash.com/klon-centaur-analysis ↩

6 Iii, Julius O. Smith. Physical Audio Signal Processing: For Virtual Musical Instruments and Digital Audio Effects. W3K Publishing, 12 2010. https://www.dsprelated.com/freebooks/pasp/Nonlinear_Distortion.html. ↩

7 Smith, Steven W. ‘The Scientist and Engineer’s Guide to Digital Signal Processing’, First., 19–23. California Technical Publishing, 1997. https://www.dspguide.com/ch2/4.htm. ↩

8 Box, George E. P., and Mervin E. Muller. ‘A Note on the Generation of Random Normal Deviates’. Annals of Mathematical Statistics 29 (1958): 610–11. https://api.semanticscholar.org/CorpusID:119971394. ↩

9 Metex. ‘Approximations Of The Inverse Error Function’. MIMIR GAMES, 6 2017. https://www.mimirgames.com/articles/programming/approximations-of-the-inverse-error-function/. ↩

10 Lauri Lorenzo Fiestas, . "Soft Clipper Analysis." (2026). https://github.com/PrinssiFiestas/soft-clipper-analysis ↩

Lauri Lorenzo Fiestas, March 2026