endolith · October 18, 2024 13:16 · tmbouman · Mar 14, 2021 · endolith · Mar 14, 2021
diff --git a/readme.txt b/readme.txt
 Somewhat crude THD+N calculator in Python

 Measures the total harmonic distortion plus noise (THD+N) for a given input 
 signal, by guessing the fundamental frequency (finding the peak in the FFT), 
 and notching it out in the frequency domain.

 Depends on Audiolab and SciPy
 * http://www.ar.media.kyoto-u.ac.jp/members/david/softwares/audiolab/
 * http://www.scipy.org/

 According to the never-wrong Wikipedia:
 * THD is the fundamental alone vs the harmonics alone
 * THD+N is the entire signal (not just the fundamental) vs the entire signal 
 with the fundamental notched out.  (For low distortion, the difference between 
 the entire signal and the fundamental is negligible.)

 Example of usage, with 997 Hz full-scale sine wave generated by Adobe Audition 
 at 96 kHz, showing the 16-bit quantization distortion:

 > python thdcalculator.py "perfect 997 Hz no dither.flac"
 Frequency:	997.000000 Hz
 THD+N:  	0.0016% or -96.1 dB

 Is this right?  Theoretical SNR of a FS sine is 1.761+6.02*16 = -98.09 dB.  
 Close, at least.

 The primary problem with the current script is that I don't know how much of 
 the surrounding region of the peak to throw away.  Probably the way to match 
 other test equipment is to just calculate the width of a certain bandwidth, 
 but is that really ideal?

 width = 50
 f[i-width: i+width+1] = 0

 Instead of a fixed width, it currently just tries to find the nearest local 
 minima and throw away everything between them.  It works for almost all cases, 
 but on peaks with wider "skirts", it gets stuck at any notches.  Should this 
 be considered part of the peak or part of the noise?

 Also it computes the FFT for the entire sample, which is a waste of time.  Use short samples.

 Adobe Audition with dither:
 997 Hz 8-bit    -49.8
 997 Hz 16-bit   -93.4
 997 Hz 32-bit   -143.9

 Art Ludwig's Sound Files (http://members.cox.net/artludwig/):
 File                Claimed  Measured  (dB)
 Reference           0.0%     0.0022%   -93.3
 Single-ended triode 5.0%     5.06%     -25.9
 Solid state         0.5%     0.51%     -45.8

 Comparing a test device on an Audio Precision vs recorded into my 24-bit sound 
 card with this script:

 Frequency   AP THD+N    Script THD+N
 40          1.00%       1.04%
 100         0.15%       0.19%
 100         0.15%       0.14%
 140         0.15%       0.17%
 440         0.056%      0.057%
 961         0.062%      0.067%
 1021        0.080%      0.082%
 1440        0.042%      0.041%
 1483        0.15%       0.15%
 4440        0.048%      0.046%
 9974        7.1%        7.8%
 10036       0.051%      0.068%
 10723       8.2%        9.3%
 13640       12.2%       16.8%
 19998       20.2%       56.3%
 20044       0.22%       0.30%

 So it's mostly accurate.   Mostly.
diff --git a/thdncalculator.py b/thdncalculator.py
 from __future__ import division
 import sys
 from scikits.audiolab import flacread
 from scipy.signal import blackmanharris
 from numpy.fft import rfft, irfft
 from numpy import argmax, sqrt, mean, absolute, arange, log10

 def rms_flat(a):
    """
    Return the root mean square of all the elements of *a*, flattened out.
    
    """
    return sqrt(mean(absolute(a)**2))

 def find_range(f, x):
    """Find range between nearest local minima from peak at index x
    
    """
    for i in arange(x+1, len(f)):
        if f[i+1] >= f[i]:
            uppermin = i
            break
    for i in arange(x-1, 0, -1):
        if f[i] <= f[i-1]:
            lowermin = i + 1
            break
    return (lowermin, uppermin)

 filename = sys.argv[1]
 signal, fs, enc = flacread(filename)

 # Get rid of DC and window the signal
 signal -= mean(signal)
 windowed = signal * blackmanharris(len(signal))

 # Measure the total signal before filtering but after windowing
 total_rms = rms_flat(windowed)

 # Find the peak of the frequency spectrum (fundamental frequency), and filter 
 # the signal by throwing away values between the nearest local minima
 f = rfft(windowed)
 i = argmax(abs(f))
 print 'Frequency: %f Hz' % (fs * (i / len(windowed)))
 lowermin, uppermin = find_range(abs(f), i)
 f[lowermin: uppermin] = 0

 # Transform noise back into the signal domain and measure it
 # Could probably calculate the RMS directly in the frequency domain instead
 noise = irfft(f)
 THDN = rms_flat(noise) / total_rms
 print "THD+N:     %.4f%% or %.1f dB" % (THDN * 100, 20 * log10(THDN))
	Somewhat crude THD+N calculator in Python

	Measures the total harmonic distortion plus noise (THD+N) for a given input
	signal, by guessing the fundamental frequency (finding the peak in the FFT),
	and notching it out in the frequency domain.

	Depends on Audiolab and SciPy
	* http://www.ar.media.kyoto-u.ac.jp/members/david/softwares/audiolab/
	* http://www.scipy.org/

	According to the never-wrong Wikipedia:
	* THD is the fundamental alone vs the harmonics alone
	* THD+N is the entire signal (not just the fundamental) vs the entire signal
	with the fundamental notched out. (For low distortion, the difference between
	the entire signal and the fundamental is negligible.)

	Example of usage, with 997 Hz full-scale sine wave generated by Adobe Audition
	at 96 kHz, showing the 16-bit quantization distortion:

	> python thdcalculator.py "perfect 997 Hz no dither.flac"
	Frequency: 997.000000 Hz
	THD+N: 0.0016% or -96.1 dB

	Is this right? Theoretical SNR of a FS sine is 1.761+6.02*16 = -98.09 dB.
	Close, at least.

	The primary problem with the current script is that I don't know how much of
	the surrounding region of the peak to throw away. Probably the way to match
	other test equipment is to just calculate the width of a certain bandwidth,
	but is that really ideal?

	width = 50
	f[i-width: i+width+1] = 0

	Instead of a fixed width, it currently just tries to find the nearest local
	minima and throw away everything between them. It works for almost all cases,
	but on peaks with wider "skirts", it gets stuck at any notches. Should this
	be considered part of the peak or part of the noise?

	Also it computes the FFT for the entire sample, which is a waste of time. Use short samples.

	Adobe Audition with dither:
	997 Hz 8-bit -49.8
	997 Hz 16-bit -93.4
	997 Hz 32-bit -143.9

	Art Ludwig's Sound Files (http://members.cox.net/artludwig/):
	File Claimed Measured (dB)
	Reference 0.0% 0.0022% -93.3
	Single-ended triode 5.0% 5.06% -25.9
	Solid state 0.5% 0.51% -45.8

	Comparing a test device on an Audio Precision vs recorded into my 24-bit sound
	card with this script:

	Frequency AP THD+N Script THD+N
	40 1.00% 1.04%
	100 0.15% 0.19%
	100 0.15% 0.14%
	140 0.15% 0.17%
	440 0.056% 0.057%
	961 0.062% 0.067%
	1021 0.080% 0.082%
	1440 0.042% 0.041%
	1483 0.15% 0.15%
	4440 0.048% 0.046%
	9974 7.1% 7.8%
	10036 0.051% 0.068%
	10723 8.2% 9.3%
	13640 12.2% 16.8%
	19998 20.2% 56.3%
	20044 0.22% 0.30%

	So it's mostly accurate. Mostly.
	from __future__ import division
	import sys
	from scikits.audiolab import flacread
	from scipy.signal import blackmanharris
	from numpy.fft import rfft, irfft
	from numpy import argmax, sqrt, mean, absolute, arange, log10

	def rms_flat(a):
	"""
	Return the root mean square of all the elements of a, flattened out.

	"""
	return sqrt(mean(absolute(a)**2))

	def find_range(f, x):
	"""Find range between nearest local minima from peak at index x

	"""
	for i in arange(x+1, len(f)):
	if f[i+1] >= f[i]:
	uppermin = i
	break
	for i in arange(x-1, 0, -1):
	if f[i] <= f[i-1]:
	lowermin = i + 1
	break
	return (lowermin, uppermin)

	filename = sys.argv[1]
	signal, fs, enc = flacread(filename)

	# Get rid of DC and window the signal
	signal -= mean(signal)
	windowed = signal * blackmanharris(len(signal))

	# Measure the total signal before filtering but after windowing
	total_rms = rms_flat(windowed)

	# Find the peak of the frequency spectrum (fundamental frequency), and filter
	# the signal by throwing away values between the nearest local minima
	f = rfft(windowed)
	i = argmax(abs(f))
	print 'Frequency: %f Hz' % (fs * (i / len(windowed)))
	lowermin, uppermin = find_range(abs(f), i)
	f[lowermin: uppermin] = 0

	# Transform noise back into the signal domain and measure it
	# Could probably calculate the RMS directly in the frequency domain instead
	noise = irfft(f)
	THDN = rms_flat(noise) / total_rms
	print "THD+N: %.4f%% or %.1f dB" % (THDN * 100, 20 * log10(THDN))