Feature | Importance |
---|---|
Number of peaks | 1.7676 |
Mean width of peaks | 1.8311 |
Mean height of peaks | 0.7818 |
Max. width of peaks | 1.9384 |
Max. height of peaks | 0.9907 |
Min. width of peaks | 1.8822 |
Min. height of peaks | 0.9474 |
There are several sources of background noise:
Discrete spectral interference
DSI (radio emissions)- sometimes, more than 100 radio stations can be identified in the raw signal
- these sources of DSI can be easily recognizable with
FFT
based on their modulation
Repetitive pulses interference
(power electronics)- represented by a corona discharge - hissing noise when standing below a high voltage transmission line is due to the corona discharge
Random pulses interference
RPI (lightning, switching operations, corona)Ambient
andamplifier
noise
- the biggest permanent source of DSI on the measured site was
radio transmitter
“Solec Kujawski”. Its carrier wave (225 kHz
) is clearly visible in almost all of acquired signals (see fig 1). false hit peaks
(most are corona discharges)
Perform basic univariate wavelet de-noising with:
Name of the parameter | Experts setting | SOMA’s range | Description |
---|---|---|---|
maxDistance (ticks) | 10 | <4,10> | distance that the symmetric peaks have to be within |
maxHeightRatio (%) | 0.25 | <0.05,0.5> | check if amplitudes of symmetric peaks exceeds this value) |
maxHeight (%) | 100 | <80, 140> | remove peaks with heights greater than this |
maxTicksRemoval | 500 | <50,500> | distance to remove after symmetric peaks |
Threshold coef. | 1 | (0,5> | used in univariate wavelet de-noising |
Mother wavelet | db4 | all members of the wavelet families | used in univariate wavelet de-noising |
Level of decomposition | 1 | {1,...6} | used in univariate wavelet de-noising |
- For removing coronas, each peak is compared to the next peak. If their distance in the signal is under a defined limit
maxDistance
, check if the signs of the peaks are opposite and the ratio of their amplitudes is higher than themaxHeightRatio
. Since the following oscillations can be misdetected as PD, then the peaks in the distancemaxTicksRemoval
behind the symmetric peak is cancelled. - Remove peaks with higher amplitude than the defined limit
maxHeight
- use a proper design of a representative subset (under-sampling method)
- the subset should contain all the kinds of class labels equally distributed, because there are various signals in both classes with various amounts of background noise. The background noise can cover the PD-pattern or form some false hit peaks, which should also be recognized by final processing. This phenomenon has to be reasonably represented in the chosen subset.
- Extraction of relevant parts of the signal by performing univariate discrete wavelet transform (DWT) de-noising (this suppresses most of the irrelevant small peaks so only the most significant peaks remain)
- Describe each peak with its starting index, amplitude, width
- Remove coronas and high peaks
- Calculate features
- Classification by Random Forest
28 feature columns and one class column for each signal:
- number of positive peaks
- number of negative peaks
- max width of peaks
- min width of peaks
- max amplitude of peaks
- min amplitude of peaks
- mean value of width of peaks
- mean value of amplitude of peaks
- four (one for each sinusoidal phase and one for all of them) added columns containing standard deviation of histograms of peaks positions, widths, and amplitudes
Search for Subset selection in thesis
- a rapidly decaying wavelike oscillation that has
0
mean transients
are the jagged edges in a signal- each scaled wavelet is shifted in time along the signal and compared with the original signal
- you can repeat this process for all the wavelet scales to get the coefficients as a function of wavelet
scale
andshift parameter
- a signal with 1,000 samples x 20 scales = 20,000 coefficients
- a wavelet with more
vanishing moments
is more complex p
vanishing moments -> polynomials up to the pth order will not be able to be identified by the wavelet
- output of wavelet transform ->
coefficients
- when you scale a wavelet by a factor of
2
, it decreases the frequency by half (anoctave
)
Mother Wavelet Examples:
- Refers to the process of
stretching
orshrinking
the wavelet in time
- scaling by
2
reduces the frequency byhalf
or by anoctave
- A stretched wavelet would capture lower frequencies whereas a squished wavelet can capture higher frequencies
- Denoising
- Compression of signals and images
- uses less coefficients (eliminates redundancy in coefficients)
- yields the same number of coefficents as the length of the signal
- scale:
2^j
(j = 1,2,3,4,...) - translation:
(2^j)*m
(m = 1,2,3,4,...) - D1 and A1 filters can reconstruct "subbanks" and cancel out aliasing from downsampling
*dwt
D
- detail coefficients
A
- approximation coefficients
- the
length
of the coefficients in each subbank ishalf
the number of the coefficients in the previous stage
- Time-Frequency Analysis
- Filtering of time-localized frequency components
- allows one to analyze the signal at
intermediary
scales between each octave (fine scale analysis) - these wavelets do NOT have
negative
frequency (easy for analysis) - needs more coefficients
- Perform a multi-level wavelet decomposition
- Identify a thresholding technique
- Threshold and reconstruct the signal
Does not represent abrupt transients efficiently, since it represents data as a sum of sine waves
which are not localized in time or space (oscillate forever)