This is an old revision of the document!
Morphological Image Analysis – Principles and Applications, Pierre Soille
The first question that may arise when we probe a set with a structuring element is “Does the structuring element fit the set?” The eroded set is the locus of points where the answer to this question is affirmative.
The eroded value at a given pixel $x$ is the minimum value of the image in the window defined by the structuring element when its origin is at $x$: $$[\varepsilon_{B}(f)](x) = \min_{b \in B} f(x+b)$$
81, 82
The dilation is the dual operator of the erosion and is based on the following question: “Does the structuring element hit the set?” The dilated set is the locus of points where the answer to this question is affirmative.
the dilated value at a given pixel $x$ is the maximum value of the image in the window defined by the structuring element when its origin is at $x$: $$[\delta{B}(f)](x) = \max_{b \in B} f(x+b)$$
83, 84
Only symmetric structuring elements containing their origin are considered. By doing so, we make sure that the arithmetic difference is always nonnegative.
The choice between internal or external gradient depends on the geometry and relative brightness of the objects to be extracted. For instance, an external gradient applied to a two or one pixel thick dark structure will provide a thin edge following the structure whereas an internal gradient will output a double edge (one on each side of the structure).
102
If the size of the SE is greater than 1, morphological gradients are referred
to as thick gradients: $$\rho_{nB} = \delta_{nB} - \varepsilon_{nB}$$.
Thick gradients give the maximum variation of the function in a neighbourhood
of size $n$. If the size $n$ equals the width $e$ of the transition between regions of homogeneous grey level, the thick gradient will output the contrast value $h$ between these regions. These gradients are therefore recommended when the transitions between objects are smooth. However, thick gradients output thick edges.
88
A combination of thick gradients of increasing size avoiding thick edges is presented in Sec. 4.6.
The opening $\gamma$ of an image $f$ by a structuring element $B$ is denoted by $\gamma_{B}(f)$ and is defined as the erosion of $f$ by $B$ followed by the dilation with the reflected SE $B$: $$\gamma_{B}(f) = \delta_{\check{B}}[\varepsilon_{B}(f)]$$
It is essential to consider the reflected SE for the dilation. Indeed, an erosion corresponds to an intersection of translations. It follows that a union of translations in the opposite direction (i.e., a dilation by the reflected SE) must be considered when attempting to recover the original image.
Geometric formulation in terms of SE fit using the question already introduced for the erosions: “Does the structuring element fit the set?” Each time the answer to this question is affirmative, the whole SE must be kept (for the erosion, it is the origin of the SE that is kept).
120, 121
The shape and size of the structuring element must be set according to the image structures that are to be extracted. For instance, if we are interested in removing all elongated objects while keeping disc shaped objects, the appropriate structuring element is a disc having a diameter larger than the width of the elongated objects.
The closing of an image $f$ by a structuring element $B$ is denoted by $\phi_{B}(f)$ and is defined as the dilation of $f$ with a structuring element $B$ followed by the erosion with the reflected structuring element $\check{B}$: $$\phi_{B}(f) = \varepsilon_{\check{B}}[\delta_{B}(f)]$$
Using set formalism, we have the following question for defining a closing: “Does the BE fit the background of the set?” If yes, then all points of the SE belong to the complement of the closing of the set.
123, 124
area opening – Removing all connected components whose area in number of pixels is smaller than a given threshold value $\lambda$: $$\gamma_{\lambda} = \bigvee_{i} \{ \gamma_{B_i} \text{is connected and } \text{card}(B_i) \geq \lambda \}$$
area closing – dual to area opening: $$\gamma_{\lambda} = \bigwedge_{i} \{ \phi_{B_i} \text{is connected and } \text{card}(B_i) \geq \lambda \}$$
At least $\lambda$ pixels of the considered structuring element $B$ have to fit the foreground pixels (denoted as $\gamma_{B, \lambda}$)
It can be shown that the parametric opening is equivalent to the intersection (point-wise minimum operator $\wedge$) between the identity transformation and the dilation by $\check{B}$ of the rank filter $\zeta$ using $B$ as kernel and $n - \lambda + 1$ as rank (such an interpretation is easier to implement and performs much faster): $$\gamma_{B, \lambda} = id \wedge \delta_{\check{B}} \zeta_{B, n - \lambda + 1}$$
Similarly, for parametric closing: $$\phi_{B, \lambda} = id \vee \varepsilon_{\check{B}} \zeta_{B, \lambda}$$
Parametric openings and closings are very useful in practice because they are much more flexible and less sensitive to noise than the corresponding morphological openings and closings.
129
The annular opening of an image is defined as the intersection between the dilation of the image with a ring shaped SE and the original image: $\delta_{\bigcirc}(f) \wedge f$, where $\bigcirc$ is a ring-shaped structuring element. Since the ring SE does not contain its origin, the input image is not included in its dilation by the ring SE.
Annular openings are useful for extracting clusters in an image since isolated blobs are not covered by the dilation of other blobs.
129
119
Used to detect concavity regions.
The choice of a given morphological filter is driven by the available knowledge about the shape, size, and orientation of the structures we would like to filter. Morphological top-hats proceed a contrario. Indeed, the approach undertaken with top-hats consists in using knowledge about the shape characteristics that are not shared by the relevant image structures. An opening or closing with a SE that does not fit the relevant image structures is then used to remove them from the image. These structures are recovered through the arithmetic difference between the image and its opening or between the closing and the image.
It is sometimes easier to remove relevant image objects than trying to directly suppress the irrelevant objects.
WTH of an image $f$ is the difference between the original image $f$ and its opening $\gamma$: $$\text{WTH}(f) = f - \gamma(f)$$
Since the opening is an anti-extensive image transformation, the grey scale values of the white top-hat are always greater or equal to zero.
136
BTH of an image $f$ is the difference between its closing $\phi$ and the original image $f$: $$\text{BTH}(f) = \phi(f) - f$$
Owing to the extensivity property of the closing operator, the values of the black top-hat images are always greater or equal to zero.
137 (4.16, 4.17)
In situations where the input image is corrupted by a high frequency noise signal, it must be filtered out before using top-hat transforms to avoid side effects. For example, a closing by a small SE should be considered before computing a white top-hat and an opening before a black top-hat.
If the image objects have all the same local contrast, i.e., if they are either all darker or brighter than the background, top-hat transforms can be used for mitigating illumination gradients. Indeed, a top-hat with a large isotropic structuring element acts as a high-pass filter. As the illumination gradient lies within the low frequencies of the image, it is removed by the top-hat. White top-hats are used for dark backgrounds and black top-hats for bright backgrounds.
138
If the contrast between the objects and the background is decreasing when the background is darkening, a better visual rendering may be obtained by dividing the input image by the closing (or opening).
139
In quality control applications where a series of objects are acquired at a fixed position, another solution consists in first capturing an image without any object and then perform the point-wise division of further image captures with this background image.
A simple neighbourhood-based morphological contrast operator can be obtained by computing in parallel the white and black top-hat of the image. The white top-hat is then added to the original image to enhance bright objects and the black top-hat is subtracted from the resulting image to enhance dark objects. We denote this top-hat contrast operator by $\kappa^{\text{TH}}$: $$\kappa^{\text{TH}} = id + WTH_{B} - BTH_{B} = 3id - \phi_{B} - \gamma_{B}$$
The output values falling outside the dynamic range of the input image, i.e. $[t_{\min}, t_{\max}]$, are set to $t_{\min}$ or $t_{\max}$ depending on whether they fall below or above the dynamic range.
141
141 (cała sekcja)
144 (cała sekcja)
128