Figure 1. Scanned document |
A part of the scanned document was scanned. Since the image was slightly tilted, it was corrected using GIMP. This correction is necessary for further steps to be done. The cropped image was converted to grayscale. These are shown Figure 2.
Figure 2. Cropped image and its grayscale format
To remove the horizontal lines, a filter was applied. The Fourier transform of the image was first obtained. The filter was done by blocking the high frequencies in the Fourier transform.The FT of the image was then multiplied to the filter. Figure 3 shows the Fourier transform and the filter that was made (using GIMP).
Figure 3. Fourier transform of the cropped image and the filter used
The resulting image is shown in Figure 4. It can be seen that the horizontal lines are now removed. However, the lines that intersected with the letters were also removed. Thus, the letters seems to be cut into half. :( Figure 5 shows the binarized image.
Figure 4. The resulting filtered image |
Figure 5. Binarized image |
Figure 6. Dilated image |
Figure 7. Skeletonized(thinned) image |
It can be seen that some of the letter can be easily read now. :)
Reference:
[1] . Maricor Soriano, "A8 - Applications of Morphological Operations 1 of 3: Preprocessing Text", Applied Physics 186 Manual, 2012
Self-evaluation:
I admit I submitted this later due to SPP and my own research and experiments, but I would still give myself a 10. I was able to produce the necessary output and I was able to use concepts learned from previous activities.
No comments:
Post a Comment