Diploma work: Monthly report 8/97
Review August 1997
I did the following things during the last month:
Two examples: |
Detected Linestrokes in July: |
|
|
|
Original image masked: |
|
|
|
Resulting Linestrokes: |
|
|
|
Detected Dark Pixels: |
|
|
Preview September 1997
My goals for September 1997 are:
- The treating of Otsu still needs some improvement. The
problem is at the moment sometimes the limit of the
allowed variance and sometimes the resulting threshold is
too bright.
- Some character show problem to be detected well. E.g. the
letter "w" is problematic. I write this letter
like two combined "v". But after the first
downstroke of the "v" I start the upstroke to
finish the "v". But during this upstroke the
pen covers the new written ink. This ink is only detected
during the downstroke of the second "v" and is
nearly the same as the downstroke of the 2nd
"v". This results in a wrong order of the
written line segments of this letter. I dont see a
solution for this problem at the moment, because I think
Im missing data to detect and solve this problem.
Ill have to investigate it.
- The interpolation routine needs some improvements. At the
moment only gaps of 1 or 2 positions can be fixed. I will
try to fix gaps that can be larger. But there will be a
limit of the interpolated distance per missing position.
- My program has still a memory leak. Its smaller
than before but still has to be fixed.
- As soon as the memory leak is fixed Ill check
sequences of frames that represent a full line of written
text.
- If the above showes satisfiying results Id like to
check a slightly new approach. Id like to analyse
the greylevels of a single pixel or a set of pixels over
all the frames. In the optimal the greylevels should be
white until the ink covers this pixel and then stay
black. If this pixel is coverd by the pen in the meantime
there will be a short disturbance of the graph.
Longer time goals with less priority:
- I will capture some more sequencies with pens that have
different ink colors (red, green, grey) and with other
writers than myself.
- Analyse the influence of the ETHZ input interface to my
detection process and my output interface.
- Think about the automatic getting of the mask after the
writer completed a line of text.
- Read about projective geometry to be able to reproject
the captured image into a normalized view of the paper.
With this I should be able to detect the position of the
paper and normalize into the point of view of a scanner
or a graphics tablet.
zurück zur Uebersicht der
Diplomarbeit
Last modified 23.12.1999 00:18:51 by Thomas von Siebenthal