Black Highlighter — April 22nd, 2019
In this session, I figured out how to take the data representation of the text rectangles that were detected last time and represent them visually. A number of issues made this more difficult than first anticipated, though.
First, the coordinates that VNTextObservation
contains aren’t pixel-based coordinates; they’re percentages of the total image size. This is pretty easy to fix, we just multiply each point by the image size. However, it’s also in a “flipped” coordinate system. Instead of the top-left origin point used by UIKit, these coordinates use a lower-left origin point. We have to appropriately translate vertical positions before converting them to pixel coordinates.
The other issue that I ran into is that the image view we display the image in scales the image down to fit on screen. However, the visualization view I created doesn’t do the same scaling (yet). As such, the rectangles that the view draws to show detected text doesn’t quite line up with the text in the image. I’ll have to make the visualization view draw at the same scale as the image view if I want things to look right.
Commits Made
- Translate text observations to rectangles in image coordinates
- Create visualization view to draw detected text
Tickets Closed
Tickets Created
None.
Project Stats
- Sessions Completed
- 8
- Days Since Start
- 22
- Issues Closed
- 9
- Issues Open
- 24
- Percent Complete
- 27.2%
Replay
Watch this session on YouTube: