This answer assumes that you have ruled working on the DOM level with selectors.
So if you want to do visual testing, I see several options:
(1) If you are not tied to Selenium, use the free Kantu web testing tool, which works visually, just the way you want it to work. It has built-in OCR. At the very least, you can use this tool test if the visual testing approach itself is the right one for your test case – before spending much time to get Selenium to work this way.
(3) Maybe you do not need OCR, but “only” image recognition? In this case Selenium + Sikuli is an option.