'Extracting Hebrew text from image in python
I want to extract Hebrew text from an image.
I've tried using pytesseract, but it gets some letters confused (for example ' instead of י or נ instead of כ)
I tried doing some manipulations on the image (such as resizing, removing noise and binarization) which helped a little but still got many mistakes.
I've spent hours searching for better text extraction tools but couldn't find.
So here's my question:
A) Is there a tool I can use that I might have missed?
B) If not, what are the steps to creating my own?
Thanks in advance, Amichai
Solution 1:[1]
Did you try HebHTR? It is based on tf1 which is a challenge to integrate.
About tesseract, check out this tutorial
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
| Solution | Source |
|---|---|
| Solution 1 | YScharf |
