'Extracting Hebrew text from image in python

I want to extract Hebrew text from an image.

I've tried using pytesseract, but it gets some letters confused (for example ' instead of י or נ instead of כ)

I tried doing some manipulations on the image (such as resizing, removing noise and binarization) which helped a little but still got many mistakes.

I've spent hours searching for better text extraction tools but couldn't find.

So here's my question:

A) Is there a tool I can use that I might have missed?

B) If not, what are the steps to creating my own?

Thanks in advance, Amichai



Solution 1:[1]

Did you try HebHTR? It is based on tf1 which is a challenge to integrate.

About tesseract, check out this tutorial

Sources

This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.

Source: Stack Overflow

Solution Source
Solution 1 YScharf