The problem of combining object detection and object recognition is a difficult problem, as both object detection and object recognition methods must be combined. That's why in this repo we want to challenge ourselves with an application that can both detect text and recognize detected text areas. In addition, to make the problem more challenging, we also use a machine translation model.
In this application we use CRAFT for text detection, TRBA model for text recognition and Transformer model for text translation from English to Vietnamese.
git clone this repo
Before that, let create a folder named models in Translate-text-from-image, you can download all file in this folder.
cd Translate-text-from-image
pip install -r requirements.txt
python/python3 my_api.py
After running all code above, you can try it by your browser. Let put http://localhost:8000/text
in your browser.
git clone this repo
cd Translate-text-from-image
docker build -t image_name
docker run -it --name container_name --network=host --ipc=host -p 8000:80 image_name
Where image_name and container_name are the name of the image and the name of the container you want to give them
To use the application, you can directly transmit the English text that needs to be translated into Vietnamese into the text field on the left, then press the "Translate" button, the translated text will return as Vietnamese on the right ( red area).
To use this mode, you need to select a photo from your computer using the "Choose file" button. After the photo is uploaded, press the Translate button and wait for the returned result to be the text translated from the transferred image.