Translate-from-image

Introduction

The problem of combining object detection and object recognition is a difficult problem, as both object detection and object recognition methods must be combined. That's why in this repo we want to challenge ourselves with an application that can both detect text and recognize detected text areas. In addition, to make the problem more challenging, we also use a machine translation model.

In this application we use CRAFT for text detection, TRBA model for text recognition and Transformer model for text translation from English to Vietnamese.

Run demo

To run this repo, follow these steps if you don't want to use docker:

git clone this repo

Before that, let create a folder named models in Translate-text-from-image, you can download all file in this folder.

cd Translate-text-from-image
pip install -r requirements.txt
python/python3 my_api.py

After running all code above, you can try it by your browser. Let put http://localhost:8000/text in your browser.

To run this repo use docker:

git clone this repo
cd Translate-text-from-image
docker build -t image_name
docker run -it --name container_name --network=host --ipc=host -p 8000:80 image_name

Where image_name and container_name are the name of the image and the name of the container you want to give them

Usage

Translate from text

To use the application, you can directly transmit the English text that needs to be translated into Vietnamese into the text field on the left, then press the "Translate" button, the translated text will return as Vietnamese on the right ( red area).