Skip to content

ellerypan/MANGA-TRANSLATOR

Repository files navigation

Manga Translator

This is the model pipeline in our self-designed application--"Mlator", which is aimed to help manga fans and publishers to overcome the language barrie and lower the cost of translation respectively.

Workflow

This example image is from <<Q.E.D.iff-proven end-11>> Episode 1 © Motohiro Katou.

Bubble Detection

First, train a object detection model that helps us locate the text in the bubble. Here we thanks to Manga109 providing us with large amount of high quality annotated dataset.As the following image shows, the identified areas are marked with orange bounding boxes, and content in the box would be processed by the next step.

Screen Shot 2019-05-07 at 12 54 04 AM

Optical Character Recognition

Next we use a state-of-the-art OCR engine to parse the image segment we identified in step 1 into machine-readable text. Besides, a few tricks are needed to help the model parse vertically-oriented Japanese text and stylized comic fonts.

Screen Shot 2019-05-07 at 12 54 16 AM

Translation

All the extracted Japanese text is translated to English. This is a crucial stage in the process, since a quality translation is what allows readers to enjoy the results.

Screen Shot 2019-05-07 at 12 54 28 AM

Text Removal

If we simply use the bounding boxes as our translated text background, some of the boxes would leak beyond the bounds of the bubble, which make the page uncomfortable to read. It would be the best if the bubble is used for background, that's why we need to remove the original text.

Screen Shot 2019-05-07 at 12 54 44 AM

Placement

Finally, the English text is broken up into lines of an appropriate length and resized to comfortably fit their corresponding speech bubble. At this point, the comics are translated and ready for reading!

Screen Shot 2019-05-07 at 12 54 51 AM

Instructions (Work in progress)

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published