Browsing by Subject "Text recognition, Vision Transformer (ViT), You Only Look Once (YOLO), Generative Adversarial Network (GAN), multilingual-text recognition."
JavaScript is disabled for your browser. Some features of this site may not work without it.
Browsing by Subject "Text recognition, Vision Transformer (ViT), You Only Look Once (YOLO), Generative Adversarial Network (GAN), multilingual-text recognition."
Multilingual image-based text recognition is
a tough problem with several practical applications. This
work suggests an integrated ViT-YOLO model which
integrates the strengths of the Vision Transformer (ViT)
and ...