Search Results for author: Ling Fu

Found 4 papers, 3 papers with code

Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering

1 code implementation21 May 2024 Hiba Maryam, Ling Fu, Jiajun Song, Tajrian ABM Shafayet, Qidi Luo, Xiang Bai, Yuliang Liu

The development of Urdu scene text detection, recognition, and Visual Question Answering (VQA) technologies is crucial for advancing accessibility, information retrieval, and linguistic diversity in digital content, facilitating better understanding and interaction with Urdu-language visual data.

Information Retrieval Question Answering +4

The First Swahili Language Scene Text Detection and Recognition Dataset

1 code implementation19 May 2024 Fadila Wendigoundi Douamba, Jianjun Song, Ling Fu, Yuliang Liu, Xiang Bai

We propose a comprehensive dataset of Swahili scene text images and evaluate the dataset on different scene text detection and recognition models.

Information Retrieval Scene Text Detection +2

Enhancing Scene Text Detectors with Realistic Text Image Synthesis Using Diffusion Models

no code implementations28 Nov 2023 Ling Fu, Zijie Wu, Yingying Zhu, Yuliang Liu, Xiang Bai

We contend that one main limitation of existing generation methods is the insufficient integration of foreground text with the background.

Image Generation Scene Text Detection +1

Toward Understanding WordArt: Corner-Guided Transformer for Scene Text Recognition

1 code implementation31 Jul 2022 Xudong Xie, Ling Fu, Zhifei Zhang, Zhaowen Wang, Xiang Bai

Thirdly, we utilize Transformer to learn the global feature on image-level and model the global relationship of the corner points, with the assistance of a corner-query cross-attention mechanism.

Scene Text Recognition

Cannot find the paper you are looking for? You can Submit a new open access paper.