Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation

Skew estimation is one of the vital tasks in document processing systems, especially for scanned document images, because its performance impacts subsequent steps directly. Over the years, an enormous number of researches focus on this challenging problem in the rise of digitization age. In this research, we first propose a novel skew estimation method that extracts the dominant skew angle of the given document image by applying an Adaptive Radial Projection on the 2D Discrete Fourier Magnitude spectrum. Second, we introduce a high quality skew estimation dataset DISE-2021 to assess the performance of different estimators. Finally, we provide comprehensive analyses that focus on multiple improvement aspects of Fourier-based methods. Our results show that the proposed method is robust, reliable, and outperforms all compared methods. Data and code are available at https://github.com/phamquiluan/jdeskew.

PDF Abstract

Datasets


Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Document Image Skew Estimation DISE 2021 Dataset PypiDeskew Percentage correct 0.2 # 2
Document Image Skew Estimation DISE 2021 Dataset JDeskew Percentage correct 0.86 # 1

Methods