A dataset for Text Detection, Optical Character Recognition, Spatial Layout Analysis and Form Understanding.