Empirical learning methods for digitized document recognition: an integrated approach to inductive generalization