Automatic extraction of logical web lists