Anomaly Detection and Repair for Accurate Predictions in Geo-distributed Big Data