Data integrity vs. inference accuracy in large AIS datasets