Geocoding in Python: A Complete Guide
When dealing with large datasets for machine learning, have you ever come across an address column that looks like this? Location data can be very messy and difficult to process. It is difficult to encode addresses, since they are of very high cardinality. If you try to encode a column like this with a technique like one-hot encoding, it will lead to high dimensionality, and your machine learning model might not perform well. The easiest way to overcome this problem is to geocode these columns. Geocoding is the process of converting addresses into geographical coordinates.
Nov-19-2022, 12:36:03 GMT