Uber relies on accurate driver location data to power dispatch, ETA estimation, and marketplace monitoring. In practice, GPS pings can be incomplete, delayed, or malformed, so analysts need a clear SQL-based approach to handling missing values.
Explain how you would handle missing values in a driver location dataset stored in PostgreSQL. Your answer should cover:
NULL latitude/longitude, empty strings in source fields, or obviously invalid coordinates.The interviewer is not looking for advanced modeling or geospatial interpolation. Focus on practical SQL data manipulation: profiling the data, standardizing missing values, creating quality flags, and explaining trade-offs between excluding bad rows and filling defaults.