fromMedium
2 days agoDay 6-Sessionization of Web Logs using Time Difference | Apache Spark Interview Problem.
To accurately assign session IDs based on user activity, we need to understand the time difference between consecutive actions. If the difference exceeds 30 minutes, a new session starts.
Data science