Spanning Web Logs

If a session extends from one Web log in to another, the data collected on that session from the first Web log is incomplete. In this case, the Clickstream Sessionize transformation cannot determine whether the session is complete, and the record is marked with a 2 in the Exit point column field of the output record. This incomplete session data is held in the permanent library, which was set in the Permanent library path field on the Options tab of the Clickstream Sessionize transformation. Once the following day's data is captured in another Web log, the session data is matched up and collected. For example, if the cutoff for the Web log is at midnight, but a user clicks on that Web site from 11:30 p.m. until 12:30 a.m., then the session information is contained in two Web logs. The data from the first day is held in the permanent library as incomplete until it is matched with the second Web log. This is why it is important to properly manage the content of the Permanent library path between runs. See Best Practices for the Clickstream Sessionize Transformation.