The Clickstream Similarity node applies a modified ‘Markov Chain’ algorithm to calculate the probability that once the Customer views the If Product they will, at some point in the future, also view the Then Product. The algorithm puts greater weight on the Correlation Probability when the Then Product is viewed soon after the If Product. The algorithm also puts greater weight on those Products that the Customer views towards the end of their journey through the website, as opposed to those Products the Customer views at the beginning of their journey. These later Products are more likely to represent the Customer’s true interest than the earlier Products the Customer discovers when first entering the website.

*This Premium Node is not available as part of the free Community Edition. Premium Nodes help clean and connect real-world data to Market Simulations, and provide advanced Market Science analysis. Note that these descriptions are often deliberately vague.*

# Clickstream Similarity

The Clickstream Similarity node converts a Clickstream Log File into a Product Similarity Rankings table that can be used by downstream nodes to generate a Product Correlation Matrix. The Output Product Similarity Rankings table can then be used to generate a WTP Matrix containing the Willingness To Pay (WTP) of each Customer for each Product in a Market.

## Inputs

#### Product Clickstream

Lists the set of Customers who visited the website alongside the set of Products viewed by those Customers (listed in the order visited).

#### Product Array

Optional detail concerning each Product found within the ‘Input Product Clickstream’.

## Node

#### Configuration

The user can set the number of ranked ‘Then Products’ returned for each ‘If Product’ in the Output Product Similarity Rankings.