It is now common knowledge that advertisers have many methods to collect user data, but here's how they identify their "super-users". Super-users are individuals that are likely to buy a significant amount of offered products and services.
This step-by-step process of finding super-users is called "Look Alike Modelling".
- The user base is first split into superusers and normal users
- Advertisers would then send a list of the superusers’ device identifiers (obtained through browser fingerprinting) and send a list of the normal users’ device identifiers to a data scientist.
- This list of device identifiers is then analyzed to understand the characteristics of superusers as compared to other users.
- To distinguish between the two lists of data on different users, a data scientist would build a “feature vector”. A feature vector in this scenario, is a vector of numerical values that represent certain features that one list of users would have versus the other list. Features that feature vectors would contain could be time zones, devices, and even age.
- These feature vectors are then used to make new sets of data that differentiate between the features of super users and normal users.
- Feature vectors then take individual users that are configured to optimize the superuser are then put into a new data set.
- This new, expanded data set’s users are compared to the feature vectors to make and implement even more efficient feature vectors.