Full session (30 minutes)
Data
Day 1 | 13:20-13:50 | A4
Measuring unique users in a billion user network is hard - accurate counting is space consuming and not easily distributable.
In this talk I will describe HyperLogLog, a probabilistic cardinality estimation algorithm and data structure and how we used it to provide breakdowns of our billion user reach.