GOLIATH: Geographies of Lifestyle, Activity, Transport and Health

GOLIATH: Geographies of Lifestyle, Activity, Transport and Health

Consumer data collected by commercial providers have huge potential for various research purposes, but can be challenging to access as they are often held in secure environments. Secure handling of these datasets is crucial, as consumer data contains sensitive attributes (e.g. address) or commercially sensitive data (e.g. they have been purchased or contain licenced information). There is a trade-off between privacy and the utility of data. These new data sources could be utilised to understand phenomena better that are hard to capture through traditional data sources (Rango & Vespe, 2017).

This project aims to enable easier access to consumer/commercial data for research purposes, thereby making data available for a broader range of academic users. Taking securely held consumer datasets within the Consumer Data Research Centre (CDRC), the objective is to produce non-disclosive and aggregated versions of the data whilst maintaining the unique characteristics and value of consumer data. Various privacy preservation methods, including aggregation techniques and more advanced microsimulation and machine learning, will be applied to create synthetic versions of these data. Visualisations will be used to showcase the utility of non-disclosive datasets for research purposes.

 

References

Rango, M., & Vespe, M. (2017). Big Data and alternative data sources on migration: From case-studies to policy support – Summary report. European Commission, Joint Research Centre.