Extracting complements and substitutes from sales data: a network perspective

Abstract The complementarity and substitutability between products are essential concepts in retail and marketing. Qualitatively, two products are said to be substitutable if a customer can replace one product by the other, while they are complementary if they tend to be bought together. In this art...

Full description

Bibliographic Details
Main Authors: Yu Tian, Sebastian Lautz, Alisdair O. G. Wallis, Renaud Lambiotte
Format: Article
Language:English
Published: SpringerOpen 2021-08-01
Series:EPJ Data Science
Subjects:
Online Access:https://doi.org/10.1140/epjds/s13688-021-00297-4
id doaj-b282a35d556d40b99cbb2d6d0f86fffc
record_format Article
spelling doaj-b282a35d556d40b99cbb2d6d0f86fffc2021-08-29T11:45:17ZengSpringerOpenEPJ Data Science2193-11272021-08-0110112710.1140/epjds/s13688-021-00297-4Extracting complements and substitutes from sales data: a network perspectiveYu Tian0Sebastian Lautz1Alisdair O. G. Wallis2Renaud Lambiotte3Mathematical Institute, University of OxfordTesco PLC, Tesco House, Shire ParkTesco PLC, Tesco House, Shire ParkMathematical Institute, University of OxfordAbstract The complementarity and substitutability between products are essential concepts in retail and marketing. Qualitatively, two products are said to be substitutable if a customer can replace one product by the other, while they are complementary if they tend to be bought together. In this article, we take a network perspective to help automatically identify complements and substitutes from sales transaction data. Starting from a bipartite product-purchase network representation, with both transaction nodes and product nodes, we develop appropriate null models to infer significant relations, either complements or substitutes, between products, and design measures based on random walks to quantify their importance. The resulting unipartite networks between products are then analysed with community detection methods, in order to find groups of similar products for the different types of relationships. The results are validated by combining observations from a real-world basket dataset with the existing product hierarchy, as well as a large-scale flavour compound and recipe dataset.https://doi.org/10.1140/epjds/s13688-021-00297-4Product relationshipsNetwork modellingRole extractionSales dataMarket basket analysis
collection DOAJ
language English
format Article
sources DOAJ
author Yu Tian
Sebastian Lautz
Alisdair O. G. Wallis
Renaud Lambiotte
spellingShingle Yu Tian
Sebastian Lautz
Alisdair O. G. Wallis
Renaud Lambiotte
Extracting complements and substitutes from sales data: a network perspective
EPJ Data Science
Product relationships
Network modelling
Role extraction
Sales data
Market basket analysis
author_facet Yu Tian
Sebastian Lautz
Alisdair O. G. Wallis
Renaud Lambiotte
author_sort Yu Tian
title Extracting complements and substitutes from sales data: a network perspective
title_short Extracting complements and substitutes from sales data: a network perspective
title_full Extracting complements and substitutes from sales data: a network perspective
title_fullStr Extracting complements and substitutes from sales data: a network perspective
title_full_unstemmed Extracting complements and substitutes from sales data: a network perspective
title_sort extracting complements and substitutes from sales data: a network perspective
publisher SpringerOpen
series EPJ Data Science
issn 2193-1127
publishDate 2021-08-01
description Abstract The complementarity and substitutability between products are essential concepts in retail and marketing. Qualitatively, two products are said to be substitutable if a customer can replace one product by the other, while they are complementary if they tend to be bought together. In this article, we take a network perspective to help automatically identify complements and substitutes from sales transaction data. Starting from a bipartite product-purchase network representation, with both transaction nodes and product nodes, we develop appropriate null models to infer significant relations, either complements or substitutes, between products, and design measures based on random walks to quantify their importance. The resulting unipartite networks between products are then analysed with community detection methods, in order to find groups of similar products for the different types of relationships. The results are validated by combining observations from a real-world basket dataset with the existing product hierarchy, as well as a large-scale flavour compound and recipe dataset.
topic Product relationships
Network modelling
Role extraction
Sales data
Market basket analysis
url https://doi.org/10.1140/epjds/s13688-021-00297-4
work_keys_str_mv AT yutian extractingcomplementsandsubstitutesfromsalesdataanetworkperspective
AT sebastianlautz extractingcomplementsandsubstitutesfromsalesdataanetworkperspective
AT alisdairogwallis extractingcomplementsandsubstitutesfromsalesdataanetworkperspective
AT renaudlambiotte extractingcomplementsandsubstitutesfromsalesdataanetworkperspective
_version_ 1721186493928570880