Extracting complements and substitutes from sales data: a network perspective
Abstract The complementarity and substitutability between products are essential concepts in retail and marketing. Qualitatively, two products are said to be substitutable if a customer can replace one product by the other, while they are complementary if they tend to be bought together. In this art...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2021-08-01
|
Series: | EPJ Data Science |
Subjects: | |
Online Access: | https://doi.org/10.1140/epjds/s13688-021-00297-4 |
id |
doaj-b282a35d556d40b99cbb2d6d0f86fffc |
---|---|
record_format |
Article |
spelling |
doaj-b282a35d556d40b99cbb2d6d0f86fffc2021-08-29T11:45:17ZengSpringerOpenEPJ Data Science2193-11272021-08-0110112710.1140/epjds/s13688-021-00297-4Extracting complements and substitutes from sales data: a network perspectiveYu Tian0Sebastian Lautz1Alisdair O. G. Wallis2Renaud Lambiotte3Mathematical Institute, University of OxfordTesco PLC, Tesco House, Shire ParkTesco PLC, Tesco House, Shire ParkMathematical Institute, University of OxfordAbstract The complementarity and substitutability between products are essential concepts in retail and marketing. Qualitatively, two products are said to be substitutable if a customer can replace one product by the other, while they are complementary if they tend to be bought together. In this article, we take a network perspective to help automatically identify complements and substitutes from sales transaction data. Starting from a bipartite product-purchase network representation, with both transaction nodes and product nodes, we develop appropriate null models to infer significant relations, either complements or substitutes, between products, and design measures based on random walks to quantify their importance. The resulting unipartite networks between products are then analysed with community detection methods, in order to find groups of similar products for the different types of relationships. The results are validated by combining observations from a real-world basket dataset with the existing product hierarchy, as well as a large-scale flavour compound and recipe dataset.https://doi.org/10.1140/epjds/s13688-021-00297-4Product relationshipsNetwork modellingRole extractionSales dataMarket basket analysis |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yu Tian Sebastian Lautz Alisdair O. G. Wallis Renaud Lambiotte |
spellingShingle |
Yu Tian Sebastian Lautz Alisdair O. G. Wallis Renaud Lambiotte Extracting complements and substitutes from sales data: a network perspective EPJ Data Science Product relationships Network modelling Role extraction Sales data Market basket analysis |
author_facet |
Yu Tian Sebastian Lautz Alisdair O. G. Wallis Renaud Lambiotte |
author_sort |
Yu Tian |
title |
Extracting complements and substitutes from sales data: a network perspective |
title_short |
Extracting complements and substitutes from sales data: a network perspective |
title_full |
Extracting complements and substitutes from sales data: a network perspective |
title_fullStr |
Extracting complements and substitutes from sales data: a network perspective |
title_full_unstemmed |
Extracting complements and substitutes from sales data: a network perspective |
title_sort |
extracting complements and substitutes from sales data: a network perspective |
publisher |
SpringerOpen |
series |
EPJ Data Science |
issn |
2193-1127 |
publishDate |
2021-08-01 |
description |
Abstract The complementarity and substitutability between products are essential concepts in retail and marketing. Qualitatively, two products are said to be substitutable if a customer can replace one product by the other, while they are complementary if they tend to be bought together. In this article, we take a network perspective to help automatically identify complements and substitutes from sales transaction data. Starting from a bipartite product-purchase network representation, with both transaction nodes and product nodes, we develop appropriate null models to infer significant relations, either complements or substitutes, between products, and design measures based on random walks to quantify their importance. The resulting unipartite networks between products are then analysed with community detection methods, in order to find groups of similar products for the different types of relationships. The results are validated by combining observations from a real-world basket dataset with the existing product hierarchy, as well as a large-scale flavour compound and recipe dataset. |
topic |
Product relationships Network modelling Role extraction Sales data Market basket analysis |
url |
https://doi.org/10.1140/epjds/s13688-021-00297-4 |
work_keys_str_mv |
AT yutian extractingcomplementsandsubstitutesfromsalesdataanetworkperspective AT sebastianlautz extractingcomplementsandsubstitutesfromsalesdataanetworkperspective AT alisdairogwallis extractingcomplementsandsubstitutesfromsalesdataanetworkperspective AT renaudlambiotte extractingcomplementsandsubstitutesfromsalesdataanetworkperspective |
_version_ |
1721186493928570880 |