Explainable Machine Learning Exploiting News and Domain-Specific Lexicon for Stock Market Forecasting

In this manuscript, we propose a Machine Learning approach to tackle a binary classification problem whose goal is to predict the magnitude (high or low) of future stock price variations for individual companies of the S&P 500 index. Sets of lexicons are generated from globally published art...

Full description

Bibliographic Details
Main Authors:	Salvatore M. Carta, Sergio Consoli, Luca Piras, Alessandro Sebastian Podda, Diego Reforgiato Recupero
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Access
Subjects:	Stock market forecasting machine learning natural language processing financial technology explainable artificial intelligence
Online Access:	https://ieeexplore.ieee.org/document/9355141/

id	doaj-311240e90475458b9f614f4cb404a16f
record_format	Article
spelling	doaj-311240e90475458b9f614f4cb404a16f2021-03-30T15:05:53ZengIEEEIEEE Access2169-35362021-01-019301933020510.1109/ACCESS.2021.30599609355141Explainable Machine Learning Exploiting News and Domain-Specific Lexicon for Stock Market ForecastingSalvatore M. Carta0https://orcid.org/0000-0001-9481-511XSergio Consoli1https://orcid.org/0000-0001-7357-5858Luca Piras2Alessandro Sebastian Podda3https://orcid.org/0000-0002-7862-8362Diego Reforgiato Recupero4https://orcid.org/0000-0001-8646-6183Department of Mathematics and Computer Science, University of Cagliari, Cagliari, ItalyEuropean Commission, Joint Research Centre (DG-JRC), Ispra, ItalyDepartment of Mathematics and Computer Science, University of Cagliari, Cagliari, ItalyDepartment of Mathematics and Computer Science, University of Cagliari, Cagliari, ItalyDepartment of Mathematics and Computer Science, University of Cagliari, Cagliari, ItalyIn this manuscript, we propose a Machine Learning approach to tackle a binary classification problem whose goal is to predict the magnitude (high or low) of future stock price variations for individual companies of the S&P 500 index. Sets of lexicons are generated from globally published articles with the goal of identifying the most impactful words on the market in a specific time interval and within a certain business sector. A feature engineering process is then performed out of the generated lexicons, and the obtained features are fed to a Decision Tree classifier. The predicted label (high or low) represents the underlying company's stock price variation on the next day, being either higher or lower than a certain threshold. The performance evaluation we have carried out through a walk-forward strategy, and against a set of solid baselines, shows that our approach clearly outperforms the competitors. Moreover, the devised Artificial Intelligence (AI) approach is explainable, in the sense that we analyze the white-box behind the classifier and provide a set of explanations on the obtained results.https://ieeexplore.ieee.org/document/9355141/Stock market forecastingmachine learningnatural language processingfinancial technologyexplainable artificial intelligence
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Salvatore M. Carta Sergio Consoli Luca Piras Alessandro Sebastian Podda Diego Reforgiato Recupero
spellingShingle	Salvatore M. Carta Sergio Consoli Luca Piras Alessandro Sebastian Podda Diego Reforgiato Recupero Explainable Machine Learning Exploiting News and Domain-Specific Lexicon for Stock Market Forecasting IEEE Access Stock market forecasting machine learning natural language processing financial technology explainable artificial intelligence
author_facet	Salvatore M. Carta Sergio Consoli Luca Piras Alessandro Sebastian Podda Diego Reforgiato Recupero
author_sort	Salvatore M. Carta
title	Explainable Machine Learning Exploiting News and Domain-Specific Lexicon for Stock Market Forecasting
title_short	Explainable Machine Learning Exploiting News and Domain-Specific Lexicon for Stock Market Forecasting
title_full	Explainable Machine Learning Exploiting News and Domain-Specific Lexicon for Stock Market Forecasting
title_fullStr	Explainable Machine Learning Exploiting News and Domain-Specific Lexicon for Stock Market Forecasting
title_full_unstemmed	Explainable Machine Learning Exploiting News and Domain-Specific Lexicon for Stock Market Forecasting
title_sort	explainable machine learning exploiting news and domain-specific lexicon for stock market forecasting
publisher	IEEE
series	IEEE Access
issn	2169-3536
publishDate	2021-01-01
description	In this manuscript, we propose a Machine Learning approach to tackle a binary classification problem whose goal is to predict the magnitude (high or low) of future stock price variations for individual companies of the S&P 500 index. Sets of lexicons are generated from globally published articles with the goal of identifying the most impactful words on the market in a specific time interval and within a certain business sector. A feature engineering process is then performed out of the generated lexicons, and the obtained features are fed to a Decision Tree classifier. The predicted label (high or low) represents the underlying company's stock price variation on the next day, being either higher or lower than a certain threshold. The performance evaluation we have carried out through a walk-forward strategy, and against a set of solid baselines, shows that our approach clearly outperforms the competitors. Moreover, the devised Artificial Intelligence (AI) approach is explainable, in the sense that we analyze the white-box behind the classifier and provide a set of explanations on the obtained results.
topic	Stock market forecasting machine learning natural language processing financial technology explainable artificial intelligence
url	https://ieeexplore.ieee.org/document/9355141/
work_keys_str_mv	AT salvatoremcarta explainablemachinelearningexploitingnewsanddomainspecificlexiconforstockmarketforecasting AT sergioconsoli explainablemachinelearningexploitingnewsanddomainspecificlexiconforstockmarketforecasting AT lucapiras explainablemachinelearningexploitingnewsanddomainspecificlexiconforstockmarketforecasting AT alessandrosebastianpodda explainablemachinelearningexploitingnewsanddomainspecificlexiconforstockmarketforecasting AT diegoreforgiatorecupero explainablemachinelearningexploitingnewsanddomainspecificlexiconforstockmarketforecasting
_version_	1724180077645660160

Explainable Machine Learning Exploiting News and Domain-Specific Lexicon for Stock Market Forecasting

Similar Items