A Commodity Classification Framework Based on Machine Learning for Analysis of Trade Declaration

Text, voice, images and videos can express some intentions and facts in daily life. By understanding these contents, people can identify and analyze some behaviors. This paper focuses on the commodity trade declaration process and identifies the commodity categories based on text information on cust...

Full description

Bibliographic Details
Main Authors: Mingshu He, Xiaojuan Wang, Chundong Zou, Bingying Dai, Lei Jin
Format: Article
Language:English
Published: MDPI AG 2021-05-01
Series:Symmetry
Subjects:
Online Access:https://www.mdpi.com/2073-8994/13/6/964
id doaj-ff0db75f976149518b72619b32d9af02
record_format Article
spelling doaj-ff0db75f976149518b72619b32d9af022021-06-01T01:33:17ZengMDPI AGSymmetry2073-89942021-05-011396496410.3390/sym13060964A Commodity Classification Framework Based on Machine Learning for Analysis of Trade DeclarationMingshu He0Xiaojuan Wang1Chundong Zou2Bingying Dai3Lei Jin4School of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaSchool of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaSchool of Electronic Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaDepartment of Statistics, Colorado State University, Fort Collins, CO 80523, USASchool of Computer Science, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaText, voice, images and videos can express some intentions and facts in daily life. By understanding these contents, people can identify and analyze some behaviors. This paper focuses on the commodity trade declaration process and identifies the commodity categories based on text information on customs declarations. Although the technology of text recognition is mature in many application fields, there are few studies on the classification and recognition of customs declaration goods. In this paper, we proposed a classification framework based on machine learning (ML) models for commodity trade declaration that reaches a high rate of accuracy. This paper also proposed a symmetrical decision fusion method for this task based on convolutional neural network (CNN) and transformer. The experimental results show that the fusion model can make up for the shortcomings of the two original models and some improvements have been made. In the two datasets used in this paper, the accuracy can reach 88% and 99%, respectively. To promote the development of study of customs declaration business and Chinese text recognition, we also exposed the proprietary datasets used in this study.https://www.mdpi.com/2073-8994/13/6/964trade declarationmachine learningtext classificationsymmetrical decision fusionharmonized System
collection DOAJ
language English
format Article
sources DOAJ
author Mingshu He
Xiaojuan Wang
Chundong Zou
Bingying Dai
Lei Jin
spellingShingle Mingshu He
Xiaojuan Wang
Chundong Zou
Bingying Dai
Lei Jin
A Commodity Classification Framework Based on Machine Learning for Analysis of Trade Declaration
Symmetry
trade declaration
machine learning
text classification
symmetrical decision fusion
harmonized System
author_facet Mingshu He
Xiaojuan Wang
Chundong Zou
Bingying Dai
Lei Jin
author_sort Mingshu He
title A Commodity Classification Framework Based on Machine Learning for Analysis of Trade Declaration
title_short A Commodity Classification Framework Based on Machine Learning for Analysis of Trade Declaration
title_full A Commodity Classification Framework Based on Machine Learning for Analysis of Trade Declaration
title_fullStr A Commodity Classification Framework Based on Machine Learning for Analysis of Trade Declaration
title_full_unstemmed A Commodity Classification Framework Based on Machine Learning for Analysis of Trade Declaration
title_sort commodity classification framework based on machine learning for analysis of trade declaration
publisher MDPI AG
series Symmetry
issn 2073-8994
publishDate 2021-05-01
description Text, voice, images and videos can express some intentions and facts in daily life. By understanding these contents, people can identify and analyze some behaviors. This paper focuses on the commodity trade declaration process and identifies the commodity categories based on text information on customs declarations. Although the technology of text recognition is mature in many application fields, there are few studies on the classification and recognition of customs declaration goods. In this paper, we proposed a classification framework based on machine learning (ML) models for commodity trade declaration that reaches a high rate of accuracy. This paper also proposed a symmetrical decision fusion method for this task based on convolutional neural network (CNN) and transformer. The experimental results show that the fusion model can make up for the shortcomings of the two original models and some improvements have been made. In the two datasets used in this paper, the accuracy can reach 88% and 99%, respectively. To promote the development of study of customs declaration business and Chinese text recognition, we also exposed the proprietary datasets used in this study.
topic trade declaration
machine learning
text classification
symmetrical decision fusion
harmonized System
url https://www.mdpi.com/2073-8994/13/6/964
work_keys_str_mv AT mingshuhe acommodityclassificationframeworkbasedonmachinelearningforanalysisoftradedeclaration
AT xiaojuanwang acommodityclassificationframeworkbasedonmachinelearningforanalysisoftradedeclaration
AT chundongzou acommodityclassificationframeworkbasedonmachinelearningforanalysisoftradedeclaration
AT bingyingdai acommodityclassificationframeworkbasedonmachinelearningforanalysisoftradedeclaration
AT leijin acommodityclassificationframeworkbasedonmachinelearningforanalysisoftradedeclaration
AT mingshuhe commodityclassificationframeworkbasedonmachinelearningforanalysisoftradedeclaration
AT xiaojuanwang commodityclassificationframeworkbasedonmachinelearningforanalysisoftradedeclaration
AT chundongzou commodityclassificationframeworkbasedonmachinelearningforanalysisoftradedeclaration
AT bingyingdai commodityclassificationframeworkbasedonmachinelearningforanalysisoftradedeclaration
AT leijin commodityclassificationframeworkbasedonmachinelearningforanalysisoftradedeclaration
_version_ 1721412182210510848