K-RANK: AN EVOLUTION OF Y-RANK FOR MULTIPLE SOLUTIONS PROBLEM

ABSTRACT Y-rank can present faults when dealing with non-linear problems. A methodology is proposed to improve the selection of data in situations where y-rank is fragile. The proposed alternative, called k-rank, consists of splitting the data set into clusters using the k-means algorithm, and then...

Full description

Bibliographic Details
Main Authors: Pedro V. J. L. Santos, Lucas Ranzan, Marcelo Farenzena, Jorge O. Trierweiler
Format: Article
Language:English
Published: Brazilian Society of Chemical Engineering
Series:Brazilian Journal of Chemical Engineering
Subjects:
Online Access:http://www.scielo.br/scielo.php?script=sci_arttext&pid=S0104-66322019000100409&lng=en&tlng=en
id doaj-6e8405461897449da08840737e1d47bd
record_format Article
spelling doaj-6e8405461897449da08840737e1d47bd2020-11-24T23:52:09ZengBrazilian Society of Chemical EngineeringBrazilian Journal of Chemical Engineering1678-438336140941910.1590/0104-6632.20190361s20170455S0104-66322019000100409K-RANK: AN EVOLUTION OF Y-RANK FOR MULTIPLE SOLUTIONS PROBLEMPedro V. J. L. SantosLucas RanzanMarcelo FarenzenaJorge O. TrierweilerABSTRACT Y-rank can present faults when dealing with non-linear problems. A methodology is proposed to improve the selection of data in situations where y-rank is fragile. The proposed alternative, called k-rank, consists of splitting the data set into clusters using the k-means algorithm, and then apply y-rank to the generated clusters. Models were calibrated and tested with subsets split by y-rank and k-rank. For the Heating Tank case study, in 59% of the simulations, models calibrated with k-rank subsets achieved better results. For the Propylene / Propane Separation Unit case, when dealing with a small number of sample points, the y-rank models had errors almost three times higher than the k-rank models for the test subset, meaning that the fitted model could not deal properly with new unseen data. The proposed methodology was successful in splitting the data, especially in cases with a limited amount of samples.http://www.scielo.br/scielo.php?script=sci_arttext&pid=S0104-66322019000100409&lng=en&tlng=enSplitting dataK-meansSystematic samplingMultiple solutions
collection DOAJ
language English
format Article
sources DOAJ
author Pedro V. J. L. Santos
Lucas Ranzan
Marcelo Farenzena
Jorge O. Trierweiler
spellingShingle Pedro V. J. L. Santos
Lucas Ranzan
Marcelo Farenzena
Jorge O. Trierweiler
K-RANK: AN EVOLUTION OF Y-RANK FOR MULTIPLE SOLUTIONS PROBLEM
Brazilian Journal of Chemical Engineering
Splitting data
K-means
Systematic sampling
Multiple solutions
author_facet Pedro V. J. L. Santos
Lucas Ranzan
Marcelo Farenzena
Jorge O. Trierweiler
author_sort Pedro V. J. L. Santos
title K-RANK: AN EVOLUTION OF Y-RANK FOR MULTIPLE SOLUTIONS PROBLEM
title_short K-RANK: AN EVOLUTION OF Y-RANK FOR MULTIPLE SOLUTIONS PROBLEM
title_full K-RANK: AN EVOLUTION OF Y-RANK FOR MULTIPLE SOLUTIONS PROBLEM
title_fullStr K-RANK: AN EVOLUTION OF Y-RANK FOR MULTIPLE SOLUTIONS PROBLEM
title_full_unstemmed K-RANK: AN EVOLUTION OF Y-RANK FOR MULTIPLE SOLUTIONS PROBLEM
title_sort k-rank: an evolution of y-rank for multiple solutions problem
publisher Brazilian Society of Chemical Engineering
series Brazilian Journal of Chemical Engineering
issn 1678-4383
description ABSTRACT Y-rank can present faults when dealing with non-linear problems. A methodology is proposed to improve the selection of data in situations where y-rank is fragile. The proposed alternative, called k-rank, consists of splitting the data set into clusters using the k-means algorithm, and then apply y-rank to the generated clusters. Models were calibrated and tested with subsets split by y-rank and k-rank. For the Heating Tank case study, in 59% of the simulations, models calibrated with k-rank subsets achieved better results. For the Propylene / Propane Separation Unit case, when dealing with a small number of sample points, the y-rank models had errors almost three times higher than the k-rank models for the test subset, meaning that the fitted model could not deal properly with new unseen data. The proposed methodology was successful in splitting the data, especially in cases with a limited amount of samples.
topic Splitting data
K-means
Systematic sampling
Multiple solutions
url http://www.scielo.br/scielo.php?script=sci_arttext&pid=S0104-66322019000100409&lng=en&tlng=en
work_keys_str_mv AT pedrovjlsantos krankanevolutionofyrankformultiplesolutionsproblem
AT lucasranzan krankanevolutionofyrankformultiplesolutionsproblem
AT marcelofarenzena krankanevolutionofyrankformultiplesolutionsproblem
AT jorgeotrierweiler krankanevolutionofyrankformultiplesolutionsproblem
_version_ 1725474631409729536