K-RANK: AN EVOLUTION OF Y-RANK FOR MULTIPLE SOLUTIONS PROBLEM
ABSTRACT Y-rank can present faults when dealing with non-linear problems. A methodology is proposed to improve the selection of data in situations where y-rank is fragile. The proposed alternative, called k-rank, consists of splitting the data set into clusters using the k-means algorithm, and then...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Brazilian Society of Chemical Engineering
|
Series: | Brazilian Journal of Chemical Engineering |
Subjects: | |
Online Access: | http://www.scielo.br/scielo.php?script=sci_arttext&pid=S0104-66322019000100409&lng=en&tlng=en |
id |
doaj-6e8405461897449da08840737e1d47bd |
---|---|
record_format |
Article |
spelling |
doaj-6e8405461897449da08840737e1d47bd2020-11-24T23:52:09ZengBrazilian Society of Chemical EngineeringBrazilian Journal of Chemical Engineering1678-438336140941910.1590/0104-6632.20190361s20170455S0104-66322019000100409K-RANK: AN EVOLUTION OF Y-RANK FOR MULTIPLE SOLUTIONS PROBLEMPedro V. J. L. SantosLucas RanzanMarcelo FarenzenaJorge O. TrierweilerABSTRACT Y-rank can present faults when dealing with non-linear problems. A methodology is proposed to improve the selection of data in situations where y-rank is fragile. The proposed alternative, called k-rank, consists of splitting the data set into clusters using the k-means algorithm, and then apply y-rank to the generated clusters. Models were calibrated and tested with subsets split by y-rank and k-rank. For the Heating Tank case study, in 59% of the simulations, models calibrated with k-rank subsets achieved better results. For the Propylene / Propane Separation Unit case, when dealing with a small number of sample points, the y-rank models had errors almost three times higher than the k-rank models for the test subset, meaning that the fitted model could not deal properly with new unseen data. The proposed methodology was successful in splitting the data, especially in cases with a limited amount of samples.http://www.scielo.br/scielo.php?script=sci_arttext&pid=S0104-66322019000100409&lng=en&tlng=enSplitting dataK-meansSystematic samplingMultiple solutions |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Pedro V. J. L. Santos Lucas Ranzan Marcelo Farenzena Jorge O. Trierweiler |
spellingShingle |
Pedro V. J. L. Santos Lucas Ranzan Marcelo Farenzena Jorge O. Trierweiler K-RANK: AN EVOLUTION OF Y-RANK FOR MULTIPLE SOLUTIONS PROBLEM Brazilian Journal of Chemical Engineering Splitting data K-means Systematic sampling Multiple solutions |
author_facet |
Pedro V. J. L. Santos Lucas Ranzan Marcelo Farenzena Jorge O. Trierweiler |
author_sort |
Pedro V. J. L. Santos |
title |
K-RANK: AN EVOLUTION OF Y-RANK FOR MULTIPLE SOLUTIONS PROBLEM |
title_short |
K-RANK: AN EVOLUTION OF Y-RANK FOR MULTIPLE SOLUTIONS PROBLEM |
title_full |
K-RANK: AN EVOLUTION OF Y-RANK FOR MULTIPLE SOLUTIONS PROBLEM |
title_fullStr |
K-RANK: AN EVOLUTION OF Y-RANK FOR MULTIPLE SOLUTIONS PROBLEM |
title_full_unstemmed |
K-RANK: AN EVOLUTION OF Y-RANK FOR MULTIPLE SOLUTIONS PROBLEM |
title_sort |
k-rank: an evolution of y-rank for multiple solutions problem |
publisher |
Brazilian Society of Chemical Engineering |
series |
Brazilian Journal of Chemical Engineering |
issn |
1678-4383 |
description |
ABSTRACT Y-rank can present faults when dealing with non-linear problems. A methodology is proposed to improve the selection of data in situations where y-rank is fragile. The proposed alternative, called k-rank, consists of splitting the data set into clusters using the k-means algorithm, and then apply y-rank to the generated clusters. Models were calibrated and tested with subsets split by y-rank and k-rank. For the Heating Tank case study, in 59% of the simulations, models calibrated with k-rank subsets achieved better results. For the Propylene / Propane Separation Unit case, when dealing with a small number of sample points, the y-rank models had errors almost three times higher than the k-rank models for the test subset, meaning that the fitted model could not deal properly with new unseen data. The proposed methodology was successful in splitting the data, especially in cases with a limited amount of samples. |
topic |
Splitting data K-means Systematic sampling Multiple solutions |
url |
http://www.scielo.br/scielo.php?script=sci_arttext&pid=S0104-66322019000100409&lng=en&tlng=en |
work_keys_str_mv |
AT pedrovjlsantos krankanevolutionofyrankformultiplesolutionsproblem AT lucasranzan krankanevolutionofyrankformultiplesolutionsproblem AT marcelofarenzena krankanevolutionofyrankformultiplesolutionsproblem AT jorgeotrierweiler krankanevolutionofyrankformultiplesolutionsproblem |
_version_ |
1725474631409729536 |