A distribution-free multi-factorial profiler for harvesting information from high-density screenings.

Data screening is an indispensable phase in initiating the scientific discovery process. Fractional factorial designs offer quick and economical options for engineering highly-dense structured datasets. Maximum information content is harvested when a selected fractional factorial scheme is driven to...

Full description

Bibliographic Details
Main Author: George J Besseris
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC3756950?pdf=render
id doaj-bea975eb58584fbf87996d4ac61b6bb1
record_format Article
spelling doaj-bea975eb58584fbf87996d4ac61b6bb12020-11-24T21:12:24ZengPublic Library of Science (PLoS)PLoS ONE1932-62032013-01-0188e7327510.1371/journal.pone.0073275A distribution-free multi-factorial profiler for harvesting information from high-density screenings.George J BesserisData screening is an indispensable phase in initiating the scientific discovery process. Fractional factorial designs offer quick and economical options for engineering highly-dense structured datasets. Maximum information content is harvested when a selected fractional factorial scheme is driven to saturation while data gathering is suppressed to no replication. A novel multi-factorial profiler is presented that allows screening of saturated-unreplicated designs by decomposing the examined response to its constituent contributions. Partial effects are sliced off systematically from the investigated response to form individual contrasts using simple robust measures. By isolating each time the disturbance attributed solely to a single controlling factor, the Wilcoxon-Mann-Whitney rank stochastics are employed to assign significance. We demonstrate that the proposed profiler possesses its own self-checking mechanism for detecting a potential influence due to fluctuations attributed to the remaining unexplainable error. Main benefits of the method are: 1) easy to grasp, 2) well-explained test-power properties, 3) distribution-free, 4) sparsity-free, 5) calibration-free, 6) simulation-free, 7) easy to implement, and 8) expanded usability to any type and size of multi-factorial screening designs. The method is elucidated with a benchmarked profiling effort for a water filtration process.http://europepmc.org/articles/PMC3756950?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author George J Besseris
spellingShingle George J Besseris
A distribution-free multi-factorial profiler for harvesting information from high-density screenings.
PLoS ONE
author_facet George J Besseris
author_sort George J Besseris
title A distribution-free multi-factorial profiler for harvesting information from high-density screenings.
title_short A distribution-free multi-factorial profiler for harvesting information from high-density screenings.
title_full A distribution-free multi-factorial profiler for harvesting information from high-density screenings.
title_fullStr A distribution-free multi-factorial profiler for harvesting information from high-density screenings.
title_full_unstemmed A distribution-free multi-factorial profiler for harvesting information from high-density screenings.
title_sort distribution-free multi-factorial profiler for harvesting information from high-density screenings.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2013-01-01
description Data screening is an indispensable phase in initiating the scientific discovery process. Fractional factorial designs offer quick and economical options for engineering highly-dense structured datasets. Maximum information content is harvested when a selected fractional factorial scheme is driven to saturation while data gathering is suppressed to no replication. A novel multi-factorial profiler is presented that allows screening of saturated-unreplicated designs by decomposing the examined response to its constituent contributions. Partial effects are sliced off systematically from the investigated response to form individual contrasts using simple robust measures. By isolating each time the disturbance attributed solely to a single controlling factor, the Wilcoxon-Mann-Whitney rank stochastics are employed to assign significance. We demonstrate that the proposed profiler possesses its own self-checking mechanism for detecting a potential influence due to fluctuations attributed to the remaining unexplainable error. Main benefits of the method are: 1) easy to grasp, 2) well-explained test-power properties, 3) distribution-free, 4) sparsity-free, 5) calibration-free, 6) simulation-free, 7) easy to implement, and 8) expanded usability to any type and size of multi-factorial screening designs. The method is elucidated with a benchmarked profiling effort for a water filtration process.
url http://europepmc.org/articles/PMC3756950?pdf=render
work_keys_str_mv AT georgejbesseris adistributionfreemultifactorialprofilerforharvestinginformationfromhighdensityscreenings
AT georgejbesseris distributionfreemultifactorialprofilerforharvestinginformationfromhighdensityscreenings
_version_ 1716751067188822016