Auto-tunable GPU BLAS
In this paper, we present our implementation of an Auto tuning system, written in C++, which incorporate the use of OpenCL kernels. We deploy this approach on different GPU architectures, evaluating the performance of the approach. Our main focus is to easily generate tuned code, that would otherwis...
Main Author: | Lien, Geir Josten |
---|---|
Format: | Others |
Language: | English |
Published: |
Norges teknisk-naturvitenskapelige universitet, Institutt for datateknikk og informasjonsvitenskap
2012
|
Subjects: | |
Online Access: | http://urn.kb.se/resolve?urn=urn:nbn:no:ntnu:diva-18411 |
Similar Items
-
Techniques and Tools for Optimizing Codes on Modern Architectures: : A Low-Level Approach
by: Jensen, Rune Erlend
Published: (2009) -
Evolusjon av feil-tolerante digitale kretser ved bruk av en beregningsklynge
by: Martinsen, May Linda
Published: (2007) -
BSPlab - experiment manager (BEM)
by: Klepaker, Erlend Søreide
Published: (2006) -
Rekonfigurerbar maskinvare som applikasjonsakselerator ved søk i DNA
by: Gulbrandsen, Per Andreas
Published: (2007) -
Threats to Bitcoin Software
by: Kateraas, Christian H
Published: (2014)