Using graphics processor on B-Spline finite element analysis
碩士 === 國立成功大學 === 機械工程學系碩博士班 === 95 === This paper discusses how to use the graphics processor on B-Spline finite element analysis and studies the performance analysis. The basic linear algebraic arithmetic operations on the graphics processor, such as vector inner product, vector-vector addition an...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2007
|
Online Access: | http://ndltd.ncl.edu.tw/handle/58206200551958860054 |
id |
ndltd-TW-095NCKU5490024 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-095NCKU54900242015-10-13T14:16:09Z http://ndltd.ncl.edu.tw/handle/58206200551958860054 Using graphics processor on B-Spline finite element analysis 使用圖形處理器於B-Spline有限元素分析 Wu-Yung Chen 陳武勇 碩士 國立成功大學 機械工程學系碩博士班 95 This paper discusses how to use the graphics processor on B-Spline finite element analysis and studies the performance analysis. The basic linear algebraic arithmetic operations on the graphics processor, such as vector inner product, vector-vector addition and multiplication, full matrix-vector multiplication, full matrix-matrix multiplication and sparse matrix-vector multiplication are studied in this paper. The performances are improved when executed by different threads and compared with the CUBLAS(Basic Linear Algebraic Subprogram)functions. Beside, the data of sparse matrix’s non zero elements are sorted to gain the performance when executed sparse matrix-vector multiplication. Finally, I used the iterative method to solve a B-Spline finite element problem. The Jacobi precondition conjugate gradient method is used in the iterative method. At the conclusion of finite element problem, the accuracy of the single precision floating point operation in graphics processor can’t absolutely converge to the result when compared with central processing unit that used the double precision floating point operation. The errors are about 0.04~2 %. The graphics processor used is the GeForce 8800 GTX, which is introduced in the beginning of this year by Nvidia company. GeForce 8800 GTX includes 128 stream processors and operates on SIMD(Single Instruction Multiple Data)parallel model. The peak performance can reach about 350Gflops and the bandwidth of data transfer is 86.5GB/s. The GeForce 8800 GTX has a new architecture called CUDA(Compute Unified Device Architecture) that is integrated and independent of GeForce 8800 GTX stream processor’s calculation capability. When compared with the traditional graphics processor, the GeForce 8800 GTX has less limitation to use the 128 stream processors. Shi-Pin Ho 何旭彬 2007 學位論文 ; thesis 66 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立成功大學 === 機械工程學系碩博士班 === 95 === This paper discusses how to use the graphics processor on B-Spline finite element analysis and studies the performance analysis. The basic linear algebraic arithmetic operations on the graphics processor, such as vector inner product, vector-vector addition and multiplication, full matrix-vector multiplication, full matrix-matrix multiplication and sparse matrix-vector multiplication are studied in this paper. The performances are improved when executed by different threads and compared with the CUBLAS(Basic Linear Algebraic Subprogram)functions. Beside, the data of sparse matrix’s non zero elements are sorted to gain the performance when executed sparse matrix-vector multiplication. Finally, I used the iterative method to solve a B-Spline finite element problem. The Jacobi precondition conjugate gradient method is used in the iterative method. At the conclusion of finite element problem, the accuracy of the single precision floating point operation in graphics processor can’t absolutely converge to the result when compared with central processing unit that used the double precision floating point operation. The errors are about 0.04~2 %.
The graphics processor used is the GeForce 8800 GTX, which is introduced in the beginning of this year by Nvidia company. GeForce 8800 GTX includes 128 stream processors and operates on SIMD(Single Instruction Multiple Data)parallel model. The peak performance can reach about 350Gflops and the bandwidth of data transfer is 86.5GB/s. The GeForce 8800 GTX has a new architecture called CUDA(Compute Unified Device Architecture) that is integrated and independent of GeForce 8800 GTX stream processor’s calculation capability. When compared with the traditional graphics processor, the GeForce 8800 GTX has less limitation to use the 128 stream processors.
|
author2 |
Shi-Pin Ho |
author_facet |
Shi-Pin Ho Wu-Yung Chen 陳武勇 |
author |
Wu-Yung Chen 陳武勇 |
spellingShingle |
Wu-Yung Chen 陳武勇 Using graphics processor on B-Spline finite element analysis |
author_sort |
Wu-Yung Chen |
title |
Using graphics processor on B-Spline finite element analysis |
title_short |
Using graphics processor on B-Spline finite element analysis |
title_full |
Using graphics processor on B-Spline finite element analysis |
title_fullStr |
Using graphics processor on B-Spline finite element analysis |
title_full_unstemmed |
Using graphics processor on B-Spline finite element analysis |
title_sort |
using graphics processor on b-spline finite element analysis |
publishDate |
2007 |
url |
http://ndltd.ncl.edu.tw/handle/58206200551958860054 |
work_keys_str_mv |
AT wuyungchen usinggraphicsprocessoronbsplinefiniteelementanalysis AT chénwǔyǒng usinggraphicsprocessoronbsplinefiniteelementanalysis AT wuyungchen shǐyòngtúxíngchùlǐqìyúbsplineyǒuxiànyuánsùfēnxī AT chénwǔyǒng shǐyòngtúxíngchùlǐqìyúbsplineyǒuxiànyuánsùfēnxī |
_version_ |
1717750508992593920 |