<--- Back to Details
First PageDocument Content
Parallel computing / GPGPU / Numerical linear algebra / Computational science / Programming paradigms / OpenCL / General-purpose computing on graphics processing units / Basic Linear Algebra Subprograms / Automatic vectorization / Compute kernel / Kernel / Matrix multiplication algorithm
Date: 2016-01-27 04:16:13
Parallel computing
GPGPU
Numerical linear algebra
Computational science
Programming paradigms
OpenCL
General-purpose computing on graphics processing units
Basic Linear Algebra Subprograms
Automatic vectorization
Compute kernel
Kernel
Matrix multiplication algorithm

Writing a performance-portable matrix multiplication

Add to Reading List

Source URL: www.des.udc.es

Download Document from Source Website

File Size: 445,49 KB

Share Document on Facebook

Similar Documents

Computing / Concurrent computing / Parallel computing / Computer programming / GPGPU / Application programming interfaces / Graphics hardware / Video cards / OpenCL / General-purpose computing on graphics processing units / Compute kernel / Stream processing

A Portable High-Productivity Approach to Program Heterogeneous Systems

DocID: 1rqSu - View Document

Computing / Video cards / Computer architecture / Graphics hardware / GPGPU / Parallel computing / General-purpose computing on graphics processing units / Graphics processing unit / Stream processing / Shader / GeForce / Compute kernel

technology from seed Stream-based concurrent computational models and programming tools Leonel Sousa

DocID: 1rhpR - View Document

Computing / Parallel computing / Computer architecture / GPGPU / Graphics hardware / Video cards / Computer engineering / Computer graphics / General-purpose computing on graphics processing units / Shader / Graphics processing unit / Compute kernel

INSTITUTO SUPERIOR TÉCNICO FCT Universidade Técnica de Lisboa

DocID: 1r2zS - View Document

GPGPU / Computer architecture / Computing / Graphics hardware / Parallel computing / Computer engineering / Graphics processing unit / Virtual reality / Compute kernel / General-purpose computing on graphics processing units / CUDA Pinned memory

spcl.inf.ethz.ch @spcl_eth Polly-ACC: Transparent Compilation to Heterogeneous Hardware Tobias Grosser, Torsten Hoefler

DocID: 1r29t - View Document

Computing / Computer architecture / GPGPU / Concurrent computing / Parallel computing / Graphics hardware / Video cards / Nvidia / General-purpose computing on graphics processing units / Graphics processing unit / CUDA / Compute kernel

¨ MUNCHEN ¨ TECHNISCHE UNIVERSITAT Data access optimized applications on the GPU using NVIDIA CUDA

DocID: 1r1p5 - View Document