<--- Back to Details
First PageDocument Content
Parallel computing / GPGPU / Numerical linear algebra / Computational science / Programming paradigms / OpenCL / General-purpose computing on graphics processing units / Basic Linear Algebra Subprograms / Automatic vectorization / Compute kernel / Kernel / Matrix multiplication algorithm
Date: 2016-01-27 04:16:13
Parallel computing
GPGPU
Numerical linear algebra
Computational science
Programming paradigms
OpenCL
General-purpose computing on graphics processing units
Basic Linear Algebra Subprograms
Automatic vectorization
Compute kernel
Kernel
Matrix multiplication algorithm

Writing a performance-portable matrix multiplication

Add to Reading List

Source URL: www.des.udc.es

Download Document from Source Website

File Size: 445,49 KB

Share Document on Facebook

Similar Documents

Parallel computing / GPGPU / Numerical linear algebra / Computational science / Programming paradigms / OpenCL / General-purpose computing on graphics processing units / Basic Linear Algebra Subprograms / Automatic vectorization / Compute kernel / Kernel / Matrix multiplication algorithm

Writing a performance-portable matrix multiplication

DocID: 1p8i5 - View Document

Computer programming / Automatic parallelization / Dynamic recompilation / Binary translation / Vectorization / Profiling / Loop unwinding / Pin / Multithreading / Computing / Parallel computing / Compiler optimizations

Journal of Instruction-Level ParallelismSubmitted 6/07; published 6/08 Dynamic Parallelization and Vectorization of Binary Executables on Hierarchical Platforms

DocID: 1fArT - View Document

Compiler construction / Software engineering / Automatic parallelization / Loop optimization / Vectorization / Code generation / Parallel computing / Schedule / Polytope model / Compiler optimizations / Computing / Programming language theory

Transparent Parallelization of Binary Code BenoƮt Pradelle Alain Ketterlin Philippe Clauss

DocID: 17wGi - View Document

Programming language theory / Loop optimization / Static single assignment form / Polytope model / Automatic parallelization / GNU Compiler Collection / Vectorization / Loop tiling / Fortran / Computing / Compiler optimizations / Software engineering

Optimization opportunities based on the polyhedral model in GRAPHITE How much impact has GRAPHITE already? Tobias Grosser University of Passau

DocID: 14Zbg - View Document

Software engineering / OpenMP / Loop scheduling / Fortran / For loop / Vectorization / Barrier / Automatic parallelization tool / Computing / Computer programming / Parallel computing

The Thermoflow60 Finite-Element Program Ulrich Wepler 1, Dieter an Mey2, Thomas Haarmann3, Wolfgang Koschel4 1) German Aerospace Center (DLRCenter for Computing and Communication, Aachen University (RWTH)

DocID: 13wiM - View Document