Writing a performance-portable matrix multiplication

First Page		Document Content
Date: 2016-01-27 04:16:13 Parallel computing GPGPU Numerical linear algebra Computational science Programming paradigms OpenCL General-purpose computing on graphics processing units Basic Linear Algebra Subprograms Automatic vectorization Compute kernel Kernel Matrix multiplication algorithm		Writing a performance-portable matrix multiplication Add to Reading List Source URL: www.des.udc.es Download Document from Source Website File Size: 445,49 KB Share Document on Facebook

	A Portable High-Productivity Approach to Program Heterogeneous Systems DocID: 1rqSu - View Document
	technology from seed Stream-based concurrent computational models and programming tools Leonel Sousa DocID: 1rhpR - View Document
	INSTITUTO SUPERIOR TÉCNICO FCT Universidade Técnica de Lisboa DocID: 1r2zS - View Document
	spcl.inf.ethz.ch @spcl_eth Polly-ACC: Transparent Compilation to Heterogeneous Hardware Tobias Grosser, Torsten Hoefler DocID: 1r29t - View Document
	¨ MUNCHEN ¨ TECHNISCHE UNIVERSITAT Data access optimized applications on the GPU using NVIDIA CUDA DocID: 1r1p5 - View Document