<--- Back to Details
First PageDocument Content
Algebra / Mathematics / Mathematical analysis / Roofline model / Software optimization / Software testing / Matrix theory / Matrix / Pi
Date: 2015-11-23 13:58:39
Algebra
Mathematics
Mathematical analysis
Roofline model
Software optimization
Software testing
Matrix theory
Matrix
Pi

spcl.inf.ethz.ch @spcl_eth TIMO SCHNEIDER <> DPHPC Recitation Session

Add to Reading List

Source URL: spcl.inf.ethz.ch

Download Document from Source Website

File Size: 522,45 KB

Share Document on Facebook

Similar Documents

FLOPS / Floating point / Computing / Roofline model / Speedup / Xeon / Computer programming / Parallel computing / Software engineering

BOPS, Not FLOPS! A New Metric, Measuring Tool, and Roofline Performance Model For Datacenter Computing Chen Zheng ICT,CAS

DocID: 1xVt0 - View Document

1 Cache-aware Roofline model: Upgrading the loft Aleksandar Ilic, Frederico Pratas, and Leonel Sousa INESC-ID/IST, Technical University of Lisbon, Portugal {ilic,fcpp,las}@inesc-id.pt

DocID: 1rBXE - View Document

Computing / Concurrent computing / Parallel computing / Computer programming / OpenMP / Roofline model / Multi-core processor / Manycore processor / Thread / Benchmark / CUDA / Data parallelism

Roofline Model Toolkit: A Practical Tool for Architectural and Program Analysis Yu Jung Lo, Samuel Williams, Brian Van Straalen, Terry J. Ligocki, Matthew J. Cordery, Nicholas J. Wright, Mary W. Hall, and Leonid Oliker U

DocID: 1rrNN - View Document

Algebra / Computing / Mathematics / Software optimization / Parallel computing / Roofline model / Software testing / Matrix multiplication / NC / Matrix / Lookup table / Matrix multiplication algorithm

Design of Parallel and High Performance Computing HS 2013 Markus P¨ uschel, Torsten Hoefler Department of Computer Science ETH Zurich

DocID: 1rlc8 - View Document

Compiler optimizations / Computing / Software engineering / Software / Loop nest optimization / Stencil code / Roofline model / Stencil / Program optimization / Common subexpression elimination / CPU cache / Scalable locality

Auto-tuning the 27-point Stencil for Multicore Kaushik Datta2 , Samuel Williams1 , Vasily Volkov2 , Jonathan Carter1 , Leonid Oliker1 , John Shalf1 , and Katherine Yelick1 1 CRD/NERSC, Lawrence Berkeley National Laborat

DocID: 1r4gA - View Document