CUDA Application Design and Development. (Record no. 918403)

001 - CONTROL NUMBER
control field EBL806486
006 - FIXED-LENGTH DATA ELEMENTS--ADDITIONAL MATERIAL CHARACTERISTICS
additional material characteristics m d
007 - PHYSICAL DESCRIPTION FIXED FIELD--GENERAL INFORMATION
fixed length control field cr -n---------
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field 151219s2014||||||| s|||||||||||eng|d
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number 9780123884329
Terms of availability 89.91 (UA),59.94 (1U)
035 ## - SYSTEM CONTROL NUMBER
System control number (AU-PeEL)806486
035 ## - SYSTEM CONTROL NUMBER
System control number (OCoLC)763159035
040 ## - CATALOGING SOURCE
Original cataloging agency AU-PeEL
Language of cataloging eng
Transcribing agency AU-PeEL
Modifying agency AU-PeEL
050 #4 - LIBRARY OF CONGRESS CALL NUMBER
Classification number QA76.76.A65 -- F37 2012eb
082 00 - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number 005.3
090 ## - LOCALLY ASSIGNED LC-TYPE CALL NUMBER (OCLC); LOCAL CALL NUMBER (OCLC)
Classification number (OCLC) (R) ; Classification number, CALL (RLIN) (NR) QA76.76.A65 -- F37 2012eb
100 1# - MAIN ENTRY--PERSONAL NAME
Personal name Farber, Rob.
245 10 - TITLE STATEMENT
Title CUDA Application Design and Development.
260 ## - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT)
Place of publication, distribution, etc St. Louis :
Name of publisher, distributor, etc Elsevier Science,
Date of publication, distribution, etc 2014.
300 ## - PHYSICAL DESCRIPTION
Extent 1 online resource (337 p.)
490 0# - SERIES STATEMENT
Series statement eBooks on Demand
505 0# - FORMATTED CONTENTS NOTE
Formatted contents note Front Cover; CUDA Application Design and Development; Copyright; Dedication; Table of Contents; Foreword; Preface; 1 First Programs and How to Think in CUDA; Source Code and Wiki; Distinguishing CUDA from Conventional Programming with a Simple Example; Choosing a CUDA API; Some Basic CUDA Concepts; Understanding Our First Runtime Kernel; Three Rules of GPGPU Programming; Rule 1: Get the Data on the GPU and Keep It There; Rule 2: Give the GPGPU Enough Work to Do; Rule 3: Focus on Data Reuse within the GPGPU to Avoid Memory Bandwidth Limitations; Big-O Considerations and Data Transfers
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note CUDA and Amdahl's LawData and Task Parallelism; Hybrid Execution: Using Both CPU and GPU Resources; Regression Testing and Accuracy; Silent Errors; Introduction to Debugging; UNIX Debugging; NVIDIA's cuda-gdb Debugger; The CUDA Memory Checker; Use cuda-gdb with the UNIX ddd Interface; Windows Debugging with Parallel Nsight; Summary; 2 CUDA for Machine Learning and Optimization; Modeling and Simulation; Fitting Parameterized Models; Nelder-Mead Method; Levenberg-Marquardt Method; Algorithmic Speedups; Machine Learning and Neural Networks; XOR: An Important Nonlinear Machine-Learning Problem
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note An Example Objective FunctionA Complete Functor for Multiple GPU Devices and the Host Processors; Brief Discussion of a Complete Nelder-Mead Optimization Code; Performance Results on XOR; Performance Discussion; Summary; The C++ Nelder-Mead Template; 3 The CUDA Tool Suite: Profiling a PCA/NLPCA Functor; PCA and NLPCA; Autoencoders; An Example Functor for PCA Analysis; An Example Functor for NLPCA Analysis; Obtaining Basic Profile Information; Gprof: A Common UNIX Profiler; The NVIDIA Visual Profiler: Computeprof; Parallel Nsight for Microsoft Visual Studio; The Nsight Timeline Analysis
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note The NVTX Tracing LibraryScaling Behavior of the CUDA API; Tuning and Analysis Utilities (TAU); Summary; 4 The CUDA Execution Model; GPU Architecture Overview; Thread Scheduling: Orchestrating Performance and Parallelism via the Execution Configuration; Relevant computeprof Values for a Warp; Warp Divergence; Guidelines for Warp Divergence; Relevant computeprof Values for Warp Divergence; Warp Scheduling and TLP; Relevant computeprof Values for Occupancy; ILP: Higher Performance at Lower Occupancy; ILP Hides Arithmetic Latency; ILP Hides Data Latency; ILP in the Future
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note Relevant computeprof Values for Instruction RatesLittle's Law; CUDA Tools to Identify Limiting Factors; The nvcc Compiler; Launch Bounds; The Disassembler; PTX Kernels; GPU Emulators; Summary; 5 CUDA Memory; The CUDA Memory Hierarchy; GPU Memory; L2 Cache; Relevant computeprof Values for the L2 Cache; L1 Cache; Relevant computeprof Values for the L1 Cache; CUDA Memory Types; Registers; Local memory; Relevant computeprof Values for Local Memory Cache; Shared Memory; Relevant computeprof Values for Shared Memory; Constant Memory; Texture Memory; Relevant computeprof Values for Texture Memory
505 8# - FORMATTED CONTENTS NOTE
Formatted contents note Global Memory
520 ## - SUMMARY, ETC.
Summary, etc As the computer industry retools to leverage massively parallel graphics processing units (GPUs), this book is designed to meet the needs of working software developers who need to understand GPU programming with CUDA and increase efficiency in their projects. <i>CUDA Application Design and Development</i> starts with an introduction to parallel computing concepts for readers with no previous parallel experience, and focuses on issues of immediate importance to working software developers: achieving high performance, maintaining competitiveness, analyzing CUDA benefits versus costs, and deter
588 ## -
-- Description based upon print version of record.
650 #4 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Application software -- Development.
650 #4 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Computer architecture.
650 #4 - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Parallel programming (Computer science).
655 #0 - INDEX TERM--GENRE/FORM
Genre/form data or focus term Electronic books.
776 08 - ADDITIONAL PHYSICAL FORM ENTRY
Display text Print version:
Main entry heading Farber, Rob
Title CUDA Application Design and Development
Place, publisher, and date of publication St. Louis : Elsevier Science,c2014
International Standard Book Number 9780123884268
856 40 - ELECTRONIC LOCATION AND ACCESS
Uniform Resource Identifier <a href="http://uttyler.eblib.com/patron/FullRecord.aspx?p=806486">http://uttyler.eblib.com/patron/FullRecord.aspx?p=806486</a>
Link text Click here to view this ebook.
901 ## - LOCAL DATA ELEMENT A, LDA (RLIN)
Platform EBL
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Koha item type Electronic Book
Source of classification or shelving scheme
Holdings
Withdrawn status Lost item Source of classification or shelving scheme Damaged status Not for loan Permanent Location Current Location Shelving location Date acquired Full call number Barcode Date last seen Uniform Resource Identifier Price effective from Koha item type
          UT Tyler Online UT Tyler Online Online 2015-12-21 QA76.76.A65 -- F37 2012eb EBL806486 2015-12-21 http://uttyler.eblib.com/patron/FullRecord.aspx?p=806486 2015-12-21 Electronic Book