عنوان فارسی مقاله: عملکرد و مقایسه انرژی FPGA ها، GPU ها، و چند هسته ای ها برای برنامه های کاربردی پنجره لغزان
عنوان انگلیسی مقاله:
فهرست مطالب
A Performance and Energy Comparison of FPGAs, GPUs, and Multicores for Sliding-Window Applications
Introduction
Case Study: Sliding Window
Sliding Window Applications
App 1: Sum of Absolute Differences (SAD)
App 2: 2D Convolution
App 3: Correntropy
Devices Targeted
FPGA Architecture
Window Generator
FPGA Architecture
FPGA Datapaths
FPGA Datapaths Cont.
GPU CUDA Framework
GPU CUDA Framework Cont.
GPU Implementations
CPU OpenCL Implementations
Experimental Setup
Application Case StudiesSum of Absolute Differences
Application Case Studies2D Convolution
Application Case StudiesCorrentropy
Speedup
Single Chip Implementations
Energy Comparison
Future Work
Conclusion
بخشی از مقاله
GPU Implementations
SAD: each thread computes SAD between kernel and the 4 windows in its Macro Block
2D Convolution: like SAD, but with multiply-accumulate
2D FFT Convolution: used CUFFT to implement frequency domain version
Correntropy: adds Gaussian lookup table to SAD, computes max values in parallel post processing
کلمات کلیدی:
A Tradeoff Analysis of FPGAs, GPUs, and Multicores for Sliding ...https://www.researchgate.net/.../273897505_A_Tradeoff_Analysis_of_FPGAs_GPUs_an...The results show that, for large input sizes, FPGAs can achieve speedups of up to 5.6x and 58x compared to GPUs and multicore CPUs, respectively, while also ...A Tradeoff Analysis of FPGAs, GPUs, and Multicores for Sliding ... - DOIshttps://doi.org/10.1145/2659000Mar 6, 2015 - Jeremy Fowers , Greg Brown , Patrick Cooke , Greg Stitt, A performance and energy comparison of FPGAs, GPUs, and multicores for ...[PDF]FPGA-GPU-CPU Heterogenous Architecture for Real-time - UCSD CSEcseweb.ucsd.edu/~kastner/papers/fpt12-rt_om.pdfby P Meng - Cited by 15 - Related articlesThis represents a 273× speed up over a multi-core CPU implementation. ..... energy comparison of FPGAs, GPUs, and multicores for sliding-window applications ...Separation Logic for High-level Synthesishttps://books.google.com/books?isbn=3319532227Felix Winterstein - 2017 - Technology & EngineeringS.T. Fleming, D.B. Thomas, F. Winterstein, FPGAs and Parallel Architectures for ... A performance and energy comparison of fpgas, gpus, and multicores for ...dblp: Patrick Cookedblp.uni-trier.de › PersonsJan 5, 2017 - A comparison of correntropy-based feature tracking on FPGAs and GPUs. ... comparison of FPGAs, GPUs, and multicores for sliding-window ...