|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 44 occurrences of 42 keywords
|
|
|
Results
Found 62 publication records. Showing 62 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
129 | Liang Gu, Xiaoming Li |
Performance modeling for DFT algorithms in FFTW. |
ICS |
2009 |
DBLP DOI BibTeX RDF |
dft, fftw, model, performance |
98 | Rich Vuduc, James Demmel |
Code Generators for Automatic Tuning of Numerical Kernels: Experiences with FFTW. |
SAIG |
2000 |
DBLP DOI BibTeX RDF |
|
65 | A. M. Alkindi, Darren J. Kerbyson, Efstathios Papaefstathiou, Graham R. Nudd |
Run-Time Optimization Using Dynamic Performance Prediction. |
HPCN |
2000 |
DBLP DOI BibTeX RDF |
Performance Optimisation, Dynamic Performance Prediction, Application Steering, FFTW, Performance Modeling |
64 | Oleg Kiselyov, Walid Taha |
Relating FFTW and Split-Radix. |
ICESS |
2004 |
DBLP DOI BibTeX RDF |
|
34 | Anne C. Elster, Jan Christian Meyer |
A super-efficient adaptable bit-reversal algorithm for multithreaded architectures. |
IPDPS |
2009 |
DBLP DOI BibTeX RDF |
|
34 | Maria Eleftheriou, Blake G. Fitch, Aleksandr Rayshubskiy, T. J. Christopher Ward, Robert S. Germain |
Performance Measurements of the 3D FFT on the Blue Gene/L Supercomputer. |
Euro-Par |
2005 |
DBLP DOI BibTeX RDF |
|
34 | Franz Franchetti, Yevgen Voronenko, Markus Püschel |
Formal loop merging for signal transforms. |
PLDI |
2005 |
DBLP DOI BibTeX RDF |
linear signal transform, domain-specific language, DFT, discrete Fourier transform, loop optimization, automatic performance tuning |
34 | Xiaoming Li 0004, María Jesús Garzarán, David A. Padua |
A Dynamically Tuned Sorting Library. |
CGO |
2004 |
DBLP DOI BibTeX RDF |
|
34 | Franz Franchetti, Stefan Kral, Juergen Lorenz, Markus Püschel, Christoph W. Ueberhuber |
Automatically Tuned FFTs for BlueGene/L's Double FPU. |
VECPAR |
2004 |
DBLP DOI BibTeX RDF |
|
34 | Matteo Frigo |
A Fast Fourier Transform Compiler. |
PLDI |
1999 |
DBLP DOI BibTeX RDF |
|
31 | Thiab R. Taha, Xiangming Xu |
Parallel Split-Step Fourier Methods for the Coupled Nonlinear Schro"dinger Type Equations. |
J. Supercomput. |
2005 |
DBLP DOI BibTeX RDF |
split-step method, FFTW, parallel algorithms, NLS |
30 | Rafael L. Delgado, Sebastian Steinbeißer, Michael Strickland, Johannes Heinrich Weber |
The relativistic Schrödinger equation through FFTW 3: An extension of quantumfdtd. |
Comput. Phys. Commun. |
2022 |
DBLP DOI BibTeX RDF |
|
30 | Larry T. Pileggi, Siyuan Chen, Keshav Harisrikanth, Guanglin Xu, Ken Mai, Franz Franchetti |
A High Throughput Hardware Accelerator for FFTW Codelets: A First Look. |
HPEC |
2022 |
DBLP DOI BibTeX RDF |
|
30 | S. Biplab Raut |
Performance speedup of Quantum Espresso using optimized AOCL-FFTW. |
HPEC |
2022 |
DBLP DOI BibTeX RDF |
|
30 | Oleg Gradov |
Honeysuckle flower: multiscale nervature detection using FFTW-based sofytware. |
|
2020 |
DOI RDF |
|
30 | Oleg Gradov |
Tilia stem: detection of radial and circular pecularities using FFTW-based software. |
|
2020 |
DOI RDF |
|
30 | Oleg Gradov |
Volvox sp. (Volvox L., 1758) Pfeffer's osmometry: FFTW-based LUCAS-like real time approach. |
|
2020 |
DOI RDF |
|
30 | Oleg Gradov |
Epidermis of allium: FFTW-based regulometry case. |
|
2020 |
DOI RDF |
|
30 | Oleg Gradov |
Fern stem: detection of radial and circular pecularities using FFTW-based software. |
|
2020 |
DOI RDF |
|
30 | Oleg Gradov |
Cusurbits stem: detection of radial and circular pecularities using FFTW-based software. |
|
2020 |
DOI RDF |
|
30 | Santiago Mislata Valero, Federico Silla |
Using Remote Accelerators to Improve the Performance of the FFTW Library. |
HPCC/SmartCity/DSS |
2016 |
DBLP DOI BibTeX RDF |
|
30 | Santiago Mislata Valero, Federico Silla |
On the Execution of Computationally Intensive CPU-Based Libraries on Remote Accelerators for Increasing Performance: Early Experience with the OpenBLAS and FFTW Libraries. |
CLUSTER |
2015 |
DBLP DOI BibTeX RDF |
|
30 | Michael Pippig |
PFFT: An Extension of FFTW to Massively Parallel Architectures. |
SIAM J. Sci. Comput. |
2013 |
DBLP DOI BibTeX RDF |
|
30 | Massimiliano Guarrasi, Ning Li, Sandro Frigio, Andrew P. J. Emerson, Giovanni Erbacci |
Testing and Implementing Some New Algorithms Using the FFTW Library on Massively Parallel Supercomputers. |
PARCO |
2013 |
DBLP DOI BibTeX RDF |
|
30 | David A. Padua |
FFTW. |
Encyclopedia of Parallel Computing |
2011 |
DBLP DOI BibTeX RDF |
|
30 | Teng Ma, Aurélien Bouteiller, George Bosilca, Jack J. Dongarra |
Impact of Kernel-Assisted MPI Communication over Scientific Applications: CPMD and FFTW. |
EuroMPI |
2011 |
DBLP DOI BibTeX RDF |
|
30 | Peter Koval, J. D. Talman |
Update of spherical Bessel transform: FFTW and OpenMP. |
Comput. Phys. Commun. |
2010 |
DBLP DOI BibTeX RDF |
|
30 | Michael Pippig |
An Efficient and Flexible Parallel FFT Implementation Based on FFTW. |
CHPC |
2010 |
DBLP DOI BibTeX RDF |
|
30 | Liang Gu, Xiaoming Li |
DFT Performance Prediction in FFTW. |
LCPC |
2009 |
DBLP DOI BibTeX RDF |
|
30 | Matteo Frigo, Steven G. Johnson |
FFTW: an adaptive software architecture for the FFT. |
ICASSP |
1998 |
DBLP DOI BibTeX RDF |
|
17 | Liang Gu, Xiaoming Li, Jakob Siegel |
An empirically tuned 2D and 3D FFT library on CUDA GPU. |
ICS |
2010 |
DBLP DOI BibTeX RDF |
2D FFT, 3D FFT, empirical tuning, library generation, GPU, CUDA |
17 | Yifeng Chen, Xiang Cui, Hong Mei 0001 |
Large-scale FFT on GPU clusters. |
ICS |
2010 |
DBLP DOI BibTeX RDF |
GPU clusters, array dimensions, FFT |
17 | Mainak Chaudhuri |
PageNUCA: Selected policies for page-grain locality management in large shared chip-multiprocessor caches. |
HPCA |
2009 |
DBLP DOI BibTeX RDF |
|
17 | Yan Li, Li Zhao, Haibo Lin, Alex C. Chow, Jeffrey R. Diamond |
A performance model for Fast Fourier Transform. |
IPDPS |
2009 |
DBLP DOI BibTeX RDF |
|
17 | Yuanrui Zhang, Mahmut T. Kandemir, Nikos Pitsianis, Xiaobai Sun |
Exploring parallelization strategies for NUFFT data translation. |
EMSOFT |
2009 |
DBLP DOI BibTeX RDF |
geometric tiling, non-uniform fft, parallelization, gridding, code generation |
17 | Piotr Wendykier, James G. Nagy |
Large-Scale Image Deblurring in Java. |
ICCS (1) |
2008 |
DBLP DOI BibTeX RDF |
|
17 | Alexandre Duchateau, Albert Sidelnik, María Jesús Garzarán, David A. Padua |
P-Ray: A Software Suite for Multi-core Architecture Characterization. |
LCPC |
2008 |
DBLP DOI BibTeX RDF |
|
17 | Arif Mahmood, Sohaib Khan |
Early Termination Algorithms for Correlation Coefficient Based Block Matching. |
ICIP (2) |
2007 |
DBLP DOI BibTeX RDF |
|
17 | David A. Bader, Virat Agarwal |
FFTC: Fastest Fourier Transform for the IBM Cell Broadband Engine. |
HiPC |
2007 |
DBLP DOI BibTeX RDF |
|
17 | V. Paul Pauca, Todd C. Torgersen, Y. Abraham, J. Schmitt, R. Harris |
Automating the development of quantum computational software. |
ACM Southeast Regional Conference |
2007 |
DBLP DOI BibTeX RDF |
molecular dynamics, compiler design |
17 | Daniel A. Orozco, Liping Xue, Murat Bolat, Xiaoming Li, Guang R. Gao |
Experience of Optimizing FFT on Intel Architectures. |
IPDPS |
2007 |
DBLP DOI BibTeX RDF |
|
17 | Ayaz Ali, S. Lennart Johnsson, Jaspal Subhlok |
Scheduling FFT computation on SMP and multicore systems. |
ICS |
2007 |
DBLP DOI BibTeX RDF |
fast Fourier transform, shared memory, multicore, automatic parallelization, automatic performance tuning |
17 | Sung-Chul Han, Franz Franchetti, Markus Püschel |
Program generation for the all-pairs shortest path problem. |
PACT |
2006 |
DBLP DOI BibTeX RDF |
Floyd-Warshall algorithm, SIMD vectorization, empirical search, tiling, blocking |
17 | Stefan Kral, Markus Triska, Christoph W. Ueberhuber |
Compiler Technology for Blue Gene Systems. |
Euro-Par |
2006 |
DBLP DOI BibTeX RDF |
|
17 | Masayo Haneda, Peter M. W. Knijnenburg, Harry A. G. Wijshoff |
On the impact of data input sets on statistical compiler tuning. |
IPDPS |
2006 |
DBLP DOI BibTeX RDF |
|
17 | Andreas Bonelli, Franz Franchetti, Juergen Lorenz, Markus Püschel, Christoph W. Ueberhuber |
Automatic Performance Optimization of the Discrete Fourier Transform on Distributed Memory Computers. |
ISPA |
2006 |
DBLP DOI BibTeX RDF |
|
17 | Frederik Eaton |
Statically typed linear algebra in Haskell. |
Haskell |
2006 |
DBLP DOI BibTeX RDF |
higher-rank polymorphism, template Haskell, linear algebra, staging, existential types |
17 | Xiaoming Li 0004, María Jesús Garzarán, David A. Padua |
Optimizing Sorting with Genetic Algorithms. |
CGO |
2005 |
DBLP DOI BibTeX RDF |
|
17 | Bryson R. Payne, Saeid Belkasim, G. Scott Owen, Michael Weeks, Ying Zhu 0001 |
Accelerated 2D Image Processing on GPUs. |
International Conference on Computational Science (2) |
2005 |
DBLP DOI BibTeX RDF |
|
17 | Arkady Epshteyn, María Jesús Garzarán, Gerald DeJong, David A. Padua, Gang Ren 0002, Xiaoming Li 0004, Kamen Yotov, Keshav Pingali |
Analytic Models and Empirical Search: A Hybrid Approach to Code Optimization. |
LCPC |
2005 |
DBLP DOI BibTeX RDF |
|
17 | Xiaoming Li 0004, María Jesús Garzarán |
Optimizing Matrix Multiplication with a Classifier Learning System. |
LCPC |
2005 |
DBLP DOI BibTeX RDF |
|
17 | Kamen Yotov, Sandra Jackson, Tyler Steele, Keshav Pingali, Paul Stodghill |
Automatic Measurement of Instruction Cache Capacity. |
LCPC |
2005 |
DBLP DOI BibTeX RDF |
|
17 | Sébastien Donadio, James C. Brodman, Thomas Roeder, Kamen Yotov, Denis Barthou, Albert Cohen 0001, María Jesús Garzarán, David A. Padua, Keshav Pingali |
A Language for the Compact Representation of Multiple Program Versions. |
LCPC |
2005 |
DBLP DOI BibTeX RDF |
|
17 | Oleg Kiselyov, Kedar N. Swadi, Walid Taha |
A methodology for generating verified combinatorial circuits. |
EMSOFT |
2004 |
DBLP DOI BibTeX RDF |
abstract interpretation, multi-stage programming |
17 | Anna Derezinska |
Estimating Dependability of Parallel FFT Application using Fault Injection. |
PARELEC |
2004 |
DBLP DOI BibTeX RDF |
|
17 | Aili Li, Klaus Mueller 0001, Thomas Ernst 0001 |
Methods for Efficient, High Quality Volume Resampling in the Frequency Domain. |
IEEE Visualization |
2004 |
DBLP DOI BibTeX RDF |
filters, Fourier Transform, resampling |
17 | Stefan Kral, Franz Franchetti, Juergen Lorenz, Christoph W. Ueberhuber |
SIMD Vectorization of Straight Line FFT Code. |
Euro-Par |
2003 |
DBLP DOI BibTeX RDF |
|
17 | Marios S. Pattichis, Ruhai Zhou, Balaji Raman 0002 |
New algorithms for computing directional discrete Fourier transforms. |
ICIP (3) |
2001 |
DBLP DOI BibTeX RDF |
|
17 | Gavin J. Pringle, Steven P. Booth, Hugh M. P. Couchman, Frazer R. Pearce, Alan D. Simpson |
Towards a Portable, Fast Parallel AP3M-SPH Code: HYDRA_MPI. |
PVM/MPI |
2001 |
DBLP DOI BibTeX RDF |
|
17 | Jianxin Xiong, Jeremy R. Johnson, Robert W. Johnson, David A. Padua |
SPL: A Language and Compiler for DSP Algorithms. |
PLDI |
2001 |
DBLP DOI BibTeX RDF |
C, FORTRAN |
17 | Kang Su Gatlin, Larry Carter |
Faster FFTs via Architecture-Cognizance. |
IEEE PACT |
2000 |
DBLP DOI BibTeX RDF |
cache, feedback, memory hierarchy, compiler optimization, associativity, divide-and-conquer, ILP, runtime systems, registers, TLB |
17 | Lixin Zhang 0002, Venkata K. Pingali, Bharat Chandramouli, John B. Carter |
Memory System Support for Dynamic Cache Line Assembly. |
Intelligent Memory Systems |
2000 |
DBLP DOI BibTeX RDF |
|
Displaying result #1 - #62 of 62 (100 per page; Change: )
|
|