The FacetedDBLP logo    Search for: in:

Disable automatic phrases ?     Syntactic query expansion: ?

Searching for DGEMM with no syntactic query expansion in all metadata.

Publication years (Num. hits)
2000-2015 (17) 2017-2023 (9)
Publication types (Num. hits)
article(4) inproceedings(22)
Venues (Conferences, Journals, ...)
SC(3) CoRR(2) ICPADS(2) ICPP(2) PARA(2) PDP(2) Euro-Par(1) GPGPU(1) HiPC(1) ICCS(1) ICS(1) IEEE Micro(1) IPDPS(1) ISC(1) J. Comput.(1) PPoPP(1) More (+10 of total 19)
GrowBag graphs for keyword ? (Num. hits/coverage)

Group by:
The graphs summarize 18 occurrences of 18 keywords

Results
Found 26 publication records. Showing 26 according to the selection in the facets
Hits ? Authors Title Venue Year Link Author keywords
77John A. Gunnels, Fred G. Gustavson, Keshav Pingali, Kamen Yotov Is Cache-Oblivious DGEMM Viable? Search on Bibsonomy PARA The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
70David S. Wise, Jeremy D. Frens, Yuhong Gu, Gregory A. Alexander Language support for Morton-order matrices. Search on Bibsonomy PPoPP The full citation details ... 2001 DBLP  DOI  BibTeX  RDF paging, quadtrees
63David Rohr, Matthias Bach, Matthias Kretz, Volker Lindenstruth Multi-GPU DGEMM and High Performance Linpack on Highly Energy-Efficient Clusters. Search on Bibsonomy IEEE Micro The full citation details ... 2011 DBLP  DOI  BibTeX  RDF multi-GPU, HPL, High Performance Linpack, DGEMM, double-precision general matrix multiply, GPGPU, system architecture, Green IT, heterogeneous (hybrid) systems
56Rolf Rabenseifner, Sunil R. Tiyyagura, Matthias S. Müller Network Bandwidth Measurements and Ratio Analysis with the HPC Challenge Benchmark Suite (HPCC). Search on Bibsonomy PVM/MPI The full citation details ... 2005 DBLP  DOI  BibTeX  RDF HPCC, HPL, DGEMM, PTRANS, FFTE, benchmarking, STREAM, latency, effective bandwidth, network bandwidth, Linpack
54Daniel Hackenberg, Robert Schöne, Wolfgang E. Nagel, Stefan Pflüger Optimizing OpenMP Parallelized DGEMM Calls on SGI Altix 3700. Search on Bibsonomy Euro-Par The full citation details ... 2006 DBLP  DOI  BibTeX  RDF
47Stéphane Zuckerman, Marc Pérache, William Jalby Fine Tuning Matrix Multiplications on Multicore. Search on Bibsonomy HiPC The full citation details ... 2008 DBLP  DOI  BibTeX  RDF multicore, cache coherency, BLAS
30Hiroyuki Ootomo, Katsuhisa Ozaki, Rio Yokota DGEMM on Integer Matrix Multiplication Unit. Search on Bibsonomy CoRR The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
30Pedro Valero-Lara, Ian Jorquera, Frank Liu 0001, Jeffrey S. Vetter Mixed-Precision S/DGEMM Using the TF32 and TF64 Frameworks on Low-Precision AI Tensor Cores. Search on Bibsonomy SC Workshops The full citation details ... 2023 DBLP  DOI  BibTeX  RDF
30Jialin Li, Huang Ye, Shaobo Tian, Xinyuan Li, Jian Zhang 0070 A Fine-grained Prefetching Scheme for DGEMM Kernels on GPU with Auto-tuning Compatibility. Search on Bibsonomy IPDPS The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
30Yi Wei, Lin Deng, Sizheng Sun, Sisi Li, Li Shen 0007 DGEMM Optimization Oriented to ARM SVE Instruction Set Architecture. Search on Bibsonomy ICPADS The full citation details ... 2022 DBLP  DOI  BibTeX  RDF
30Daichi Mukunoki, Katsuhisa Ozaki, Takeshi Ogita, Toshiyuki Imamura DGEMM Using Tensor Cores, and Its Accurate and Reproducible Versions. Search on Bibsonomy ISC The full citation details ... 2020 DBLP  DOI  BibTeX  RDF
30Tom Cornebize, Arnaud Legrand DGEMM performance is data-dependent. Search on Bibsonomy CoRR The full citation details ... 2019 DBLP  BibTeX  RDF
30Pedro Valero-Lara, Ivan Martínez-Pérez, Sergi Mateo, Raül Sirvent, Vicenç Beltran 0001, Xavier Martorell, Jesús Labarta Variable Batched DGEMM. Search on Bibsonomy PDP The full citation details ... 2018 DBLP  DOI  BibTeX  RDF
30John D. McCalpin HPL and DGEMM performance variability on the Xeon Platinum 8160 processor. Search on Bibsonomy SC The full citation details ... 2018 DBLP  BibTeX  RDF
30Lijuan Jiang, Chao Yang 0002, Yulong Ao, Wanwang Yin, Wenjing Ma, Qiao Sun, Fangfang Liu, Rongfen Lin, Peng Zhang Towards Highly Efficient DGEMM on the Emerging SW26010 Many-Core Processor. Search on Bibsonomy ICPP The full citation details ... 2017 DBLP  DOI  BibTeX  RDF
30David Rohr, Volker Lindenstruth A Flexible and Portable Large-Scale DGEMM Library for Linpack on Next-Generation Multi-GPU Systems. Search on Bibsonomy PDP The full citation details ... 2015 DBLP  DOI  BibTeX  RDF
30Hao Jiang 0001, Feng Wang, Kuan Li, Canqun Yang, Kejia Zhao, Chun Huang Implementation of an Accurate and Efficient Compensated DGEMM for 64-bit ARMv8 Multi-Core Processors. Search on Bibsonomy ICPADS The full citation details ... 2015 DBLP  DOI  BibTeX  RDF
30Feng Wang, Hao Jiang 0001, Ke Zuo, Xing Su, Jingling Xue, Canqun Yang Design and Implementation of a Highly Efficient DGEMM for 64-Bit ARMv8 Multi-core Processors. Search on Bibsonomy ICPP The full citation details ... 2015 DBLP  DOI  BibTeX  RDF
30Pawel Gepner, Victor Gamayunov, David L. Fraser, Eric Houdard, Ludovic Sauge, Damien Déclat, Mathieu Dubois Evaluation of DGEMM Implementation on Intel Xeon Phi Coprocessor. Search on Bibsonomy J. Comput. The full citation details ... 2014 DBLP  BibTeX  RDF
30Pawel Gepner, Victor Gamayunov, David L. Fraser Effective Implementation of DGEMM on Modern Multicore CPU. Search on Bibsonomy ICCS The full citation details ... 2012 DBLP  DOI  BibTeX  RDF
30Gideon Nimako, Ekow J. Otoo, Daniel Ohene-Kwofie Cache-sensitive MapReduce DGEMM algorithms for shared memory architectures. Search on Bibsonomy SAICSIT The full citation details ... 2012 DBLP  DOI  BibTeX  RDF
30Jiajia Li 0001, Xingjian Li 0002, Guangming Tan, Mingyu Chen 0001, Ninghui Sun An optimized large-scale hybrid DGEMM design for CPUs and ATI GPUs. Search on Bibsonomy ICS The full citation details ... 2012 DBLP  DOI  BibTeX  RDF
30Guangming Tan, Linchuan Li, Sean Triechle, Everett H. Phillips, Yungang Bao, Ninghui Sun Fast implementation of DGEMM on Fermi GPU. Search on Bibsonomy SC The full citation details ... 2011 DBLP  DOI  BibTeX  RDF
23Akira Nukada, Satoshi Matsuoka Auto-tuning 3-D FFT library for CUDA GPUs. Search on Bibsonomy SC The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
23Massimiliano Fatica Accelerating linpack with CUDA on heterogenous clusters. Search on Bibsonomy GPGPU The full citation details ... 2009 DBLP  DOI  BibTeX  RDF
23Fred G. Gustavson, Isak Jonsson High Performance Cholesky Factorization via Blocking and Recursion That Uses Minimal Storage. Search on Bibsonomy PARA The full citation details ... 2000 DBLP  DOI  BibTeX  RDF packed format, level 3 BLAS parallelism, recursive algorithm, Cholesky factorization, recursive data structure
Displaying result #1 - #26 of 26 (100 per page; Change: )
Valid XHTML 1.1! Valid CSS! [Valid RSS]
Maintained by L3S.
Previously maintained by Jörg Diederich.
Based upon DBLP by Michael Ley.
open data data released under the ODC-BY 1.0 license