|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 25 occurrences of 19 keywords
|
|
|
Results
Found 168 publication records. Showing 168 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
85 | Michael A. Bender, Gerth Stølting Brodal, Rolf Fagerberg, Riko Jacob, Elias Vicari |
Optimal sparse matrix dense vector multiplication in the I/O-model. |
SPAA |
2007 |
DBLP DOI BibTeX RDF |
I/O-model, sparse matrix dense vector multiplication, lower bound, external memory algorithms |
64 | E. Yuan, Yunquan Zhang, Xiangzheng Sun |
Memory Access Complexity Analysis of SpMV in RAM (h) Model. |
HPCC |
2008 |
DBLP DOI BibTeX RDF |
|
64 | Vasileios Karakasis, Georgios I. Goumas, Nectarios Koziris |
Exploring the effect of block shapes on the performance of sparse kernels. |
IPDPS |
2009 |
DBLP DOI BibTeX RDF |
|
64 | Vasileios Karakasis, Georgios I. Goumas, Nectarios Koziris |
A Comparative Study of Blocking Storage Methods for Sparse Matrices on Multicore Architectures. |
CSE (1) |
2009 |
DBLP DOI BibTeX RDF |
|
64 | Li Wang 0027, Xuejun Yang, Guibin Wang, Xiaobo Yan, Yu Deng 0001, Jing Du 0002, Ying Zhang 0032, Tao Tang 0001, Kun Zeng |
Implementation and Optimization of Sparse Matrix-Vector Multiplication on Imagine Stream Processor. |
ISPA |
2007 |
DBLP DOI BibTeX RDF |
stream processor, Imagine, Sparse Matrix-Vector Multiplication |
64 | Leonid Oliker, Xiaoye S. Li, Gerd Heber, Rupak Biswas |
Ordering Unstructured Meshes for Sparse Matrix Computations on Leading Parallel Systems. |
IPDPS Workshops |
2000 |
DBLP DOI BibTeX RDF |
|
43 | Mina Ashoury, Mohammad Loni, Farshad Khunjush, Masoud Daneshtalab |
Auto-SpMV: Automated Optimizing SpMV Kernels on GPU. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
42 | Lixia Liu, Zhiyuan Li 0001 |
A compiler-automated array compression scheme for optimizing memory intensive programs. |
ICS |
2010 |
DBLP DOI BibTeX RDF |
adaptive code selection, bandwidth consumption reduction, compiler implementation, memory intensive programs, compression |
42 | JeeWhan Choi, Amik Singh, Richard W. Vuduc |
Model-driven autotuning of sparse matrix-vector multiply on GPUs. |
PPoPP |
2010 |
DBLP DOI BibTeX RDF |
performance modeling, gpu, sparse matrix-vector multiplication |
42 | Tilman Küstner, Josef Weidendorfer, Jasmine Schirmer, Tobias Klug, Carsten Trinitis, Sybille Ziegler |
Parallel MLEM on Multicore Architectures. |
ICCS (1) |
2009 |
DBLP DOI BibTeX RDF |
|
42 | Nathan Bell, Michael Garland |
Implementing sparse matrix-vector multiplication on throughput-oriented processors. |
SC |
2009 |
DBLP DOI BibTeX RDF |
|
42 | Seyong Lee, Rudolf Eigenmann |
Adaptive runtime tuning of parallel sparse matrix-vector multiplication on distributed memory systems. |
ICS |
2008 |
DBLP DOI BibTeX RDF |
runtime tuning, sparse matrix, process mapping |
42 | Samuel Williams 0001, Leonid Oliker, Richard W. Vuduc, John Shalf, Katherine A. Yelick, James Demmel |
Optimization of sparse matrix-vector multiplication on emerging multicore platforms. |
SC |
2007 |
DBLP DOI BibTeX RDF |
|
42 | Benjamin C. Lee, Richard W. Vuduc, James Demmel, Katherine A. Yelick |
Performance Models for Evaluation and Automatic Tuning of Symmetric Sparse Matrix-Vector Multiply. |
ICPP |
2004 |
DBLP DOI BibTeX RDF |
|
31 | Mehmet Belgin, Godmar Back, Calvin J. Ribbens |
Pattern-based sparse matrix representation for memory-efficient SMVM kernels. |
ICS |
2009 |
DBLP DOI BibTeX RDF |
bcoo, bcsr, matrix splitting, pbr, smvm, spmv, prefetching, vectorization, memory bandwidth, iterative solvers, sparse computations, csr |
31 | Samuel Williams 0001, John Shalf, Leonid Oliker, Shoaib Kamil 0001, Parry Husbands, Katherine A. Yelick |
Scientific Computing Kernels on the Cell Processor. |
Int. J. Parallel Program. |
2007 |
DBLP DOI BibTeX RDF |
GEMM, SpMV, three level memory, FFT, sparse matrix, Cell processor, Stencil |
31 | Samuel Williams 0001, John Shalf, Leonid Oliker, Shoaib Kamil 0001, Parry Husbands, Katherine A. Yelick |
The potential of the cell processor for scientific computing. |
Conf. Computing Frontiers |
2006 |
DBLP DOI BibTeX RDF |
GEMM, SpMV, three level memory, FFT, sparse matrix, cell processor, stencil |
21 | Utpal Kiran, Deepak Sharma 0001, Sachin Singh Gautam |
An efficient framework for matrix-free SpMV computation on GPU for elastoplastic problems. |
Math. Comput. Simul. |
2024 |
DBLP DOI BibTeX RDF |
|
21 | Evann Regnault, Bérenger Bramas |
SPC5: An efficient SpMV framework vectorized using ARM SVE and x86 AVX-512. |
Comput. Sci. Inf. Syst. |
2024 |
DBLP DOI BibTeX RDF |
|
21 | Mayez Al-Mouhamed, Lutfi A. Firdaus, Ayaz H. Khan, Nazeeruddin Mohammad |
SpMV and BiCG-Stab sparse solver on Multi-GPUs for reservoir simulation. |
Multim. Tools Appl. |
2024 |
DBLP DOI BibTeX RDF |
|
21 | Jianhua Gao, Weixing Ji, Jie Liu, Yizhuo Wang, Feng Shi 0009 |
Revisiting thread configuration of SpMV kernels on GPU: A machine learning based approach. |
J. Parallel Distributed Comput. |
2024 |
DBLP DOI BibTeX RDF |
|
21 | Manoj B. Rajashekar, Xingyu Tian, Zhenman Fang |
HiSpMV: Hybrid Row Distribution and Vector Buffering for Imbalanced SpMV Acceleration on FPGAs. |
FPGA |
2024 |
DBLP DOI BibTeX RDF |
|
21 | Panagiotis Mpakos, Ioanna Tasou, Chloe Alverti, Panagiotis Miliadis, Pavlos Malakonakis, Dimitris Theodoropoulos, Georgios I. Goumas, Dionisios N. Pnevmatikatos, Nectarios Koziris |
Open-Source SpMV Multiplication Hardware Accelerator for FPGA-Based HPC Systems. |
ARC |
2024 |
DBLP DOI BibTeX RDF |
|
21 | Whijin Kim, Hana Kim, Jihye Lee, Hyunji Kim, Ji-Hoon Kim |
Multi-Mode SpMV Accelerator for Transprecision PageRank With Real-World Graphs. |
IEEE Access |
2023 |
DBLP DOI BibTeX RDF |
|
21 | Chi Zhang, Paul Scheffler, Thomas Benz, Matteo Perotti, Luca Benini |
Near-Memory Parallel Indexing and Coalescing: Enabling Highly Efficient Indirect Access for SpMV. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
21 | Christodoulos Stylianou, Mark Klaisoongnoen, Ricardo Jesus, Nick Brown 0002, Michèle Weiland |
Morpheus unleashed: Fast cross-platform SpMV on emerging architectures. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
21 | Evann Regnault, Bérenger Bramas |
SPC5: an efficient SpMV framework vectorized using ARM SVE and x86 AVX-512. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
21 | Jens Saak, Jonas Schulze |
Diagonally-Addressed Matrix Nicknack: How to improve SpMV performance. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
21 | Panagiotis Mpakos, Dimitrios Galanopoulos, Petros Anastasiadis, Nikela Papadopoulou, Nectarios Koziris, Georgios I. Goumas |
Feature-based SpMV Performance Analysis on Contemporary Devices. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
21 | Sardar Usman, Rashid Mehmood, Iyad A. Katib, Aiiad Albeshri, Saleh M. Altowaijri |
ZAKI: A Smart Method and Tool for Automatic Performance Optimization of Parallel SpMV Computations on Distributed Memory Machines. |
Mob. Networks Appl. |
2023 |
DBLP DOI BibTeX RDF |
|
21 | Siyi Hu, Makiko Ito, Takahide Yoshikawa, Yuan He 0002, Hiroshi Nakamura, Masaaki Kondo |
Adaptive Lossy Data Compression Extended Architecture for Memory Bandwidth Conservation in SpMV. |
IEICE Trans. Inf. Syst. |
2023 |
DBLP DOI BibTeX RDF |
|
21 | Bowen Liu, Dajiang Liu |
Towards High-Bandwidth-Utilization SpMV on FPGAs via Partial Vector Duplication. |
ASP-DAC |
2023 |
DBLP DOI BibTeX RDF |
|
21 | Alexandre Rodrigues 0006, Leonel Sousa, Aleksandar Ilic |
Performance Modelling-Driven Optimization of RISC-V Hardware for Efficient SpMV. |
ISC Workshops |
2023 |
DBLP DOI BibTeX RDF |
|
21 | José Oliver 0002, Carlos Álvarez 0001, Teresa Cervero, Xavier Martorell, John D. Davis, Eduard Ayguadé |
Accelerating SpMV on FPGAs Through Block-Row Compress: A Task-Based Approach. |
FPL |
2023 |
DBLP DOI BibTeX RDF |
|
21 | Dimitrios Galanopoulos, Panagiotis Mpakos, Petros Anastasiadis, Nectarios Koziris, Georgios I. Goumas |
Invited paper: An Artificial Matrix Generator for Multi-platform SpMV Performance Analysis. |
IPDPS Workshops |
2023 |
DBLP DOI BibTeX RDF |
|
21 | Panagiotis Mpakos, Dimitrios Galanopoulos, Petros Anastasiadis, Nikela Papadopoulou, Nectarios Koziris, Georgios I. Goumas |
Feature-based SpMV Performance Analysis on Contemporary Devices. |
IPDPS |
2023 |
DBLP DOI BibTeX RDF |
|
21 | José Oliver 0002, Carlos Álvarez 0001, Teresa Cervero, Xavier Martorell, John D. Davis, Eduard Ayguadé |
b8c: SpMV accelerator implementation leveraging high memory bandwidth. |
FCCM |
2023 |
DBLP DOI BibTeX RDF |
|
21 | Deshun Bi, Xiaowen Tian, Shengguo Li, Dezun Dong |
Efficiently Running SpMV on Multi-Core DSPs for Block Sparse Matrix. |
ICPADS |
2023 |
DBLP DOI BibTeX RDF |
|
21 | Jingshan Pan, Lei Xiao, Min Tian, Li Wang, Chaochao Yang, Renjiang Chen, Zenghui Ren, Anjun Liu, Guanghui Zhu |
hsSpMV: A Heterogeneous and SPM-aggregated SpMV for SW26010-Pro many-core processor. |
CCGrid |
2023 |
DBLP DOI BibTeX RDF |
|
21 | Dheeraj Ramchandani, Bahar Asgari, Hyesoon Kim |
Spica: Exploring FPGA Optimizations to Enable an Efficient SpMV Implementation for Computations at Edge. |
EDGE |
2023 |
DBLP DOI BibTeX RDF |
|
21 | Jiafan Jiang, Jianqiang Huang, Haodong Bian |
GTLB: A Load-Balanced SpMV Computation Method on GPU. |
HP3C |
2023 |
DBLP DOI BibTeX RDF |
|
21 | Gelin Fu, Tian Xia 0008, Shaoru Qu, Zhongpei Luo, Shuyu Li, Pengyu Cheng, Runfan Guo, Yitong Ding, Pengju Ren |
PrSpMV: An Efficient Predictable Kernel for SpMV. |
ICCD |
2023 |
DBLP DOI BibTeX RDF |
|
21 | Jihu Guo, Jie Liu 0002, Qinglin Wang, Xiaoxiong Zhu |
Optimizing CSR-Based SpMV on a New MIMD Architecture Pezy-SC3s. |
ICA3PP (2) |
2023 |
DBLP DOI BibTeX RDF |
|
21 | Deshun Bi, Shengguo Li, Yichen Zhang, Xiaojian Yang, Dezun Dong |
Efficiently Running SpMV on Multi-core DSPs for Banded Matrix. |
ICA3PP (5) |
2023 |
DBLP DOI BibTeX RDF |
|
21 | Genshen Chu, Yuanjie He, Lingyu Dong, Zhezhao Ding, Dandan Chen, He Bai 0005, Xuesong Wang, Changjun Hu |
Efficient Algorithm Design of Optimizing SpMV on GPU. |
HPDC |
2023 |
DBLP DOI BibTeX RDF |
|
21 | Huanyu Cui, Nianbin Wang, Yuhua Wang, Qilong Han, Yuezhu Xu |
An effective SPMV based on block strategy and hybrid compression on GPU. |
J. Supercomput. |
2022 |
DBLP DOI BibTeX RDF |
|
21 | Huachen Tan |
An asynchronous and parallel row-wise compressed SpMV kernel on heterogeneous CPU-GPU architectures. |
Int. J. Embed. Syst. |
2022 |
DBLP DOI BibTeX RDF |
|
21 | Zhen Du, Jiajia Li 0001, Yinshan Wang, Xueqi Li 0001, Guangming Tan, Ninghui Sun |
AlphaSparse: Generating High Performance SpMV Codes Directly from Sparse Matrices. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
21 | Thaha Mohammed 0001, Rashid Mehmood |
Performance Enhancement Strategies for Sparse Matrix-Vector Multiplication (SpMV) and Iterative Linear Solvers. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
21 | Chong Chen |
Explicit caching HYB: a new high-performance SpMV framework on GPGPU. |
CoRR |
2022 |
DBLP DOI BibTeX RDF |
|
21 | Yixiao Du, Yuwei Hu, Zhongchun Zhou, Zhiru Zhang |
High-Performance Sparse Linear Algebra on HBM-Equipped FPGAs Using HLS: A Case Study on SpMV. |
FPGA |
2022 |
DBLP DOI BibTeX RDF |
|
21 | Tao Li, Li Shen 0007, Shangshang Yao |
A High-performance SpMV Accelerator on HBM-equipped FPGAs. |
HPCC/DSS/SmartCity/DependSys |
2022 |
DBLP DOI BibTeX RDF |
|
21 | Guoqing Xiao 0001, Tao Zhou, Yuedan Chen, Yikun Hu, Kenli Li 0001 |
DTSpMV: An Adaptive SpMV Framework for Graph Analysis on GPUs. |
HPCC/DSS/SmartCity/DependSys |
2022 |
DBLP DOI BibTeX RDF |
|
21 | Weicai Ye, Chenghuan Huang, Jiasheng Huang, Jiajun Li, Yao Lu 0007, Ying Jiang |
An Integral-equation-oriented Vectorized SpMV Algorithm and its Application on CT Imaging Reconstruction. |
IPDPS |
2022 |
DBLP DOI BibTeX RDF |
|
21 | Han D. Tran, Milinda Fernando, Kumar Saurabh, Baskar Ganapathysubramanian, Robert M. Kirby, Hari Sundar |
A scalable adaptive-matrix SPMV for heterogeneous architectures. |
IPDPS |
2022 |
DBLP DOI BibTeX RDF |
|
21 | Erhan Tezcan, Tugba Torun, Fahrican Kosar, Kamer Kaya, Didem Unat |
Mixed and Multi-Precision SpMV for GPUs with Row-wise Precision Selection. |
SBAC-PAD |
2022 |
DBLP DOI BibTeX RDF |
|
21 | Xin You, Changxi Liu, Hailong Yang, Pengbo Wang, Zhongzhi Luan, Depei Qian |
Vectorizing SpMV by Exploiting Dynamic Regular Patterns. |
ICPP |
2022 |
DBLP DOI BibTeX RDF |
|
21 | Zhen Du, Jiajia Li 0001, Yinshan Wang, Xueqi Li 0001, Guangming Tan, Ninghui Sun |
AlphaSparse: Generating High Performance SpMV Codes Directly from Sparse Matrices. |
SC |
2022 |
DBLP DOI BibTeX RDF |
|
21 | Siyi Hu, Makiko Ito, Takahide Yoshikawa, Yuan He 0002, Masaaki Kondo |
Memory Bandwidth Conservation for SpMV Kernels Through Adaptive Lossy Data Compression. |
PDCAT |
2022 |
DBLP DOI BibTeX RDF |
|
21 | Edoardo Coronado-Barrientos, Mario Antonioletti, Antonio J. García-Loureiro |
A new AXT format for an efficient SpMV product using AVX-512 instructions and CUDA. |
Adv. Eng. Softw. |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Thaha Mohammed 0001, Aiiad Albeshri, Iyad A. Katib, Rashid Mehmood |
DIESEL: A novel deep learning-based tool for SpMV computations and solving sparse linear equation systems. |
J. Supercomput. |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Ernesto Dufrechou, Pablo Ezzatti, Enrique S. Quintana-Ortí |
Selecting optimal SpMV realizations for GPUs via machine learning. |
Int. J. High Perform. Comput. Appl. |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Guoqing Xiao 0001, Kenli Li 0001, Yuedan Chen, Wangquan He, Albert Y. Zomaya, Tao Li 0006 |
CASpMV: A Customized and Accelerative SpMV Framework for the Sunway TaihuLight. |
IEEE Trans. Parallel Distributed Syst. |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Min Li, Yulong Ao, Chao Yang 0002 |
Adaptive SpMV/SpMSpV on GPUs for Input Vectors of Varied Sparsity. |
IEEE Trans. Parallel Distributed Syst. |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Yufeng Zhang 0001, Wangdong Yang, Kenli Li 0001, Dahai Tang, Keqin Li 0001 |
Performance analysis and optimization for SpMV based on aligned storage formats on an ARM processor. |
J. Parallel Distributed Comput. |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Alberto Parravicini, Luca Giuseppe Cellamare, Marco Siracusa, Marco Domenico Santambrogio |
Scaling up HBM Efficiency of Top-K SpMV for Approximate Embedding Similarity on FPGAs. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
21 | Guillermo Oyarzun, Daniel Peyrolon, Carlos Álvarez 0001, Xavier Martorell |
An FPGA cached sparse matrix vector product (SpMV) for unstructured computational fluid dynamics simulations. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
21 | Takeshi Fukaya, Koki Ishida, Akie Miura, Takeshi Iwashita, Hiroshi Nakashima |
Accelerating the SpMV kernel on standard CPUs by exploiting the partially diagonal structures. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
21 | Christie L. Alappat, Nils Meyer, Jan Laukemann, Thomas Gruber, Georg Hager, Gerhard Wellein, Tilo Wettig |
ECM modeling and performance tuning of SpMV and Lattice QCD on A64FX. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
21 | Haodong Bian, Jianqiang Huang, Lingbin Liu, Dongqiang Huang, Xiaoying Wang 0002 |
ALBUS: A method for efficiently processing SpMV using SIMD and Load balancing. |
Future Gener. Comput. Syst. |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Haodong Bian, Jianqiang Huang, Runting Dong, Yuluo Guo, Lingbin Liu, Dongqiang Huang, Xiaoying Wang 0002 |
A simple and efficient storage format for SIMD-accelerated SpMV. |
Clust. Comput. |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Muhammad Hannan Khan, Osman Hassan, Shahid Khan 0002 |
Accelerating SpMV Multiplication in Probabilistic Model Checkers Using GPUs. |
ICTAC |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Naveen Namashivavam, Sanyam Mehta, Pen-Chung Yew |
Variable-Sized Blocks for Locality-Aware SpMV. |
CGO |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Alberto Parravicini, Francesco Sgherzi, Marco D. Santambrogio |
A reduced-precision streaming SpMV architecture for Personalized PageRank on FPGA. |
ASP-DAC |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Jianhua Gao, Weixing Ji, Jie Liu, Senhao Shao, Yizhuo Wang, Feng Shi 0009 |
AMF-CSR: Adaptive Multi-Row Folding of CSR for SpMV on GPU. |
ICPADS |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Mohsen Koohi Esfahani, Peter Kilpatrick, Hans Vandierendonck |
Exploiting in-Hub Temporal Locality in SpMV-based Graph Processing. |
ICPP |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Xiang Fei, Youhui Zhang |
Regu2D: Accelerating Vectorization of SpMV on Intel Processors through 2D-partitioning and Regular Arrangement. |
ICPP |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Constantino Gómez, Filippo Mantovani, Erich Focht, Marc Casas |
Efficiently running SpMV on long vector architectures. |
PPoPP |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Stephen L. Olivier, Nathan D. Ellingwood, Jonathan W. Berry, Daniel M. Dunlavy |
Performance Portability of an SpMV Kernel Across Scientific Computing and Data Science Applications. |
HPEC |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Mahesh Mahadurkar, Nalesh Sivanandan, S. Kala |
Hardware Acceleration of SpMV Multiplier for Deep Learning. |
VDAT |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Federico Favaro, Juan P. Oliver, Pablo Ezzatti |
Unleashing the computational power of FPGAs to efficiently perform SPMV operation. |
SCCC |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Siying Feng, Jiawen Sun, Subhankar Pal, Xin He, Kuba Kaszyk, Dong-Hyeon Park, John Magnus Morton, Trevor N. Mudge, Murray Cole, Michael F. P. O'Boyle, Chaitali Chakrabarti, Ronald G. Dreslinski |
CoSPARSE: A Software and Hardware Reconfigurable SpMV Framework for Graph Analytics. |
DAC |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Chenyang Li, Tian Xia 0008, Wenzhe Zhao, Nanning Zheng 0001, Pengju Ren |
SpV8: Pursuing Optimal Vectorization and Regular Computation Pattern in SpMV. |
DAC |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Alberto Parravicini, Luca Giuseppe Cellamare, Marco Siracusa, Marco D. Santambrogio |
Scaling up HBM Efficiency of Top-K SpMV for Approximate Embedding Similarity on FPGAs. |
DAC |
2021 |
DBLP DOI BibTeX RDF |
|
21 | Guoqing Xiao 0001, Yuedan Chen, Chubo Liu, Xu Zhou |
ahSpMV: An Autotuning Hybrid Computing Scheme for SpMV on the Sunway Architecture. |
IEEE Internet Things J. |
2020 |
DBLP DOI BibTeX RDF |
|
21 | Akrem Benatia, Weixing Ji, Yizhuo Wang, Feng Shi 0009 |
Sparse matrix partitioning for optimizing SpMV on CPU-GPU heterogeneous platforms. |
Int. J. High Perform. Comput. Appl. |
2020 |
DBLP DOI BibTeX RDF |
|
21 | Weijie Zhou, Yue Zhao 0011, Xipeng Shen, Wang Chen |
Enabling Runtime SpMV Format Selection through an Overhead Conscious Method. |
IEEE Trans. Parallel Distributed Syst. |
2020 |
DBLP DOI BibTeX RDF |
|
21 | Mohammad Almasri, Walid A. Abu-Sufah |
CCF: An efficient SpMV storage format for AVX512 platforms. |
Parallel Comput. |
2020 |
DBLP DOI BibTeX RDF |
|
21 | Alberto Parravicini, Francesco Sgherzi, Marco D. Santambrogio |
A reduced-precision streaming SpMV architecture for Personalized PageRank on FPGA. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
21 | Min Li, Yulong Ao, Chao Yang 0002 |
Adaptive SpMV/SpMSpV on GPUs for Input Vectors of Varied Sparsity. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
21 | Sandra Catalán, Tetsuzo Usui, Leonel Toledo, Xavier Martorell, Jesús Labarta, Pedro Valero-Lara |
Towards an Auto-Tuned and Task-Based SpMV (LASs Library). |
IWOMP |
2020 |
DBLP DOI BibTeX RDF |
|
21 | Brian A. Page, Peter M. Kogge |
Scalability of Sparse Matrix Dense Vector Multiply (SpMV) on a Migrating Thread Architecture. |
IPDPS Workshops |
2020 |
DBLP DOI BibTeX RDF |
|
21 | Haodong Bian, Jianqiang Huang, Runting Dong, Lingbin Liu, Xiaoying Wang 0002 |
CSR2: A New Format for SIMD-accelerated SpMV. |
CCGRID |
2020 |
DBLP DOI BibTeX RDF |
|
21 | Reo Furuhata, Minglu Zhao, Mulya Agung, Ryusuke Egawa, Hiroyuki Takizawa |
Improving the Accuracy in SpMV Implementation Selection with Machine Learning. |
CANDAR (Workshops) |
2020 |
DBLP DOI BibTeX RDF |
|
21 | Serif Yesil, Azin Heidarshenas, Adam Morrison 0001, Josep Torrellas |
Speeding up SpMV for power-law graph analytics by enhancing locality & vectorization. |
SC |
2020 |
DBLP DOI BibTeX RDF |
|
21 | Haodong Bian, Gangsheng Li, Linbing Liu, Dongqiang Huang, Runting Dong, Jianqiang Huang |
Research on Accelerating the Performance of SpMV Based on AVX2 Instruction Set. |
HPCCT/BDAI |
2020 |
DBLP DOI BibTeX RDF |
|
21 | Edoardo Coronado-Barrientos, Guillermo Indalecio Fernández, Antonio J. García-Loureiro |
AXC: A new format to perform the SpMV oriented to Intel Xeon Phi architecture in OpenCL. |
Concurr. Comput. Pract. Exp. |
2019 |
DBLP DOI BibTeX RDF |
|
21 | Sardar Usman, Rashid Mehmood, Iyad A. Katib, Aiiad Albeshri |
ZAKI+: A Machine Learning Based Process Mapping Tool for SpMV Computations on Distributed Memory Architectures. |
IEEE Access |
2019 |
DBLP DOI BibTeX RDF |
|
21 | Fazle Sadi, Joe Sweeney, Tze Meng Low, James C. Hoe, Larry T. Pileggi, Franz Franchetti |
Efficient SpMV Operation for Large and Highly Sparse Matrices using Scalable Multi-way Merge Parallelization. |
MICRO |
2019 |
DBLP DOI BibTeX RDF |
|
21 | Yuedan Chen, Guoqing Xiao 0001, Zheng Xiao, Wangdong Yang |
hpSpMV: A Heterogeneous Parallel Computing Scheme for SpMV on the Sunway TaihuLight Supercomputer. |
HPCC/SmartCity/DSS |
2019 |
DBLP DOI BibTeX RDF |
|
Displaying result #1 - #100 of 168 (100 per page; Change: ) Pages: [ 1][ 2][ >>] |
|