|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
The graphs summarize 108 occurrences of 76 keywords
|
|
|
Results
Found 163 publication records. Showing 163 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
151 | Satish Kharat, Rajeev Mishra, Ranadip Das, Srikanth Vishwanathan |
Migration of software partition in UNIX system. |
Bangalore Compute Conf. |
2008 |
DBLP DOI BibTeX RDF |
|
143 | Joshua Hursey, Timothy Mattox, Andrew Lumsdaine |
Interconnect agnostic checkpoint/restart in open MPI. |
HPDC |
2009 |
DBLP DOI BibTeX RDF |
checkpoint coordination protocol, fault tolerance, MPI, shared memory, rollback-recovery, infiniband, myrinet, high speed interconnect, checkpoint/restart |
127 | Oren Laadan, Dan B. Phung, Jason Nieh |
Transparent Checkpoint-Restart of Distributed Applications on Commodity Clusters. |
CLUSTER |
2005 |
DBLP DOI BibTeX RDF |
|
124 | Chao Wang 0056, Frank Mueller 0001, Christian Engelmann, Stephen L. Scott |
A Job Pause Service under LAM/MPI+BLCR for Transparent Fault Tolerance. |
IPDPS |
2007 |
DBLP DOI BibTeX RDF |
|
123 | Oreste Villa, Sriram Krishnamoorthy, Jarek Nieplocha, David M. Brown Jr. |
Scalable transparent checkpoint-restart of global address space applications on virtual machines over infiniband. |
Conf. Computing Frontiers |
2009 |
DBLP DOI BibTeX RDF |
checkpoint-restart, virtual machines, infiniband, global address space |
108 | G. John Janakiraman, Jose Renato Santos, Dinesh Subhraveti, Yoshio Turner |
Cruz: Application-Transparent Distributed Checkpoint-Restart on Standard Operating Systems. |
DSN |
2005 |
DBLP DOI BibTeX RDF |
|
107 | Yudan Liu, Raja Nassar, Chokchai Leangsuksun, Nichamon Naksinehaboon, Mihaela Paun, Stephen L. Scott |
An optimal checkpoint/restart model for a large scale high performance computing system. |
IPDPS |
2008 |
DBLP DOI BibTeX RDF |
|
107 | Yudan Liu, Raja Nassar, Chokchai Leangsuksun, Nichamon Naksinehaboon, Mihaela Paun, Stephen L. Scott |
A reliability-aware approach for an optimal checkpoint/restart model in HPC environments. |
CLUSTER |
2007 |
DBLP DOI BibTeX RDF |
|
105 | Gengbin Zheng, Lixia Shi, Laxmikant V. Kalé |
FTC-Charm++: an in-memory checkpoint-based fault tolerant runtime for Charm++ and MPI. |
CLUSTER |
2004 |
DBLP DOI BibTeX RDF |
|
101 | Jiannong Cao 0001, Yinghao Li, Minyi Guo |
Process Migration for MPI Applications based on Coordinated Checkpoint. |
ICPADS (1) |
2005 |
DBLP DOI BibTeX RDF |
MPI, process migration, checkpoint/restart, coordinated checkpoint |
95 | Gracjan Jankowski, József Kovács, Norbert Meyer, Radoslaw Januszewski, Rafal Mikolajczak |
Towards Checkpointing Grid Architecture. |
PPAM |
2005 |
DBLP DOI BibTeX RDF |
|
89 | Joshua Hursey, Jeffrey M. Squyres, Timothy Mattox, Andrew Lumsdaine |
The Design and Implementation of Checkpoint/Restart Process Fault Tolerance for Open MPI. |
IPDPS |
2007 |
DBLP DOI BibTeX RDF |
|
89 | R. Badrinath, R. Krishnakumar, R. K. Palanivel Rajan |
Virtualization aware job schedulers for checkpoint-restart. |
ICPADS |
2007 |
DBLP DOI BibTeX RDF |
|
70 | Qi Gao 0004, Weikuan Yu, Wei Huang 0003, Dhabaleswar K. Panda 0001 |
Application-Transparent Checkpoint/Restart for MPI Programs over InfiniBand. |
ICPP |
2006 |
DBLP DOI BibTeX RDF |
|
70 | José Carlos Sancho, Fabrizio Petrini, Kei Davis, Roberto Gioiosa, Song Jiang 0001 |
Current Practice and a Direction Forward in Checkpoint/Restart Implementations for Fault Tolerance. |
IPDPS |
2005 |
DBLP DOI BibTeX RDF |
|
70 | Kalman Z. Meth, William G. Tuel Jr. |
Parallel Checkpoint/Restart without Message Logging. |
ICPP Workshops |
2000 |
DBLP DOI BibTeX RDF |
|
67 | Xiangyong Ouyang, Raghunath Rajachandrasekar, Xavier Besseron, Hao Wang 0002, Jian Huang 0006, Dhabaleswar K. Panda 0001 |
CRFS: A Lightweight User-Level Filesystem for Generic Checkpoint/Restart. |
ICPP |
2011 |
DBLP DOI BibTeX RDF |
checkpoint-restart, userspace filesystem, write aggregation |
66 | Jason Ansel, Kapil Arya, Gene Cooperman |
DMTCP: Transparent checkpointing for cluster computations and the desktop. |
IPDPS |
2009 |
DBLP DOI BibTeX RDF |
|
60 | Geoffroy Vallée, Renaud Lottiaux, David Margery, Christine Morin |
Ghost Process: a Sound Basis to Implement Process Duplication, Migration and Checkpoint/Restart in Linux Clusters. |
ISPDC |
2005 |
DBLP DOI BibTeX RDF |
process virtualization, distributed system, operating system, Linux cluster, single system image |
60 | Yudan Liu, Chokchai Leangsuksun, Hertong Song, Stephen L. Scott |
Reliability-aware Checkpoint/Restart Scheme: A Performability Trade-off. |
CLUSTER |
2005 |
DBLP DOI BibTeX RDF |
|
58 | Adnan Agbaria, Roy Friedman |
Virtual Machine Based Heterogeneous Checkpointing. |
IPDPS |
2002 |
DBLP DOI BibTeX RDF |
|
54 | Stephen L. Scott, Christian Engelmann, Geoffroy Vallée, Thomas J. Naughton, Anand Tikotekar, George Ostrouchov, Chokchai Leangsuksun, Nichamon Naksinehaboon, Raja Nassar, Mihaela Paun, Frank Mueller 0001, Chao Wang 0056, Arun Babu Nagarajan, Jyothish Varma |
A tunable holistic resiliency approach for high-performance computing systems. |
PPoPP |
2009 |
DBLP DOI BibTeX RDF |
preemptive migration, fault tolerance, high-performance computing, resilience, checkpoint/restart |
54 | Shaya Potter, Jason Nieh |
WebPod: persistent Web browsing sessions with pocketable storage devices. |
WWW |
2005 |
DBLP DOI BibTeX RDF |
portable storage, virtualization, web browsing, process migration, checkpoint/restart |
54 | Adnan Agbaria, Roy Friedman |
Starfish: Fault-Tolerant Dynamic MPI Programs on Clusters of Workstations. |
Clust. Comput. |
2003 |
DBLP DOI BibTeX RDF |
fault-tolerance, distributed system, MPI, high performance, checkpoint/restart |
54 | Andrea Clematis, Vittoria Gianuzzi |
CPVM - Extending PVM for Consistent Checkpointing. |
PDP |
1996 |
DBLP DOI BibTeX RDF |
CPVM, consistent checkpointing, global checkpoint-restart algorithms, job-swapping, parallel programming, software tools, concurrency control, migration, deadlocks, termination, software fault tolerance, software fault-tolerance, software libraries, software library, PVM, Parallel Virtual Machine, software portability, nonblocking |
54 | Vittoria Gianuzzi, F. Merani |
Using PVM to implement a distributed dependable simulation system. |
PDP |
1995 |
DBLP DOI BibTeX RDF |
distributed dependable simulation system, PVM routines, fault tolerant mechanisms, checkpoint-restart mechanism, distributed algorithms, distributed algorithms, fault tolerant computing, message passing, synchronisation, simulations modelling, Virtual Time, high speed interconnection |
48 | Yawei Li, Zhiling Lan |
FREM: A Fast Restart Mechanism for General Checkpoint/Restart. |
IEEE Trans. Computers |
2011 |
DBLP DOI BibTeX RDF |
|
48 | Kathryn M. Mohror, Adam Moody, Bronis R. de Supinski |
Asynchronous checkpoint migration with MRNet in the Scalable Checkpoint / Restart Library. |
DSN Workshops |
2012 |
DBLP DOI BibTeX RDF |
|
47 | Arun Babu Nagarajan, Frank Mueller 0001, Christian Engelmann, Stephen L. Scott |
Proactive fault tolerance for HPC with Xen virtualization. |
ICS |
2007 |
DBLP DOI BibTeX RDF |
proactive fault tolerance, virtualization, high-performance computing |
47 | Matthieu Fertre, Christine Morin |
Extending a Cluster SSI OS for Transparently Checkpointing Message-Passing Parallel Application. |
ISPAN |
2005 |
DBLP DOI BibTeX RDF |
global coordination, checkpointing, parallel application, single system image |
38 | Limor Fix, Orna Grumberg, Amnon Heyman, Tamir Heyman, Assaf Schuster |
Verifying Very Large Industrial Circuits Using 100 Processes and Beyond. |
ATVA |
2005 |
DBLP DOI BibTeX RDF |
|
35 | Borja Sotomayor, Kate Keahey, Ian T. Foster |
Combining batch execution and leasing using virtual machines. |
HPDC |
2008 |
DBLP DOI BibTeX RDF |
resource leasing, virtual machine overhead, virtual workspaces, virtual machines, resource management, advance reservations, batch processing, checkpoint/restart, backfilling |
35 | Ricardo Marcelín-Jiménez, Sergio Rajsbaum, Brett Stevens |
Cyclic Storage for Fault-Tolerant Distributed Executions. |
IEEE Trans. Parallel Distributed Syst. |
2006 |
DBLP DOI BibTeX RDF |
storage/repositories, network repositories/data mining/backup, fault-tolerance, distributed systems, distributed applications, checkpoint/restart, Load balancing and task assignment |
35 | Tatsuya Ozaki, Tadashi Dohi, Hiroyuki Okamura, Naoto Kaio |
Distribution-Free Checkpoint Placement Algorithms Based on Min-Max Principle. |
IEEE Trans. Dependable Secur. Comput. |
2006 |
DBLP DOI BibTeX RDF |
incomplete failure information, performance evaluation, fault-tolerance, maintenance, high availability, modeling and prediction, Checkpoint/restart |
35 | Rohit Fernandes, Keshav Pingali, Paul Stodghill |
Mobile MPI programs in computational grids. |
PPoPP |
2006 |
DBLP DOI BibTeX RDF |
over-decomposition, portable checkpointing, grid computing, MPI, heterogeneity, checkpoint/restart, application-level checkpointing |
35 | Lorenzo Alvisi, Keith Marzullo |
Message Logging: Pessimistic, Optimistic, Causal, and Optimal. |
IEEE Trans. Software Eng. |
1998 |
DBLP DOI BibTeX RDF |
pessimistic protocols, checkpoint-restart protocols, resilient processes, specification of fault-tolerance techniques, Message logging, optimistic protocols |
35 | Raymond A. Lorie |
Physical Integrity in a Large Segmented Database. |
ACM Trans. Database Syst. |
1977 |
DBLP DOI BibTeX RDF |
checkpoint-restart, database, recovery, storage management |
32 | Lei Wang 0042, Jiyuan Liu 0007, Qiang He 0001 |
Concept Drift-Based Checkpoint-Restart for Edge Services Rejuvenation. |
IEEE Trans. Serv. Comput. |
2023 |
DBLP DOI BibTeX RDF |
|
32 | Akira Nukada, Taichiro Suzuki, Satoshi Matsuoka |
Efficient checkpoint/Restart of CUDA applications. |
Parallel Comput. |
2023 |
DBLP DOI BibTeX RDF |
|
32 | Basma Abdel Azeem, Manal Helal |
Performance Evaluation of Checkpoint/Restart Techniques. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
32 | Yao Xu, Leonid Belyaev, Twinkle Jain, Derek Schafer, Anthony Skjellum, Gene Cooperman |
Implementation-Oblivious Transparent Checkpoint-Restart for MPI. |
CoRR |
2023 |
DBLP DOI BibTeX RDF |
|
32 | Yao Xu, Leonid Belyaev, Twinkle Jain, Derek Schafer, Anthony Skjellum, Gene Cooperman |
Implementation-Oblivious Transparent Checkpoint-Restart for MPI. |
SC Workshops |
2023 |
DBLP DOI BibTeX RDF |
|
32 | Niklas Eiling, Stefan Lankes, Antonello Monti |
Checkpoint/Restart for CUDA Kernels. |
SC Workshops |
2023 |
DBLP DOI BibTeX RDF |
|
32 | Niklas Eiling, Jonas Baude, Stefan Lankes, Antonello Monti |
Cricket: A virtualization layer for distributed execution of CUDA applications with checkpoint/restart support. |
Concurr. Comput. Pract. Exp. |
2022 |
DBLP DOI BibTeX RDF |
|
32 | Genlang Chen, Jiajian Zhang, Zufang Zhu, Hao Wang, Hai Jiang 0003, Chaoyi Pang |
CRAC: An automatic assistant compiler of checkpoint/restart for OpenCL program. |
Concurr. Comput. Pract. Exp. |
2022 |
DBLP DOI BibTeX RDF |
|
32 | Genlang Chen, Jiajian Zhang, Zufang Zhu, Qiangqiang Jiang, Hai Jiang 0003, Chaoyi Pang |
CRState: checkpoint/restart of OpenCL program for in-kernel applications. |
J. Supercomput. |
2021 |
DBLP DOI BibTeX RDF |
|
32 | Guangye Chen, Luis Chacón, Truong B. Nguyen |
An unsupervised machine-learning checkpoint-restart algorithm using Gaussian mixtures for particle-in-cell simulations. |
J. Comput. Phys. |
2021 |
DBLP DOI BibTeX RDF |
|
32 | Anthony Skjellum, Derek Schafer |
Checkpoint-Restart Libraries Must Become More Fault Tolerant. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
32 | Guangye Chen, Luis Chacón, Truong B. Nguyen |
An unsupervised machine-learning checkpoint-restart algorithm using Gaussian mixtures for particle-in-cell simulations. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
32 | Prashant Singh Chouhan, Gregory Price, Gene Cooperman |
An Architecture for Exploiting Native User-Land Checkpoint-Restart to Improve Fuzzing. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
32 | Rajeev Jain, Klaus Weide, Saurabh Chawdhary, Thomas Klostermann |
Checkpoint/Restart for Lagrangian particle mesh with AMR in community code FLASH-X. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
32 | Kfir Zvi, Gal Oren 0001 |
Optimized Memoryless Fair-Share HPC Resources Scheduling using Transparent Checkpoint-Restart Preemption. |
CoRR |
2021 |
DBLP BibTeX RDF |
|
32 | Masoud Gholami, Florian Schintke |
Combining XOR and Partner Checkpointing for Resilient Multilevel Checkpoint/Restart. |
IPDPS |
2021 |
DBLP DOI BibTeX RDF |
|
32 | Shashank Gugnani, Tianxi Li, Xiaoyi Lu |
NVMe-CR: A Scalable Ephemeral Storage Runtime for Checkpoint/Restart with NVMe-over-Fabrics. |
IPDPS |
2021 |
DBLP DOI BibTeX RDF |
|
32 | Konstantinos Parasyris, Giorgis Georgakoudis, Leonardo Bautista-Gomez, Ignacio Laguna |
Co-Designing Multi-Level Checkpoint Restart for MPI Applications. |
CCGRID |
2021 |
DBLP DOI BibTeX RDF |
|
32 | Twinkle Jain, Gene Cooperman |
CRAC: Checkpoint-Restart Architecture for CUDA with Streams and UVM. |
CoRR |
2020 |
DBLP BibTeX RDF |
|
32 | Tonmoy Dey, Kento Sato, Bogdan Nicolae, Jian Guo, Jens Domke, Weikuan Yu, Franck Cappello, Kathryn M. Mohror |
Optimizing Asynchronous Multi-Level Checkpoint/Restart Configurations with Machine Learning. |
IPDPS Workshops |
2020 |
DBLP DOI BibTeX RDF |
|
32 | Konstantinos Parasyris, Kai Keller, Leonardo Bautista-Gomez, Osman S. Unsal |
Checkpoint Restart Support for Heterogeneous HPC Applications. |
CCGRID |
2020 |
DBLP DOI BibTeX RDF |
|
32 | Twinkle Jain, Gene Cooperman |
CRAC: checkpoint-restart architecture for CUDA with streams and UVM. |
SC |
2020 |
DBLP DOI BibTeX RDF |
|
32 | Arif Ahmed 0001, Apoorve Mohan, Gene Cooperman, Guillaume Pierre |
Docker Container Deployment in Distributed Fog Infrastructures with Checkpoint/Restart. |
MobileCloud |
2020 |
DBLP DOI BibTeX RDF |
|
32 | Marina Morán, Javier Aldo Balladini, Dolores Rexachs, Emilio Luque |
Prediction of Energy Consumption by Checkpoint/Restart in HPC. |
IEEE Access |
2019 |
DBLP DOI BibTeX RDF |
|
32 | Manuel Aurelio Rodriguez Pascual, Jiajun Cao, José A. Moríñigo, Gene Cooperman, Rafael Mayo-García |
Job migration in HPC clusters by means of checkpoint/restart. |
J. Supercomput. |
2019 |
DBLP DOI BibTeX RDF |
|
32 | Faisal Shahzad 0001, Jonas Thies, Moritz Kreutzer, Thomas Zeiser, Georg Hager, Gerhard Wellein |
CRAFT: A Library for Easier Application-Level Checkpoint/Restart and Automatic Fault Tolerance. |
IEEE Trans. Parallel Distributed Syst. |
2019 |
DBLP DOI BibTeX RDF |
|
32 | Julien Adam, Maxime Kermarquer, Jean-Baptiste Besnard, Leonardo Bautista-Gomez, Marc Pérache, Patrick Carribault, Julien Jaeger, Allen D. Malony, Sameer Shende |
Checkpoint/restart approaches for a thread-based MPI runtime. |
Parallel Comput. |
2019 |
DBLP DOI BibTeX RDF |
|
32 | Julien Adam, Maxime Kermarquer, Jean-Baptiste Besnard, Leonardo Bautista-Gomez, Marc Pérache, Patrick Carribault, Julien Jaeger, Allen D. Malony, Sameer Shende |
Checkpoint/restart approaches for a thread-based MPI runtime. |
CoRR |
2019 |
DBLP BibTeX RDF |
|
32 | Jumpol Yaothanee, Kasidit Chanchio |
An In-Memory Checkpoint-Restart Mechanism for a Cluster of Virtual Machines. |
JCSSE |
2019 |
DBLP DOI BibTeX RDF |
|
32 | Genlang Chen, Jiajian Zhang, Zufang Zhu, Chaoyan Zhu, Hai Jiang 0003, Chaoyi Pang |
CRAC: An Automatic Assistant Compiler of Checkpoint/Restart for OpenCL Program. |
ICDS |
2019 |
DBLP DOI BibTeX RDF |
|
32 | Genlang Chen, Jiajian Zhang, Qiuru Lin, Hai Jiang 0003, Chaoyi Pang |
CRState: In-Kernel Checkpoint/Restart of OpenCL Program Execution on GPU. |
ICPADS |
2019 |
DBLP DOI BibTeX RDF |
|
32 | Jialing Zhang, Xiaoyan Zhuo, Aekyeung Moon, Hang Liu, Seung Woo Son 0001 |
Efficient Encoding and Reconstruction of HPC Datasets for Checkpoint/Restart. |
MSST |
2019 |
DBLP DOI BibTeX RDF |
|
32 | Masoud Gholami Estahbanati, Florian Schintke |
Multilevel Checkpoint/Restart for Large Computational Jobs on Distributed Computing Resources. |
SRDS |
2019 |
DBLP DOI BibTeX RDF |
|
32 | Thouraya Louati, Heithem Abbes, Christophe Cérin, Mohamed Jemni |
LXCloud-CR: Towards LinuX Containers Distributed Hash Table based Checkpoint-Restart. |
J. Parallel Distributed Comput. |
2018 |
DBLP DOI BibTeX RDF |
|
32 | Gregory Michael Price |
DMTCP Checkpoint/Restart of MPI Programs via Proxies. |
CoRR |
2018 |
DBLP BibTeX RDF |
|
32 | Rohan Garg 0001, Apoorve Mohan, Michael B. Sullivan 0001, Gene Cooperman |
CRUM: Checkpoint-Restart Support for CUDA's Unified Memory. |
CoRR |
2018 |
DBLP BibTeX RDF |
|
32 | Adrian Bazaga, Michal Pitonák |
Performance Evaluation of an Algorithm-based Asynchronous Checkpoint-Restart Fault Tolerant Application Using Mixed MPI/GPI-2. |
CoRR |
2018 |
DBLP BibTeX RDF |
|
32 | Julien Adam, Jean-Baptiste Besnard, Allen D. Malony, Sameer Shende, Marc Pérache, Patrick Carribault, Julien Jaeger |
Transparent High-Speed Network Checkpoint/Restart in MPI. |
EuroMPI |
2018 |
DBLP DOI BibTeX RDF |
|
32 | Chayawat Pechwises, Kasidit Chanchio |
A Transparent Hypervisor-level Checkpoint-Restart Mechanism for a Cluster of Virtual Machines. |
JCSSE |
2018 |
DBLP DOI BibTeX RDF |
|
32 | Rohan Garg 0001, Apoorve Mohan, Michael B. Sullivan 0001, Gene Cooperman |
CRUM: Checkpoint-Restart Support for CUDA's Unified Memory. |
CLUSTER |
2018 |
DBLP DOI BibTeX RDF |
|
32 | Faisal Shahzad 0001, Jonas Thies, Moritz Kreutzer, Thomas Zeiser, Georg Hager, Gerhard Wellein |
CRAFT: A library for easier application-level Checkpoint/Restart and Automatic Fault Tolerance. |
CoRR |
2017 |
DBLP BibTeX RDF |
|
32 | Kiril Dichev, Herbert Jordan, Konstantinos Tovletoglou, Thomas Heller, Dimitrios S. Nikolopoulos, Georgios Karakonstantis, Charles Gillan |
Dependency-Aware Rollback and Checkpoint-Restart for Distributed Task-Based Runtimes. |
CoRR |
2017 |
DBLP BibTeX RDF |
|
32 | Marcos Maronas, Sergi Mateo, Vicenç Beltran 0001, Eduard Ayguadé |
A Directive-Based Approach to Perform Persistent Checkpoint/Restart. |
HPCS |
2017 |
DBLP DOI BibTeX RDF |
|
32 | Abhinav Agrawal 0003, Gabriel H. Loh, James Tuck 0001 |
Leveraging near data processing for high-performance checkpoint/restart. |
SC |
2017 |
DBLP DOI BibTeX RDF |
|
32 | Behnam Pourghassemi, Aparna Chandramowlishwaran |
cudaCR: An In-Kernel Application-Level Checkpoint/Restart Scheme for CUDA-Enabled GPUs. |
CLUSTER |
2017 |
DBLP DOI BibTeX RDF |
|
32 | Juri Schmidt |
Accelerating checkpoint/restart application performance in large-scale systems with network attached memory. |
|
2017 |
RDF |
|
32 | Behnam Pourghassemi |
cudaCR: An In-kernel Application-level Checkpoint/Restart Scheme for CUDA Applications. |
|
2017 |
RDF |
|
32 | Jiajun Cao, Kapil Arya, Rohan Garg 0001, L. Shawn Matott, Dhabaleswar K. Panda 0001, Hari Subramoni, Jérôme Vienne, Gene Cooperman |
System-level Scalable Checkpoint-Restart for Petascale Computing. |
CoRR |
2016 |
DBLP BibTeX RDF |
|
32 | Zhengyu Chen, Jianhua Sun 0002, Hao Chen 0002 |
Optimizing Checkpoint Restart with Data Deduplication. |
Sci. Program. |
2016 |
DBLP DOI BibTeX RDF |
|
32 | Jiajun Cao, Kapil Arya, Rohan Garg 0001, L. Shawn Matott, Dhabaleswar K. Panda 0001, Hari Subramoni, Jérôme Vienne, Gene Cooperman |
System-Level Scalable Checkpoint-Restart for Petascale Computing. |
ICPADS |
2016 |
DBLP DOI BibTeX RDF |
|
32 | Scott Levy, Kurt B. Ferreira |
An Examination of the Impact of Failure Distribution on Coordinated Checkpoint/Restart. |
FTXS@HPDC |
2016 |
DBLP DOI BibTeX RDF |
|
32 | Javier Arias Moreno, Osman S. Unsal, Jesús Labarta, Adrián Cristal |
NanoCheckpoints: A Task-Based Asynchronous Dataflow Framework for Efficient and Scalable Checkpoint/Restart. |
PDP |
2015 |
DBLP DOI BibTeX RDF |
|
32 | Naoto Sasaki, Kento Sato, Toshio Endo, Satoshi Matsuoka |
Exploration of Lossy Compression for Application-Level Checkpoint/Restart. |
IPDPS |
2015 |
DBLP DOI BibTeX RDF |
|
32 | Jiajun Cao, Gregory Kerr, Kapil Arya, Gene Cooperman |
Transparent checkpoint-restart over infiniband. |
HPDC |
2014 |
DBLP DOI BibTeX RDF |
|
32 | Ajay Saini, Arash Rezaei, Frank Mueller 0001, Paul Hargrove, Eric Roman |
Affinity-aware checkpoint restart. |
Middleware |
2014 |
DBLP DOI BibTeX RDF |
|
32 | Cristiano Giuffrida, Calin Iorgulescu, Andrew S. Tanenbaum |
Mutable checkpoint-restart: automating live update for generic server programs. |
Middleware |
2014 |
DBLP DOI BibTeX RDF |
|
32 | Nosayba El-Sayed, Bianca Schroeder |
Checkpoint/restart in practice: When 'simple is better'. |
CLUSTER |
2014 |
DBLP DOI BibTeX RDF |
|
32 | Faisal Shahzad 0001, Markus Wittmann, Moritz Kreutzer, Thomas Zeiser, Georg Hager, Gerhard Wellein |
A Survey of Checkpoint/Restart Techniques on Distributed Memory Systems. |
Parallel Process. Lett. |
2013 |
DBLP DOI BibTeX RDF |
|
32 | Ifeanyi P. Egwutuoha, David Levy 0001, Bran Selic, Shiping Chen 0001 |
A survey of fault tolerance mechanisms and checkpoint/restart implementations for high performance computing systems. |
J. Supercomput. |
2013 |
DBLP DOI BibTeX RDF |
|
32 | Bogdan Nicolae, Franck Cappello |
BlobCR: Virtual disk based checkpoint-restart for HPC applications on IaaS clouds. |
J. Parallel Distributed Comput. |
2013 |
DBLP DOI BibTeX RDF |
|
32 | Jiajun Cao, Kapil Arya, Gene Cooperman |
Transparent Checkpoint-Restart over InfiniBand. |
CoRR |
2013 |
DBLP BibTeX RDF |
|
32 | Kapil Arya, Gene Cooperman, Andrea Dotti, Peter Elmer |
Use of Checkpoint-Restart for Complex HEP Software on Traditional Architectures and Intel MIC. |
CoRR |
2013 |
DBLP BibTeX RDF |
|
32 | Samaneh Kazemi, Rohan Garg 0001, Gene Cooperman |
Transparent Checkpoint-Restart for Hardware-Accelerated 3D Graphics. |
CoRR |
2013 |
DBLP BibTeX RDF |
|
Displaying result #1 - #100 of 163 (100 per page; Change: ) Pages: [ 1][ 2][ >>] |
|