|
|
Venues (Conferences, Journals, ...)
|
|
GrowBag graphs for keyword ? (Num. hits/coverage)
Group by:
No Growbag Graphs found.
|
|
|
Results
Found 37 publication records. Showing 37 according to the selection in the facets
Hits ?▲ |
Authors |
Title |
Venue |
Year |
Link |
Author keywords |
1 | Bo Fang, Siva Kumar Sastry Hari, Timothy Tsai 0002, Xinyi Li, Ganesh Gopalakrishnan, Ignacio Laguna, Kevin J. Barker, Ang Li 0006 |
Towards Precision-Aware Fault Tolerance Approaches for Mixed-Precision Applications. |
FTXS@SC |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Aurelien Bouteiller, George Bosilca |
Implicit Actions and Non-blocking Failure Recovery with MPI. |
FTXS@SC |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Chris Egersdoerfer, Di Zhang, Dong Dai 0001 |
ClusterLog: Clustering Logs for Effeftxsctive Log-based Anomaly Detection. |
FTXS@SC |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Yehonatan Fridman, Yaniv Snir, Harel Levin, Danny Hendler, Hagit Attiya, Gal Oren 0001 |
Recovery of Distributed Iterative Solvers for Linear Systems Using Non-Volatile RAM. |
FTXS@SC |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Lukas Hübner, Demian Hespe, Peter Sanders 0001, Alexandros Stamatakis |
ReStore: In-Memory REplicated STORagE for Rapid Recovery in Fault-Tolerant Algorithms. |
FTXS@SC |
2022 |
DBLP DOI BibTeX RDF |
|
1 | |
12th IEEE/ACM Workshop on Fault Tolerance for HPC at eXtreme Scale, FTXS@SC 2022, Dallas, TX, USA, November 13-18, 2022 |
FTXS@SC |
2022 |
DBLP DOI BibTeX RDF |
|
1 | Nathan DeBardeleben, Tom Burr, Stephen Penton, Craig Walker, Josip Loncaric, William M. Jones |
Statistical Framework for Two-Party Acceptance Testing of HPC Systems for Reliability. |
FTXS@SC |
2021 |
DBLP DOI BibTeX RDF |
|
1 | Zheng Miao, Jon C. Calhoun, Rong Ge 0002 |
Relaxed Replication for Energy Efficient and Resilient GPU Computing. |
FTXS@SC |
2021 |
DBLP DOI BibTeX RDF |
|
1 | Trokon Johnson, Herman Lam |
Incorporating Fault-Tolerance Awareness into System-Level Modeling and Simulation. |
FTXS@SC |
2021 |
DBLP DOI BibTeX RDF |
|
1 | |
11th IEEE/ACM Workshop on Fault Tolerance for HPC at eXtreme Scale, FTXS@SC 2021, St. Louis, MO, USA, November 14, 2021 |
FTXS@SC |
2021 |
DBLP DOI BibTeX RDF |
|
1 | Yehonatan Fridman, Yaniv Snir, Matan Rusanovsky, Kfir Zvi, Harel Levin, Danny Hendler, Hagit Attiya, Gal Oren 0001 |
Assessing the Use Cases of Persistent Memory in High-Performance Scientific Computing. |
FTXS@SC |
2021 |
DBLP DOI BibTeX RDF |
|
1 | Philipp Samfass, Tobias Weinzierl, Anne Reinarz, Michael Bader |
Doubt and Redundancy Kill Soft Errors - Towards Detection and Correction of Silent Data Corruption in Task-based Numerical Software. |
FTXS@SC |
2021 |
DBLP DOI BibTeX RDF |
|
1 | Romain Lion, Samuel Thibault |
From tasks graphs to asynchronous distributed checkpointing with local restart. |
FTXS@SC |
2020 |
DBLP DOI BibTeX RDF |
|
1 | Hemanth Kolla, Jackson R. Mayo, Keita Teranishi, Robert C. Armstrong |
Improving Scalability of Silent-Error Resilience for Message-Passing Solvers via Local Recovery and Asynchrony. |
FTXS@SC |
2020 |
DBLP DOI BibTeX RDF |
|
1 | |
10th IEEE/ACM Workshop on Fault Tolerance for HPC at eXtreme Scale, FTXS@SC 2020, Atlanta, GA, USA, November 11, 2020 |
FTXS@SC |
2020 |
DBLP DOI BibTeX RDF |
|
1 | Mohit Kumar, Christian Engelmann |
Models for Resilience Design Patterns. |
FTXS@SC |
2020 |
DBLP DOI BibTeX RDF |
|
1 | Md Abdullah Shahneous Bari, Debasmita Basu, Wenbin Lu, Tony Curtis, Barbara M. Chapman |
Checkpointing OpenSHMEM Programs Using Compiler Analysis. |
FTXS@SC |
2020 |
DBLP DOI BibTeX RDF |
|
1 | Scott Levy |
Message from the Workshop Chair. |
FTXS@SC |
2020 |
DBLP DOI BibTeX RDF |
|
1 | Carlos Pachajoa, Robert Ernstbrunner, Wilfried N. Gansterer |
A Generic Strategy for Node-Failure Resilience for Certain Iterative Linear Algebra Methods. |
FTXS@SC |
2020 |
DBLP DOI BibTeX RDF |
|
1 | Nikunj Gupta, Jackson R. Mayo, Adrian S. Lemoine, Hartmut Kaiser |
Towards Distributed Software Resilience in Asynchronous Many- Task Programming Models. |
FTXS@SC |
2020 |
DBLP DOI BibTeX RDF |
|
1 | Piyush Sao, Christian Engelmann, Srinivas Eswar, Oded Green, Richard W. Vuduc |
Self-stabilizing Connected Components. |
FTXS@SC |
2019 |
DBLP DOI BibTeX RDF |
|
1 | Carlos Pachajoa, Christina Pacher, Wilfried N. Gansterer |
Node-Failure-Resistant Preconditioned Conjugate Gradient Method without Replacement Nodes. |
FTXS@SC |
2019 |
DBLP DOI BibTeX RDF |
|
1 | Chun-Kai Chang, Guanpeng Li, Mattan Erez |
Evaluating Compiler IR-Level Selective Instruction Duplication with Realistic Hardware Errors. |
FTXS@SC |
2019 |
DBLP DOI BibTeX RDF |
|
1 | |
9th IEEE/ACM Workshop on Fault Tolerance for HPC at eXtreme Scale, FTXS@SC 2019, Denver, CO, USA, November 22, 2019 |
FTXS@SC |
2019 |
DBLP BibTeX RDF |
|
1 | Einar Horn, Dakota Fulp, Jon Calhoun 0001, Luke N. Olson |
FaultSight: A Fault Analysis Tool for HPC Researchers. |
FTXS@SC |
2019 |
DBLP DOI BibTeX RDF |
|
1 | Nuria Losada, Aurélien Bouteiller, George Bosilca |
Asynchronous Receiver-Driven Replay for Local Rollback of MPI Applications. |
FTXS@SC |
2019 |
DBLP DOI BibTeX RDF |
|
1 | Tyler Coy, Xuechen Zhang 0001 |
Enforcing Crash Consistency of Scientific Applications in Non-Volatile Main Memory Systems. |
FTXS@SC |
2019 |
DBLP DOI BibTeX RDF |
|
1 | Nuria Losada, Leonardo Bautista-Gomez, Kai Keller, Osman S. Unsal |
Towards Ad Hoc Recovery for Soft Errors. |
FTXS@SC |
2018 |
DBLP DOI BibTeX RDF |
|
1 | |
IEEE/ACM 8th Workshop on Fault Tolerance for HPC at eXtreme Scale, FTXS@SC 2018, Dallas, TX, USA, November 16, 2018 |
FTXS@SC |
2018 |
DBLP BibTeX RDF |
|
1 | Yawei Hui, Byung-Hoon Park, Christian Engelmann |
A Comprehensive Informative Metric for Analyzing HPC System Status Using the LogSCAN Platform. |
FTXS@SC |
2018 |
DBLP DOI BibTeX RDF |
|
1 | Marc Platini, Thomas Ropars, Benoit Pelletier, Noel De Palma |
CPU Overheating Characterization in HPC Systems: A Case Study. |
FTXS@SC |
2018 |
DBLP DOI BibTeX RDF |
|
1 | Anne Reinarz, Jean-Mathieu Gallard, Michael Bader |
Influence of A-Posteriori Subcell Limiting on Fault Frequency in Higher-Order DG Schemes. |
FTXS@SC |
2018 |
DBLP DOI BibTeX RDF |
|
1 | Carlos Pachajoa, Markus Levonyak, Wilfried N. Gansterer |
Extending and Evaluating Fault-Tolerant Preconditioned Conjugate Gradient Methods. |
FTXS@SC |
2018 |
DBLP DOI BibTeX RDF |
|
1 | Rizwan A. Ashraf, Christian Engelmann |
Analyzing the Impact of System Reliability Events on Applications in the Titan Supercomputer. |
FTXS@SC |
2018 |
DBLP DOI BibTeX RDF |
|
1 | Neil Agarwal, Hugh Greenberg, Sean Blanchard, Nathan DeBardeleben |
SaNSA - The Supercomputer and Node State Architecture. |
FTXS@SC |
2018 |
DBLP DOI BibTeX RDF |
|
1 | Alexandra Poulos, Dylan Wallace, Robert Robey, Laura Monroe, Vanessa Job, Sean Blanchard, William M. Jones, Nathan DeBardeleben |
Improving Application Resilience by Extending Error Correction with Contextual Information. |
FTXS@SC |
2018 |
DBLP DOI BibTeX RDF |
|
1 | Felix Loh, Kewal K. Saluja, Parameswaran Ramanathan |
Fault Tolerant Cholesky Factorization on GPUs. |
FTXS@SC |
2018 |
DBLP DOI BibTeX RDF |
|
Displaying result #1 - #37 of 37 (100 per page; Change: )
|
|