Now showing items 1-2 of 2

    • Analyzing a Five-year Failure Record of a Leadership-class Supercomputer 

      Rojas, Elvis; Meneses, Esteban; Jones, Terry; Maxwell, Don (Institute of Electrical and Electronics Engineers, Incorporated (IEEE), 2019-10-18)
      Extreme-scale computing systems are required to solve some of the grand challenges in science and technology. From astrophysics to molecular biology, supercomputers are an essential tool to accelerate scientific discovery. ...
    • Understanding failures through the lifetime of a top-level supercomputer 

      Rojas, Elvis; Meneses, Esteban; Jones, Terry; Maxwell, Don (Academic Press Inc., 2021-04-20)
      High performance computing systems are required to solve grand challenges in many scientific disciplines. These systems assemble many components to be powerful enough for solving extremely complex problems. An inherent ...