ONLINE PERFORMANCE OBSERVATION FOR HPC APPLICATIONS

dc.contributor.advisorMalony, Allen
dc.contributor.authorYokelson, Dewi
dc.date.accessioned2024-08-07T21:49:25Z
dc.date.available2024-08-07T21:49:25Z
dc.date.issued2024-08-07
dc.description.abstractThe exascale computing era is providing faster and more powerful systems for advanced HPC applications. However, it is increasingly challenging for programmers to utilize the range of hardware resources that make up these platforms to their fullest extent. Enabling larger, faster, and more diversified simulations requires performance monitoring tools that can integrate seamlessly with applications and operate efficiently in all desired configurations. In addition to critical computational bottlenecks, data movement and I/O performance issues are also important to monitor as data can quickly grow to terabytes and beyond. Thus, a major challenge in high-performance computing is maximizing the performance of many diverse simulations on expensive, energy consuming, and heterogeneous hardware. Furthermore, the landscape of scientific simulations is changing to include increasingly diverse and complex systems, such as coupled applications and workflows. This creates additional considerations in the performance analysis space, where dependencies and task scheduling can play a larger role. This dissertation presents an approach to addressing these issues, wherein we enable performance observability during runtime for different applications and workflows running on heterogeneous architectures. The framework we have created to support this valuable functionality is called Service-based Observability, Monitoring, and Analytics (SOMA). We show how it addresses diverse application and workflow needs across systems, while supporting many useful performance monitoring capabilities with reasonable overhead.en_US
dc.identifier.urihttps://hdl.handle.net/1794/29769
dc.language.isoen_US
dc.publisherUniversity of Oregon
dc.rightsAll Rights Reserved.
dc.subjecthigh performance computingen_US
dc.subjectperformance monitoringen_US
dc.titleONLINE PERFORMANCE OBSERVATION FOR HPC APPLICATIONS
dc.typeElectronic Thesis or Dissertation
thesis.degree.disciplineDepartment of Computer Science
thesis.degree.grantorUniversity of Oregon
thesis.degree.leveldoctoral
thesis.degree.namePh.D.

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Yokelson_oregon_0171A_13882.pdf
Size:
6.15 MB
Format:
Adobe Portable Document Format