From facility to application sensor data: modular, continuous and holistic monitoring with DCDB A Netti, M Müller, A Auweter, C Guillen, M Ott, D Tafani, M Schulz Proceedings of the International Conference for High Performance Computing …, 2019 | 39 | 2019 |
DCDB wintermute: Enabling online and holistic operational data analytics on HPC systems A Netti, M Müller, C Guillen, M Ott, D Tafani, G Ozer, M Schulz Proceedings of the 29th International Symposium on High-Performance Parallel …, 2020 | 36 | 2020 |
A machine learning approach to online fault classification in HPC systems A Netti, Z Kiziltan, O Babaoglu, A Sîrbu, A Bartolini, A Borghesi Future Generation Computer Systems 110, 1009-1022, 2020 | 31 | 2020 |
FINJ: A fault injection tool for HPC systems A Netti, Z Kiziltan, O Babaoglu, A Sîrbu, A Bartolini, A Borghesi Euro-Par 2018: Parallel Processing Workshops: Euro-Par 2018 International …, 2019 | 20 | 2019 |
A conceptual framework for HPC operational data analytics A Netti, W Shin, M Ott, T Wilde, N Bates 2021 IEEE International Conference on Cluster Computing (CLUSTER), 596-603, 2021 | 19 | 2021 |
Towards a predictive energy model for HPC runtime systems using supervised learning G Ozer, S Garg, N Davoudi, G Poerwawinata, M Maiterth, A Netti, D Tafani Euro-Par 2019: Parallel Processing Workshops: Euro-Par 2019 International …, 2020 | 16 | 2020 |
AccaSim: a customizable workload management simulator for job dispatching research in HPC systems C Galleguillos, Z Kiziltan, A Netti, R Soto Cluster Computing 23 (1), 107-122, 2020 | 15 | 2020 |
Characterizing HPC performance variation with monitoring and unsupervised learning G Ozer, A Netti, D Tafani, M Schulz High Performance Computing: ISC High Performance 2020 International …, 2020 | 14 | 2020 |
Online fault classification in hpc systems through machine learning A Netti, Z Kiziltan, O Babaoglu, A Sîrbu, A Bartolini, A Borghesi Euro-Par 2019: Parallel Processing: 25th International Conference on …, 2019 | 12 | 2019 |
Correlation-wise smoothing: Lightweight knowledge extraction for HPC monitoring data A Netti, D Tafani, M Ott, M Schulz 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2021 | 11 | 2021 |
Accasim: an HPC simulator for workload management C Galleguillos, Z Kiziltan, A Netti High Performance Computing: 4th Latin American Conference, CARLA 2017 …, 2018 | 11 | 2018 |
FPGA implementation of a real-time filter and sum beamformer for acoustic antenna A Netti, G Diodati, F Camastra, V Quaranta INTER-NOISE and NOISE-CON Congress and Conference Proceedings 250 (3), 3458-3469, 2015 | 11 | 2015 |
Operational data analytics in practice: experiences from design to deployment in production HPC environments A Netti, M Ott, C Guillen, D Tafani, M Schulz Parallel Computing 113, 102950, 2022 | 10 | 2022 |
Heterogeneity-aware resource allocation in HPC systems A Netti, C Galleguillos, Z Kiziltan, A Sîrbu, O Babaoglu High Performance Computing: 33rd International Conference, ISC High …, 2018 | 10 | 2018 |
Hpc hardware design reliability benchmarking with hdfit P Omland, A Netti, Y Peng, A Baldovin, M Paulitsch, G Espinosa, J Parra, ... IEEE Transactions on Parallel and Distributed Systems 34 (3), 995-1006, 2023 | 5 | 2023 |
Holistic and Portable Operational Data Analytics on Production HPC Systems A Netti Technische Universität München, 2022 | 3 | 2022 |
Energy-efficient runtime in HPC systems with machine learning G Ozer, S Garg, G Poerwawinata, N Davoudi, MDT LRZ, M Maiterth, ... Technical University of Munich, Data Innovation Lab, 2019 | 3 | 2019 |
Mixed precision support in HPC applications: What about reliability? A Netti, Y Peng, P Omland, M Paulitsch, J Parra, G Espinosa, U Agarwal, ... Journal of Parallel and Distributed Computing 181, 104746, 2023 | 2 | 2023 |
DEEP-EST T Moschny, C Clauß, ZU Huda, S Krempel, A Netti, M Nuessle, ... | | 2021 |
DEEP-EST N Eicker, T Moschny, C Clauß, ZU Huda, A Netti, M Nuessle, M Ott, ... Update (v2. 0), 2019 | | 2019 |