Publications

(2024). Celerity-RSim - Porting Light Propagation Simulation to Accelerator Clusters using a High-Level API. HLPP 2024.

(2024). Achieving a deeper understanding of user-related influences on artificial lighting energy demand using high performance computing. BSA 2024.

(2024). The Role of Force Fields and Water Models in Protein Folding and Unfolding Dynamics. Journal of Chemical Theory and Computation.

DOI

(2023). Domain-specific energy modeling for drug discovery and magnetohydrodynamics applications. SC 2023 Workshops.

PDF DOI

(2023). High-resolution simulations of LS 5039. Astronomy & Astrophysics.

PDF DOI

(2023). Tunable and Portable Extreme-Scale Drug Discovery Platform at Exascale: the LIGATE Approach. CF 2023.

PDF DOI

(2023). An Asynchronous Dataflow-Driven Execution Model For Distributed Accelerator Computing. CCGrid 2023.

PDF DOI

(2022). User's needs influencing HPC technologies. PRACE Technical Report.

PDF DOI

(2021). Multi-GPU Room Response Simulation with Hardware Raytracing and Domain-specific Compression. CCPE.

PDF DOI

(2021). The Cluster Coffer: Teaching HPC on the Road. JPDC.

PDF DOI

(2021). Porting Real-World Applications to GPU Clusters: A Celerity and Cronos Case Study. IEEE eScience 2021.

PDF DOI

(2021). State-of-the-Art and Trends for Computing and Interconnect Network. PRACE Technical Report.

PDF DOI

(2020). AllScale API. Computing and Informatics.

PDF DOI

(2020). The AllScale Framework Architecture. Parallel Computing.

PDF DOI

(2020). AllScale toolchain pilot applications: PDE based solvers using a parallel development environment. Computer Physics Communications.

PDF DOI

(2019). The AllScale API. 2019 15th International Conference on eScience (eScience).

PDF DOI

(2019). Multi-Objective region-Aware optimization of parallel programs. Parallel Computing.

PDF DOI

(2018). Towards Automatic Compiler-assisted Performance and Energy Modeling for Message Passing Parallel Programs. ARCS Workshop 2018; 31th International Conference on Architecture of Computing Systems.

(2018). TOEP: Threshold Oriented Energy Prediction Mechanism for MPI-OpenMP Hybrid Applications. 2018 Eleventh International Conference on Contemporary Computing (IC3).

(2018). The AllScale Runtime Application Model. 2018 IEEE International Conference on Cluster Computing (CLUSTER).

PDF DOI

(2018). Exploring the Semantic Gap in Compiling Embedded DSLs. Proceedings of the 18th International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation.

PDF DOI

(2018). D6. 9--Installation, Integration and Deployment of the AllScale Environment and Pilot Applications.

(2018). CELERITY: Towards an Effective Programming Interface for GPU Clusters. 2018 26th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing.

PDF

(2018). A Taxonomy of Task-Based Technologies for High-Performance Computing. Parallel Processing and Applied Mathematics.

PDF DOI

(2018). A taxonomy of task-based parallel programming technologies for high-performance computing. The Journal of Supercomputing.

PDF DOI

(2018). A localised data assimilation framework within the ‘AllScale’ parallel development environment. OCEANS 2018 MTS/IEEE Charleston.

(2017). A Region-Aware Multi-Objective Auto-Tuner for Parallel Programs. 2017 46th International Conference on Parallel Processing Workshops (ICPPW).

PDF DOI

(2016). A particle-in-cell method for automatic load-balancing with the allscale environment. The Exascale applications & Software conference (EASC2016).

(2015). On the Potential of Significance-Driven Execution for Energy-Aware HPC. Comput. Sci..

PDF DOI

(2015). Energy Prediction of OpenMP Applications Using Random Forest Modeling Approach. 2015 IEEE International Parallel and Distributed Processing Symposium Workshop.

PDF DOI

(2014). Multi-Objective Auto-Tuning with Insieme: Optimization and Trade-Off Analysis for Time, Energy and Resource Usage. Euro-Par 2014 Parallel Processing.

PDF DOI

(2014). Modeling CPU Energy Consumption of HPC Applications on the IBM POWER7. 2014 22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing.

PDF DOI

(2012). Low-Latency Collectives for the Intel SCC. 2012 IEEE International Conference on Cluster Computing.

PDF DOI

(2012). A Multi-Objective Auto-Tuning Framework for Parallel Codes. Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis.

PDF DOI

(2011). Performance Analysis and Benchmarking of the Intel SCC. 2011 IEEE International Conference on Cluster Computing.

PDF DOI

(2009). A split connection PEP in ns-2.

PDF

(2009). Why Is This Web Page Coming Up so Slow? Investigating the Loss of SYN Packets. NETWORKING 2009.

PDF DOI