Publications

Journal article Conference paper Book chapter Thesis Poster

2026 #

  • [J12]
    A. Mulone, D. Medić, I. Colonnelli, and M. Aldinucci, “A formal framework for fault tolerance in hybrid scientific workflows,” Future Generation Computer Systems, vol. 176, p. 108188, 2026.
    Abstract
    In large-scale distributed systems, failures are routine events whose occurrences increase with the number of computational tasks and execution locations. The advantage of representing an application as a workflow is the possibility of exploiting Workflow Management System (WMS) features such as portability, scalability, and, crucially, reliability. Among these, reliability is essential for ensuring robust execution in dynamic and failure-prone environments. In recent years, the emergence of hybrid workflows has posed new and intriguing challenges by increasing the possibility of distributing computations involving heterogeneous and independent environments. Consequently, the number of possible points of failure during the execution increased, creating a need for sophisticated fault tolerance mechanisms capable of addressing the specific requirements of hybrid systems. This work introduces a formal framework for a fault tolerance mechanism in hybrid workflows, enabling failure recovery through a rollback approach. The framework is rigorously defined by adapting and extending an existing workflow semantics tailored for hybrid execution. Our method leverages provenance data from workflow execution up to the point of failure, and creates a recovery workflow that spans multiple infrastructures. The rollback approach provides a robust and reliable strategy to ensure resilience against step failures and potential data loss. We then implement this mechanism in the StreamFlow WMS, and evaluate it using two case studies: the 1000 Genomes workflow and a synthetic workflow featuring iterative patterns. Experiments showcase the conceptual validity of our approach and assess the overhead introduced by the mechanism, including data availability checks.
    BibTeX
    @article{2026:fgcs:mulone,
      title        = {A formal framework for fault tolerance in hybrid scientific workflows},
      author       = {Alberto Mulone and
                      Doriana Medi\'{c} and
                      Iacopo Colonnelli and
                      Marco Aldinucci},
      journal      = {Future Generation Computer Systems},
      year         = {2026},
      volume       = {176},
      pages        = {108188},
      doi          = {10.1016/j.future.2025.108188},
      issn         = {0167-739X},
    }
    
    DOI PDF
  • [J11]
    M. E. Santimaria, I. Colonnelli, B. Cantalupo, M. Torquati, D. Medić, N. Tuccari, E. Sciacca, and M. Aldinucci, “Dynamic transparent streaming in file-based workflows with CAPIO,” Future Generation Computer Systems, p. 108159, 2026.
    Abstract
    Advances in big data and the growth in complexity of modern applications highlight the necessity for optimizing workflow executions on different levels, such as hybrid workflow executions, automatic optimization of data movements, and efficient use of IO. Following this line, streaming features are the desired capabilities for file-based workflows as they can reduce overall execution times. Expanding workflows with streaming capabilities usually requires rewriting the application, which is time-consuming and requires deep knowledge of the application. With this work, we introduce the Cross-Application Programmable IO (CAPIO) methodology, of which the stack is composed of two parts: the CAPIO-CL coordination language and the CAPIO middleware (which implements the semantics expressed by the CAPIO-CL coordination language). The CAPIO-CL coordination language annotates synchronization semantics between files produced and consumed by workflow steps. At the same time, the CAPIO middleware improves the performance of file-based workflows, leveraging the information provided by the CAPIO-CL language while not having to change (recompile) the code of the original workflow steps. By design, the CAPIO middleware supports multiple backends and can be extended to support more. It is dynamic, and it supports dynamic job scheduling. Benchmarks, done on both microbenchmarks and real-life workflows, prove that with CAPIO, it is possible to reduce the workflow execution time by up to 50
    BibTeX
    @article{2026:fgcs:santimaria,
      title        = {Dynamic transparent streaming in file-based workflows with {CAPIO}},
      author       = {Santimaria, Marco Edoardo and
                      Colonnelli, Iacopo and
                      Cantalupo, Barbara and
                      Torquati, Massimo and
                      Medi\'{c}, Doriana and
                      Tuccari, Nicola and
                      Sciacca, Eva and
                      Aldinucci, Marco},
      journal      = {Future Generation Computer Systems},
      year         = {2026},
      pages        = {108159},
      doi          = {10.1016/j.future.2025.108159},
      issn         = {0167-739X},
      publisher    = {Elsevier},
    }
    
    DOI PDF
  • [J10]
    L. Brescia, I. Colonnelli, R. Birke, V. Schiavoni, P. Felber, and M. Aldinucci, “A comprehensive performance evaluation of TEEs for confidential DNA alignment,” Future Generation Computer Systems, p. 108031, 2026.
    Abstract
    Data confidentiality is crucial when processing sensitive information, often limiting user interactions and shared computing services like the cloud. While Trusted Execution Environments (TEEs) offer a means to ensure privacy in untrusted environments, they frequently introduce significant computational overhead. DNA alignment, a key step in bioinformatics workflows, is privacy-sensitive and computationally intensive. Given its parallelizable nature, it is a compelling case study for evaluating the performance impact and scalability of various TEEs. This study assesses three TEEs – Intel SGX, Intel TDX, and AMD SEV-SNP – by evaluating their overhead through real-world bioinformatics workloads and system-level microbenchmarks. Our evaluation shows that SGX-based solutions incur substantial overhead, particularly for small workloads, with slowdowns ranging from 283
    BibTeX
    @article{2026:fgcs:brescia,
      title        = {A comprehensive performance evaluation of {TEEs} for confidential {DNA} alignment},
      author       = {Lorenzo Brescia and
                      Iacopo Colonnelli and
                      Robert Birke and
                      Valerio Schiavoni and
                      Pascal Felber and
                      Marco Aldinucci},
      journal      = {Future Generation Computer Systems},
      year         = {2026},
      pages        = {108031},
      doi          = {10.1016/j.future.2025.108031},
      issn         = {0167-739X},
    }
    
    DOI PDF
  • [J9]
    F. Suter, T. Coleman, İ. Altintaş, R. M. Badia, B. Balis, K. Chard, I. Colonnelli, E. Deelman, P. Di Tommaso, T. Fahringer, C. Goble, S. Jha, D. S. Katz, J. Köster, U. Leser, K. Mehta, H. Oliver, J. Peterson, G. Pizzi, L. Pottier, R. Sirvent, E. Suchyta, D. Thain, S. R. Wilkinson, J. M. Wozniak, and R. Ferreira da Silva, “A terminology for scientific workflow systems,” Future Generation Computer Systems, vol. 174, p. 107974, 2026.
    Abstract
    The term “scientific workflow” has evolved over the last two decades to encompass a broad range of compositions of interdependent compute tasks and data movements. It has also become an umbrella term for processing in modern scientific applications. Today, many scientific applications can be considered as workflows made of multiple dependent steps, and hundreds of workflow systems have been developed to manage and run these scientific workflows. However, no turnkey solution has emerged from the field to address the diversity of scientific processes and the infrastructure on which they are supposed to be implemented. Instead, new research problems requiring the execution of scientific workflows with some novel feature often lead to the development of an entirely new workflow system. A direct consequence of this situation is that many existing workflow management systems (WMSs) share some salient features, offer similar functionalities, and can manage the same categories of workflows but at the same time also have some distinct capabilities that can be important for specific applications. This situation makes researchers who develop workflows face the complex question of selecting a WMS. This selection can be driven by technical considerations, to find the system that is the most appropriate for their application and for the computing and storage resources available to them, or other factors such as reputation, adoption, strong community support, or long-term sustainability. To address this problem, a group of WMS developers and practitioners joined their efforts to produce a community-based terminology of WMSs. This paper summarizes their findings and introduces this new terminology to characterize WMSs. This terminology is composed of fives axes: workflow structure and characteristics, composition, orchestration, data management, and metadata capture. Each axis comprises several concepts that capture the prominent features of WMSs. Based on this terminology, this paper also presents a classification of 23 existing WMSs according to the proposed axes and terms.
    BibTeX
    @article{2026:fgcs:suter,
      title        = {A terminology for scientific workflow systems},
      author       = {Fr\'{e}d\'{e}ric Suter and
                      Tain\~{a} Coleman and
                      \.{I}lkay Altinta\c{s} and
                      Rosa M. Badia and
                      Bartosz Balis and
                      Kyle Chard and
                      Iacopo Colonnelli and
                      Ewa Deelman and
                      Paolo {Di Tommaso} and
                      Thomas Fahringer and
                      Carole Goble and
                      Shantenu Jha and
                      Daniel S. Katz and
                      Johannes K\"{o}ster and
                      Ulf Leser and
                      Kshitij Mehta and
                      Hilary Oliver and
                      J.-Luc Peterson and
                      Giovanni Pizzi and
                      Lo\"{i}c Pottier and
                      Ra\"{u}l Sirvent and
                      Eric Suchyta and
                      Douglas Thain and
                      Sean R. Wilkinson and
                      Justin M. Wozniak and
                      Rafael {Ferreira da Silva}},
      journal      = {Future Generation Computer Systems},
      year         = {2026},
      volume       = {174},
      pages        = {107974},
      doi          = {10.1016/j.future.2025.107974},
      issn         = {0167-739X},
    }
    
    DOI PDF

2025 #

  • [J8]
    M. E. Santimaria, A. R. Martinelli, I. Colonnelli, B. Cantalupo, M. Torquati, and M. Aldinucci, “CAPIO-CL: The CAPIO coordination language,” International Journal of Parallel Programming, vol. 53, p. 10, 2025.
    Abstract
    The performance bottleneck in file-based workflows remains a pressing issue in the realm of I/O-based workflows. To address this challenge, a novel annotation language has been developed. CAPIO-CL is positioned as an innovative I/O coordination language, enabling users to annotate data dependencies within file-based workflows with synchronization semantics pertinent to the involved files and directories. Through the information provided by the language, optimization opportunities arise in streaming and preemptive data movement. This paper serves to illustrate the semantics and syntax enabling CAPIO-CL to enhance the performance of in situ workflows without necessitating the rewriting or modification of the original workflow application steps. Finally, an analysis of CAPIO-CL is provided, taking into consideration both language expressiveness and application performance enhancement.
    BibTeX
    @article{2025:ijpp:santimaria,
      title        = {{CAPIO-CL}: The {CAPIO} Coordination Language},
      author       = {Santimaria, Marco Edoardo and
                      Martinelli, Alberto Riccardo and
                      Colonnelli, Iacopo and
                      Cantalupo, Barbara and
                      Torquati, Massimo and
                      Aldinucci, Marco},
      journal      = {International Journal of Parallel Programming},
      year         = {2025},
      volume       = {53},
      number       = {2},
      pages        = {10},
      doi          = {10.1007/s10766-025-00789-0},
      isbn         = {1573-7640},
      id           = {Santimaria2025},
    }
    
    DOI PDF
  • [C29]
    S. G. Contaldo, L. Bosio, J. E. H. Rea, E. L. Perottino, S. Rabellino, M. Aldinucci, M. Beccuti, and I. Colonnelli, “BookedSlurm: Meeting user needs for advanced resource reservations in Slurm,” in IEEE International Conference on eScience, eScience 2025, Chicago, IL, USA: IEEE, Sep. 2025, pp. 250–259.
    Abstract
    Modern scientific discovery is frequently backed by largescale scientific experiments, which cannot prescind from the unparalleled computational capabilities provided by highperformance computing systems. However, large data centers usually prioritize system efficiency over user accessibility, posing challenges for researchers without advanced computer science expertise. This work introduces BookedSlurm, a secure and userfocused extension of the Slurm workload manager, aiming to democratize HPC access across interdisciplinary research domains. BookedSlurm enables a partially decentralized regulation of finegrained advanced resource reservations through a novel creditbased framework, ensuring fair and predictable access to computing resources. Its modular architecture leverages dedicated microservices to manage reservations, credit handling, and accounting. These components are exposed through a secure REST API and an intuitive webbased dashboard, enhancing system usability for novice and expert users. While the dashboard simplifies interactions for nonspecialists, advanced users can directly access agentlevel APIs for more complex and automated operations. The effectiveness of BookedSlurm is validated on a realworld bioinformatics use case from the SUSMIRRI.IT project, showcasing its ability to enhance usability, optimize job scheduling, and streamline execution workflows.
    BibTeX
    @inproceedings{2025:escience:gepiro-contaldo,
      title        = {{BookedSlurm}: meeting user needs for advanced resource reservations
                      in {Slurm}},
      author       = {Sandro Gepiro Contaldo and
                      Lorenzo Bosio and
                      Janneth Estefania Hoyos Rea and
                      Elisa Li Perottino and
                      Sergio Rabellino and
                      Marco Aldinucci and
                      Marco Beccuti and
                      Iacopo Colonnelli},
      booktitle    = {{IEEE} International Conference on eScience, eScience 2025},
      year         = {2025},
      pages        = {250--259},
      month        = sep,
      doi          = {10.1109/ESCIENCE65000.2025.00037},
      publisher    = {{IEEE}},
      location     = {Chicago, IL, USA},
    }
    
    DOI PDF
  • [C28]
    N. Shukla, A. Romeo, C. Caravita, L. Riha, O. Vysocky, P. Strakos, M. Jaros, J. Barbosa, R. Vavrı́k, A. Mignone, M. Rossazza, S. Truzzi, V. Berta, I. Colonnelli, D. Medic, E. Boella, D. Gregori, E. Sciacca, L. Tornatore, G. Taffoni, P. J. Deka, F. Bacchini, R. Wilhelm, G. Doulis, K. Pierre, L. Rezzolla, T. Colman, B. Commerçon, O. Bouizi, M. Kuhn, E. Raffin, M. Sergent, R. Wissing, G. Marin, K. Dolag, G. S. Karademir, G. Perna, M. Zanotti, and S. Trujillo-Gomez, “EuroHPC SPACE CoE: Redesigning scalable parallel astrophysical codes for exascale. Invited paper,” in Proceedings of the 22nd ACM International Conference on Computing Frontiers: Workshops and Special Sessions, CF 2025, Cagliari, Italy: ACM, May 2025, pp. 177–184.
    Abstract
    High Performance Computing (HPC) based simulations are crucial in Astrophysics & Cosmology (A&C), helping scientists investigate and understand complex astrophysical phenomena. Taking advantage of exascale computing capabilities is essential for these efforts. However, the unprecedented architectural complexity of exascale systems impacts legacy codes. The SPACE Centre of Excellence (CoE) aims to re-engineer key astrophysical codes to tackle new computational challenges by adopting innovative programming paradigms and software (SW) solutions. SPACE brings together scientists, code developers, HPC experts, hardware (HW) manufacturers, and SW developers. This collaboration enhances exascale A C applications, promoting the use of exascale and post-exascale computing capabilities. Additionally, SPACE addresses high-performance data analysis for the massive data outputs from exascale simulations and modern observations, using machine learning (ML) and visualisation tools. The project facilitates application deployment across platforms by focusing on code repositories and data sharing, integrating European astrophysical communities around exascale computing with standardised SW and data protocols.
    BibTeX
    @inproceedings{2025:cf:shukla,
      title        = {{EuroHPC} {SPACE} {CoE}: Redesigning Scalable Parallel Astrophysical Codes
                      for Exascale. Invited Paper},
      author       = {Nitin Shukla and
                      Alessandro Romeo and
                      Caterina Caravita and
                      Lubomir Riha and
                      Ondrej Vysocky and
                      Petr Strakos and
                      Milan Jaros and
                      Jo{\~{a}}o Barbosa and
                      Radim Vavr{\'{\i}}k and
                      Andrea Mignone and
                      Marco Rossazza and
                      Stefano Truzzi and
                      Vittoria Berta and
                      Iacopo Colonnelli and
                      Doriana Medic and
                      Elisabetta Boella and
                      Daniele Gregori and
                      Eva Sciacca and
                      Luca Tornatore and
                      Giuliano Taffoni and
                      Pranab J. Deka and
                      Fabio Bacchini and
                      Rostislav{-}Paul Wilhelm and
                      Georgios Doulis and
                      Khalil Pierre and
                      Luciano Rezzolla and
                      Tine Colman and
                      Beno{\^{\i}}t Commer{\c{c}}on and
                      Othman Bouizi and
                      Matthieu Kuhn and
                      Erwan Raffin and
                      Marc Sergent and
                      Robert Wissing and
                      Guillermo Marin and
                      Klaus Dolag and
                      Geray S. Karademir and
                      Gino Perna and
                      Marisa Zanotti and
                      Sebastian Trujillo{-}Gomez},
      booktitle    = {Proceedings of the 22nd {ACM} International Conference on Computing
                   Frontiers: Workshops and Special Sessions, {CF} 2025},
      year         = {2025},
      pages        = {177--184},
      month        = may,
      doi          = {10.1145/3706594.3728892},
      publisher    = {{ACM}},
      location     = {Cagliari, Italy},
    }
    
    DOI PDF
  • [C27]
    L. Brescia, I. Colonnelli, V. Schiavoni, P. Felber, and M. Aldinucci, “End-to-end confidentiality with sev-snp leveraging in-memory storage,” in 2025 IEEE European Symposium on Security and Privacy Workshops (EuroS&PW), Los Alamitos, CA, USA: IEEE Computer Society, 2025, pp. 414–421.
    Abstract
    Confidential computing ensures data in-use protection in untrusted cloud environments, yet securing data atrest typically relies on Full Disk Encryption (FDE), which imposes significant performance overhead. This work proposes an alternative in-memory storage approach that eliminates FDE by leveraging SEV-SNP confidential virtual machines (CVMs). Our framework extends SNPGuard, an open-source platform for booting and attesting SEV-SNP VMs, to manage workload execution using temporary file systems (tmpfs), inherently secured by CVM memory encryption. By enabling seamless deployment of Docker based applications, our approach improves runtime and throughput by 20
    BibTeX
    @inproceedings{2025:eurospw:brescia,
      title        = {End-To-End Confidentiality with Sev-Snp Leveraging in-Memory Storage},
      author       = {Brescia, Lorenzo and
                      Colonnelli, Iacopo and
                      Schiavoni, Valerio and
                      Felber, Pascal and
                      Aldinucci, Marco},
      booktitle    = {2025 IEEE European Symposium on Security and Privacy Workshops (EuroS\&PW)},
      year         = {2025},
      pages        = {414-421},
      doi          = {10.1109/EuroSPW67616.2025.00054},
      publisher    = {IEEE Computer Society},
      address      = {Los Alamitos, CA, USA},
    }
    
    DOI PDF
  • [C26]
    M. Imbrosciano, E. Sciacca, F. Vitello, L. Pelonero, F. Franchina, U. Becciani, I. Colonnelli, and D. Medić, “The cloud-HPC infrastructure for hazard mapping and vulnerability monitoring (HaMMon),” in 2025 33rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP): IEEE, 2025, pp. 309–316.
    Abstract
    The HaMMon project is the outcome of an industrial partnership that includes many Italian research institutions and private companies. It is led by UnipolSai and Leitha, and funded by the ICSC, the Italian National Research Center for High Performance Computing, Big Data and Quantum Computing.The ambition of HaMMon is to build a flexible and scalable platform to analyze the hydrogeological and atmospheric balance of the Italian territory. The project aims to expand the current knowledge in hazard mapping, monitoring, and forecasting from an industrial perspective by leveraging innovative technologies and the interdisciplinary activities carried out by the ICSC.In this work, we present the cloud-HPC infrastructure deployed in the High-Performance Computing for Artificial Intelligence (HPC4AI) green data center of the University of Turin which supports the testing and development of HaMMon’s applications and services. We describe the current activities and preliminary results related to the integration of Photogrammetry techniques, Data Visualization and Artificial Intelligence technologies, applied on aerial images, to assess extreme natural events and evaluate their impact on risk-exposed assets.
    BibTeX
    @inproceedings{2025:pdp:imbrosciano,
      title        = {The Cloud-{HPC} infrastructure for Hazard Mapping and vulnerability Monitoring ({HaMMon})},
      author       = {Mauro Imbrosciano and
                      Eva Sciacca and
                      Fabio Vitello and
                      Leonardo Pelonero and
                      Francesco Franchina and
                      Ugo Becciani and
                      Iacopo Colonnelli and
                      Doriana Medi\'{c}},
      booktitle    = {2025 33rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)},
      year         = {2025},
      pages        = {309--316},
      doi          = {10.1109/PDP66500.2025.00050},
      issn         = {2377-5750},
      publisher    = {IEEE},
    }
    
    DOI PDF
  • [C25]
    N. Tuccari, E. Sciacca, F. Vitello, I. Colonnelli, Y. Becerra, E. S. Cintero, G. Marin, M. Jaros, L. Riha, P. Strakos, S. Trujillo-Gomez, E. Tramontana, and R. Wissing, “High performance visualization for astrophysics and cosmology,” in 2025 33rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP): IEEE, 2025, pp. 443–450.
    Abstract
    Modern Astrophysics and Cosmology (A&C) projects produce immense data volumes, necessitating advanced software tools for data access, storage, and analysis. Visualization Interface for the Virtual Observatory (VisIVO) is one such tool enabling multi-dimensional data analysis and knowledge discovery across complex astrophysical datasets. Leveraging containerization and virtualization, VisIVO has been deployed on various distributed computing platforms. Additionally, Blender, an open-source 3D suite, provides robust tools for rendering and processing volumetric data, making it suitable for visualizing complex datasets. At the SPACE Center of Excellence these tools are being adapted for high-performance visualization of cosmological simulations performed with GADGET and ChaNGa on pre-exascale systems. However, implementing high-performance visualization on diverse HPC platforms presents several challenges, including hardware and software compatibility, data management, scalability, performance portability, and efficient resource allocation. This paper outlines strategies to integrate VisIVO with workflow frameworks and streaming platforms to address these challenges. Workflow frameworks enhance portability, scheduling, and reproducibility of visualization workflows on pre-exascale systems used in A&C simulations. We also discuss the use of streaming platforms to enable concurrent (i.e. in-situ) analysis and visualization of simulations, reducing the need to store full simulation data by leveraging distributed databases that stream the output data in real time. Lastly, we present an adaptation of Blender to handle large-scale particle-based astrophysical data, offering high-quality visualization with interactive exploration capabilities.
    BibTeX
    @inproceedings{2025:pdp:tuccari,
      title        = {High Performance Visualization for Astrophysics and Cosmology},
      author       = {Nicola Tuccari and
                      Eva Sciacca and
                      Fabio Vitello and
                      Iacopo Colonnelli and
                      Yolanda Becerra and
                      Enric Sosa Cintero and
                      Guillermo Marin and
                      Milan Jaros and
                      Lubomir Riha and
                      Petr Strakos and
                      Sebastian Trujillo-Gomez and
                      Emiliano Tramontana and
                      Robert Wissing},
      booktitle    = {2025 33rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)},
      year         = {2025},
      pages        = {443--450},
      doi          = {10.1109/PDP66500.2025.00069},
      issn         = {2377-5750},
      publisher    = {IEEE},
    }
    
    DOI PDF
  • [C24]
    P. Taborsky, I. Colonnelli, K. Kurowski, R. Sarma, N. H. Pontoppidan, B. Jansı́k, N. S. Detlefsen, J. E. Pedersen, R. Larsen, and L. K. Hansen, “Towards a european HPC/AI ecosystem: A community-driven report,” in Procedia Computer Science, Amsterdam, Netherlands: Elsevier, 2025, pp. 140–149.
    Abstract
    The rapid advancements in AI and Machine Learning necessitate a robust computational infrastructure to support cutting-edge research and industrial applications. From the academic and industrial AI community perspective, voiced in the recent ELISE project, the European AI platform is recommended to center around the EuroHPC growing ecosystem. It should be user-driven, easily accessible, powerful, and compliant with European regulations. AI-optimized and dedicated supercomputers for the European AI community are also coming, in addition to upgrading partitions of existing EuroHPC systems to ’AI enabled’ stage. Related calls have been initiated in September 2024. Further, conventional EuroHPC systems are suggested to be extended with quantum computing, edge AI, and neuromorphic computing to cater to AI models deployed on network edge devices and sustainability in the long run. The challenges are presented in three case studies, ranging from training Transformers on HPC to LLMs trained federally across three different Euro HPC systems to recent results on hybrid classical-quantum application. This paper concludes with case studies results-informed next steps believed to benefit AI practitioners and the broader AI community.
    BibTeX
    @article{2025:eurohpc:taborsky,
      title        = {Towards a European {HPC}/{AI} ecosystem: a community-driven report},
      author       = {Petr Taborsky and
                      Iacopo Colonnelli and
                      Krzysztof Kurowski and
                      Rakesh Sarma and
                      Niels Henrik Pontoppidan and
                      Branislav Jans{\'\i}k and
                      Nicki Skafte Detlefsen and
                      Jens Egholm Pedersen and
                      Rasmus Larsen and
                      Lars Kai Hansen},
      booktitle    = {Proceedings of the Second EuroHPC user day},
      journal      = {Procedia Computer Science},
      year         = {2025},
      volume       = {255},
      pages        = {140--149},
      doi          = {10.1016/j.procs.2025.02.269},
      issn         = {1877-0509},
      publisher    = {Elsevier},
      address      = {Amsterdam, Netherlands},
    }
    
    DOI PDF
  • [C23]
    R. Rocco, S. Rizzo, M. Barbieri, G. Bettonte, E. Boella, F. Ganz, S. Iserte, A. J. Peña, P. Sandås, A. Scionti, O. Terzo, C. Vercellino, G. Vitali, P. Viviani, J. Frassineti, S. Marzella, D. Ottaviani, I. Colonnelli, and D. Gregori, “Dynamic solutions for hybrid quantum-HPC resource allocation,” in 2025 IEEE International Conference on Quantum Computing and Engineering (QCE), Albuquerque, NM, USA: IEEE, Sep. 2025, pp. 34–40.
    Abstract
    The integration of quantum computers within classical High-Performance Computing (HPC) infrastructures is receiving increasing attention, with the former expected to serve as accelerators for specific computational tasks. However, combining HPC and quantum computers presents significant technical challenges, including resource allocation. This paper presents a novel malleability-based approach, alongside a workflow-based strategy, to optimize resource utilization in hybrid HPC-quantum workloads. With both these approaches, we can release classical resources when computations are offloaded to the quantum computer and reallocate them once quantum processing is complete. Our experiments with a hybrid HPC-quantum use case show the benefits of dynamic allocation, highlighting the potential of those solutions.
    BibTeX
    @inproceedings{2025:qce:rocco,
      title        = {Dynamic Solutions for Hybrid Quantum-{HPC} Resource Allocation},
      author       = {Roberto Rocco and
                      Simone Rizzo and
                      Matteo Barbieri and
                      Gabriella Bettonte and
                      Elisabetta Boella and
                      Fulvio Ganz and
                      Sergio Iserte and
                      Antonio J. Pe{\~{n}}a and
                      Petter Sand{\aa}s and
                      Alberto Scionti and
                      Olivier Terzo and
                      Chiara Vercellino and
                      Giacomo Vitali and
                      Paolo Viviani and
                      Jonathan Frassineti and
                      Sara Marzella and
                      Daniele Ottaviani and
                      Iacopo Colonnelli and
                      Daniele Gregori},
      booktitle    = {2025 {IEEE} International Conference on Quantum Computing and Engineering (QCE)},
      year         = {2025},
      pages        = {34--40},
      month        = sep,
      doi          = {10.1109/QCE65121.2025.10289},
      publisher    = {{IEEE}},
      location     = {Albuquerque, NM, USA},
    }
    
    DOI PDF
  • [C22]
    M. E. Santimaria, R. Filgueira, D. Medić, I. Colonnelli, and M. Aldinucci, “Overcoming dynamic I/O boundaries: A double-sided streaming methodology with dispel4py and CAPIO,” in Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC Workshops '25), St Louis, MO, USA: ACM, Nov. 2025, pp. 2269–2280.
    Abstract
    This work introduces a novel double-sided streaming methodology that combines control-plane and data-plane streaming. Our goal is to implement the long-advocated separation of concerns in workflow orchestration without introducing artificial boundaries in their execution. Our approach is exemplified by the integration of control-plane streaming provided by dispel4py and the transparent data-plane streaming provided by CAPIO. Our integration eliminates file synchronization barriers without requiring modifications to existing workflow logic. To support this, we extend CAPIO with a new commit rule that allows streaming over dynamically generated file sets, enabling hybrid workflows that blend in-memory dataflows with file-based communication. We validate our approach using a real-world seismic cross-correlation workflow, achieving performance improvements between 23
    BibTeX
    @inproceedings{2025:scw:santimaria,
      title        = {Overcoming Dynamic {I/O} Boundaries: a Double-Sided Streaming Methodology with dispel4py and {CAPIO}},
      author       = {Santimaria, Marco Edoardo and
                      Filgueira, Rosa and
                      Medi{\'c}, Doriana and
                      Colonnelli, Iacopo and
                      Aldinucci, Marco},
      booktitle    = {Workshops of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC Workshops '25)},
      year         = {2025},
      pages        = {2269–-2280},
      month        = nov,
      doi          = {10.1145/3731599.3767577},
      publisher    = {ACM},
      location     = {St Louis, MO, USA},
    }
    
    DOI PDF

2024 #

  • [J7]
    S. Leo, M. R. Crusoe, L. Rodríguez-Navas, R. Sirvent, A. Kanitz, P. D. Geest, R. Wittner, L. Pireddu, D. Garijo, J. M. Fernández, I. Colonnelli, M. Gallo, T. Ohta, H. Suetake, S. Capella-Gutierrez, R. Wit, B. P. Kinoshita, and S. Soiland-Reyes, “Recording provenance of workflow runs with RO-Crate,” PLoS ONE, vol. 19, pp. 1–35, 2024.
    Abstract
    Recording the provenance of scientific computation results is key to the support of traceability, reproducibility and quality assessment of data products. Several data models have been explored to address this need, providing representations of workflow plans and their executions as well as means of packaging the resulting information for archiving and sharing. However, existing approaches tend to lack interoperable adoption across workflow management systems. In this work we present Workflow Run RO-Crate, an extension of RO-Crate (Research Object Crate) and Schema.org to capture the provenance of the execution of computational workflows at different levels of granularity and bundle together all their associated objects (inputs, outputs, code, etc.). The model is supported by a diverse, open community that runs regular meetings, discussing development, maintenance and adoption aspects. Workflow Run RO-Crate is already implemented by several workflow management systems, allowing interoperable comparisons between workflow runs from heterogeneous systems. We describe the model, its alignment to standards such as W3C PROV, and its implementation in six workflow systems. Finally, we illustrate the application of Workflow Run RO-Crate in two use cases of machine learning in the digital image analysis domain.
    BibTeX
    @article{2024:plos-one:leo,
      title        = {Recording provenance of workflow runs with {RO-Crate}},
      author       = {Simone Leo and
                      Michael R. Crusoe and
                      Laura Rodr\'{i}guez-Navas and
                      Ra\"{u}l Sirvent and
                      Alexander Kanitz and
                      Paul De Geest and
                      Rudolf Wittner and
                      Luca Pireddu and
                      Daniel Garijo and
                      Jos\'{e} M. Fern\'{a}ndez and
                      Iacopo Colonnelli and
                      Matej Gallo and
                      Tazro Ohta and
                      Hirotaka Suetake and
                      Salvador Capella-Gutierrez and
                      Renske de Wit and
                      Bruno P. Kinoshita and
                      Stian Soiland-Reyes},
      journal      = {PLoS ONE},
      year         = {2024},
      volume       = {19},
      number       = {9},
      pages        = {1--35},
      doi          = {10.1371/journal.pone.0309210},
      publisher    = {Public Library of Science},
    }
    
    DOI PDF
  • [C21]
    L. Brescia, I. Colonnelli, and M. Aldinucci, “Performance analysis on DNA alignment workload with intel SGX multithreading,” in Proceedings of BigHPC2024: Special Track on Big Data and High-Performance Computing, co-located with the 3rd Italian Conference on Big Data and Data Science, ITADATA2024: CEUR-WS.org, 2024.
    Abstract
    Data confidentiality is a critical issue in the digital age, impacting interactions between users and public services and between scientific computing organizations and Cloud and HPC providers. Performance in parallel computing is essential, yet techniques for establishing Trusted Execution Environments (TEEs) to ensure privacy in remote environments often negatively impact execution time. This paper aims to analyze the performance of a parallel bioinformatics workload for DNA alignment (Bowtie2) executed within the confidential enclaves of Intel SGX processors. The results provide encouraging insights regarding the feasibility of using SGX-based TEEs for parallel computing on large datasets. The findings indicate that, under conditions of high parallelization and with twice as many threads, workloads executed within SGX enclaves perform, on average, 15% faster than non-confidential execution. This empirical demonstration supports the potential of SGX-based TEEs to effectively balance the need for privacy with the demands of high-performance computing.
    BibTeX
    @inproceedings{2024:itadata:brescia,
      title        = {Performance Analysis on {DNA} Alignment Workload with Intel {SGX} Multithreading},
      author       = {Brescia, Lorenzo and
                      Colonnelli, Iacopo and
                      Aldinucci, Marco},
      booktitle    = {Proceedings of BigHPC2024: Special Track on Big Data and High-Performance Computing, co-located with the 3\textsuperscript{rd} Italian Conference on Big Data and Data Science, ITADATA2024},
      year         = {2024},
      volume       = {3785},
      publisher    = {CEUR-WS.org},
      series       = {{CEUR} Workshop Proceedings},
    }
    
    PDF
  • [C20]
    I. Colonnelli, D. Medić, A. Mulone, V. Bono, L. Padovani, and M. Aldinucci, “Introducing SWIRL: An intermediate representation language for scientific workflows,” in Formal Methods. FM 2024, Milano, Italy: Springer Nature Switzerland, Sep. 2024, pp. 226–244.
    Abstract
    In the ever-evolving landscape of scientific computing, properly supporting the modularity and complexity of modern scientific applications requires new approaches to workflow execution, like seamless interoperability between different workflow systems, distributed-by-design workflow models, and automatic optimisation of data movements. In order to address this need, this article introduces SWIRL, an intermediate representation language for scientific workflows. In contrast with other product-agnostic workflow languages, SWIRL is not designed for human interaction but to serve as a low-level compilation target for distributed workflow execution plans. The main advantages of SWIRL semantics are low-level primitives based on the send/receive programming model and a formal framework ensuring the consistency of the semantics and the specification of translating workflow models represented by Directed Acyclic Graphs (DAGs) into SWIRL workflow descriptions. Additionally, SWIRL offers rewriting rules designed to optimise execution traces, accompanied by corresponding equivalence. An open-source SWIRL compiler toolchain has been developed using the ANTLR Python3 bindings.
    BibTeX
    @inproceedings{2024:fm:colonnelli,
      title        = {Introducing {SWIRL}: An Intermediate Representation Language for Scientific Workflows},
      author       = {Iacopo Colonnelli and
                      Doriana Medi\'{c} and
                      Alberto Mulone and
                      Viviana Bono and
                      Luca Padovani and
                      Marco Aldinucci},
      booktitle    = {Formal Methods. FM 2024},
      year         = {2024},
      volume       = {14933},
      pages        = {226--244},
      month        = sep,
      doi          = {10.1007/978-3-031-71162-6_12},
      publisher    = {Springer Nature Switzerland},
      address      = {Milano, Italy},
      series       = {Lecture Notes in Computer Science},
    }
    
    DOI PDF
  • [C19]
    I. Colonnelli, R. Birke, G. Malenza, G. Mittone, A. Mulone, J. Galjaard, L. Y. Chen, S. Bassini, G. Scipione, J. Martinovič, V. Vondrák, and M. Aldinucci, “Cross-facility federated learning,” in Procedia Computer Science, Bruxelles, Belgium: Elsevier, 2024, pp. 3–12.
    Abstract
    In a decade, AI frontier research transitioned from the researcher’s workstation to thousands of high-end hardware-accelerated compute nodes. This rapid evolution shows no signs of slowing down in the foreseeable future. While top cloud providers may be able to keep pace with this growth rate, obtaining and efficiently exploiting computing resources at that scale is a daunting challenge for universities and SMEs. This work introduces the Cross-Facility Federated Learning (XFFL) framework to bridge this compute divide, extending the opportunity to efficiently exploit multiple independent data centres for extreme-scale deep learning tasks to data scientists and domain experts. XFFL relies on hybrid workflow abstractions to decouple tasks from environment-specific technicalities, reducing complexity and enhancing reusability. In addition, Federated Learning (FL) algorithms eliminate the need to move large amounts of data between different facilities, reducing time-to-solution and preserving data privacy. The XFFL approach is empirically evaluated by training a full LLaMAv2 7B instance on two facilities of the EuroHPC JU, showing how the increased computing power completely compensates for the additional overhead introduced by two data centres.
    BibTeX
    @article{2024:eurohpc:colonnelli,
      title        = {Cross-Facility Federated Learning},
      author       = {Iacopo Colonnelli and
                      Robert Birke and
                      Giulio Malenza and
                      Gianluca Mittone and
                      Alberto Mulone and
                      Jeroen Galjaard and
                      Lydia Y. Chen and
                      Sanzio Bassini and
                      Gabriella Scipione and
                      Jan Martinovi\v{c} and
                      Vit Vondr\'{a}k and
                      Marco Aldinucci},
      booktitle    = {Proceedings of the First EuroHPC user day},
      journal      = {Procedia Computer Science},
      year         = {2024},
      volume       = {240},
      pages        = {3--12},
      doi          = {10.1016/j.procs.2024.07.003},
      issn         = {1877-0509},
      publisher    = {Elsevier},
      address      = {Bruxelles, Belgium},
    }
    
    DOI PDF
  • [C18]
    M. E. Santimaria, S. Fonio, G. Malenza, I. Colonnelli, and M. Aldinucci, “Benchmarking parallelization models through Karmarkar interior-point method,” in Proc. of 32nd Euromicro intl. Conference on Parallel, Distributed and Network-based Processing (PDP), New York City, NYC, USA: IEEE, Mar. 2024, pp. 1–8.
    Abstract
    Optimization problems are one of the main focus of scientific research. Their computational-intensive nature makes them prone to be parallelized with consistent improvements in performance. This paper sheds light on different parallel models for accelerating Karmarkar’s Interior-point method. To do so, we assess parallelization strategies for individual operations within the aforementioned Karmarkar’s algorithm using OpenMP, GPU acceleration with CUDA, and the recent Parallel Standard C++ Linear Algebra library (PSTL) executing both on GPU and CPU. Our different implementations yield interesting benchmark results that show the optimal approach for parallelizing interior point algorithms for general Linear Programming (LP) problems. In addition, we propose a more theoretical perspective of the parallelization of this algorithm, with a detailed study of our OpenMP implementation, showing the limits of optimizing the single operations
    BibTeX
    @inproceedings{2024:pdp:santimaria,
      title        = {Benchmarking Parallelization Models through {Karmarkar} Interior-point method},
      author       = {Santimaria, Marco Edoardo and
                      Fonio, Samuele and
                      Malenza, Giulio and
                      Colonnelli, Iacopo and
                      Aldinucci, Marco},
      booktitle    = {Proc. of 32nd Euromicro intl. Conference on Parallel, Distributed and Network-based Processing (PDP)},
      year         = {2024},
      pages        = {1-8},
      month        = mar,
      doi          = {10.1109/PDP62718.2024.00010},
      issn         = {2377-5750},
      isbn         = {979-8-3503-6307-4},
      publisher    = {IEEE},
      address      = {New York City, NYC, USA},
      location     = {Dublin, Ireland},
    }
    
    DOI PDF
  • [C17]
    B. Casella, I. Colonnelli, G. Mittone, R. Birke, W. Riviera, A. Sciarappa, C. Cavazzoni, and M. Aldinucci, “A performance analysis for confidential federated learning,” in Proceedings of the 2024 Deep Learning Security and Privacy Workshop, IEEE Symposium on Security and Privacy 2024, San Francisco, CA, May 2024.
    Abstract
    Federated Learning (FL) has emerged as a solution to preserve data privacy by keeping the data locally on each participant’s device. However, FL alone is still vulnerable to attacks that can cause privacy leaks. Therefore, it becomes necessary to take additional security measures at the cost of increasing runtimes. The Trusted Execution Environment (TEE) approach promises to offer the highest degree of security during execution. However, TEEs suffer from memory limits which prevent safe end-to-end FL training of modern deep models. State-of- the-art approaches limit secure training to selected layers, failing to avert the full spectrum of attacks or adopt layer-wise training affecting model performance. We benchmark the usage of a library OS (LibOS) to run the full, unmodified end-to-end FL training inside the TEE. We extensively evaluate and model the overhead of the different security mechanisms needed to protect the data and model during computation (TEE), communication (TLS), and storage (disk encryption). The obtained results across three datasets and two models demonstrate that LibOSes are a viable way to seamlessly inject security into FL with limited overhead (at most 2x), offering valuable guidance for researchers and developers aiming to apply FL in data-security-focused contexts.
    BibTeX
    @inproceedings{2024:spw:casella,
      title        = {A Performance Analysis for Confidential Federated Learning},
      author       = {Casella, Bruno and
                      Colonnelli, Iacopo and
                      Mittone, Gianluca and
                      Birke, Robert and
                      Riviera, Walter and
                      Sciarappa, Antonio and
                      Cavazzoni, Carlo and
                      Aldinucci, Marco},
      booktitle    = {Proceedings of the 2024 Deep Learning Security and Privacy Workshop, IEEE Symposium on Security and Privacy 2024},
      year         = {2024},
      month        = may,
      doi          = {10.1109/SPW63631.2024.00009},
      location     = {San Francisco, CA},
    }
    
    DOI PDF

2023 #

  • [C16]
    I. Colonnelli, “Workflow models for heterogeneous distributed systems,” in Proceedings of the 2nd Italian Conference on Big Data and Data Science (ITADATA 2023), Naples, Italy: CEUR-WS.org, Sep. 2023.
    Abstract
    This article introduces a novel hybrid workflow abstraction that injects topology awareness directly into the definition of a distributed workflow model. In particular, the article briefly discusses the advantages brought by this approach to the design and orchestration of large-scale data-oriented workflows, the current level of support from state-of-the-art workflow systems, and some future research directions.
    BibTeX
    @inproceedings{2023:itadata:colonnelli,
      title        = {Workflow Models for Heterogeneous Distributed Systems},
      author       = {Iacopo Colonnelli},
      booktitle    = {Proceedings of the 2nd Italian Conference on Big Data and Data Science ({ITADATA} 2023)},
      year         = {2023},
      volume       = {3606},
      month        = sep,
      publisher    = {CEUR-WS.org},
      location     = {Naples, Italy},
      series       = {{CEUR} Workshop Proceedings},
    }
    
    PDF
  • [C15]
    A. R. Martinelli, M. Torquati, M. Aldinucci, I. Colonnelli, and B. Cantalupo, “CAPIO: A middleware for transparent I/O streaming in data-intensive workflows,” in 2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics (HiPC), Goa, India: IEEE, Dec. 2023.
    Abstract
    With the increasing amount of digital data available for analysis and simulation, the class of I/O-intensive HPC workflows is fated to quickly expand, further exacerbating the performance gap between computing, memory, and storage technologies. This paper introduces CAPIO (Cross-Application Programmable I/O), a middleware capable of injecting I/O streaming capabilities into file-based workflows, improving the computation-I/O overlap without the need to change the application code. The contribution is twofold: 1) at design time, a new I/O coordination language allows users to annotate workflow data dependencies with synchronization semantics; 2) at run time, a user-space middleware automatically and transparently to the user turns a workflow batch execution into a streaming execution according to the semantics expressed in the configuration file. CAPIO has been tested on synthetic benchmarks simulating typical workflow I/O patterns and two real-world workflows. Experiments show that CAPIO reduces the execution time by 10% to 66% for data-intensive workflows that use the file system as a communication medium.
    BibTeX
    @inproceedings{2023:hipc:martinelli,
      title        = {{CAPIO}: a Middleware for Transparent {I/O} Streaming in Data-Intensive Workflows},
      author       = {Alberto Riccardo Martinelli and
                      Massimo Torquati and
                      Marco Aldinucci and
                      Iacopo Colonnelli and
                      Barbara Cantalupo},
      booktitle    = {2023 IEEE 30th International Conference on High Performance Computing, Data, and Analytics (HiPC)},
      year         = {2023},
      month        = dec,
      doi          = {10.1109/HiPC58850.2023.00031},
      publisher    = {{IEEE}},
      address      = {Goa, India},
    }
    
    DOI PDF
  • [C14]
    M. Aldinucci, E. M. Baralis, V. Cardellini, I. Colonnelli, M. Danelutto, S. Decherchi, G. D. Modica, L. Ferrucci, M. Gribaudo, F. Iannone, M. Lapegna, D. Medic, G. Muscianisi, F. Righetti, E. Sciacca, N. Tonellotto, M. Tortonesi, P. Trunfio, and T. Vardanega, “A systematic mapping study of Italian research on workflows,” in Proceedings of the SC '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis, SC-W 2023, Denver, CO, USA: ACM, Nov. 2023, pp. 2065–2076.
    Abstract
    An entire ecosystem of methodologies and tools revolves around scientific workflow management. They cover crucial non-functional requirements that standard workflow models fail to target, such as interactive execution, energy efficiency, performance portability, Big Data management, and intelligent orchestration in the Computing Continuum. Characterizing and monitoring this ecosystem is crucial to develop an informed view of current and future research directions. This work conducts a systematic mapping study of the Italian workflow research community, collecting and analyzing 25 tools and 10 applications from several scientific domains in the context of the “National Research Centre for HPC, Big Data, and Quantum Computing” (ICSC). The study aims to outline the main current research directions and determine how they address the critical needs of modern scientific applications. The findings highlight a variegated research ecosystem of tools, with a prominent interest in advanced workflow orchestration and still immature but promising efforts toward energy efficiency.
    BibTeX
    @inproceedings{2023:scw:aldinucci,
      title        = {A Systematic Mapping Study of {Italian} Research on Workflows},
      author       = {Marco Aldinucci and
                      Elena Maria Baralis and
                      Valeria Cardellini and
                      Iacopo Colonnelli and
                      Marco Danelutto and
                      Sergio Decherchi and
                      Giuseppe Di Modica and
                      Luca Ferrucci and
                      Marco Gribaudo and
                      Francesco Iannone and
                      Marco Lapegna and
                      Doriana Medic and
                      Giuseppa Muscianisi and
                      Francesca Righetti and
                      Eva Sciacca and
                      Nicola Tonellotto and
                      Mauro Tortonesi and
                      Paolo Trunfio and
                      Tullio Vardanega},
      booktitle    = {Proceedings of the {SC} '23 Workshops of The International Conference on High Performance Computing, Network, Storage, and Analysis, {SC-W} 2023},
      year         = {2023},
      pages        = {2065--2076},
      month        = nov,
      doi          = {10.1145/3624062.3624285},
      publisher    = {{ACM}},
      address      = {Denver, CO, USA},
    }
    
    DOI PDF
  • [C13]
    W. Fornaciari, F. Reghenzani, F. Terraneo, D. Baroffio, C. Metra, M. Omana, J. E. R. Condia, M. S. Reorda, R. Birke, I. Colonnelli, G. Mittone, M. Aldinucci, G. Mencagli, F. Iannone, F. Palombi, G. Zummo, D. Cesarini, and F. Tesser, “RISC-V-based platforms for HPC: Analyzing non-functional properties for future HPC and Big-Data clusters,” in Embedded Computer Systems: Architectures, Modeling, and Simulation - 23rd International Conference, SAMOS 2023, Samos, Greece, 2023.
    Abstract
    High-PerformanceComputing(HPC)haveevolvedtobeused to perform simulations of systems where physical experimentation is pro- hibitively impractical, expensive, or dangerous. This paper provides a general overview and showcases the analysis of non-functional properties in RISC-V-based platforms for HPCs. In particular, our analyses target the evaluation of power and energy control, thermal management, and reliability assessment of promising systems, structures, and technologies devised for current and future generation of HPC machines. The main set of design methodologies and technologies developed within the activ- ities of the Future and HPC Big Data spoke of the National Centre of HPC, Big Data and Quantum Computing project are described along with the description of the testbed for experimenting two-phase cooling approaches.
    BibTeX
    @inproceedings{2023:samos:fornaciari,
      title        = {{RISC-V}-based Platforms for {HPC}: Analyzing Non-functional Properties for Future {HPC} and {Big-Data} Clusters},
      author       = {William Fornaciari and
                      Federico Reghenzani and
                      Federico Terraneo and
                      Davide Baroffio and
                      Cecilia Metra and
                      Martin Omana and
                      Josie E. Rodriguez Condia and
                      Matteo Sonza Reorda and
                      Robert Birke and
                      Iacopo Colonnelli and
                      Gianluca Mittone and
                      Marco Aldinucci and
                      Gabriele Mencagli and
                      Francesco Iannone and
                      Filippo Palombi and
                      Giuseppe Zummo and
                      Daniele Cesarini and
                      Federico Tesser},
      booktitle    = {{Embedded Computer Systems: Architectures, Modeling, and Simulation - 23rd International Conference, {SAMOS} 2023}},
      year         = {2023},
      doi          = {10.1007/978-3-031-46077-7_26},
      address      = {Samos, Greece},
    }
    
    DOI PDF
  • [C12]
    G. Mittone, W. Riviera, I. Colonnelli, R. Birke, and M. Aldinucci, “Model-agnostic federated learning,” in Euro-Par 2023: Parallel Processing, Limassol, Cyprus: Computer Science Department, University of Torino; Springer, Aug. 2023, pp. 383–396.
    Abstract
    Since its debut in 2016, Federated Learning (FL) has been tied to the inner workings of Deep Neural Networks (DNNs). On the one hand, this allowed its development and widespread use as DNNs proliferated. On the other hand, it neglected all those scenarios in which using DNNs is not possible or advantageous. The fact that most current FL frameworks only allow training DNNs reinforces this problem. To address the lack of FL solutions for non-DNN-based use cases, we propose MAFL (Model-Agnostic Federated Learning). MAFL marries a model-agnostic FL algorithm, AdaBoost.F, with an open industry-grade FL framework: Intel OpenFL. MAFL is the first FL system not tied to any specific type of machine learning model, allowing exploration of FL scenarios beyond DNNs and trees. We test MAFL from multiple points of view, assessing its correctness, flexibility and scaling properties up to 64 nodes. We optimised the base software achieving a 5.5x speedup on a standard FL scenario. MAFL is compatible with x86-64, ARM-v8, Power and RISC-V.
    BibTeX
    @inproceedings{2023:euro-par:mittone,
      title        = {Model-Agnostic Federated Learning},
      author       = {Mittone, Gianluca and
                      Riviera, Walter and
                      Colonnelli, Iacopo and
                      Birke, Robert and
                      Aldinucci, Marco},
      booktitle    = {Euro-Par 2023: Parallel Processing},
      year         = {2023},
      volume       = {14100},
      pages        = {383--396},
      month        = aug,
      doi          = {10.1007/978-3-031-39698-4_26},
      publisher    = {{Springer}},
      address      = {Limassol, Cyprus},
      institution  = {Computer Science Department, University of Torino},
    }
    
    DOI PDF
  • [C11]
    G. Mittone, N. Tonci, R. Birke, I. Colonnelli, D. Medić, A. Bartolini, R. Esposito, E. Parisi, F. Beneventi, M. Polato, M. Torquati, L. Benini, and M. Aldinucci, “Experimenting with emerging RISC-V systems for decentralised machine learning,” in 20th ACM International Conference on Computing Frontiers (CF '23), Bologna, Italy: Computer Science Department, University of Torino; ACM, May 2023.
    Abstract
    Decentralised Machine Learning (DML) enables collaborative machine learning without centralised input data. Federated Learning (FL) and Edge Inference are examples of DML. While tools for DML (especially FL) are starting to flourish, many are not flexible and portable enough to experiment with novel systems (e.g., RISC-V), non-fully connected topologies, and asynchronous collaboration schemes. We overcome these limitations via a domain-specific language allowing to map DML schemes to an underlying middleware, i.e. the FastFlow parallel programming library. We experiment with it by generating different working DML schemes on two emerging architectures (ARM-v8, RISC-V) and the x86-64 platform. We characterise the performance and energy efficiency of the presented schemes and systems. As a byproduct, we introduce a RISC-V porting of the PyTorch framework, the first publicly available to our knowledge.
    BibTeX
    @inproceedings{2023:cf:mittone,
      title        = {Experimenting with Emerging {RISC-V} Systems for Decentralised Machine Learning},
      author       = {Mittone, Gianluca and
                      Tonci, Nicol{\`o} and
                      Birke, Robert and
                      Colonnelli, Iacopo and
                      Medi\'{c}, Doriana and
                      Bartolini, Andrea and
                      Esposito, Roberto and
                      Parisi, Emanuele and
                      Beneventi, Francesco and
                      Polato, Mirko and
                      Torquati, Massimo and
                      Benini, Luca and
                      Aldinucci, Marco},
      booktitle    = {20th {ACM} International Conference on Computing Frontiers ({CF} '23)},
      year         = {2023},
      month        = may,
      doi          = {10.1145/3587135.3592211},
      isbn         = {979-8-4007-0140-5/23/05},
      publisher    = {{ACM}},
      address      = {Bologna, Italy},
      institution  = {Computer Science Department, University of Torino},
    }
    
    DOI PDF
  • [C10]
    I. Colonnelli, B. Casella, G. Mittone, Y. Arfat, B. Cantalupo, R. Esposito, A. R. Martinelli, D. Medić, and M. Aldinucci, “Federated learning meets HPC and cloud,” in Astrophysics and Space Science Proceedings, Catania, Italy: Springer, 2023, pp. 193–199.
    Abstract
    HPC and AI are fated to meet for several reasons. This article will discuss some of them and argue why this will happen through the set of methods and technologies that underpin cloud computing. As a paradigmatic example, we present a new federated learning system that collaboratively trains a deep learning model in different supercomputing centers. The system is based on the StreamFlow workflow manager designed for hybrid cloud-HPC infrastructures.
    BibTeX
    @inproceedings{2023:ml4astro:colonnelli,
      title        = {Federated Learning meets {HPC} and cloud},
      author       = {Iacopo Colonnelli and
                      Bruno Casella and
                      Gianluca Mittone and
                      Yasir Arfat and
                      Barbara Cantalupo and
                      Roberto Esposito and
                      Alberto Riccardo Martinelli and
                      Doriana Medi\'{c} and
                      Marco Aldinucci},
      booktitle    = {Astrophysics and Space Science Proceedings},
      year         = {2023},
      volume       = {60},
      pages        = {193--199},
      doi          = {10.1007/978-3-031-34167-0_39},
      isbn         = {978-3-031-34167-0},
      publisher    = {Springer},
      address      = {Catania, Italy},
    }
    
    DOI PDF
  • [C9]
    Y. Arfat, G. Mittone, I. Colonnelli, F. D’Ascenzo, R. Esposito, and M. Aldinucci, “Pooling critical datasets with federated learning,” in 31st Euromicro International Conference on Parallel, Distributed and Network-Based Processing, PDP 2023, Napoli, Italy: IEEE, 2023, pp. 329–337.
    Abstract
    Federated Learning (FL) is becoming popular in different industrial sectors where data access is critical for security, privacy and the economic value of data itself. Unlike traditional machine learning, where all the data must be globally gathered for analysis, FL makes it possible to extract knowledge from data distributed across different organizations that can be coupled with different Machine Learning paradigms. In this work, we replicate, using Federated Learning, the analysis of a pooled dataset (with AdaBoost) that has been used to define the PRAISE score, which is today among the most accurate scores to evaluate the risk of a second acute myocardial infarction. We show that thanks to the extended-OpenFL framework, which implements AdaBoost.F, we can train a federated PRAISE model that exhibits comparable accuracy and recall as the centralised model. We achieved F1 and F2 scores which are consistently comparable to the PRAISE score study of a 16- parties federation but within an order of magnitude less time.
    BibTeX
    @inproceedings{2023:pdp:arfat,
      title        = {Pooling critical datasets with Federated Learning},
      author       = {Yasir Arfat and
                      Gianluca Mittone and
                      Iacopo Colonnelli and
                      Fabrizio D'Ascenzo and
                      Roberto Esposito and
                      Marco Aldinucci},
      booktitle    = {31st Euromicro International Conference on Parallel, Distributed and Network-Based Processing, {PDP} 2023},
      year         = {2023},
      pages        = {329--337},
      doi          = {10.1109/PDP59025.2023.00057},
      publisher    = {IEEE},
      address      = {Napoli, Italy},
    }
    
    DOI PDF
  • [B3]
    S. G. Contaldo, L. Alessandri, I. Colonnelli, M. Beccuti, and M. Aldinucci, “Bringing cell subpopulation discovery on a cloud-HPC using rCASC and StreamFlow,” in Single Cell Transcriptomics: Methods and Protocols, New York, NY: Springer US, 2023, pp. 337–345.
    Abstract
    The idea behind novel single-cell RNA sequencing (scRNA-seq) pipelines is to isolate single cells through microfluidic approaches and generate sequencing libraries in which the transcripts are tagged to track their cell of origin. Modern scRNA-seq platforms are capable of analyzing up to many thousands of cells in each run. Then, combined with massive high-throughput sequencing producing billions of reads, scRNA-seq allows the assessment of fundamental biological properties of cell populations and biological systems at unprecedented resolution.
    BibTeX
    @inbook{2023:book:contaldo,
      title        = {Bringing Cell Subpopulation Discovery on a Cloud-{HPC} Using {rCASC} and {StreamFlow}},
      author       = {Contaldo, Sandro Gepiro and
                      Alessandri, Luca and
                      Colonnelli, Iacopo and
                      Beccuti, Marco and
                      Aldinucci, Marco},
      booktitle    = {Single Cell Transcriptomics: Methods and Protocols},
      year         = {2023},
      pages        = {337--345},
      doi          = {10.1007/978-1-0716-2756-3_17},
      isbn         = {978-1-0716-2756-3},
      publisher    = {Springer {US}},
      address      = {New York, NY},
    }
    
    DOI PDF
  • [P2]
    I. Colonnelli, R. Birke, and M. Aldinucci, “Experimenting with PyTorch on RISC-V,” in RISC-V Summit Europe 2023, Barcelona, Spain, Jun. 2023.
    Abstract
    RISC-V is an emerging instruction set architecture. Its modular and extensible open-source royalty-free design is increasingly attracting interest from both research and industry. Nowadays, different RISC-V-based boards can be bought off the shelf. However, software availability is equivalently vital in guaranteeing the RISC-V ecosystem’s success. Here we contribute with the first publicly available port of PyTorch. PyTorch is one of the most popular Deep Learning libraries available today. As such, it is a crucial enabler in running state-of-the-art AI applications on RISC-V-based systems and a first step towards a fully democratic end-to-end codesign process.
    BibTeX
    @inproceedings{2023:risc-v:colonnelli,
      title        = {Experimenting with {PyTorch} on {RISC-V}},
      author       = {Iacopo Colonnelli and
                      Robert Birke and
                      Marco Aldinucci},
      booktitle    = {{RISC-V Summit Europe 2023}},
      year         = {2023},
      month        = jun,
      address      = {Barcelona, Spain},
    }
    
    PDF

2022 #

  • [J6]
    G. Agosta, M. Aldinucci, C. Alvarez, R. Ammendola, Y. Arfat, O. Beaumont, M. Bernaschi, A. Biagioni, T. Boccali, B. Bramas, C. Brandolese, B. Cantalupo, M. Carrozzo, D. Cattaneo, A. Celestini, M. Celino, I. Colonnelli, P. Cretaro, P. D’Ambra, M. Danelutto, R. Esposito, L. Eyraud-Dubois, A. Filgueras, W. Fornaciari, O. Frezza, A. Galimberti, F. Giacomini, B. Goglin, D. Gregori, A. Guermouche, F. Iannone, M. Kulczewski, F. Lo Cicero, A. Lonardo, A. R. Martinelli, M. Martinelli, X. Martorell, G. Massari, S. Montangero, G. Mittone, R. Namyst, A. Oleksiak, P. Palazzari, P. S. Paolucci, F. Reghenzani, C. Rossi, S. Saponara, F. Simula, F. Terraneo, S. Thibault, M. Torquati, M. Turisini, P. Vicini, M. Vidal, D. Zoni, and G. Zummo, “Towards EXtreme scale technologies and accelerators for euROhpc hw/sw supercomputing applications for exascale: The TEXTAROSSA approach,” Microprocessors and Microsystems, vol. 95, p. 104679, 2022.
    Abstract
    In the near future, Exascale systems will need to bridge three technology gaps to achieve high performance while remaining under tight power constraints: energy efficiency and thermal control; extreme computation efficiency via HW acceleration and new arithmetic; methods and tools for seamless integration of reconfigurable accelerators in heterogeneous HPC multi-node platforms. TEXTAROSSA addresses these gaps through a co-design approach to heterogeneous HPC solutions, supported by the integration and extension of HW and SW IPs, programming models, and tools derived from European research.
    BibTeX
    @article{2022:micpro:agosta,
      title        = {Towards EXtreme scale technologies and accelerators for euROhpc hw/Sw supercomputing applications for exascale: The {TEXTAROSSA} approach},
      author       = {Giovanni Agosta and
                      Marco Aldinucci and
                      Carlos Alvarez and
                      Roberto Ammendola and
                      Yasir Arfat and
                      Olivier Beaumont and
                      Massimo Bernaschi and
                      Andrea Biagioni and
                      Tommaso Boccali and
                      Berenger Bramas and
                      Carlo Brandolese and
                      Barbara Cantalupo and
                      Mauro Carrozzo and
                      Daniele Cattaneo and
                      Alessandro Celestini and
                      Massimo Celino and
                      Iacopo Colonnelli and
                      Paolo Cretaro and
                      Pasqua D'Ambra and
                      Marco Danelutto and
                      Roberto Esposito and
                      Lionel Eyraud-Dubois and
                      Antonio Filgueras and
                      William Fornaciari and
                      Ottorino Frezza and
                      Andrea Galimberti and
                      Francesco Giacomini and
                      Brice Goglin and
                      Daniele Gregori and
                      Abdou Guermouche and
                      Francesco Iannone and
                      Michal Kulczewski and
                      Francesca {Lo Cicero} and
                      Alessandro Lonardo and
                      Alberto R. Martinelli and
                      Michele Martinelli and
                      Xavier Martorell and
                      Giuseppe Massari and
                      Simone Montangero and
                      Gianluca Mittone and
                      Raymond Namyst and
                      Ariel Oleksiak and
                      Paolo Palazzari and
                      Pier Stanislao Paolucci and
                      Federico Reghenzani and
                      Cristian Rossi and
                      Sergio Saponara and
                      Francesco Simula and
                      Federico Terraneo and
                      Samuel Thibault and
                      Massimo Torquati and
                      Matteo Turisini and
                      Piero Vicini and
                      Miquel Vidal and
                      Davide Zoni and
                      Giuseppe Zummo},
      journal      = {Microprocessors and Microsystems},
      year         = {2022},
      volume       = {95},
      pages        = {104679},
      doi          = {10.1016/j.micpro.2022.104679},
      issn         = {0141-9331},
    }
    
    DOI PDF
  • [J5]
    I. Colonnelli, M. Aldinucci, B. Cantalupo, L. Padovani, S. Rabellino, C. Spampinato, R. Morelli, R. Di Carlo, N. Magini, and C. Cavazzoni, “Distributed workflows with Jupyter,” Future Generation Computer Systems, vol. 128, pp. 282–298, 2022.
    FGCS Fall 2022 Editors’ Choice
    Abstract
    The designers of a new coordination interface enacting complex workflows have to tackle a dichotomy: choosing a language-independent or language-dependent approach. Language-independent approaches decouple workflow models from the host code’s business logic and advocate portability. Language-dependent approaches foster flexibility and performance by adopting the same host language for business and coordination code. Jupyter Notebooks, with their capability to describe both imperative and declarative code in a unique format, allow taking the best of the two approaches, maintaining a clear separation between application and coordination layers but still providing a unified interface to both aspects. We advocate the Jupyter Notebooks’ potential to express complex distributed workflows, identifying the general requirements for a Jupyter-based Workflow Management System (WMS) and introducing a proof-of-concept portable implementation working on hybrid Cloud-HPC infrastructures. As a byproduct, we extended the vanilla IPython kernel with workflow-based parallel and distributed execution capabilities. The proposed Jupyter-workflow (Jw) system is evaluated on common scenarios for High Performance Computing (HPC) and Cloud, showing its potential in lowering the barriers between prototypical Notebooks and production-ready implementations.
    BibTeX
    @article{2022:fgcs:colonnelli,
      title        = {Distributed workflows with {Jupyter}},
      author       = {Iacopo Colonnelli and
                      Marco Aldinucci and
                      Barbara Cantalupo and
                      Luca Padovani and
                      Sergio Rabellino and
                      Concetto Spampinato and
                      Roberto Morelli and
                      Rosario {Di Carlo} and
                      Nicol{\`o} Magini and
                      Carlo Cavazzoni},
      journal      = {Future Generation Computer Systems},
      year         = {2022},
      volume       = {128},
      pages        = {282--298},
      doi          = {10.1016/j.future.2021.10.007},
      issn         = {0167-739X},
    }
    
    DOI PDF
  • [C8]
    I. Colonnelli and M. Aldinucci, “Hybrid workflows for large - scale scientific applications,” in Sixth EAGE High Performance Computing Workshop, Milano, Italy: European Association of Geoscientists & Engineers, Sep. 2022, pp. 1–5.
    Abstract
    Large-scale scientific applications are facing an irrevrsible transition from monolithic, high-performance oriented codes to modular and polyglot deployments of specialised (micro-)services. The reasons behind this transition are many: coupling of standard solvers with Deep Learning techniques, offloading of data analysis and visualisation to Cloud, and the advent of specialised hardware accelerators. Topology-aware Workflow Management Systems (WMSs) play a crucial role. In particular, topology-awareness allows an explicit mapping of workflow steps onto heterogeneous locations, allowing automated executions on top of hybrid architectures (e.g., cloud+HPC or classical+quantum). Plus, topology-aware WMSs can offer nonfunctional requirements OOTB, e.g. components’ life-cycle orchestration, secure and efficient data transfers, fault tolerance, and cross-cluster execution of urgent workloads. Augmenting interactive Jupyter Notebooks with distributed workflow capabilities allows domain experts to prototype and scale applications using the same technological stack, while relying on a feature-rich and user-friendly web interface. This abstract will showcase how these general methodologies can be applied to a typical geoscience simulation pipeline based on the Full Wavefront Inversion (FWI) technique. In particular, a prototypical Jupyter Notebook will be executed interactively on Cloud. Preliminary data analyses and post-processing will be executed locally, while the computationally demanding optimisation loop will be scheduled on a remote HPC cluster.
    BibTeX
    @inproceedings{2022:eage-hpc:colonnelli,
      title        = {Hybrid Workflows For Large - Scale Scientific Applications},
      author       = {Iacopo Colonnelli and
                      Marco Aldinucci},
      booktitle    = {Sixth {EAGE} High Performance Computing Workshop},
      year         = {2022},
      pages        = {1--5},
      month        = sep,
      doi          = {10.3997/2214-4609.2022615029},
      issn         = {2214-4609},
      publisher    = {{European Association of Geoscientists \& Engineers }},
      address      = {Milano, Italy},
    }
    
    DOI PDF
  • [B2]
    M. Aldinucci, D. Atienza, F. Bolelli, M. Caballero, I. Colonnelli, J. Flich, J. A. Gómez, D. González, C. Grana, M. Grangetto, S. Leo, P. López, D. Oniga, R. Paredes, L. Pireddu, E. Quiñones, T. Silva, E. Tartaglione, and M. Zapater, “The DeepHealth toolkit: A key european free and open-source software for deep learning and computer vision ready to exploit heterogeneous HPC and Cloud architectures,” in Technologies and Applications for Big Data Value, Cham: Springer International Publishing, 2022, pp. 183–202.
    Abstract
    At the present time, we are immersed in the convergence between Big Data, High-Performance Computing and Artificial Intelligence. Technological progress in these three areas has accelerated in recent years, forcing different players like software companies and stakeholders to move quickly. The European Union is dedicating a lot of resources to maintain its relevant position in this scenario, funding projects to implement large-scale pilot testbeds that combine the latest advances in Artificial Intelligence, High-Performance Computing, Cloud and Big Data technologies. The DeepHealth project is an example focused on the health sector whose main outcome is the DeepHealth toolkit, a European unified framework that offers deep learning and computer vision capabilities, completely adapted to exploit underlying heterogeneous High-Performance Computing, Big Data and cloud architectures, and ready to be integrated into any software platform to facilitate the development and deployment of new applications for specific problems in any sector. This toolkit is intended to be one of the European contributions to the field of AI. This chapter introduces the toolkit with its main components and complementary tools, providing a clear view to facilitate and encourage its adoption and wide use by the European community of developers of AI-based solutions and data scientists working in the healthcare sector and others.
    BibTeX
    @incollection{2022:book:aldinucci,
      title        = {The {DeepHealth} Toolkit: A Key European Free and Open-Source Software for Deep Learning and Computer Vision Ready to Exploit Heterogeneous {HPC} and {C}loud Architectures},
      author       = {Marco Aldinucci and
                      David Atienza and
                      Federico Bolelli and
                      M\'{o}nica Caballero and
                      Iacopo Colonnelli and
                      Jos\'{e} Flich and
                      Jon Ander G\'{o}mez and
                      David Gonz\'{a}lez and
                      Costantino Grana and
                      Marco Grangetto and
                      Simone Leo and
                      Pedro L\'{o}pez and
                      Dana Oniga and
                      Roberto Paredes and
                      Luca Pireddu and
                      Eduardo Qui\~{n}ones and
                      Tatiana Silva and
                      Enzo Tartaglione and
                      Marina Zapater},
      booktitle    = {Technologies and Applications for Big Data Value},
      year         = {2022},
      pages        = {183--202},
      doi          = {10.1007/978-3-030-78307-5_9},
      isbn         = {978-3-030-78307-5},
      publisher    = {Springer International Publishing},
      address      = {Cham},
      chapter      = {9},
    }
    
    DOI PDF
  • [B1]
    E. Quiñones, J. Perales, J. Ejarque, A. Badouh, S. Marco, F. Auzanneau, F. Galea, D. González, J. R. Hervás, T. Silva, I. Colonnelli, B. Cantalupo, M. Aldinucci, E. Tartaglione, R. Tornero, J. Flich, J. M. Martinez, D. Rodriguez, I. Catalán, J. Garcia, and C. Hernández, “The DeepHealth HPC infrastructure: Leveraging heterogenous HPC and cloud computing infrastructures for IA-based medical solutions,” in HPC, Big Data, and AI Convergence Towards Exascale: Challenge and Vision, Boca Raton, Florida: CRC Press, 2022, pp. 191–216.
    Abstract
    This chapter presents the DeepHealth HPC toolkit for an efficient execution of deep learning (DL) medical application into HPC and cloud-computing infrastructures, featuring many-core, GPU, and FPGA acceleration devices. The toolkit offers to the European Computer Vision Library and the European Distributed Deep Learning Library (EDDL), developed in the DeepHealth project as well, the mechanisms to distribute and parallelize DL operations on HPC and cloud infrastructures in a fully transparent way. The toolkit implements workflow managers used to orchestrate HPC workloads for an efficient parallelization of EDDL training operations on HPC and cloud infrastructures, and includes the parallel programming models for an efficient execution EDDL inference and training operations on many-core, GPUs and FPGAs acceleration devices.
    BibTeX
    @incollection{2022:book:quinones,
      title        = {The {DeepHealth} {HPC} Infrastructure: Leveraging Heterogenous {HPC} and Cloud Computing Infrastructures for {IA}-based Medical Solutions},
      author       = {Eduardo Qui\~{n}ones and
                      Jesus Perales and
                      Jorge Ejarque and
                      Asaf Badouh and
                      Santiago Marco and
                      Fabrice Auzanneau and
                      Fran\c{c}ois Galea and
                      David Gonz\'{a}lez and
                      Jos\'{e} Ram\'{o}n Herv\'{a}s and
                      Tatiana Silva and
                      Iacopo Colonnelli and
                      Barbara Cantalupo and
                      Marco Aldinucci and
                      Enzo Tartaglione and
                      Rafael Tornero and
                      Jos\'{e} Flich and
                      Jose Maria Martinez and
                      David Rodriguez and
                      Izan Catal\'{a}n and
                      Jorge Garcia and
                      Carles Hern\'{a}ndez},
      booktitle    = {{HPC}, Big Data, and {AI} Convergence Towards Exascale: Challenge and Vision},
      year         = {2022},
      pages        = {191--216},
      doi          = {10.1201/9781003176664},
      isbn         = {978-1-0320-0984-1},
      publisher    = {{CRC} Press},
      address      = {Boca Raton, Florida},
      chapter      = {10},
    }
    
    DOI PDF
  • [T2]
    I. Colonnelli, “Workflow models for heterogeneous distributed systems,” Ph.D. dissertation, University of Turin, Turin, Italy, May 2022.
    ITADATA 2023 Best PhD Thesis
    Abstract
    This article introduces a novel hybrid workflow abstraction that injects topology awareness directly into the definition of a distributed workflow model. In particular, the article briefly discusses the advantages brought by this approach to the design and orchestration of large-scale data-oriented workflows, the current level of support from state-of-the-art workflow systems, and some future research directions.
    BibTeX
    @phdthesis{2022:thesis:colonnelli,
      title        = {Workflow models for heterogeneous distributed systems},
      author       = {Colonnelli, Iacopo},
      school       = {University of Turin},
      year         = {2022},
      month        = may,
      doi          = {10.5281/zenodo.7135483},
      publisher    = {Zenodo},
      address      = {Turin, Italy},
    }
    
    DOI

2021 #

  • [J4]
    O. D. Filippo, J. Kang, F. Bruno, J. Han, A. Saglietto, H. Yang, G. Patti, K. Park, R. Parma, H. Kim, L. D. Luca, H. Gwon, M. Iannaccone, W. J. Chun, G. Smolka, S. Hur, E. Cerrato, S. H. Han, C. Mario, Y. B. Song, J. Escaned, K. H. Choi, G. Helft, J. Doh, A. T. Giachet, S. Hong, S. Muscoli, C. Nam, G. Gallone, D. Capodanno, D. Trabattoni, Y. Imori, V. Dusi, B. Cortese, A. Montefusco, F. Conrotto, I. Colonnelli, I. Sheiban, G. M. Ferrari, B. Koo, and F. D’Ascenzo, “Benefit of extended dual antiplatelet therapy duration in acute coronary syndrome patients treated with drug eluting stents for coronary bifurcation lesions (from the BIFURCAT registry),” The American Journal of Cardiology, 2021.
    Abstract
    Optimal dual antiplatelet therapy (DAPT) duration for patients undergoing percutaneous coronary intervention (PCI) for coronary bifurcations is an unmet issue. The BIFURCAT registry was obtained by merging two registries on coronary bifurcations. Three groups were compared in a two-by-two fashion: short-term DAPT (< 6 months), intermediate-term DAPT (6-12 months) and extended DAPT (>12 months). Major adverse cardiac events (MACE) (a composite of all-cause death, myocardial infarction (MI), target-lesion revascularization and stent thrombosis) were the primary endpoint. Single components of MACE were the secondary endpoints. Events were appraised according to the clinical presentation: chronic coronary syndrome (CCS) versus acute coronary syndrome (ACS). 5537 patients (3231 ACS, 2306 CCS) were included. After a median follow-up of 2.1 years (IQR 0.9-2.2), extended DAPT was associated with a lower incidence of MACE compared with intermediate-term DAPT (2.8% versus 3.4%, adjusted HR 0.23 [0.1-0.54], p <0.001), driven by a reduction of all-cause death in the ACS cohort. In the CCS cohort, an extended DAPT strategy was not associated with a reduced risk of MACE. In conclusion, among real-world patients receiving PCI for coronary bifurcation, an extended DAPT strategy was associated with a reduction of MACE in ACS but not in CCS patients.
    BibTeX
    @article{2021:ajc:de-filippo,
      title        = {Benefit of Extended Dual Antiplatelet Therapy Duration in Acute Coronary Syndrome Patients Treated with Drug Eluting Stents for Coronary Bifurcation Lesions (from the {BIFURCAT} Registry)},
      author       = {Ovidio De Filippo and
                      Jeehoon Kang and
                      Francesco Bruno and
                      Jung-Kyu Han and
                      Andrea Saglietto and
                      Han-Mo Yang and
                      Giuseppe Patti and
                      Kyung-Woo Park and
                      Radoslaw Parma and
                      Hyo-Soo Kim and
                      Leonardo De Luca and
                      Hyeon-Cheol Gwon and
                      Mario Iannaccone and
                      Woo Jung Chun and
                      Grzegorz Smolka and
                      Seung-Ho Hur and
                      Enrico Cerrato and
                      Seung Hwan Han and
                      Carlo di Mario and
                      Young Bin Song and
                      Javier Escaned and
                      Ki Hong Choi and
                      Gerard Helft and
                      Joon-Hyung Doh and
                      Alessandra Truffa Giachet and
                      Soon-Jun Hong and
                      Saverio Muscoli and
                      Chang-Wook Nam and
                      Guglielmo Gallone and
                      Davide Capodanno and
                      Daniela Trabattoni and
                      Yoichi Imori and
                      Veronica Dusi and
                      Bernardo Cortese and
                      Antonio Montefusco and
                      Federico Conrotto and
                      Iacopo Colonnelli and
                      Imad Sheiban and
                      Gaetano Maria de Ferrari and
                      Bon-Kwon Koo and
                      Fabrizio D'Ascenzo},
      journal      = {The American Journal of Cardiology},
      year         = {2021},
      doi          = {10.1016/j.amjcard.2021.07.005},
      issn         = {0002-9149},
    }
    
    DOI PDF
  • [J3]
    M. Aldinucci, V. Cesare, I. Colonnelli, A. R. Martinelli, G. Mittone, B. Cantalupo, C. Cavazzoni, and M. Drocco, “Practical parallelization of scientific applications with OpenMP, OpenACC and MPI,” Journal of Parallel and Distributed Computing, vol. 157, pp. 13–29, 2021.
    Abstract
    This work aims at distilling a systematic methodology to modernize existing sequential scientific codes with a little re-designing effort, turning an old codebase into modern code, i.e., parallel and robust code. We propose a semi-automatic methodology to parallelize scientific applications designed with a purely sequential programming mindset, possibly using global variables, aliasing, random number generators, and stateful functions. We demonstrate that the same methodology works for the parallelization in the shared memory model (via OpenMP), message passing model (via MPI), and General Purpose Computing on GPU model (via OpenACC). The method is demonstrated parallelizing four real-world sequential codes in the domain of physics and material science. The methodology itself has been distilled in collaboration with MSc students of the Parallel Computing course at the University of Torino, that applied it for the first time to the project works that they presented for the final exam of the course. Every year the course hosts some special lectures from industry representatives, who present how they use parallel computing and offer codes to be parallelizeda.
    BibTeX
    @article{2021:jpdc:aldinucci,
      title        = {Practical Parallelization of Scientific Applications with {OpenMP, OpenACC and MPI}},
      author       = {Aldinucci, Marco and
                      Cesare, Valentina and
                      Colonnelli, Iacopo and
                      Martinelli, Alberto Riccardo and
                      Mittone, Gianluca and
                      Cantalupo, Barbara and
                      Cavazzoni, Carlo and
                      Drocco, Maurizio},
      journal      = {Journal of Parallel and Distributed Computing},
      year         = {2021},
      volume       = {157},
      pages        = {13--29},
      doi          = {10.1016/j.jpdc.2021.05.017},
    }
    
    DOI PDF
  • [J2]
    F. D’Ascenzo, O. De Filippo, G. Gallone, G. Mittone, M. A. Deriu, M. Iannaccone, A. Ariza-Solé, C. Liebetrau, S. Manzano-Fernández, G. Quadri, T. Kinnaird, G. Campo, J. P. Simao Henriques, J. M. Hughes, A. Dominguez-Rodriguez, M. Aldinucci, U. Morbiducci, G. Patti, S. Raposeiras-Roubin, E. Abu-Assi, G. M. De Ferrari, F. Piroli, A. Saglietto, F. Conrotto, P. Omedé, A. Montefusco, M. Pennone, F. Bruno, P. P. Bocchino, G. Boccuzzi, E. Cerrato, F. Varbella, M. Sperti, S. B. Wilton, L. Velicki, I. Xanthopoulou, A. Cequier, A. Iniguez-Romo, I. Munoz Pousa, M. Cespon Fernandez, B. Caneiro Queija, R. Cobas-Paz, A. Lopez-Cuenca, A. Garay, P. F. Blanco, A. Rognoni, G. Biondi Zoccai, S. Biscaglia, I. Nunez-Gil, T. Fujii, A. Durante, X. Song, T. Kawaji, D. Alexopoulos, Z. Huczek, J. R. Gonzalez Juanatey, S. Nie, M. Kawashiri, I. Colonnelli, B. Cantalupo, R. Esposito, S. Leonardi, W. Grosso Marra, A. Chieffo, U. Michelucci, D. Piga, M. Malavolta, S. Gili, M. Mennuni, C. Montalto, L. Oltrona Visconti, and Y. Arfat, “Machine learning-based prediction of adverse events following an acute coronary syndrome (PRAISE): A modelling study of pooled datasets,” The Lancet, vol. 397, pp. 199–207, 2021.
    Abstract
    Background The accuracy of current prediction tools for ischaemic and bleeding events after an acute coronary syndrome (ACS) remains insufficient for individualised patient management strategies. We developed a machine learning-based risk stratification model to predict all-cause death, recurrent acute myocardial infarction, and major bleeding after ACS. Methods Different machine learning models for the prediction of 1-year post-discharge all-cause death, myocardial infarction, and major bleeding (defined as Bleeding Academic Research Consortium type 3 or 5) were trained on a cohort of 19826 adult patients with ACS (split into a training cohort [80
    BibTeX
    @article{2021:lancet:dascenzo,
      title        = {Machine learning-based prediction of adverse events following an acute coronary syndrome {(PRAISE)}: a modelling study of pooled datasets},
      author       = {Fabrizio D'Ascenzo and
                      Ovidio {De Filippo} and
                      Guglielmo Gallone and
                      Gianluca Mittone and
                      Marco Agostino Deriu and
                      Mario Iannaccone and
                      Albert Ariza-Sol\'e and
                      Christoph Liebetrau and
                      Sergio Manzano-Fern\'andez and
                      Giorgio Quadri and
                      Tim Kinnaird and
                      Gianluca Campo and
                      Jose Paulo {Simao Henriques} and
                      James M Hughes and
                      Alberto Dominguez-Rodriguez and
                      Marco Aldinucci and
                      Umberto Morbiducci and
                      Giuseppe Patti and
                      Sergio Raposeiras-Roubin and
                      Emad Abu-Assi and
                      Gaetano Maria {De Ferrari} and
                      Francesco Piroli and
                      Andrea Saglietto and
                      Federico Conrotto and
                      Pierluigi Omed\'e and
                      Antonio Montefusco and
                      Mauro Pennone and
                      Francesco Bruno and
                      Pier Paolo Bocchino and
                      Giacomo Boccuzzi and
                      Enrico Cerrato and
                      Ferdinando Varbella and
                      Michela Sperti and
                      Stephen B. Wilton and
                      Lazar Velicki and
                      Ioanna Xanthopoulou and
                      Angel Cequier and
                      Andres Iniguez-Romo and
                      Isabel {Munoz Pousa} and
                      Maria {Cespon Fernandez} and
                      Berenice {Caneiro Queija} and
                      Rafael Cobas-Paz and
                      Angel Lopez-Cuenca and
                      Alberto Garay and
                      Pedro Flores Blanco and
                      Andrea Rognoni and
                      Giuseppe {Biondi Zoccai} and
                      Simone Biscaglia and
                      Ivan Nunez-Gil and
                      Toshiharu Fujii and
                      Alessandro Durante and
                      Xiantao Song and
                      Tetsuma Kawaji and
                      Dimitrios Alexopoulos and
                      Zenon Huczek and
                      Jose Ramon {Gonzalez Juanatey} and
                      Shao-Ping Nie and
                      Masa-aki Kawashiri and
                      Iacopo Colonnelli and
                      Barbara Cantalupo and
                      Roberto Esposito and
                      Sergio Leonardi and
                      Walter {Grosso Marra} and
                      Alaide Chieffo and
                      Umberto Michelucci and
                      Dario Piga and
                      Marta Malavolta and
                      Sebastiano Gili and
                      Marco Mennuni and
                      Claudio Montalto and
                      Luigi {Oltrona Visconti} and
                      Yasir Arfat},
      journal      = {The Lancet},
      year         = {2021},
      volume       = {397},
      number       = {10270},
      pages        = {199--207},
      doi          = {10.1016/S0140-6736(20)32519-8},
      issn         = {0140-6736},
    }
    
    DOI PDF
  • [J1]
    I. Colonnelli, B. Cantalupo, I. Merelli, and M. Aldinucci, “StreamFlow: Cross-breeding cloud with HPC,” IEEE Transactions on Emerging Topics in Computing, vol. 9, pp. 1723–1737, 2021.
    Abstract
    Workflows are among the most commonly used tools in a variety of execution environments. Many of them target a specific environment; few of them make it possible to execute an entire workflow in different environments, e.g. Kubernetes and batch clusters. We present a novel approach to workflow execution, called StreamFlow, that complements the workflow graph with the declarative description of potentially complex execution environments, and that makes it possible the execution onto multiple sites not sharing a common data space. StreamFlow is then exemplified on a novel bioinformatics pipeline for single cell transcriptomic data analysis workflow.
    BibTeX
    @article{2021:tetc:colonnelli,
      title        = {{StreamFlow}: cross-breeding cloud with {HPC}},
      author       = {Iacopo Colonnelli and
                      Barbara Cantalupo and
                      Ivan Merelli and
                      Marco Aldinucci},
      journal      = {{IEEE} {T}ransactions on {E}merging {T}opics in {C}omputing},
      year         = {2021},
      volume       = {9},
      number       = {4},
      pages        = {1723--1737},
      doi          = {10.1109/TETC.2020.3019202},
    }
    
    DOI PDF
  • [C7]
    G. Agosta, W. Fornaciari, A. Galimberti, G. Massari, F. Reghenzani, F. Terraneo, D. Zoni, C. Brandolese, M. Celino, F. Iannone, P. Palazzari, G. Zummo, M. Bernaschi, P. D’Ambra, S. Saponara, M. Danelutto, M. Torquati, M. Aldinucci, Y. Arfat, B. Cantalupo, I. Colonnelli, R. Esposito, A. R. Martinelli, G. Mittone, O. Beaumont, B. Bramas, L. Eyraud-Dubois, B. Goglin, A. Guermouche, R. Namyst, S. Thibault, A. Filgueras, M. Vidal, C. Alvarez, X. Martorell, A. Oleksiak, M. Kulczewski, A. Lonardo, P. Vicini, F. L. Cicero, F. Simula, A. Biagioni, P. Cretaro, O. Frezza, P. S. Paolucci, M. Turisini, F. Giacomini, T. Boccali, S. Montangero, and R. Ammendola, “TEXTAROSSA: Towards EXtreme scale technologies and accelerators for euROhpc hw/sw supercomputing applications for exascale,” in Proc. of the 24th Euromicro Conference on Digital System Design (DSD), Palermo, Italy: IEEE, Aug. 2021.
    Abstract
    To achieve high performance and high energy efficiency on near-future exascale computing systems, three key technology gaps needs to be bridged. These gaps include: energy efficiency and thermal control; extreme computation efficiency via HW acceleration and new arithmetics; methods and tools for seamless integration of reconfigurable accelerators in heterogeneous HPC multi-node platforms. TEXTAROSSA aims at tackling this gap through a co-design approach to heterogeneous HPC solutions, supported by the integration and extension of HW and SW IPs, programming models and tools derived from European research.
    BibTeX
    @inproceedings{2021:dsd:agosta,
      title        = {{TEXTAROSSA}: Towards EXtreme scale Technologies and Accelerators for euROhpc hw/Sw Supercomputing Applications for exascale},
      author       = {Giovanni Agosta and
                      William Fornaciari and
                      Andrea Galimberti and
                      Giuseppe Massari and
                      Federico Reghenzani and
                      Federico Terraneo and
                      Davide Zoni and
                      Carlo Brandolese and
                      Massimo Celino and
                      Francesco Iannone and
                      Paolo Palazzari and
                      Giuseppe Zummo and
                      Massimo Bernaschi and
                      Pasqua D'Ambra and
                      Sergio Saponara and
                      Marco Danelutto and
                      Massimo Torquati and
                      Marco Aldinucci and
                      Yasir Arfat and
                      Barbara Cantalupo and
                      Iacopo Colonnelli and
                      Roberto Esposito and
                      Alberto Riccardo Martinelli and
                      Gianluca Mittone and
                      Olivier Beaumont and
                      Berenger Bramas and
                      Lionel Eyraud-Dubois and
                      Brice Goglin and
                      Abdou Guermouche and
                      Raymond Namyst and
                      Samuel Thibault and
                      Antonio Filgueras and
                      Miquel Vidal and
                      Carlos Alvarez and
                      Xavier Martorell and
                      Ariel Oleksiak and
                      Michal Kulczewski and
                      Alessandro Lonardo and
                      Piero Vicini and
                      Francesco Lo Cicero and
                      Francesco Simula and
                      Andrea Biagioni and
                      Paolo Cretaro and
                      Ottorino Frezza and
                      Pier Stanislao Paolucci and
                      Matteo Turisini and
                      Francesco Giacomini and
                      Tommaso Boccali and
                      Simone Montangero and
                      Roberto Ammendola},
      booktitle    = {Proc. of the 24th Euromicro Conference on Digital System Design ({DSD})},
      year         = {2021},
      month        = aug,
      doi          = {10.1109/DSD53832.2021.00051},
      publisher    = {IEEE},
      address      = {Palermo, Italy},
    }
    
    DOI PDF
  • [C6]
    M. Aldinucci, V. Cesare, I. Colonnelli, A. R. Martinelli, G. Mittone, and B. Cantalupo, “Practical parallelizazion of a Laplace solver with MPI,” in ENEA CRESCO in the fight against COVID-19: ENEA, 2021, pp. 21–24.
    Abstract
    This work exposes a practical methodology for the semi-automatic parallelization of existing code. We show how a scientific sequential code can be parallelized through our approach. The obtained parallel code is only slightly different from the starting sequential one, providing an example of how little re-designing our methodology involves. The performance of the parallelized code, executed on the CRESCO6 cluster, is then exposed and discussed. We also believe in the educational value of this approach and suggest its use as a teaching device for students.
    BibTeX
    @inproceedings{2021:enea:aldinucci,
      title        = {Practical Parallelizazion of a {Laplace} Solver with {MPI}},
      author       = {Aldinucci, Marco and
                      Cesare, Valentina and
                      Colonnelli, Iacopo and
                      Martinelli, Alberto Riccardo and
                      Mittone, Gianluca and
                      Cantalupo, Barbara},
      booktitle    = {ENEA CRESCO in the fight against COVID-19},
      year         = {2021},
      pages        = {21--24},
      publisher    = {ENEA},
    }
    
  • [C5]
    I. Colonnelli, B. Cantalupo, C. Spampinato, M. Pennisi, and M. Aldinucci, “Bringing AI pipelines onto cloud-HPC: Setting a baseline for accuracy of COVID-19 diagnosis,” in ENEA CRESCO in the fight against COVID-19: ENEA, 2021.
    Abstract
    HPC is an enabling platform for AI. The introduction of AI workloads in the HPC applications basket has non-trivial consequences both on the way of designing AI applications and on the way of providing HPC computing. This is the leitmotif of the convergence between HPC and AI. The formalized definition of AI pipelines is one of the milestones of HPC-AI convergence. If well conducted, it allows, on the one hand, to obtain portable and scalable applications. On the other hand, it is crucial for the reproducibility of scientific pipelines. In this work, we advocate the StreamFlow Workflow Management System as a crucial ingredient to define a parametric pipeline, called “CLAIRE COVID-19 Universal Pipeline”, which is able to explore the optimization space of methods to classify COVID-19 lung lesions from CT scans, compare them for accuracy, and therefore set a performance baseline. The universal pipeline automatizes the training of many different Deep Neural Networks (DNNs) and many different hyperparameters. It, therefore, requires a massive computing power, which is found in traditional HPC infrastructure thanks to the portability-by-design of pipelines designed with StreamFlow. Using the universal pipeline, we identified a DNN reaching over 90% accuracy in detecting COVID-19 lesions in CT scans.
    BibTeX
    @inproceedings{2021:enea:colonnelli,
      title        = {Bringing AI pipelines onto cloud-{HPC}: setting a baseline for accuracy of {COVID-19} diagnosis},
      author       = {Colonnelli, Iacopo and
                      Cantalupo, Barbara and
                      Spampinato, Concetto and
                      Pennisi, Matteo and
                      Aldinucci, Marco},
      booktitle    = {ENEA CRESCO in the fight against COVID-19},
      year         = {2021},
      doi          = {10.5281/zenodo.5151511},
      publisher    = {ENEA},
    }
    
    DOI PDF
  • [C4]
    I. Colonnelli, B. Cantalupo, R. Esposito, M. Pennisi, C. Spampinato, and M. Aldinucci, “HPC Application Cloudification: The StreamFlow Toolkit,” in 12th Workshop on Parallel Programming and Run-Time Management Techniques for Many-core Architectures and 10th Workshop on Design Tools and Architectures for Multicore Embedded Computing Platforms (PARMA-DITAM 2021), Dagstuhl, Germany: Schloss Dagstuhl – Leibniz-Zentrum für Informatik, 2021, pp. 5:1–5:13.
    Abstract
    Finding an effective way to improve accessibility to High-Performance Computing facilities, still anchored to SSH-based remote shells and queue-based job submission mechanisms, is an open problem in computer science. This work advocates a cloudification of HPC applications through a cluster-as-accelerator pattern, where computationally demanding portions of the main execution flow hosted on a Cloud Finding an effective way to improve accessibility to High-Performance Computing facilities, still anchored to SSH-based remote shells and queue-based job submission mechanisms, is an open problem in computer science. This work advocates a cloudification of HPC applications through a cluster-as-accelerator pattern, where computationally demanding portions of the main execution flow hosted on a Cloud infrastructure can be offloaded to HPC environments to speed them up. We introduce StreamFlow, a novel Workflow Management System that supports such a design pattern and makes it possible to run the steps of a standard workflow model on independent processing elements with no shared storage. We validated the proposed approach’s effectiveness on the CLAIRE COVID-19 universal pipeline, i.e. a reproducible workflow capable of automating the comparison of (possibly all) state-of-the-art pipelines for the diagnosis of COVID-19 interstitial pneumonia from CT scans images based on Deep Neural Networks (DNNs).
    BibTeX
    @inproceedings{2021:parma-ditam:colonnelli,
      title        = {{HPC Application Cloudification: The StreamFlow Toolkit}},
      author       = {Colonnelli, Iacopo and
                      Cantalupo, Barbara and
                      Esposito, Roberto and
                      Pennisi, Matteo and
                      Spampinato, Concetto and
                      Aldinucci, Marco},
      booktitle    = {12th Workshop on Parallel Programming and Run-Time Management Techniques for Many-core Architectures and 10th Workshop on Design Tools and Architectures for Multicore Embedded Computing Platforms (PARMA-DITAM 2021)},
      year         = {2021},
      volume       = {88},
      pages        = {5:1--5:13},
      doi          = {10.4230/OASIcs.PARMA-DITAM.2021.5},
      issn         = {2190-6807},
      isbn         = {978-3-95977-181-8},
      publisher    = {Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
      address      = {Dagstuhl, Germany},
      series       = {Open Access Series in Informatics (OASIcs)},
    }
    
    DOI PDF

2020 #

  • [C3]
    V. Cesare, I. Colonnelli, and M. Aldinucci, “Practical parallelization of scientific applications,” in Proc. of 28th Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP), Västerås, Sweden: IEEE, 2020, pp. 376–384.
    Abstract
    This work aims at distilling a systematic methodology to modernize existing sequential scientific codes with a limited re-designing effort, turning an old codebase into modern code, i.e., parallel and robust code. We propose an automatable methodology to parallelize scientific applications designed with a purely sequential programming mindset, thus possibly using global variables, aliasing, random number generators, and stateful functions. We demonstrate the methodology by way of an astrophysical application, where we model at the same time the kinematic profiles of 30 disk galaxies with a Monte Carlo Markov Chain (MCMC), which is sequential by definition. The parallel code exhibits a 12 times speedup on a 48-core platform.
    BibTeX
    @inproceedings{2020:pdp:cesare,
      title        = {Practical Parallelization of Scientific Applications},
      author       = {Valentina Cesare and
                      Iacopo Colonnelli and
                      Marco Aldinucci},
      booktitle    = {Proc. of 28th Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP)},
      year         = {2020},
      pages        = {376--384},
      doi          = {10.1109/PDP50117.2020.00064},
      publisher    = {IEEE},
      address      = {V{\"a}ster{\aa}s, Sweden},
    }
    
    DOI PDF

2019 #

  • [C2]
    P. Viviani, M. Drocco, D. Baccega, I. Colonnelli, and M. Aldinucci, “Deep learning at scale,” in Proc. of 27th Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP), Pavia, Italy: IEEE, 2019, pp. 124–131.
    Abstract
    This work presents a novel approach to distributed training of deep neural networks (DNNs) that aims to overcome the issues related to mainstream approaches to data parallel training. Established techniques for data parallel training are discussed from both a parallel computing and deep learning perspective, then a different approach is presented that is meant to allow DNN training to scale while retaining good convergence properties. Moreover, an experimental implementation is presented as well as some preliminary results.
    BibTeX
    @inproceedings{2019:pdp:viviani,
      title        = {Deep Learning at Scale},
      author       = {Paolo Viviani and
                      Maurizio Drocco and
                      Daniele Baccega and
                      Iacopo Colonnelli and
                      Marco Aldinucci},
      booktitle    = {Proc. of 27th Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP)},
      year         = {2019},
      pages        = {124--131},
      doi          = {10.1109/EMPDP.2019.8671552},
      publisher    = {IEEE},
      address      = {Pavia, Italy},
    }
    
    DOI PDF
  • [C1]
    M. Drocco, P. Viviani, I. Colonnelli, M. Aldinucci, and M. Grangetto, “Accelerating spectral graph analysis through wavefronts of linear algebra operations,” in Proc. of 27th Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP), Pavia, Italy: IEEE, 2019, pp. 9–16.
    Abstract
    The wavefront pattern captures the unfolding of a parallel computation in which data elements are laid out as a logical multidimensional grid and the dependency graph favours a diagonal sweep across the grid. In the emerging area of spectral graph analysis, the computing often consists in a wavefront running over a tiled matrix, involving expensive linear algebra kernels. While these applications might benefit from parallel heterogeneous platforms (multi-core with GPUs),programming wavefront applications directly with high-performance linear algebra libraries yields code that is complex to write and optimize for the specific application. We advocate a methodology based on two abstractions (linear algebra and parallel pattern-based run-time), that allows to develop portable, self-configuring, and easy-to-profile code on hybrid platforms.
    BibTeX
    @inproceedings{2019:pdp:drocco,
      title        = {Accelerating spectral graph analysis through wavefronts of linear algebra operations},
      author       = {Maurizio Drocco and
                      Paolo Viviani and
                      Iacopo Colonnelli and
                      Marco Aldinucci and
                      Marco Grangetto},
      booktitle    = {Proc. of 27th Euromicro Intl. Conference on Parallel Distributed and network-based Processing (PDP)},
      year         = {2019},
      pages        = {9--16},
      doi          = {10.1109/EMPDP.2019.8671640},
      publisher    = {IEEE},
      address      = {Pavia, Italy},
      date-modified = {2024-08-18 14:45:22 +0200},
    }
    
    DOI PDF
  • [P1]
    I. Colonnelli and M. Aldinucci, “A model-based approach to scientific workflows,” in Advanced Computer Architecture and Compilation for High-performance Embedded Systems (ACACES 2019), Fiuggi, Italy, Jul. 2019.
    BibTeX
    @inproceedings{2019:acaces:colonnelli,
      title        = {A model-based approach to scientific workflows},
      author       = {Iacopo Colonnelli and
                      Marco Aldinucci},
      booktitle    = {Advanced Computer Architecture and Compilation for High-performance Embedded Systems (ACACES 2019)},
      year         = {2019},
      month        = jul,
      address      = {Fiuggi, Italy},
    }
    

2017 #

  • [T1]
    I. Colonnelli, “Design of an high-performance tracking algorithm optimised for the inner tracking system of the ALICE experiment,” M.S. thesis, Polytechnic University of Turin, Turin, Italy, 2017.
    BibTeX
    @mastersthesis{2017:thesis:colonnelli,
      title        = {Design of an high-performance tracking algorithm optimised for the Inner Tracking System of the {ALICE} experiment},
      author       = {Colonnelli, Iacopo},
      school       = {Polytechnic University of Turin},
      year         = {2017},
      address      = {Turin, Italy},
    }