OATAO - Open Archive Toulouse Archive Ouverte Open Access Week

Spatio-temporal thermal-aware scheduling for homogeneous high-performance computing datacenters

Sun, Hongyang and Stolf, Patricia and Pierson, Jean-Marc Spatio-temporal thermal-aware scheduling for homogeneous high-performance computing datacenters. (2017) Future Generation Computer Systems, 71. 157-170. ISSN 0167-739X

(Document in English)

PDF (Author's version) - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader

Official URL: https://doi.org/10.1016/j.future.2017.02.005


Datacenters have become an important part of today's computing infrastructure. Recent studies have shown the increasing importance of thermal considerations to achieve effective resource management. In this paper, we study thermal-aware scheduling for homogeneous high-performance computing (HPC) datacenters under a thermal model that captures both spatial and temporal correlations of the temperature evolution. We propose an online scheduling heuristic to minimize the makespan for a set of HPC applications subject to a thermal constraint. The heuristic leverages the novel notion of thermal-aware load to perform both job assignment and thermal management. To respect the temperature constraint, which is governed by a complex spatio-temporal thermal correlation, dynamic voltage and frequency scaling (DVFS) is used to regulate the job executions during runtime while dynamically balancing the loads of the servers to improve makespan. Extensive simulations are conducted based on an experimentally validated datacenter configuration and realistic parameter settings. The results show improved performance of the proposed heuristic compared to existing solutions in the literature, and demonstrate the importance of both spatial and temporal considerations. In contrast to some scheduling problems, where DVFS introduces performance-energy tradeoffs, our findings reveal the benefit of applying DVFS with both performance and energy gains in the context of spatio-temporal thermal-aware scheduling.

Item Type:Article
Additional Information:Thanks to Elsevier editor. This papers appears in volume 71, Future Generation Computer Systems ISSN 0167-739X The original PDF is available at: https://www.sciencedirect.com/science/article/pii/S0167739X17301966
HAL Id:hal-01740033
Audience (journal):International peer-reviewed journal
Uncontrolled Keywords:
Institution:French research institutions > Centre National de la Recherche Scientifique - CNRS (FRANCE)
Other partners > Ecole Normale Supérieure de Lyon - ENS de Lyon (FRANCE)
Université de Toulouse > Institut National Polytechnique de Toulouse - Toulouse INP (FRANCE)
French research institutions > Institut National de la Recherche en Informatique et en Automatique - INRIA (FRANCE)
Université de Toulouse > Université Toulouse III - Paul Sabatier - UT3 (FRANCE)
Université de Toulouse > Université Toulouse - Jean Jaurès - UT2J (FRANCE)
Université de Toulouse > Université Toulouse 1 Capitole - UT1 (FRANCE)
Laboratory name:
European Commission under contract 288701 - LABEX MILYON (ANR-10-LABX-0070) of Université de Lyon - French National Research Agency (ANR)
Deposited On:16 Mar 2018 14:09

Repository Staff Only: item control page