论文标题
MPI+OpenMP的任务可扩展性用于人脑的多形式模拟
MPI+OpenMP Tasking Scalability for Multi-Morphology Simulations of the Human Brain
论文作者
论文摘要
人脑的行为模拟是当今最雄心勃勃的挑战之一,没有重要的应用。我们可以在美国,欧洲和日本找到许多不同的举措,试图实现这一挑战性目标。在这项工作中,我们专注于最重要的欧洲倡议(人脑项目),以及该项目中开发的一种模型。该工具通过计算神经元形态上的电压来模拟神经网络中触发的尖峰,这是当今最精确的模拟器之一。在目前的工作中,我们已经评估了在此框架之上的MPI+OpenMP任务的使用。我们证明,即使计算每个节点相对较低的工作负载(神经元数),这种方法也能够实现良好的缩放。我们的目标之一包括实现高度可扩展的实现,还包括开发具有高度抽象的工具,而不会使用\ emph {MPI+OpenMP}任务而失去控制和性能。这项工作的主要动机是评估对多形式神经网络的尖端模拟。在其中完全不同的大量神经元的模拟是一个重要的挑战。实际上,在多形式模拟中,我们发现节点之间的重要不平衡,这主要是由于神经元的差异,这会导致可用资源的重要利用。在这项工作中,作者提出并评估了处理此问题的机制,并大大减少了这种模拟的时间。
The simulation of the behavior of the human brain is one of the most ambitious challenges today with a non-end of important applications. We can find many different initiatives in the USA, Europe and Japan which attempt to achieve such a challenging target. In this work, we focus on the most important European initiative (the Human Brain Project) and on one of the models developed in this project. This tool simulates the spikes triggered in a neural network by computing the voltage capacitance on the neurons' morphology, being one of the most precise simulators today. In the present work, we have evaluated the use of MPI+OpenMP tasking on top of this framework. We prove that this approach is able to achieve a good scaling even when computing a relatively low workload (number of neurons) per node. One of our targets consists of achieving not only a highly scalable implementation, but also to develop a tool with a high degree of abstraction without losing control and performance by using \emph{MPI+OpenMP} tasking. The main motivation of this work is the evaluation of this cutting-edge simulation on multi-morphology neural networks. The simulation of a high number of neurons, which are completely different among them, is an important challenge. In fact, in the multi-morphology simulations, we find an important unbalancing between the nodes, mainly due to the differences in the neurons, which causes an important under-utilization of the available resources. In this work, the authors present and evaluate mechanisms to deal with this and reduce the time of this kind of simulations considerably.
