用户名: 密码: 验证码:
Design and Preliminary Evaluation of Omni OpenACC Compiler for Massive MIMD Processor PEZY-SC
详细信息    查看全文
  • 关键词:PEZY ; SC ; OpenACC ; Compiler
  • 刊名:Lecture Notes in Computer Science
  • 出版年:2016
  • 出版时间:2016
  • 年:2016
  • 卷:9903
  • 期:1
  • 页码:293-305
  • 全文大小:1,134 KB
  • 参考文献:1.The green500. http://​www.​green500.​org
    2.Khronos Group. OpenCL - The open standard for parallel programming of heterogeneous systems. https://​www.​khronos.​org/​opencl/​
    3.OpenACC-Standard.org. OpenACC Home. http://​www.​openacc.​org
    4.Tabuchi, A., Nakao, M., Sato, M.: A source-to-source OpenACC compiler for CUDA. In: an Mey, D., et al. (eds.) Euro-Par 2013. LNCS, vol. 8374, pp. 178–187. Springer, Heidelberg (2014)CrossRef
    5.NASA Advanced Supercomputing Division. NAS Parallel Benchmarks. http://​www.​nas.​nasa.​gov/​publications/​npb.​html
    6.Reyes, R., López-Rodríguez, I., Fumero, J.J., de Sande, F.: accULL : an OpenACC implementation with CUDA and OpenCL support. In: Kaklamanis, C., Papatheodorou, T., Spirakis, P.G. (eds.) Euro-Par 2012. LNCS, vol. 7484, pp. 871–882. Springer, Heidelberg (2012)CrossRef
    7.Tian, X., Xu, R., Yan, Y., Yun, Z., Chandrasekaran, S., Chapman, B.: Compiling a high-level directive-based programming model for GPGPUs. In: Caṣcaval, C., Montesinos-Ortego, P. (eds.) LCPC 2013. LNCS, vol. 8664, pp. 105–120. Springer, Heidelberg (2014)
    8.Lee, S., Vetter, J.S.: Openarc: open accelerator research compiler for directive-based, efficient heterogeneous computing. In: Proceedings of the 23rd International Symposium on High-performance Parallel and Distributed Computing, HPDC 2014, New York, NY, USA, pp. 115–120. ACM (2014)
    9.University of Delaware and LLNL. RoseACC. http://​roseacc.​org/​
    10.GCC. OpenACC - GCC Wiki. https://​gcc.​gnu.​org/​wiki/​OpenACC
    11.RIKEN AICS and University of Tsukuba. Omni Compiler Project. http://​omni-compiler.​org
    12.Warren, M.S., Salmon, J.K., Becker, D.J., Goda, M.P., Sterling, T., Winckelmans, W.: Pentium pro inside: I. a treecode at 430 gigaflops on asci red, ii. price/performance of $50/mflop on loki and hyglac. In: ACM/IEEE 1997 Conference on Supercomputing, p. 61, November 1997
  • 作者单位:Akihiro Tabuchi (16)
    Yasuyuki Kimura (17)
    Sunao Torii (17)
    Hideo Matsufuru (18)
    Tadashi Ishikawa (18)
    Taisuke Boku (16) (19)
    Mitsuhisa Sato (16) (20)

    16. Graduate School of Systems and Information Engineering, University of Tsukuba, Tsukuba, Japan
    17. ExaScaler Inc., Tokyo, Japan
    18. Computing Research Center, High Energy Accelerator Research Organization (KEK), Tsukuba, Japan
    19. Center for Computational Sciences, University of Tsukuba, Tsukuba, Japan
    20. RIKEN Advanced Institute for Computational Science, Kobe, Japan
  • 丛书名:OpenMP: Memory, Devices, and Tasks
  • ISBN:978-3-319-45550-1
  • 刊物类别:Computer Science
  • 刊物主题:Artificial Intelligence and Robotics
    Computer Communication Networks
    Software Engineering
    Data Encryption
    Database Management
    Computation by Abstract Devices
    Algorithm Analysis and Problem Complexity
  • 出版者:Springer Berlin / Heidelberg
  • ISSN:1611-3349
  • 卷排序:9903
文摘
PEZY-SC is a novel massive Multiple Instruction Multiple Data (MIMD) processor used as an accelerator and characterized by high power efficiency. OpenACC is a standard directive-based programming model for accelerators, and programmers can concisely offload data and computation to the accelerators. In this paper, we present the design and preliminary implementation of an OpenACC compiler for a PEZY-SC. Our compiler translates C code with OpenACC directives to the corresponding PZCL code, which is the programming environment for PEZY-SC. The evaluation shows that the performance of the OpenACC version achieves over 98 % at N-body and up to 88 % at NAS Parallel Benchmarks CG than that of the PZCL version. In addition, we examined optimization techniques such as kernel merging and explicit context switching to exploit the PEZY-SC MIMD architecture, which differs from the single instruction multiple data graphics processing units. We found these optimizations useful in improving the performance and will be implemented in the future release.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700