Parallel implementation of the direct simulation monte carlo method for shared memory architectures

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Scopus citations

Abstract

Parallel implementation of a three-dimensional direct simulation Monte Carlo (DSMC) code is detailed that employs shared memory system using Open Multi-Processing (OpenMP). Several techniques to optimize the serial implementation of the DSMC method are also discussed. The synchronizations in OpenMP, as well as the related critical sections, have been identified as major factors that impact the OpenMP parallel performance. Methods to remove such barriers in the OpenMP implementation of the DSMC method are presented. For dual-core and quad-core systems, speedups of 1.99 and 3.74, respectively, are obtained for the OpenMP implementation. It is also reported that memory fetching and data communication within the same node but across sockets needs further improvement in order to achieve acceptable scalability for clusters of multi-socket, shared-memory architectures.

Original languageEnglish (US)
Title of host publication48th AIAA Aerospace Sciences Meeting Including the New Horizons Forum and Aerospace Exposition
StatePublished - Dec 13 2010
Event48th AIAA Aerospace Sciences Meeting Including the New Horizons Forum and Aerospace Exposition - Orlando, FL, United States
Duration: Jan 4 2010Jan 7 2010

Publication series

Name48th AIAA Aerospace Sciences Meeting Including the New Horizons Forum and Aerospace Exposition

Other

Other48th AIAA Aerospace Sciences Meeting Including the New Horizons Forum and Aerospace Exposition
CountryUnited States
CityOrlando, FL
Period1/4/101/7/10

Fingerprint Dive into the research topics of 'Parallel implementation of the direct simulation monte carlo method for shared memory architectures'. Together they form a unique fingerprint.

Cite this