Skip to content

Latest commit

 

History

History

Example1

Profiling mpi4py (MPI for Python)

The below instructions are intended to help you get started with TAU on the FAS cluster for profiling mpy4py applications.

Quick Start

TAU is a sophisticated, full-featured toolkit. Here we discuss only a subset of TAU's features at a very "basic level". Users will need to consult the official TAU documentation to learn more.

Profiling

The easiest and fastest way to profile a mpi4py application is to use the tau_python command. It automatically instruments your executable at run time, and requires no modifications to Python source code.

  • Setup up your TAU environment on the FAS cluster
$ module load gcc/9.3.0-fasrc01 openmpi/4.0.2-fasrc01 tau/2.29-fasrc01
  • Set up your conda environment and install mpi4py through pip. NOTE: mpi4py needs to be compiled with the same flavor / version of MPI used to build TAU, e.g.,
$ conda create -n python3_tau python numpy pip wheel
$ source activate python_tau
(python_tau) $ pip install mpi4py
  • Run the mpy4py application. In this example we use the code mpi4py_test.py from here.

Example batch-job submission script:

#!/bin/bash
#SBATCH -J mpi4py_test
#SBATCH -o mpi4py_test.out
#SBATCH -e mpi4py_test.err
#SBATCH -p test
#SBATCH -n 4
#SBATCH -t 30
#SBATCH --mem-per-cpu=4000

# Set up environment
module load gcc/9.3.0-fasrc01 openmpi/4.0.2-fasrc01 tau/2.29-fasrc01
source activate python_tau

# Run program
srun -n 4 --mpi=pmi2 tau_python mpi4py_test.py

Submit the batch-job submission script:

sbatch run.sbatch
  • Following completion of your job, you will have a set of files named profile.#.* where # denotes the MPI rank. Viewing these files is discussed in the Output section below.

Output

TAU profiling output consists of a set of files named profile.X.Y.Z where:
X = MPI rank number
Y = context
Z = thread number

pprof
To get a quick, text based summary of your job's profile data, the TAU pprof utility can be used. By default, it will process all of the profile.* files in the current directory and produce a report showing profile data for each rank/context/thread. An example for one MPI rank is shown below:

$ pprof
Reading Profile files in profile.*

NODE 0;CONTEXT 0;THREAD 0:
---------------------------------------------------------------------------------------
%Time    Exclusive    Inclusive       #Call      #Subrs  Inclusive Name
              msec   total msec                          usec/call
---------------------------------------------------------------------------------------
100.0        3,006        4,361           1           6    4361739 .TAU application
 37.7            2        1,645           5         178     329147 
 29.7        0.483        1,294           5           5     258810 exec
 28.3        0.756        1,236          10          60     123609 _find_and_load
 28.3        0.477        1,234          10          30     123452 _find_and_load_unlocked
 26.7        0.699        1,166           7          42     166640 _load_unlocked
 25.9        0.245        1,131          13          13      87039 _call_with_frames_removed
 25.9         0.16        1,131          17          23      66553 _handle_fromlist
 25.9        0.031        1,131           4           3     282819 __import__
 25.4        0.265        1,107           7          14     158151 exec_module
 16.5            2          720           3          10     240298 exec_dynamic
 13.6        0.438          592           7          25      84595 module_from_spec
 13.5        0.041          590           7           6      84360 create_module
 13.5          590          590           3           1     196787 create_dynamic
  8.6          375          375           1           0     375869 MPI_Init_thread()
  4.4            1          193          60         474       3230 find_spec
  3.5        0.738          154           4          32      38684 get_code
  3.5          120          151           4           4      37960 get_data
  2.5          110          110          57           0       1938 stat
  2.3        0.644           98           8          72      12305 _find_spec
  2.2        0.851           98          15         173       6538 _get_spec
  1.7        0.085           74          51          51       1451 _path_stat
  1.7        0.117           73          10          10       7353 _path_isfile
  1.7        0.039           73          11          11       6675 _path_is_mode_type
  1.4        0.875           60           1          21      60705 find_module
  1.1        0.114           46           1           2      46765 Finalize
  1.1           46           46           1           0      46650 MPI_Finalize()
  0.8        0.228           36           6           7       6133 isfile
  0.7           31           31           5           1       6232 read
  0.5           21           21           2           1      10820 open
  0.4        0.123           18           1           3      18649 _fill_cache
  0.4           18           18           1           0      18522 listdir
  0.2        0.113            9           4           4       2426 Barrier
  0.2            9            9           4           0       2398 MPI_Barrier()
  0.1        0.678            6         178         456         35 
  0.1        0.625            5         104         133         53 _compile
  0.1        0.285            5         110         207         48 match
  0.1         0.32            4           1           7       4095 compile
  0.0        0.415            2           1           4       2072 parse
  0.0        0.206            1           1           3       1689 _code
  0.0          0.2            1           1           5       1538 _parse_sub
  0.0        0.236            1           4          16        380 _compile_bytecode
  0.0            1            1           1          53       1331 _parse
  0.0        0.556            1         173         346          8 _path_join
  0.0            1            1           4           0        315 loads
  0.0            1            1           8           8        150 __build_class__
  0.0        0.494            1           7          66        170 _init_module_attrs
  0.0        0.289            1          41          44         26 __enter__
  0.0        0.527        0.912           1          13        912 _compile_info
  0.0         0.28        0.882          43           2         21 _path_importer_cache
  0.0        0.546        0.689          49          26         14 __init__
  0.0        0.131        0.595           1           1        595 _path_hooks
  0.0        0.134        0.594          11           7         54 cached
  0.0        0.415        0.593          10          30         59 _get_module_lock
  0.0        0.222        0.578           1           6        578 detect_encoding
  0.0        0.297        0.533          41          48         13 __exit__
  0.0        0.266        0.523           3          29        174 _optimize_charset
  0.0        0.441        0.477           7          28         68 join
  0.0        0.204        0.464           1           2        464 path_hook_for_FileFinder
  0.0        0.138         0.46           7          14         66 _get_cached
  0.0        0.174        0.387           8          40         48 cache_from_source
  0.0        0.199        0.325           4           8         81 _classify_pyc
  0.0        0.314         0.32           1           3        320 get_suffixes
  0.0        0.281        0.281           2           0        140 compile
  0.0        0.116         0.24           1           3        240 __and__
  0.0        0.214        0.234           1           3        234 _mk_bitmap
  0.0        0.208        0.223           2           4        112 find_cookie
  0.0        0.163        0.186           7          14         27 spec_from_file_location
  0.0        0.166        0.175          10          10         18 acquire
  0.0        0.173        0.173         354           0          0 rstrip
  0.0        0.164        0.164         181           0          1 _verbose_message
  0.0        0.159        0.159         105           0          2 match
  0.0        0.121        0.144           7          28         21 any
  0.0        0.127        0.141          12          12         12 _r_long
  0.0        0.138        0.138         176           0          1 isinstance
  0.0        0.126        0.137           8          16         17 _path_split
  0.0        0.135        0.135         181           0          1 join
  0.0        0.125         0.13           2           2         65 _uniq
  0.0        0.104        0.127           2           2         64 read_or_stop
  0.0        0.124        0.124         164           0          1 
  0.0        0.118        0.123           2           5         62 getwidth
  0.0        0.117        0.122           2           2         61 __call__
  0.0        0.117        0.122           5           8         24 extend
  0.0        0.116         0.12           4           4         30 _check_name_wrapper
  0.0        0.112        0.113           1           1        113 Get_size
  0.0        0.106        0.109           1           1        109 Get_processor_name
  0.0        0.104        0.108           1           1        108 decode
  0.0        0.104        0.105           1           1        105 _get_literal_prefix
  0.0        0.101        0.102           1           1        102 Get_rank
  0.0        0.094        0.094          69           0          1 hasattr
  0.0        0.075        0.075          42           0          2 getattr
  0.0        0.013         0.07           4           4         18 path_stats
  0.0        0.059        0.067          10          10          7 release
  0.0        0.066        0.066           1           0         66 _Unpickler
  0.0         0.04        0.066          10          30          7 cb
  0.0        0.056        0.056           1           0         56 dir
  0.0        0.055        0.055          69           0          1 rpartition
  0.0        0.019        0.041          13          13          3 get
  0.0         0.02         0.04           4           8         10 _validate_timestamp_pyc
  0.0        0.036        0.036          44           0          1 acquire_lock
  0.0        0.033        0.036          51           2          1 len
  0.0        0.035        0.035          44           0          1 release_lock
  0.0        0.031        0.031          17           0          2 __next
  0.0        0.031        0.031          42           0          1 append
  0.0        0.029        0.029           2           0         14 print
  0.0         0.02        0.028          10          11          3 __getitem__
  0.0        0.028        0.028          36           0          1 _relax_case
  0.0        0.023        0.027           7           6          4 parent
  0.0        0.004        0.024           1           1         24 _path_isdir
  0.0        0.023        0.023           1           0         23 _Pickler
  0.0        0.023        0.023           2           0         12 readline
  0.0        0.022        0.022           8           0          3 is_builtin
  0.0        0.013        0.021           3          11          7 _compile_charset
  0.0        0.019        0.019           4           0          5 _fix_co_filename
  0.0        0.019        0.019          21           0          1 get
  0.0        0.019        0.019           4           0          5 update
  0.0        0.012        0.018           7           7          3 _get_sep
  0.0        0.017        0.017          20           0          1 allocate_lock
  0.0        0.017        0.017          20           0          1 get_ident
  0.0        0.016        0.016          22           0          1 fspath
  0.0        0.015        0.015          17           0          1 endswith
  0.0        0.014        0.014          12           0          1 from_bytes
  0.0        0.014        0.014          10           0          1 startswith
  0.0        0.013        0.013          11           0          1 find
  0.0        0.012        0.012           4           0          3 _new_module
  0.0        0.012        0.012           9           0          1 is_frozen
  0.0        0.004        0.011           1           2         11 _simple
  0.0        0.009        0.011           4           4          3 tell
  0.0        0.006        0.009           3           3          3 append
  0.0        0.005        0.008           4           4          2 __len__
  0.0        0.007        0.007           3           0          2 MPI_Finalized()
  0.0        0.007        0.007           2           0          4 format
  0.0        0.007        0.007           1           0          7 getcwd
  0.0        0.007        0.007           7           0          1 has_location
  0.0        0.005        0.007           2           2          4 isstring
  0.0        0.007        0.007           5           0          1 min
  0.0        0.005        0.006           1           1          6 _get_charset_prefix
  0.0        0.006        0.006           7           0          1 ord
  0.0        0.005        0.005           1           0          5 _Framer
  0.0        0.005        0.005           2           0          2 __new__
  0.0        0.004        0.005           1           1          5 fix_flags
  0.0        0.005        0.005           2           0          2 fromkeys
  0.0        0.004        0.005           2           2          2 groups
  0.0        0.005        0.005           1           0          5 new_module
  0.0        0.004        0.004           4           0          1 get_filename
  0.0        0.004        0.004           3           0          1 items
  0.0        0.004        0.004           1           0          4 utf_8_decode
  0.0        0.003        0.003           2           0          2 MPI_Comm_set_errhandler()
  0.0        0.003        0.003           1           0          3 MPI_Get_processor_name()
  0.0        0.003        0.003           1           0          3 _Unframer
  0.0        0.003        0.003           2           0          2 decode
  0.0        0.003        0.003           1           0          3 translate
  0.0        0.002        0.002           1           0          2 PickleError
  0.0        0.002        0.002           1           0          2 PicklingError
  0.0        0.002        0.002           1           0          2 __setitem__
  0.0        0.002        0.002           2           0          1 _get_iscased
  0.0        0.002        0.002           1           0          2 setattr
  0.0        0.001        0.001           1           0          1 MPI_Comm_rank()
  0.0        0.001        0.001           1           0          1 MPI_Comm_size()
  0.0        0.001        0.001           1           0          1 S_ISREG
  0.0        0.001        0.001           1           0          1 UnpicklingError
  0.0        0.001        0.001           1           0          1 _Stop

NODE 1;CONTEXT 0;THREAD 0:
---------------------------------------------------------------------------------------
%Time    Exclusive    Inclusive       #Call      #Subrs  Inclusive Name
              msec   total msec                          usec/call
---------------------------------------------------------------------------------------
100.0        3,004        4,358           1           6    4358039 .TAU application
 37.7            2        1,643           5         177     328699 
 29.6        0.492        1,291           5           5     258346 exec
 28.4        0.762        1,235          10          60     123574 _find_and_load
 28.3        0.499        1,234          10          30     123414 _find_and_load_unlocked
 27.0        0.727        1,174           7          42     167845 _load_unlocked
 25.8        0.255        1,122          13          13      86384 _call_with_frames_removed
 25.8        0.149        1,122          17          23      66052 _handle_fromlist
 25.8        0.021        1,122           4           3     280690 __import__
 25.6        0.255        1,115           7          14     159424 exec_module
 16.5            2          720           3          10     240298 exec_dynamic
 13.6        0.452          591           7          25      84560 module_from_spec
 13.5        0.033          590           7           6      84318 create_module
 13.5          590          590           3           1     196686 create_dynamic
  8.6          375          375           1           0     375865 MPI_Init_thread()
  4.0            1          175          60         474       2926 find_spec
  3.7         0.76          163           4          32      40807 get_code
  3.7          129          160           4           4      40097 get_data
  2.5          109          109          57           0       1921 stat
  2.1        0.669           89           8          72      11194 _find_spec
  2.0        0.851           89          15         173       5942 _get_spec
  1.7        0.094           73          51          51       1434 _path_stat
  1.7        0.123           72          10          10       7262 _path_isfile
  1.7        0.036           72          11          11       6593 _path_is_mode_type
  1.4        0.907           61           1          21      61020 find_module
  1.0        0.113           44           1           2      44816 Finalize
  1.0           44           44           1           0      44702 MPI_Finalize()
  0.8        0.234           36           6           7       6116 isfile
  0.7           30           31           5           1       6215 read
  0.5           21           21           2           1      10982 open
  0.2         0.11           10           1           3      10140 _fill_cache
  0.2           10           10           1           0      10027 listdir
  0.2        0.121            9           4           4       2423 Barrier
  0.2            9            9           4           0       2392 MPI_Barrier()
  0.1         0.66            6         178         456         35 
  0.1        0.634            5         104         133         54 _compile
  0.1        0.282            5         110         207         50 match
  0.1        0.328            4           1           7       4229 compile
  0.0        0.431            2           1           4       2134 parse
  0.0        0.218            1           1           3       1754 _code
  0.0        0.217            1           1           5       1581 _parse_sub
  0.0        0.251            1           4          16        378 _compile_bytecode
  0.0            1            1           1          53       1358 _parse
  0.0        0.577            1         173         346          8 _path_join
  0.0            1            1           4           0        308 loads
  0.0            1            1           8           8        153 __build_class__
  0.0        0.518            1           7          66        175 _init_module_attrs
  0.0        0.293            1          41          44         27 __enter__
  0.0        0.547         0.95           1          13        950 _compile_info
  0.0        0.285        0.901          43           2         21 _path_importer_cache
  0.0        0.557        0.704          49          26         14 __init__
  0.0        0.417        0.614          10          30         61 _get_module_lock
  0.0        0.129        0.609           1           1        609 _path_hooks
  0.0         0.13        0.597          11           7         54 cached
  0.0        0.229        0.584           1           6        584 detect_encoding
  0.0        0.307        0.561          41          48         14 __exit__
  0.0        0.279        0.547           3          29        182 _optimize_charset
  0.0        0.492        0.525           7          28         75 join
  0.0        0.208         0.48           1           2        480 path_hook_for_FileFinder
  0.0        0.139        0.467           7          14         67 _get_cached
  0.0         0.45         0.45          69           0          7 hasattr
  0.0        0.167        0.386           8          40         48 cache_from_source
  0.0        0.323        0.327           1           3        327 get_suffixes
  0.0        0.284        0.284           2           0        142 compile
  0.0        0.131        0.259           4           8         65 _classify_pyc
  0.0        0.124        0.247           1           3        247 __and__
  0.0        0.223        0.245           1           3        245 _mk_bitmap
  0.0        0.205         0.22           2           4        110 find_cookie
  0.0        0.188        0.188         354           0          1 rstrip
  0.0        0.161        0.188           7          14         27 spec_from_file_location
  0.0        0.174        0.184          10          10         18 acquire
  0.0        0.167        0.167         181           0          1 _verbose_message
  0.0        0.162        0.162         105           0          2 match
  0.0        0.132        0.151           7          28         22 any
  0.0        0.144        0.144         176           0          1 isinstance
  0.0        0.127        0.143           8          16         18 _path_split
  0.0        0.128        0.143          12          12         12 _r_long
  0.0         0.14         0.14         181           0          1 join
  0.0        0.109         0.13           2           2         65 read_or_stop
  0.0        0.121        0.126           2           2         63 _uniq
  0.0        0.115        0.124           5           8         25 extend
  0.0        0.115        0.121           2           5         60 getwidth
  0.0        0.113         0.12           2           2         60 __call__
  0.0        0.117         0.12           4           4         30 _check_name_wrapper
  0.0        0.118        0.118         164           0          1 
  0.0        0.113        0.116           1           1        116 Get_processor_name
  0.0        0.112        0.113           1           1        113 Get_size
  0.0         0.11        0.111           1           1        111 Get_rank
  0.0        0.104        0.108           1           1        108 decode
  0.0        0.104        0.105           1           1        105 _get_literal_prefix
  0.0        0.086        0.086          42           0          2 getattr
  0.0        0.067        0.074          10          10          7 release
  0.0        0.047        0.068          10          30          7 cb
  0.0        0.011        0.068           4           4         17 path_stats
  0.0        0.062        0.062           1           0         62 _Unpickler
  0.0        0.052        0.052          69           0          1 rpartition
  0.0        0.047        0.047           1           0         47 dir
  0.0        0.044        0.044          44           0          1 release_lock
  0.0        0.023        0.043          13          13          3 get
  0.0         0.02         0.04           4           8         10 _validate_timestamp_pyc
  0.0        0.037         0.04          51           2          1 len
  0.0        0.036        0.036          42           0          1 append
  0.0        0.004        0.033           1           1         33 _path_isdir
  0.0        0.033        0.033          44           0          1 acquire_lock
  0.0        0.015         0.03           3          11         10 _compile_charset
  0.0        0.024        0.028           7           6          4 parent
  0.0        0.019        0.027          10          11          3 __getitem__
  0.0        0.027        0.027          17           0          2 __next
  0.0        0.023        0.023          36           0          1 _relax_case
  0.0        0.022        0.022           8           0          3 is_builtin
  0.0        0.021        0.021           1           0         21 _Pickler
  0.0        0.021        0.021          17           0          1 endswith
  0.0        0.021        0.021          22           0          1 fspath
  0.0        0.021        0.021           2           0         10 readline
  0.0        0.019        0.019          20           0          1 allocate_lock
  0.0        0.019        0.019           4           0          5 update
  0.0        0.018        0.018           4           0          4 _fix_co_filename
  0.0        0.018        0.018          21           0          1 get
  0.0        0.017        0.017          20           0          1 get_ident
  0.0        0.015        0.015          12           0          1 from_bytes
  0.0        0.008        0.014           7           7          2 _get_sep
  0.0        0.014        0.014           4           0          4 _new_module
  0.0        0.013        0.013          10           0          1 startswith
  0.0        0.012        0.012          11           0          1 find
  0.0        0.012        0.012           9           0          1 is_frozen
  0.0        0.012        0.012           1           0         12 print
  0.0        0.004        0.011           1           2         11 _simple
  0.0        0.008        0.011           4           4          3 tell
  0.0        0.005        0.009           4           4          2 __len__
  0.0        0.006        0.008           3           3          3 append
  0.0        0.008        0.008           2           0          4 format
  0.0        0.007        0.007           3           0          2 MPI_Finalized()
  0.0        0.007        0.007           2           0          4 __new__
  0.0        0.007        0.007           1           0          7 getcwd
  0.0        0.007        0.007           5           0          1 min
  0.0        0.005        0.006           1           1          6 _get_charset_prefix
  0.0        0.004        0.006           2           2          3 isstring
  0.0        0.005        0.005           1           0          5 _Framer
  0.0        0.005        0.005           1           1          5 fix_flags
  0.0        0.005        0.005           2           0          2 fromkeys
  0.0        0.005        0.005           7           0          1 has_location
  0.0        0.005        0.005           1           0          5 new_module
  0.0        0.005        0.005           7           0          1 ord
  0.0        0.004        0.004           1           0          4 _Unframer
  0.0        0.002        0.004           2           2          2 groups
  0.0        0.004        0.004           1           0          4 translate
  0.0        0.004        0.004           1           0          4 utf_8_decode
  0.0        0.003        0.003           2           0          2 MPI_Comm_set_errhandler()
  0.0        0.003        0.003           1           0          3 MPI_Get_processor_name()
  0.0        0.003        0.003           2           0          2 decode
  0.0        0.003        0.003           4           0          1 get_filename
  0.0        0.003        0.003           1           0          3 setattr
  0.0        0.002        0.002           1           0          2 PickleError
  0.0        0.002        0.002           1           0          2 S_ISREG
  0.0        0.002        0.002           1           0          2 _Stop
  0.0        0.002        0.002           1           0          2 __setitem__
  0.0        0.002        0.002           2           0          1 _get_iscased
  0.0        0.002        0.002           3           0          1 items
  0.0        0.001        0.001           1           0          1 MPI_Comm_rank()
  0.0        0.001        0.001           1           0          1 MPI_Comm_size()
  0.0        0.001        0.001           1           0          1 PicklingError
  0.0        0.001        0.001           1           0          1 UnpicklingError

NODE 2;CONTEXT 0;THREAD 0:
---------------------------------------------------------------------------------------
%Time    Exclusive    Inclusive       #Call      #Subrs  Inclusive Name
              msec   total msec                          usec/call
---------------------------------------------------------------------------------------
100.0        3,012        4,357           1           6    4357507 .TAU application
 37.7            2        1,644           5         177     328914 
 29.7        0.498        1,292           5           5     258552 exec
 28.4         0.76        1,235          10          60     123598 _find_and_load
 28.3        0.494        1,234          10          30     123441 _find_and_load_unlocked
 26.8        0.726        1,166           7          42     166628 _load_unlocked
 26.0        0.241        1,131          13          13      87041 _call_with_frames_removed
 26.0        0.152        1,131          17          23      66554 _handle_fromlist
 26.0         0.02        1,131           4           3     282824 __import__
 25.3        0.248        1,102           7          14     157438 exec_module
 16.5            2          720           3          10     240296 exec_dynamic
 13.4        0.437          583           7          25      83362 module_from_spec
 13.4        0.041          581           7           6      83124 create_module
 13.3          581          581           3           1     193896 create_dynamic
  8.7          379          379           1           0     379953 MPI_Init_thread()
  4.5            1          195          60         474       3259 find_spec
  3.6        0.712          158           4          32      39614 get_code
  3.6          133          155           4           4      38922 get_data
  2.4          102          102          57           0       1799 stat
  2.3        0.652           99           8          72      12418 _find_spec
  2.3        0.845           98          15         173       6596 _get_spec
  1.7        0.084           74          51          51       1469 _path_stat
  1.7        0.121           74          10          10       7444 _path_isfile
  1.7        0.035           74          11          11       6759 _path_is_mode_type
  1.2         0.87           51           1          21      51943 find_module
  1.0        0.116           45           1           2      45592 Finalize
  1.0           45           45           1           0      45475 MPI_Finalize()
  0.6         0.23           27           6           7       4662 isfile
  0.5           22           22           5           1       4441 read
  0.5           21           21           2           1      10868 open
  0.4        0.123           18           1           3      18667 _fill_cache
  0.4           18           18           1           0      18541 listdir
  0.2        0.112            9           4           4       2431 Barrier
  0.2            9            9           4           0       2403 MPI_Barrier()
  0.1        0.669            6         178         456         34 
  0.1        0.597            5         104         133         51 _compile
  0.1        0.277            5         110         207         47 match
  0.1        0.316            4           1           7       4009 compile
  0.0        0.399            2           1           4       2032 parse
  0.0        0.195            1           1           3       1647 _code
  0.0        0.214            1           1           5       1516 _parse_sub
  0.0        0.232            1           4          16        374 _compile_bytecode
  0.0        0.582            1         173         346          8 _path_join
  0.0            1            1           1          53       1295 _parse
  0.0            1            1           4           0        310 loads
  0.0        0.491            1           7          66        173 _init_module_attrs
  0.0            1            1           8           8        146 __build_class__
  0.0        0.272            1          41          44         26 __enter__
  0.0        0.513        0.889           1          13        889 _compile_info
  0.0         0.28        0.887          43           2         21 _path_importer_cache
  0.0        0.555        0.702          49          26         14 __init__
  0.0        0.128        0.616          11           7         56 cached
  0.0        0.132        0.599           1           1        599 _path_hooks
  0.0        0.409        0.596          10          30         60 _get_module_lock
  0.0        0.219        0.574           1           6        574 detect_encoding
  0.0        0.293        0.532          41          48         13 __exit__
  0.0        0.265        0.519           3          29        173 _optimize_charset
  0.0        0.136        0.488           7          14         70 _get_cached
  0.0        0.205        0.467           1           2        467 path_hook_for_FileFinder
  0.0        0.428        0.459           7          28         66 join
  0.0        0.168        0.416           8          40         52 cache_from_source
  0.0        0.311        0.316           1           3        316 get_suffixes
  0.0        0.283        0.283           2           0        142 compile
  0.0        0.135        0.259           4           8         65 _classify_pyc
  0.0        0.213        0.234           1           3        234 _mk_bitmap
  0.0        0.108        0.224           1           3        224 __and__
  0.0        0.203        0.219           2           4        110 find_cookie
  0.0        0.157        0.185           7          14         26 spec_from_file_location
  0.0        0.176        0.183          10          10         18 acquire
  0.0        0.181        0.181         354           0          1 rstrip
  0.0        0.169        0.169         181           0          1 _verbose_message
  0.0        0.156        0.156         105           0          1 match
  0.0        0.124        0.145           7          28         21 any
  0.0        0.143        0.143         176           0          1 isinstance
  0.0        0.122        0.138           8          16         17 _path_split
  0.0        0.125        0.138          12          12         12 _r_long
  0.0        0.109        0.131           2           2         66 read_or_stop
  0.0         0.13         0.13         181           0          1 join
  0.0        0.118        0.127           5           8         25 extend
  0.0        0.117        0.123           2           5         62 getwidth
  0.0        0.112        0.117           2           2         58 _uniq
  0.0        0.113        0.116           4           4         29 _check_name_wrapper
  0.0        0.108        0.113           2           2         56 __call__
  0.0        0.109        0.112           1           1        112 Get_processor_name
  0.0         0.11        0.111           1           1        111 Get_size
  0.0        0.106         0.11           1           1        110 decode
  0.0        0.108        0.108         164           0          1 
  0.0        0.104        0.105           1           1        105 Get_rank
  0.0        0.098        0.099           1           1         99 _get_literal_prefix
  0.0        0.098        0.098          69           0          1 hasattr
  0.0        0.083        0.083          42           0          2 getattr
  0.0        0.063        0.072          10          10          7 release
  0.0        0.043        0.066          10          30          7 cb
  0.0        0.011        0.065           4           4         16 path_stats
  0.0        0.059        0.059           1           0         59 _Unpickler
  0.0        0.053        0.053          69           0          1 rpartition
  0.0        0.047        0.047           1           0         47 dir
  0.0        0.021        0.043          13          13          3 get
  0.0        0.039        0.042          51           2          1 len
  0.0         0.02        0.038           4           8         10 _validate_timestamp_pyc
  0.0        0.033        0.033          44           0          1 acquire_lock
  0.0        0.019         0.03          10          11          3 __getitem__
  0.0         0.03         0.03          17           0          2 __next
  0.0         0.03         0.03          44           0          1 release_lock
  0.0        0.029        0.029           8           0          4 is_builtin
  0.0         0.02        0.028           7           6          4 parent
  0.0        0.026        0.026          36           0          1 _relax_case
  0.0        0.026        0.026          42           0          1 append
  0.0        0.003        0.024           1           1         24 _path_isdir
  0.0        0.013        0.022           3          11          7 _compile_charset
  0.0        0.022        0.022           2           0         11 readline
  0.0        0.021        0.021           1           0         21 _Pickler
  0.0         0.02         0.02          21           0          1 get
  0.0        0.019        0.019           4           0          5 _fix_co_filename
  0.0        0.019        0.019          22           0          1 fspath
  0.0        0.018        0.018          20           0          1 allocate_lock
  0.0        0.017        0.017          17           0          1 endswith
  0.0        0.017        0.017           4           0          4 update
  0.0        0.016        0.016          20           0          1 get_ident
  0.0        0.009        0.015           7           7          2 _get_sep
  0.0        0.015        0.015           9           0          2 is_frozen
  0.0        0.013        0.013          12           0          1 from_bytes
  0.0        0.012        0.012           4           0          3 _new_module
  0.0        0.012        0.012           1           0         12 print
  0.0        0.003        0.011           1           2         11 _simple
  0.0        0.011        0.011          11           0          1 find
  0.0         0.01         0.01          10           0          1 startswith
  0.0        0.006         0.01           4           4          2 tell
  0.0        0.006        0.009           4           4          2 __len__
  0.0        0.007        0.008           3           3          3 append
  0.0        0.008        0.008           2           0          4 format
  0.0        0.008        0.008           1           0          8 getcwd
  0.0        0.005        0.007           2           2          4 isstring
  0.0        0.007        0.007           5           0          1 min
  0.0        0.006        0.006           3           0          2 MPI_Finalized()
  0.0        0.005        0.006           1           1          6 _get_charset_prefix
  0.0        0.005        0.005           1           0          5 _Framer
  0.0        0.005        0.005           2           0          2 __new__
  0.0        0.004        0.005           1           1          5 fix_flags
  0.0        0.005        0.005           2           0          2 fromkeys
  0.0        0.005        0.005           2           2          2 groups
  0.0        0.005        0.005           1           0          5 new_module
  0.0        0.005        0.005           7           0          1 ord
  0.0        0.004        0.004           1           0          4 _Unframer
  0.0        0.004        0.004           7           0          1 has_location
  0.0        0.004        0.004           1           0          4 utf_8_decode
  0.0        0.003        0.003           1           0          3 MPI_Get_processor_name()
  0.0        0.003        0.003           2           0          2 decode
  0.0        0.003        0.003           4           0          1 get_filename
  0.0        0.003        0.003           3           0          1 items
  0.0        0.003        0.003           1           0          3 translate
  0.0        0.002        0.002           2           0          1 MPI_Comm_set_errhandler()
  0.0        0.002        0.002           1           0          2 PickleError
  0.0        0.002        0.002           1           0          2 _Stop
  0.0        0.002        0.002           1           0          2 __setitem__
  0.0        0.002        0.002           2           0          1 _get_iscased
  0.0        0.002        0.002           1           0          2 setattr
  0.0        0.001        0.001           1           0          1 MPI_Comm_rank()
  0.0        0.001        0.001           1           0          1 MPI_Comm_size()
  0.0        0.001        0.001           1           0          1 PicklingError
  0.0        0.001        0.001           1           0          1 S_ISREG
  0.0        0.001        0.001           1           0          1 UnpicklingError

NODE 3;CONTEXT 0;THREAD 0:
---------------------------------------------------------------------------------------
%Time    Exclusive    Inclusive       #Call      #Subrs  Inclusive Name
              msec   total msec                          usec/call
---------------------------------------------------------------------------------------
100.0        3,005        4,358           1           6    4358487 .TAU application
 37.5            2        1,632           5         177     326532 
 29.6         0.51        1,291           5           5     258266 exec
 28.4        0.765        1,235          10          60     123584 _find_and_load
 28.3        0.507        1,234          10          30     123422 _find_and_load_unlocked
 26.8        0.742        1,166           7          42     166636 _load_unlocked
 26.0        0.255        1,131          13          13      87034 _call_with_frames_removed
 26.0         0.15        1,131          17          23      66549 _handle_fromlist
 26.0         0.02        1,131           4           3     282801 __import__
 25.4        0.261        1,107           7          14     158178 exec_module
 16.5            2          720           3          10     240296 exec_dynamic
 13.3        0.458          581           7          25      83063 module_from_spec
 13.3        0.041          579           7           6      82821 create_module
 13.3          579          579           3           1     193189 create_dynamic
  8.6          375          375           1           0     375886 MPI_Init_thread()
  4.5            1          194          60         474       3235 find_spec
  3.8        0.736          164           4          32      41155 get_code
  3.7          131          161           4           4      40444 get_data
  2.5          110          110          57           0       1930 stat
  2.3        0.667           98           8          72      12332 _find_spec
  2.3        0.867           98          15         173       6551 _get_spec
  1.7         0.09           73          51          51       1443 _path_stat
  1.7        0.146           73          10          10       7316 _path_isfile
  1.7        0.034           73          11          11       6639 _path_is_mode_type
  1.4        0.914           60           1          21      60908 find_module
  1.0        0.114           44           1           2      44158 Finalize
  1.0           44           44           1           0      44043 MPI_Finalize()
  0.8        0.236           36           6           7       6120 isfile
  0.7           30           30           5           1       6158 read
  0.5           21           21           2           1      10922 open
  0.4        0.119           18           1           3      18560 _fill_cache
  0.4           18           18           1           0      18438 listdir
  0.2        0.114            9           4           4       2456 Barrier
  0.2            9            9           4           0       2428 MPI_Barrier()
  0.2            1            6         178         456         39 
  0.1        0.632            5         104         133         54 _compile
  0.1        0.275            5         110         207         50 match
  0.1        0.337            4           1           7       4252 compile
  0.0        0.438            2           1           4       2124 parse
  0.0        0.582            1         173         346         11 _path_join
  0.0        0.228            1           1           3       1778 _code
  0.0        0.219            1           1           5       1567 _parse_sub
  0.0        0.241            1           4          16        379 _compile_bytecode
  0.0            1            1           1          53       1341 _parse
  0.0            1            1           4           0        311 loads
  0.0            1            1           8           8        153 __build_class__
  0.0        0.509            1           7          66        174 _init_module_attrs
  0.0        0.294            1          41          44         27 __enter__
  0.0        0.561         0.96           1          13        960 _compile_info
  0.0        0.284        0.917          43           2         21 _path_importer_cache
  0.0        0.587        0.738          49          26         15 __init__
  0.0        0.137        0.626           1           1        626 _path_hooks
  0.0        0.417        0.607          10          30         61 _get_module_lock
  0.0        0.138        0.604          11           7         55 cached
  0.0         0.23        0.587           1           6        587 detect_encoding
  0.0        0.307        0.551          41          48         13 __exit__
  0.0        0.273        0.544           3          29        181 _optimize_charset
  0.0        0.476        0.506           7          28         72 join
  0.0        0.214        0.489           1           2        489 path_hook_for_FileFinder
  0.0        0.144        0.466           7          14         67 _get_cached
  0.0        0.169        0.386           8          40         48 cache_from_source
  0.0        0.316        0.321           1           3        321 get_suffixes
  0.0        0.286        0.286           2           0        143 compile
  0.0        0.136        0.264           4           8         66 _classify_pyc
  0.0        0.127        0.251           1           3        251 __and__
  0.0        0.226        0.247           1           3        247 _mk_bitmap
  0.0        0.202         0.22           2           4        110 find_cookie
  0.0         0.18        0.188          10          10         19 acquire
  0.0         0.16        0.186           7          14         27 spec_from_file_location
  0.0        0.183        0.183         354           0          1 rstrip
  0.0        0.177        0.177         105           0          2 match
  0.0        0.176        0.176         181           0          1 _verbose_message
  0.0        0.148        0.164          12          12         14 _r_long
  0.0        0.152        0.152         176           0          1 isinstance
  0.0        0.127        0.148           7          28         21 any
  0.0        0.128        0.143           8          16         18 _path_split
  0.0        0.131        0.131         181           0          1 join
  0.0        0.107        0.131           2           2         66 read_or_stop
  0.0        0.124         0.13           2           5         65 getwidth
  0.0        0.121        0.128           5           8         26 extend
  0.0        0.121        0.127           2           2         64 _uniq
  0.0        0.118        0.122           4           4         30 _check_name_wrapper
  0.0        0.116        0.121           2           2         60 __call__
  0.0        0.112        0.116           1           1        116 Get_processor_name
  0.0        0.114        0.115           1           1        115 Get_size
  0.0        0.112        0.113           1           1        113 Get_rank
  0.0        0.109         0.11           1           1        110 _get_literal_prefix
  0.0        0.105        0.105         164           0          1 
  0.0        0.099        0.103           1           1        103 decode
  0.0        0.101        0.101          69           0          1 hasattr
  0.0        0.079        0.079          42           0          2 getattr
  0.0        0.061         0.07          10          10          7 release
  0.0        0.044        0.065          10          30          6 cb
  0.0        0.011        0.064           4           4         16 path_stats
  0.0        0.062        0.062           1           0         62 _Unpickler
  0.0         0.02         0.06           4           8         15 _validate_timestamp_pyc
  0.0        0.047        0.047           1           0         47 dir
  0.0        0.046        0.046          69           0          1 rpartition
  0.0         0.04        0.043          51           2          1 len
  0.0         0.02        0.041          13          13          3 get
  0.0        0.041        0.041          44           0          1 release_lock
  0.0        0.019         0.03          10          11          3 __getitem__
  0.0        0.029        0.029          17           0          2 __next
  0.0        0.025        0.029           7           6          4 parent
  0.0        0.028        0.028          44           0          1 acquire_lock
  0.0        0.003        0.024           1           1         24 _path_isdir
  0.0        0.024        0.024           8           0          3 is_builtin
  0.0        0.024        0.024           2           0         12 readline
  0.0        0.023        0.023           1           0         23 _Pickler
  0.0        0.018        0.023           3          11          8 _compile_charset
  0.0        0.022        0.022           4           0          6 _fix_co_filename
  0.0         0.02         0.02          36           0          1 _relax_case
  0.0         0.02         0.02          20           0          1 allocate_lock
  0.0        0.018        0.018          42           0          0 append
  0.0        0.018        0.018          22           0          1 fspath
  0.0        0.017        0.017          17           0          1 endswith
  0.0        0.017        0.017          21           0          1 get
  0.0        0.017        0.017          20           0          1 get_ident
  0.0        0.017        0.017           4           0          4 update
  0.0        0.016        0.016           4           0          4 _new_module
  0.0        0.016        0.016          12           0          1 from_bytes
  0.0        0.009        0.014           7           7          2 _get_sep
  0.0        0.014        0.014           9           0          2 is_frozen
  0.0        0.013        0.013          11           0          1 find
  0.0        0.004        0.012           1           2         12 _simple
  0.0        0.012        0.012           1           0         12 print
  0.0        0.012        0.012          10           0          1 startswith
  0.0        0.007         0.01           4           4          2 tell
  0.0        0.008        0.009           3           3          3 append
  0.0        0.009        0.009           2           0          4 format
  0.0        0.006        0.008           4           4          2 __len__
  0.0        0.007        0.007           3           0          2 MPI_Finalized()
  0.0        0.007        0.007           1           0          7 getcwd
  0.0        0.007        0.007           5           0          1 min
  0.0        0.006        0.006           2           0          3 fromkeys
  0.0        0.004        0.006           2           2          3 isstring
  0.0        0.005        0.005           1           0          5 _Framer
  0.0        0.005        0.005           2           0          2 __new__
  0.0        0.004        0.005           1           1          5 _get_charset_prefix
  0.0        0.004        0.005           1           1          5 fix_flags
  0.0        0.005        0.005           7           0          1 has_location
  0.0        0.005        0.005           1           0          5 new_module
  0.0        0.004        0.004           1           0          4 MPI_Get_processor_name()
  0.0        0.004        0.004           1           0          4 _Unframer
  0.0        0.004        0.004           2           0          2 decode
  0.0        0.004        0.004           4           0          1 get_filename
  0.0        0.002        0.004           2           2          2 groups
  0.0        0.004        0.004           7           0          1 ord
  0.0        0.004        0.004           1           0          4 utf_8_decode
  0.0        0.003        0.003           3           0          1 items
  0.0        0.003        0.003           1           0          3 translate
  0.0        0.002        0.002           2           0          1 MPI_Comm_set_errhandler()
  0.0        0.002        0.002           1           0          2 PickleError
  0.0        0.002        0.002           1           0          2 _Stop
  0.0        0.002        0.002           1           0          2 __setitem__
  0.0        0.002        0.002           2           0          1 _get_iscased
  0.0        0.002        0.002           1           0          2 setattr
  0.0        0.001        0.001           1           0          1 MPI_Comm_rank()
  0.0        0.001        0.001           1           0          1 MPI_Comm_size()
  0.0        0.001        0.001           1           0          1 PicklingError
  0.0        0.001        0.001           1           0          1 S_ISREG
  0.0        0.001        0.001           1           0          1 UnpicklingError

FUNCTION SUMMARY (total):
---------------------------------------------------------------------------------------
%Time    Exclusive    Inclusive       #Call      #Subrs  Inclusive Name
              msec   total msec                          usec/call
---------------------------------------------------------------------------------------
100.0       12,029       17,435           4          24    4358943 .TAU application
 37.7            8        6,566          20         709     328323 
 29.7            1        5,169          20          20     258493 exec
 28.4            3        4,943          40         240     123591 _find_and_load
 28.3            1        4,937          40         120     123432 _find_and_load_unlocked
 26.8            2        4,674          28         168     166937 _load_unlocked
 25.9        0.996        4,517          52          52      86875 _call_with_frames_removed
 25.9        0.611        4,517          68          92      66427 _handle_fromlist
 25.9        0.092        4,516          16          12     282284 __import__
 25.4            1        4,432          28          56     158298 exec_module
 16.5            8        2,883          12          40     240297 exec_dynamic
 13.5            1        2,349          28         100      83895 module_from_spec
 13.4        0.156        2,342          28          24      83656 create_module
 13.4        2,341        2,341          12           4     195139 create_dynamic
  8.6        1,507        1,507           4           0     376893 MPI_Init_thread()
  4.4            6          758         240        1896       3162 find_spec
  3.7            2          641          16         128      40065 get_code
  3.6          515          629          16          16      39356 get_data
  2.5          432          432         228           0       1897 stat
  2.2            2          385          32         288      12062 _find_spec
  2.2            3          384          60         692       6407 _get_spec
  1.7        0.353          295         204         204       1449 _path_stat
  1.7        0.507          293          40          40       7344 _path_isfile
  1.7        0.144          293          44          44       6667 _path_is_mode_type
  1.3            3          234           4          84      58644 find_module
  1.0        0.457          181           4           8      45333 Finalize
  1.0          180          180           4           0      45218 MPI_Finalize()
  0.8        0.928          138          24          28       5758 isfile
  0.7          114          115          20           4       5761 read
  0.5           86           87           8           4      10898 open
  0.4        0.475           66           4          12      16504 _fill_cache
  0.4           65           65           4           0      16382 listdir
  0.2         0.46           38          16          16       2434 Barrier
  0.2           38           38          16           0       2406 MPI_Barrier()
  0.1            3           25         712        1824         36 
  0.1            2           22         416         532         53 _compile
  0.1            1           21         440         828         49 match
  0.1            1           16           4          28       4146 compile
  0.0            1            8           4          16       2090 parse
  0.0        0.847            6           4          12       1717 _code
  0.0         0.85            6           4          20       1550 _parse_sub
  0.0         0.96            6          16          64        378 _compile_bytecode
  0.0            2            5         692        1384          8 _path_join
  0.0            4            5           4         212       1331 _parse
  0.0            4            4          16           0        311 loads
  0.0            2            4          28         264        173 _init_module_attrs
  0.0            4            4          32          32        151 __build_class__
  0.0            1            4         164         176         26 __enter__
  0.0            2            3           4          52        928 _compile_info
  0.0            1            3         172           8         21 _path_importer_cache
  0.0            2            2         196         104         14 __init__
  0.0        0.529            2           4           4        607 _path_hooks
  0.0         0.53            2          44          28         55 cached
  0.0            1            2          40         120         60 _get_module_lock
  0.0          0.9            2           4          24        581 detect_encoding
  0.0            1            2         164         192         13 __exit__
  0.0            1            2          12         116        178 _optimize_charset
  0.0            1            1          28         112         70 join
  0.0        0.831            1           4           8        475 path_hook_for_FileFinder
  0.0        0.557            1          28          56         67 _get_cached
  0.0        0.678            1          32         160         49 cache_from_source
  0.0            1            1           4          12        321 get_suffixes
  0.0            1            1           8           0        142 compile
  0.0        0.601            1          16          32         69 _classify_pyc
  0.0        0.475        0.962           4          12        240 __and__
  0.0        0.876         0.96           4          12        240 _mk_bitmap
  0.0        0.818        0.882           8          16        110 find_cookie
  0.0        0.641        0.745          28          56         27 spec_from_file_location
  0.0        0.743        0.743         276           0          3 hasattr
  0.0        0.696         0.73          40          40         18 acquire
  0.0        0.725        0.725        1416           0          1 rstrip
  0.0        0.676        0.676         724           0          1 _verbose_message
  0.0        0.654        0.654         420           0          2 match
  0.0        0.504        0.588          28         112         21 any
  0.0        0.528        0.586          48          48         12 _r_long
  0.0        0.577        0.577         704           0          1 isinstance
  0.0        0.503        0.561          32          64         18 _path_split
  0.0        0.536        0.536         724           0          1 join
  0.0        0.429        0.519           8           8         65 read_or_stop
  0.0        0.471        0.501          20          32         25 extend
  0.0        0.479          0.5           8           8         62 _uniq
  0.0        0.474        0.497           8          20         62 getwidth
  0.0        0.464        0.478          16          16         30 _check_name_wrapper
  0.0        0.454        0.476           8           8         60 __call__
  0.0        0.455        0.455         656           0          1 
  0.0         0.44        0.453           4           4        113 Get_processor_name
  0.0        0.448        0.452           4           4        113 Get_size
  0.0        0.427        0.431           4           4        108 Get_rank
  0.0        0.413        0.429           4           4        107 decode
  0.0        0.415        0.419           4           4        105 _get_literal_prefix
  0.0        0.323        0.323         168           0          2 getattr
  0.0         0.25        0.283          40          40          7 release
  0.0        0.046        0.267          16          16         17 path_stats
  0.0        0.174        0.265          40         120          7 cb
  0.0        0.249        0.249           4           0         62 _Unpickler
  0.0        0.206        0.206         276           0          1 rpartition
  0.0        0.197        0.197           4           0         49 dir
  0.0         0.08        0.178          16          32         11 _validate_timestamp_pyc
  0.0        0.083        0.168          52          52          3 get
  0.0        0.149        0.161         204           8          1 len
  0.0         0.15         0.15         176           0          1 release_lock
  0.0         0.13         0.13         176           0          1 acquire_lock
  0.0        0.117        0.117          68           0          2 __next
  0.0        0.077        0.115          40          44          3 __getitem__
  0.0        0.092        0.112          28          24          4 parent
  0.0        0.111        0.111         168           0          1 append
  0.0        0.014        0.105           4           4         26 _path_isdir
  0.0        0.097        0.097         144           0          1 _relax_case
  0.0        0.097        0.097          32           0          3 is_builtin
  0.0        0.059        0.096          12          44          8 _compile_charset
  0.0         0.09         0.09           8           0         11 readline
  0.0        0.088        0.088           4           0         22 _Pickler
  0.0        0.078        0.078          16           0          5 _fix_co_filename
  0.0        0.074        0.074          80           0          1 allocate_lock
  0.0        0.074        0.074          88           0          1 fspath
  0.0        0.074        0.074          84           0          1 get
  0.0        0.072        0.072          16           0          4 update
  0.0         0.07         0.07          68           0          1 endswith
  0.0        0.067        0.067          80           0          1 get_ident
  0.0        0.065        0.065           5           0         13 print
  0.0        0.038        0.061          28          28          2 _get_sep
  0.0        0.058        0.058          48           0          1 from_bytes
  0.0        0.054        0.054          16           0          3 _new_module
  0.0        0.053        0.053          36           0          1 is_frozen
  0.0        0.049        0.049          44           0          1 find
  0.0        0.049        0.049          40           0          1 startswith
  0.0        0.015        0.045           4           8         11 _simple
  0.0         0.03        0.042          16          16          3 tell
  0.0        0.022        0.034          16          16          2 __len__
  0.0        0.027        0.034          12          12          3 append
  0.0        0.032        0.032           8           0          4 format
  0.0        0.029        0.029           4           0          7 getcwd
  0.0        0.028        0.028          20           0          1 min
  0.0        0.027        0.027          12           0          2 MPI_Finalized()
  0.0        0.018        0.026           8           8          3 isstring
  0.0        0.019        0.023           4           4          6 _get_charset_prefix
  0.0        0.022        0.022           8           0          3 __new__
  0.0        0.021        0.021           8           0          3 fromkeys
  0.0        0.021        0.021          28           0          1 has_location
  0.0         0.02         0.02           4           0          5 _Framer
  0.0        0.017         0.02           4           4          5 fix_flags
  0.0         0.02         0.02           4           0          5 new_module
  0.0         0.02         0.02          28           0          1 ord
  0.0        0.013        0.018           8           8          2 groups
  0.0        0.016        0.016           4           0          4 utf_8_decode
  0.0        0.015        0.015           4           0          4 _Unframer
  0.0        0.014        0.014          16           0          1 get_filename
  0.0        0.013        0.013           4           0          3 MPI_Get_processor_name()
  0.0        0.013        0.013           8           0          2 decode
  0.0        0.013        0.013           4           0          3 translate
  0.0        0.012        0.012          12           0          1 items
  0.0         0.01         0.01           8           0          1 MPI_Comm_set_errhandler()
  0.0        0.009        0.009           4           0          2 setattr
  0.0        0.008        0.008           4           0          2 PickleError
  0.0        0.008        0.008           4           0          2 __setitem__
  0.0        0.008        0.008           8           0          1 _get_iscased
  0.0        0.007        0.007           4           0          2 _Stop
  0.0        0.005        0.005           4           0          1 PicklingError
  0.0        0.005        0.005           4           0          1 S_ISREG
  0.0        0.004        0.004           4           0          1 MPI_Comm_rank()
  0.0        0.004        0.004           4           0          1 MPI_Comm_size()
  0.0        0.004        0.004           4           0          1 UnpicklingError

FUNCTION SUMMARY (mean):
---------------------------------------------------------------------------------------
%Time    Exclusive    Inclusive       #Call      #Subrs  Inclusive Name
              msec   total msec                          usec/call
---------------------------------------------------------------------------------------
100.0        3,007        4,358           1           6    4358943 .TAU application
 37.7            2        1,641           5      177.25     328323 
 29.7        0.496        1,292           5           5     258493 exec
 28.4        0.761        1,235          10          60     123591 _find_and_load
 28.3        0.494        1,234          10          30     123432 _find_and_load_unlocked
 26.8        0.724        1,168           7          42     166937 _load_unlocked
 25.9        0.249        1,129          13          13      86875 _call_with_frames_removed
 25.9        0.153        1,129          17          23      66427 _handle_fromlist
 25.9        0.023        1,129           4           3     282284 __import__
 25.4        0.257        1,108           7          14     158298 exec_module
 16.5            2          720           3          10     240297 exec_dynamic
 13.5        0.446          587           7          25      83895 module_from_spec
 13.4        0.039          585           7           6      83656 create_module
 13.4          585          585           3           1     195139 create_dynamic
  8.6          376          376           1           0     376893 MPI_Init_thread()
  4.4            1          189          60         474       3162 find_spec
  3.7        0.737          160           4          32      40065 get_code
  3.6          128          157           4           4      39356 get_data
  2.5          108          108          57           0       1897 stat
  2.2        0.658           96           8          72      12062 _find_spec
  2.2        0.854           96          15         173       6407 _get_spec
  1.7       0.0882           73          51          51       1449 _path_stat
  1.7        0.127           73          10          10       7344 _path_isfile
  1.7        0.036           73          11          11       6667 _path_is_mode_type
  1.3        0.891           58           1          21      58644 find_module
  1.0        0.114           45           1           2      45333 Finalize
  1.0           45           45           1           0      45218 MPI_Finalize()
  0.8        0.232           34           6           7       5758 isfile
  0.7           28           28           5           1       5761 read
  0.5           21           21           2           1      10898 open
  0.4        0.119           16           1           3      16504 _fill_cache
  0.4           16           16           1           0      16382 listdir
  0.2        0.115            9           4           4       2434 Barrier
  0.2            9            9           4           0       2406 MPI_Barrier()
  0.1        0.802            6         178         456         36 
  0.1        0.622            5         104         133         53 _compile
  0.1         0.28            5         110         207         49 match
  0.1        0.325            4           1           7       4146 compile
  0.0        0.421            2           1           4       2090 parse
  0.0        0.212            1           1           3       1717 _code
  0.0        0.212            1           1           5       1550 _parse_sub
  0.0         0.24            1           4          16        378 _compile_bytecode
  0.0        0.574            1         173         346          8 _path_join
  0.0            1            1           1          53       1331 _parse
  0.0            1            1           4           0        311 loads
  0.0        0.503            1           7          66        173 _init_module_attrs
  0.0            1            1           8           8        151 __build_class__
  0.0        0.287            1          41          44         26 __enter__
  0.0        0.537        0.928           1          13        928 _compile_info
  0.0        0.282        0.897          43           2         21 _path_importer_cache
  0.0        0.561        0.708          49          26         14 __init__
  0.0        0.132        0.607           1           1        607 _path_hooks
  0.0        0.133        0.603          11           7         55 cached
  0.0        0.414        0.603          10          30         60 _get_module_lock
  0.0        0.225        0.581           1           6        581 detect_encoding
  0.0        0.301        0.544          41          48         13 __exit__
  0.0        0.271        0.533           3          29        178 _optimize_charset
  0.0        0.459        0.492           7          28         70 join
  0.0        0.208        0.475           1           2        475 path_hook_for_FileFinder
  0.0        0.139         0.47           7          14         67 _get_cached
  0.0         0.17        0.394           8          40         49 cache_from_source
  0.0        0.316        0.321           1           3        321 get_suffixes
  0.0        0.283        0.283           2           0        142 compile
  0.0         0.15        0.277           4           8         69 _classify_pyc
  0.0        0.119         0.24           1           3        240 __and__
  0.0        0.219         0.24           1           3        240 _mk_bitmap
  0.0        0.204        0.221           2           4        110 find_cookie
  0.0         0.16        0.186           7          14         27 spec_from_file_location
  0.0        0.186        0.186          69           0          3 hasattr
  0.0        0.174        0.182          10          10         18 acquire
  0.0        0.181        0.181         354           0          1 rstrip
  0.0        0.169        0.169         181           0          1 _verbose_message
  0.0        0.164        0.164         105           0          2 match
  0.0        0.126        0.147           7          28         21 any
  0.0        0.132        0.146          12          12         12 _r_long
  0.0        0.144        0.144         176           0          1 isinstance
  0.0        0.126         0.14           8          16         18 _path_split
  0.0        0.134        0.134         181           0          1 join
  0.0        0.107         0.13           2           2         65 read_or_stop
  0.0        0.118        0.125           5           8         25 extend
  0.0         0.12        0.125           2           2         62 _uniq
  0.0        0.118        0.124           2           5         62 getwidth
  0.0        0.116        0.119           4           4         30 _check_name_wrapper
  0.0        0.114        0.119           2           2         60 __call__
  0.0        0.114        0.114         164           0          1 
  0.0         0.11        0.113           1           1        113 Get_processor_name
  0.0        0.112        0.113           1           1        113 Get_size
  0.0        0.107        0.108           1           1        108 Get_rank
  0.0        0.103        0.107           1           1        107 decode
  0.0        0.104        0.105           1           1        105 _get_literal_prefix
  0.0       0.0808       0.0808          42           0          2 getattr
  0.0       0.0625       0.0707          10          10          7 release
  0.0       0.0115       0.0668           4           4         17 path_stats
  0.0       0.0435       0.0663          10          30          7 cb
  0.0       0.0622       0.0622           1           0         62 _Unpickler
  0.0       0.0515       0.0515          69           0          1 rpartition
  0.0       0.0493       0.0493           1           0         49 dir
  0.0         0.02       0.0445           4           8         11 _validate_timestamp_pyc
  0.0       0.0208        0.042          13          13          3 get
  0.0       0.0372       0.0403          51           2          1 len
  0.0       0.0375       0.0375          44           0          1 release_lock
  0.0       0.0325       0.0325          44           0          1 acquire_lock
  0.0       0.0293       0.0293          17           0          2 __next
  0.0       0.0192       0.0288          10          11          3 __getitem__
  0.0        0.023        0.028           7           6          4 parent
  0.0       0.0278       0.0278          42           0          1 append
  0.0       0.0035       0.0262           1           1         26 _path_isdir
  0.0       0.0243       0.0243          36           0          1 _relax_case
  0.0       0.0243       0.0243           8           0          3 is_builtin
  0.0       0.0147        0.024           3          11          8 _compile_charset
  0.0       0.0225       0.0225           2           0         11 readline
  0.0        0.022        0.022           1           0         22 _Pickler
  0.0       0.0195       0.0195           4           0          5 _fix_co_filename
  0.0       0.0185       0.0185          20           0          1 allocate_lock
  0.0       0.0185       0.0185          22           0          1 fspath
  0.0       0.0185       0.0185          21           0          1 get
  0.0        0.018        0.018           4           0          4 update
  0.0       0.0175       0.0175          17           0          1 endswith
  0.0       0.0168       0.0168          20           0          1 get_ident
  0.0       0.0163       0.0163        1.25           0         13 print
  0.0       0.0095       0.0152           7           7          2 _get_sep
  0.0       0.0145       0.0145          12           0          1 from_bytes
  0.0       0.0135       0.0135           4           0          3 _new_module
  0.0       0.0132       0.0132           9           0          1 is_frozen
  0.0       0.0123       0.0123          11           0          1 find
  0.0       0.0123       0.0123          10           0          1 startswith
  0.0      0.00375       0.0112           1           2         11 _simple
  0.0       0.0075       0.0105           4           4          3 tell
  0.0       0.0055       0.0085           4           4          2 __len__
  0.0      0.00675       0.0085           3           3          3 append
  0.0        0.008        0.008           2           0          4 format
  0.0      0.00725      0.00725           1           0          7 getcwd
  0.0        0.007        0.007           5           0          1 min
  0.0      0.00675      0.00675           3           0          2 MPI_Finalized()
  0.0       0.0045       0.0065           2           2          3 isstring
  0.0      0.00475      0.00575           1           1          6 _get_charset_prefix
  0.0       0.0055       0.0055           2           0          3 __new__
  0.0      0.00525      0.00525           2           0          3 fromkeys
  0.0      0.00525      0.00525           7           0          1 has_location
  0.0        0.005        0.005           1           0          5 _Framer
  0.0      0.00425        0.005           1           1          5 fix_flags
  0.0        0.005        0.005           1           0          5 new_module
  0.0        0.005        0.005           7           0          1 ord
  0.0      0.00325       0.0045           2           2          2 groups
  0.0        0.004        0.004           1           0          4 utf_8_decode
  0.0      0.00375      0.00375           1           0          4 _Unframer
  0.0       0.0035       0.0035           4           0          1 get_filename
  0.0      0.00325      0.00325           1           0          3 MPI_Get_processor_name()
  0.0      0.00325      0.00325           2           0          2 decode
  0.0      0.00325      0.00325           1           0          3 translate
  0.0        0.003        0.003           3           0          1 items
  0.0       0.0025       0.0025           2           0          1 MPI_Comm_set_errhandler()
  0.0      0.00225      0.00225           1           0          2 setattr
  0.0        0.002        0.002           1           0          2 PickleError
  0.0        0.002        0.002           1           0          2 __setitem__
  0.0        0.002        0.002           2           0          1 _get_iscased
  0.0      0.00175      0.00175           1           0          2 _Stop
  0.0      0.00125      0.00125           1           0          1 PicklingError
  0.0      0.00125      0.00125           1           0          1 S_ISREG
  0.0        0.001        0.001           1           0          1 MPI_Comm_rank()
  0.0        0.001        0.001           1           0          1 MPI_Comm_size()
  0.0        0.001        0.001           1           0          1 UnpicklingError

ParaProf
ParaProf is TAU's graphical profile analysis utility. To use ParaProf:

(1) Start a GUI session through FAS VDI portal.

(2) Go to the directory containing the profile.* files.

(3) If you have not already done so, load the TAU environment and then issue the paraprof command:

$ module load gcc/9.3.0-fasrc01 openmpi/4.0.2-fasrc01 tau/2.29-fasrc01
$ paraprof

(4) A manager window and profile summary window will appear. Left/right clicking on items in the summary window allows you to view different types of information, as do the various menu selections.

ParaProf includes many features for diving more deeply into your application's behavior - see the TAU Documentation for details. A few representative screenshots are provided below:

tau01

tau02

tau03

tau04

tau05

tau07