Benchmarks¶
pyshmem includes benchmark-marked tests for both round-trip IO and matrix-vector multiply pipeline workloads.
Running benchmarks locally¶
CPU benchmark smoke tests:
pytest -m "cpu and benchmark" -q -s
GPU benchmark smoke tests:
pytest tests/test_benchmark.py -m "gpu and benchmark" -q -s
GPU pipeline shapes¶
Two GPU MVM benchmark shapes are included:
host-upload pipeline: the vector is produced in NumPy and uploaded each iteration
device-resident pipeline: the vector is produced directly on GPU each iteration
Environment knobs¶
pyshmem_MVM_DIMpyshmem_CPU_MVM_ITERATIONSpyshmem_GPU_MVM_ITERATIONSpyshmem_GPU_DEVICE_MVM_DIMpyshmem_GPU_DEVICE_MVM_ITERATIONSpyshmem_GPU_DEVICE_MVM_WARMUP_ITERATIONS
See the project README for the current measured results captured on the primary development machine.