Skip to content

Commit

Permalink
add a10 (mosaicml#3336)
Browse files Browse the repository at this point in the history
  • Loading branch information
mvpatel2000 authored May 28, 2024
1 parent 1c1f36e commit e90afa4
Showing 1 changed file with 11 additions and 0 deletions.
11 changes: 11 additions & 0 deletions composer/callbacks/speed_monitor.py
Original file line number Diff line number Diff line change
Expand Up @@ -58,6 +58,15 @@
'bf16': 312e12,
'amp_bf16': 312e12,
},
# source: https://www.nvidia.com/content/dam/en-zz/Solutions/Data-Center/a10/pdf/a10-datasheet.pdf
'a10': {
'fp32': 31.2e12,
'tf32': 62.5e12,
'fp16': 125e12,
'amp_fp16': 125e12,
'bf16': 125e12,
'amp_bf16': 125e12,
},
# source: https://images.nvidia.com/content/technologies/volta/pdf/volta-v100-datasheet-update-us-1165301-r5.pdf
'v100-sxm': {
'fp64': 7.8e12,
Expand Down Expand Up @@ -113,6 +122,8 @@ def get_gpu_flops_available(state: State):
device_name = 'h100-pcie'
elif 'a100' in device_name:
device_name = 'a100'
elif 'a10' in device_name:
device_name = 'a10'
elif 'v100-sxm' in device_name:
device_name = 'v100-sxm'
elif 'v100-pcie' in device_name:
Expand Down

0 comments on commit e90afa4

Please sign in to comment.