Calculating the FLOPS of your NVIDIA gpu

This is the formula for calculating the peak FLOPS on your cuda enabled gpu. (as uses by NVIDIA) This is useful as a comparison when evaluating how many FLOPS your cuda kernel is achieving, and how much you can hope to gain through optimization. You can get the numbers needed from the NVIDIA control panel’s system information option.

If your card is pre-fermi:
Processor clock*cuda cores*3

If your card is fermi based: (or higher?)
Processor clock*cuda cores*2

Original Source here.

One thought on “Calculating the FLOPS of your NVIDIA gpu

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s