Calculating the FLOPS of your NVIDIA gpu

This is the formula for calculating the peak FLOPS on your cuda enabled gpu. (as uses by NVIDIA) This is useful as a comparison when evaluating how many FLOPS your cuda kernel is achieving, and how much you can hope to gain through optimization. You can get the numbers needed from the NVIDIA control panel’s system information option.

If your card is pre-fermi:
Processor clock*cuda cores*3

If your card is fermi based: (or higher?)
Processor clock*cuda cores*2

Original Source here.

Advertisements

One thought on “Calculating the FLOPS of your NVIDIA gpu”

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s