9x333, 8x375, or 7x428 on a Q6600 - Which is faster?

Discussion about Intel CPUs and overclocking. Need help with that new Intel processor? Not sure which one is right for you? Like to void your warranty? This is the place for you! Please keep the topic on Intel Processors!
Post Reply
graysky
Legit Aficionado
Legit Aficionado
Posts: 83
Joined: Wed Nov 09, 2005 3:03 pm

9x333, 8x375, or 7x428 on a Q6600 - Which is faster?

Post by graysky »

What is a better overclock?

Good question. Most people believe that a higher FSB and lower multiplier are better since this maximizes the bandwidth on the FSB. Or is a low bus rate and higher multiplier better? Or is there no difference? I looked at three different settings on my Q6600:

9x333 = 3.0 GHz (DRAM was 667 MHz)
8x375 = 3.0 GHz (DRAM was 750 MHz)
7x428 = 3.0 GHz (DRAM was 856 MHz)

The DRAM:CPU ratio was 1:1 for each test and the voltage and timings were held constant; voltage was 2.25V and timings were 4-4-4-12-4-20-10-10-10-11.

After the same experiments, at each of these settings, I concluded that there is no difference for real world applications. If you use a synthetic benchmark, like Sandra, you will see faster memory reads/writes, etc. with the higher FSB values -- so what. These high FSB settings are great if all you do with your machine is run synthetic benchmarks. But the higher FSB values come at the cost of higher voltages for the board which equate to higher temps.

I think that FSB bandwidth is simply not the bottle neck in a modern system... at least when starting at 333. Perhaps you would see a difference if starting slower. In other words, a 333 MHz FSB quad pumped to 1333 MHz is more than sufficient for today’s applications; when I increased it to 375 MHz (1500 MHz quad pumped) I saw no real-world change; same result when I pushed it up to 428 MHz (1712 MHz quad pumped). Don’t believe me? Read this thread wherein x264.exe (a video encoder) is used at different FSB and multiplier values. Have a close look at the 3rd table in that thread and note the FPS (frames per second) numbers are nearly identical for a chip clocked at the same clockrate with different FSB speeds. This was found to be true of C2Q as well as C2D chips.

You can do a similar test for yourself with applications you commonly use on your machine. Time them with a stop watch if the application doesn’t report its own benchmarks like x264 does.

Some "Real-World" Application Based Tests

Three different 3.0 GHz settings on a Q6600 system were tested with some apps including: lameenc, super pi, x264, winrar, and the trial version of photoshop. Here are the details:

Test O/C 1: 9x333 = 3.0 GHz
Image

Test O/C 2: 8x375 = 3.0 GHz
Image

Test O/C 3: 7x428 = 3.0 GHz
Image

Result: I could not measure a difference between a FSB of 333 MHz, 375 MHz, or 428 MHz using these application based, "real-world" benchmarks.

Since 428 MHz is about 28 % faster than 333 MHz, you’d think that if the FSB was indeed the bottle neck, the higher values would have given faster results. I believe that the bottleneck for most apps is the hard drive.

Description of Experiments and Raw Data

Lame version 3.97 – Encoded the same test file (about 60 MB wav) with these commandline options:

Code: Select all

lame -V 2 --vbr-new test.wav
(which is equivalent to the old –-alt-preset fast standard) a total of 10 times and averaged play/CPU data as the benchmark.

Super Pi version 1.1 – Ran both the 1M and 2M tests and compared the reported total number of seconds to calculate as the benchmark.

x264 version 0.54.620 – Ran a 2-pass encode on the same MPEG-2 (480x480 DVD source) file twice and averaged the FPS1 and FPS2 numbers as the benchmark. In case you’re wondering, here is the commandline options for this encode, pass1:

Code: Select all

x264 --pass 1 --bitrate 1000 --stats "C:\work\test-NEW.stats" --bframes 3 --b-pyramid --direct auto --subme 1 --analyse none --vbv-maxrate 25000 --me dia --merange 12 --threads auto --thread-input --progress --no-psnr --no-ssim --output NUL "C:\work\test-NEW.avs"
And for pass2:

Code: Select all

x264 --pass 2 --bitrate 1000 --stats "C:\work\test-NEW.stats" --ref 3 --bframes 3 --b-pyramid --weightb --direct auto --subme 6 --trellis 1 --analyse all  --8x8dct --vbv-maxrate 25000 --me umh --merange 12 --threads auto --thread-input --progress --no-psnr --no-ssim --output "C:\work\test-NEW.264" "C:\work\test-NEW.avs"
The input avisynth script was:

Code: Select all

global MeGUI_darx = 4
global MeGUI_dary = 3
DGDecode_mpeg2source("C:\work\test-new.d2v")
AssumeTFF()
Telecide(guide=1,post=2,vthresh=35) # IVTC
Decimate(quality=3) # remove dup. frames
crop( 2, 0, -10, -4)
Spline36Resize(640,480) # Spline36 (Neutral)
RAR version 2.63 – Had rar run my standard backup batch file which generated about 0.98 G of rars (1,896 files totally). Here is the commandline I used:

Code: Select all

rar a -u -m0 -md2048 -v51200 -rv5 -msjpg;mp3;tif;avi;zip;rar;gpg;jpg  "e:\Backups\Backup.rar" @list.txt
where list.txt a list of all the dirs I want it to back up. I timed how long it took to complete with a stop watch. I ran the backup twice and averaged it as the benchmark.

Trial of Photoshop CS3 – I used the batch function in PSCS3 to batch bicubic resize 10.1 MP to 0.7 MP (3872x2592 --> 1024x685), then applied an unsharpen mask (60 %, 0.8 px radius, threshold 12), and finally saved as quality 8 jpg. In total, 57 jpg files were used in the batch. I timed how long it took to complete two runs, and averaged them together as the benchmark.

Here are the raw data if you care to see them:
Image

I just read the FSB1333 Intel Processors & New 2007 CPU Charts article over at TH.com and am happy to see that the testers over there have drawn the same conclusion that I have about fixed final core speeds with higher and higher FSB speeds: faster FSB speeds w/ a C2Q/C2D don't equate to faster real-world benchmarks.

Have a look at page 8 from their article comparing the "old" 1066 MHz FSB to the "new" 1333 MHz FSB chips: average gain <1 %.
Blacklash
Legit User
Legit User
Posts: 24
Joined: Thu Feb 22, 2007 1:01 am

Re: 9x333, 8x375, or 7x428 on a Q6600 - Which is faster?

Post by Blacklash »

Wish I could remember where I results akin to these. Basically, I've seen data that suggests a Core chip at its native multi will be faster than one lower to the same said multi due to issues with the strap|chipset. IE 8x400 on an E6400 should be slightly faster than 8x400 on an E6600 that has had it's multi lowered. I always tend to run my chips native multi. If I want 8x I buy 8x. Your option for that with a Quad core would be an X3210.

http://www.allstarshop.com/shop/product ... VXXPQN8ND5

Supposedly it is less stressful overclocking in this manner as well.
User avatar
Major_A
Legit Extremist
Legit Extremist
Posts: 3793
Joined: Tue May 15, 2007 2:11 pm
Location: Houston, TX

Re: 9x333, 8x375, or 7x428 on a Q6600 - Which is faster?

Post by Major_A »

A faster CPU will out perform overclocked RAM any day of the week. Unless you want to show us some [sarcasm}useful SANDRA/Everest RAM tests[/sarcasm]. This is why you didn't notice any "real world" improvements. Max out the CPU then play around with the RAM.

Forgive my ignorance but what program did you use to encode h.264? Does that program utilize all the CPU cores? I use Nero Recode all the time to take TV recordings and put it on my iPod. It pegs both cores on my computer and is uses h.264. If you don't own it you can download a trial of Nero 7 and see if your results are the same with your different settings.

Example....
Image
User avatar
DMB2000uk
Site Admin
Site Admin
Posts: 7095
Joined: Mon Jul 18, 2005 5:36 pm
Location: UK

Re: 9x333, 8x375, or 7x428 on a Q6600 - Which is faster?

Post by DMB2000uk »

Maybe we need someone to do this test who has an unlinked memory board?

Good effort though gray.

Dan
Image (<- Clickable)
User avatar
Bio-Hazard
Legit Extremist
Legit Extremist
Posts: 2302
Joined: Thu May 06, 2004 9:48 pm
Location: Back Woods Of MO.

Re: 9x333, 8x375, or 7x428 on a Q6600 - Which is faster?

Post by Bio-Hazard »

Sounds like a good idea as I've notice some differances in performance depending on what speed I clock my ram to. Super Pi times alone improve by as much as 1.5 seconds with the core staying at the same speed.
User avatar
Major_A
Legit Extremist
Legit Extremist
Posts: 3793
Joined: Tue May 15, 2007 2:11 pm
Location: Houston, TX

Re: 9x333, 8x375, or 7x428 on a Q6600 - Which is faster?

Post by Major_A »

SuperPi is really RAM sensitive. If I'm overclocking and want to check for RAM stability I run SuperPi. If you get a rounding error your RAM isn't stable.
User avatar
Bio-Hazard
Legit Extremist
Legit Extremist
Posts: 2302
Joined: Thu May 06, 2004 9:48 pm
Location: Back Woods Of MO.

Re: 9x333, 8x375, or 7x428 on a Q6600 - Which is faster?

Post by Bio-Hazard »

Exactly my point, there are other programs out there that can take advantage of increased ram speeds (unlinked) as well. That's why it'd be nice to see all the same tests ran on both types of systems................ :mrgreen:
User avatar
Major_A
Legit Extremist
Legit Extremist
Posts: 3793
Joined: Tue May 15, 2007 2:11 pm
Location: Houston, TX

Re: 9x333, 8x375, or 7x428 on a Q6600 - Which is faster?

Post by Major_A »

Sorry just woke up, still had goop in my eyes.
User avatar
Kougar
Legit Extremist
Legit Extremist
Posts: 251
Joined: Mon Feb 12, 2007 12:59 pm
Location: Texas

Re: 9x333, 8x375, or 7x428 on a Q6600 - Which is faster?

Post by Kougar »

Increasing the RAM speed will help a little, but even fast RAM is not anything a person would ever notice if still using a 3Ghz Core 2 Duo. Even in a game like Supreme Commander that will eat the entire 2Gb by itself a person wouldn't know if they were running 800MHz or 1066MHz RAM just be feel.

Raising the FSB does come at a penalty for the chipset. Think of the chipset as a mini-CPU, it has a internal bus speed and a multiplier/strap. The strap setting is akin to a RAM divider, that it will use to adjust it's own clockrates to stay in sync with the system. Or in this case during overclocking. The higher the FSB, the looser the MCH strap has to be set to remain stable. There are at least 13 strap levels I know about... stock speed is usually a strap of 3 for instance. A 400FSB could result in a strap of 6, 500FSB a strap of 9, and so on. Ingoring the CPU, any minor gains in performance from a higher FSB would be offset by the minor penalty from a higher strap. A strap of 11 will have a significant impact on memory performance sensitive applications compared to say a strap of 4, for the same reasons a 1:1 synced RAM divider brings a small 2-5% performance boost over any other divider or asynchronous/unlinked setting. It would be enough to add one or two seconds to a SuperPi 1M time, or show some significant differences in any synthetic memory bandwidth benchmark.

Memset3.3 beta2 is a nifty program for Intel chipsets that will let you view and even adjust the MCH strap. Anyone that does care enough about performance to the point they tweak their RAM subtimings might be interested in trying it. On a good board with a tuned BIOS the user will likely only be able to tighten the strap by a level or two at best. Any minor change by 1-3 strap levels would only be about as noticeable to the end user as the difference between 800Mhz and 1066Mhz RAM, but it's there for those who want it. Or for P35 Gigabyte users like myself, whom until the most recent BETA bios release were all running MCHbar straps of "11" at all speeds. :roll:
Core i7 920 @ 4.2GHz 1.36v
Gigabyte GA-X58-UD5
Under Water
graysky
Legit Aficionado
Legit Aficionado
Posts: 83
Joined: Wed Nov 09, 2005 3:03 pm

Re: 9x333, 8x375, or 7x428 on a Q6600 - Which is faster?

Post by graysky »

Major_A wrote:Forgive my ignorance but what program did you use to encode h.264? Does that program utilize all the CPU cores?
The software is x264.exe but I use MeGUI as a frontend. Yes, x264 maxes out all 4 cores (on the 2nd pass encode).

To speak to the whole SuperPi thing. I shouldn't have included it here since I did want to look at real world apps. It is true that you can get 2-5 % faster times with higher and higher FSB values in SuperPi... but you can only see these slight increases when calculating the really huge # of decimal places. Bottom line is for "real computing," you won't see a difference.
http://encoding.n3.net <--- for all your DVD and audio CD backup needs!

Image
Post Reply