Tom's Hardware Wants You: CPU Tests For 2011
We're looking for a few good suggestions as we revamp our processor testing suite for 2011. Is there something you'd like to see heading into the new year? Here's your chance to make your voice heard!
We’ve had a lot of fun ramping up audience interaction here on Tom’s Hardware—from the contests (it seems like there is always at least one running) to the forums-driven Best Configs section going live this week to the comments and emails we get from every single story that gets published. Keep it coming—and we’ll do the same.
In the meantime, we’re working on our benchmark suites for 2011 (yes, there will be several) and want your input each step of the way.
Currently, I think we have a reasonable mix of gaming-, productivity-, media-, and synthetic-based tests in our automated CPU software package. But it’s hard to know if we’re giving you the performance results you most want to see in the software you most commonly use unless you speak up and let us know.
Now, keep in mind that we’d prefer to use free/trial/open source software, making reproducibility an option for as many of our readers as possible. If it turns out we’re able to lean heavily on widely available titles, it might even be possible to make the suite downloadable, allowing you to run the same tests we run for quick and easy comparison. Understandably, benchmarks that still require a license probably won’t be part of that package.
Here’s our current list of metrics:
- Apple iTunes 10.0.1
- TMPG 4.7 with DivX 6.9.2 and Xvid 1.2.2
- MainConcept Reference 2.0
- HandBrake 0.9.4
- Autodesk 3ds Max 2010
- WinRAR 3.92
- 7-Zip 4.65
- Adobe Photoshop CS5
- AVG Anti-Virus 11.0
- 3DMark Vantage
- PCMark Vantage
- SiSoftware Sandra 2010
- Metro 2033
- Just Cause 2
- Call of Duty: Modern Warfare 2
- DiRT 2
Occasionally you’ll see us swap out a game, exclude AVG, or add something like Fritz, but that’s the lineup I’m using as of…well, now. So, give us some suggestions on what you’d like to see in 2011. Bear in mind that the apps we use will likely be the ones most applicable to the largest number of people, though there is room for a handful of more fringe tests, too.
As we revamp other aspects of testing—from graphics card measurements to workstation and mobile suites, I’ll ask for your input first, as our team in Germany sets out to automate as much of what we do as possible.
Thanks again for the input. I’ll be stopping by the comments section here, of course. And if you want to reach me directly, I swear I’m trying to get better about interacting with Twitter. I’m even planning to give away some hardware there...
Chris Angelini
Managing Editor, Tom’s Hardware
- The iPad is Coming to Walmart This Friday
- Nvidia Branded Video Cards Only Sold in Best Buy
- A Nintendo 64 Modded Inteo An HTPC
- Hitachi Ships Industry's Fastest 10K RPM HDD
- Tom's Hardware is Now Available on Kindle!
- These Are ASRock's Favorite Motherboard Ideas
- Next StarCraft 2 Chapter May Come In Early 2012
- Gearbox Wants To Honor Old DNF Pre-Orders
- Ballmer: China Business Piracy A 'Real Problem'
- Intel Reports Record $11.1 Billion Revenue
- 64 GB Coming to SD Cards, USB Drives
- WD: 1TB of Digital Content Per House By 2014
- Intel Praises iPad; Making Atom Secret 4G Weapon
- Foxconn Raising Prices to Avoid Further Suicides
- Intel Promises Big Performance With Sandy Bridge
- DNF 3x The Size of MW2; DN3D Remake OK'd
- bModo's Touchscreen Tabet Does Win7, Atom
- It's the World's 1st Gaming Keystroke Keyboard





How about a folding at home test on the CPU / GPU? People build farms for FAH or similar distributed projects.
David
Dump iTunes until they introduce multi-threading.
would love to comment, but you don't read uk comments so I won't bother. Ta for the thought though.
what about adding the boot / shutdown time of the system and the launch time of applications ?
that is user experience accurate as well imo
The games list is missing a good RTS title. SC2 is a natural choice since it has been proven to make CPU's work quite hard (and is extremely popular, so there's a good reference base).
I'd also like to see Microsoft Security Essentials as an alternative to AVG.
what about adding the boot / shutdown time of the system and the launch time of applications ?that is user experience accurate as well imo
Far more storage dependent.
Well I really would like to see real-life situations like multitasking. I often multitask on my computer, and a test set up as follows would be relevant for me:
Start up a game (Battlefield Bad Company 2, or something like that)in windowed mode
Have an mp3, Flac, other music-file playing in the background
and then start to unzip a VERY big file
Time the unzipping and show the result.
Our Cpu's get more and more cores, so multitasking gets more and more relevant in my opinion.
The test must not be held back by a slow harddisk, so the game could be running on one disk while the music file and unzipping could be reading/writing to/from another.
Real Time Strategy games stress the CPU far far more than most other games. As a result this is one situation where CPU performance is vital (video encoding, for example, just takes longer). Not enough CPU grunt has the potential ruin your experience of a game as the fps number plummet into single digits. As a result, it makes sense to include number for games like Supreme Commander 2, where there might be hundreds of units on screen at once, as well large numbers of physics calculations. If you want to really test CPU performance, this would be a great thing to include (PS, not Starcraft, its not taxing enough)
I'd just like to see people no longer comparing apples to oranges as even this site has been guilty of in the past. Comparing Phenom II architecture to i7 architecture is the same as comparing Core2Quad architecture to i7 architecture. It's stupid, useless and misleading. In addition, stop using Futuremark since it has been proven that it optimizes better for any CPU identifying itself as an Intel. Once that was proven, sites should have stopped using it.
I'm still wondering how much software is out there that doesn't run optimally on AMD CPUs thanks to Intel's compilers. If it's widespread, is this going to detrimentally affect Bulldozer?
The games list is missing a good RTS title. SC2 is a natural choice since it has been proven to make CPU's work quite hard (and is extremely popular, so there's a good reference base).I'd also like to see Microsoft Security Essentials as an alternative to AVG.
And while we're at it, MalwareBytes Anti-Malware as an alternative...? LOVE that program.
That list seems fine. But could you chuck your test render for 3DS max 2010 on the benchmark page so we can download said scene and render it to compare CPUs to our CPUs.
I'd just like to see people no longer comparing apples to oranges as even this site has been guilty of in the past. Comparing Phenom II architecture to i7 architecture is the same as comparing Core2Quad architecture to i7 architecture. It's stupid, useless and misleading. In addition, stop using Futuremark since it has been proven that it optimizes better for any CPU identifying itself as an Intel. Once that was proven, sites should have stopped using it.
As for the comment on Futurmark I agree.
But for the CPU list I completely disagree. I'm still running a Core2Quad, and when I eventually upgrade, I'd like to see how my previous CPU compares to whatever CPU I have settled on.
I would like to see workstation performance reviews, like Xeon and Opteron based pc:s. There are number of free trial licenses for CAD-programs, but unfortunately programs like MATHLAB requires license, but I think it is so essential in scientific world that it shouldn't matter. Also FEM (finite element method) calculations needs significant number crushing capabilities, some free softwares exists atleast in linux world, but some commercial software should be included. FEM is the standard solver in mechanics and physics so it is in interest of many. Besides, I havent seen anyone doing performance reviews on scientific and industrial cases, so it would be great merit for TOM's.
We need to get civ 5 in there for a game. As can be seen form numerous forums the game kills systems during turns.
Not totally on topic but seems a good place to put in a request!.
It would be good to see a rolling (e.g. repeat lab every 6months say) hi-spec engineering/scientific parallel cluster chart (best performance with CPU boxes and best price per bang).
A good solid benchmark would be using OpenFoam CFD 1.7.1 freeware. A 50 million cell, steady state, turbulent, incompressible model would make a good solid test case metric to benchmark on multi CPU's.
It would be nice to see the equivalent of alt-tabbing from a particularly demanding process (i.e. a game) to desktop. While to some extent that's more a RAM, hdd and process-specific issue, its still something that people do fairly often and also tends to vary widely with the system. How to make an effective benchmark out of it, I don't know, but its an interesting point of comparison nonetheless.
It would also be nice to see more benchmarks in general that deal with running several different applications at once, like a web browser, a music player, copying a large file, encoding a video, running a virus scan or similar operation, and playing a game all at once. Sure, so many things at once is a fairly extreme case, but it should provide an interesting comparison of processors taxed nearly to their limits in a somewhat realistic environment (at least an environment that is more familiar than the many synthetic benchmark tests).
Linux benchmarks. And i love to see some benchmarks running inside a VM (i.e. compressing 10gb inside a Ubuntu guest in VirtualBox).
Lightroom!
I think it would be a good idea to consider hyper-threading on and off separately, where appropriate.
It would be useful to know how much better (or worse!) a processor with (e.g.) 4 physical cores performs with 8 logical cores, as opposed to 5 logical cores.
This would be of particular interest to overclockers, as the cpu overclock limit is reduced when HT is on (obviously).
I suspect that applications that are not heavily multi-threaded will respond negatively to hyper-threading, so it would be useful for users to be able to weigh up the different options (i.e. HT on or off) when looking at your benchmarks.
^ typo: I meant "4 logical cores", not 5! Whoops.
There's something that I've been wanting to see in the mass-benchmark comparisons for a long time:
There are so many processors included in the comparison that the charts can be a bit overwheming. I think that there should be an alternative way to view this information that is more meaningful. Here is my suggestion:
Exploit the fact that there are many processors that are effectively the same thing (e.g. i7-920 and i7-940), i.e. the same architecture, clocked at different frequencies. So, why not display a graph with cpu frequency on the x-axis, score on the y-axis, and have one line/curve for each architecture, instead of for each cpu. Imagine it - this would reduce the number of lines to a minimum, and would make it obvious to the user which architectures are better for each test. Furthermore, it would make it more obvious whether or not cpu frequency has a significant effect on that particular benchmark.
This could be expanded by overclocking (or underclocking) the processors within some reasonable range (e.g. 2Ghz-4Ghz, or whatever the max of that particular architecture is) in order to extrapolate the results, in an overclocker-friendly fashion, so that people will be informed of the expected benefits (if any) of running their cpu at a particular frequency. This would result in each of the stock cpus being a point on a continuous line on the graph.
In the past, when researching which cpu to get, I have personally implemented this at home in a spreadsheet (using your charts as reference), and the results are fantastic, and very informative. Please consider implementing this as a additional feature, as I think people would find it very useful. To my knowledge, no-one else on the web does this at current.
OK, maybe this doesn't really fall into the category, but there is a hardware test I would love to read about in Tom's Hardware.
It would actually be a comparison between "typical" pc configurations from the last, let's say, 20 years. But the point is not finding how faster an i5 is to a Pentium, but answer the question whether modern systems are faster in daily use, than older ones used to be.
And to be more specific, being a Pc uses for 14 years now, I am not so sure my PC or my laptop is actually faster than my first PC (Pentium 133Mhz) was.
Why? Well, in my Pentium, Doom and Doom 2 worked great, but in my Athlon XP, Doom 3 was almost unplayable, unless lowering it's graphics quality. What is more, I really think Windows 95 loaded faster on my old Pentium, than Windows Vista do on my Core 2 Duo. Not to mention that internet browsing doesn't look that faster on a DSL line than on my PSTN, ten or so years ago.
So, to sum things up, yeah, everybody knows that a modern multi-core system with 4GB of RAM is a lot faster than a 486 with 4MB of RAM, but the software we use also gets heavier and cpus and gpus try to cope up with it, maybe more than they used to.
My idea is to assemble "typical" systems of the past, like 386, 486, Pentiums and "modern" systems, like Core2 series, i series, load them with the corresponding software (Windows, games, other software) and benchmark them! Did Windows uses to load faster in the past? Did Doom I and Duke Nukem used to have a better framerate, at full graphics, than Crysis or Halo do today, at medium detail? Those are some questions I would like to see answered!
Thanks for your time,
Kostas Karakostas
I'd like to see Malwarebytes Antimalware in the suite of test products... oh and the internet security comparison as well...