Download the Tom's Hardware App from the App Store
The reference for current tech news
Yes No

The Architecture in Detail

by

Like Nvidia, AMD has chosen to build on its previous architecture rather than starting from scratch. It’s very much the same as that of the R600, which had already been re-used for the RV670.

Radeon HD 4850

SIMD cores

The architecture initially introduced with the Xenos, which is the same GPU used in the Xbox 360, is based on a group of SIMD arrays. The Xenos had three SIMD arrays, and the R600 and RV670 have four. The RV770 goes much further with ten.

Radeon HD 4850

As you’ve deduced, each SIMD array contains 80 ALUs since the GPU has 800 ALUs. That’s true, but it’s a slightly simplified view of reality. In practice, the 80 ALUs are not independent of each other. They’re grouped together in five-way VLIW units – 16 units per SIMD array. This organization implies certain restrictions on the instructions executed; each of the five instructions of a VLIW bundle has to be independent from the others. It’s up to the compiler to find enough independent instructions to saturate the ALUs – unlike the G80, which uses a more "hardware" solution.

Here’s an example to illustrate what we just described:

  • I1 FADD R1, R1, 3.14
  • I2 FMUL R2, R1, 1.41
  • I3 FMAD R3, R0, 0.5, 0.5

In this case, Instructions 1 and 3 can share the same bundle, but not Instruction 2, which depends on the result of Instruction 1. If the compiler can’t find enough operations in its window of instructions, it has to fill the bundle with NOP instructions that don’t do anything, thus reducing the chip’s performance. What all that adds up to in the present case is that Nvidia ALUs will hit their peak performance more often because they’re less dependent on the underlying code; but the down side is that they’re much more costly in terms of transistors. AMD’s units depend strongly on the compiler’s performance (the compiler that’s “internal” to the driver, which reorganizes the assembler instructions generated by the HLSL), but AMD can afford to include a much larger number on a die that’s still significantly smaller.

Radeon HD 4850

The VLIW units themselves haven’t been heavily reworked; there are four units capable of executing a FMAD or an integer addition and a special unit capable of executing either a FMAD or an integer multiplication, or a transcendental function (sine, cosine, log, exp, etc.) The only real improvement is to bit-shifting operations in integers, which can now be handled by any of the five units, whereas on the 2900/3800 only the special unit could perform these operations. Rather than make them more powerful, AMD has concentrated on optimizing them in order to reduce their size on the die to be able to fit more of them on the device.

Share:
6
Comments
Read more
X
Submit

Comments
Read the comments on the forums
TechSupport 25/06/2008 13:43
Hide
-0+

Bottom line as i see it, this is a Smashing card!
Prices on the UK market put a HD4850 at £120 (ebuyer.com) where as the GTX260 is £300! considering the overal performance difference between the two, i'd get 2 HD4850's in Crossfire mode and still be cheaper than a GTX260.

Once again AMD/ATI have produced a card that isnt amazing on performance, but bang for your buck its probably the best card on the market at those prices

david__t 25/06/2008 13:54
Hide
-0+

The very fact that, yet again, we have a whole page dedicated to noise is a sure sign that these manufacturers still don't have all of the priorities covered. How many years ago was it that THG posted that funny video showing an Nvidia card as a leaf blower & barbecue? - and still we have to put up with substandard cooling solutions. I haven't bought a graphics card for years now without getting an aftermarket heatsink to go with it - thank god for companies like Zalman & Thermalright who are sparing our ears!!

americanbrian 25/06/2008 19:37
Hide
-0+

I have seen on the reg that the 4870 more than doubles the performance of a 3870. If this is true and they hit a ~$300 price tag than AMD/ATI are back baby.

Apparently they use 160W at load but beat the 3870 on performance/W by a long way.

Can't wait to see it. Even though I know that this site rakes in more from intel/nvidia.

QueueMan 26/06/2008 08:26
Hide
-0+

http://www.pcper.com/images/review [...] 48-bar.jpg

Dual 4870s can take on a 280 no sweat and push for serious performance, and they're supposedly near similar price. By itself the 4870 takes on the $100USD more; 260. I smell price drops.

JohnVulcan 28/06/2008 08:55
Hide
-0+

Thanks to Tom’s Hardware for including the FSX test in these results. It’s interesting to see how differently the cards line up with FSX. No other game produces similar results or shows up the new technology in a different light. To any serious simmer it is the AA/AF performance on big screens that is crucial and this shows the biggest differences and biggest insights. Also it was good to see that my card (an overclocked 8800 GTX) is still top of the pile!

Anonymous 29/06/2008 13:49
Hide
-0+

lets just hope that this brings in enough profits to help the CPU market, because if you look at the info on the intel nahalem, it looks as though they have alot more potential, but they dont want to release that potential just yet, where as, if AMD release a competitor then we might just see the nahalems full potential!!

Best offers

Newsletters


OK