The Architecture in Detail
Like Nvidia, AMD has chosen to build on its previous architecture rather than starting from scratch. It’s very much the same as that of the R600, which had already been re-used for the RV670.
SIMD cores
The architecture initially introduced with the Xenos, which is the same GPU used in the Xbox 360, is based on a group of SIMD arrays. The Xenos had three SIMD arrays, and the R600 and RV670 have four. The RV770 goes much further with ten.
As you’ve deduced, each SIMD array contains 80 ALUs since the GPU has 800 ALUs. That’s true, but it’s a slightly simplified view of reality. In practice, the 80 ALUs are not independent of each other. They’re grouped together in five-way VLIW units – 16 units per SIMD array. This organization implies certain restrictions on the instructions executed; each of the five instructions of a VLIW bundle has to be independent from the others. It’s up to the compiler to find enough independent instructions to saturate the ALUs – unlike the G80, which uses a more "hardware" solution.
Here’s an example to illustrate what we just described:
- I1 FADD R1, R1, 3.14
- I2 FMUL R2, R1, 1.41
- I3 FMAD R3, R0, 0.5, 0.5
In this case, Instructions 1 and 3 can share the same bundle, but not Instruction 2, which depends on the result of Instruction 1. If the compiler can’t find enough operations in its window of instructions, it has to fill the bundle with NOP instructions that don’t do anything, thus reducing the chip’s performance. What all that adds up to in the present case is that Nvidia ALUs will hit their peak performance more often because they’re less dependent on the underlying code; but the down side is that they’re much more costly in terms of transistors. AMD’s units depend strongly on the compiler’s performance (the compiler that’s “internal” to the driver, which reorganizes the assembler instructions generated by the HLSL), but AMD can afford to include a much larger number on a die that’s still significantly smaller.
The VLIW units themselves haven’t been heavily reworked; there are four units capable of executing a FMAD or an integer addition and a special unit capable of executing either a FMAD or an integer multiplication, or a transcendental function (sine, cosine, log, exp, etc.) The only real improvement is to bit-shifting operations in integers, which can now be handled by any of the five units, whereas on the 2900/3800 only the special unit could perform these operations. Rather than make them more powerful, AMD has concentrated on optimizing them in order to reduce their size on the die to be able to fit more of them on the device.



Bottom line as i see it, this is a Smashing card!
Prices on the UK market put a HD4850 at £120 (ebuyer.com) where as the GTX260 is £300! considering the overal performance difference between the two, i'd get 2 HD4850's in Crossfire mode and still be cheaper than a GTX260.
Once again AMD/ATI have produced a card that isnt amazing on performance, but bang for your buck its probably the best card on the market at those prices
The very fact that, yet again, we have a whole page dedicated to noise is a sure sign that these manufacturers still don't have all of the priorities covered. How many years ago was it that THG posted that funny video showing an Nvidia card as a leaf blower & barbecue? - and still we have to put up with substandard cooling solutions. I haven't bought a graphics card for years now without getting an aftermarket heatsink to go with it - thank god for companies like Zalman & Thermalright who are sparing our ears!!
I have seen on the reg that the 4870 more than doubles the performance of a 3870. If this is true and they hit a ~$300 price tag than AMD/ATI are back baby.
Apparently they use 160W at load but beat the 3870 on performance/W by a long way.
Can't wait to see it. Even though I know that this site rakes in more from intel/nvidia.
http://www.pcper.com/images/review [...] 48-bar.jpg
Dual 4870s can take on a 280 no sweat and push for serious performance, and they're supposedly near similar price. By itself the 4870 takes on the $100USD more; 260. I smell price drops.
Thanks to Tom’s Hardware for including the FSX test in these results. It’s interesting to see how differently the cards line up with FSX. No other game produces similar results or shows up the new technology in a different light. To any serious simmer it is the AA/AF performance on big screens that is crucial and this shows the biggest differences and biggest insights. Also it was good to see that my card (an overclocked 8800 GTX) is still top of the pile!
lets just hope that this brings in enough profits to help the CPU market, because if you look at the info on the intel nahalem, it looks as though they have alot more potential, but they dont want to release that potential just yet, where as, if AMD release a competitor then we might just see the nahalems full potential!!