Download the Tom's Hardware App from the App Store
The reference for current tech news
Yes No

Local and Global Data Share

by

Radeon HD 4850

With the RV770, the AMD engineers didn’t stop at optimizing their architecture to only slightly increase the die real-estate— they also borrowed a few good ideas from the competition. The G80 had introduced a small, 16-KB memory area per multiprocessor that’s entirely under the programmer’s control, unlike a cache. This memory area, accessible in CUDA applications, can share data among threads. AMD has introduced its version of this with the RV770. It’s called Local Data Share and is exactly the same size as its competitor’s Shared Memory. It also plays a similar role by enabling GPGPU applications to share data among several threads. The RV770 goes even further, with another memory area (also 16 KB) called Global Data Share to enable communication among SIMD arrays.

Texture units

While the ALUs haven’t undergone a major modification, the texture units have been completely redesigned. The goal was obvious – as with the rest of the GPU, it was to increase performance significantly while maintaining as small a die area as possible. The engineers set fairly ambitious goals, aiming for an increase of 70% in performance for an equivalent die area. To do that, they focused their efforts largely on the texture cache. The bandwidth of the L1 texture cache was increased to 480 GB/s.

Radeon HD 4850

But that’s not all; the L1 cache that was shared by all the SIMD arrays has been broken down into 10 cache memories, one per SIMD array, and each contains only data exclusive to the corresponding SIMD array. Shared data are now stored in an L2 cache, which has also been completely redesigned, now having a bandwidth 384 GB/s to the L1 cache. In order to reduce latency, this L2 cache has been positioned near the memory controllers. Let’s see what the results of these improvements are in practice:

Radeon-HD4850 Geforce-9800-GTX+ GTX-260 GTX-280 Grafikkarten

Compared to its direct competitor, the 9800 GTX, the Radeon HD 4850 showed first-rate performance with single and dual texturing, while not giving up any performance in terms of raw fill rate – which is to be expected considering the 40 texture units for 16 ROPs (to simplify, “2.5 texture units per pixel,” as they used to say in another era). On the other hand, with triple and quad texturing, the RV770, logically enough, can’t compete with the G92’s 64 texture units (the equivalent of “4 texture units per pixel”); but in all cases the RV770 proved to be closer to its theoretical performance than its competitor.

Share:
6
Comments
Read more
X
Submit

Comments
Read the comments on the forums
TechSupport 25/06/2008 13:43
Hide
-0+

Bottom line as i see it, this is a Smashing card!
Prices on the UK market put a HD4850 at £120 (ebuyer.com) where as the GTX260 is £300! considering the overal performance difference between the two, i'd get 2 HD4850's in Crossfire mode and still be cheaper than a GTX260.

Once again AMD/ATI have produced a card that isnt amazing on performance, but bang for your buck its probably the best card on the market at those prices

david__t 25/06/2008 13:54
Hide
-0+

The very fact that, yet again, we have a whole page dedicated to noise is a sure sign that these manufacturers still don't have all of the priorities covered. How many years ago was it that THG posted that funny video showing an Nvidia card as a leaf blower & barbecue? - and still we have to put up with substandard cooling solutions. I haven't bought a graphics card for years now without getting an aftermarket heatsink to go with it - thank god for companies like Zalman & Thermalright who are sparing our ears!!

americanbrian 25/06/2008 19:37
Hide
-0+

I have seen on the reg that the 4870 more than doubles the performance of a 3870. If this is true and they hit a ~$300 price tag than AMD/ATI are back baby.

Apparently they use 160W at load but beat the 3870 on performance/W by a long way.

Can't wait to see it. Even though I know that this site rakes in more from intel/nvidia.

QueueMan 26/06/2008 08:26
Hide
-0+

http://www.pcper.com/images/review [...] 48-bar.jpg

Dual 4870s can take on a 280 no sweat and push for serious performance, and they're supposedly near similar price. By itself the 4870 takes on the $100USD more; 260. I smell price drops.

JohnVulcan 28/06/2008 08:55
Hide
-0+

Thanks to Tom’s Hardware for including the FSX test in these results. It’s interesting to see how differently the cards line up with FSX. No other game produces similar results or shows up the new technology in a different light. To any serious simmer it is the AA/AF performance on big screens that is crucial and this shows the biggest differences and biggest insights. Also it was good to see that my card (an overclocked 8800 GTX) is still top of the pile!

Anonymous 29/06/2008 13:49
Hide
-0+

lets just hope that this brings in enough profits to help the CPU market, because if you look at the info on the intel nahalem, it looks as though they have alot more potential, but they dont want to release that potential just yet, where as, if AMD release a competitor then we might just see the nahalems full potential!!

Best offers

Newsletters


OK