The AMD A8-3500M APU Review: Llano Is Unleashed

The West-Side CPU

The CPU portion of Llano’s die is based on the Stars architecture we know from the original Phenom, living on in today’s Athlon II and Phenom II processors. There are a few notable differences and improvements, though. The most obvious is AMD’s transition from 45 nm manufacturing to a 32 nm node. So, right out of the gate, we expect certain power- and thermal-oriented advantages compared to the same piece of logic manufactured at 45 nm. We don’t have that here, of course. Adding the Fusion-oriented components just about doubles Llano’s transistor count compared to Deneb, adding up to 1.45 billion. 

Bear in mind that Sandy Bridge-based CPUs already employ a mature 32 nm process, and Intel is on the verge of making a shift to 22 nm with the Ivy Bridge die shrink later this year. AMD knows that Intel has a manufacturing technology advantage and is looking to compensate in other ways.

AMD claims that Llano’s processing cores demonstrate an average 6% improvement in successfully executed instructions per clock (IPC) versus previous Stars-based offerings. In some cases there will be zero improvement and in others the new core might show as much as a 15% increase. There are two main reasons for this on-average performance improvement: more L2 cache and a better hardware prefetcher.

As you likely noticed, the 6 MB shared L3 CPU cache is gone entirely, a loss AMD hopes will be mitigated by a doubling of the exclusive L2 cache from 2 MB to 4 MB, totalling 1 MB of dedicated cache per core. This might sound like a dire compromise at first, but keep in mind that although a large L3 on the back of the northbridge allows sharing of another high-speed repository between cores, it also increases latency and presents a problem for power management. Increasing the L2 might sacrifice a little overall scalability, but it helps augment multithreading operations and allows more granular control of the chip’s power management. Since this was one of the main goals of Llano’s design team, the change made sense. And although the L2 and L3 cache configurations are changed, each core has the same 64 KB L1 instruction and data cache (128 KB total L1) as the 45 nm Phenom II and Athlon II families.

AMD also put a lot of work into its hardware prefetcher. Traditionally, a hardware stride prefetcher looks at specific memory instructions over time, and if the address loaded differs by a constant stride, the prefetcher may decide that it’s beneficial to put that instruction into the cache. The problem with this approach is that many programs can make it difficult to extrapolate patterns because of spurious instructions that distract the algorithm. AMD enhanced the prefetcher intelligence with Instruction Pointer (IP)-based prefetching. IP understands the instructions accessing the memory and uses that intelligence to find patterns, improving its ability to find strides.

Aside from this, the buffer sizes are larger. The reorder buffer is about 20% larger and the load/store buffers are doubled in size. We’re also told that the hardware multiplier is better, but AMD won’t disclose exactly what it improved.

When all is said and done, the 6% average IPC increase over the Phenom II represents a modest gain, if it’s quantifiable at all in testing. It seems that improving CPU performance was not a major target for the design team. And again, this makes well-enough sense to us with the first Bulldozer-based CPU (Zambezi) in the wings.

Power management takes precedence, which is smart since Llano needs to share its thermal ceiling between the CPU and GPU. While the aging Stars architecture might be sufficient to show what the GPU side can do today, this APU’s successor replaces Stars with Bulldozer-derived silicon. That part may be what manages to challenge Intel on the processor side. AMD’s Rick Bergman showed off a Trinity-based APU at this year’s Computex and said the first silicon was showing up at the lab. Hopefully the company isn’t late with that part.

Create a new thread in the UK Article comments forum about this subject
This thread is closed for comments
6 comments
Comment from the forums
    Your comment
  • Jay_83
    Hmm. Mixed feelings about this one. One the one hand, it's less impressive than I wished for. On the other, it's obvious AMD has done some good work here, in more than one direction.
    Ultimately, We'll have to wait and see what Trinity has to offer to give the final word. Here's hoping Bulldozer throws a serious punch. I'm sure we'd all like to see AMD back in the game, and not just in notebook space.
  • aje21
    If you want us to look at this as the end of entry level discrete GPUs for light gaming, how about comparing it to the GPU running at its real speed? Once you have a real price for an APU+M/B you can put it up against a similar spec/price CPU+GPU+M/B to see how they compare.
    If the transcoding performance was worth having (and supported by Windows 7 in Media Center) then this could be an HTPC route...
  • daglesj
    Chaps, you are wasting your time commenting on the UK version of the article. All the comments are over on the US version.

    Why they cant pull the 'shared' article comments together I dont know.
  • Anonymous
    Seems like the Turbo CORE was simply turned off... Perhaps it needs more than enabling it in the BIOS. Like in case of Cool'n'Quiet, that you has had to enable in Windows, as well?
  • silverblue
    ajaraSeems like the Turbo CORE was simply turned off... Perhaps it needs more than enabling it in the BIOS. Like in case of Cool'n'Quiet, that you has had to enable in Windows, as well?

    Yeah, by selecting Minimal Power Management in XP or Balanced in Vista/7.
  • wild9
    daglesjChaps, you are wasting your time commenting on the UK version of the article. All the comments are over on the US version. Why they cant pull the 'shared' article comments together I dont know.


    I've often thought the same..I wish all the UK and US comments could be combined. I discovered the duality by accident, and also noticed that the UK version of the article has 4 comments, whereas the US had 4 pages. I am grateful for all this tech news..I just wish the comments section could be combined. Also, if they could send the spammers to Mars..that would be a nice bonus.