Altered code blocks

Ad

There’s been no change made to the executions units. So we find the same 9 units that appeared in the K7 and then in the K8: 3 ALU, 3 AGU (Address Generation Unit) and the 3 units of the FPU: FADD, FMULT and FSTORE.

AMD K10 architecture

The 3 units are symmetrical, apart from 3 operations: multiplication is systematically carried out in the ALU0 and the two new instructions introduced with the Barcelona, LZCNT and POPCNT are carried out by a dedicated unit on the same port as the ALU2.

The main modification to the calculating units is on division, latency is now a variable function depending on the size of the data. On the K8 the latency was systematic, 42 cycles in 32 bits and 74 cycles in 64 bits, with Barcelona, this latency is composed of a fixed 23 cycles and another variable (depending on the number of significant bytes in the absolute value of the dividend).

The calculating units have also undergone alteration. The big new feature isn’t really all that new, Intel already uses it in its Core 2 architecture: the units are now capable of operating on 128 bytes data instead of 64 (80 to be precise) bytes.

Until now, an instruction (either of two floating in double precision or in four simple precisions) was decoded in two packets, each packets has to pass through the 64 bit. With SSE128 these instructions are decoded in one micro-op and only take one pass in the 128 bit additioner.

Apart from the peak performance, which is multiplied by two (AMD announced a practical gain of 85% on calculations), the fact that less packets are generated is good news overall for execution in disorder. It neatly avoids saturation of buffers that allow instructions to be reordered.

Quad-Core AMD K10 Barcelona Opteron Phenom

AMD has also taken the opportunity to move charging operations from the FSTORE unit. From now on the K10 can carry out two 128-bit charges instead of two 64-bit charges and on top of this the FSTORE unit is free to carry out another operation (essentially MOVs).


Ad
Talkback
JeanLuc 18/09/2007 03:00
Hide
-0+
JeanLuc

Why is the wording under the pictures in French?

MrRimmer 19/09/2007 11:31
Hide
-0+
MrRimmer

It looks like the editor either hasn't been doing his/her job properly or is not a fluent English speaker. There are at least half a dozen spelling errors in this article, and the grammar is somewhat less than perfect!
Apart from that, an interesting read.

Fragula 19/09/2007 12:00
Hide
-0+
Fragula

Re: "AMD K10: The Architecture of the Revival?"

Article compares apples and oranges. :-(

i.e. It would be fair to compare the memory architecture of Coppermine vs. Thunderbird, as an example of where AMD /romped/ ahead.

Go back to Tomshardwares own archives and compare those memory architectures.

Or as another example, compare Katmai with the original Slot-A Athlon K75.

Where's the definitive great chart of all (x86) CPUs gone? Where are the archives?? What happened to the once-great tomshardware.com????

Cheers!

Fragz.

Comments are closed on this page.
Google Ads
Ad