Talk About Network

Google


Register and Login
Nick
Password
Register create new account Sign up is FREE and you can post replies, new topics, bookmark posts and more!
Recover lost password


Programming > Assembly x86 > Re: Throughput ...
Latest [ Topics | Posts ] Archive Post A New Topic Post a Reply
<< Topic < Post Post 4 of 4 Topic 4615 of 4729
Post > Topic >>

Re: Throughput of PXOR

by "Maarten Kronenburg" <spamtrap@[EMAIL PROTECTED] > Apr 14, 2008 at 02:00 PM

"James Van Buskirk"  wrote in message
> > In your loop the 4th xorps is dependent on the 1st xorps etc.
> > In throughput they should probably be all independent,
> > but I don't know how to do that.
> > Probably it's somehwere in Agner Fog's do***ents.
>
> Nope.  Both AF & Intel say latency 1 cycle, throughput 3 per cycle.
> Following that, we should in principle be able to issue three
> mutually independent xorps instructions in cycle 1, three mutually
> independent instructions in cycle 2 (but dependent on the operations
> in cycle 1 because there latency period has elapsed) 3 in cycle 3
> and so on.  If I could find out why not in the docs, I wouldn't
> be asking here.  In Intel 64 and IA-32 Architectures Optimization
> Reference Manual, section 3.5.2.1, ROB read ****t stalls are
> discussed, but they aren't relevant here because all operands
> will be in flight because they were modified within the last two
> instructions.
>
That they should be independent is in Agner Fogs optimizing_assembly.pdf.
But as mentioned I don't know how to measure it, perhaps ask Agner
himself.
 




 4 Posts in Topic:
Throughput of PXOR
"James Van Buskirk&q  2008-04-12 20:53:39 
Re: Throughput of PXOR
"Maarten Kronenburg&  2008-04-13 23:42:51 
Re: Throughput of PXOR
"James Van Buskirk&q  2008-04-14 02:00:55 
Re: Throughput of PXOR
"Maarten Kronenburg&  2008-04-14 14:00:24 

Post A Reply:
  Go here to Signup

AddThis Feed Button


About - Advertising - Contact - Frequently Asked Questions - Privacy Policy - Terms of Use - Signup

Contact
tan12V112 Fri Jul 25 21:05:48 CDT 2008.