Re: Command Queue Depth (was asymmetric/Symmetric)

To: Matt Wakeley <matt_wakeley@agilent.com>
Subject: Re: Command Queue Depth (was asymmetric/Symmetric)
From: Nafea Bishara <nafea@galileo.co.il>
Date: Tue, 12 Sep 2000 01:17:58 +0100
CC: ips <ips@ece.cmu.edu>
Content-Transfer-Encoding: 7bit
Content-Type: text/plain; charset=us-ascii
Organization: Galileo Technology
References: <B7E2A2967AF7D211995B00805FA7E4DF01FE0844@milcmsgc.qntm.com> <39B8781F.CAA9B067@agilent.com>
Sender: owner-ips@ece.cmu.edu


    Hi Matt

    regarding Ethernet Switches, i can tell you from my experience in architecting
wide range of those: For a Gigabit switch, the latency of a 2KByte packet is 16us
in today arch and going below 10us in next generation.

    note the following:

- the numbers given there is from last-bit in to first-bit out, so, it is different
than the "normal" latency which is first-bit in to first-bit out

- whether the switch is acting on Layer-2 or Layer-3 does not matter since all
switching decision done on the fly

- Ethernet switch latency depends on packet length due to store and forward
behaviour.

- In Our devices,  priority is given based on the TCP/UDP connection, so iSCSI
connection always get higher priority than other data ( FTP, HTTP ), and thus,
experience the lowest latency possible ( 16us for 2KByte )

    -Nafea

Matt Wakeley wrote:

> Jim McGrath wrote:
>
> > I agree that BB credit has nothing to do with commands per se, but
> > illustrates the problem we have had with deciding on policies for the
> > distributed allocation of device resources over multiple initiators.  In my
> > postscript I noted that the same problems have arisen on command queue (how
> > many queue slots do you get), with similarly no satisfactory solution.
>
> So it sounds to me like this distribution of (command) resources across
> initiators is a T10 SCSI issue, and should be solved there, not by each
> individual transport (yesterday FC - which didn't solve it, today iSCSI,
> tomorrow IB or whatever).
>
> You've lost me on the following two paragraphs... Ethernet doesn't require
> credits to send frames - it's a ship and pray model.  I don't know what the
> latency through ethernet switches is, but I'd hope it wasn't in the ms range.
> Finally, a "usefull" FC-AL will have many devices on it (say 20 drives in a
> JBOD) and the latency of the elastic store really starts adding up.  So I still
> contend that FC-AL is not a low latency medium.
>
> -Matt
>
> > On FC-AL, the latency to get an initial credit is typically measured in us
> > for a couple of reasons.  First, much of that logic has been automated in
> > the interface hardware (indeed, the major source of delay is typically the
> > elasticity buffer, which is not store and forward and so has a very low
> > latency compared to many switches or routers).  Second, the distances are
> > very small (e.g. hundreds of meters), so both transmission time and the
> > opportunity for intervening devices to increase latency is lower than in the
> > general internet world.
> >
> > So generally 2 credits (of 2Kbyte frames) is enough to cover the latency and
> > get you into streaming.  If the latency of the system was measured in ms,
> > then a transport on a Gibt wire would require more like 50 initial credits
> > (or more) to cover the latency.  Unless we are designing for a low latency
> > environment for the exchange of credits (like those where FC-AL are used),
> > then we probably need to allocate so much buffer space that it becomes
> > difficult to promise initial credits to a lot of potential initiators.
> >
> > Jim
> >
> > PS general Fibre Channel (e.g. with switches and the like) is a bit
> > different.
> >
> > -----Original Message-----
> > From: Matt Wakeley [mailto:matt_wakeley@agilent.com]
> > Sent: Wednesday, September 06, 2000 8:23 PM
> > To: ips
> > Subject: Re: Command Queue Depth (was asymmetric/Symmetric)
> >
> > Jim,
> >
> > I agree that FC has tried (unsuccessfully) to address this command queue
> > allocation problem.
> >
> > However, the "login bb credit" mechanism in FC does not address the command
> > queue depth issue at all.  BB credit is used to receive commands and/or
> > data,
> > and the target has no clue in advance what is coming.  BB credit is just
> > there
> > to ensure that there is a lowest layer buffer available to receive the FC
> > frame
> > (as opposed to dropping it on the floor if there is no "mac" buffer, like
> > ethernet does).  It does not mean the command queue has any room for the
> > frame.
> >
> > At one time, there was a big push to have "data" credits and "command"
> > credits
> > to take care of this problem, but it couldn't be made to work and be
> > "backwards
> > compatible".
> >
> > > The issue of buffer space allocation for multiple initiators has a long
> > and
> > > troubled history in SCSI.  We have never been able to come up with a good
> > > answer.
> > >
> > > Fibre Channel tried to fix this with the notion of "login BB credit" -
> > when
> > > you login you get a minimum number of credits you are always guaranteed
> > when
> > > you start data transfers.  The problem with this is that storage devices
> > had
> > > no realistic ability to discriminate between initiators or to change the
> > > login BB credit.  In addition, the expectation is that all possible
> > > initiators would get these credits on login.  So storage devices vendors
> > > have played it safe and kept this number low (at 0 until recently, now
> > > around 2).  For iSCSI the number of initial credits you need to "prime the
> > > pump" until normal data flow is established is probably large (given the
> > > latencies are higher than in Fibre Channel, especially FC-AL), and the
> >
> > How is the latency low on FC-AL?? given that you need to arbitrate and win
> > the
> > loop, then receive bb credit, before you can send anything?
> >
> > -Matt
> >
> > >
> > > number of potential initiators larger than in Fibre Channel, making this a
> > > whole lot worse for the storage device.
> > >
> > > As soon as we allow the devices to start adjusting these credits, then you
> > > have the protocol problem of making sure people know when their credits
> > are
> > > adjusted and the policy problem of how, who, and when to adjust the
> > credits.
> > > Changing everyone's credit when you add a new initiator can get into a
> > > notification nightmare, although it is "fair."  Any policy brings up all
> > > sorts of nasty issues regarding fairness vs efficient use of the
> > > transmission media.
> > >
> > > Jim
> > >
> > > Note: the same problem has plagued other attempts to allocate device
> > > resources between multiple initiators, like command queue space.  In
> > general
> > > policies with respect to multiple initiators are not really standard in
> > the
> > > SCSI world.
> > >
> > > -----Original Message-----
> > > From: Matt Wakeley [mailto:matt_wakeley@agilent.com]
> > > Sent: Wednesday, September 06, 2000 2:56 PM
> > > To: ips
> > > Subject: Re: Command Queue Depth (was asymmetric/Symmetric)
> > >
> > > Joshua Tseng wrote:
> > >
> > > > James,
> > > >
> > > > I agree with others that there may be an issue with the
> > > > command windowing mechanism in the existing iSCSI spec.  It is like
> > > > "TCP in reverse", in that the target determines the size of the window,
> > > and
> > > > not the initiator as in TCP.  Rather, I believe that everything that
> > this
> > > > windowing mechanism is attempting to achieve can be more easily obtained
> > > > by having the target communicate its buffer size to the initiator at
> > > > iSCSI login.  It should be the role of the initiator to determine how
> > > > many commands to put in flight simultaneously, given this input on
> > > available
> > > > buffer size from the target.
> > >
> > > As more initiators connect to a target, it may need to scale back the
> > amount
> > > of
> > > this buffering it has allocated to each previously logged in initiator (to
> > > prevent rejecting new logins).
> > >
> > > >
> > > >
> > > > As far as multiple initiators, could this not be resolved by the target
> > > > refusing additional logins beyond the number of initiators it can safely
> > > > support?  Not being a storage expert, this is my best guess/suggestion
> > > > at how to do it.
> > >
> > > I believe John already answered this...
> > >
> > > -Matt

--

             Nafea
                \\|//
                (o o)
~~~~~~~~~~~~oOOo~(_)~oOOo~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

     _/_/_/_/
   _/_/_/_/_/_/
 _/_/_/_/_/_/_/_/
 _/_/_/_/_/_/_/_/
_/_/_/_/            Nafea Bishara
_/_/_/              Director, Product definition
_/_/_/    _/_/_/_/  Galileo Technology Ltd.
_/_/_/      _/_/_/  Email     -  nafea@galileo.co.il
 _/_/_/     _/_/_/  Snail Mail-  D.N. Misgav 20184, Moshav Manof, ISRAEL.
 _/_/_/_/_/_/_/_/   Tel       -  Manof + 972 4 9999555 ext. 0
   _/_/_/_/_/_/                  Haifa + 972 4 8225046 ext. 417
     _/_/_/_/                    Mobile+ 972 54 995417
                    FAX       -  + 972 4 8326420

 Check our Web site: http://www.galileoT.com
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

References:
- RE: Command Queue Depth (was asymmetric/Symmetric)
  - From: Jim McGrath <Jim.McGrath@quantum.com>
- Re: Command Queue Depth (was asymmetric/Symmetric)
  - From: "Matt Wakeley" <matt_wakeley@agilent.com>

Prev by Date: RE: Data in SCSI Response or SCSI Data
Next by Date: Re: Symmetric vs Asymmetric
Prev by thread: Re: Command Queue Depth (was asymmetric/Symmetric)
Next by thread: Re: Command Queue Depth (was asymmetric/Symmetric)
Index(es):
- Date
- Thread

Home

Last updated: Tue Sep 04 01:07:22 2001
6315 messages in chronological order