|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] Re: iSCSI ERT: data SACK/replay buffer/"semi-transport"
Mallikarjun,
You are right. Too much travel and jet lag. The reason we made SACK and
status recovery
practically a MUST is that without them we are bound to have only session
drop as an alternative.
If the target does not keep any information after it has sent out status it
can't even retry a command. And if it can retry a command it should be
able to do SACK.
But perhaps there is a place in the market for the kind of devices Somesh
is suggesting that do all recovery at SCSI level (and that can't copy a
terabyte of data without a session drop).
If that is true (which I doubt) we can make SNACK support optional.
Julo
"Mallikarjun C." <cbm@rose.hp.com> on 05/04/2001 04:09:46
Please respond to cbm@rose.hp.com
To: ips@ece.cmu.edu
cc:
Subject: Re: iSCSI ERT: data SACK/replay buffer/"semi-transport"
>
>Santosh,
>
>I can't find the place where this is stated. SNACK as a PDU type is
>mandated. But it can be rejected outright.
Sorry, you agreed that status SACK is mandatory in ERT forum last
week in response to my comments. Has there been a change in your opinion?
Attached is the email in a long email thread (issue 3) where you agreed
to make this explicit in rev06.
--
Mallikarjun
Mallikarjun Chadalapaka
Networked Storage Architecture
Network Storage Solutions Organization
MS 5668 Hewlett-Packard, Roseville.
cbm@rose.hp.com
>1.2.2.2 show explicitely that SACK can be rejected. We can add a protocol
>specific parameter in the target Logical Unit Control Page (non-setable)
by
>which the target will indicate support for SNACK.
>
>Julo
Santosh Rao <santoshr@cup.hp.com> on 04/04/2001 23:53:32
Please respond to Santosh Rao <santoshr@cup.hp.com>
To: Julian Satran/Haifa/IBM@IBMIL
cc: Jon Hall <jhall@emc.com>, ips@ece.cmu.edu
Subject: Re: iSCSI ERT: data SACK/replay buffer/"semi-transport"
julian_satran@il.ibm.com wrote:
>
> Jon,
>
> Inexpensive implementation are always free to do away with recovery. That
> si true for targets too.
That's not the interpretation one gets from reading the spec and prior
discussions on this list. Per the spec, support for Status SACK is
mandatory while support for data SACK is optional.
IOW, targets MUST retains state information to satisfy a potential
status SACK request.
- Santosh
------------------------------------------------------------------------------
From julian_satran@il.ibm.com Tue Mar 27 05:16:54 PST 2001
Received: from mailhub.rose.hp.com (mailhub.rose.hp.com [15.96.64.24]) by
core.rose.hp.com with ESMTP (8.8.6 (PHNE_14041)/8.8.6 SMKit7.02) id
FAA26277 for <cbm@core.rose.hp.com>; Tue, 27 Mar 2001 05:16:52 -0800 (PST)
From: julian_satran@il.ibm.com
Received: from atlrel2.hp.com (atlrel2.hp.com [15.10.184.10]) by
mailhub.rose.hp.com with ESMTP (8.7.1/8.7.3 SMKit7.02) id FAA10600 for
<cbm@rose.hp.com>; Tue, 27 Mar 2001 05:15:51 -0800 (PST)
Received: from d06lmsgate.uk.ibm.COM (d06lmsgate.uk.ibm.com [195.212.29.1])
by atlrel2.hp.com (Postfix) with ESMTP id AFC11120
for <cbm@rose.hp.com>; Tue, 27 Mar 2001 08:15:49 -0500 (EST)
Received: from d12relay01.de.ibm.com (d12relay01.de.ibm.com [9.165.215.22])
by d06lmsgate.uk.ibm.COM (1.0.0) with ESMTP id NAA50078;
Tue, 27 Mar 2001 13:55:50 +0100
Received: from d12mta02.de.ibm.com (d12mta02_cs0 [9.165.222.253])
by d12relay01.de.ibm.com (8.8.8m3/NCO v4.95) with SMTP id PAA174358;
Tue, 27 Mar 2001 15:09:58 +0200
Received: by d12mta02.de.ibm.com(Lotus SMTP MTA v4.6.5 (863.2 5-20-1999))
id C1256A1C.00483A46 ; Tue, 27 Mar 2001 15:08:55 +0200
X-Lotus-FromDomain: IBMIL@IBMDE
To: cbm@rose.hp.com
Cc: someshg@yahoo.com, steph@cs.uchicago.edu, hufferd@us.ibm.com,
cbm@rose.hp.com, ldalleore@snapserver.com,
Venkat Rangan <venkat@rhapsodynetworks.com>, Black_David@emc.com
Message-ID: <C1256A1C.0048399C.00@d12mta02.de.ibm.com>
Date: Tue, 27 Mar 2001 15:12:01 +0200
Subject: Re: iSCSI ERT: error recovery comments
Mime-Version: 1.0
Content-type: text/plain; charset=us-ascii
Content-Disposition: inline
Status: RO
Comments in text. Thanks Julo
"Mallikarjun C." <cbm@rose.hp.com> on 27/03/2001 01:41:48
Please respond to cbm@rose.hp.com
To: cbm@rose.hp.com, someshg@yahoo.com, steph@cs.uchicago.edu, Julian
Satran/Haifa/IBM@IBMIL, John Hufferd/San Jose/IBM@IBMUS
cc: Black_David@emc.com
Subject: iSCSI ERT: error recovery comments
Hi Julian and the Team,
Here are some comments on error recovery issues. I hope these will
be addressed soon. Thanks.
1. The draft should clearly state that if a target doesn't support
retry (replay in my previous memo's terminology), it must not silently
accept a command with retry bit and re-do the I/O.
2. Consequent to the above -
- Clarification required on section 6.7.1, page 83, last para.
Please confirm and clarify in the draft: If the target sends
a response with an iSCSI error response of "SACK-rejected" that
implicitly terminates the task - no retries are allowed. If the
target sends a Reject PDU with "Data SACK Reject" code, the task
stays open and the initiator may try to recover using SACK/retry.
+++ I will clarify
it will read:
An iSCSI target MAY reject a data-SNACK and terminate the command with
an iSCSI command response of SNACK rejected. In this case, the task is
terminated and no future action is expected at target and initiator.
Alternatively, an iSCSI target MAY reject a data-SNACK with a reject
response of data SNACK rejected. In this case the task is still open and
may be recovered using the retry.
+++
- On a data digest error on a data PDU without the F-bit, the draft
states that the target must wait for a data PDU with the F-bit
(per section 6.2), then a command termination is signalled with
a Reject PDU! I like the formulation in 2.4.2 better. I
strongly
recommend that similarly, the target shall send a SCSI Response
with a iSCSI response of "delivery subsystem failure". In
general,
I suggest that anytime a target terminates a task internally, it
must generate a SCSI Response PDU with an appropriate response
code.
+++ It reads now:
When a target receives an iSCSI PDU with a header digest error or a
payload digest error in an iSCSI PDU, it MUST answer with a Reject iSCSI
PDU with a Reason-code of Header-Digest-error or Data-Digest-Error and
discard the offending PDU. If the error is a Data-Digest-Error in a
Data-PDU, the target MUST either request retransmission with a R2T or
answer with a command response PDU with a response-code of
delivery-subsystem-failure and abort the task. If the target is
answering with an error in the command response PDU it must wait for the
target to receive all the data (signaled by a Data PDU with the final
bit Set for all outstanding R2Ts) the command response PDU.
++++
3. While the following is implied in different sections, it is not
obvious. Please clarify the following in the draft - "Status SACK
support is mandatory, whereas data SACK support is not."
+++ will do in 2.16.1 +++
4. The general policy of retry should be that all ordered commands
shall support retry bit, since the loss of an ordered command
creates a hole in target scoreboarding and stalls the target
pipeline. Retry hopefully can plug the hole quickly to avoid this.
5. As a fallout of the above comment, Retry bit must be supported
for Text Commands.
+++ I have added the X-bit. The reason I did no earlier was that I could
not foresee
a case in which the command is not idempotent - I can allways be resent -
but I guess it is cleaner with the X +++
6. Section 2.20, page 71 on Reject must specify if a retry of the operation
is allowed for each Reject PDU reason code. Lack of specification could
lead to interoperability issues down the road with "retry wars" raging
between heterogeneous implementations (ex., target rejects the retry bit,
initiator retries the "retry" bit,....).
+++ the part now reads:
The reject Reason is coded as follows:
1 - Format Error
2 - Header Digest Error
3 - Data (payload) Digest Error
4 - Data-SNACK Reject
5 - Command Retry Reject
15 - Full Feature Phase Command before login
Some of the reject reasons terminate or prevent the creation of a
task at the target and no retry is possible in those cases. Format
error for a command, Command Retry Reject and Full Feature Phase
Command before login are in this category.
7. NOP-OUT does not require CmdSNs. Why make it an ordered command
and run the risk of a digest error on it leading to a hole in
command ordering?
+++ the reason I wanted it ordered is to check the whole command path - but
you may try to convince me that it is not a good idea +++
--
Mallikarjun
Mallikarjun Chadalapaka
Networked Storage Architecture
Network Storage Solutions Organization
MS 5668 Hewlett-Packard, Roseville.
cbm@rose.hp.com
Home Last updated: Tue Sep 04 01:05:10 2001 6315 messages in chronological order |