|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] RE: iSCSI: DAP Retry commentsRegarding Retry, it's not about the command executing twice. Below is a rehash from previous emails of the issues with Retry: ***** Example scenario: 1. tape locate command is issued with a 10 second timer 2. tape command is dropped at the target due to a header digest error 3. having seen no response for the command after an iSCSI initiator determined timeout value, the initiator decides to retry the command This example leaves a 2 second window for the response to the second command to arrive before a ULP abort is sent. A mechanism to determine whether or not the command arrived at the target would be beneficial for the retry functionality to be useful. The mechanism should be initiated early enough in the ULP timeout window to allow the iSCSI retried command the opportunity to complete. Some options: A. If the initiator issues a NOP-Out(immed=1), the target can send back the expected CmdSN. B. The target, upon a header digest error, sends back a reject or NOP-In with the expected CmdSN. This would provide an indication to the initiator that something happened and trigger the command retry, still hopefully within the ULP timeout to allow for sucessful command completion. C. Texting stating that an iSCSI initiator should (only) perform a command retry when sufficient time remains for the command to completed. But, this leads one down the path of device type specific behavior. D. Remove the command retry functionality. I have yet to see it actually being used. The use of command retry is really only applicable when header digests are being used. Given TCP, probability of header digest usage and occurance, and existing ULP tools for error detection and recovery, my preference is to remove it. **** My point is the following need to occur for the Retry functionality to work: 1. header digest must be enabled 2. a header digest must be detected 3. the retried command must be issued early enough in the ULP timeout window to allow completion I don't see the Retry functionality being useful for disk devices and questionable for tape. Using a well-engineered (TCP/IP) network along with hopefully available SCSI level tools is a better solution. But I don't want to hold up the spec over this matter either, we just won't use it. Regarding SNACK, I don't really have a problem with it other than again a well-engineered network should mitigate the need. Regarding connection allegiance reassignment, the functionality is a bit more useful (if one supports more than 1 connection per session), provided the reassignment completes in time to allow the command to complete within the ULP timeout. Regarding legacy tape devices, I expect them to be front ended by a gateway. These devices will most likely not support any type of transport level error detection and recovery making them incompatible/problem childs with respect to the various iSCSI error recovery mechanisms. Bottom line is engineer your network well and leave the error detection and recovery to the SCSI level and above...dap > -----Original Message----- > From: Black_David@emc.com [mailto:Black_David@emc.com] > Sent: Tuesday, July 09, 2002 1:51 AM > To: dap@cisco.com; ips@ece.cmu.edu > Subject: iSCSI: DAP Retry comments > > > > T p 103 6.1.1 Usage of Retry and 6.7 SCSI Timeouts: the semantics of > Retry > > remain broken rendering it useless for tape operation. SCSI level error > > detection and recovery is the preferred mechanism. Refer to previous > emails > > sent via the IPS reflector regarding this matter. > > Can you provide more information? Command retry *never* results in > the command executing twice - both the original command and the retry > have the same CmdSN, so the second one is dropped as a duplicate if > the first one was received correctly. 6.1.1 is very clear that retry > MUST NOT be used if the command was received successfully (acknowledged > by ExpCmdSN), and if it is used, the retried command PDU is silently > dropped. > iSCSI's ordered delivery requirement avoids the situation in which a > dropped command causes subsequent commands to mis-execute - if none > of the commands are marked for immediate delivery, iSCSI will stop > at the "hole" created by the dropped command, and wait for the retry > to plug the hole. > > > T p 128 8.6 Considerations for State-dependent devices: last > paragraph: > > don't agree with the statement that error recovery at the iSCSI level > > (specifically Retry in its current state) is advisable. Retry > at the SCSI > > level is feasible and is not difficult (i.e., READ POSITION and LOCATE > > commands). This paragraph should be removed. > > Two questions: > - What about the SNACK and allegiance change mechanisms? > - What about the "legacy" tape devices (e.g., as discussed in London) > that presumably don't implement those commands? I believe this > text was originally intended to address this class of devices. > > Thanks, > --David > --------------------------------------------------- > David L. Black, Senior Technologist > EMC Corporation, 42 South St., Hopkinton, MA 01748 > +1 (508) 249-6449 FAX: +1 (508) 497-8018 > black_david@emc.com Mobile: +1 (978) 394-7754 > ---------------------------------------------------
Home Last updated: Tue Jul 09 20:18:44 2002 11219 messages in chronological order |