Re: iSCSI draft 02: digests

To: ips@ece.cmu.edu
Subject: Re: iSCSI draft 02: digests
From: julian_satran@il.ibm.com
Date: Tue, 5 Dec 2000 20:02:58 +0200
Content-Disposition: inline
Content-type: text/plain; charset=us-ascii
Sender: owner-ips@ece.cmu.edu

Randall,

The one point you are not considering is the weakness of the TCP checksum
and bad equipment on the way.  I personally experienced good FTPs ending at
my machine in bad files.
Resending on an error undetected by TCP solves this kind of problem.

The bad think is that you can't relay on the iSCSI header when a header
digest is bad so you can't explicitly ask for retransmit. And if it is rare
enough dropping connection and reconnecting won't harm. I am still
considering what to do about data.

Julo

"Randall R. Stewart" <randall@stewart.chicago.il.us> on 05/12/2000 18:17:39

Please respond to "Randall R. Stewart" <randall@stewart.chicago.il.us>

To:   Mark Bakke <mbakke@cisco.com>
cc:   Julian Satran/Haifa/IBM@IBMIL, ips@ece.cmu.edu
Subject:  Re: iSCSI draft 02: digests

Mark/Julian:

Mark Bakke wrote:
>
> Yes, the connection should not be recovered, but the iSCSI session
> can be.

I have thought on this for a bit, and it seems to me one must
have a look at what went wrong if a digest fails and yet TCP
still delivered the packet. Was it TCP's fault? Well in some
ways one could answer yes... since what happened is a sequence
of bit errors was somewhere introduced to the IP packets that
caused:

A) TCP's (and IP's) checksum to still pass
and
B) The stronger digest protection detected the error.

Now then, the question is how can we fix the problem?
I think that you must get a new copy of the bad segment
to the receiver... Here in lies the question, what will
make it so we can do so:

Will restarting the TCP connection (or keeping the same
one for that matter), make any difference? I don't think
so... it was something in the network that caused this
error to occur, if it is a fluke then retransmitting the
packet on the same or a new connection will not make
a bit of difference ... since it is the network that
corrupted the packet. If it is not a fluke random chance, then
you have a more serious network (or TCP stack) problem and
all the restarts in the world are not going to make the packet
go through since the network or stack will just keep re-corrupting
the packet...

Bottom line is I am not convinced reseting the connection will
gain you anything.. I don't think it will hurt .. but I don't
see you gaining anything...

R

>
> --
> Mark
>
> julian_satran@il.ibm.com wrote:
> >
> > Mark,
> >
> > I also gave it some more thought.
> >
> > Since a digest failure is a transport failure that went undetected by
TCP
> > dropping and restarting a connection won't do us to much good - if we
use
> > the same link we may end up having some more.
> >
> > We should treat them as iSCSI failures and have iSCSI restart the
command
> > without restarting the connection.
> >
> > Regards,
> > Julo
> >
> > Mark Bakke <mbakke@cisco.com> on 05/12/2000 00:18:28
> >
> > Please respond to Mark Bakke <mbakke@cisco.com>
> >
> > To:   Julian Satran/Haifa/IBM@IBMIL
> > cc:   ips@ece.cmu.edu
> > Subject:  Re: iSCSI draft 02: digests
> >
> > Julian-
> >
> > Here's what we had in mind for recovering from digest/CRC failures:
> >
> > 1. If the digest failure is on a command, status, or iSCSI header,
> >    this means that a length field could be corrupted.  This should
> >    not happen often, but it may be possible to re-send the command
> >    if both the initiator and target can do session recovery as in
> >    the iSCSI spec.  In any case, the connection should be terminated,
> >    and a new one built in its place.  If session recovery is supported
> >    and is successful, the missing iSCSI PDU(s) during and after the
> >    digest failure are re-send, re-responded, and no harm done.  If
> >    session recovery fails, the upper SCSI layer must receive the
> >    failure, and do whatever recovery is necessary.  In any case, the
> >    old connection should not be used after the failure.
> >
> > 2. If the digest failure is on a SCSI data block, iSCSI length fields
> >    are not affected, so there may be a possible way to resend the
> >    data.  However, doing this is probably not worthwhile, so I think
> >    that in the data digest case, the same recovery as in (1) should
> >    be used.
> >
> > --
> > Mark
> >
> > julian_satran@il.ibm.com wrote:
> > >
> > > Like on a data failure on any bus. Raise a check condition and end
the
> > > command with an error but let it go up to
> > > the normal end.  I will spec it.
> > >
> > > Thanks,
> > > Julo
> > >
> > > Matt Wakeley <matt_wakeley@agilent.com> on 29/11/2000 01:39:22
> > >
> > > Please respond to Matt Wakeley <matt_wakeley@agilent.com>
> > >
> > > To:   ips@ece.cmu.edu
> > > cc:
> > > Subject:  iSCSI draft 02: digests
> > >
> > > In appendix A is a (brief) description of the iSCSI header and data
> > > digests.
> > >
> > > What is the expected behavior if there is a digest failure?  Just
throw
> > the
> > > PDU away?
> > >
> > > -Matt
> >
> > --
> > Mark A. Bakke
> > Cisco Systems
> > mbakke@cisco.com
> > 763.398.1054
>
> --
> Mark A. Bakke
> Cisco Systems
> mbakke@cisco.com
> 763.398.1054

--
Randall R. Stewart
randall@stewart.chicago.il.us or rrs@cisco.com
815-342-5222 (cell) 815-477-2127 (work)

Prev by Date: A question on Zero Copy
Next by Date: Re: iSCSI draft 02: digests
Prev by thread: Re: iSCSI draft 02: digests
Next by thread: Re: iSCSI draft 02: digests
Index(es):
- Date
- Thread

Home

Last updated: Tue Sep 04 01:06:11 2001
6315 messages in chronological order