Updated: 24-SEP-2003 (Use your browsers' Reload button to ensure you're viewing the most recent version)
VAXF11X04_072 VAX V7.2 F11AACP/F11BXQP/F11CACP/F11DACP ECO Summary
*OpenVMS] VAXF11X04_072 VAX V7.2 F11AACP/F11BXQP/F11CACP/F11DACP ECO Summary
New Kit Date: 23-SEP-2002
Modification Date: Not Applicable
Modification Type: NEW KIT
Copyright (c) Compaq Computer Corporation 2001,2002. All rights reserved.
OP/SYS: OpenVMS VAX
COMPONENTS: F11AACP
F11BXQP
F11CACP
F11DACP
SOURCE: Compaq Computer Corporation
ECO INFORMATION:
ECO Kit Name: VAXF11X04_072
ECO Kits Superseded by This ECO Kit: VAXF11X03_072
ECO Kit Approximate Size: 576 Blocks
Kit Applies To: OpenVMS VAX V7.2
System/Cluster Reboot Necessary: Yes
Rolling Re-boot Supported: Yes
Installation Rating: INSTALL_1
1 - To be installed on all systems running
the listed version(s) of OpenVMS.
Kit Dependencies:
The following remedial kit(s) must be installed BEFORE
installation of this kit:
VAXUPDATE01_072
In order to receive all the corrections listed in this
kit, the following remedial kits should also be installed:
VAXODS1_01_072
ECO KIT SUMMARY:
An ECO kit exists for F11AACP ,F11BXQP,F11CACP and F11DACP on OpenVMS VAX V7.2. This kit
addresses the following problems:
PROBLEMS ADDRESSED IN VAXF11X04_072 KIT
o If multiple processes on the same node are attempting to
obtain or convert the same lock, which is being mastered on
another node, it is possible to get an SS$_CVTUNGRANT error in
addition to an SS$_IVLOCKID error during this race. The
problem arises because the new, faster lock manager, adds
timing considerations that were not previously present. The
code now traps this error and retries the lock conversion
attempt.
Images Affected:[SYS$LDR]F11BXQP.EXE
o A system can crash with a MAPCNTZER bugcheck, in XQP routine
MAKE_POINTER, when trying to expand or create a new
PAGEFILE.SYS.
Crash Dump Summary Information:
------------------------------
Bugcheck Type: MAPCNTZER, Attempted to generate
zero length map pointer
Current Image: DSA0:[SYS0.SYSCOMMON.][SYSEXE]SYSGEN.EXE
Failing PC: FFFFFFFF.A89F6B98 MAKE_POINTER_C+00044
Failing PS: 30000000.00000000
Module: F11BXQP (Link Date/Time:
24-AUG-2000 07:20:45.47)
Offset: 00026B98
This problem occurs in the XQP only on volumes where the
cluster size exceeds 256 and the volume is so fragmented that
the file in question would need to expand beyond the first
file header. The fix for this problem will result in a
SS$_HEADERFULL error for SYSGEN CREATE instead the bugcheck
encountered.
Images Affected:[SYS$LDR]F11BXQP.EXE
o The system can crash with a UNXSIGNAL bugcheck in routine
ACL_BUILDACL. The exception occurs because the XQP tries to
access a null FCB (which was loaded from a previous FCB that
had FCB$L_EXFCB = 0).
Crash Dump Summary
------------------
Bugcheck Type: UNXSIGNAL, Unexpected signal name in ACP
Current Process: BATCH_488
Failing PS: 20000000.00000000
Module: F11BXQP (Link Date/Time:
24-AUG-2000 07:20:45.47)
Offset: 00052DC4
Images Affected:[SYS$LDR]F11BXQP.EXE
o When using SYS$CHECK_ACCESS against a file that resides on a
CD-ROM the process disappears and the accounting record shows
a final status of ACCVIO. If BUGCHECK_FATAL or SYSTEM_CHECK
are set, the system bugchecks with SSRVEXCEPT in
SYS$$CHECK_ACCESS. The same program, when used against a file
that is not on a CD-ROM, works as expected.
The problem occurs because SYS$$CHECK_ACCESS calls
FIL$OSR_CLONE_PROFILE. FIL$OSR_CLONE_PROFILE does a $QIO
against the file for IO$_ACPCONTROL, with a FIB control
function of FIB$C_CLONE_FIL_PROFILE. This $QIO returns good
status even though this control function is not implemented in
ACPCONTROL.B32. So, routine FIL$OSR_CLONE_PROFILE returns
SS$_NORMAL and the returned ORB is null (which leads to the
exception).
Also, it was found that the embedded ORB associated with the
file (FCB) was not being initialized in the case of an XAR not
being supplied.
Images Affected:[SYSEXE]F11CACP.EXE
[SYSEXE]F11DACP.EXE
PROBLEMS ADDRESSED IN VAXF11X03_072 KIT:
o The XQP bugchecks with XQPERR in routine MAKE_DIRINDX in
module RDBLOK, 'All the index buffers are active'. There is
DFO IO$M_MOVEFILE activity on the node at the time of the
crash.
Crashdump Summary Information:
------------------------------
Bugcheck Type: XQPERR, Error detected by file system XQP
Current Process: BATCH_1170
Current Image: $2$DUA0:[SYS8.SYSCOMMON.]
[SYSEXE]PSPA$ADVISOR.EXE;14
Failing PC: FFFFFFFF.8017DCDC MAKE_DIRINDX_C+0032C
Failing PS: 30000000.00000000
Module: F11BXQP (Link Date/Time:
18-JUL-2000 13:27:34.38)
Offset: 00001CDC
Images Affected: [SYS$LDR]F11BXQP.EXE
o During the DISMOUNT of a bound volume set, the XQP signals an
XQPERR bugcheck in CHECK_DISMOUNT after a $DEQ against
RVT$L_STRUCLKID fails with "Cannot dequeue a lock with
sublocks".
Images Affected: [SYS$LDR]F11BXQP.EXE
o The XQP for trusted processes/threads is supposed to suppress
the audit messages that may emanate from their actions. These
audit messages are not being suppressed.
Images Affected: [SYS$LDR]F11BXQP.EXE
Problems Addressed in VAXF11X21_072:
o A BADATTRIB error can occur from BACKUP and other applications
that use ATR$C_ASCNAME.
Images Affected:
- [SYSEXE]F11AACP.EXE
- [SYSEXE]F11AACP.STB
o An XQPERR bugcheck can occur when the volume allocation lock
becomes invalid and the process is extending/re-validating the
INDEXF.SYS file.
Images Affected:
- [SYSEXE]F11BXQP.EXE
- [SYSEXE]F11BXQP.STB
o An exception, which leads to an INVEXCEPTN bugcheck, can occur
in XQP routine INS_LIMBO or TRIM_LIMBO.
Images Affected:
- [SYSEXE]F11BXQP.EXE
- [SYSEXE]F11BXQP.STB
o The XQP bugchecks in routine MARK_INCOMPLETE with NOTWCBWCB, a
corrupted WCB list. Other bugchecks could occur, depending
upon when the WCB queue corruption is detected.
Images Affected: [SYSEXE]F11BXQP.EXE
o MOUNT routine START_ACP bugchecks with NOTUCBRVT.
Images Affected:
- [SYSEXE]F11BXQP.EXE
- [SYSEXE]F11BXQP.STB
o If a directory command is executed for a directory file that
is greater than 127 blocks, the UCB$x_QLEN counter gets
incremented, but may not get decremented.
If an I/O past the highwater mark is started exactly at
%x7FFFFF blocks from the file size, a false SS$_ENDOFFILE
would be reported.
Images Affected:
- [SYSEXE]F11BXQP.EXE
- [SYSEXE]F11BXQP.STB
o A process hangs and may not be deleteable when writing to a
sequential file. The process may have a "lost" I/O
outstanding. In such cases, the I/O may be on the FCB (file
control block) HWM (high water mark) wait queue waiting for
other I/Os.
Images Affected:
- [SYSEXE]F11BXQP.EXE
- [SYSEXE]F11BXQP.STB
o Separator pages for print jobs, which are created via COPY to
a spooled device, do not include the complete file specification.
Images Affected:
- [SYSEXE]F11BXQP.EXE
- [SYSEXE]F11BXQP.STB
o The XQP bugchecks with an XQPERR in routine RES_SEQ_MISMATCH,
'Found a stale referenced or non directory FCB (file control
block) in FCB queue'.
Images Affected:
- [SYSEXE]F11BXQP.EXE
- [SYSEXE]F11BXQP.STB
o The XQP bugchecks with an XQPERR in routine MAKE_DEACCESS,
'deaccess conversion failed'.
Images Affected:
- [SYSEXE]F11BXQP.EXE
- [SYSEXE]F11BXQP.STB
o An XQPERR bugcheck occurs in UPDATE_INDX of multi-header
directories.
Images Affected:
- [SYS$LDR]F11BXQP.EXE
- [SYS$LDR]F11BXQP.STB
o A paged pool may become overused or exhausted when many
shuffles occur on a directory with multiple headers. This is
due to an occurrence of a large ACL (access control list)
chain.
Images Affected:
- [SYSEXE]F11BXQP.EXE
- [SYSEXE]F11BXQP.STB
Problems Addressed in VAXF11X01_072:
o A READATTR/FID_TO_SPEC mechanism, such as a data collector
process running on the same volume as a defragger competing
for the same data, both processes try to delete the
'primary_fcb' used to get the information in question. In
both of these circumstances, the reference count on the FCB
has not been bumped up so both accesses appear to allow the
deletion. This results in a NOTFCBFCB Bugcheck.
Images Affected: [SYS$LDR]F11BXQP.EXE
o If a process attempts to mount a bound volume set (BVS)
and all the members of the BVS are not present, an attempt
to lock the volume for REBUILDing the meta-data on the volume
will fail. However, the blocking lock (F11B$b) is left
with the process.
Image(s) Affected: [SYS$LDR]F11BXQP.EXE
o An XQPERR bugcheck in LOCKERS can occur when the retry
limit on the F11B$x lock is reached. This happens when
the owner of the $x lock is running at a high process priority
and there are a number of processes in a clustered system that
are also trying to validate this lock but at a lower process
priority. The high priority process never really gives up the
locks long enough to let the low process priority processes
to continue and either validate or release the $v lock.
To avoid this situation, after (every) 256 attempts, the
process with the most retry iterations is stalled for a short
period to allow other processes to complete their accesses to
the lock.
Image(s) Affected: [SYS$LDR]F11BXQP.EXE
o After releasing the current process' IPL/Fork lock, a system
can crash with a SPLACQERR bugcheck
Image(s) Affected: [SYS$LDR]F11BXQP.EXE
o When writing a large number of log files in rapid succession
to a very large directory, the directory file becomes "corrupt"
and DUMP/DIRECTORY displays a block similar to the following:
Virtual block number 3574 (00000DF6), 512 (0200) bytes
0000 Directory Entry:
0000 Size: 508
0002 Version limit: 32767
0004 Type: 0 (FID)
0005 Name count: 24
0006 Name: COSLR1201_01_JUPICC2.LIS
001E Version: 7859 FID: (40993,5,0)
0026 Version: 7858 FID: (40990,1,0)
002E Version: 7857 FID: (40988,3,0)
...
01E6 Version: 7802 FID: (40455,1,0)
01EE Version: 7801 FID: (40454,1,0)
01F6 Version: 32767 FID: (16744447,65535,0)
01FE End of records (-1)
Image(s) Affected: [SYS$LDR]F11BXQP.EXE
o Exhaustion of non-paged pool with FCBs and CFCBs as the
primary data structures. The output from a SHOW
MEMORY/CACHE/FULL command may put odd values in the files
retained field:
$ SHOW MEMORY /CACHE/FULL
System Memory Resources on 18-NOV-1999 14:13:52.13
Virtual I/O Cache
Total Size (pages) 2098 Read IO Count 13359
Free Pages 0 Read Hit Count 6130
Pages in Use 2098 Read Hit Rate 45%
Maximum Size (SPTEs) 3140177 Write IO Count 16874
Files Retained ****** IO Bypassing the Cache 536
Non-paged pool exhaustion starts when a file create or access
causes the file to become part of the VCC cache and be held on
the XQP limbo queue. A short time later, if this file is
renamed and as part of that rename operation the file is
removed from the limbo queue, the accounting for that file
is not correct. If the file is later deleted, the count
of the limbo items goes down incorrectly and eventually goes
negative. This then allows the limbo list to grow very large.
This can be seen using SDA where the value of EXE$GL_LIMBOLEN
does not match the number of queue elements in EXE$GQ_LIMBOQ.
For example:
SDA> EX EXE$GL_LIMBOLEN
EXE$GL_LIMBOLEN: 00000061 "a..."
SDA> VALIDATE QUEUE EXE$GQ_LIMBOQ
Queue is complete, total of 195 elements in the queue
SDA> EVAL 61
Hex = 00000061 Decimal = 97
SDA>
Images Affected: [SYS$LDR]F11BXQP.EXE
o Under the following circumstances:
1. A directory with multiple headers (from a large ACL for
example) is deleted on one node (A) in a cluster and
2. the directory had been previously accessed on another node
(B) in the cluster
The files created with the previously deleted headers in step
1 would show up on node B with the error:
%SYSTEM-F-NOSUCHFILE, no such file
Image(s) Affected: [SYS$LDR]F11BXQP.EXE
o Fix the internal GETTIM function so that it properly formats
the 64 bit internal time to an RSX-11 compatible ASCII string.
RSX date string contain only a 2 character year field (ASCII).
In order to accommodate the year 2000, this ASCII encoding
will now include the value of decades (since 1900) into the
string. This means that for the years 1900 through 1999, the
date string will appear as 00 to 99. For the years 2000 into
the years 39xx, the string will show as ':0', ';0' etc.
For years prior to the year 1900, dates are encoded as binary
zeros, which is interpreted as 'no date'
Image(s) Affected:
- [SYS$LDR]F11BXQP.EXE
- [SYSEXE]F11AACP.EXE
RELATED ARTICLES:
Detailed articles describing the problems listed above may exist in
the OPENVMS database. To view these articles, open the appropriate
product database and perform a query using either of the following
search strings: 'VAXF11X04_072' or 'VAXF11X'.
ECO KIT ORDERING INSTRUCTIONS:
If after an evaluation you wish to obtain this kit, request it
electronically using the appropriate Advanced Electronic Services
(AES) Service Tool. If you are not familiar with how to request
kits electronically, open the DIA, WIS or DSNLINK database and
review the article entitled:
[AES] How To Electronically Request ECO Kits Using Service Tools
INSTALLATION NOTES:
This kit requires a system reboot. Compaq strongly recommends that
a reboot is performed immediately after kit installation to avoid
system instability
If you have other nodes in your OpenVMS cluster, they must also be
rebooted in order to make use of the new image(s). If it is not
possible or convenient to reboot the entire cluster at this time, a
rolling re-boot may be performed.
All trademarks are the property of their respective owners.
==========================================================================
| Table of Kit Image Information |
+----------------------------+----------+-----------------+--------------+
| | Overall | Image File | Image Link |
| Image Name | Checksum | Identification | Date/Time |
+----------------------------+----------+-----------------+--------------+
| F11AACP.EXE |%X52F7B090| X-2 | 20-DEC-1999 |
| | | 10:18:04.48 |
+----------------------------+----------+-----------------+--------------+
| F11BXQP.EXE |%X119BC97C| XQP V7.2R-0004 | 23-JUL-2002 |
| | | 15:04:08.46 |
+----------------------------+----------+-----------------+--------------+
| F11CACP.EXE |%XE200D487| X-8 | 4-JUN-2002 |
| | | 20:52:46.70 |
+----------------------------+----------+-----------------+--------------+
| F11DACP.EXE |%X20F8EF29| X-8 | 4-JUN-2002 |
| | | 20:53:10.59 |
+----------------------------+----------+-----------------+--------------+
|