2100 server will not boot

From: Rudolf Gabler (rug@usm.uni-muenchen.de)
Date: Tue May 13 2003 - 11:13:57 EDT


Hi managers,

we are in a rolling upgrade from V5.1a to V5.1b PL 1 of a node cluster and
one node does not like the genvmunix of V5.1b (see log below). I saved
the V5.1a genvmunix kernel which worked for this node a:

 Alphaserver 2100 5/300

I didnt find anything in the release notes. Any hints?

Best regards,

Rudi Gabler

Log:
P00>>>boot -fi genvmunix
(boot dka200.2.0.1.0 -file genvmunix -flags A)
block 0 of dka200.2.0.1.0 is a valid boot block
reading 19 blocks from dka200.2.0.1.0
bootstrap code read in
base = 200000, image_start = 0, image_bytes = 2600
initializing HWRPB at 2000
initializing page table at 1fff0000
initializing machine state
setting affinity to the primary CPU
jumping to bootstrap code

UNIX boot - Wednesday August 01, 2001

Loading genvmunix ...
Loading at 0xffffffff00000000

Sizes:
text = 10695680
data = 2774592
bss = 4569248
Starting at 0xffffffff000133e0

bcm: DEGXA driver V1.0.12 NUMA lanlog
failed configuring ev7_ocla subsystem
Alpha boot: available memory from 0x2760000 to 0x1ffee000 Compaq Tru64
UNIX P5.1B (Rev. 173); Tue Dec 17 15:49:27 EST 2002 physical memory =
512.00 megabytes. available memory = 472.55 megabytes. using 1883 buffers
containing 14.71 megabytes of memory Master cpu at slot 0 Starting
secondary cpu 2 Firmware revision: 5.3
PALcode: UNIX version 1.22
AlphaServer 2100 5/300
Firmware revision: 5.3
PALcode: UNIX version 1.22
ibus0 at nexus
cpu 0 EV-5 4mb b-cache
cpu 2 EV-5 4mb b-cache
gpc0 at ibus0
pci0 (primary bus:0) at ibus0 slot -1
tu2: DECchip 21040: Revision: 2.3
tu2 at pci0 slot 0
tu2: DEC TULIP (10Mbps) Ethernet Interface, hardware address:
08-00-2B-E5-E1-89
tu2: console mode: selecting 10BaseT (UTP) port: half duplex Loading SIOP:
script 1000000, reg 81441000, data 427b0000 scsi4 at psiop0 slot 0 rad 0
eisa0 at pci0 ace0 at eisa0 ace1 at eisa0 lp0 at eisa0 fdi0 at eisa0 fd0
at fdi0 unit 0 gpc1 not probed
qvision0: CMPQ Qvision 1024/E SVGA
qvision0 at eisa0
emx4 at pci0 slot 6
KGPSA-CA : Driver Rev 2.06 : F/W Rev 3.81A4(2.01A0) : wwn
1000-0000-c924-bf07
emx4: Using console topology setting of : Fabric
scsi5 at emx4 slot 0 rad 0
tu3: DECchip 21140: Revision: 2.0
tu3: auto negotiation capable device
tu3 at pci0 slot 7
tu3: DEC TULIP (10/100) Ethernet Interface, hardware address:
00-00-F8-04-7B-1D
tu3: auto negotiation off: selecting 100BaseTX (UTP) port: half duplex
mchan1: Module revision = 34
mchan1: jumpered as HUB configuration
mchan1 at pci0 slot 8
Created FRU table binary error log packet
kernel console: ace0
dli: configured
NetRAIN configured.
Random number generator configured.
TruCluster Server V5.1B (Rev. 1029); 12/17/02 14:28
TNC kproc_creator_daemon: Initialized and Ready
clubase: configured
Configuring RDG to use Memory Channel
ics_hl: Configuring memory channel as transport.
icsnet: configured
drd configured 0
drd_config_thread: Found 1 previously unknown local devices
kch: configured
dlm: configured
Starting CFS daemons
Registering CFS Services
Initializing CFSREC ICS Service
Registering CFSMSFS remote syscall interface
Registering CMS Services
rm slave: mchan1, hubslot = 1, phys_rail 0 (size 512 MB)
rm slave: log_rail 0 (size 512 MB), phys_rail 0 (mchan1)
ics_mct: icsinfo set for node 1
ics_mct: Declaring this node up 1
ics_mct: icsinfo set for node 2
ics_mct: icsinfo set for node 3
CNX MGR: Join operation complete
CNX MGR: membership configuration index: 91 (47 additions, 44 removals)
CNX MGR: quorum (re)gained, (re)starting cluster operations. Joining versw
kch set. CNX MGR: Node coma 1 incarn 0xe4c0 csid 0x150003 has been added
to the cluster
ics_mct: Declaring this node up 3
CNX MGR: Node eridani 3 incarn 0x494ac csid 0x70001 has been added to the
clustr
ics_mct: Declaring this node up 2
CNX MGR: Node virgo 2 incarn 0x19bd8 csid 0x140002 has been added to the
cluster
dlm: resuming lock activity
kch: resuming activity
cam_logger: SCSI event packet
cam_logger: bus 4
psiop_hardintr
Bus reset detected
cam_logger: SCSI event packet
cam_logger: bus 4
psiop_hardintr
Bus reset detected
clsm: incoming CNX data: '
a'
clsm: checking for peer configurations
Waiting for cluster mount to complete
clsm: configuration synchronized using peer data
clsm: initialized
cam_logger: SCSI event packet
cam_logger: bus 4 target 2 lun 0
ss_perform_timeout
timeout on disconnected request
Active CCB at time of error
cam_logger: SCSI event packet
cam_logger: bus 4 target 2 lun 0
ss_perform_timeout
timeout on disconnected request
Active CCB at time of error
cam_logger: SCSI event packet
cam_logger: bus 4 target 2 lun 0
ss_perform_timeout
Reached max abort count, scheduled bus reset
Active CCB at time of error
cam_logger: SCSI event packet
cam_logger: bus 4
psiop_hardintr
Bus reset detected
panic (cpu 2): cfs_mountboot_local: PFS mountroot failed syncing disks...
done
drd: Clean Shutdown

DUMP: Warning: no disk available for dump.

DUMP: first crash dump failed: attempting memory dump...
DUMP: compressing 88184KB into 444935KB memory...
DUMP: Starting Address Ending Address Size(MB)
DUMP: ------------------ ------------------ --------
DUMP: 0xfffffc001fc8e000 - 0xfffffc001ffedfef 3.3 (indicator)
DUMP: Writing data..... [5MB]
DUMP: crash dump complete.
halted CPU 2

halted CPU 0

halt code = 5
HALT instruction executed
PC = ffffffff003649f0

Working version:
resetting all I/O buses
P00>>>boot -fi genvmunix.v51a
(boot dka200.2.0.1.0 -file genvmunix.v51a -flags A)
block 0 of dka200.2.0.1.0 is a valid boot block
reading 19 blocks from dka200.2.0.1.0
bootstrap code read in
base = 200000, image_start = 0, image_bytes = 2600
initializing HWRPB at 2000
initializing page table at 1fff0000
initializing machine state
setting affinity to the primary CPU
jumping to bootstrap code

UNIX boot - Wednesday August 01, 2001

Loading genvmunix.v51a ...
Loading at 0xffffffff00000000

Sizes:
text = 9919616
data = 2466848
bss = 4421680
Starting at 0xffffffff00011940

sysconfigtab: attribute parallel_edt_scan not in subsystem io Alpha boot:
available memory from 0x2294000 to 0x1ffee000 Compaq Tru64 UNIX P5.1A
(Rev. 304); Mon May 13 10:02:07 EDT 2002 physical memory = 512.00
megabytes. available memory = 477.35 megabytes. using 1888 buffers
containing 14.75 megabytes of memory Master cpu at slot 0 Starting
secondary cpu 2 Firmware revision: 5.3
PALcode: UNIX version 1.22
AlphaServer 2100 5/300
Firmware revision: 5.3
PALcode: UNIX version 1.22
ibus0 at nexus
cpu 0 EV-5 4mb b-cache
cpu 2 EV-5 4mb b-cache
gpc0 at ibus0
pci0 (primary bus:0) at ibus0 slot -1
tu2: DECchip 21040: Revision: 2.3
tu2 at pci0 slot 0
tu2: DEC TULIP (10Mbps) Ethernet Interface, hardware address:
08-00-2B-E5-E1-89
tu2: console mode: selecting 10BaseT (UTP) port: half duplex Loading SIOP:
script 1000000, reg 81441000, data 4229c000 scsi0 at psiop0 slot 0 rad 0
eisa0 at pci0 ace0 at eisa0 ace1 at eisa0 lp0 at eisa0 fdi0 at eisa0 fd0
at fdi0 unit 0 gpc1 not probed
qvision0: CMPQ Qvision 1024/E SVGA
qvision0 at eisa0
emx4 at pci0 slot 6
KGPSA-CA : Driver Rev 2.02 : F/W Rev 3.81A4(2.01A0) : wwn
1000-0000-c924-bf07
emx4: Using console topology setting of : Fabric
scsi1 at emx4 slot 0 rad 0
tu3: DECchip 21140: Revision: 2.0
tu3: auto negotiation capable device
tu3 at pci0 slot 7
tu3: DEC TULIP (10/100) Ethernet Interface, hardware address:
00-00-F8-04-7B-1D
tu3: auto negotiation off: selecting 100BaseTX (UTP) port: half duplex
mchan1: Module revision = 34
mchan1: jumpered as HUB configuration
mchan1 at pci0 slot 8
Created FRU table binary error log packet
kernel console: ace0
dli: configured
NetRAIN configured.
TruCluster Server V5.1A (Rev. 1312); 05/13/02 08:11
clubase: configured
TNC kproc_creator_daemon: Initialized and Ready
Configuring RDG to use Memory Channel
ics_hl: Configuring memory channel as transport.
icsnet: configured
drd configured 0
kch: configured
dlm: configured
Starting CFS daemons
Registering CFS Services
Initializing CFSREC ICS Service
Registering CFSMSFS remote syscall interface
Registering CMS Services
rm slave: mchan1, hubslot = 1, phys_rail 0 (size 512 MB)
rm slave: log_rail 0 (size 512 MB), phys_rail 0 (mchan1)
ics_mct: icsinfo set for node 1
ics_mct: Declaring this node up 1
ics_mct: icsinfo set for node 3
ics_mct: icsinfo set for node 2
CNX MGR: Join operation complete
CNX MGR: membership configuration index: 95 (49 additions, 46 removals)
CNX MGR: quorum (re)gained, (re)starting cluster operations. Joining versw
kch set. CNX MGR: Node coma 1 incarn 0x2dfd0 csid 0x170003 has been added
to the cluster
ics_mct: Declaring this node up 3
CNX MGR: Node eridani 3 incarn 0x494ac csid 0x70001 has been added to the
clustr
ics_mct: Declaring this node up 2
CNX MGR: Node virgo 2 incarn 0x19bd8 csid 0x140002 has been added to the
cluster
dlm: resuming lock activity
kch: resuming activity
cam_logger: SCSI event packet
cam_logger: bus 0 target 255 lun 255
psiop_hardintr
Bus reset detected
cam_logger: SCSI event packet
cam_logger: bus 0 target 255 lun 255
psiop_hardintr
Bus reset detected
clsm: incoming CNX data: '
a'
clsm: checking for peer configurations
cam_logger: SCSI event packet
cam_logger: bus 0 target 255 lun 255
psiop_hardintr
Bus reset detected
clsm: configuration synchronized using peer data
clsm: initialized
Waiting for cluster mount to complete
vm_swap_init: swap is set to lazy (over commitment) mode
Error could not open file /vmunix.sym
kloadsrv: Error 2: ldr_kernel_bootstrap("/genvmunix.v51a") failed, return
-2
CMS: Joining deferred filesystem sets
Checking device naming:
    Passed.
dsfmgr: NOTE: updating kernel basenames for system at /
    scp kevm tty00 tty01 lp0 floppy2 tape4 mc1 dmapi scp2 dsk11 dsk12
dsk13 dsk6 Mounting / (root)
user_cfg_pt: reconfigured
root_mounted_rw: reconfigured
Mounting /cluster/members/member1/boot_partition (boot filesystem)
user_cfg_pt: reconfigured
root_mounted_rw: reconfigured
user_cfg_pt: reconfigured
dsfmgr: NOTE: updating kernel basenames for system at / starting LSM
Checking local filesystems Mounting local filesystems ....
(etc...)



This archive was generated by hypermail 2.1.7 : Sat Apr 12 2008 - 10:49:18 EDT