Fire V210 reboots "PCI bus 1 error(s)!"

From: Mauricio Colina (mcolina@adexus.cl)
Date: Mon Nov 24 2003 - 16:25:38 EST


Hello Gurus :

we have a v210 with the following features :

  Model: Fire V210
  ------
  Hostname: pelu
  Hostid: 8345a48d
  Release: 5.8
  Kernel architecture: sun4u
  Application architecture: sparc
  Hardware provider: Sun_Microsystems
  Domain:
  Kernel version: SunOS 5.8 Generic 108528-23 Jun 2003
  MB-Model: 'SUNW,375-3150'
  OBP 4.8.2 2003/03/27 13:22

  ------ Print Diag (prtdiag -v) native ------

  System Configuration: Sun Microsystems sun4u Sun Fire V210
  System clock frequency: 167 MHZ
  Memory size: 4GB

  ==================================== CPUs
  ====================================
                        E$ CPU CPU Temperature
  Fan
         CPU Freq Size Impl. Mask Die Ambient Speed
  Unit
         --- -------- ---------- ------ ---- -------- -------- -----
  ----
       MB/P0 1002 MHz 1MB US-IIIi 2.3 - -
       MB/P1 1002 MHz 1MB US-IIIi 2.3 - -

  ================================= IO Devices
  =================================
       Bus Freq
  Brd Type MHz Slot Name Model
  --- ---- ---- ---------- ----------------------------
  --------------------
   0 pci 66 2 network-pci14e4,1648.108e.16+
   0 pci 66 2 network-pci14e4,1648.108e.16+
   0 pci 66 2 scsi-pci1000,21.1000.1000.1 +
   0 pci 66 2 scsi-pci1000,21.1000.1000.1 +
   0 pci 66 2 network-pci14e4,1648.108e.16+
   0 pci 66 2 network-pci14e4,1648.108e.16+
   0 pci 33 7 isa/serial-su16550 (serial)
   0 pci 33 7 isa/serial-su16550 (serial)
   0 pci 33 7 isa/rmc-comm-rmc_comm (seria+
   0 pci 33 13 ide-pci10b9,5229.c4 (ide)

  ============================ Memory Configuration
  ============================
  Segment Table:
  -----------------------------------------------------------------------
  Base Address Size Interleave Factor Contains
  -----------------------------------------------------------------------
  0x0 2GB 4 BankIDs 0,1,2,3
  0x1000000000 2GB 4 BankIDs 16,17,18,19

  Bank Table:
  -----------------------------------------------------------
             Physical Location
  ID ControllerID GroupID Size Interleave Way
  -----------------------------------------------------------
  0 0 0 512MB 0,1,2,3
  1 0 1 512MB
  2 0 1 512MB
  3 0 0 512MB
  16 1 0 512MB 0,1,2,3
  17 1 1 512MB
  18 1 1 512MB
  19 1 0 512MB

  Memory Module Groups:
  --------------------------------------------------
  ControllerID GroupID Labels
  --------------------------------------------------
  0 0 MB/P0/B0/D0,MB/P0/B0/D1
  0 1 MB/P0/B1/D0,MB/P0/B1/D1

  Memory Module Groups:
  --------------------------------------------------
  ControllerID GroupID Labels
  --------------------------------------------------
  1 0 MB/P1/B0/D0,MB/P1/B0/D1
  1 1 MB/P1/B1/D0,MB/P1/B1/D1

  ============================ Environmental Status
  ============================
  Fan Speeds:
  ---------------------------------------
  Location Sensor Speed
  ---------------------------------------
  MB/P0/F0 RS 16875 rpm
  MB/P0/F1 RS 16463 rpm
  MB/P1/F0 RS 16875 rpm
  MB/P1/F1 RS 16875 rpm
  F0 RS 9310 rpm
  F1 RS 9375 rpm
  F2 RS 9574 rpm
  PS0 FF_FAN okay
  F3 RS 9375 rpm
  --------------------------------------------------
  Led State:
  --------------------------------------------------
  Location Led State Color
  --------------------------------------------------
  MB ACT on green
  MB SERVICE off amber
  MB LOCATE off white
  PS0 ACT on green
  PS0 SERVICE off amber
  PS0 OK2RM off blue
  HDD0 SERVICE off amber
  HDD0 OK2RM off blue
  HDD1 SERVICE off amber
  HDD1 OK2RM off blue
  ---------------------------------------------------------------
  Temperature sensors:
  ---------------------------------------------------------------
  Location Sensor Temperature Lo LoWarn HiWarn Hi Status
  ---------------------------------------------------------------
  MB T_ENC 24C -3C 5C 40C 48C okay
  MB/P0 T_CORE 55C - - 110C 115C okay
  MB/P1 T_CORE 52C - - 110C 115C okay
  PS0 FF_OT - - - - - okay
  ----------------------------------------------------------------------
  Voltage sensors:
  ----------------------------------------------------------------------
  Location Sensor Voltage Lo LoWarn HiWarn Hi Status
  ----------------------------------------------------------------------
  MB V_VTT 1.31V - 1.17V 1.43V - okay
  MB V_GBE_+2V5 2.51V - 2.25V 2.75V - okay
  MB V_GBE_CORE 1.20V - 1.08V 1.32V - okay
  MB V_VCCTM 2.55V - 2.25V 2.75V - okay
  MB V_+2V5 2.61V - 2.34V 2.86V - okay
  MB V_+1V5 1.51V - 1.35V 1.65V - okay
  MB/BAT V_BAT 2.97V - 2.70V - - okay
  MB/P0 V_CORE 1.45V - 1.26V 1.54V - okay
  MB/P1 V_CORE 1.45V - 1.26V 1.54V - okay
  PS0 FF_UV - - - - - okay
  PS0 FF_OV - - - - - okay
  PS0 P_PWR - - - - - okay
  ----------------------------------------------------------------------
  Current sensors:
  ----------------------------------------------------------------------
  Location Sensor Current Lo LoWarn HiWarn Hi Status
  ----------------------------------------------------------------------
  MB FF_SCSI - - - - - okay
  PS0 FF_OC - - - - - okay
  -------------------------
  Board Status:
  -------------------------
  Location Status
  -------------------------
  MB/SC okay
  PS0 okay
  HDD0 present
  HDD1 present

  ================================ HW Revisions
  ================================
  ASIC Revisions:
  ---------------
  pci: Rev 4
  pci: Rev 4
  pci: Rev 4
  pci: Rev 4

  System PROM revisions:
  ----------------------
  OBP 4.8.2 2003/03/27 13:22 Sun Fire V210/V240
  OBDIAG 4.8.2 2003/03/27 13:23

  the problem basically is that machine is reboots constant :

Nov 16 19:23:43 pelu pcisch: [ID 370704 kern.info] PCI-device: usb@a, ohci0
Nov 16 19:23:43 pelu genunix: [ID 936769 kern.info] ohci0 is
/pci@1e,600000/usb@a
Nov 16 19:23:50 pelu pcisch: [ID 831440 kern.warning] WARNING: pcisch-0:
PCI fault log start:
Nov 16 19:23:50 pelu pcisch: [ID 303176 kern.notice] PCI SERR
Nov 16 19:23:50 pelu pcisch: [ID 917854 kern.notice] pcisch-0: PBM
AFSR=0x0.00000000
Nov 16 19:23:50 pelu pcisch: [ID 120591 kern.notice] dwordmask=0 bytemask=0
Nov 16 19:23:50 pelu pcisch: [ID 607383 kern.notice] pcisch-0: PCI primary
error (0):
Nov 16 19:23:50 pelu pcisch: [ID 259679 kern.notice] pcisch-0: PCI
secondary error (0):
Nov 16 19:23:50 pelu pcisch: [ID 467665 kern.notice] pcisch-0: PBM AFAR
0.00000000:
Nov 16 19:23:50 pelu pcisch: [ID 127741 kern.warning] WARNING: pcisch0: PCI
config space CSR=0x2a0
Nov 16 19:23:50 pelu pcisch: [ID 141464 kern.notice] pcisch-0: PCI fault
log end.
Nov 16 19:23:50 pelu unix: [ID 836849 kern.notice]
Nov 16 19:23:50 pelu ^Mpanic[cpu1]/thread=2a1001d1d20:
Nov 16 19:23:50 pelu unix: [ID 578303 kern.notice] pcisch-0: PCI bus 1
error(s)!
Nov 16 19:23:50 pelu unix: [ID 100000 kern.notice]
Nov 16 19:23:50 pelu genunix: [ID 723222 kern.notice] 000002a1001cbea0
pcisch:pbm_error_intr+164 (30000157e10, 7f3, 30000179350, 3, 30000157e10,
1)
Nov 16 19:23:50 pelu genunix: [ID 179002 kern.notice] %l0-3:
000003000007b810 0000000000004000 0000000000001fff 000002a10000fd20
Nov 16 19:23:50 pelu %l4-7: 0000000000000000 00000300001c9048
0000000010490ba8 0000000000000001
Nov 16 19:23:50 pelu genunix: [ID 723222 kern.notice] 000002a1001cbf50
unix:current_thread+44 (8, 30001ed2460, 0, 0, 300001b1000, 300001e3e60)
Nov 16 19:23:50 pelu genunix: [ID 179002 kern.notice] %l0-3:
00000000100074b0 000002a1001d1061 000000000000000e 0000000000000016
Nov 16 19:23:50 pelu %l4-7: 0000030001b2ea98 0000000000000016
0000000000000004 000002a1001d1910
Nov 16 19:23:50 pelu genunix: [ID 723222 kern.notice] 000002a1001d19b0
bge:bge_factotum_link_check+c (300001e5000, 300001e5000, 20, 1,
30001eb37f8, 300001629e0)
Nov 16 19:23:50 pelu genunix: [ID 179002 kern.notice] %l0-3:
0000000078047334 0000030000065748 0000002058701050 0000030001eb37f8
Nov 16 19:23:50 pelu %l4-7: 0000030000187088 0000000000000000
0000000000000000 0000030001eb37f8
Nov 16 19:23:50 pelu genunix: [ID 723222 kern.notice] 000002a1001d1a60
bge:bge_chip_factotum+5c (300001e6f6c, 300001e6e60, 300001e5000, 0, 0, 0)
Nov 16 19:23:50 pelu genunix: [ID 179002 kern.notice] %l0-3:
0000000078050888 0000000000000001 0000030001b84ca0 0000030000187748
Nov 16 19:23:50 pelu %l4-7: 0000030000187848 0000000000000000
0000000000000004 000002a1001d1a70
Nov 16 19:23:50 pelu unix: [ID 100000 kern.notice]
Nov 16 19:23:50 pelu genunix: [ID 672855 kern.notice] syncing file
systems...
Nov 16 19:23:50 pelu genunix: [ID 733762 kern.notice] 16
Nov 16 19:23:50 pelu genunix: [ID 733762 kern.notice] 9
Nov 16 19:23:50 pelu genunix: [ID 904073 kern.notice] done
Nov 16 19:23:50 pelu genunix: [ID 353387 kern.notice] dumping to
/dev/dsk/c1t0d0s1, offset 65536
Nov 16 19:23:50 pelu genunix: [ID 409368 kern.notice] ^M100% done: 27864
pages dumped, compression ratio 8.02,
Nov 16 19:23:50 pelu genunix: [ID 851671 kern.notice] dump succeeded
Nov 16 19:30:29 pelu genunix: [ID 540533 kern.notice] ^MSunOS Release 5.8
Version Generic_108528-23 64-bit
Nov 16 19:30:29 pelu genunix: [ID 913632 kern.notice] Copyright 1983-2003
Sun Microsystems, Inc. All rights reserved.
Nov 16 19:30:29 pelu genunix: [ID 678236 kern.info] Ethernet address =
0:3:ba:45:a4:8d
Nov 16 19:30:29 pelu unix: [ID 389951 kern.info] mem = 4194304K
(0x100000000)
Nov 16 19:30:29 pelu unix: [ID 930857 kern.info] avail mem = 4116856832
Nov 16 19:30:29 pelu rootnex: [ID 466748 kern.info] root nexus = Sun Fire
V210
Nov 16 19:30:29 pelu rootnex: [ID 349649 kern.info] pcisch2 at root: SAFARI
0x1c 0x600000
Nov 16 19:30:29 pelu genunix: [ID 936769 kern.info] pcisch2 is
/pci@1c,600000

  What's the problem please?

  Regards,
  Mauricio.
_______________________________________________
sunmanagers mailing list
sunmanagers@sunmanagers.org
http://www.sunmanagers.org/mailman/listinfo/sunmanagers



This archive was generated by hypermail 2.1.7 : Wed Apr 09 2008 - 23:27:33 EDT