Received: from PLOXCHG24.cern.ch ([fe80::9864:103b:dbb5:decc]) by CERNFE21.cern.ch ([fe80::e1f5:639c:7d83:5631%10]) with mapi id 14.01.0270.002; Thu, 23 Jun 2011 12:50:07 +0200 From: Joachim Andreas Tingvold To: Jochen Ulrich CC: "alice-hlt-cluster-admin (ALICE HLT Cluster Admin List)" Subject: Re: SysMES detected BMC errors Thread-Topic: SysMES detected BMC errors Thread-Index: AQHMMLKQYSjJ0ovb40mM58sx61OU6pTJseqAgADm1fCAAAr0AA== Date: Thu, 23 Jun 2011 10:50:07 +0000 Message-ID: <9F692E51-FB3A-4784-858E-EA53A5F10F3F@cern.ch> References: <246279427.471308729668758.JavaMail.sysmes@mon1.internal> <40E73F0F-C9F7-459F-BA94-706675D67740@cern.ch> <25B2CFAAEF514F4CB8B2481D800BE07D@firebat> In-Reply-To: <25B2CFAAEF514F4CB8B2481D800BE07D@firebat> Accept-Language: en-GB, en-US Content-Language: en-US X-MS-Exchange-Organization-AuthAs: Internal X-MS-Exchange-Organization-AuthMechanism: 04 X-MS-Exchange-Organization-AuthSource: CERNFE21.cern.ch X-MS-Has-Attach: X-Auto-Response-Suppress: All X-MS-Exchange-Organization-SCL: -1 X-MS-TNEF-Correlator: x-originating-ip: [83.201.250.248] list-id: Content-Type: text/plain; charset="us-ascii" Content-ID: Content-Transfer-Encoding: quoted-printable MIME-Version: 1.0 On Thu, Jun 23, 2011, at 10:49:35AM GMT+02:00, Jochen Ulrich wrote: > Yes, it is a false-positive but it's not the SysMES monitor that =20 > produces it but it's the BMC itself. > The SysMES monitor simply reads the BMC log and sends every =20 > "critical" or "non-revocerable" message it finds in the log. Still -- couldn't it just poll the sensor directly when SysMES =20 triggers on the "critical"- and "non-recoverable"-messages in the log? =20 So something like this? Log entry triggers SysMES -> Wait 1 minute -> Poll sensors directly, =20 check values -> Wait 1 minute -> Poll sensors directly, check values -=20 > If still below threshold, send alarm --=20 Joachim