How to configure SCOM for Exchange 2010
There are 2 types of alerts. Monitoring based alerts, these are for services and are in one of 2 states, good or bad (green or red). The second are rule based alerts, these are based on events, number of times it repeats which can indicate on going issues.
1: Responding to an un-known alert
a) Close it, see if it comes back (ie: is this a 1 time occurance)
b) Research, what is the root cause, what is triggering the alert? Eg: service, event ID, perfmon
- Can it be fixed? Write KB before closing alert
- Can it be fixed and Optimised? Ie: increase threshold.
- look at performance data, baseline vs Spike
- If it is a spike, increase the threshold above spike
- If Baseline is changing (ie: gradually increasing), incestigate root cause.
- If the alert can be ignored/override, add company information to explain override.
2: Responding to a known Alert
· Apply fix in Knowledge base
· Reset health state if is it a monitor based alert
· Close it if it is a rule based alert, re-calculate health.
How to in SCOM
1: Override a monitor/rule state (enable/disable)
Never select Disable.
2: Override a monitor/rule setting (eg: frequency, threshold)
a) Click on “Authoring”>Management Pack Objects> Monitors. Ensure you are not bound by scope, click on the “X” that may pop up at the top.
b) In the look for field, type (for example) msftesql
c) Observe that the processor time >75% is a configurable rule, in this example we want to increase the threshold. To configure, Right click on “KHI: msftesql”>Overrides>override the monitor> For all objects of class: content indexing service.
d) At the next screen, select frequency by checking the box, change the over5ride value from 300 to 600 (for this example). Select the management pack to save to as “Exchange Server 2010 MP Customizations. Click on Edit and put in some information about why this override is being put in place. Click apply.
3: Create a new monitor/rule to capture, by examples a new EVENT ID.
In this example we are going to create a rule to alert us every time Event 15004 is logged.
a) Go to the Authoring Section, Expand Management Pack objects and right click on rules> Create New Rule
b) In the next window, select NT event log (alert). Under the Management Pack drop down menu, select Exchange server 2010 MP Customizations.
c) Type in the Rule name, Next to Rule Target, Click Select
d) In the next window, under “Look For” type, Exchange and in the list, select “Microsoft Exchange 2010 server” and select “View All Targets” (this will alert you whenever event ID 15004 is present on any exchange server). Click OK
e) At the next screen, ensure Application is selected, click Next
f) Type in the Event ID number (in this case 15004), and the event Source, click on the dialogue box and select MSExchange Transport. Click Next.
g) For this alert we want it to be a high priority, and critical severity, select each from the drop down menu, then click Create. Finish.
4: Create a new view in SCOM
a) You can only do this on a folder that is not locked. So for example, you want to create a new view for all closed alerts in SCOM.
b) Right click on Monitoring> Exchange Server 2010 MP Customizations. Right click, New> Alert View.
c) Under name, type, “Closed Alerts”. Under the Criteria Tab> Show Data Related to, select “Microsoft Exchange 2010 All entities Group. Under “Select Conditions”, choose “with specific resolution state” then click on the criteria descript below and select “Closed (255)”
d) On the display tab, check “Path”, “Last modified” and “Repeat Count” check boxes in addition to the default checked boxes. Under sort columns by, first one select “Last Modified” and sort descending. Then group items by, select “Severity” sort Descending
e) Observe new alert view
5: Generate/view performance data to create a baseline
a) To find a performance counter, Open Monitoring> Exchange 2010 > Performance
b) For this example we are going to search on alert: “Transport latency impacted – 99th percentile of messages not meeting SLA over last 30 min - Red (>90). In the look for field, select “Items by text search”, type in Latency for this example in order to find out counter.
c) Select the counter by checking the appropriate box. Right click> select time range, on the graph to change the time range to get a clearer picture of whether the counter was a momentary spike or requires threshold review.
d) Select 7 days as the time range option, click OK.
e) Observe that this case is a spike and there is no further action required.
e) If it is a repetative spike, then you want to increase the threshold of that alert to above the highest figure it reached
6: Generate reports
7: Add Company Knowledge
a) Need Microsoft office & visual studio 2005 for Microsoft office system installed in order to edit this. http://technet.microsoft.com/en-us/library/cc974492.aspx
b) Log on to the computer with an account that is a member of the Operations Manager Authors role for the Operations Manager 2007 management group.
c) Click Authoring to open the Authoring workspace.
d) Locate the monitor or rule to be documented.
e) Click Properties under Actions, or right-click the monitor name and select Properties from the shortcut menu.
f) Click the Company Knowledge tab.
g) In the Management pack section, select a management pack in which to save the company knowledge. As a best practice, you should not save changes to the Default Management Pack.
h) Click Edit to launch Microsoft Office Word.
i) Add or edit text as desired.
a. The company knowledge tab displays only the sections of the Word document with custom text.
j) On the File menu, click Save to save your changes.
k) Return to the company knowledge tab and click Save, and then click Close. This will close both the properties dialog box and Word.
8: How to determine if an alert is Rule or Monitor based.
a) In this example we want to disable an alert “Realtime Scan Enabled State Monitor” which is a ForeFront based alert.
You can search for this alert under the alerts tab and under Alert Monitor, it will tell you if it is a rule or monitor, in this case it is a monitor.
Normally you would go to Authoring>Monitors and in the Look for: “realtime scan” and run a search and place your overrides there but you can also click on the Alert monitor (as above) and it will immediately bring you all the options to configure the alert.
b) In this case we want to turn off this alert. Click on overrides tab>override> for all objects of class.
c) Give this an override value of False, place a comment in the edit field, Select “Microsoft Forefront Protections 2010 MP Customizations” for this case.
9: Enable Event Collection for Synthetic Transaction Rules (Do not do this, this is just for testing purposes)
Enable Event Collection for Synthetic Transaction Rules
The Exchange 2010 Management Pack uses synthetic transactions, for example, running the Test-MapiConnectivity, Test-OwaConnectivity, and other commands, to scan your Exchange organization for basic connection responses and to test simple operations such as signing in to a mailbox. Whether these tests succeed or fail, their output is useful for investigating the state of the Exchange environment. However, because there is a large amount of output for each task, the event output isn't stored by default. The views for these tests in the Operations Console are populated if you enable the event collection rules for each respective test. For more information about synthetic transactions, see Monitoring by Using Synthetic Transactions in the System Center Operations Manager 2007 R2 documentation.
When you enable these collection rules, make sure you have sufficient disk space to accommodate the additional data. Each task creates from 4 to 12 event messages every time it runs. By default, each test runs every five minutes.
To enable the event collection rules for synthetic transaction output:
1. In System Center Operations Manager 2007, click Authoring.
2. In the Authoring pane, expand Management Pack Objects, and then click Rules.
3. In the Rules pane, click Change Scope.
4. In the Scope Management Pack Target(s) by object dialog box, in the Look for box, type "Exchange Server 2010."
5. Click View all targets.
6. Click Select All if it’s not disabled (it is only disabled when all rows are already selected).
7. Click OK to close the dialog box.
8. After the rules have loaded, type "Script event collection" in the Look for box near the top of the console.
9. For each test task that you would like to enable, perform the following steps:
a. Right-click the rule and select Overrides > Override the Rule > For all objects of class:<class name>.
b. Select the Override check box.
c. Set the override value to True.
d. Click OK.
It will take some time for the overrides to be picked up by the agents and for events to appear in the built-in views.