Skip to main content

mom: missing data in top 100 mailboxes by size...

ran into a problem today where an administrator told me that their "top 100 mailboxes by size" report stopped working. now, to define the boundaries of "stopped working", let's evaluate these few things:
  1. report works for other servers
  2. report worked a month ago
i generally follow these basic rules to troubleshoot a report issue:
  1. check if the report contains data
  2. check if data exists in the console
  3. check if data exists in the reporting tables
so going on the boundaries of "stopped working", we start with step 1. i open up the report to verify what the person telling me is the same thing i'm seeing. in this case, it actually was. :o i tried switching the report to a different server and data came up. i tried switching it to the problem server and no data existed. perplexing! i went to step 2. to make this easier, i created some custom performance views to look at the object "mcexchdg" breaking it down into "mailbox mb" and "mailbox message count". both views work but again, same problem. no data is posted for the problem server. at this point, we know the rules must be working because some servers are posting data. this is definitely an agent-side problem then... right? since the person reported that this report worked on the same server in the past, we go to step 3. i ran a little query that looked like this just to see what was there:
select top 10 * from sdkperformanceview where computername = [problemserver] and performanceobjectname='mcexchdg'
i get back counters for mcexchdg but not for the expected "mailbox mb" counter! hmmm. so apparently the data never did exist for this server. so at least we know the thing never worked for this server, instead of thinking something changed along the way. to figure out what was happening, instead of being a good administrator and googling it, potentially finding results that could save me hours, i decided to use the script "exchange 2003 - collect mailbox statistics" to troubleshoot. in order to do this, i went through the following:
  • replace all createevent entries with wscript.echo
  • comment out on error resume next
  • use local reference of computer name instead of scriptcontext.targetnebioscomputer
now i figured when i ran it, i'd get errors. i did! i had to use an interactive cmd shell to make this work since the script demands to run as "localsystem". after that, i had to run through it for a few iterations to find where it failed, comment it, and move on. so what was the end result? you have to have the servernameMOM accounts. in this case, it turned out we did... however, someone inadvertently renamed the servernameMOM account!

Comments

Popular posts from this blog

using preloadpkgonsite.exe to stage compressed copies to child site distribution points

UPDATE: john marcum sent me a kind email to let me know about a problem he ran into with preloadpkgonsite.exe in the new SCCM Toolkit V2 where under certain conditions, packages will not uncompress.  if you are using the v2 toolkit, PLEASE read this blog post before proceeding.   here’s a scenario that came up on the mssms@lists.myitforum.com mailing list. when confronted with a situation of large packages and wan links, it’s generally best to get the data to the other location without going over the wire. in this case, 75gb. :/ the “how” you get the files there is really not the most important thing to worry about. once they’re there and moved to the appropriate location, preloadpkgonsite.exe is required to install the compressed source files. once done, a status message goes back to the parent server which should stop the upstream server from copying the package source files over the wan to the child site. anyway, if it’s a relatively small amount of packages, you can

How to Identify Applications Using Your Domain Controller

Problem Everyone has been through it. We've all had to retire or replace a domain controller at some point in our checkered collective experiences. While AD provides very intelligent high availability, some applications are just plain dumb. They do not observe site awareness or participate in locating a domain controller. All they want is the name or IP of one domain controller which gets hardcoded in a configuration file somewhere, deeply embedded in some file folder or setting that you are never going to find. How do you look at a DC and decide which applications might be doing it? Packet trace? Logs? Shut it down and wait for screaming? It seems very tedious and nearly impossible. Potential Solution Obviously I wouldn't even bother posting this if I hadn't run across something interesting. :) I ran across something in draftcalled Domain Controller Isolation. Since it's in draft, I don't know that it's published yet. HOWEVER, the concept is based off

sccm: content hash fails to match

back in 2008, I wrote up a little thing about how distribution manager fails to send a package to a distribution point . even though a lot of what I wrote that for was the failure of packages to get delivered to child sites, the result was pretty much the same. when the client tries to run the advertisement with an old package, the result was a failure because of content mismatch. I went through an ordeal recently capturing these exact kinds of failures and corrected quite a number of problems with these packages. the resulting blog post is my effort to capture how these problems were resolved. if nothing else, it's a basic checklist of things you can use.   DETECTION status messages take a look at your status messages. this has to be the easiest way to determine where these problems exist. unfortunately, it requires that a client is already experiencing problems. there are client logs you can examine as well such as cas, but I wasn't even sure I was going to have enough m