it was last tuesday when we encountered a major (major?) trouble in our system. in the middle of that dull afternoon, all plant operations went kaput. the usual noise that we hear from the huge equipment was replaced with unbelievable silence and everyone literally wondered what went wrong.
after a quick analysis, we were able to trace the problem back to our control system. and since it fell on my area of expertise, it was up to me come up with the best solution at the fastest available time.
we were able to resume operations an hour after. then, i had a short meeting with the area managers and discussed with them the cause of the problem and the steps undertaken to correct it. however, since most of the items i was talking about were technical in nature, they requested me to draft a report "in layman's terms" for them to understand the situation better.
when i opened my mailbox the next morning, i found this reply from one of the area managers.
Thanks John Stan.
In the possible cause of the problem, is an error that occurs in the ECS through NOE A link as it switches to link NOE B the link is busy. What will happen to this error/s and how will it be resetted? Does this will not causes congestion of errors then later will give traffic and busy signal to the system and will cause again communication error if ever another switching is required? Please give us your insight on this.
i was at a loss for words.