GRID – Processing Network Management Data

The Problem

Customer processing of network management data take 23+ hours to complete for a 24 hour daya of collection. Any processing disruption results in run times exceeding 24 hours and results in full data sets being lost.

Situation Assessment

Long processing times caused by numerous serial CPU-intensive processes.
Occasional data errors not properly handled, causing processing to ‘hang’ and causing processing to exceed the critical 24-hour window.
No capacity to add additional network management data sets of interest.

Our Solution

Re-engineer serial processes into 43 parallel routines that run independently of each other.
Partitioning means that, in the now-unlikely event of a stalled process, only that small data set is affected.
Add fault detection and handling to significantly reduce the possibility of stalled processes.
Implementation uses low-cost, open-source, high-throughput processing GRID running on hardened LINUX OS and X86 blade hardware.

Benefit

Problem of dropped data sets completely eliminated.
Solution processes 15x the amount of data in 1/12 the time – an almost 200x efficiency improvement.
Solution is economically scalable and inherently less expensive to maintain.
Data collection can be expanded to additional network elements with existing solution.