GRID – Processing Network Management Data
The Problem
Customer processing of network management data take 23+ hours to complete for a 24 hour daya of collection. Any processing disruption results in run times exceeding 24 hours and results in full data sets being lost.
Situation Assessment
- Long processing times caused by numerous serial CPU-intensive processes.
- Occasional data errors not properly handled, causing processing to ‘hang’ and causing processing to exceed the critical 24-hour window.
- No capacity to add additional network management data sets of interest.
Our Solution
- Re-engineer serial processes into 43 parallel routines that run independently of each other.
- Partitioning means that, in the now-unlikely event of a stalled process, only that small data set is affected.
- Add fault detection and handling to significantly reduce the possibility of stalled processes.
- Implementation uses low-cost, open-source, high-throughput processing GRID running on hardened LINUX OS and X86 blade hardware.
Benefit
- Problem of dropped data sets completely eliminated.
- Solution processes 15x the amount of data in 1/12 the time – an almost 200x efficiency improvement.
- Solution is economically scalable and inherently less expensive to maintain.
- Data collection can be expanded to additional network elements with existing solution.