Publication Details
Issue: Vol 7, No 3 (2026)
Pages: 37-44
ISSN: 2660-5317

Abstract

Big Data processing requires high-performance solutions in today's industries with the increasing growth of data. Traditional computing techniques are not efficient to deal with huge datasets based on process and memory constraints . Distributed AI algorithms on HPC platforms are utilized in this work to enhance Big Data processing performance. Distributed Random Forest and Deep Neural Networks were experimented with multi-core CPUs and GPU clusters. Memory optimization and cache reuse were employed to minimize data access latency. Experiments based on synthetic health-care and financial data sets show remarkable improvement in processing time, prediction accuracy, and power consumption. Experiments prove the efficacy of distributed AI strategies along with HPC for scalable Big Data analysis with high performance.

Keywords
Distributed AI High-Performance Computing (HPC) Big Data processing Apache Spark GPU acceleration Random Forest Deep Neural Networks energy consumption scalability