CFP last date
20 May 2025
Reseach Article

A Novel Proximity-based Sorting Algorithm for Real-Time Numerical Data Streams and Big Data Applications

by Japheth Kodua Wiredu, Edward Yellakuor Baagyere, Callistus Ireneous Nakpih, Iven Aabaah
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 186 - Number 71
Year of Publication: 2025
Authors: Japheth Kodua Wiredu, Edward Yellakuor Baagyere, Callistus Ireneous Nakpih, Iven Aabaah
10.5120/ijca2025924567

Japheth Kodua Wiredu, Edward Yellakuor Baagyere, Callistus Ireneous Nakpih, Iven Aabaah . A Novel Proximity-based Sorting Algorithm for Real-Time Numerical Data Streams and Big Data Applications. International Journal of Computer Applications. 186, 71 ( Mar 2025), 1-10. DOI=10.5120/ijca2025924567

@article{ 10.5120/ijca2025924567,
author = { Japheth Kodua Wiredu, Edward Yellakuor Baagyere, Callistus Ireneous Nakpih, Iven Aabaah },
title = { A Novel Proximity-based Sorting Algorithm for Real-Time Numerical Data Streams and Big Data Applications },
journal = { International Journal of Computer Applications },
issue_date = { Mar 2025 },
volume = { 186 },
number = { 71 },
month = { Mar },
year = { 2025 },
issn = { 0975-8887 },
pages = { 1-10 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume186/number71/a-novel-proximity-based-sorting-algorithm-for-real-time-numerical-data-streams-and-big-data-applications/ },
doi = { 10.5120/ijca2025924567 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2025-03-06T21:09:24.880107+05:30
%A Japheth Kodua Wiredu
%A Edward Yellakuor Baagyere
%A Callistus Ireneous Nakpih
%A Iven Aabaah
%T A Novel Proximity-based Sorting Algorithm for Real-Time Numerical Data Streams and Big Data Applications
%J International Journal of Computer Applications
%@ 0975-8887
%V 186
%N 71
%P 1-10
%D 2025
%I Foundation of Computer Science (FCS), NY, USA
Abstract

As the volume of data generated and processed continues to grow exponentially, the demand for innovative and efficient sorting algorithms has become increasingly critical. Traditional sorting algorithms, while effective in certain scenarios, often struggle with the challenges posed by large-scale datasets, particularly in terms of memory usage and time complexity. This paper evaluates six primary sorting algorithms, identifying key limitations such as high memory consumption in merge sort and inefficiency in bubble sort for large datasets. To address these challenges, we introduce the Proximity-Based Pivot Sort (PBPS), an advanced sorting algorithm designed to optimize performance in real-time numerical data streams and big data applications. The proposed PBPS algorithm leverages proximity-based principles, such as absolute numerical difference, to group and sort similar elements efficiently, reducing unnecessary comparisons and computational overhead. By incorporating dynamic pivot selection (initially using the last element as the pivot, with plans to explore more advanced selection strategies in future work) and adaptive merging strategies, PBPS achieves significant improvements in both time complexity and memory efficiency. Experimental results demonstrate that PBPS outperforms traditional methods, including merge sort, radix sort, heap sort, and quicksort, particularly with datasets that exhibit a high degree of data locality. For instance, PBPS can process up to approximately 2000 sorted elements—twice the number managed by standard quicksort—while reducing execution time by up to 40% and improving memory efficiency by 30%. PBPS outperformed the other algorithms in tests measuring memory usage and execution time, making it a superior choice for handling large-scale datasets. The PBPS algorithm is particularly well-suited for real-time data processing and big data analytics, offering faster insights and streamlined data processing. Its ability to handle large-scale datasets with minimal latency makes it a valuable tool for applications such as financial trading, IoT sensor networks, and real-time analytics. By addressing the limitations of traditional sorting methods, PBPS represents a significant advancement in sorting algorithm design, providing a more efficient solution for modern data-intensive environments.

References
  1. Sarker, I. H. (2021). Machine learning: Algorithms, realworld applications and research directions. SN computer science, 2(3), 160.
  2. Mohammadi, M., Al-Fuqaha, A., Sorour, S., & Guizani, M. (2018). Deep learning for IoT big data and streaming analytics: A survey. IEEE Communications Surveys & Tutorials, 20(4), 2923-2960.
  3. Mohamed, A., Najafabadi, M. K., Wah, Y. B., Zaman, E. A. K., & Maskat, R. (2020). The state of the art and taxonomy of big data analytics: view from new big data framework. Artificial intelligence review, 53, 989-1037. Artificial intelligence review, 53, 989-1037.
  4. Ros, ca, C. M., & C˘arbureanu, M. (2024, March). A Comparative Analysis of Sorting Algorithms for Large-Scale Data: Performance Metrics and Language Efficiency. In International Conference on Emerging Trends and Technologies on Intelligent Systems (pp. 99-113). Singapore: Springer Nature Singapore.
  5. Ben Jmaa, Y., Ben Atitallah, R., Duvivier, D., & Ben Jemaa, M. (2019). A comparative study of sorting algorithms with FPGA acceleration by high level synthesis. Computaci´on y Sistemas, 23(1), 213-230.
  6. Lee, I., & Lee, K. (2015). The Internet of Things (IoT): Applications, investments, and challenges for enterprises. Business horizons, 58(4), 431-440.
  7. Alam, M. A., Nabil, A. R., Mintoo, A. A., & Islam, A. (2024). Real-Time Analytics In Streaming Big Data: Techniques And Applications. Journal of Science and Engineering Research, 1(01), 104-122.
  8. Lee, K. S., Lee, S. R., Kim, Y., & Lee, C. G. (2017). Deep learning–based real-time query processing for wireless sensor network. International Journal of Distributed Sensor Networks, 13(5), 1550147717707896.
  9. Almusallam, N. Y., Tari, Z., Bertok, P., & Zomaya, A. Y. (2017). Dimensionality reduction for intrusion detection systems in multi-data streams—A review and proposal of unsupervised feature selection scheme.Emergent Computation: a Festschrift for Selim G. Akl, 467-487.
  10. De Assuncao, M. D., da Silva Veith, A., & Buyya, R. (2018). Distributed data stream processing and edge computing: A survey on resource elasticity and future directions. Journal of Network and Computer Applications, 103, 1-17.
  11. Fan, Y., Hu, Z., Fu, L., Cheng, Y., Wang, L., & Wang, Y. (2024, November). Research on Optimizing Real-Time Data Processing in High-Frequency Trading Algorithms using Machine Learning. In 2024 6th International Conference on Intelligent Control, Measurement and Signal Processing (ICMSP) (pp. 774-777). IEEE.
  12. Pillai, V. (2023). Integrating AI-Driven Techniques in Big Data Analytics: Enhancing Decision-Making in Financial Markets. International Journal of Engineering and Computer Science, 12(07), 10-18535.
  13. Aftab, A., Ali, M. A., Ghaffar, A., Shah, A. U. R., Ishfaq, H. M., & Shujaat, M. (2021). “Review on performance of quick sort algorithm”. International Journal of Computer Science and Information Security (IJCSIS), 19 (2).
  14. Alotaibi, A., Almutairi, A., & Kurdi, H. (2020). Onebyone (obo): “A fast sorting algorithm”. Procedia Computer Science, 175, 270–277.
  15. Klaib, M. F., Sara, M. R. A., & Hasan, M. (2020). “A parallel implementation of dual-pivot quick sort for computers with small number of processors”. Indonesia Journal on Computing (Indo-JC), 5 (2), 81–90.
  16. Gomez, J. M., Aponte, E.,&Isaacson, B. (2022). “An analysis of non-comparison based sorting algorithms”.
  17. Wiredu, J. K., Aabaah, I., & Acheampong, R. W. (2024). Optimizing Heap Sort for Repeated Values: A Modified Approach to Improve Efficiency in Duplicate-Heavy Data Sets. International Journal of Advanced Research in Computer Science, 15(6).
  18. Purnomo, R., Putra, T. D. (2023). Theoretical Analysis of Standard Selection Sort Algorithm. Sinkron: jurnal dan penelitian teknik informatika, 7(2), 666-673.
  19. Furat, F. G. (2016). “A comparative study of selection sort and insertion sort algorithms”. International Research Journal of Engineering and Technology (IRJET), 3 (12), 326–330.
  20. Murthy, P. (2020). Optimizing cloud resource allocation using advanced AI techniques: A comparative study of reinforcement learning and genetic algorithms in multi-cloud environments. World Journal of Advanced Research and Reviews, 2..
  21. Knebl, H. (2020). “Algorithms and data structures”.
  22. Sara, M. R. A. (2020). “Balanced linked list: Ball”. International Journal of Software En- gineering and Computer Systems, 6 (1), 52–63.
  23. Sharma, N. K., Zhao, C., Liu, M., Kannan, P. G., Kim, C., Krishnamurthy, A., & Sivara- man, A. (2020). “Programmable calendar queues for high-speed packet schedul- ing”. 17th USENIX Symposium on Networked Systems Design and Implementation (NSDI 20), 685–699.
  24. Thabit, D. K. (2021). “Review on performance of quick sort algorithm”. International Jour- nal of Computer Science and Information Security (IJCSIS), 19 (2).
  25. Wild, S. (2016). “Dual-pivot quicksort and beyond: Analysis of multiway partitioning and its practical potential” [Doctoral dissertation, Technische Universit at Kaiserslautern].
  26. Abuba, N. S., Baagyere, E. Y., Nakpih, C. I., & Wiredu, J. K. (2025). OptiFlexSort: A Hybrid Sorting Algorithm for Efficient Large-Scale Data Processing. Journal of Advances in Mathematics and Computer Science, 40(2), 67–81.. https://doi.org/10.9734/jamcs/2025/v40i21970
  27. Wiredu, J. K., Atiyire, B., Abuba, N. S., & Acheampong, R. W. (2024). Efficiency Analysis and Optimization Techniques for Base Conversion Algorithms in Computational Systems. International Journal of Innovative Science and Research Technology, 9, 235-244.
Index Terms

Computer Science
Information Sciences

Keywords

Big-O notation Big-data Applications Numerical Data Streams Real-Time Data Processing Sorting Algorithms Time Complexity Proximity-Based Pivot Sort