A comprehensive study and review of tuning the performance on database scalability in big data analytics.

التفاصيل البيبلوغرافية
العنوان:	A comprehensive study and review of tuning the performance on database scalability in big data analytics.
المؤلفون:	Sundarakumar, M.R., Mahadevan, G., Natchadalingam, R., Karthikeyan, G., Ashok, J., Manoharan, J. Samuel, Sathya, V., Velmurugadass, P.
المصدر:	Journal of Intelligent & Fuzzy Systems; 2023, Vol. 44 Issue 3, p5231-5255, 25p
مصطلحات موضوعية:	DATABASES, BIG data, DATA libraries, PYTHON programming language, BATCH processing, ELECTRONIC data processing
مستخلص:	In the modern era, digital data processing with a huge volume of data from the repository is challenging due to various data formats and the extraction techniques available. The accuracy levels and speed of the data processing on larger networks using modern tools have limitations for getting quick results. The major problem of data extraction on the repository is finding the data location and the dynamic changes in the existing data. Even though many researchers created different tools with algorithms for processing those data from the warehouse, it has not given accurate results and gives low latency. This output is due to a larger network of batch processing. The performance of the database scalability has to be tuned with the powerful distributed framework and programming languages for the latest real-time applications to process the huge datasets over the network. Data processing has been done in big data analytics using the modern tools HADOOP and SPARK effectively. Moreover, a recent programming language such as Python will provide solutions with the concepts of map reduction and erasure coding. But it has some challenges and limitations on a huge dataset at network clusters. This review paper deals with Hadoop and Spark features also their challenges and limitations over different criteria such as file size, file formats, and scheduling techniques. In this paper, a detailed survey of the challenges and limitations that occurred during the processing phase in big data analytics was discussed and provided solutions to that by selecting the languages and techniques using modern tools. This paper gives solutions to the research people who are working in big data analytics, for improving the speed of data processing with a proper algorithm over digital data in huge repositories. [ABSTRACT FROM AUTHOR]
	Copyright of Journal of Intelligent & Fuzzy Systems is the property of IOS Press and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات:	Complementary Index

الوصف
تدمد:	10641246
DOI:	10.3233/JIFS-223295