2,010
13
Research Paper, 2 pages (350 words)

Hadoop software for large amount of data

Its a platform managed under the apache software foundation and its an open source and its deal with big data with any data type structer semi structer or unstructers and give the result in very short time it allows to work with structured and unstructured data arrays of dimension from 10 to 100 gb and even more.

V. Burunova and its structer is a group of clusters or one each of them contains groups of nodes too and each cluster has two type of node name node and data node name node is a unique node on cluster and it knows any data block location on cluster and data node is the remining node in cluster and that have done by using a set of servers which called a cluster.

Hadoop has two layers cooperate together first layer is mapreduce and it task is divided data processing across multiple servers and the second one is hadoop distributed file system hdfs and its task is storing data on multiple clusters and these data are separated as a set of blocks. Hadoop make sure the work is correct on clusters and it can detect and retrieve any error orfailurefor one or more of connecting nodes and by this way hadoop efforts increasing in core processing and storage size and high availability. Hadoop is usually used in a large cluster or a public cloud service such as yahoo.

Facebook twitter and amazon hadeer mahmoud 2018 hadoops features: Scalable: Hadoop able to work with huge applications and it can run analyze store process distribute large amount of data across thousands of nodes and servers which handle thousands terabytes of data or more also it can add additional nodes to clusters and these servers work parallel. Hadoop better than traditional relational database systems because rdbms cant expand to deal with huge data.

Single write multiple read the data on cluster can be read from multiple source at the same time data avalibility: When data is sent to a data node that hadoop creates multiple copies of data on other nodes in the cluster to keep data available if there a failure on one of nodes on cluster.

Thank's for Your Vote!
Hadoop software for large amount of data. Page 1
Hadoop software for large amount of data. Page 2
Hadoop software for large amount of data. Page 3

This work, titled "Hadoop software for large amount of data" was written and willingly shared by a fellow student. This sample can be utilized as a research and reference resource to aid in the writing of your own work. Any use of the work that does not include an appropriate citation is banned.

If you are the owner of this work and don’t want it to be published on AssignBuster, request its removal.

Request Removal
Cite this Research Paper

References

AssignBuster. (2022) 'Hadoop software for large amount of data'. 15 September.

Reference

AssignBuster. (2022, September 15). Hadoop software for large amount of data. Retrieved from https://assignbuster.com/hadoop-software-for-large-amount-of-data/

References

AssignBuster. 2022. "Hadoop software for large amount of data." September 15, 2022. https://assignbuster.com/hadoop-software-for-large-amount-of-data/.

1. AssignBuster. "Hadoop software for large amount of data." September 15, 2022. https://assignbuster.com/hadoop-software-for-large-amount-of-data/.


Bibliography


AssignBuster. "Hadoop software for large amount of data." September 15, 2022. https://assignbuster.com/hadoop-software-for-large-amount-of-data/.

Work Cited

"Hadoop software for large amount of data." AssignBuster, 15 Sept. 2022, assignbuster.com/hadoop-software-for-large-amount-of-data/.

Get in Touch

Please, let us know if you have any ideas on improving Hadoop software for large amount of data, or our service. We will be happy to hear what you think: [email protected]