Primjena MapReduce algoritma na analizu nizova tekstualnih podataka

Volarić, Karolina (2014) Primjena MapReduce algoritma na analizu nizova tekstualnih podataka. Diploma thesis, Faculty of Science > Department of Mathematics.

[img] PDF
Restricted to Registered users only
Language: Croatian

Download (688kB) | Request a copy

Abstract

This thesis describes the operation and use of Apache Hadoop and its components. The most important component is MapReduce. To use MapReduce algorithm it is necessary to understand its mode of operation, and learn some of the rules for writing the algorithm as well as the use of the combiner function. In order to fully understand the concept of Hadoop, the following concepts are explained: Flume, Hive, HDFS and Oozie. Use of Hadoop and MapReduce is shown in the analysis of social network Twitter. Data were collected according to certain conditions using Apache Flume, then they were processed with Oozie-operation and queried using Hive.

Item Type: Thesis (Diploma thesis)
Supervisor: Grubišić, Luka
Date: 2014
Number of Pages: 37
Subjects: NATURAL SCIENCES > Mathematics
Divisions: Faculty of Science > Department of Mathematics
Depositing User: Iva Prah
Date Deposited: 01 Sep 2015 11:55
Last Modified: 01 Sep 2015 11:55
URI: http://digre.pmf.unizg.hr/id/eprint/4203

Actions (login required)

View Item View Item