Q3 - To Run A Basic Word Count MapReduce

This document provides a simple Word Count program in R as an example of how the MapReduce paradigm works. The program takes input text from a file, maps it by counting the occurrences of each word, reduces the counts, and outputs the results. While this example runs locally, a true MapReduce system would distribute the map and reduce steps across multiple nodes for processing large datasets.

Uploaded by

omkarsahane2001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views2 pages

Q3 - To Run A Basic Word Count MapReduce

Uploaded by

omkarsahane2001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 2

Running a Word Count MapReduce program typically involves using a distributed

computing framework like Apache Hadoop. However, since you're asking for a basic
example, I'll provide a simple Word Count program in R, which won't be distributed
but will give you an idea of how the MapReduce paradigm works.

Firstly, you need to have R installed on your system. If you haven't installed it,
you can download and install it from the official R website: [R
Project](https://www.r-project.org/).

Here's a basic Word Count program in R:

```R
# Word Count MapReduce program in R

# Function to perform Map step

map <- function(text) {
words <- unlist(strsplit(text, "\\s+"))
key_value_pairs <- lapply(words, function(word) list(word, 1))
return(key_value_pairs)
}

# Function to perform Reduce step

reduce <- function(key, values) {
return(sum(values))
}

# Read input text from a file

input_file <- "input.txt"
text <- tolower(readLines(input_file))

# Map step
mapped_data <- unlist(lapply(text, map))

# Reduce step
result <- tapply(mapped_data, names(mapped_data), reduce)

# Print the word count

cat("Word Count:\n")
print(result)

# Save the result to an output file

write.table(data.frame(word = names(result), count = result), file = "output.txt",
quote = FALSE, row.names = FALSE, sep = "\t")
```

This is a basic example, and it assumes you have a file named `input.txt` in the
same directory with the text you want to analyze.

To run this program:

1. Save the code to a file, e.g., `wordcount.R`.

2. Create an input file (`input.txt`) with the text you want to analyze.
3. Open R in your terminal or RStudio.
4. Run the script using `source("wordcount.R")`.

The word count result will be printed, and an output file (`output.txt`) will be
created with the word count information.
Please note that this example is for educational purposes and doesn't leverage the
parallel processing capabilities of a true MapReduce system. In a real distributed
environment, such as Apache Hadoop, the Map and Reduce steps would be executed
across multiple nodes to handle large-scale data.

Unit 4 CS 3RD Yr
No ratings yet
Unit 4 CS 3RD Yr
13 pages
Bda Lab Exercises Lab Mannual - 2023
No ratings yet
Bda Lab Exercises Lab Mannual - 2023
72 pages
Map Reduce Design and Execution Framework Part 1
No ratings yet
Map Reduce Design and Execution Framework Part 1
19 pages
Hadoop Map-Reduce
No ratings yet
Hadoop Map-Reduce
2 pages
2403RES29 - Hemant Choudhary - CS546 - Assignment - 2
No ratings yet
2403RES29 - Hemant Choudhary - CS546 - Assignment - 2
10 pages
BDA Experiment 3
No ratings yet
BDA Experiment 3
7 pages
Ravikant Hadoop File
No ratings yet
Ravikant Hadoop File
22 pages
Big Data Report
No ratings yet
Big Data Report
7 pages
BDP 2024 08
No ratings yet
BDP 2024 08
14 pages
Practical 2-1
No ratings yet
Practical 2-1
4 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
6 pages
Word Count Program With MapReduce and Java
No ratings yet
Word Count Program With MapReduce and Java
5 pages
Big Data 4 Vivek
No ratings yet
Big Data 4 Vivek
3 pages
Word Count
No ratings yet
Word Count
3 pages
Exp5 BDI 60004200124
No ratings yet
Exp5 BDI 60004200124
5 pages
Assignment 11 DSBDA
No ratings yet
Assignment 11 DSBDA
4 pages
Lab2 WC
No ratings yet
Lab2 WC
2 pages
MapReduce Word Count Example - Javatpoint
No ratings yet
MapReduce Word Count Example - Javatpoint
12 pages
Practical 2c
No ratings yet
Practical 2c
2 pages
Bda Exp2 Chinmay
No ratings yet
Bda Exp2 Chinmay
7 pages
Lec 8
No ratings yet
Lec 8
19 pages
579 BDA Week-04
No ratings yet
579 BDA Week-04
1 page
Hands-On Exercises With Big Data: Lab Sheet 1: Getting Started With Mapreduce and Hadoop
No ratings yet
Hands-On Exercises With Big Data: Lab Sheet 1: Getting Started With Mapreduce and Hadoop
14 pages
Experiment 6 BDA
No ratings yet
Experiment 6 BDA
4 pages
Lec 8
No ratings yet
Lec 8
24 pages
Experiment 3
No ratings yet
Experiment 3
5 pages
Practical-2 Aim: Write A Program of Word Count in Map Reduce Over HDFS. Description
No ratings yet
Practical-2 Aim: Write A Program of Word Count in Map Reduce Over HDFS. Description
6 pages
Unit 4 2 - CC
No ratings yet
Unit 4 2 - CC
6 pages
Chapter 4
No ratings yet
Chapter 4
53 pages
BDC Output 3
No ratings yet
BDC Output 3
4 pages
Parlab Parallel Boot Camp Cloud Computing With Mapreduce and Hadoop
No ratings yet
Parlab Parallel Boot Camp Cloud Computing With Mapreduce and Hadoop
49 pages
Example - (Map Function in Word Count)
No ratings yet
Example - (Map Function in Word Count)
6 pages
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
No ratings yet
Word Count Program To Demonstrate The Use of Map and Reduce Tasks
5 pages
Word Count Program
No ratings yet
Word Count Program
3 pages
DSBDA GRP B Print
No ratings yet
DSBDA GRP B Print
21 pages
DSBDA Manual
No ratings yet
DSBDA Manual
54 pages
6 WIBD-Practicals
No ratings yet
6 WIBD-Practicals
19 pages
Unit 3 2
No ratings yet
Unit 3 2
4 pages
Writing An Hadoop MapReduce Program in Python
No ratings yet
Writing An Hadoop MapReduce Program in Python
21 pages
ESSIR MapReduce For Indexing
No ratings yet
ESSIR MapReduce For Indexing
86 pages
MapReduce Enhanced Guide
No ratings yet
MapReduce Enhanced Guide
3 pages
Bda Experiment No2
No ratings yet
Bda Experiment No2
12 pages
Exp 3-Word Count
No ratings yet
Exp 3-Word Count
4 pages
Bda Unit 3
No ratings yet
Bda Unit 3
20 pages
32 BDA Exp3
No ratings yet
32 BDA Exp3
11 pages
Unit-2 (MapReduce-I)
No ratings yet
Unit-2 (MapReduce-I)
28 pages
Map Reduce Intro CS4961-L22
No ratings yet
Map Reduce Intro CS4961-L22
20 pages
Map Reduce
No ratings yet
Map Reduce
39 pages
Ex No 04
No ratings yet
Ex No 04
4 pages
Run The WordCount Program Instructions
No ratings yet
Run The WordCount Program Instructions
3 pages
Map Reduce
No ratings yet
Map Reduce
3 pages
Big Data Practical 2
No ratings yet
Big Data Practical 2
11 pages
Module2 C MapReduceParadigm
No ratings yet
Module2 C MapReduceParadigm
74 pages
Mapreduce
No ratings yet
Mapreduce
5 pages
5 Paso S Text Mining
No ratings yet
5 Paso S Text Mining
4 pages
CS702 Big Data Programs
No ratings yet
CS702 Big Data Programs
59 pages
Unit IV Programming Model
No ratings yet
Unit IV Programming Model
30 pages
M4 06 MapReduce
No ratings yet
M4 06 MapReduce
28 pages
Map Reduce
No ratings yet
Map Reduce
3 pages
DBM Sprint
No ratings yet
DBM Sprint
29 pages
STQAPRACTICALPROGRAM
No ratings yet
STQAPRACTICALPROGRAM
5 pages
STQA Answers
No ratings yet
STQA Answers
13 pages
Form11 1638000270970
No ratings yet
Form11 1638000270970
2 pages