0% found this document useful (0 votes)

260 views15 pages

Map-Reduce Implementation, Using In-Map Aggregation and Other Features

The document describes a MapReduce algorithm for building an inverted index from a set of files, where each token is mapped to the files it appears in along with its positions. The algorithm implements various text processing steps like case folding, punctuation handling, stopword removal, and stemming before writing the output in key-value pairs with the token as key and file/positions list as value. The algorithm aims to improve performance through techniques like in-mapper combining to reduce data shuffling between map and reduce tasks.

Uploaded by

Cristiano Ruschel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

260 views15 pages

Map-Reduce Implementation, Using In-Map Aggregation and Other Features

Uploaded by

Cristiano Ruschel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

COMP38120: Documents, Services and Data on the Web

Laboratory Exercise 1.3

Author:CristianoRuschelMarquesDias

Description

Theindexingalgorithmimplementedusingthemapreducearchitectureallows
whomeverhasaccesstotheoutputdatatomakequeriestakingonaccountthepositionof
eachwordonthedocument,andtheamountofoccurrencesofeachwordineachdocument.
Thefeaturesimplementedwere:

CaseFoldingAcontextbasedcapitalizationalgorithmwasusedtodecidewhento
leaveawordcapitalized.Essentially,wheneverawordisthestartofasentence,itis
presupposedthatitnormallywouldnotbecapitalized,andthereforeislowercased.A
lotofthoughtwasgivenintothis,speciallyifitwouldbeworthtoimplementacasing
algorithm,giventhatingeneralnotusingcasematchinggivesgoodenoughand
arguablyasgoodas.Sincetheusageofthisdoesnothaveagreatimpacton
performance,andaccordingtotheopinionviewedon[1],thealgorithmusesit,but
resultsaresimilarwithoutit.

Punctuationtreatinginsteadofremovingallpunctuation,weinsteadtrimthe
punctuationofthewords,sintepunctuationinthemiddleofawordsometimeshave
meaning.Forexample,thenumber8.5orinmapper.Todothiswealsohadto
separetelytrimthetagsforreferencesgeneratedbywikipedia,duetotheirpeculiar
form[number],somechanismstomodifytheimportanceofwordsinsidea
referenceinthedocumentwouldbeeasilyimplementedfromthispoint.

StopWordsandStemmingAftertheaforementionedsteps,stopwordswordsthat
donotaddinformationtothetextareremovedusingthealgorithmprovided.After
this,wordsarestemmed,alsousingaprovidedalgorithm.

InmappercombiningThemapreductionpatterncalledinmappercombiningwas
implemented.Thismeansthatinsteadofbeingdirectlywrittenintothecontexttobe
treatedbythereducer,foreachmapperthekeyvaluepairsarepreprocessed,suchas
tolessentheamountofinformationthatissenttothereducersandincreaseoverall
speedoftherunofthemapreduce.Itwasimplementedinawaythatthepre
combinedkeyvaluepairsarewritteninthecontextassoonasthemapperfinishesor
theMapcontainingthemhasusedtoomuchmemoryaconstantvaluecanbe
specified.Itissimilartothelastimplementationfoundon[2]thoughnocodewas
copied.

PositionalindexingThepositionofeachofeachoccurrenceofeachtokenemittedby
themapperwhichisasimplifiedversionofawordresultedbytheaforementioned
operationsiskept,andpropagatedintheoutput,sothatqueriescantaketheposition
ofthewordinthedocumentintoaccount.

FlaggingofimportantitemsThemodificationsneededtoimplementthepropagations
oftheflaggingofimportantitemstotheoutputwerenotmadetherefore,eventhough
theverificationofthisisbeingmadeinsomepoints,thisinformationisnotsenttothe
output.

Performance
AlltheoperationsmadehavearuntimecomplexityofO(n)inrelationtothelengthofthe
input,whichguaranteesthespeedandscalabilityofthealgorithmimplemented.The
algorithmtakessometimetorunduetotheoverheadsinvolvedinthemapreduce
architecture,thoughastheinputgrows,theoverheadgetscomparativelyinsignificant.The
useoftheinmappercombiningpatternhelpsusavoidbottlenecks,suchasthealgorithm
runningslowlyduetotheexcessivenormallycostlymemoryoperationsthatwouldbe
causedbythemappersendinganunnecessarylargeamountofdatatothereducers,which
makesthealgorithmbetterscalable.Alsothepatterndoesnotoverloadthememory,andthe
usageofmemoryusedbytheinmapcombinermaybechangedenablesthealgorithmtobe
selectivelytunedfordifferentusersorsituations.Thebottleneckstothealgorithmas
implementedaretheamountofmemoryonthemachinethoughitwouldneedareallybig
inputtocauserealimpactontheperformanceandtheamountofcores,sincethoselimitthe
amountofmapreduceoperationsthatcanberunparallelly.

Sample Output

Man(Bart_the_Fink.txt.gz,[101,1950])
Man(Bart_the_Mother.txt.gz,[178,2268])
Manhattan(Bart_the_Murderer.txt.gz,[492])
Marg(Bart_the_Murderer.txt.gz,[134,517,2199])
Marg(Bart_the_Genius.txt.gz,[372,402])
Marg(Bart_the_Fink.txt.gz,[130,460,639,1978])
Marg(Bart_the_General.txt.gz,[257,403])
Marg(Bart_the_Lover.txt.gz,[110,625,627,2480])
Mark(Bart_the_Murderer.txt.gz,[1760])
Marri(Bart_the_Murderer.txt.gz,[133,2198])
Marri(Bart_the_Lover.txt.gz,[109,1573,2479])
Marri(Bart_the_Fink.txt.gz,[1379])
Martin(Bart_the_Genius.txt.gz,[349,466,1034])
Martyn(Bart_the_Genius.txt.gz,[1257,1686])
Martyn(Bart_the_Fink.txt.gz,[1461,1619])
Martyn(Bart_the_Mother.txt.gz,[1681])
Martyn(Bart_the_Murderer.txt.gz,[1492,1850])
Martyn(Bart_the_Lover.txt.gz,[1864,2040])
Martyn(Bart_the_General.txt.gz,[860,1350])
Mason(Bart_the_Lover.txt.gz,[1632])
Massachusett(Bart_the_Mother.txt.gz,[1433])
Masterpiec(Bart_the_Genius.txt.gz,[1986])
Masterpiec(Bart_the_Fink.txt.gz,[1855])
Matt(Bart_the_Fink.txt.gz,[73,1704])
Matt(Bart_the_Genius.txt.gz,[27,71,874,1722,1789])
Matt(Bart_the_Lover.txt.gz,[53,2102])
Matt(Bart_the_Mother.txt.gz,[78,957,1966])
Matt(Bart_the_General.txt.gz,[27,39,974,1327])
Matt(Bart_the_Murderer.txt.gz,[78,1926])
Max(Bart_the_Mother.txt.gz,[153,2243])
Maximum(Bart_the_Mother.txt.gz,[167,2257])
Mayor(Bart_the_Mother.txt.gz,[135,858,2225])
McClure(Bart_the_Mother.txt.gz,[75,317,1529,1574])
McClure(Bart_the_Fink.txt.gz,[70,1195])
McClure(Bart_the_Murderer.txt.gz,[59])
Me(Bart_the_Mother.txt.gz,[186,2276])
Melissa(Bart_the_Genius.txt.gz,[1529])
Melros(Bart_the_Fink.txt.gz,[1371])

BasicInvertedIndex.java

/**
*BasicInvertedIndex
*
*ThisMapReduceprogramshouldbuildanInvertedIndexfromasetoffiles.
*Eachtoken(thekey)inagivenfileshouldreferencethefileitwasfound
*in.
*
*Theoutputoftheprogramshouldlooklikethis:
*sometoken[file001,file002,...]
*
*@authorKristianEpps
*/
packageuk.ac.man.cs.comp38120.exercise

importjava.io.*
importjava.util.*
importjava.util.regex.Pattern

importorg.apache.hadoop.conf.Configuration
importorg.apache.hadoop.conf.Configured
importorg.apache.hadoop.fs.Path
importorg.apache.hadoop.io.*
importorg.apache.hadoop.mapreduce.Job
importorg.apache.hadoop.mapreduce.Mapper
importorg.apache.hadoop.mapreduce.Reducer
importorg.apache.hadoop.mapreduce.lib.input.FileInputFormat
importorg.apache.hadoop.mapreduce.lib.input.FileSplit
importorg.apache.hadoop.mapreduce.lib.output.FileOutputFormat
importorg.apache.commons.cli.CommandLine
importorg.apache.commons.cli.CommandLineParser
importorg.apache.commons.cli.HelpFormatter
importorg.apache.commons.cli.OptionBuilder

importorg.apache.commons.cli.Options
importorg.apache.commons.cli.ParseException
importorg.apache.hadoop.util.Tool
importorg.apache.hadoop.util.ToolRunner
importorg.apache.log4j.Logger

importuk.ac.man.cs.comp38120.io.array.ArrayListWritable
importuk.ac.man.cs.comp38120.io.pair.PairOfStringFloat
importuk.ac.man.cs.comp38120.io.pair.PairOfWritables
importuk.ac.man.cs.comp38120.util.XParser
importuk.ac.man.cs.comp38120.ir.StopAnalyser
importuk.ac.man.cs.comp38120.ir.Stemmer

importstaticjava.lang.System.out

publicclassBasicInvertedIndexextendsConfiguredimplementsTool
{
privatestaticfinalLoggerLOG=Logger

.getLogger(BasicInvertedIndex.class)

publicstaticclassMapextends

Mapper<Object,Text,Text,PairOfWritables<Text,ArrayListWritable<IntWritable>>>
{

//Inmapaggregatorarray

java.util.Map<String,ArrayListWritable<IntWritable>>aggregator

finalintMAX_AGGREGATOR_SIZE=300000

//lazyinitialization

privatejava.util.Map<String,ArrayListWritable<IntWritable>>getAggregator()

if(aggregator==null)

aggregator=newHashMap<String,ArrayListWritable<IntWritable>>()

returnaggregator

//functionthatwritesintothecontextallthedataontheaggregatorarrayandcleansit

privatevoiddump(Contextcontext)throwsIOException,InterruptedException

Iterator<java.util.Map.Entry<String,ArrayListWritable<IntWritable>>>iter

iter=getAggregator().entrySet().iterator()

while(iter.hasNext())

java.util.Map.Entry<String,ArrayListWritable<IntWritable>>aux=
iter.next()

WORD.set(aux.getKey())

context.write(WORD,new
PairOfWritables<Text,ArrayListWritable<IntWritable>>(INPUTFILE,aux.getValue()))

aggregator=null

//flushesthearrayshoulditusetoomuchmemory

privatevoidflush(Contextcontext)throwsIOException,InterruptedException

if(getAggregator().size()>MAX_AGGREGATOR_SIZE)

dump(context)

//addsthegiveninformationtobewritteninthecontexttotheaggregatorarray

privatevoidaggregate(Stringtoken,intposition,Contextcontext)throwsIOException,
InterruptedException

if(getAggregator().containsKey(token))

ArrayListWritable<IntWritable>l=getAggregator().get(token)

l.add(newIntWritable(position))

getAggregator().put(token,l)

else

ArrayListWritable<IntWritable>l=new
ArrayListWritable<IntWritable>()

l.add(newIntWritable(position))

getAggregator().put(token,l)

flush(context)

//INPUTFILEholdsthenameofthecurrentfile

privatefinalstaticTextINPUTFILE=newText()

//TOKENshouldbesettothecurrenttokenratherthancreatinga
//newTextobjectforeachone
@SuppressWarnings("unused")
privatefinalstaticTextTOKEN=newText()
//TheStopAnalyserclasshelpsremovestopwords
@SuppressWarnings("unused")
privateStopAnalyserstopAnalyser=newStopAnalyser()

//ThestemmethodwrapsthefunctionalityoftheStemmer
//class,whichtrimsextracharactersfromEnglishwords
//PleaserefertotheStemmerclassformorecomments
@SuppressWarnings("unused")
privateStringstem(Stringword)
{
Stemmers=newStemmer()
//Achar[]wordisaddedtothestemmerwithitslength,
//thenstemmed
s.add(word.toCharArray(),word.length())
s.stem()
//returnthestemmedchar[]wordasastring
returns.toString()
}

//ThismethodgetsthenameofthefilethecurrentMapperisworking
//on
@Override
publicvoidsetup(Contextcontext)
{
StringinputFilePath=((FileSplit)context.getInputSplit()).getPath().toString()
String[]pathComponents=inputFilePath.split("/")
INPUTFILE.set(pathComponents[pathComponents.length1])
}

//leavesuppercasedlettersinbeginningofsentences
privateStringcaseFolding(Stringtext)
{
Stringresult=newString(text)

//foreachsentence
for(Stringsentence:text.split("\\."))

for(Stringword:sentence.split(""))

//cleansthewordofpunctuation

Stringaux=trimPunctuation(word)

//getsthefirstwordthatwasnotonlypunctuation

if(aux==null)

continue

if(aux.length()<=0)

continue

//makesitlowercase

if(Character.isUpperCase(aux.codePointAt(0)))

//TODO

//IFNOTACRONYM

result=result.replace(word,
word.toLowerCase())

break

returnresult

///trimspunctuationfromstartandendofstring.returnsnullifstringisonly
punctuation,elsereturnsthetrimmedstring

privateStringtrimPunctuation(Stringstr)

if(str.length()==0)

returnnull

Stringpunct=newString("!\"#$%&\'*+,./:'\\'<=>?@[]^_`{|}~()\t\n\f\r")

//removespunctuationandothersymbolsfrombeginningandendofstring

inti=0

//removespunctuationfrombeginning

while(i<str.length()&&punct.contains(str.substring(i,i+1)))

i++

str=str.substring(i)

if(str.length()==0)

returnnull

intj=str.length()1

//removespunctuationfromend

while(j>0&&punct.contains(str.substring(j,j+1)))

returnstr.substring(0,j+1)

//removestagsintheform[number],whichocasionallyremainaftertokenization

privateStringtrimTags(Stringstr)

if(str.length()==0)

returnnull

inti

for(i=0i<str.length()&&str.codePointAt(i)!='['i++)

if(i==str.length())

returnstr

intj

for(j=ij<str.length()&&(Character.isDigit(str.codePointAt(j))||
str.codePointAt(j)==']')j++)

if(i<str.length())

str=str.substring(0,i)

if(j>i&&j+1<str.length())

str+=str.substring(j+1,str.length())

returnstr

privatefinalstaticTextWORD=newText()

//TODO

//ThisMappershouldreadinaline,convertittoasetoftokens

//andoutputeachmodifiedtokenwiththepositionofitsoccurrenceinthedocument

publicvoidmap(Objectkey,Textvalue,Contextcontext)

throwsIOException,InterruptedException

Stringline=value.toString()

//tokenizesthetextaftercasefolding

StringTokenizeritr=newStringTokenizer(caseFolding(line))

for(intposition=0itr.hasMoreTokens()position++)

Stringstr=itr.nextToken()

//trimsthetagsontheform[number]

str=trimTags(str)

//trimspunctuation

str=trimPunctuation(str)

//doesnotaddwordsthatbecamenullafterbeingstrippedof
punctuation

if(str==null)

continue

//disregardsstopwords

if(StopAnalyser.isStopWord(str))

continue

//stemwords

str=stem(str)

//combinesthisoutputwiththeotheroutputgivenbythismapper.

//ImplementsthepatternofInmapcombiningorInmapagregation

aggregate(str,position,context)

//guaranteesthatnoinformationremainswithoutbeingforwardedtothe
reducer

dump(context)

}
}

publicstaticclassReduceextendsReducer<Text,Text,Text,
PairOfWritables<Text,ArrayListWritable<IntWritable>>>
{

privatefinalstaticTextWORD=newText()

//TODO

//ThisReduceJobshouldtakeinakeyandaniterableoffilenames

//Itshouldconvertthisiterabletoawritablearraylistandoutput

//italongwiththekey

publicvoidreduce(

Textkey,

Iterable<PairOfWritables<Text,ArrayListWritable<IntWritable>>>values,

Contextcontext)throwsIOException,InterruptedException

Iterator<PairOfWritables<Text,ArrayListWritable<IntWritable>>>iter=
values.iterator()

java.util.Map<Text,ArrayListWritable<IntWritable>>combine=new
HashMap<Text,ArrayListWritable<IntWritable>>()

//foreachvaluegivenbythemappers

while(iter.hasNext())

PairOfWritables<Text,ArrayListWritable<IntWritable>>pair=
iter.next()

//concatenatesthepositionarraysforeachdocumentforallthe
tokensthatappearonthatdocument

if(!combine.containsKey(pair.getLeftElement()))

combine.put(pair.getLeftElement(),
pair.getRightElement())

else

ArrayListWritable<IntWritable>auxList=new
ArrayListWritable<IntWritable>()

auxList.addAll(pair.getRightElement())

auxList.addAll(combine.get(key))

combine.put(pair.getLeftElement(),auxList)

Iterator<java.util.Map.Entry<Text,ArrayListWritable<IntWritable>>>
iter2=combine.entrySet().iterator()

//writestheotput

while(iter2.hasNext())

java.util.Map.Entry<Text,ArrayListWritable<IntWritable>>entry
=iter2.next()

WORD.set(key)

context.write(WORD,new
PairOfWritables<Text,ArrayListWritable<IntWritable>>(entry.getKey(),entry.getValue()))

}
}

//Letscreateanobject!:)
publicBasicInvertedIndex()
{
}

//Variablestoholdcmdlineargs
privatestaticfinalStringINPUT="input"
privatestaticfinalStringOUTPUT="output"
privatestaticfinalStringNUM_REDUCERS="numReducers"

@SuppressWarnings({"staticaccess"})
publicintrun(String[]args)throwsException
{

//Handlecommandlineargs

Optionsoptions=newOptions()

options.addOption(OptionBuilder.withArgName("path").hasArg()

.withDescription("inputpath").create(INPUT))

options.addOption(OptionBuilder.withArgName("path").hasArg()

.withDescription("outputpath").create(OUTPUT))

options.addOption(OptionBuilder.withArgName("num").hasArg()

.withDescription("numberofreducers").create(NUM_REDUCERS))

CommandLinecmdline=null

CommandLineParserparser=newXParser(true)

try

cmdline=parser.parse(options,args)
}
catch(ParseExceptionexp)
{
System.err.println("Errorparsingcommandline:"
+exp.getMessage())
System.err.println(cmdline)
return1
}
//Ifwearemissingtheinputoroutputflag,lettheuserknow
if(!cmdline.hasOption(INPUT)||!cmdline.hasOption(OUTPUT))
{
System.out.println("args:"+Arrays.toString(args))
HelpFormatterformatter=newHelpFormatter()
formatter.setWidth(120)
formatter.printHelp(this.getClass().getName(),options)
ToolRunner.printGenericCommandUsage(System.out)
return1
}
//CreateanewMapReduceJob
Configurationconf=newConfiguration()
Jobjob=newJob(conf)
StringinputPath=cmdline.getOptionValue(INPUT)
StringoutputPath=cmdline.getOptionValue(OUTPUT)
intreduceTasks=cmdline.hasOption(NUM_REDUCERS)?Integer
.parseInt(cmdline.getOptionValue(NUM_REDUCERS)):1
//SetthenameoftheJobandtheclassitisin
job.setJobName("BasicInvertedIndex")
job.setJarByClass(BasicInvertedIndex.class)
job.setNumReduceTasks(reduceTasks)

//SettheMapperandReducerclass(noneedforcombinerhere)
job.setMapperClass(Map.class)
job.setReducerClass(Reduce.class)

//SettheOutputClasses
job.setMapOutputKeyClass(Text.class)
job.setMapOutputValueClass(PairOfWritables.class)
job.setOutputKeyClass(Text.class)
job.setOutputValueClass(PairOfWritables.class)

//Settheinputandoutputfilepaths
FileInputFormat.setInputPaths(job,newPath(inputPath))
FileOutputFormat.setOutputPath(job,newPath(outputPath))

//Timethejobwhilstitisrunning
longstartTime=System.currentTimeMillis()
job.waitForCompletion(true)
LOG.info("JobFinishedin"+(System.currentTimeMillis()startTime)
/1000.0+"seconds")
//Returning0letseveryoneknowthejobwassuccessful
return0
}

publicstaticvoidmain(String[]args)throwsException
{
ToolRunner.run(newBasicInvertedIndex(),args)
}

BDA Module 3
No ratings yet
BDA Module 3
66 pages
14 MapReduce
100% (1)
14 MapReduce
82 pages
Lecture3 Tolerant Retrieval
100% (1)
Lecture3 Tolerant Retrieval
48 pages
14 MapReduce PDF
100% (1)
14 MapReduce PDF
82 pages
Lecture3 Tolerant Retrieval
100% (1)
Lecture3 Tolerant Retrieval
48 pages
MapReduce: Simplified Data Processing On Large Clusters
100% (1)
MapReduce: Simplified Data Processing On Large Clusters
13 pages
03 MapReduce
No ratings yet
03 MapReduce
184 pages
6-Spelling Correction Soundex
No ratings yet
6-Spelling Correction Soundex
52 pages
Big Data Infrastructure: Week 2: Mapreduce Algorithm Design (2/2)
No ratings yet
Big Data Infrastructure: Week 2: Mapreduce Algorithm Design (2/2)
55 pages
IR Unit III - Notes
No ratings yet
IR Unit III - Notes
18 pages
3G Alarm Handling
100% (4)
3G Alarm Handling
39 pages
20 Tolerantretrieval
No ratings yet
20 Tolerantretrieval
39 pages
Chapter 9 - Processing Big Data With Mapreduce
No ratings yet
Chapter 9 - Processing Big Data With Mapreduce
157 pages
Lec 8
No ratings yet
Lec 8
19 pages
C7 SpellCorrection
No ratings yet
C7 SpellCorrection
43 pages
IR Chapter 2 Text Operations
No ratings yet
IR Chapter 2 Text Operations
25 pages
Lecture 4 - Index Construction - Compressing
No ratings yet
Lecture 4 - Index Construction - Compressing
90 pages
Lecture 4
No ratings yet
Lecture 4
48 pages
Unit I
No ratings yet
Unit I
83 pages
BDA List of Experiments For Practical Exam
No ratings yet
BDA List of Experiments For Practical Exam
21 pages
chapter2-MA212-Indexing & Preprocessing
No ratings yet
chapter2-MA212-Indexing & Preprocessing
68 pages
Lecture3-Tolerant-retrieval Dictionaries and Tolerant Retrieval CH 3
No ratings yet
Lecture3-Tolerant-retrieval Dictionaries and Tolerant Retrieval CH 3
47 pages
2T-Inverted Index
No ratings yet
2T-Inverted Index
54 pages
Big Data Unit 2 - PPT1
No ratings yet
Big Data Unit 2 - PPT1
15 pages
2.boolean Retrieval Model
No ratings yet
2.boolean Retrieval Model
40 pages
Lec 8
No ratings yet
Lec 8
24 pages
Advanced Indexing Issues
No ratings yet
Advanced Indexing Issues
52 pages
Lecture 2 Inverted Index PDF
No ratings yet
Lecture 2 Inverted Index PDF
24 pages
Lec 9
No ratings yet
Lec 9
21 pages
Comp38120 Lab Manual Lab 1
No ratings yet
Comp38120 Lab Manual Lab 1
6 pages
Lec 9
No ratings yet
Lec 9
21 pages
Mapreduce: Theory and Implementation: Cse 490H - Intro To Distributed Computing, Modified by George Lee
No ratings yet
Mapreduce: Theory and Implementation: Cse 490H - Intro To Distributed Computing, Modified by George Lee
33 pages
3 - Index Construction
No ratings yet
3 - Index Construction
5 pages
Chap5 Index Construction
No ratings yet
Chap5 Index Construction
38 pages
Introduction To: Information Retrieval
No ratings yet
Introduction To: Information Retrieval
69 pages
Information Retrieval - 1
No ratings yet
Information Retrieval - 1
47 pages
AI6122 Topic 3.1 - Index
No ratings yet
AI6122 Topic 3.1 - Index
40 pages
Map Reduce Design and Execution Framework Part 1
No ratings yet
Map Reduce Design and Execution Framework Part 1
19 pages
Distributed Computing Seminar: Mapreduce Theory and Implementation
No ratings yet
Distributed Computing Seminar: Mapreduce Theory and Implementation
30 pages
Completed UNIT-III 20.9.17
No ratings yet
Completed UNIT-III 20.9.17
61 pages
115 Ir 9
No ratings yet
115 Ir 9
4 pages
Research Paper - Map Reduce - CSC3323
No ratings yet
Research Paper - Map Reduce - CSC3323
16 pages
Module 1-1
No ratings yet
Module 1-1
12 pages
Chapter 1: Boolean Retrieval
No ratings yet
Chapter 1: Boolean Retrieval
9 pages
Introduction To Information Rertrieval Recitation
No ratings yet
Introduction To Information Rertrieval Recitation
2 pages
Saudi Aramco: Workover Manual Original Issue & Revision Guidelines
No ratings yet
Saudi Aramco: Workover Manual Original Issue & Revision Guidelines
6 pages
Penetration Testing Professional: The World's Premier Online Penetration Testing Course
No ratings yet
Penetration Testing Professional: The World's Premier Online Penetration Testing Course
55 pages
ICSE - Robotics AI
No ratings yet
ICSE - Robotics AI
17 pages
Archers Voice - Free Download, Borrow, and Streaming - Internet Archive
No ratings yet
Archers Voice - Free Download, Borrow, and Streaming - Internet Archive
4 pages
12th Computer Applications All Practical Programs English Medium PDF Download
No ratings yet
12th Computer Applications All Practical Programs English Medium PDF Download
33 pages
Intelligent Platform Management Interface Firmware, Upgrade: Operational Instruction
No ratings yet
Intelligent Platform Management Interface Firmware, Upgrade: Operational Instruction
23 pages
Circuits Simulation Lab: Department of Technical Education
No ratings yet
Circuits Simulation Lab: Department of Technical Education
18 pages
Iot Imp Question
No ratings yet
Iot Imp Question
4 pages
Digital Data
No ratings yet
Digital Data
32 pages
QB - Python Basics - Ver 7.0
No ratings yet
QB - Python Basics - Ver 7.0
49 pages
Non-Classical Models of IR (Uploaded by Snaptricks - In)
No ratings yet
Non-Classical Models of IR (Uploaded by Snaptricks - In)
8 pages
Kroenke Mis5e PPT ch04
No ratings yet
Kroenke Mis5e PPT ch04
48 pages
WC - Module 3
No ratings yet
WC - Module 3
72 pages
SE Module 3
No ratings yet
SE Module 3
23 pages
CrowdStrike Software Update Failure
No ratings yet
CrowdStrike Software Update Failure
10 pages
Manual 1172520
No ratings yet
Manual 1172520
23 pages
69 Spring Interview Questions and Answers - The ULTIMATE List
No ratings yet
69 Spring Interview Questions and Answers - The ULTIMATE List
16 pages
Course Outline ADV 08 - Data Mining
No ratings yet
Course Outline ADV 08 - Data Mining
3 pages
Dere 0922
No ratings yet
Dere 0922
7 pages
Manish Lodaya
No ratings yet
Manish Lodaya
7 pages
Getting Started On The AT91SAM7X-EK PDF
No ratings yet
Getting Started On The AT91SAM7X-EK PDF
16 pages
Trusted Platform Module
No ratings yet
Trusted Platform Module
14 pages
Unit 7 E-Lifestyle
No ratings yet
Unit 7 E-Lifestyle
6 pages
Apple Versus Corellium Amended Filing
No ratings yet
Apple Versus Corellium Amended Filing
28 pages
Relecloud Presents
No ratings yet
Relecloud Presents
5 pages
How To Add AdSense Ads at The End of The Post in Blogger
No ratings yet
How To Add AdSense Ads at The End of The Post in Blogger
2 pages
Sitev 3
No ratings yet
Sitev 3
7 pages
Advanced Authentication Using 3D Passwords in Virtual World: Nisha Salian, Sayali Godbole, Shalaka Wagh
No ratings yet
Advanced Authentication Using 3D Passwords in Virtual World: Nisha Salian, Sayali Godbole, Shalaka Wagh
6 pages
Likhit Hegu
No ratings yet
Likhit Hegu
3 pages
C# Data Structures and Algorithms: Harness the power of C# to build a diverse range of efficient applications
From Everand
C# Data Structures and Algorithms: Harness the power of C# to build a diverse range of efficient applications
Marcin Jamro
No ratings yet
Rust In Practice, Second Edition
From Everand
Rust In Practice, Second Edition
Rick Tim
No ratings yet
DESIGN ALGORITHMS TO SOLVE COMMON PROBLEMS: Mastering Algorithm Design for Practical Solutions (2024 Guide)
From Everand
DESIGN ALGORITHMS TO SOLVE COMMON PROBLEMS: Mastering Algorithm Design for Practical Solutions (2024 Guide)
ARCHER PAUL
No ratings yet
Algorithms Made Simple: Understanding the Building Blocks of Software
From Everand
Algorithms Made Simple: Understanding the Building Blocks of Software
William E. Clark
No ratings yet
Real-Time Critical Systems
From Everand
Real-Time Critical Systems
Jordan Lee Mauro-Buhagiar
3/5 (1)
GROKKING ALGORITHM BLUEPRINT: Advanced Guide to Help You Excel Using Grokking Algorithms
From Everand
GROKKING ALGORITHM BLUEPRINT: Advanced Guide to Help You Excel Using Grokking Algorithms
William Turner
No ratings yet
Learning Concurrent Programming in Scala
From Everand
Learning Concurrent Programming in Scala
Aleksandar Prokopec
No ratings yet
Mastering the Art of Prolog Programming: Advanced Techniques and Skills
From Everand
Mastering the Art of Prolog Programming: Advanced Techniques and Skills
Steve Jones
No ratings yet
Graph Layout Support for Model-Driven Engineering
From Everand
Graph Layout Support for Model-Driven Engineering
Miro Spönemann
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
Go Algorithms for Beginners: A Practical Guide with Examples
From Everand
Go Algorithms for Beginners: A Practical Guide with Examples
William E. Clark
No ratings yet
Rust Programming Basics: A Practical Guide with Examples
From Everand
Rust Programming Basics: A Practical Guide with Examples
William E. Clark
No ratings yet
Essential Algorithms: A Practical Approach to Computer Algorithms
From Everand
Essential Algorithms: A Practical Approach to Computer Algorithms
Rod Stephens
4.5/5 (2)
Regular Expressions Demystified: A Practical Guide with Examples
From Everand
Regular Expressions Demystified: A Practical Guide with Examples
William E. Clark
No ratings yet
How to use ChatGPT
From Everand
How to use ChatGPT
Bernhard Gaum
No ratings yet

Map-Reduce Implementation, Using In-Map Aggregation and Other Features

Uploaded by

Map-Reduce Implementation, Using In-Map Aggregation and Other Features

Uploaded by

COMP38120: Documents, Services and Data on the Web

Laboratory Exercise 1.3

You might also like