Muppet: MapReduce-Style Processing of Fast Data

Lam, Wang; Liu, Lu; Prasad, STS; Rajaraman, Anand; Vacheri, Zoheb; Doan, AnHai

Computer Science > Databases

arXiv:1208.4175 (cs)

[Submitted on 21 Aug 2012]

Title:Muppet: MapReduce-Style Processing of Fast Data

Authors:Wang Lam, Lu Liu, STS Prasad, Anand Rajaraman, Zoheb Vacheri, AnHai Doan

View PDF

Abstract:MapReduce has emerged as a popular method to process big data. In the past few years, however, not just big data, but fast data has also exploded in volume and availability. Examples of such data include sensor data streams, the Twitter Firehose, and Facebook updates. Numerous applications must process fast data. Can we provide a MapReduce-style framework so that developers can quickly write such applications and execute them over a cluster of machines, to achieve low latency and high scalability? In this paper we report on our investigation of this question, as carried out at Kosmix and WalmartLabs. We describe MapUpdate, a framework like MapReduce, but specifically developed for fast data. We describe Muppet, our implementation of MapUpdate. Throughout the description we highlight the key challenges, argue why MapReduce is not well suited to address them, and briefly describe our current solutions. Finally, we describe our experience and lessons learned with Muppet, which has been used extensively at Kosmix and WalmartLabs to power a broad range of applications in social media and e-commerce.

Comments:	VLDB2012
Subjects:	Databases (cs.DB)
Cite as:	arXiv:1208.4175 [cs.DB]
	(or arXiv:1208.4175v1 [cs.DB] for this version)
	https://doi.org/10.48550/arXiv.1208.4175
Journal reference:	Proceedings of the VLDB Endowment (PVLDB), Vol. 5, No. 12, pp. 1814-1825 (2012)

Submission history

From: Wang Lam [view email] [via Ahmet Sacan as proxy]
[v1] Tue, 21 Aug 2012 02:53:58 UTC (228 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.DB

< prev | next >

new | recent | 2012-08

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Wang Lam
Lu Liu
STS Prasad
Anand Rajaraman
Zoheb Vacheri

…

export BibTeX citation

Computer Science > Databases

Title:Muppet: MapReduce-Style Processing of Fast Data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Databases

Title:Muppet: MapReduce-Style Processing of Fast Data

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators