Mosharaf Chowdhury
UC Berkeley
  • Home
  • Publications
  • Academic
    • Coursework
  • Network Virtualization
    • Bibliography
    • Resources
  • Blog
    • Archives
Browse: Home / MapReduce

MapReduce

Dremel: Interactive Analysis of Web-Scale Datasets

By Mosharaf on October 10, 2011

Google, “Dremel: Interactive Analysis of Web-Scale Datasets,” VLDB, 2010. [PDF] Summary Dremel is Google’s interactive  ad hoc query system for analysis of read-only nested data. Unlike MapReduce, Dremel is aimed toward data exploration, monitoring, and debugging, where near real-time performance is of utmost importance. To achieve scalability and performance, Dremel builds upon three key ideas: ...

Continue reading →

Posted in Reviews | Tagged Dremel, MapReduce, UCB Cloud Computing Course F11 | Leave a response

Dryad: Distributed Data-Parallel Programs from Sequential Building Blocks

By Mosharaf on October 1, 2011

Michael Isard, Mihai Budiu, Yuan Yu, Andrew Birrell, Dennis Fetterly, “Dryad: Distributed Data-Parallel Programs from Sequential Building Blocks,” EuroSys, 2007. [PDF] Summary Dryad is Microsoft’s answer to the MapReduce paradigm, albeit at a (slightly) lower level with greater flexibility. Like MapReduce, Dryad allows developers to think about what to do with the data, and Dryad ...

Continue reading →

Posted in Reviews | Tagged Birrell, Budiu, Dryad, Fetterly, Isard, MapReduce, UCB Cloud Computing Course F11, Yu | Leave a response

MapReduce: Simplified Data Processing on Large Clusters

By Mosharaf on September 28, 2011

Jeffrey Dean, Sanjay Ghemawat, “MapReduce: Simplified Data Processing on Large Clusters,” OSDI, 2004. [PDF] Summary MapReduce is a programming model and associated implementation for processing and generating large data sets in a parallel, fault-tolerant, distributed, and load-balanced manner. There are two main functions (both user provided) in this programming model. The map function takes an input ...

Continue reading →

Posted in Reviews | Tagged Dean, Ghemawat, Hadoop, MapReduce, UCB Cloud Computing Course F11 | Leave a response

Spark short paper has been accepted at HotCloud’10

By Mosharaf on May 8, 2010

An initial overview of our ongoing work on Spark, an iterative and interactive framework for cluster computing, has been accepted at HotCloud’10. I’ve been joined the project last February, while Matei has been working on it since last Fall. I will have uploaded the paper in the publications page. once we have taken care of ...

Continue reading →

Posted in Cloud Computing, Recent News, Research | Tagged data parallel systems, HotCloud, MapReduce, Spark | Leave a response

Copyright © 2012 Mosharaf Chowdhury.

Powered by WordPress and Hybrid.