Recent News
TAing for CS162: Operating Systems and Systems Programming
This semester I’m TAing for CS162: Operating Systems and Systems Programming taught by Anthony Joseph and Ion Stoica. Here in Berkeley, we call a TA a GSI (Graduate Student Instructor). This is my first time GSI appointment in Berkeley, and it comes after more than 2 years of break. The last time I TAed, I was in ...
Continue reading →
Spark has been accepted at NSDI’2012
Our paper “Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing” has been accepted at NSDI’2012. This is Matei‘s brainchild and a joint work of a lot of people including, but not limited to, TD, Ankur, Justin, Murphy, and professors Ion Stoica, Scott Shenker, and Michael Franklin. Unlike many other systems papers, Spark is ...
Continue reading →
Presented Orchestra at Yahoo! Research
Today I presented Orchestra at Yahoo! Research in Santa Clara. Ganesh hooked me up with Sriram Rao at Yahoo! Research, who kindly invited me to give a talk in front of their Cloud Platforms team. I received some very interesting feedback that will help us in further improving Orchestra design and extending its reach.
Back in Berkeley after a Summer in MSR Redmond
After a long Summer that included an internship at MSR Redmond/Bing, attending SIGCOMM in Toronto, and a trip to Bangladesh, I’m back in Berkeley again. Time to work! This summer I worked with Dave Maltz and Lijiang Fang at MSR/Bing on a datacenter-related problem.
Presented Orchestra at SIGCOMM’2011
I’m attending my second SIGCOMM and had the privilege of giving my first talk at the flagship networking conference. I presented Orchestra, which happened to be very well attended even though it was the last talk of the day at 6PM. I’d like to thank everyone for showing up and also for the lively Q/A ...
Continue reading →
Technical report on Spark is available Online
A technical report describing the key concepts behind Spark is available online. The abstract goes below: We present Resilient Distributed Datasets (RDDs), a distributed memory abstraction that allows programmers to perform in-memory computations on large clusters while retaining the fault tolerance of data flow models like MapReduce. RDDs are motivated by two types of applications ...
Continue reading →
Spark’s in the wild
We have been working on the Spark cluster computing framework for last couple of years. It has always been open source under the BSD license in github. But yesterday Matei declared official launch of the spark website (spark-project.org) and mailing lists along with its 0.2 release to everyone during the AMPLab summer retreat at Chaminede, ...
Continue reading →
Extended version of ViNEYard has been accepted in IEEE/ACM ToN
An extended, updated, and emended version of our ViNEYard paper in INFOCOM’09 has been accepted for publication in IEEE/ACM Transactions on Networking after yearlong multiple rounds of reviews. Since there is normally a long queue for actually getting an accepted ToN paper printed, its hard to tell when ours will officially be out there. I’d ...
Continue reading →
Presented Orchestra at LBNL
Today I presented Orchestra for the first time in front of a crowd outside our lab. Taghrid Samak kindly invited me at LBNL’s Computing Sciences Seminar after we caught up over lunch last week, after a year. She is currently a post-doc fellow with the Advance Computing for Science group. Overall, the talk went very ...
Continue reading →
Orchestra has been accepted at SIGCOMM’2011
Update: Camera-ready version of the paper should be can be found in the publications page very soon! Our paper “Managing Data Transfers in Computer Clusters with Orchestra” has been accepted at SIGCOMM’2011. This is a joint work with Matei, Justin, and professors Mike Jordan and Ion Stoica. The project started as part of Spark and ...
Continue reading →