Facebook Opensourced Presto

Facebook opensourced its data processing technique ‘Presto’ to the world. Presto is a distributed query engine based on ANSI SQL. It is very optimized and currently running with more than 300 petabytes of data, which may one among the top big data processing systems. Presto is a totally different from mapreduce. It is an in memory data processing mechanism and is very much optimised. From the details given in the facebook newsletter and presto website, it is 10 times faster than Hive. mapreduce.Hive came from facebook only, so presto will definitely beat hive. Hive queries are ultimately running as multiple mapreduce jobs and it will take more time. From my point of view, the competition may be between Cloudera Impala and Presto. Impala’s performance with huge datasets is not available now from any production environments because it is a budding technology from cloudera family, but presto is already tested and running in huge dataset production environment. Another interesting fact about presto is that we can use the already existing infrastructure and hadoop cluster for deploying presto, because presto supports hdfs as its underlying data storage. It supports other storage systems also. So it is flexible. Leading internet companies including Airbnb and Dropbox are using Presto. Presto code and further details are available in this link

I have deployed Presto and Impala on a small cluster of 8 nodes. I haven’t got enough time to explore more on presto. I am planning to explore more on the coming days. 🙂

2 thoughts on “Facebook Opensourced Presto”

Florian Stompe says:

November 11, 2013 at 4:06 pm

I’m really excited to see your results on that topic. We have used Hive in numerous projects and would like to take the next step to improve performance, either with Impala or Presto. Let’s see which one is faster and more convenient to use.

Anoop Sam John says:

November 29, 2013 at 7:20 am

Please post back the performance results when you are having some. Good on you Amal!

All About Tech

Victory goes to the player who makes the next-to-last mistake

Facebook Opensourced Presto

2 thoughts on “Facebook Opensourced Presto”

Leave a comment Cancel reply

Share this:

Related

2 thoughts on “Facebook Opensourced Presto”

Leave a comment Cancel reply