Archive

Archive for June, 2013

Presto – Facebook´s Exabyte-Scale Query Engine

Presto is Facebook´s answer to Cloudera´s Impala, Hortonworks´ Stinger and Google´s Dremel. Presto is an ANSI-SQL compatible real-time data warehouse query engine so existing data tools should be working with it unlike Hive which needed special integration. Presto is in-memory and runs simple queries in few hundred milliseconds and complex queries in a few minutes. Ideal for interactive data warehousing. Unfortunately Presto will not be open sourced until later this year [probably fall], so the Big Data community will have to be patient.

Open Source real-time massive-scale data warehousing is likely to disrupt existing players like Teradata, Oracle, etc. who until recently were able to charge $100K per tera-byte…

Update: Facebook has open source Presto. You can now download it at http://prestodb.io/

If you want to make a Juju Charm of Presto please contact me…

Advertisements
%d bloggers like this: