
Presto is best at handling analytics workloads, and though Presto has added some features to handle insertions more efficiently, it shines when reading and federating data in a data warehouse or data lake.
Presto is an open source distributed SQL query engine for running high performance queries against various data sources ranging in size from gigabytes to petabytes.
Presto was designed and built from scratch in Java for interactive analytics as a replacement for Apache Hadoop/HDFS MapReduce jobs. It approaches the speed of commercial data warehouses while scaling up to the size of the largest organizations in the world.
Presto was originally developed at Facebook to scale to the data size and performance they needed to query their Hive-based data warehouse, but was expanded to connect to many other data sources over time.
PrestoDB vs PrestoSQL vs Trino
PrestoDB is the former name of the original version of Presto. It was developed by Eric Hwang, Dain Sundstrom, David Phillips, and Martin Traverso at Facebook. In 2018, they left Facebook and founded the Presto Software Foundation to ensure that the project would remain collaborative and independent. They named their new fork PrestoSQL, which was later renamed to Trino at the end of 2020. PrestoDB was renamed to Presto shortly after, so PrestoDB is now simply called Presto, and PrestoSQL is now Trino.
Presto and Trino share similar features and the same core code. However, ongoing development on Presto has been driven by Facebook, while development on Trino has been driven by companies like Starburst and AWS trying to serve a wide audience. This has made Trino more generally useful, and as explained below, it has benefitted from higher velocity development.
Related reading: What’s the difference between Prestodb, Presto SQL, and Trino
Is Trino better than Presto?
Since the fork in 2018, development on Trino has gone at roughly three times the velocity of development on Presto. It boasts additional connectors that aren’t in Presto, better performance across the vast majority of connectors, expanded SQL support, and is much better at handling batch ETL/ELT workloads.
Whether you’re considering Presto or Trino, the easiest way to start querying your data is with Starburst Galaxy, the simplest and quickest way to get running with SQL. If you don’t want to use Starburst, the Trino website provides tutorials on using it locally on Linux, via a Docker image, or with Kubernetes.