Release 0.272
Improve performance by enabling Cost based optimizations by default.
Add ability to stream data for partial aggregation instead of building hash tables.
This improves the performance of aggregation when the data is already ordered by the group-by keys. Streaming aggregation can be enabled with the session property or the
streaming-for-partial-aggregation-enabled
configuration property.Add an adaptive stage scheduling policy that switches to phased execution mode once a query’s stage count exceeds a configurable upper bound.
Add function to return a cryptographically secure random number.
Hive Changes
Fix integer overflow exception in Parquet writer when writing files larger than ~2 GB.
-
This can be disabled with the
file_splittable
session property or the configuration property. Add support for using Parquet page-level statistics to skip pages.
This feature can be enabled by setting the
hive.parquet-column-index-filter-enabled
configuration property.Add support for metadata-based listing and bootstrap for Hudi tables.
- Add a new parameter
timeZoneID
which will set the time zone used for the timestamp columns. (See #16680).
MongoDB Connector Changes
- Fix the spelling of the write concern option
JOURNAL_SAFE
for the property .
- Add support for concurrent insertion from the same Presto cluster or multiple Presto clusters which share the same Metastore.
Pinot Changes
- Add support for querying Pinot
JSON
type.