Release 0.213
Fix query execution failure or indefinite hang during grouped execution when all splits for any lifespan are completely filtered out.
Fix grouped execution to respect the configured concurrent lifespans per task. Previously, it always used a single lifespan per task.
Fix execution failure when using grouped execution with right or full outer joins where the right side is not partitioned on the join key.
Fix a scenario where too many rows are returned to clients in a single response.
Do not allow setting invalid property values with SET SESSION.
Disable stats calculator by default as it can cause a planning failure for certain complex queries. It can be enabled with the configuration property or the
enable_stats_calculator
session property.Avoid making guesses when estimating filters for joins. Previously, if nothing was known about the filter, a
0.9
coefficient was applied as a filter factor. Now, if nothing is known about a filter, the estimate will be unknown. A0.9
coefficient will be applied for all additional conjuncts if at least a single conjunct can be reasonably estimated.Improve
EXPLAIN ANALYZE
output by adding CPU time and enhancing accuracy of CPU fraction.Include stats and cost estimates in textual plans created on query completion.
Enhance
SHOW STATS
to supportIN
andBETWEEN
predicates in theWHERE
condition of theSELECT
clause.Remove transaction from explain plan for indexes joins.
Add
join-max-broadcast-table-size
configuration property andjoin_max_broadcast_table_size
session property to control the maximum estimated size of a table that can be broadcast when usingAUTOMATIC
join distribution type ().Add experimental config option
experimental.reserved-pool-enabled
to disable the reserved memory pool.Add
targetResultSize
query parameter to/v1/statement
endpoint to control response data size.
Geospatial Changes
Fix
ST_Distance()
function to returnNULL
if any of the inputs is an empty geometry as required by the SQL/MM specification.Add function to construct multi-point geometry from an array of points.
Add geometry_union() function to efficiently union arrays of geometries.
Add support for distributed spatial joins ().
- Allow running on a JVM from any vendor.
Web UI Changes
Remove legacy plan UI.
Add support for filtering queries by all error categories.
Add dialog to show errors refreshing data from coordinator.
Fix LZO and LZOP decompression to work with certain data compressed by Hadoop.
Fix potential out-of-bounds read for ZSTD on corrupted input.
Stop assuming no distinct values when column null fraction statistic is less than
1.0
.Treat as an absent null count for compatibility with statistics written by .
Preserve original exception for metastore network errors.
Preserve exceptions from Avro deserializer
Categorize text line length exceeded error.
Remove the old Parquet reader. The
hive.parquet-optimized-reader.enabled
configuration property andparquet_optimized_reader_enabled
session property no longer exist.Remove the
hive.parquet-predicate-pushdown.enabled
configuration property andparquet_predicate_pushdown_enabled
session property. Pushdown is always enabled now in the Parquet reader.Enable optimized ORC writer by default. It can be disabled using the
hive.orc.optimized-writer.enabled
configuration property or theorc_optimized_writer_enabled
session property.Use ORC file format as the default for new tables or partitions.
Add support for Avro tables where the Avro schema URL is an HDFS location.
Add
hive.parquet.writer.block-size
andhive.parquet.writer.page-size
configuration properties andparquet_writer_block_size
and session properties for tuning Parquet writer options.
Memory Connector Changes
- Improve table data size accounting.
Verifier Changes
- Rewrite queries in parallel when shadowing writes.