Latest in branch 8.3
8.3.3
Released 28 Jul 2022
(3 years ago)
SoftwareElasticsearch
Version8.3
Status
End of life
Initial release8.3.0
28 Jun 2022
(3 years ago)
Latest release8.3.3
28 Jul 2022
(3 years ago)
End of lifeUnavailable
Release noteshttps://www.elastic.co/guide/en/elasticsearch/reference/8.3/es-release-notes.html
Source codehttps://github.com/elastic/elasticsearch/tree/8.3
Documentationhttps://www.elastic.co/guide/en/elasticsearch/reference/8.3/
Downloadhttps://www.elastic.co/downloads/elasticsearch
Elasticsearch 8.3 ReleasesView full list

What Is New in Elasticsearch 8.3

Elasticsearch 8.3 brings significant enhancements across vector search, analytics, and data management. This release focuses on making complex operations more efficient and developer-friendly.

Category Key Updates
New Features ANN search, Vector tiles, Painless support for long and double, terms enrichment policy
Improvements Faster frozen tier, Better GeoSpatial aggregation performance, Async search node communication
Deprecations Deprecated REST API parameters for ML endpoints

How does vector search get better in 8.3?

The headline feature is the introduction of approximate nearest neighbor (ANN) search. This is a game-changer for performance on large-scale vector similarity operations.

ANN search uses the HNSW algorithm to find similar vectors without an exhaustive, costly scan. In practice, this means your k-NN queries return results orders of magnitude faster, especially on big datasets. You enable it by setting method to hnsw in your field mapping.

PUT my-index
{
  "mappings": {
    "properties": {
      "my_vector": {
        "type": "dense_vector",
        "dims": 3,
        "index": true,
        "similarity": "l2_norm",
        "index_options": {
          "type": "hnsw",
          "m": 16,
          "ef_construction": 100
        }
      }
    }
  }
}

What analytics upgrades should I know about?

Elasticsearch 8.3 supercharges geospatial and terms-based analytics. The new vector tiles API delivers map visualizations significantly faster than the previous geo_shape approach.

This matters because rendering complex maps for dashboards used to be a bottleneck. Now, you can serve high-resolution map layers directly from Elasticsearch without overloading your Kibana instance. Under the hood, it uses the Mapbox Vector Tile specification.

For terms enrichment, the new terms policy type lets you enrich incoming documents based on exact keyword matches. It's simpler and more efficient than the match policy for use cases where you don't need fuzzy matching.

Are there any scripting enhancements?

Yes, Painless scripting now supports the long and double field types for the field API. This closes a previous gap where you had to work with these types differently.

Before, you had to use doc['my_long_field'].value. Now, you can use the cleaner and more intuitive field('my_long_field') and field('my_double_field') methods directly in your scripts. It makes writing scripts for numeric calculations much more straightforward.

// Now you can do this:
def my_calc = field('price').get(0) * field('tax_rate').get(0);

// Instead of this:
def my_calc = doc['price'].value * doc['tax_rate'].value;

How is manageability improved?

Two key areas see major improvements: the frozen tier and async search. Searches on the frozen tier are now up to three times faster, drastically cutting the cost and time needed to query archived, read-only data.

For async search, the protocol for communication between coordinating nodes and the node holding the async search results has been optimized. This reduces overhead and makes checking the status of long-running searches more efficient across the cluster.

FAQ

Is the new ANN search a replacement for the exact k-NN search?
No, they serve different purposes. Approximate Nearest Neighbor (ANN) with HNSW is for high-speed, high-recall search on large datasets. Exact k-NN is for when you need 100% accuracy, regardless of performance cost. You choose the method that fits your accuracy vs. speed trade-off.

When should I use the new `terms` enrichment policy over `match`?
Use the terms policy when you need to enrich documents based on an exact key match, like a user ID or product SKU. Stick with the match policy if you need to enrich based on more complex, analyzed text where stemming and scoring are important.

Do I need to reindex to use the new vector tiles API?
No, the vector tiles API works with existing geo_point and geo_shape data. You can start using it immediately to render faster maps from your current geo-data.

What's the practical benefit of the frozen tier performance boost?
It makes querying cold data-like old logs or historical records-feel almost as fast as querying your hot tier. This means you can archive data more aggressively to save on storage costs without making it painful to access when needed for compliance or analysis.

Are the deprecated ML API parameters breaking changes?
No, they are only deprecated, not removed. Your existing code will continue to work, but you should plan to update it to use the new parameter names to ensure future compatibility. The deprecated parameters will be removed in a future major version.

Releases In Branch 8.3

VersionRelease date
8.3.328 Jul 2022
(3 years ago)
8.3.207 Jul 2022
(3 years ago)
8.3.130 Jun 2022
(3 years ago)
8.3.028 Jun 2022
(3 years ago)