-
Notifications
You must be signed in to change notification settings - Fork 25k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Object properties order not preserved in _source for fields with "type": "object", "enabled": false #119347
Comments
Just testing this as you described it, it doesn't reproduce. Unless you provide a reproduceable scenario (complete mapping and settings, exact document indexed and commands showing the mangled _source) I cannot confirm this as a bug. Our documentation provides a similar to what you described behavior for synthetic source otherwise _source should be as it was when the document was indexed. |
@astefan I just wanted to make sure that this is not considered normal. If you reopen this I can provide a working example. |
@astefan thank you, will update the issue later tonight. |
@astefan after a little digging, I have discovered the culprit: this only happens when the index mapping for _source has a "excludes" property. I have updated the issue with a reproducible example, you will see that after the last GET request, the properties of the "content" field object in the document source are in a seemingly random order. Please inform me if you want me to change the title to something more descriptive, now that I have narrowed down the cause of the issue. |
After further testing, it seems to happen only when the excludes property contains a wildcard 😃 |
And one final finding is that this affects not only the property order of the nested objects (like the one in the example), but rather of the whole source object and all nested objects. Therefore I think that we should rewrite the issue title and description from scratch, since it seems that it has nothing to do with enabled: false. |
Pinging @elastic/es-storage-engine (Team:StorageEngine) |
I don't think we attempt to provide any guarantees about preserving the order of fields in the source. Most of the time the source will be preserved because we don't bother parsing and rewriting the source unless we need to, but if we need to exclude field then we need to parse and rewrite it and that may introduce changes. Since fields in JSON are explicitly unordered 1, I don't think you should be relying on the source coming back with the same order as it was stored. Footnotes |
Well said, @tvernum. More so, there are (documented modifications) when synthetic source is used. In other words, it's not safe to assume that the I'm closing this as working as intended. |
Elasticsearch Version
8.15.2
Installed Plugins
analysis_icu
Java Version
bundled
OS Version
MacOS
Problem Description
According to the documentation (https://www.elastic.co/guide/en/elasticsearch/reference/current/enabled.html), fields mapped as "type": "object", "enabled": false are supposedly not processed/parsed by Elasticsearch and therefore one would expect that any stored objects in such fields would be stored in the document _source exactly as sent in the indexing request.
This does not seem to be the case when it comes to the order of their properties, since we have cases where such fields inside the document _source (retrieved using a plain GET document request, no searching involved) do not retain the original order.
Eg. a document indexed like this:
is returned like this right after it is indexed:
Is this normal/to be expected (ie. object serialization in source fields is not guaranteed to preserve properties order), or is it a bug?
Steps to Reproduce
Logs (if relevant)
No response
The text was updated successfully, but these errors were encountered: