Track bytes used by in-memory postings #129969

jordan-powers · 2025-06-24T22:07:56Z

This patch adds a field totalPostingBytes to the ShardFields record that tracks the memory usage of the largest term, which may be stored in-memory by the postings FieldReader.

Most of this was already done by @dnhatn in #121476, but was never merged.

Mostly copied from Nhat's implementation in elastic#121476

elasticsearchmachine · 2025-06-24T22:08:21Z

Pinging @elastic/es-storage-engine (Team:StorageEngine)

…ngth

martijnvg

Looks good, Jordan. I do wonder a little bit about the potential overhead of TrackingPostingsInMemoryBytesCodec. Maybe check this quickly with esbench?

martijnvg · 2025-06-25T11:38:51Z

server/src/main/java/org/elasticsearch/index/engine/InternalEngine.java

@@ -2778,7 +2779,7 @@ private IndexWriterConfig getIndexWriterConfig() {
        iwc.setMaxFullFlushMergeWaitMillis(-1);
        iwc.setSimilarity(engineConfig.getSimilarity());
        iwc.setRAMBufferSizeMB(engineConfig.getIndexingBufferSize().getMbFrac());
-        iwc.setCodec(engineConfig.getCodec());
+        iwc.setCodec(new TrackingPostingsInMemoryBytesCodec(engineConfig.getCodec()));


I wonder what the overhead is of always wrapping the codec in TrackingPostingsInMemoryBytesCodec. Maybe let's quickly run benchmark? (elastic/logs?)

Additionally I wonder whether this should only be done for stateless only.

martijnvg · 2025-06-25T11:41:23Z

server/src/main/java/org/elasticsearch/index/codec/TrackingPostingsInMemoryBytesCodec.java

+import java.io.IOException;
+import java.util.function.IntConsumer;
+
+public class TrackingPostingsInMemoryBytesCodec extends FilterCodec {


Maybe add class level javadocs explain the purpose of this class?

martijnvg · 2025-06-26T13:19:53Z

server/src/main/java/org/elasticsearch/index/codec/TrackingPostingsInMemoryBytesCodec.java

+            Terms terms = super.terms(field);
+            if (terms == null) {
+                return terms;
+            }
+            int fieldNum = fieldInfos.fieldInfo(field).number;
+            return new TrackingLengthTerms(terms, len -> maxLengths.put(fieldNum, Math.max(maxLengths.getOrDefault(fieldNum, 0), len)));


I wonder whether we can do this instead:

Suggested change

Terms terms = super.terms(field);

if (terms == null) {

return terms;

}

int fieldNum = fieldInfos.fieldInfo(field).number;

return new TrackingLengthTerms(terms, len -> maxLengths.put(fieldNum, Math.max(maxLengths.getOrDefault(fieldNum, 0), len)));

Terms terms = super.terms(field);

// Only org.apache.lucene.codecs.lucene90.blocktree.FieldReader keeps min and max term in jvm heap,

// so only account for these cases:

if (terms instanceof FieldReader fieldReader) {

int fieldNum = fieldInfos.fieldInfo(field).number;

int length = fieldReader.getMin().length;

length += fieldReader.getMax().length;

maxLengths.put(fieldNum, length);

}

return terms;

This way there is way less wrapping. We only care about min and max term, given that this is loaded in jvm heap.

Scratch that idea. The implementation provided here different. This gets invoked during indexing / merging. During indexing this implementation of terms is FreqProxTermsWriterPerField. Invoking getMax() is potentially expensive as it causes reading ahead to figure out which is the max term, these terms get later read via terms enum.

martijnvg · 2025-06-26T14:00:56Z

server/src/main/java/org/elasticsearch/index/codec/TrackingPostingsInMemoryBytesCodec.java

+        public BytesRef next() throws IOException {
+            final BytesRef term = super.next();
+            if (term != null) {
+                maxTermLength = Math.max(maxTermLength, term.length);
+            } else {
+                onFinish.accept(maxTermLength);
+            }
+            return term;
+        }


Given that we need to estimate the terms that get loaded in jvm heap would the following be more accurate?

Suggested change

public BytesRef next() throws IOException {

final BytesRef term = super.next();

if (term != null) {

maxTermLength = Math.max(maxTermLength, term.length);

} else {

onFinish.accept(maxTermLength);

}

return term;

}

int prevTermLength = 0;

@Override

public BytesRef next() throws IOException {

final BytesRef term = super.next();

if (term == null) {

maxTermLength += prevTermLength;

onFinish.accept(maxTermLength);

return term;

}

if (maxTermLength == 0) {

maxTermLength = term.length;

}

prevTermLength = term.length;

return term;

}

In the org.apache.lucene.codecs.lucene90.blocktree.FieldReader class, the lowest and highest lexicographically term is kept around in jvm heap. The current code just keeps track what the longest term is and report that, which doesn't map with the minTerm and maxTerm in FieldReader?

jordan-powers added 3 commits June 23, 2025 15:13

Add TrackingPostingsInMemoryBytesCodec

0d8e02f

Mostly copied from Nhat's implementation in elastic#121476

Add totalPostingBytes to ShardFieldStats

3292e00

Add postingsInMemoryBytes to testShardFieldStats

8189993

jordan-powers requested review from martijnvg and dnhatn June 24, 2025 22:07

jordan-powers self-assigned this Jun 24, 2025

jordan-powers added >non-issue :StorageEngine/Codec v9.1.0 labels Jun 24, 2025

elasticsearchmachine added the Team:StorageEngine label Jun 24, 2025

jordan-powers added 2 commits June 24, 2025 15:09

Merge remote-tracking branch 'upstream/main' into track-field-term-le…

625639c

…ngth

Update min numDocs to 2 in testShardFieldStats

69ce68d

martijnvg reviewed Jun 25, 2025

View reviewed changes

martijnvg reviewed Jun 26, 2025

View reviewed changes

elasticsearchmachine added v9.2.0 and removed v9.1.0 labels Jun 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Track bytes used by in-memory postings #129969

Track bytes used by in-memory postings #129969

jordan-powers commented Jun 24, 2025

elasticsearchmachine commented Jun 24, 2025

martijnvg left a comment

martijnvg Jun 25, 2025

martijnvg Jun 25, 2025

martijnvg Jun 26, 2025

martijnvg Jun 26, 2025

martijnvg Jun 26, 2025

Track bytes used by in-memory postings #129969

Are you sure you want to change the base?

Track bytes used by in-memory postings #129969

Conversation

jordan-powers commented Jun 24, 2025

elasticsearchmachine commented Jun 24, 2025

martijnvg left a comment

Choose a reason for hiding this comment

martijnvg Jun 25, 2025

Choose a reason for hiding this comment

martijnvg Jun 25, 2025

Choose a reason for hiding this comment

martijnvg Jun 26, 2025

Choose a reason for hiding this comment

martijnvg Jun 26, 2025

Choose a reason for hiding this comment

martijnvg Jun 26, 2025

Choose a reason for hiding this comment