Initial rewrite of MMapDirectory for JDK-17 preview (incubating) Panama APIs (>= JDK-17-ea-b25) #177

uschindler · 2021-06-08T11:12:48Z

INFO: This is a followup of #173: It's the same code base, but with API changes from JDK 17 applied

This is just a draft PR for a first insight on memory mapping improvements in JDK 17+.

Some background information: Starting with JDK-14, there is a new incubating module "jdk.incubator.foreign" that has a new, not yet stable API for accessing off-heap memory (and later it will also support calling functions using classical MethodHandles that are located in libraries like .so or .dll files). This incubator module has several versions:

first version: https://openjdk.java.net/jeps/370 (slow, very buggy and thread confinement, so making it unuseable with Lucene)
second version: https://openjdk.java.net/jeps/383 (still thread confinement, but now allows transfer of "ownership" to other threads; this is still impossible to use with Lucene.
third version in JDK 16: https://openjdk.java.net/jeps/393 (this version has included "Support for shared segments"). This now allows us to safely use the same external mmaped memory from different threads and also unmap it! This was implemented in the previous pull request Initial rewrite of MMapDirectory for JDK-16 preview (incubating) Panama APIs (>= JDK-16-ea-b32) #173
fourth version in JDK 17, included in build 25: https://openjdk.java.net/jeps/412 (actual version). This mainly changes the API around the scopes. Instead of having segments explicitely made "shared", we can assign them to some resource scope which control their behaviour. The resourceScope is produced one time for each IndexInput instance (not clones) and owns all segments. When the resourceScope is closed, all segments get invalid - and we throw AlreadyClosedException.

This module more or less overcomes several problems:

ByteBuffer API is limited to 32bit (in fact MMapDirectory has to chunk in 1 GiB portions)
There is no official way to unmap ByteBuffers when the file is no longer used. There is a way to use sun.misc.Unsafe and forcefully unmap segments, but any IndexInput accessing the file from another thread will crush the JVM with SIGSEGV or SIGBUS. We learned to live with that and we happily apply the unsafe unmapping, but that's the main issue.

@uschindler had many discussions with the team at OpenJDK and finally with the third incubator, we have an API that works with Lucene. It was very fruitful discussions (thanks to @mcimadamore !)

With the third incubator we are now finally able to do some tests (especially performance). As this is an incubating module, this PR first changes a bit the build system:

disable -Werror for :lucene:core
add the incubating module to compiler of :lucene:core and enable it for all test builds. This is important, as you have to pass --add-modules jdk.incubator.foreign also at runtime!

The code basically just modifies MMapDirectory to use LONG instead of INT for the chunk size parameter. In addition it adds MemorySegmentIndexInput that is a copy of our ByteBufferIndexInput (still there, but unused), but using MemorySegment instead of ByteBuffer behind the scenes. It works in exactly the same way, just the try/catch blocks for supporting EOFException or moving to another segment were rewritten.

It passes all tests and it looks like you can use it to read indexes. The default chunk size is now 16 GiB (but you can raise or lower it as you like; tests are doing this). Of course you can set it to Long.MAX_VALUE, in that case every index file is always mapped to one big memory mapping. My testing with Windows 10 have shown, that this is not a good idea!!!. Huge mappings fragment address space over time and as we can only use like 43 or 46 bits (depending on OS), the fragmentation will at some point kill you. So 16 GiB looks like a good compromise: Most files will be smaller than 6 GiB anyways (unless you optimize your index to one huge segment). So for most Lucene installations, the number of segments will equal the number of open files, so Elasticsearch huge user consumers will be very happy. The sysctl max_map_count may not need to be touched anymore.

In addition, this implements readLongs in a better way than @jpountz did (no caching or arbitrary objects). Nevertheless, as the new MemorySegment API relies on final, unmodifiable classes and coping memory from a MemorySegment to a on-heap Java array, it requires us to wrap all those arrays using a MemorySegment each time (e.g. in readBytes() or readLELongs), there may be some overhead du to short living object allocations (those are NOT reuseable!!!). In short: In future we should throw away on coping/loading our stuff to heap and maybe throw away IndexInput completely and base our code fully on random access. The new foreign-vector APIs will in future also be written with MemorySegment in its focus. So you can allocate a vector view on a MemorySegment and let the vectorizer fully work outside java heap inside our mmapped files! :-)

It would be good if you could checkout this branch and try it in production.

But be aware:

You need JDK 17 to compile and run with Gradle (set JAVA_HOME to it)
The lucene-core.jar will be JDK17 class files and requires JDK-17 to execute.
Also you need to add --add-modules jdk.incubator.foreign to the command line of your Java program/Solr server/Elasticsearch server

It would be good to get some benchmarks, especially by @rmuir or @mikemccand.

My plan is the following:

report any bugs or slowness, especially with Hotspot optimizations. The last time I talked to Maurizio, he taked about Hotspot not being able to fully optimize for-loops with long instead of int, so it may take some time until the full performance is there.
wait until the final version of project PANAMA-foreign goes into Java's Core Library (no module needed anymore)
add a MR-JAR for lucene-core.jar and compile the MemorySegmentIndexInput and maybe some helper classes with JDK 18/19 (hopefully?).

…d from ANT build)

…s occur! Remove useless slicing if aligned.

…ning "buffer" to "segment"; also make the segments array final (curSegment == null when closed)

…eException: Cannot close while another thread is accessing the segment"

…ng objects to extend their functionality (like asserting in tests)

…eap segments don't need this)

… length mappings and offsets

… can correctly throw AlreadyClosedEx; TODO: add a test

…eign-mmap

…es); fix warnings

uschindler · 2021-08-03T08:41:48Z

It works with build 33, every hour a Jenkins build tests it: https://jenkins.thetaphi.de/job/Lucene-jdk17panama-Linux/ and https://jenkins.thetaphi.de/job/Lucene-jdk17panama-Windows/

Before 25 the API of Panama was on state of java 16, in that case you need to use the Java 16 pull request.

In addition make sure to not use any system installed Gradle but ./gradlew.

jbhateja · 2021-08-03T15:39:53Z

It works with build 33, every hour a Jenkins build tests it: https://jenkins.thetaphi.de/job/Lucene-jdk17panama-Linux/ and https://jenkins.thetaphi.de/job/Lucene-jdk17panama-Windows/

Before 25 the API of Panama was on state of java 16, in that case you need to use the Java 16 pull request.

In addition make sure to not use any system installed Gradle but ./gradlew.

Thanks @uschindler for your help, I have removed my other comments from this patch to reduce the noise.

…o draft/jdk-foreign-mmap-jdk17

…o draft/jdk-foreign-mmap-jdk17 # Conflicts: # lucene/core/src/java/org/apache/lucene/store/MMapDirectory.java

markrmiller · 2021-10-06T03:14:39Z

Just stumbled on this. I've also recently thought a bit about Lucene JMH benchmarks that are semi equivalent to lucene-util benchmarks. I've been spending some free time here and there trying to bring some JMH to lucene-util, mainly perfasm output and async profiler, for which I also have a rough patch to use in the tests waiting for some clean up. I have some rough working stuff, but there tends to be a lot of resulting noise in the outputs that I have to look at filtering out or targeting the capture more with a more invasive integration. More than once, I've thought, it would be nicer to just offer the same suite of benchmarks within JMH and capture the other things it offers as well.

As Robert says, it's no replacement for realistic high level benchmarks in most case, especially for Solr, I've used the same unit/integration test analogy, but it's fantastic for fast, reliable feedback with superb introspection tools built in.

Also, those lambdas are damn slow until hotspot takes care of them :) I saw that slowness way back and chased what the deal was for a while, I was ready to toss them out of my toolbox, and yeah, they just need time. Then they are fine, probably not great for places that don't run a lot.

JMH will actually run the profilers during warmup runs to warm them up, but then it dumps the data, so it's nice that even profiling just captures the warm iterations if you'd like. I don't like that it also includes setup, as it says, to be sure and catch the edges. I could do without that when you are constructing indexes or something to be queried, but if you setup larger stuff like lucene-util does to have at hand already, that circumvents that.

markrmiller · 2021-10-06T03:39:34Z

and also enables strong physical isolation of indexing and searching JVMs which have very different resources requirements! OK

This is a real issue with impact that comes up. Indexing on the replica has a surprising impact on query performance if you believe the reports. Those used to adding a bunch of replicas just for read side fan out felt the difference if they tried to migrate.

…o draft/jdk-foreign-mmap-jdk17 # Conflicts: # gradle/testing/defaults-tests.gradle # lucene/core/src/java/org/apache/lucene/util/Unwrappable.java

…o draft/jdk-foreign-mmap-jdk17

# Conflicts: # gradle/java/javac.gradle

…hindler/lucene into draft/jdk-foreign-mmap-jdk17

…o draft/jdk-foreign-mmap-jdk17 # Conflicts: # lucene/core/src/java/org/apache/lucene/store/MMapDirectory.java # lucene/core/src/test/org/apache/lucene/store/TestMmapDirectory.java

# Conflicts: # lucene/core/src/java/module-info.java # lucene/core/src/java/org/apache/lucene/store/MMapDirectory.java

…o draft/jdk-foreign-mmap-jdk17 # Conflicts: # lucene/core/src/java/org/apache/lucene/store/MMapDirectory.java # lucene/replicator/src/test/org/apache/lucene/replicator/nrt/TestStressNRTReplication.java

…rror)

…t/jdk-foreign-mmap-jdk17

uschindler · 2022-10-01T17:50:06Z

Closing this as the JDK 19 impl was merged (#912).

uschindler added 25 commits January 2, 2021 14:25

Initial state of new jdk-foreign MMAP API

190a853

Workaround to prevent incorrect test files from being executed (copie…

00d01a7

…d from ANT build)

Fix the remaining TODOs: make sure we unmap all segments if exception…

22c3c4b

…s occur! Remove useless slicing if aligned.

Cleanup code duplication mess exception handling and rename all remai…

f9ca335

…ning "buffer" to "segment"; also make the segments array final (curSegment == null when closed)

add missing ensureOpen() as NPE can't happen here

1a8a354

Cleanup messy duplicate methods

27fce4f

Add workaround for JDK-8259028

efcfccc

Make the JVM crush detector ready for heavy prime time!

8ee976a

Remove incorrect assert (won't work if page size is used like on linux)

fed48bd

Apply @dweiss improvement

50d9300

Merge branch 'master' into draft/jdk-foreign-mmap

8dd5d90

Add readLEFloats() introduced by LUCENE-9652 / apache#2175

0245d3f

Improve test to allow the following exception: "java.lang.IllegalStat…

ea188c1

…eException: Cannot close while another thread is accessing the segment"

Add a new interface to Lucene's core to mark classes which are wrappi…

01aca07

…ng objects to extend their functionality (like asserting in tests)

Split and rewrite getBytes() and remove useless try-with-resources (h…

60200e8

…eap segments don't need this)

Add static final boolean IS_LITTLE_ENDIAN and cleanup if statements

ba61072

Merge branch 'master' into draft/jdk-foreign-mmap

ec304ab

Remove hacks: JDK-16 EA b32 has now fixed the horrible bugs with zero…

5edcdf4

… length mappings and offsets

Merge branch 'master' into draft/jdk-foreign-mmap

b0eec7a

Improve close method to also null out the segments, so positional API…

7a3cf53

… can correctly throw AlreadyClosedEx; TODO: add a test

Merge branch 'master' into draft/jdk-foreign-mmap

d2c0be5

Merge branch 'draft/jdk-foreign-mmap' of ../lusolr into draft/jdk-for…

5542f2c

…eign-mmap

Update to little endian (LUCENE-9047)

e76bdc0

Apply spotless formatting

a7a4be2

Update to Java 17 (complete API change regarding resource/memory scop…

545efdf

…es); fix warnings

uschindler added the enhancement label Jun 8, 2021

uschindler self-assigned this Jun 8, 2021

uschindler marked this pull request as draft June 8, 2021 11:13

Ensure scope is released on error in IndexInput ctor

6d16a8d

uschindler mentioned this pull request Jun 8, 2021

Initial rewrite of MMapDirectory for JDK-16 preview (incubating) Panama APIs (>= JDK-16-ea-b32) #173

Closed

uschindler added 2 commits August 6, 2021 18:32

Merge branch 'main' of https://gitbox.apache.org/repos/asf/lucene int…

a89df56

…o draft/jdk-foreign-mmap-jdk17

Merge branch 'main' of https://gitbox.apache.org/repos/asf/lucene int…

195278c

…o draft/jdk-foreign-mmap-jdk17 # Conflicts: # lucene/core/src/java/org/apache/lucene/store/MMapDirectory.java

Merge branch 'main' of https://gitbox.apache.org/repos/asf/lucene int…

c12f412

…o draft/jdk-foreign-mmap-jdk17 # Conflicts: # gradle/testing/defaults-tests.gradle # lucene/core/src/java/org/apache/lucene/util/Unwrappable.java

This was referenced Nov 18, 2021

Consider contributing to the search benchmark game mikemccand/luceneutil#150

Open

introduce off-heap big arrays elastic/elasticsearch#73755

Draft

uschindler added 2 commits November 29, 2021 16:32

Merge branch 'apache:main' into draft/jdk-foreign-mmap-jdk17

0d3eace

Merge branch 'apache:main' into draft/jdk-foreign-mmap-jdk17

8a5cfd5

uschindler mentioned this pull request Dec 5, 2021

Initial rewrite of MMapDirectory for JDK-18 preview (incubating) Panama APIs (>= JDK-18-ea-b26) #518

Closed

uschindler and others added 7 commits December 20, 2021 13:16

Merge branch 'main' of https://gitbox.apache.org/repos/asf/lucene int…

8024974

…o draft/jdk-foreign-mmap-jdk17

Update to module system branch

895a15e

# Conflicts: # gradle/java/javac.gradle

Remove unneeded module

dfaef58

Merge branch 'draft/jdk-foreign-mmap-jdk17' of https://github.com/usc…

26755bf

…hindler/lucene into draft/jdk-foreign-mmap-jdk17

Merge branch 'main' of https://gitbox.apache.org/repos/asf/lucene int…

30bf7a4

…o draft/jdk-foreign-mmap-jdk17 # Conflicts: # lucene/core/src/java/org/apache/lucene/store/MMapDirectory.java # lucene/core/src/test/org/apache/lucene/store/TestMmapDirectory.java

fix formatting

85e3234

Fix test after modularization

024555b

uschindler force-pushed the draft/jdk-foreign-mmap-jdk17 branch from dda2e38 to 024555b Compare December 22, 2021 08:47

uschindler added 4 commits December 29, 2021 19:28

Merge branch 'main' into draft/jdk-foreign-mmap-jdk17

3c6209a

# Conflicts: # lucene/core/src/java/module-info.java # lucene/core/src/java/org/apache/lucene/store/MMapDirectory.java

Merge branch 'main' of https://gitbox.apache.org/repos/asf/lucene int…

851d9c4

…o draft/jdk-foreign-mmap-jdk17 # Conflicts: # lucene/core/src/java/org/apache/lucene/store/MMapDirectory.java # lucene/replicator/src/test/org/apache/lucene/replicator/nrt/TestStressNRTReplication.java

Cleanup for Module System and Java 17 and disable test again (merge e…

be714b0

…rror)

Merge branch 'main' of https://github.com/uschindler/lucene into draf…

03d5e42

…t/jdk-foreign-mmap-jdk17

This was referenced May 20, 2022

Outdated: Initial rewrite of MMapDirectory for JDK-19 preview Panama APIs (>= JDK-19-ea+23) #911

Closed

MR-JAR rewrite of MMapDirectory with JDK-19 preview Panama APIs (>= JDK-19-ea+23) #912

Merged

mocobeta removed the enhancement label Jul 15, 2022

uschindler closed this Oct 1, 2022

uschindler mentioned this pull request Dec 8, 2023

Move group-varint encoding/decoding logic to DataOutput/DataInput #12841

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial rewrite of MMapDirectory for JDK-17 preview (incubating) Panama APIs (>= JDK-17-ea-b25) #177

Initial rewrite of MMapDirectory for JDK-17 preview (incubating) Panama APIs (>= JDK-17-ea-b25) #177

uschindler commented Jun 8, 2021 •

edited

Loading

uschindler commented Aug 3, 2021

jbhateja commented Aug 3, 2021

markrmiller commented Oct 6, 2021

markrmiller commented Oct 6, 2021

uschindler commented Oct 1, 2022

Initial rewrite of MMapDirectory for JDK-17 preview (incubating) Panama APIs (>= JDK-17-ea-b25) #177

Initial rewrite of MMapDirectory for JDK-17 preview (incubating) Panama APIs (>= JDK-17-ea-b25) #177

Conversation

uschindler commented Jun 8, 2021 • edited Loading

uschindler commented Aug 3, 2021

jbhateja commented Aug 3, 2021

markrmiller commented Oct 6, 2021

markrmiller commented Oct 6, 2021

uschindler commented Oct 1, 2022

uschindler commented Jun 8, 2021 •

edited

Loading