Ahead-of-Time Compilation (opens in new tab)

(bugs.openjdk.java.net)

128 pointshittaruki9y ago93 comments

93 comments

KMag9y ago

This is great news, though it initially only supports Linux x86-64 and is decades late for Java desktop apps (and not having non-blocking I/O until Java 1.4 was shameful for a language explicitly targeted and a pervasively networked ecosystem.)

In their "tiered mode", they put sampling instrumentation into the native code, and if they detect a hotspot, regenerate fully instrumented native code from bytecode using the C1 (fast) JIT, which then allows the C2 JIT to do its full optimizations on the code as if AoT were not involved.

Since the invention of tracing JITs, I've often wondered why languages don't package together a compact serialized SSA form such as LLVM bitcode or SafeTSA along with storing functions as lists of pointers to space-optimized compilations of extended basic blocks (strait-line code), similar to how some Forth compilers generate threaded code. A threaded code dispatcher over these strait-line segments of native code would have minimal overhead, and when a simple SIGPROF lightweight sampler detected a hotspot, a tracing version of the dispatcher could collect a trace, and then generate native code from the visited traces using the stored SSA for the basic blocks.

In this way, they'd have a light-weight tracing JIT for re-optimizing native code.

vidarh9y ago

You might be interested in looking at Semantic Dictionary Encoding [1]. It was professor Michael Franz' PhD thesis work. Franz' was Andreas Gals advisor on his thesis on trace trees.

SDE didn't propose starting with SSA, but could easily work with an SSA representation. SDE basically functions as a compression mechanism for an semantic IR that builds a dictionary on compression/decompression reminiscent of LZW. So instead of storing straight byte code, you store a compact higher level representation, that could very well be SSA, that is structure for you to generate code while "decompressing" it, and reuse generated code fragments as "templates" for later fragments.

An implementation was built in Oberon, compact tree representation (you could do a DAG with some adjustments) that mirrors your code generation orderand e.g. used to support PPC and M68k from the same "binares" in MacOberon. The way it was structured makes retaining arbitrary higher level structure of the programs very straight forward.

I keep wanting to do something with SDE, but life keeps intervening... I see it as a huge shame that more work didn't go into exploring that alternative to straight up bytecode, but it basically had way too little head start on Java, and I believe Franz' moved to Java for his subsequent research on code generation.

[1] https://en.wikipedia.org/wiki/Semantic_dictionary_encoding

KMag9y ago

Yes, I'm familiar with SDE, but thanks for mentioning it. The SafeTSA I mentioned was one of Michael Franz's later contributions to the field. SafeTSA was an SSA representation capable of expressing all of the security and other semantic constraints of the Java language. Michael Franz's group took the Jikes RVM (then known as Japapeno) and added a second front-end to the JIT that could read SafeTSA, so they could test performance of programs running Java bytecode and SafeTSA in the same process. SafeTSA both took less time to go from bytecode to native code, but also the resulting native code ran faster.

amelius9y ago

Interesting. Do you have a link to the thesis? The link on wikipedia seems broken, and Franz's homepage doesn't seem to contain a link.

1 more reply

alblue9y ago

LLVM bitcode is still architecture specific - for example, whether the code is 64 bit or 32 bit will result in different bitcode paths.

You may be interested to look further into Eclipse OMR, which is a generic VM used by IBM for many of their runtimes (including J9). The Testarossa JIT support landed last week, and although it doesn't support bitcode form directly there are optimisations that can be used to transform the static parts of the class from the dynamic parts, to facilitate loading. There is an IL for the JIT and interpreter use.

https://developer.ibm.com/open/omr/

KMag9y ago

Thanks for the pointer!

I (and others) have noted that for more than a decade, it seems that Java would have been better off under IBM than under Sun/Oracle (SWT vs. Swing/AWT, jikes vs. javac, Jalapeno/JikesRVM vs. not much interesting research until Graal, etc.) It's really a shame IBM didn't buy up Sun's Java intellectual property at fire sale prices.

nwmcsween9y ago

This could be taken even further, if the IR can hold about effects and purity, etc you could potentially optimize across libraries and binaries.

pjmlp9y ago

> decades late for Java desktop apps

Commercial JDKs always offered AOT compilation, the problem is that people nowadays apparently don't buy compilers anymore unless forced to do so (e.g. embedded, consoles...).

Asooka9y ago

Those are priced for people who already made a big investment in writing their application in Java and now realise they need features not present in javac. If you're just starting out, it can very well make more sense to use Microsoft Visual C++, which costs less than a commercial Java compiler and comes with an IDE that's light years ahead of anything available to Java developers.

Desktop Java also had many other problems, which can be summarised as "the JVM is its own OS". You can't write an application in Java that has a native look and feel. Or at least you couldn't for the first several significant years of its life and even now I don't think there's a good story for writing a simple native application. Meanwhile you could grab wxWidgets or Qt (and there goes your budget for a java compiler) and have a native-looking cross-platform application. Which very few did, because back then Mac OSX didn't exist, Apple were on their death bed and "Linux Desktop Environment" was even more of a joke than it is today.

So yeah, it didn't make any bit of sense to develop Java desktop apps given that you already had a large pool of proficient C++ developers, the only platform you cared about was Windows and Java GUI libraries insisted on reinventing their own look and feel. Oh and you could always just buy Delphi if you didn't want to suffer C++ (again, for a fraction of the price of a commercial Java compiler).

Nowadays people wrap a bunch of javascript in an electron instance, but this only happened after the web took off and nobody really looks at native desktop apps much. If this AOT work can give us fully contained native executables that we can distribute without having the user install Java and with significantly better performance than nodejs, maybe Java on the desktop can still happen.

2 more replies

KMag9y ago

There has also been gcj for a long time, but default toolchains matter a lot.

2 more replies

taspeotis9y ago

I can't see anywhere in the linked issue that indicates AOT compilation is coming to Java 9, or even coming at all. The issue demonstrates nothing more than an intent to bring it to OpenJDK, and the issue seems to be very nascent? It was only created a fortnight ago.

Lest the title is changed:

    AOT compilation is coming to Java 9 (java.net)
    18 points by hittaruki 37 minutes ago

chc9y ago

The person who created this ticket is an Oracle employee, not some random Joe, so it seems like a reasonable guess that it's something Oracle is planning.

pjmlp9y ago

They are planning it, and there were already several talks, but the roadmap was Java 10 or later.

mike_hearn9y ago

There was a talk about it last year:

  https://www.youtube.com/watch?v=Xybzyv8qbOc

The project seems to have gone slower than I expected, perhaps because Chris Thalinger moved to Twitter.

dantiberian9y ago

> The extra step of recompiling code at Tier 3 is necessary since the overhead of full profiling is too high to be used for all methods, especially for a module such as java.base. For user applications it might make sense to allow AOT compilations with Tier 3-equivalent profiling, but this will not be supported for JDK 9.

This implies it will be in Java 9 (in a limited fashion).

alblue9y ago

Slightly off topic but if you are interested in how HotSpot compiles to native code I gave a presentation at JavaOne:

http://alblue.bandlem.com/2016/09/javaone-hotspot.html

The presentation wasn't recorded but there is a video recorded from a DocklandsLJC event which is on InfoQ:

https://www.infoq.com/presentations/hotspot-memory-data-stru...

avbor9y ago

I'm not familiar enough with compilers, but why would an ahead of time compiler perform worse than a just in time compiler in a static language? I think I'd understand if it was a dynamic language, because you can't know the types for sure until you start running the program, but are similar issues present for Java?

yuriks9y ago

Yes, Java code relies a lot on devirtualization to get good performance. This is because the language allows every function to be virtual by default. The flip-side is that this make coding styles that make heavy usage of interfaces (arguably good for testing, interoperability, decoupling, etc.) effectively "free" compared to ones using concrete classes.

h4nkoslo9y ago

Most of the JIT optimizations amount to "it's been called like this in the past; assume it'll always be & depotimize (at a penalty) if it isn't."

That potentially includes the fully resolved types of objects (ie devirtualization), branch prediction (stronger than the CPU can do; for instance, if a value is only used inside a branch that's never taken, don't bother mutating it), data sizes (this "array" is only ever size 2, store it in registers), dead code elimination (keeps the compiled code small), and a whole bunch more fun stuff.

catnaroek9y ago

> assume it'll always be & depotimize (at a penalty) if it isn't

Stuff like this makes me nervous. Performance is already a complex topic, and stuff like this makes it even more complex. Unnecessarily so. If we were talking about a very high-level programming language (say, Prolog), you could argue that the expressiveness benefits outweigh the cost of the runtime system's complexity. But Java isn't even as expressive as C++, let alone Prolog.

> fully resolved types of objects (ie devirtualization)

C++ (and similar languages: D, Rust, etc.) and MLton (a Standard ML implementation) have been using monomorphization for ages, which is a compile-time analogue of devirtualization. Moreover, monomorphization has important advantages over devirtualization:

(0) It's completely predictable. You don't need to guess when it will happen. It happens iff the concrete type (and its relevant vtables, if necessary) can be determined at compile-time: https://blog.rust-lang.org/2015/05/11/traits.html

(1) It's always a sound optimization, so it doesn't have to be undone at runtime under any circumstances.

(2) It's relatively simple to implement. In fact, a compiler front-end can completely monomorphize a program before handing it over to the back-end for target code generation.

> if a value is only used inside a branch that's never taken, don't bother mutating it)

The best way to handle unreachable branches is to avoid creating them in the first place. With proper use of algebraic data types and pattern matching, unreachable branches can be kept to a minimum, or even outright eliminated in many cases.

> data sizes (this "array" is only ever size 2, store it in registers)

C and similar languages natively handle statically sized arrays, so there's no need for runtime profiling and analysis just to determine that an array will always have size 2.

ML does something even better: you just use tuples (in this case, pairs), which reflect your intent much better than using arrays whose size has to be tested or guessed.

---

What I take away from this is that the JVM's supposedly “fancy” optimizations exist primarily to work around the Java language's lack of amenability to static analysis.

3 more replies

psuter9y ago

One thing JIT compilers are good at is specializing methods for common (runtime) types; e.g. you have a method operating on Iterable but it turns out most of the time you get a List as input; you can generate code that bypasses the method lookups for .size etc. The question of whether this pays off is usually only answerable at runtime.

bad_user9y ago

Actually that's not a thing most JIT compilers are good at, as such runtimes are hard to build. For the mainstream ones, you only have Java and the major Javascript runtimes.

Scarbutt9y ago

A JIT can use information from runtime for optimizations, you pay a cost in startup time. Being worse or better depends on the workloads and implementations.

CalChris9y ago

AOT can as well. PGO.

3 more replies

mike_hearn9y ago

Almost all languages can benefit from profile guided optimisations, including very static languages like C++

BuckRogers9y ago

Why was Java ever JIT'd rather than natively compiled anyway? I hate to stick my neck out and even ask this but I never understood why you'd want to JIT or interpret when you can just natively compile to a binary. It seems like Go has gone "back" to the future on this one and in general their toolchain approach to me looked like the way.

I always got the sense the world is waiting for a statically typed Python that compiles to native code with Go's CPU performance. I suppose Nim might fit that bill but a shame it doesn't have compatibility with Python's or even the extent of a language like Go's libraries. And if possible, an imperative language that interfaces with OTP.

And that said, I can see why Erlang/Elixir wouldn't make as much sense or even work with native code AOT compilation due to it's feature set (thinking stuff like hot code reloading). But I've never grasped why Java or Python were better off with JIT or interpreters than AOT comp. Seems like a type system such as Go's is simple enough and allows for good gains in both CPU performance and memory usage. Add in the fact you don't need to install anything and less to think about in deploying and it seems to be a no brainer. Please feel free to fill me in on this or where I went wrong..

jcheng9y ago

For several of Java's early use cases, being able to deploy a single file that could be run on any Java-supported platform was very important.

https://en.wikipedia.org/wiki/Write_once,_run_anywhere

qznc9y ago

Also, a small file and stack based bytecode is often the most compact representation.

1 more reply

j-g-faustus9y ago

Because Java was originally designed for set-top TV boxes and appliances, where it's kind of a big deal that you don't need to know or care what OS or processor each appliance is using internally.

https://web.archive.org/web/20050420081440/http://java.sun.c...

When the appliance market didn't pan out, they went for web browsers and Java applets. Bytecodes were a feature because browsers didn't exectute native code, and because it allowed for sandboxing to limit the attack surface.

Even when Java became more popular on the server than in the browser, the "write once, run everywhere" was considered a major feature: The same bytecode could be distributed everywhere; no need to maintain a heap of different build environments for different CPU architecture and OS combinations.

mike_hearn9y ago

I'd say the appliance market did pan out actually. BluRay players all contain an embedded JVM, as do many other kinds of set top box, as do of course all Android smart TVs.

Abstracting the CPU has worked out pretty well for the Java platform. Look at how easy the 64 bit transition was for the Java world vs the C++ world. Visual Studio is still not a 64 bit app and yet Java IDEs hardly even noticed the change. The transition on Linux was just a disaster zone, every distro came up with their own way of handling the incompatible flavours of each binary.

In addition, a simple JIT compiled instruction set makes on the fly code generation a lot easier in many cases and it's a common feature of Java frameworks. For instance the java.lang.reflect.Proxy feature is one I was using just the other day and it works by generating and loading bytecode at runtime. On the fly code generation is considered a black art for native apps and certainly extremely non portable, but is relatively commonplace and approachable in Java.

pjmlp9y ago

Plenty of other platforms do support bytecodes, JIT and AOT on the same toolchain.

So they could keep the WORA story and still offer AOT as an option, which actually most commercial JDKs do.

Just Sun was against providing it at all on Java SE, but they actually supported it on Java Embedded.

1 more reply

pjmlp9y ago

Religion.

Talking about AOT compilation at Sun was tabu and I remember seeing a few forum discussions from former employees disclosing this.

Plenty of other platforms do support bytecodes, JIT and AOT on the same toolchain.

So they could keep the WORA story and still offer AOT as an option, which actually most commercial JDKs do.

pvg9y ago

Deployment (and fever dreams of 'mobile code'). It's also worth remembering that Java was designed and implemented at a time when the landscape was significantly less x86-centric and Sun was one of the companies on the not-x86 side.

mike_hearn9y ago

The landscape is still not really x86 centric is it.

Java is old. It's seen a lot of CPU architectures come and go over the years. When it started out x86, SPARC and POWER, were important. Then it saw a mass migration from x86 to amd64 on the desktop and server side, and an explosion in the importance of ARM in mobiles (several flavours).

Along the way it's seen lots of smaller proprietary architectures come and go too, like the exotic DSP-oriented processors found in BluRay players and pre-smartphone phones and like the Azul Vega architecture that was specifically designed for executing business Java.

And don't forget that even amd64 is not a homogenous architecture. It adds new CPU instructions pretty regularly and thus can be seen as a long line of compatible but different CPU architectures. Java apps transparently get support for all of them on the fly, without having to recompile the world. You see the benefit when you realise the size of Maven Central ... there are JARs out there that are still useful and good even a decade after they were compiled, yet they still get optimised to full speed using the latest CPU instructions no matter what kind of computer you use.

CalChris9y ago

I remember that HotSpot was promising faster than GCC -O2. This sort of over promising was good for presentations to the Schmidt and McNealy types.

the84729y ago

My understanding is that GCC has a more diverse arsenal of optimizations that it can apply to code while hotspot has the advantage that it can profile at runtime and apply speculative optimizations based on those profiles and bail out later if things change. In principle it can even optimize code that never reaches steady state as long as the transient states last long enough.

What costs java performance these days is not the quality of the JIT compilers or even the garbage collectors. It's the object layout that is not very cache-friendly. There is lots of pointer-chasing going on since there are no arrays-of-structs.

Valhalla[0] promises to improve the data layout issue at some point in the future while graal may allow compiler writers to cram some more optimizations into the jits.

[0] http://openjdk.java.net/projects/valhalla/

1 more reply

Cyph0n9y ago

Assuming this comes in Java 9, and compilation of code other `java.base` is possible, will this make Java a more solid competitor to Go? I guess it partly depends on how much they optimize the compiled binary size. Go does a really good job at static compilation, so it will be tough to compete.

CJefferson9y ago

In benchmarks I've seen, Java already smokes Go on any benchmark except the second or so it takes the JVM to start up.

I'm not aiming this just at you, but I think many people (node.js users in particular come to mind) don't realise just how good the JVM is, performance-wise. I'm not a great fan of Java the language, but the JVM is top class.

mike_hearn9y ago

HotSpot can run hello world in about 50msec, not one second. A lot of people's views of JVM startup time are hopelessly out of date.

The primary thing people seem to like about Go is that it produces single native binaries. You can do that with Java too (I gave an example of Avian further up the thread), but people don't tend to bother because distributing a single JAR is not much harder and avoids any assumptions about what OS the recipient might have. Go users seem invariably to be writing programs for their own use and Go doesn't really "do" shared libraries, so they don't ever encounter the problem of distributing a binary of the wrong flavour because they don't distribute binaries at all.

By the way, in Java 8 there's a tool that produces Mac, Linux and Windows standalone packages and installers that don't depend on any system JVM. I've used it to distribute software successfully, although I had to make my own online update system for it. In Java 9 it's being extended quite a bit with the new "jlink" tool that does something similar to static linking ... the output of jlink is either a directory that's a standalone JRE image optimised and stripped to have only the modules your app needs, or you can combine it with the other tool to get a MacOS DMG (with an icon, code signing etc), Windows MSI/EXE (ditto), or a Linux DEB/RPM/tarball.

This isn't a single file at runtime of course, it's a single directory, but basically any complex native app will have data files and some sort of package too so that's not a big deal.

1 more reply

pjmlp9y ago

Java has AOT compilation since ages, here is one example.

https://www.excelsiorjet.com/

Most commercial JDKs do support AOT compilation to native code, and alongside Java library and eco-system, it definitely makes it more than a solid competitor to Go.

The problem is that free AOT compilers never were a big match to the ones from commercial JDKs, and in this day and age, most developers don't pay for compilers unless forced to do so.

So Java AOT compilers are usually only used by enterprise companies.

Cyph0n9y ago

Yeah I'm aware of Excelsior, but it would be cool to see AOT in OpenJDK.

paukiatwee9y ago

Java AOT is compile JVM bytecode to native code during startup of JVM, which is different from Go's compile source to native and distribute platform specific binaries. So in this case, Java binary size remain same as before, which is .jar or .war binaries.

For Go, .go -> native

For Java, .java -> .class -> package .jar -> AOT native

For Go part I might be wrong, not working on Go professionally.

mike_hearn9y ago

No, not during startup, AOT can happen at any time chosen by the developer or user: there's a command line tool that triggers it and I believe the plan is to integrate it with the "jlink" tool that produces standalone, app specific JRE images. So you can produce a native installer for each platform.

Scarbutt9y ago

My understanding is that JVM bytecode is compiled to native before startup of the JVM, that is why it is called AOT ;)

sandGorgon9y ago

that is an incorrect assumption. AOT step will include deadcode elimination. In that way - the Java way is a more sophisticated way combining platform independent bytecode and platform-specific machine code.

bitmapbrother9y ago

No it's not. Compilers such as the one in IBM's J9 have a switch setting to use AOT.

Matthias2479y ago

Go still has the feature of lightweight threads (goroutines) by default and great communication and synchronization primitives (channels, select) between them. For Java you still have to choose either real threads or from one of dozens of not-necessarily-compatible async IO frameworks (Netty, Grizzly, ...). How much that really matters depends on the type of your application. There's also emulation of Gos concurrency through Quasar - but again it's not as first class as Gos features.

pjmlp9y ago

Java also has green threads.

The JVM specification doesn't require OS threads, and in the early days all JVMs only had green threads.

Nowadays you can still get such JVMs, just not the OpenJDK.

Cyph0n9y ago

Scala and Akka blow goroutines out of the water imo.

jgalt2129y ago

> Infrequently-used Java methods might never be compiled at all, potentially incurring a performance penalty due to repeated interpreted invocations.

That sort of makes no sense. How can you incur a real performance hit if the uncompiled method is rarely called?

smitherfield9y ago

I would assume if the method is rarely called, but very complex or time-consuming when it is called.

Of course, I'm not an expert on JVMs, so I wouldn't know whether their analysis is synchronous or asynchronous or a mix of both.

johnydepp9y ago

That's great! Won't it make the compiled executable platform specific?

mike_hearn9y ago

You're intended to use it like this: either you distribute JARs and the recipient triggers the AOT compilation if they want it, or you distribute a "jlinked" JRE image that's inherently OS specific because it includes a bundled JVM. It's also possible that a future Java module format will allow the AOT compiled code images to come along for the ride next to the classfiles.

Sanddancer9y ago

There are other JVMs that already do AOT compilation. This just brings an AOT compilation option to Java. It also looks like the contained file will include both the compiled version and the normal bytecode, so that the code can be recompiled if/when necessary.

copperx9y ago

I believe that's the whole point.

From what I could gather, this is the process one would follow to get native code:

.java -> javac -> .class (still cross-platform bytecode) -> jaotc -> .so native code

jcdavis9y ago

the generated .so file will be platform specific, but it seems like the class files will still be needed as usual

rcaught9y ago

Write once, run anywhere... you've compiled it to.

my1239y ago

IKVM, the Java VM for .NET converted Java bytecode to .NET since a while, and crossgen/ngen can be used for .NET AOT(Mono also has AOT)

haberman9y ago

How does this interact with classloading?

My general impression is that the design of classloaders is pretty actively hostile to making JVM startup fast.

the84729y ago

AOT only applies to clases that have been AOT-compiled and are not transformed at runtime. Everything else will either still be need JITing or could potentially throw errors if pure AOT is desired.

AOT and JIT are not mutually exclusive. From the proposal itself:

> AOT libraries can be compiled in two modes:

> Non-tiered AOT compiled code behaves similarly to statically compiled C++ code in that no profiling information is collected and no JIT recompilations will happen.

> Tiered AOT compiled code does collect profiling information. The profiling done is the same as the simple profiling done by C1 methods compiled at Tier 2. If AOT methods hit the AOT invocation thresholds these methods are being recompiled by C1 at Tier 3 first in order to gather full profiling information. This is required for C2 JIT recompilations to be able to produce optimal code and reach peak application performance.

qznc9y ago

Yes it is. On the other hand, gcj still did it years ago. I guess some features like dynamic class loading are just not supported.

premium-concern9y ago

Is there anything this adds over Scala-native, which seems to be much further ahead already?

edko9y ago

I think this could be a great complement to Scala-native. Right now, the project contributors have to spend effort translating the essential Java libraries that would allow Scala-native to be successful. This could really ease that job for them. It could potentially make all the Java code ever written available to Scala-native.

The other thing it adds is the backing of a giant, like Oracle, which can bring stability and peace of mind to some people, when deciding whether to adopt the technology or not.

premium-concern9y ago

I think this assumes that Oracle

- can ship something in time

- and that it will be generally available for developers (looking at how hard Oracle pushes their Java department to invent commercial features they can sell, I'm not sure about that)

Looking at it, I assume that this will go the way of GWT ... not starting from "how can we make Java a good citizen in this new ecosystem?", but "here we have 100% of Java, the JDK and the JVM ... how can we compile this with full fidelity into X?".

saynsedit9y ago

not sure why this isn't a transparent feature implemented via caching.

jcdavis9y ago

Hotspot's JIT compiled code tends to be pretty specialized based on runtime profiling information, which may not necessarily be similar between different runs even the class itself hasn't changed, or (in an extreme case) even if none of the code has.

Some other JVMs (at least Azul's Zing) try to solve this by cache profiling information to speed up code generation.

alblue9y ago

I believe the ReadyNow technology used by Zing records what methods are compiled at which level, then trigger a compilation of those methods at start up. So you effectively use profiling information from the previous run to inform the next run of what the final target state is, allowing the warm up times to be dramatically reduced.

mike_hearn9y ago

It's explained in the talk "Java goes AOT":

https://www.youtube.com/watch?v=Xybzyv8qbOc

Basically they thought it'd de-opt too much. I'm not totally sure it's the case but they'd be the experts on that.

_ZeD_9y ago

it's a gcj comeback?

singularity20019y ago

resurrect me when it's there https://github.com/search?p=3&q=jaotc&type=Code

j / k navigate · click thread line to collapse

93 comments

KMag9y ago

In this way, they'd have a light-weight tracing JIT for re-optimizing native code.

vidarh9y ago

You might be interested in looking at Semantic Dictionary Encoding [1]. It was professor Michael Franz' PhD thesis work. Franz' was Andreas Gals advisor on his thesis on trace trees.

[1] https://en.wikipedia.org/wiki/Semantic_dictionary_encoding

KMag9y ago

amelius9y ago

Interesting. Do you have a link to the thesis? The link on wikipedia seems broken, and Franz's homepage doesn't seem to contain a link.

1 more reply

alblue9y ago

LLVM bitcode is still architecture specific - for example, whether the code is 64 bit or 32 bit will result in different bitcode paths.

https://developer.ibm.com/open/omr/

KMag9y ago

Thanks for the pointer!

nwmcsween9y ago

This could be taken even further, if the IR can hold about effects and purity, etc you could potentially optimize across libraries and binaries.

pjmlp9y ago

> decades late for Java desktop apps

Commercial JDKs always offered AOT compilation, the problem is that people nowadays apparently don't buy compilers anymore unless forced to do so (e.g. embedded, consoles...).

Asooka9y ago

2 more replies

KMag9y ago

There has also been gcj for a long time, but default toolchains matter a lot.

2 more replies

taspeotis9y ago

Lest the title is changed:

    AOT compilation is coming to Java 9 (java.net)
    18 points by hittaruki 37 minutes ago

chc9y ago

The person who created this ticket is an Oracle employee, not some random Joe, so it seems like a reasonable guess that it's something Oracle is planning.

pjmlp9y ago

They are planning it, and there were already several talks, but the roadmap was Java 10 or later.

mike_hearn9y ago

There was a talk about it last year:

  https://www.youtube.com/watch?v=Xybzyv8qbOc

The project seems to have gone slower than I expected, perhaps because Chris Thalinger moved to Twitter.

dantiberian9y ago

This implies it will be in Java 9 (in a limited fashion).

alblue9y ago

Slightly off topic but if you are interested in how HotSpot compiles to native code I gave a presentation at JavaOne:

http://alblue.bandlem.com/2016/09/javaone-hotspot.html

The presentation wasn't recorded but there is a video recorded from a DocklandsLJC event which is on InfoQ:

https://www.infoq.com/presentations/hotspot-memory-data-stru...

avbor9y ago

yuriks9y ago

h4nkoslo9y ago

Most of the JIT optimizations amount to "it's been called like this in the past; assume it'll always be & depotimize (at a penalty) if it isn't."

catnaroek9y ago

> assume it'll always be & depotimize (at a penalty) if it isn't

> fully resolved types of objects (ie devirtualization)

(1) It's always a sound optimization, so it doesn't have to be undone at runtime under any circumstances.

(2) It's relatively simple to implement. In fact, a compiler front-end can completely monomorphize a program before handing it over to the back-end for target code generation.

> if a value is only used inside a branch that's never taken, don't bother mutating it)

> data sizes (this "array" is only ever size 2, store it in registers)

C and similar languages natively handle statically sized arrays, so there's no need for runtime profiling and analysis just to determine that an array will always have size 2.

ML does something even better: you just use tuples (in this case, pairs), which reflect your intent much better than using arrays whose size has to be tested or guessed.

---

What I take away from this is that the JVM's supposedly “fancy” optimizations exist primarily to work around the Java language's lack of amenability to static analysis.

3 more replies

psuter9y ago

bad_user9y ago

Actually that's not a thing most JIT compilers are good at, as such runtimes are hard to build. For the mainstream ones, you only have Java and the major Javascript runtimes.

Scarbutt9y ago

A JIT can use information from runtime for optimizations, you pay a cost in startup time. Being worse or better depends on the workloads and implementations.

CalChris9y ago

AOT can as well. PGO.

3 more replies

mike_hearn9y ago

Almost all languages can benefit from profile guided optimisations, including very static languages like C++

BuckRogers9y ago

jcheng9y ago

For several of Java's early use cases, being able to deploy a single file that could be run on any Java-supported platform was very important.

https://en.wikipedia.org/wiki/Write_once,_run_anywhere

qznc9y ago

Also, a small file and stack based bytecode is often the most compact representation.

1 more reply

j-g-faustus9y ago

Because Java was originally designed for set-top TV boxes and appliances, where it's kind of a big deal that you don't need to know or care what OS or processor each appliance is using internally.

https://web.archive.org/web/20050420081440/http://java.sun.c...

mike_hearn9y ago

I'd say the appliance market did pan out actually. BluRay players all contain an embedded JVM, as do many other kinds of set top box, as do of course all Android smart TVs.

pjmlp9y ago

Plenty of other platforms do support bytecodes, JIT and AOT on the same toolchain.

So they could keep the WORA story and still offer AOT as an option, which actually most commercial JDKs do.

Just Sun was against providing it at all on Java SE, but they actually supported it on Java Embedded.

1 more reply

pjmlp9y ago

Religion.

Talking about AOT compilation at Sun was tabu and I remember seeing a few forum discussions from former employees disclosing this.

Plenty of other platforms do support bytecodes, JIT and AOT on the same toolchain.

So they could keep the WORA story and still offer AOT as an option, which actually most commercial JDKs do.

pvg9y ago

mike_hearn9y ago

The landscape is still not really x86 centric is it.

CalChris9y ago

I remember that HotSpot was promising faster than GCC -O2. This sort of over promising was good for presentations to the Schmidt and McNealy types.

the84729y ago

Valhalla[0] promises to improve the data layout issue at some point in the future while graal may allow compiler writers to cram some more optimizations into the jits.

[0] http://openjdk.java.net/projects/valhalla/

1 more reply

Cyph0n9y ago

CJefferson9y ago

In benchmarks I've seen, Java already smokes Go on any benchmark except the second or so it takes the JVM to start up.

mike_hearn9y ago

HotSpot can run hello world in about 50msec, not one second. A lot of people's views of JVM startup time are hopelessly out of date.

This isn't a single file at runtime of course, it's a single directory, but basically any complex native app will have data files and some sort of package too so that's not a big deal.

1 more reply

pjmlp9y ago

Java has AOT compilation since ages, here is one example.

https://www.excelsiorjet.com/

Most commercial JDKs do support AOT compilation to native code, and alongside Java library and eco-system, it definitely makes it more than a solid competitor to Go.

The problem is that free AOT compilers never were a big match to the ones from commercial JDKs, and in this day and age, most developers don't pay for compilers unless forced to do so.

So Java AOT compilers are usually only used by enterprise companies.

Cyph0n9y ago

Yeah I'm aware of Excelsior, but it would be cool to see AOT in OpenJDK.

paukiatwee9y ago

For Go, .go -> native

For Java, .java -> .class -> package .jar -> AOT native

For Go part I might be wrong, not working on Go professionally.

mike_hearn9y ago

Scarbutt9y ago

My understanding is that JVM bytecode is compiled to native before startup of the JVM, that is why it is called AOT ;)

sandGorgon9y ago

bitmapbrother9y ago

No it's not. Compilers such as the one in IBM's J9 have a switch setting to use AOT.

Matthias2479y ago

pjmlp9y ago

Java also has green threads.

The JVM specification doesn't require OS threads, and in the early days all JVMs only had green threads.

Nowadays you can still get such JVMs, just not the OpenJDK.

Cyph0n9y ago

Scala and Akka blow goroutines out of the water imo.

jgalt2129y ago

> Infrequently-used Java methods might never be compiled at all, potentially incurring a performance penalty due to repeated interpreted invocations.

That sort of makes no sense. How can you incur a real performance hit if the uncompiled method is rarely called?

smitherfield9y ago

I would assume if the method is rarely called, but very complex or time-consuming when it is called.

Of course, I'm not an expert on JVMs, so I wouldn't know whether their analysis is synchronous or asynchronous or a mix of both.

johnydepp9y ago

That's great! Won't it make the compiled executable platform specific?

mike_hearn9y ago

Sanddancer9y ago

copperx9y ago

I believe that's the whole point.

From what I could gather, this is the process one would follow to get native code:

.java -> javac -> .class (still cross-platform bytecode) -> jaotc -> .so native code

jcdavis9y ago

the generated .so file will be platform specific, but it seems like the class files will still be needed as usual

rcaught9y ago

Write once, run anywhere... you've compiled it to.

my1239y ago

IKVM, the Java VM for .NET converted Java bytecode to .NET since a while, and crossgen/ngen can be used for .NET AOT(Mono also has AOT)

haberman9y ago

How does this interact with classloading?

My general impression is that the design of classloaders is pretty actively hostile to making JVM startup fast.

the84729y ago

AOT only applies to clases that have been AOT-compiled and are not transformed at runtime. Everything else will either still be need JITing or could potentially throw errors if pure AOT is desired.

AOT and JIT are not mutually exclusive. From the proposal itself:

> AOT libraries can be compiled in two modes:

> Non-tiered AOT compiled code behaves similarly to statically compiled C++ code in that no profiling information is collected and no JIT recompilations will happen.

qznc9y ago

Yes it is. On the other hand, gcj still did it years ago. I guess some features like dynamic class loading are just not supported.

premium-concern9y ago

Is there anything this adds over Scala-native, which seems to be much further ahead already?

edko9y ago

The other thing it adds is the backing of a giant, like Oracle, which can bring stability and peace of mind to some people, when deciding whether to adopt the technology or not.

premium-concern9y ago

I think this assumes that Oracle

- can ship something in time

- and that it will be generally available for developers (looking at how hard Oracle pushes their Java department to invent commercial features they can sell, I'm not sure about that)

saynsedit9y ago

not sure why this isn't a transparent feature implemented via caching.

jcdavis9y ago

Some other JVMs (at least Azul's Zing) try to solve this by cache profiling information to speed up code generation.

alblue9y ago

mike_hearn9y ago

It's explained in the talk "Java goes AOT":

https://www.youtube.com/watch?v=Xybzyv8qbOc

Basically they thought it'd de-opt too much. I'm not totally sure it's the case but they'd be the experts on that.

_ZeD_9y ago

it's a gcj comeback?

singularity20019y ago

resurrect me when it's there https://github.com/search?p=3&q=jaotc&type=Code

j / k navigate · click thread line to collapse