Building a Better Go Linker (opens in new tab)

(golang.org)

238 pointsrjammala6y ago99 comments

99 comments

"The original linker was also simpler than it is now and its implementation fit in one Turing award winner’s head, so there’s little abstraction or modularity. Unfortunately, as the linker grew and evolved, it retained its lack of structure, and our sole Turing award winner retired." :)

luuio6y ago

For those out of the loop, who are they talking about here?

insulanus6y ago

Definitely Ken Thompson. :)

https://en.wikipedia.org/wiki/Ken_Thompson

p7IDD2436y ago

Perhaps Ken Thompson?

1 more reply

slx266y ago

I love to see that at the end the document the plugins package is mentioned. Currently it doesn't have support for Windows (among other issues), so it's really good to hear that the work in the linker might eventually improve the situation with plugins too, and that at least it's being kept in mind. On golang, interfaces being implemented implicitly is not everyone's cup of tea, and I can understand why, but when you combine that property with plugins, you can make really powerful plugin systems that use very simple code to work.

nappy-doo6y ago

Plugins are hard, and very OS dependent. There's plenty about the current system that is fragile, and likely to cause problems. I think it's likely the linker rework needs to happen before plugins can be properly addressed.

jart6y ago

This doc offers great depth on why dynamic shared objects are so hard: https://software.intel.com/sites/default/files/m/a/1/e/dsoho... and that's just for Linux & co. I'm surprised the Go team chose to add support for DSOs.

axaxs6y ago

agreed. Just yesterday I was reading about shared object support in Go, and it hasn't improved since inception. Sometimes I just want an object/module, and not a git repo. Even VB had this down.

pushpop6y ago

As much as I think VisualBasic gets a lot of unfair criticism, I really don’t think the OCX model is something worth admiring. Sure, when it worked it was great but all too often they would fail between OS updates, language patches or even just installing other applications if they happened to use different OCXs. What’s more, those bastard files required registry hacks to work.

If anything, OCX is an example of how not to do shared libraries.

jstimpfle6y ago

I don't get it. A linker's task should be straightforward. In essence, it looks up addresses from strings, whose number is bounded by the lines of code written by humans. I think that there must be a lot of incidental complexity if that task somehow becomes a bottleneck.

And how can it be that a binary called "cmd/compile" has 170k symbols (that's like, global definitions, right?). Not that that's a huge number in terms of today's computing power, but how many millions of lines of source code does that correspond to?

Still, 1M relocations, or 34MB of "Reloc" objects, as indicated, shouldn't be a huge issue to process. Object files should have minimal parsing overhead. Is there any indication how long this takes to link? Shouldn't 1 or 2 secs be sufficient? (assuming 100s of MB/s for sequential disk read/write, and < 1us to store each of the 170k symbols in a hashmap, and < 1us to to look up each of the 1M of relocations).

- I don't think mmap should be used if it can be avoided. It means giving up control over what parts of the file are loaded in memory. And from a semantic point of view, that memory region still must be treated differently, since on-disk and in-memory data structures are not the same.

wahern6y ago

> that's like, global definitions, right?

No, not just global definitions. Closures need linking, too. But the linker is doing much more than linking function entry points. Many automatic (on the stack) variables need linking so the GC can (a) trace the object graph and (b) move them when resizing the stack. Likewise, type definitions require metadata generation for GC tracing. And then there's all the debugging data that needs to be generated, which basically involves everything.

nappy-doo6y ago

Go currently does DWARF generation in the linker, but is in the process of moving that into the compiler. Also, most closures linking problems are relatively easy in Go.

As far as stack variables for the GC, the stack is exactly scanned for all but the last frame (I believe), but conservatively scanned for the last frame. This makes it much easier.

Types -- you're partially right. It is mostly all generated in the compiler, and deduped by the linker.

There's no reason Go's linker couldn't be much simpler and likely even simpler than a standard C linker. You might argue that Go will give up things like LTO, but Go designs out lots of the LTO problem (not PGO) by nature of how packages work, and the fact that there aren't cyclic dependencies.

Overall, the Go linker could be quite simple. It just needs some rework is all. :)

jjtheblunt6y ago

What do you mean by closures, and by linking of closures?

[I've implemented a Scheme before, in C, for context, so I'm wondering if I'm reading what you wrote with common vocabulary.]

2 more replies

seebs6y ago

I too often don't see why a thing would not be simple and straightforward, until I've done it a few times.

EdiX6y ago

> I don't get it. A linker's task should be straightforward.

In Go the linker also generates DWARF, does deadcode elimination and a bunch of other stuff described in the associated document.

> And how can it be that a binary called "cmd/compile" has 170k symbols (that's like, global definitions, right?). Not that that's a huge number in terms of today's computing power, but how many millions of lines of source code does that correspond to?

As described in the link each global function actually ends up producing 4 symbols.

> Shouldn't 1 or 2 secs be sufficient?

On my system linking cmd/compile takes 1.3 seconds. The problem is that object files get cached so if you have a hot cache (let's say you made a few changes inside a single package and then recompile) compiling takes almost no time and linking accounts for nearly 100% of the build time.

account426y ago

One nice speed improvement for linking C++ (and I imagine C as well) with GCC and Clang is -gsplit-dwarf which does not link debug information but instead only references the original object files.

This of course only makes sense for development where the object files are still available for GDB to load the debug symbols from but it is a nice feature.

The MSVC linker has /DEBUG:FASTLINK as well.

Someone6y ago

I don’t know how advanced the go linker is, but the distinction between compiler and linker gets blurred when one adds link-time optimization (https://llvm.org/docs/LinkTimeOptimization.html, https://gcc.gnu.org/onlinedocs/gccint/LTO-Overview.html, https://en.m.wikipedia.org/wiki/Interprocedural_optimization)

Even if it doesn’t do that, fixing up various addresses can be a lot of work if the linker can make code shorter (e.g. by using short branches where possible), or if it tries to increase cache locality by changing function order.

derefr6y ago

> its implementation fit in one Turing award winner’s head

What a great unit of measure :)

ChrisSD6y ago

Wait, how many London buses fit in a Turing award winner's head? And can they all fit in to Wales despite the number of Olympic sized swimming pools already there?

ggm6y ago

Convert to Sydney harbours and add a banana for scale

solresol6y ago

I think we also need to convert that into elephants and blue whales. What's the conversion rate for elephants to London buses?

rootlocus6y ago

Another great unit of measure is the bus factor.

heavenlyhash6y ago

I think that's part of the dry humor, here.

AlexB1386y ago

Any recommendations on resources to better understand the build process for compiled languages in general, and Go in particular?

nappy-doo6y ago

As weird as it sounds, Go is one of the easiest to understand compilers I've ever worked on. The language, and the implementation are relatively easy to understand and follow most of the standard compiler design and implementation texts. Similarly, since Go is so opinionated, the code is easy to read.

wahern6y ago

Write your own simple virtual machine[1] and a tiny assembler which accepts symbolic names for jump targets (e.g. goto LABEL). You'll need a link phase to resolve the symbolic names, and all will become clear. A few hours of time and anything involving linking, addressing[2], and so much more will instantly become intuitive to you.

[1] Array of opcodes indexed with a program counter (pc) with a giant switch statement for executing each opcode. A simple stack for data implemented as an array and a stack pointer/index (sp). There are plenty of examples online, just make sure you actually implement things yourself, and only go to the examples to answer questions you arrive at yourself. If you understand loops and arrays implementing the VM is trivial. If you can parse a text file line-by-line and split words, implementing the assembler is trivial. Working out the linking might take some thinking, but that's the point. It's like one night of work, max, unless you really get sucked in ;)

[2] E.g. Harvard architecture vs. von Neumann architecture

ori_b6y ago

https://www.amazon.com/dp/1558604960

journalctl6y ago

I just bought this! It’s fantastic and definitely still more than relevant for understanding the subject.

Solar196y ago

I like it. What would also be cool is another Go compiler and linker, developed by a different team not at Google. It would be good for the ecosystem and performance if there was more than one compiler and runtime.

I think there would be a market for a proprietary compiler and maybe an IDE to go with it — if the performance was better than the open source one. I think this is achievable because as good as Go's performance is now, there's still a lot of headroom. Google isn't exploiting modern CPUs very well, and the linker is not doing extensive LTO.

The biggest constraint is the blazing fast compile times. A compiler and toolchain that was able to take some time for optimization might deliver markedly better runtime performance.

chewxy6y ago

GCCGo,llgo and tinygo are different Go compilers built and maintained by different peoples

ckok6y ago

Curious enough; we're working on just that, a commercial compiler and ide for Go. A go compiler that can compile for .NET, native Windows, native Linux and native OSX. Haven't done much performance analysis yet, the IDE and and compiler are a first priority now, and are almost done.

geodel6y ago

> I think there would be a market for a proprietary compiler and maybe an IDE to go with it

There may be a market but not large enough to pay few top notch compiler/low level system software hacker. I only know of a commercial Go IDE and constant refrain there is how will a poor third world developer afford it. Though I feel real issue is developers are raised on diet of free software feels entitled to it. Paying for good software seems alien to them.

ksherlock6y ago

gccgo perhaps?

sdrothrock6y ago

I don't think that would be appropriate since Golang isn't (to my knowledge) under the GNU license.

Maybe gocc?

2 more replies

benesch6y ago

What I would give for the developers of the Go toolchain to have spent the last decade improving GCC or LLVM instead of their own bespoke toolchain.

In many ways Go seems like an excuse for Google to fund the continued development of Plan 9. Three of the five most influential people on the Go team (Ken Thompson, Rob Pike, and Russ Cox) were heavily involved in Plan 9. And it shows. Go's toolchain is a direct descendant of the Plan 9 toolchain; in fact, the Go language is really just an evolution of the special C dialect that Plan 9 used [1]. Indeed, for a while, the Go compiler was written in this special dialect of C, and so building Go required building a C compiler (!) that could compile this custom dialect of C, and using that to compile the Go compiler [2].

By all rights, Plan 9 was an interesting research project, and seems well loved by those familiar with it. (I'm not personally familiar; it was well before my time.) But it never took off. What we ended up with is Linux, macOS, and Windows.

Go very much wants to be Plan 9. Sure, it's not a full-fledged operating system. But it's a linker, assembler, compiler, binutils, and scheduler. All it asks of the host system is memory management, networking, and filesystem support, and it will happily replace your system's DNS resolution with a pure Go version if you ask it to [3]. I wouldn't be surprised if Go ships its own TCP/IP stack someday [4].

This is, in my opinion, craziness. What other language ships its own assembler?! [5] To make matters worse, the assembly syntax is largely undocumented, and what is documented are the strange, unnecessary quirks, like

> Instructions, registers, and assembler directives are always in UPPER CASE to remind you that assembly programming is a fraught endeavor. (Exception: the g register renaming on ARM.)

> In the general case, the frame size is followed by an argument size, separated by a minus sign. (It's not a subtraction, just idiosyncratic syntax.)

> In Go object files and binaries, the full name of a symbol is the package path followed by a period and the symbol name: fmt.Printf or math/rand.Int. Because the assembler's parser treats period and slash as punctuation, those strings cannot be used directly as identifier names. Instead, the assembler allows the middle dot character U+00B7 and the division slash U+2215 in identifiers and rewrites them to plain period and slash.

The excuse for the custom toolchain has always been twofold, that a) LLVM is too slow, and fast compiles are one of Go's main features, and b) that the core team was too unfamiliar with GCC/LLVM, at least in the early days, and attempting to build Go on top of LLVM would have slowed the speed of innovation to a degree that Go might not exist [6].

I've always been skeptical of argument (b). After all, one of Go's creators literally won a Turing award, as this document not-so-subtly mentions. I'm quite sure they could have figured out how to build an LLVM frontend, given the desire. Rust, for example, is quite a bit more complicated than Go, and Mozilla's developers have had no trouble integrating with LLVM. I suspect the real reason was that hacking on the Plan 9 toolchain was more fun and more familiar—which is a very valid personal reason to work on something! But it doesn't mean it was the right strategic decision.

I will say that (a) is valid. I recently switched from writing Go to writing Rust, and I miss the compile times of Go desperately.

That said—and this is what I can't get past—the state of compilers would be much better off if the folks on the Go team had invested more in improving the compile and link times of LLVM or GCC. Every improvement to lld wouldn't just speed up compiles for Go; it would speed up compiles for C, C++, Swift, Rust, Fortran, Kotlin, and anything else with an LLVM frontend.

In the last year or so, the gollvm project [7] (which is exactly what you'd expect–a Go frontend for LLVM) has seen some very active development, and I'm following along excitedly. Unfortunately I still can't quite tell whether it's Than McIntosh's 20% time project or an actual staffed project of Google's, albeit a small time one. (There are really only two committers, Than and Cherry Zhang.) There are so many optimizations that will likely never be added to gc, like a register-based calling convention [8] and autovectorization, that you essentially get for "free" (i.e., with a bit of plumbing from the frontend) with a mature toolchain like LLVM.

There are not many folks who have the knowledge and expertise to work on compilers and linkers these days, and those that do can command high salaries. Google is in the luxurious position of being able to afford many dozens of these people. I just wish that someone with the power to effect change at Google would realize that the priorities are backwards. gccgo/gollvm are where the primary investment should be occurring, and the gc toolchain should be a side project that makes debug builds fast... not the production compiler, where the quality of the object code is the primary objective.

[0]: https://dave.cheney.net/2013/10/15/how-does-the-go-build-com...

[1]: http://doc.cat-v.org/plan_9/programming/c_programming_in_pla...

[2]: https://docs.google.com/document/d/1P3BLR31VA8cvLJLfMibSuTdw...

[3]: https://golang.org/pkg/net/

[4]: https://github.com/google/netstack

[5]: https://golang.org/doc/asm

[6]: https://golang.org/doc/faq#What_compiler_technology_is_used_...

[7]: https://go.googlesource.com/gollvm/

[8]: https://github.com/golang/go/issues/18597

enneff6y ago

> the state of compilers would be much better off if the folks on the Go team had invested more in improving the compile and link times of LLVM or GCC

It never would have happened. How do you motivate people whose principal frustration is the state of C++ to work on a large C++ codebase?

Heterogeneity is a huge benefit to any ecosystem. Improving existing things is great, but building new things is also very important. Go would simply not exist today if it were built on LLVM or GCC.

loudmax6y ago

Agreed. This argument comes up a lot, especially in open source, that resources on one project would have better spent on some other project. Developer focus is not a fungible commodity. If the Golang devs hadn't been developing the Go compiler, they wouldn't necessarily have spent their efforts on LLVM, they'd just as likely be working on something other way to make Plan9 come about.

benesch6y ago

Let me refine my point a bit. Apologies; my original post had gotten a bit long.

I agree that enthusiasm is important! And indeed, for the Go creators, their particular leanings might have been such that they couldn't get excited about building an LLVM/GCC frontend, and adapting the Plan 9 toolchain is literally the only way those three could have Go gotten off the ground. As a member of the Go team, you'd certainly know better than I.

But Go is long past a personal passion project. Go is over ten years old. Go likely has over a million developers [0]. Go 1.0 has been stable for about seven years, and the first meaningful changes to the language are just now being talked about. In my opinion, it is several years past due for Google to start investing seriously in a Go toolchain based on a mature compiler stack.

I realize the audacity of this claim and I don't make it lightly. But if I had the money to spend on a team of developers, I would spend it making llvm-as and lld fast enough and stable enough to be Go's assembler and linker, and abandon the custom Plan 9 ones.

> It never would have happened. How do you motivate people whose principal frustration is the state of C++ to work on a large C++ codebase?

Well, for one, once the language gets off the ground, you can write the frontend in the new language. Rustc manages to be almost entirely Rust, for example.

> Heterogeneity is a huge benefit to any ecosystem. Improving existing things is great, but building new things is also very important.

I agree, and I think Go is an interesting contribution to the P/L landscape—essentially it proved that stripping away a good deal of complexity (generics, inheritance, etc.) results in a very useful, highly productive language. But I don't think Go's custom assembler and linker are meaningfully contributing to the ecosystem. They're useful presently in that they improve Go developers' productivity with ultra-fast builds, but they're not suitable for use by anything but Go. Improvements to Go's linker and assembler benefit only Go. Improvements to lld or gold can benefit practically everyone using a compiled language.

[0]: https://research.swtch.com/gophercount

2 more replies

nemo16186y ago

100% agree -- enthusiasm is not fungible! Sometimes starting over is faster than incremental improvement.

Gibbon16y ago

> It never would have happened. How do you motivate people whose principal frustration is the state of C++ to work on a large C++ codebase?

Yeah the creators of go didn't come to praise C++ but to bury it. To kill C++ you need first to knock out gcc and LLVM.

jbarham6y ago

Given that Apple are the major sponsor of LLVM and they're not exactly cash poor, I think it's reasonable to conclude that being able to throw money and people at LLVM development doesn't explain why it's still much slower than the Go compiler toolchain. In hindsight the Go team made the right call to use their own toolchain.

As enneff alludes, Ken Thompson's antipathy towards C++ is well documented: https://bryanpendleton.blogspot.com/2009/12/coders-at-work-k...

benesch6y ago

> being able to throw money and people at LLVM development doesn't explain why it's still much slower than the Go compiler toolchain.

Right, you need to specifically throw money and people at the problem of making LLVM faster, not just at LLVM in general. Neither Swift nor Objective-C have "fast compiles" as part of their pitch. Much of the work on LLVM goes into producing the highest quality object code possible, which is a goal often at odds with compiling quickly, and part of the reason the choose-your-optimization-level flag (-O) exists, though -O0 compiles are still not fast enough.

> In hindsight the Go team made the right call to use their own toolchain.

No, we don't have the benefit of hindsight yet. We don't know what could have been if the resources that had been spent on the Go toolchain had been spent on LLVM instead.

If five highly-qualified engineers spent five years trying to speed up gollvm compiles without success, we'd have strong evidence that something about LLVM prohibits the fast compiles that are possible with the gc toolchain. But that's not the situation.

4 more replies

apta6y ago

> it's still much slower than the Go compiler toolchain

The kinds of optimizations LLVM does is way beyond anything golang does. Golang doesn't even optimize passing function parameters in registers, let alone the advanced optimization techniques LLVM and GCC do.

1 more reply

atombender6y ago

While Go has a lot of clear influence from Plan 9, I don't see the "conspiracy theory" here at all.

As far as I'm aware, one major reason why Go reinvented so much was to save time and effort. They did whatever they could in order to bootstrap fast and efficiently, and they did so by cannibalizing Plan 9's toolchain, including the compiler and assembly language. The original "gc" toolchain (with inscrutable binary names like "6g" and "6a"), was written in C and came directly from Inferno. You can browse the original commit here [2]. That stuff has all been rewritten in Go.

A key attribute I and others have noticed about highly productive developers is that they tend to build an effective toolchain around themselves and bring it with them for new projects. Sometimes that stuff can become legacy baggage, but there's no denying that it's a good strategy.

Perhaps LLVM or GCC would also have been a good strategy. There are some arguments to the contrary. 11 years ago, when Go was started, LLVM wasn't nearly as mature as today. But look at the hurdles other projects like Rust have had to get over with LLVM. And a large part of Rust's compilation speed is apparently due to LLVM. So LLVM is not a magic bullet. Migrating Go today to LLVM would of course be a big, time-consuming zero-velocity project; you'd want to be really certain that the payoff would be worth the effort.

GCC is not an easy project to deal with, either. For decades, its internal intermediate representation was undocumented and intentionally obfuscated [2] to ensure FSF/GNU control over backends.

I do agree that Go has a certain bias towards a particular, idiosynchratic way of doing things, which is not always a positive.

[1] https://github.com/golang/go/commit/0cafb9ea3d

[2] http://lambda-the-ultimate.org/node/715

benesch6y ago

Thanks for the thoughtful response.

I don't mean to imply that there's a conspiracy here. What I mean is that I think the Plan 9 heritage is clouding strategic decisions around the Go toolchain. What may have been the right decision to get a new language off the ground is not necessarily the right decision once the language is widely popular and stable.

> Migrating Go today to LLVM would of course be a big, time-consuming zero-velocity project; you'd want to be really certain that the payoff would be worth the effort.

A big project, yes, but it's happening! [3] If a few engineers working on gollvm for a year or two could improve gollvm to the point that it made the average Go program run 20% faster, I'd think that would absolutely be worth it to Google.

> GCC is not an easy project to deal with, either. For decades, its internal intermediate representation was undocumented and intentionally obfuscated to ensure FSF/GNU control over backends.

Very true in general, but the Go team has been lucky in that GCC core maintainer Ian Taylor has been a member since the early days. Gccgo has been a spec-conformant Go compiler since 2012 [2], and so nearly all of the GCC-integration bits have been in place for seven years. As a result, it's far more a matter of staffing the project so that the Go frontend's inliner, garbage collector, and escape analysis can reach parity with gc, rather than dealing with GCC/FSF politics.

> You can browse the original commit here. That stuff has all been rewritten in Go.

Yeah, I'm familiar with the lineage. I don't know that I'd say that it's been rewritten in Go, though, as the toolchain and runtime were converted fairly automatically with a "c2go" tool that Russ Cox wrote [0] around the Go 1.5 release. Have you looked at the resulting code much? Some of it has been rewritten to be idiomatic Go, but a lot of it is still very clearly C code that has been automatically translated [1]. See also the fact that go/src/runtime/runtime.go, go/src/runtime/runtime1.go, and go/src/runtime/runtime2.go all exist—a consequence of the fact that runtime.go, runtime.c, and runtime.h all existed in Go 1.4 [2].

[0]: https://github.com/rsc/c2go

[1]: https://github.com/golang/go/blob/7b294cdd8df0a9523010f6ffc8...

[2]: https://blog.golang.org/gccgo-in-gcc-471

[3]: https://go.googlesource.com/gollvm/+log

grumpydba6y ago

I see lots of false assertions here. Here's a significant one:

> There are so many optimizations that will likely never be added to gc, like a register-based calling convention.

The calling conventions was marked as undefined as a first step to changing it [1]. The change was introduced by the very author of this post.

Also, a more agressive inlining mitigates the slow calling conventions.

More generally, I think it is a huge achievement for a PL to be self hosting. It makes its development easier as there is only one language to deeply know (plus assembly of course).

[1] https://github.com/golang/go/issues/27539

slrz6y ago

> Indeed, for a while, the Go compiler was written in this special dialect of C, and so building Go required building a C compiler (!) that could compile this custom dialect of C, and using that to compile the Go compiler [2].

I don't think that's true. The build process used GCC to compile the Go and C compilers. The Plan-9-derived C compiler was used for compiling those parts of the runtime that were written in C back then and were supposed to follow the conventions of Go program code.

As you can tell from Inferno (or even Plan 9 from userspace), GCC can compile the Plan 9 C dialect, given the right options.

jorams6y ago

This is a minor point, but:

> What other language ships its own assembler?!

The majority of native code compilers I have seen include their own assembler. Many of them don't have a textual input format, but they are assemblers nonetheless.

tatersolid6y ago

Go is supposed to be a concurrent, type-safe, memory-safe, systems programming language.

This goal is entirely at odds with an unsafe, legacy-ridden C/C++ toolchain.

NieDzejkob6y ago

The article often mentions how the long-lived objects are straining the garbage collector. Wouldn't a generational GC solve this?

remus6y ago

Potentially, but then the linker is just one use case for the GC and GCs are a game of trade offs so switching to a generational GC would impact other use cases.

warent6y ago

For anyone having problems loading the doc due to a traffic overload, here's a copy: https://docs.google.com/document/d/1y3k3nLlPpoMf_RAfi-FCpZpt...

miki1232116y ago

Anyone else is getting a "you can't view this page because you're offline" error? I don't know why this is happening, how does it even load when I'm supposedly offline?

warent6y ago

It's probably being overloaded with too much traffic. The link is a Google Doc, and singular google docs aren't really great at handling traffic spikes.

jacobush6y ago

I hoped they would fix how the linker uses private OS calls.

nappy-doo6y ago

Which private OS calls. Can you site what they are?

kccqzy6y ago

I'm guessing GP refers to the fact that Go binaries use raw system calls even on operating systems where the syscall boundary is considered a private implementation detail (which is the case on Windows and macOS).

2 more replies

jbn6y ago

I think you mean "cite".

1 more reply

nn36y ago

Who would have thought that statically linking hundreds of MB worth of binaries every build causes problems?

Maybe some of it can be fixed with a lot of engineering effort, but the fact remains that it is a bad idea to redo so much work every time.

Maybe the people who invented shared libraries had a point after all.

Of course the real problem is software and dependency bloat, but that is unlikely to ever get fixed.

dullgiulio6y ago

Besides the fact that static linking has a lot of benefits, shared libraries need to be linked too: just this process is called "loading" and the linker is called "loader".

Worse, this needs to happen at every executable invokation.

j / k navigate · click thread line to collapse

99 comments

cpeterso6y ago

luuio6y ago

For those out of the loop, who are they talking about here?

insulanus6y ago

Definitely Ken Thompson. :)

https://en.wikipedia.org/wiki/Ken_Thompson

p7IDD2436y ago

Perhaps Ken Thompson?

1 more reply

slx266y ago

nappy-doo6y ago

jart6y ago

axaxs6y ago

agreed. Just yesterday I was reading about shared object support in Go, and it hasn't improved since inception. Sometimes I just want an object/module, and not a git repo. Even VB had this down.

pushpop6y ago

If anything, OCX is an example of how not to do shared libraries.

jstimpfle6y ago

wahern6y ago

> that's like, global definitions, right?

nappy-doo6y ago

Go currently does DWARF generation in the linker, but is in the process of moving that into the compiler. Also, most closures linking problems are relatively easy in Go.

As far as stack variables for the GC, the stack is exactly scanned for all but the last frame (I believe), but conservatively scanned for the last frame. This makes it much easier.

Types -- you're partially right. It is mostly all generated in the compiler, and deduped by the linker.

Overall, the Go linker could be quite simple. It just needs some rework is all. :)

jjtheblunt6y ago

What do you mean by closures, and by linking of closures?

[I've implemented a Scheme before, in C, for context, so I'm wondering if I'm reading what you wrote with common vocabulary.]

2 more replies

seebs6y ago

I too often don't see why a thing would not be simple and straightforward, until I've done it a few times.

EdiX6y ago

> I don't get it. A linker's task should be straightforward.

In Go the linker also generates DWARF, does deadcode elimination and a bunch of other stuff described in the associated document.

As described in the link each global function actually ends up producing 4 symbols.

> Shouldn't 1 or 2 secs be sufficient?

account426y ago

One nice speed improvement for linking C++ (and I imagine C as well) with GCC and Clang is -gsplit-dwarf which does not link debug information but instead only references the original object files.

This of course only makes sense for development where the object files are still available for GDB to load the debug symbols from but it is a nice feature.

The MSVC linker has /DEBUG:FASTLINK as well.

Someone6y ago

derefr6y ago

> its implementation fit in one Turing award winner’s head

What a great unit of measure :)

ChrisSD6y ago

Wait, how many London buses fit in a Turing award winner's head? And can they all fit in to Wales despite the number of Olympic sized swimming pools already there?

ggm6y ago

Convert to Sydney harbours and add a banana for scale

solresol6y ago

I think we also need to convert that into elephants and blue whales. What's the conversion rate for elephants to London buses?

rootlocus6y ago

Another great unit of measure is the bus factor.

heavenlyhash6y ago

I think that's part of the dry humor, here.

AlexB1386y ago

Any recommendations on resources to better understand the build process for compiled languages in general, and Go in particular?

nappy-doo6y ago

wahern6y ago

[2] E.g. Harvard architecture vs. von Neumann architecture

ori_b6y ago

https://www.amazon.com/dp/1558604960

journalctl6y ago

I just bought this! It’s fantastic and definitely still more than relevant for understanding the subject.

Solar196y ago

The biggest constraint is the blazing fast compile times. A compiler and toolchain that was able to take some time for optimization might deliver markedly better runtime performance.

chewxy6y ago

GCCGo,llgo and tinygo are different Go compilers built and maintained by different peoples

ckok6y ago

geodel6y ago

> I think there would be a market for a proprietary compiler and maybe an IDE to go with it

ksherlock6y ago

gccgo perhaps?

sdrothrock6y ago

I don't think that would be appropriate since Golang isn't (to my knowledge) under the GNU license.

Maybe gocc?

2 more replies

benesch6y ago

What I would give for the developers of the Go toolchain to have spent the last decade improving GCC or LLVM instead of their own bespoke toolchain.

> Instructions, registers, and assembler directives are always in UPPER CASE to remind you that assembly programming is a fraught endeavor. (Exception: the g register renaming on ARM.)

> In the general case, the frame size is followed by an argument size, separated by a minus sign. (It's not a subtraction, just idiosyncratic syntax.)

I will say that (a) is valid. I recently switched from writing Go to writing Rust, and I miss the compile times of Go desperately.

[0]: https://dave.cheney.net/2013/10/15/how-does-the-go-build-com...

[1]: http://doc.cat-v.org/plan_9/programming/c_programming_in_pla...

[2]: https://docs.google.com/document/d/1P3BLR31VA8cvLJLfMibSuTdw...

[3]: https://golang.org/pkg/net/

[4]: https://github.com/google/netstack

[5]: https://golang.org/doc/asm

[6]: https://golang.org/doc/faq#What_compiler_technology_is_used_...

[7]: https://go.googlesource.com/gollvm/

[8]: https://github.com/golang/go/issues/18597

enneff6y ago

> the state of compilers would be much better off if the folks on the Go team had invested more in improving the compile and link times of LLVM or GCC

It never would have happened. How do you motivate people whose principal frustration is the state of C++ to work on a large C++ codebase?

Heterogeneity is a huge benefit to any ecosystem. Improving existing things is great, but building new things is also very important. Go would simply not exist today if it were built on LLVM or GCC.

loudmax6y ago

benesch6y ago

Let me refine my point a bit. Apologies; my original post had gotten a bit long.

> It never would have happened. How do you motivate people whose principal frustration is the state of C++ to work on a large C++ codebase?

Well, for one, once the language gets off the ground, you can write the frontend in the new language. Rustc manages to be almost entirely Rust, for example.

> Heterogeneity is a huge benefit to any ecosystem. Improving existing things is great, but building new things is also very important.

[0]: https://research.swtch.com/gophercount

2 more replies

nemo16186y ago

100% agree -- enthusiasm is not fungible! Sometimes starting over is faster than incremental improvement.

Gibbon16y ago

> It never would have happened. How do you motivate people whose principal frustration is the state of C++ to work on a large C++ codebase?

Yeah the creators of go didn't come to praise C++ but to bury it. To kill C++ you need first to knock out gcc and LLVM.

jbarham6y ago

As enneff alludes, Ken Thompson's antipathy towards C++ is well documented: https://bryanpendleton.blogspot.com/2009/12/coders-at-work-k...

benesch6y ago

> being able to throw money and people at LLVM development doesn't explain why it's still much slower than the Go compiler toolchain.

> In hindsight the Go team made the right call to use their own toolchain.

No, we don't have the benefit of hindsight yet. We don't know what could have been if the resources that had been spent on the Go toolchain had been spent on LLVM instead.

4 more replies

apta6y ago

> it's still much slower than the Go compiler toolchain

1 more reply

atombender6y ago

While Go has a lot of clear influence from Plan 9, I don't see the "conspiracy theory" here at all.

GCC is not an easy project to deal with, either. For decades, its internal intermediate representation was undocumented and intentionally obfuscated [2] to ensure FSF/GNU control over backends.

I do agree that Go has a certain bias towards a particular, idiosynchratic way of doing things, which is not always a positive.

[1] https://github.com/golang/go/commit/0cafb9ea3d

[2] http://lambda-the-ultimate.org/node/715

benesch6y ago

Thanks for the thoughtful response.

> Migrating Go today to LLVM would of course be a big, time-consuming zero-velocity project; you'd want to be really certain that the payoff would be worth the effort.

> GCC is not an easy project to deal with, either. For decades, its internal intermediate representation was undocumented and intentionally obfuscated to ensure FSF/GNU control over backends.

> You can browse the original commit here. That stuff has all been rewritten in Go.

[0]: https://github.com/rsc/c2go

[1]: https://github.com/golang/go/blob/7b294cdd8df0a9523010f6ffc8...

[2]: https://blog.golang.org/gccgo-in-gcc-471

[3]: https://go.googlesource.com/gollvm/+log

grumpydba6y ago

I see lots of false assertions here. Here's a significant one:

> There are so many optimizations that will likely never be added to gc, like a register-based calling convention.

The calling conventions was marked as undefined as a first step to changing it [1]. The change was introduced by the very author of this post.

Also, a more agressive inlining mitigates the slow calling conventions.

More generally, I think it is a huge achievement for a PL to be self hosting. It makes its development easier as there is only one language to deeply know (plus assembly of course).

[1] https://github.com/golang/go/issues/27539

slrz6y ago

As you can tell from Inferno (or even Plan 9 from userspace), GCC can compile the Plan 9 C dialect, given the right options.

jorams6y ago

This is a minor point, but:

> What other language ships its own assembler?!

The majority of native code compilers I have seen include their own assembler. Many of them don't have a textual input format, but they are assemblers nonetheless.

tatersolid6y ago

Go is supposed to be a concurrent, type-safe, memory-safe, systems programming language.

This goal is entirely at odds with an unsafe, legacy-ridden C/C++ toolchain.

NieDzejkob6y ago

The article often mentions how the long-lived objects are straining the garbage collector. Wouldn't a generational GC solve this?

remus6y ago

Potentially, but then the linker is just one use case for the GC and GCs are a game of trade offs so switching to a generational GC would impact other use cases.

warent6y ago

For anyone having problems loading the doc due to a traffic overload, here's a copy: https://docs.google.com/document/d/1y3k3nLlPpoMf_RAfi-FCpZpt...

miki1232116y ago

Anyone else is getting a "you can't view this page because you're offline" error? I don't know why this is happening, how does it even load when I'm supposedly offline?

warent6y ago

It's probably being overloaded with too much traffic. The link is a Google Doc, and singular google docs aren't really great at handling traffic spikes.

jacobush6y ago

I hoped they would fix how the linker uses private OS calls.

nappy-doo6y ago

Which private OS calls. Can you site what they are?

kccqzy6y ago

2 more replies

jbn6y ago

I think you mean "cite".

1 more reply

nn36y ago

Who would have thought that statically linking hundreds of MB worth of binaries every build causes problems?

Maybe some of it can be fixed with a lot of engineering effort, but the fact remains that it is a bad idea to redo so much work every time.

Maybe the people who invented shared libraries had a point after all.

Of course the real problem is software and dependency bloat, but that is unlikely to ever get fixed.

dullgiulio6y ago

Besides the fact that static linking has a lot of benefits, shared libraries need to be linked too: just this process is called "loading" and the linker is called "loader".

Worse, this needs to happen at every executable invokation.

j / k navigate · click thread line to collapse