“This change deletes the C implementations of the Go compiler and assembler” (opens in new tab)

(github.com)

306 pointsosw11y ago126 comments

126 comments

So what is the bootstrap process going to be? Other than already have a Go compiler I mean. Or is it have a Go cross compiler?

Maybe it matters less, you used to always assume bootstrap from C but that more or less died with C++ based compilers, although you can do a multistage bootstrap from the last gcc before C++ still.

yiyus11y ago

It is explained in the design document: https://docs.google.com/document/d/1P3BLR31VA8cvLJLfMibSuTdw...

Basically, you start from the last C version, and every version is supposed to be able to compile the next one.

cperciva11y ago

So if you want to avoid trusting trust, you need to audit not only a C compiler and the source code for the Go compiler you plan on using, but also every past Go compiler as well?

sesutton11y ago

You could audit the Go source and then use the diverse double compiling technique[0] to verify that the binary you're using corresponds to that source code.

[0] http://www.dwheeler.com/trusting-trust/dissertation/html/whe...

1 more reply

f2f11y ago

You can't avoid trusting trust. "Ken was here" :)

2 more replies

justincormack11y ago

Ah ok, so there will be a pretty long chain from 1.2 eventually, but hopefully it will be part of the test suite...

enneff11y ago

You should always be able to build a Go 1.x compiler with just the 1.4 tool chain binaries. We have committed to sticking to the Go 1.4 language and libraries for the compiler tool chain.

1 more reply

jlouis11y ago

Usually you don't keep the chain. You just keep a working compiler. You can also bootstrap from another implementation of the language, e.g., gccgo.

A common trick is to keep a highly portable interpreted version of the target language and then use this for bootstrapping, but often you attack new architectures by cross-compilation instead. It all depends.

Also, it is common for self-hosting languages to require themselves to build.

ori_b11y ago

Either cross compile, or get the last C-based Go compiler, and use that to build the more recent Go releases.

cbd198411y ago

> So what is the bootstrap process going to be? Other than already have a Go compiler I mean. Or is it have a Go cross compiler?

What's the bootstrap process for the C compiler part of the compiler?

4ad11y ago

The is no C code in the compilers anymore.

1 more reply

burnte11y ago

But if one has access to the Go compiler source, is it not a reasonable guess that one would be able to access a go compiler binary from the same trusted source? I just don't see this as a problem since it is impossible to build a computer from the ground up without trusting a lot of software first.

dmm11y ago

Probably gccgo, which is included in gcc.

dmm11y ago

For the purposes of bootstrapping.

arcticbull11y ago

I still just don't understand why they insist on building their own toolchain. It just doesn't make sense to me.

When you set out to build a programming language, what is your objective? To create a sweet new optimizer? To create a sweet new assembler? A sweet new intermediate representation? AST? Of course not. You set out to change the way programmers tell computers what to do.

So why do this insist on duplicating: (1) An intermediate representation. (2) An optimizer. (3) An assembler. (4) A linker.

And they didn't innovate in any of those areas. All those problems were solved with LLVM (and to some more difficult to interact with extent GCC). So why solve them again?

It's like saying you want to build a new car to get from SF to LA and starting by building your own roads. Why would you not focus on what you bring to the table: A cool new [compiler] front-end language. Leave turning that into bits to someone who brings innovation to that space.

This is more of a genuine question.

mseepgood11y ago

> I still just don't understand why they insist on building their own toolchain. It just doesn't make sense to me.

To quote rsc from https://news.ycombinator.com/item?id=8817990:

"It's a small toolchain that we can keep in our heads and make arbitrary changes to, quickly and easily. Honestly, if we'd built on GCC or LLVM, we'd be moving so slowly I'd probably have left the project years ago."

"For example, no standard ABIs and toolchains supported segmented stacks; we had to build that, so it was going to be incompatible from day one. If step one had been "learn the GCC or LLVM toolchains well enough to add segmented stacks", I'm not sure we'd have gotten to step two."

bsdetector11y ago

Which is of course no answer at all.

Their own explanation for wasting hundreds of thousands of man-hours on a "quirky and flawed" separate compiler, linker, assembler, runtime, and tools is because they absolutely needed an implementation detail that is completely invisible to programs and which they are now replacing because it wasn't a good idea in the first place (segmented stacks). And it's worth writing out a 1000 word rationalization that doesn't bother even mention the reason that implementation was necessary in the first place, to better run on 32-bit machines. In 2010.

Or they say that they had to reinvent the entire wheel, axle, cart, and horse so that five years later they could start working on a decent garbage collector. Never mind that five years later other people did the 'too hard and too slow' work on LLVM that a decent garbage collector needs. What foresight, that.

That's not sense, that's people rationalizing away wasting years of their time doing something foolish and unnecessary.

crawshaw11y ago

The replacement to segmented stacks is copying stacks, which as far as my knowledge of LLVM takes me, would be very difficult to add. You need a stack map of pointers to successfully move pointed-to objects on the stack from the old region to the new.

There is a great deal of work going on in LLVM on this issue for precise GC of other languages, and (from the outside) it looks like more hours have been spent on it than on the entire Go toolchain. As Go developers don't have the resources or expertise to make such wide-ranging changes to LLVM, it would have blocked Go development.

GCC is similar. Those working on gccgo are trying to work out how to add precise GC and copying stacks. It is much more complex than it was on the gc toolchain.

There is great value in having a simple toolchain that is completely understood by the developers working on it. In fact, that very idea, that code you depend on should be readable and widely understandable, is one of the goals of Go. Applying the goal to the toolchain is a case of eating our own ideological dogfood.

1 more reply

mseepgood11y ago

Most of the toolchain already existed. When Ken Thompson started writing the Go compiler he based it on his Plan 9 C compiler implementation.

Mr_T_11y ago

LLVM is a C++ monstrosity that takes hours to compile. Other programming language projects have to maintain a "temporary" fork of LLVM to achieve their goals: https://github.com/rust-lang/llvm/tree/master

1 more reply

dsymonds11y ago

What's your counter-proposal?

If you're building a new language, you need a new AST. You can't represent Go source code in a C++ AST.

There are alternate compilers for Go, in the form of gccgo and llgo. But those are both very slow to build (compared to the Go tree that takes ~30s to build the compiler, linker, assembler and standard library). And the "gc" Go compiler runs a lot faster than gccgo (though it doesn't produce code that's as good), and compilation speed is a big part of Go's value proposition.

chimeracoder11y ago

> There are alternate compilers for Go, in the form of gccgo and llgo. But those are both very slow to build (compared to the Go tree that takes ~30s to build the compiler, linker, assembler and standard library).

For any non-Gophers reading this: I write Go as my primary language, and have for the past two and a half years. I just timed the respective compilation speeds on a handful of my larger projects using both gc and gccgo (and tested on a separate computer as well just for kicks).

gccgo was marginally slower, though not enough to be appreciable. In the case of two projects, gccgo was actually slightly faster. The Go compiler/linker/assembler/stdlib are probably larger and more complex than the most complex project on my local machine at the moment, but I think my projects are a reasonable barometer of what a typical Go programmer might expect to work with (as opposed to someone working on the Go language itself).

The more pressing issue as far as I'm concerned is that gccgo is on a different release schedule than gc (because it ships with the rest of the gcc collection). That's not to say it's not worth optimizing either compiler further when it comes to compilation speed, but it's important for people considering the two compilers to understand the sense of scale we're talking about - literally less than a second for most of my projects. Literally, the time it takes you to type 'go build' is probably more significant.

dsymonds11y ago

Thanks. That's good data. I haven't seen any measurements for a few years. It's good to see that gccgo has caught up. Which version of gc did you test?

Yes, the release schedule is another important reason for building our own toolchain. Being in control of one's destiny is often underrated.

1 more reply

arcticbull11y ago

I would never set out to build a language I wanted people to use and not build it as a front-end for LLVM. I don't want to write an optimizer or assembler.

I don't doubt for one second that llgo takes a longer time to compile. And in exchange for slower compile times you benefit from many PHDs worth of optimizations in LLVM. And every single target architecture they support.

It's easy to build something faster when it does less. I'll admit there's no blanket right answer to that tradeoff.

dsymonds11y ago

Yes, that's why there's both gc and gccgo (llgo came later). Apart from the rigour of having two independent compilers, they are seeking different tradeoffs. gc is very interested in running fast, and gccgo benefits from decades of work that have been put into gcc's various optimisations.

Does that answer your original statement that you didn't understand why we build our own toolchain?

1 more reply

pjmlp11y ago

Then you will be stuck with the C view of world of what a linker is supposed to do.

Just look at Modula-2 and Object Pascal toolchains as examples of compile speeds and incremental compilation features that could run circles around contemporanean C compilers.

Or the lack of proper module system, which requires linker help.

wbond11y ago

I was impressed by the toolchain when I first peaked at Go because it was dead simple to get up and running on any platform, especially Windows.

For gcc you have to deal with MinGW. Isn't LLVM just now getting to the point where it can build native Windows applications?

This is one area where I hope Rust makes progress. MinGW/Msys2 is just kind of gross stuff to deal with.

RayDonnelly11y ago

Care to explain what's gross about MSYS2/MinGW-w64? I'm genuinely interested in making it less gross.

infogulch11y ago

Want to make it less gross? Make it completely go away.

Installation of Go or Python is just like any other Windows install. You download an installer.exe or .msi, run it, and you're done. Things compile or run immediately, and you don't have to start using a "special" terminal just for it to work.

My experience with MinGW is very different. Especially for dependent languages. "Step 1: Install MinGW" what does that even MEAN?:

"Ok, I ran this installer, and it brought up the MinGW Installation Manager. Is it done? What am I supposed to do here? Which one do I choose? How do you even select a package? What even ARE packages? OK, so I select something then go the Package menu and select Mark for Installation. It's not doing anything. Is it done now? Close window. Nope that didn't work. Open it back up. Oh, so after marking a package I have to go to the Installation menu and choose Apply Changes. ..."

This actually happened to someone I was trying to help over the phone. Heaven forbid they get lost in All Packages and get confused by the dozens of packages each with half a dozen versions and each with three different, non-descriptive "classes".

Installation needs to be braindead simple. During installation it should show a list of extra languages that can be installed, where you can't uncheck 'base' (with an "Advanced Options" button in the corner that opens up the standard installation manager instead). It should set up any environment variables, including PATH, (and including restarting explorer to refresh the env) and it shouldn't require the use of any terminal other than cmd.exe (despite it being terrible).

If you're installing something else entirely that depends on MinGW, their installer should be able to bundle the MinGW installer, and it should install without having to make any choices. It should detect if MinGW is already installed and install packages there instead, still completely automated.

Make it go away.

1 more reply

wbond11y ago

Yes, I'd be happy to, but I'm not sure they are "solvable" issues, because they seem to be more of architectural mismatches.

Part of the issue with most software that uses MinGW is that it is written with posix-y operating system in mind. That is, operating systems that can very efficiently fork processes and quickly deal with many small files. Unfortunately, Windows does neither well. Process creation is slower, and NTFS is a very lock-happy filesystem.

Why do I consider this gross as a user? Things like Git that utilize msys are slow on Windows. As in, I notice the UI hanging. Things like autoconf are terribly slow on Windows due to all of the small processes that are created to detect the environment. Antivirus tools will lock files that are created and generally slow things down due to the nature of lots of quick-running processes creating and deleting small files.

These are just realities of most software written for non-Windows platforms. So whenever I see a program that requires MinGW, I'm always very hesitant to use it. The user experience tends to be terrible. I can still remember an issue trying to compile subversion on Windows using gcc and having it take well over an hour. Turns out with all of the processes being forked and temp files being created, the antivirus program was adding a delay to every command. After completely disabling antivirus it compiled in 15 minutes.

So, in one sense, this ins't a problem with MinGW or msys, but it typical of software that relies on it.

The other issues I have with them is that they don't integrate well with the native tools on Windows. For instance, Pageant is a good, graphical SSH agent on Windows. You have to mess around with environmental variables and plink and junk to get it so you don't have multiple formats of SSH keys on your machine. Trying to deal with SSH through bash and msys is not a user friendly experience. PuTTY is the gold standard of SSH clients on Windows.

Using msys/MinGW is like running X programs on OS X, Windows programs through Wine on Linux, or Java GUIs on any OS. It has enough strange warts and doesn't quite fit the feel of the rest of the OS.

That is where Go was awesome. I downloaded go and there were 3 exes on my machine. I ran "go.exe build source.go" and out popped an exe.

1 more reply

bdarnell11y ago

The Go team does want to innovate on the toolchain. A key factor in the design of Go is the belief that once a language is "good enough", developers are better served by a superior toolchain (and specifically faster compilation) than by a fancier language. They want to own the toolchain so they can optimize it for Go and make their own tradeoffs about speed versus features.

evmar11y ago

I read somewhere (but I can't think of the keywords to find it now) that they found the greater flexibility in owning their toolchain was worth the cost. For example they changed their data layout for GC purposes and changed the segmented stack approach over the course of their development and had they been tied to LLVM or gcc they'd have spent much of their time fighting against those implementations, or politicing to convince the maintainers to add additional complexity to their systems for an unproven langauge. (My example is weak because I am trying to retell their reasons and my recollection is vague.) I think they still haven't succeeded in bringing gcc up to par with their current approach.

pcwalton11y ago

LLVM supports precise GC now via the late safepoint placement infrastructure [1]. This infrastructure should be sufficient to support both the copying stacks and a precise GC.

This is a recent addition and did not exist at the time Go was created, however.

[1]: http://llvm.org/docs/Statepoints.html

seryoiupfurds11y ago

Are you thinking of this comment?

https://news.ycombinator.com/item?id=8817990

evmar11y ago

That's the one, thanks!

ngoldbaum11y ago

Wow, github doesn't handle big diffs well. Some sort of automatic pagination would really help.

TazeTSchnitzel11y ago

GitHub does cut it off beyond a certain point, but that cutoff point should be much, much earlier.

foz11y ago

Github handles it well, but our browsers don't. It would be nice if they loaded large diffs progressively, as you scroll.

bpicolo11y ago

Github is a website, it's it's job to make the browser handle it well.

cratermoon11y ago

I would make the case the "big" diffs are a problem. Unless there's a really good reason (and bad dependency management is not a good reason) then commits should be smaller and more logically related.

rcthompson11y ago

This is a merge commit, which means the diff is going to include all the changes on the branch being merged. Even if all the individual commits are small, a merge diff can still be very large.

serf11y ago

While I agree that a big diff isn't a good idea, I disagree with the notion that one should develop in such a way that makes Github (or whatever VCS you're using) work right.

I don't think working within the capabilities of the VCS you're using should ever be a priority for a software development effort; rather I think the VCS's priority should be to allow for their use within most contexts of software development. (the other way around)

kasabali11y ago

" This change deletes the C implementations of the Go compiler and assembler from the master branch." is logically related as it could be. Everybody has a different interpretation of it, I guess.

teraflop11y ago

It's a merge commit, so naturally it's going to have a huge diff even if the actual work was done in much smaller increments.

andrewchambers11y ago

Because he automatically translated all the code using a tool.

brandonwamboldt11y ago

Congrats to the Go team, but that link kills the browser....

davecheney11y ago

You can read the original commit on Gerrit, it's less explodey.

https://go-review.googlesource.com/#/c/5652/

Animats11y ago

Nice. That's a step forward. Another bit of legacy code bites the dust. Another step forward to the post-C world we need.

(If you want to compile with a different compiler as a check, there's an LLVM-based compiler for Go.)

gillianseed11y ago

Go is also supported in GCC, as GccGo.

bketelsen11y ago

RSC is awesome.

smegel11y ago

And the boy pulled up his bootstraps and became a man.

Vecrios11y ago

So, if I'm understanding this correctly, they are to re-write the Go compiler in Go, and compile it using the currently published compiler (i.e. 1.4)?

Could someone, kindly, explain how future versions would be built? Thanks!

dsymonds11y ago

Future versions will still be built with any current published compiler. There are binary releases for each major release, and it's not hard to avoid using new language features in the compiler, so building from source only requires the most recent binary release (at worst).

humbledrone11y ago

My understanding is that they wrote code that translated the C code for the original Go compiler into Go code. This translation wasn't fully general -- it made assumptions about how the C code was written -- but it allowed the port from C to Go to be very precise (i.e. bug for bug). So now that the Go compiler written in Go can compile Go, that's what they'll use going forward, and they will slowly work to make it into more idiomatic Go instead of machine-generated Go.

So to answer your question, this new Go-written-in-Go compiler will initially be compiled by the Go-written-in-C compiler. The output from that will be an executable Go-written-in-Go compiler, and _that_ will be used to compile itself in the future. I.e. Go compiler version 1.4 will be used to compile Go version 1.5 will be used to compile Go version 1.6...

Keep in mind that this is not at all unusual. The C compiler GCC has been compiled using older versions of GCC for a long time. Having a compiler compile itself is a sort of milestone that many languages aspire to as a way of showing that the language is "ready."

uxp11y ago

Its generally called "self-hosting" when a compiler can compile itself[1]. It was a pretty big deal when Clang became self-hosting[2] in 2010.

[1] https://en.wikipedia.org/wiki/Self-hosting

[2] http://blog.llvm.org/2010/02/clang-successfully-self-hosts.h...

Vecrios11y ago

Thank you both for your inputs.

tbolt11y ago

So this means the go compiler is completely written in go?

dsymonds11y ago

In source control, yes. There's not yet a stable release where that's the case though; Go 1.5 (due later this year) will be that release.

joeld4211y ago

congrats gophers! That's a big step for the language.

davidrusu11y ago

Anyone else seeing this post as the 1st and 2nd link on the front page of HN?

dang11y ago

Looking into it now. Edit: hopefully fixed now. Will edit this comment later when we figure out what happened.

Ok, we figured out what happened. A background process that is upgrading old stories to a new data format went rogue and made multiple copies of a few stories in memory. Apparently it agrees with some of you that HN could use more stories about Go.

Sorry for the error.

WestCoastJustin11y ago

In case is gets fixed, here's what I see, to help diagnose the bug [1]. Both posts point to https://github.com/golang/go/commit/b986f3e3b54499e63903405c..., have the same HN item id=9097404, but different comments counts.

[1] http://i.imgur.com/xATOXPb.png

jdoliner11y ago

This must be the new and improved "eventually consistent HN" that dang has been talking about.

rosser11y ago

Yes. With the same URL for both discussion and article, though with differing scores and comment counts.

quacker11y ago

I notice that one submission is "canonical" for the flag/unflag bit. You can flag one of the two submissions, and the "unflag" will show up on the other submission.

krapp11y ago

Maybe Hacker News is budding. Is it spring already?

hcarvalhoalves11y ago

It's a huge page. Maybe HN took a while to read the URL and the OP ended up double-posting.

davidrusu11y ago

But they both link to the same comments section

icebraining11y ago

And they seem to link to the same URL, which HN is supposed to prevent (for submissions created within a short period).

JoblessWonder11y ago

I'm seeing the same thing with this post as well: https://news.ycombinator.com/item?id=9096843

bshimmin11y ago

It's the smart quotes.

eric_h11y ago

Ha. It's always the smart quotes.

They've caused me so many headaches over the years it's amazing to me that they are still actively supported and implemented in any software. What value do they even bring?

#DeathToSmartQuotes

2 more replies

vezzy-fnord11y ago

The same is true for the "C# Edit and Continue and Make Object ID Improvements in CTP 6" story, presently at #25 and #27.

dmcginty11y ago

I'm also seeing duplicates of the 'Add "Magic" to Your Business' post on the front page.

icebraining11y ago

Yes! And with different points, too.

ramidarigaz11y ago

Looks like they diverged. Lots of duplicate comments.

Edit: Nope. All comments show on both.

0x011y ago

The comment links are in fact the same: https://news.ycombinator.com/item?id=9097404

pessimizer11y ago

Also, one is marked as having exactly double the number of comments and points as the other.

andrewchoi11y ago

Not for me anymore: http://imgur.com/XM8xmEp

pessimizer11y ago

When the double point/reply relationship ended, the comments sections also diverged. This is how it looked while the comments were still identical: http://tinypic.com/r/vn2u0i/8

girvo11y ago

I've seen it before, a couple weeks ago, but it disappeared rather rapidly last time.

JoshTheGeek11y ago

Yep, with different numbers of comments.

nicklovescode11y ago

yes

oswOP11y ago

The second post wasn't there for the first 30 minutes or so, it just appeared out of nowhere, no idea why.

pjmlp11y ago

Great news!

gresrun11y ago

Once you go Go, you never Go back!

bsummer411y ago

Then, clearly, the right path is to never go Go.

1 more reply

bcantrill11y ago

One does wonder if the register re-naming from their abstract (but misleading) names to their proper machine names (e.g., from "SP" to "R13") wasn't at all a reaction to the (in)famous polemic on the golang build chain.[1]

[1] http://dtrace.org/blogs/wesolows/2014/12/29/golang-is-trash/

rsc11y ago

SP, FP, and PC are all still there. What we did was make the conventions more uniform across all architectures. The rules for certain corner cases for when SP and PC were references to the virtual register and when they were references to the real register were inconsistent. As part of having a single assembly parser, we made the rules consistent, which meant eliminating some forms that were accepted on only a subset of systems, or that had different meanings on different systems.

I'm a little surprised you brought that post up to begin with. It completely misses the point, as I explained in my comment here at the time (https://news.ycombinator.com/item?id=8817990). When I wrote that response I also submitted a comment on the blog itself with a link to the HN comment. That blog comment has not yet been published. If you're going to keep sending around links to such an inflammatory blog post, could you also try to get my comment there approved?

Thanks.

davecheney11y ago

No, this was unrelated.

SP, PC and FP are virtual registers, from the POV of the assembler. On _some_ architecture those words have real meanings, like RSP on intel, but on others they are just conventions.

I don't think Keith's rage quit has had a measurable impact on the direction of Go or its toolchain.

comex11y ago

Aside from what the other reply to your comment said, using R13 and R15 is actually a move away from standard notation: even though those do correspond to SP and PC, the ARM architecture manual as well as all assembly code I've seen uses the special names for those registers.

pjc5011y ago

That's a classic of the "too rude and opinionated to salvage anything reasonable" genre right there.

davexunit11y ago

Here we go again. Another compiler that can't be bootstrapped from source code. It's a packaging nightmare. Another magic binary to trust not to have a Thompson virus.

chimeracoder11y ago

> Another compiler that can't be bootstrapped from source code.

It can be bootstrapped from source - it just needs to be bootstrapped either using gccgo[0], or using the 1.4 compiler (which is guaranteed to work for all 1.x compilers, not just 1.5)

> Another magic binary to trust not to have a Thompson virus.

"Reflections on Trusting Trust" gets posted on HN regularly, and it's an interesting exercise, but you are far more likely to have an exploit hiding in plain sight in a compiler compiled from source once than you are to have one that only appears after multiple iterated compilations.

It's a good concept for security experts and compiler developers to be aware of, but the likelihood is incredibly small.

Also, for what it's worth, "Trusting Trust" is over three decades old, and there have been numerous response to it in the interim, with lots of study. It's like saying "Your problem reduces to 3-SAT, and satisfiability is NP-hard, so you can't solve it', throwing your hands up, and leaving it at that. In reality, solving 3-SAT in the general case is NP-hard, but it is well-studied enough that, in practice, solving SAT/3-SAT is actually pretty easy most of the time. Some of these responses have even been posted elsewhere in this thread, though they're also pretty easy to find online as well.

[0] which is written in C++ - frankly, I'd be much more concerned about a single-compilation bug in any C++ code than I'd be about a multiple-compilation bug in Go.

nullc11y ago

http://www.dwheeler.com/trusting-trust/ < David A. Wheeler’s Page on Fully Countering Trusting Trust through Diverse Double-Compiling, for an example.

Though the diversity available for a go compiler written in go isn't very tremendous.

j / k navigate · click thread line to collapse

126 comments

justincormack11y ago

So what is the bootstrap process going to be? Other than already have a Go compiler I mean. Or is it have a Go cross compiler?

Maybe it matters less, you used to always assume bootstrap from C but that more or less died with C++ based compilers, although you can do a multistage bootstrap from the last gcc before C++ still.

yiyus11y ago

It is explained in the design document: https://docs.google.com/document/d/1P3BLR31VA8cvLJLfMibSuTdw...

Basically, you start from the last C version, and every version is supposed to be able to compile the next one.

cperciva11y ago

So if you want to avoid trusting trust, you need to audit not only a C compiler and the source code for the Go compiler you plan on using, but also every past Go compiler as well?

sesutton11y ago

You could audit the Go source and then use the diverse double compiling technique[0] to verify that the binary you're using corresponds to that source code.

[0] http://www.dwheeler.com/trusting-trust/dissertation/html/whe...

1 more reply

f2f11y ago

You can't avoid trusting trust. "Ken was here" :)

2 more replies

justincormack11y ago

Ah ok, so there will be a pretty long chain from 1.2 eventually, but hopefully it will be part of the test suite...

enneff11y ago

You should always be able to build a Go 1.x compiler with just the 1.4 tool chain binaries. We have committed to sticking to the Go 1.4 language and libraries for the compiler tool chain.

1 more reply

jlouis11y ago

Usually you don't keep the chain. You just keep a working compiler. You can also bootstrap from another implementation of the language, e.g., gccgo.

Also, it is common for self-hosting languages to require themselves to build.

ori_b11y ago

Either cross compile, or get the last C-based Go compiler, and use that to build the more recent Go releases.

cbd198411y ago

> So what is the bootstrap process going to be? Other than already have a Go compiler I mean. Or is it have a Go cross compiler?

What's the bootstrap process for the C compiler part of the compiler?

4ad11y ago

The is no C code in the compilers anymore.

1 more reply

burnte11y ago

dmm11y ago

Probably gccgo, which is included in gcc.

dmm11y ago

For the purposes of bootstrapping.

arcticbull11y ago

I still just don't understand why they insist on building their own toolchain. It just doesn't make sense to me.

So why do this insist on duplicating: (1) An intermediate representation. (2) An optimizer. (3) An assembler. (4) A linker.

And they didn't innovate in any of those areas. All those problems were solved with LLVM (and to some more difficult to interact with extent GCC). So why solve them again?

This is more of a genuine question.

mseepgood11y ago

> I still just don't understand why they insist on building their own toolchain. It just doesn't make sense to me.

To quote rsc from https://news.ycombinator.com/item?id=8817990:

bsdetector11y ago

Which is of course no answer at all.

That's not sense, that's people rationalizing away wasting years of their time doing something foolish and unnecessary.

crawshaw11y ago

GCC is similar. Those working on gccgo are trying to work out how to add precise GC and copying stacks. It is much more complex than it was on the gc toolchain.

1 more reply

mseepgood11y ago

Most of the toolchain already existed. When Ken Thompson started writing the Go compiler he based it on his Plan 9 C compiler implementation.

Mr_T_11y ago

1 more reply

dsymonds11y ago

What's your counter-proposal?

If you're building a new language, you need a new AST. You can't represent Go source code in a C++ AST.

chimeracoder11y ago

dsymonds11y ago

Thanks. That's good data. I haven't seen any measurements for a few years. It's good to see that gccgo has caught up. Which version of gc did you test?

Yes, the release schedule is another important reason for building our own toolchain. Being in control of one's destiny is often underrated.

1 more reply

arcticbull11y ago

I would never set out to build a language I wanted people to use and not build it as a front-end for LLVM. I don't want to write an optimizer or assembler.

It's easy to build something faster when it does less. I'll admit there's no blanket right answer to that tradeoff.

dsymonds11y ago

Does that answer your original statement that you didn't understand why we build our own toolchain?

1 more reply

pjmlp11y ago

Then you will be stuck with the C view of world of what a linker is supposed to do.

Just look at Modula-2 and Object Pascal toolchains as examples of compile speeds and incremental compilation features that could run circles around contemporanean C compilers.

Or the lack of proper module system, which requires linker help.

wbond11y ago

I was impressed by the toolchain when I first peaked at Go because it was dead simple to get up and running on any platform, especially Windows.

For gcc you have to deal with MinGW. Isn't LLVM just now getting to the point where it can build native Windows applications?

This is one area where I hope Rust makes progress. MinGW/Msys2 is just kind of gross stuff to deal with.

RayDonnelly11y ago

Care to explain what's gross about MSYS2/MinGW-w64? I'm genuinely interested in making it less gross.

infogulch11y ago

Want to make it less gross? Make it completely go away.

My experience with MinGW is very different. Especially for dependent languages. "Step 1: Install MinGW" what does that even MEAN?:

Make it go away.

1 more reply

wbond11y ago

Yes, I'd be happy to, but I'm not sure they are "solvable" issues, because they seem to be more of architectural mismatches.

So, in one sense, this ins't a problem with MinGW or msys, but it typical of software that relies on it.

Using msys/MinGW is like running X programs on OS X, Windows programs through Wine on Linux, or Java GUIs on any OS. It has enough strange warts and doesn't quite fit the feel of the rest of the OS.

That is where Go was awesome. I downloaded go and there were 3 exes on my machine. I ran "go.exe build source.go" and out popped an exe.

1 more reply

bdarnell11y ago

evmar11y ago

pcwalton11y ago

LLVM supports precise GC now via the late safepoint placement infrastructure [1]. This infrastructure should be sufficient to support both the copying stacks and a precise GC.

This is a recent addition and did not exist at the time Go was created, however.

[1]: http://llvm.org/docs/Statepoints.html

seryoiupfurds11y ago

Are you thinking of this comment?

https://news.ycombinator.com/item?id=8817990

evmar11y ago

That's the one, thanks!

ngoldbaum11y ago

Wow, github doesn't handle big diffs well. Some sort of automatic pagination would really help.

TazeTSchnitzel11y ago

GitHub does cut it off beyond a certain point, but that cutoff point should be much, much earlier.

foz11y ago

Github handles it well, but our browsers don't. It would be nice if they loaded large diffs progressively, as you scroll.

bpicolo11y ago

Github is a website, it's it's job to make the browser handle it well.

cratermoon11y ago

rcthompson11y ago

This is a merge commit, which means the diff is going to include all the changes on the branch being merged. Even if all the individual commits are small, a merge diff can still be very large.

serf11y ago

While I agree that a big diff isn't a good idea, I disagree with the notion that one should develop in such a way that makes Github (or whatever VCS you're using) work right.

kasabali11y ago

" This change deletes the C implementations of the Go compiler and assembler from the master branch." is logically related as it could be. Everybody has a different interpretation of it, I guess.

teraflop11y ago

It's a merge commit, so naturally it's going to have a huge diff even if the actual work was done in much smaller increments.

andrewchambers11y ago

Because he automatically translated all the code using a tool.

brandonwamboldt11y ago

Congrats to the Go team, but that link kills the browser....

davecheney11y ago

You can read the original commit on Gerrit, it's less explodey.

https://go-review.googlesource.com/#/c/5652/

Animats11y ago

Nice. That's a step forward. Another bit of legacy code bites the dust. Another step forward to the post-C world we need.

(If you want to compile with a different compiler as a check, there's an LLVM-based compiler for Go.)

gillianseed11y ago

Go is also supported in GCC, as GccGo.

bketelsen11y ago

RSC is awesome.

smegel11y ago

And the boy pulled up his bootstraps and became a man.

Vecrios11y ago

So, if I'm understanding this correctly, they are to re-write the Go compiler in Go, and compile it using the currently published compiler (i.e. 1.4)?

Could someone, kindly, explain how future versions would be built? Thanks!

dsymonds11y ago

humbledrone11y ago

uxp11y ago

Its generally called "self-hosting" when a compiler can compile itself[1]. It was a pretty big deal when Clang became self-hosting[2] in 2010.

[1] https://en.wikipedia.org/wiki/Self-hosting

[2] http://blog.llvm.org/2010/02/clang-successfully-self-hosts.h...

Vecrios11y ago

Thank you both for your inputs.

tbolt11y ago

So this means the go compiler is completely written in go?

dsymonds11y ago

In source control, yes. There's not yet a stable release where that's the case though; Go 1.5 (due later this year) will be that release.

joeld4211y ago

congrats gophers! That's a big step for the language.

davidrusu11y ago

Anyone else seeing this post as the 1st and 2nd link on the front page of HN?

dang11y ago

Looking into it now. Edit: hopefully fixed now. Will edit this comment later when we figure out what happened.

Sorry for the error.

WestCoastJustin11y ago

[1] http://i.imgur.com/xATOXPb.png

jdoliner11y ago

This must be the new and improved "eventually consistent HN" that dang has been talking about.

rosser11y ago

Yes. With the same URL for both discussion and article, though with differing scores and comment counts.

quacker11y ago

I notice that one submission is "canonical" for the flag/unflag bit. You can flag one of the two submissions, and the "unflag" will show up on the other submission.

krapp11y ago

Maybe Hacker News is budding. Is it spring already?

hcarvalhoalves11y ago

It's a huge page. Maybe HN took a while to read the URL and the OP ended up double-posting.

davidrusu11y ago

But they both link to the same comments section

icebraining11y ago

And they seem to link to the same URL, which HN is supposed to prevent (for submissions created within a short period).

JoblessWonder11y ago

I'm seeing the same thing with this post as well: https://news.ycombinator.com/item?id=9096843

bshimmin11y ago

It's the smart quotes.

eric_h11y ago

Ha. It's always the smart quotes.

They've caused me so many headaches over the years it's amazing to me that they are still actively supported and implemented in any software. What value do they even bring?

#DeathToSmartQuotes

2 more replies

vezzy-fnord11y ago

The same is true for the "C# Edit and Continue and Make Object ID Improvements in CTP 6" story, presently at #25 and #27.

dmcginty11y ago

I'm also seeing duplicates of the 'Add "Magic" to Your Business' post on the front page.

icebraining11y ago

Yes! And with different points, too.

ramidarigaz11y ago

Looks like they diverged. Lots of duplicate comments.

Edit: Nope. All comments show on both.

0x011y ago

The comment links are in fact the same: https://news.ycombinator.com/item?id=9097404

pessimizer11y ago

Also, one is marked as having exactly double the number of comments and points as the other.

andrewchoi11y ago

Not for me anymore: http://imgur.com/XM8xmEp

pessimizer11y ago

When the double point/reply relationship ended, the comments sections also diverged. This is how it looked while the comments were still identical: http://tinypic.com/r/vn2u0i/8

girvo11y ago

I've seen it before, a couple weeks ago, but it disappeared rather rapidly last time.

JoshTheGeek11y ago

Yep, with different numbers of comments.

nicklovescode11y ago

yes

oswOP11y ago

The second post wasn't there for the first 30 minutes or so, it just appeared out of nowhere, no idea why.

pjmlp11y ago

Great news!

gresrun11y ago

Once you go Go, you never Go back!

bsummer411y ago

Then, clearly, the right path is to never go Go.

1 more reply

bcantrill11y ago

[1] http://dtrace.org/blogs/wesolows/2014/12/29/golang-is-trash/

rsc11y ago

Thanks.

davecheney11y ago

No, this was unrelated.

SP, PC and FP are virtual registers, from the POV of the assembler. On _some_ architecture those words have real meanings, like RSP on intel, but on others they are just conventions.

I don't think Keith's rage quit has had a measurable impact on the direction of Go or its toolchain.

comex11y ago

pjc5011y ago

That's a classic of the "too rude and opinionated to salvage anything reasonable" genre right there.

davexunit11y ago

Here we go again. Another compiler that can't be bootstrapped from source code. It's a packaging nightmare. Another magic binary to trust not to have a Thompson virus.

chimeracoder11y ago

> Another compiler that can't be bootstrapped from source code.

It can be bootstrapped from source - it just needs to be bootstrapped either using gccgo[0], or using the 1.4 compiler (which is guaranteed to work for all 1.x compilers, not just 1.5)

> Another magic binary to trust not to have a Thompson virus.

It's a good concept for security experts and compiler developers to be aware of, but the likelihood is incredibly small.

[0] which is written in C++ - frankly, I'd be much more concerned about a single-compilation bug in any C++ code than I'd be about a multiple-compilation bug in Go.

nullc11y ago

http://www.dwheeler.com/trusting-trust/ < David A. Wheeler’s Page on Fully Countering Trusting Trust through Diverse Double-Compiling, for an example.

Though the diversity available for a go compiler written in go isn't very tremendous.

j / k navigate · click thread line to collapse