C Is Not a Low-Level Language (opens in new tab)

(queue.acm.org)

472 pointsjodooshi8y ago316 comments

316 comments

This article makes some valid points but is overall rather misleading I think. Almost all of the reasons given why C is "not a low-level language" also apply to x86/x64 assembly. Register renaming, cache hierarchies, out of order and speculative execution etc are not visible at the assembly / machine code level either on Intel or other mainstream CPU architectures like ARM or Power PC. If C is not a low level language then a low level language does not exist for modern CPUs and since all other languages ultimately compile down to the same instruction sets they all suffer from some of the same limitations.

It's really backwards compatibility of instruction sets / architectures that imposes most of these limitations. Processors that get around them to some degree like GPUs do so by abandoning some amount of backwards compatibility and/or general purpose functionality and that is in part why they haven't displaced general purpose CPUs for general purpose use.

ge0rg8y ago

I also had the initial impression that the article is misleading, but later on the author made the point that the C compiler is doing significant work to reorder / parallelize / optimize the code. I agree that x86/x64 is not a low-level language either, but even if it was, with the description the author provided, I'd agree with his point of C not being low-level.

Regarding cutting off backwards compatibility to improve the design, Intel's Itanium (affectionately called "Itanic") was a very progressive approach to shift the optimization work from the CPU (and the compiler) to just the compiler. I'm not sure what the reasons for its failing were, though.

pera8y ago

I'm not an expert, but I don't think the optimization phase should be really considered here: the same kind of pattern matching used by (e.g.) LLVM to find optimizable sequence of statements also could be used by any assembler. NASM for instance offer some level of optimizations, so I think optimization should only be considered when it's part of the language specification itself, like in Scheme.

IMO C is close to low-level because it's relatively easy to imagine the resulting unoptimized assembly given some piece of code (which is why some people jokes about C being a macro assembly).

Maybe this old debate should get an slight update... and this could be the starting point: Is modern x86 assembly still "low-level"? :)

3 more replies

skywhopper8y ago

There was an article on Hacker News recently that covered some of the reasons for Itanium's failure to realize its theoretical benefits. I'm not finding it now, but IIRC, the argument made was that predicting likely-parallelizable code is actually a lot harder to do at compile time, and that, like so many ultra-optimized systems, the real world works much differently and a messier, more random approach ultimately yields far better performance.

4 more replies

Avshalom8y ago

Part of it was the sort of giant blind spot that "C is a low level language" creates that made even Intel forget how many man-years of optimization were actually in C compilers and the attendant hubris that "oh we can whip up something that'll beat it between now and when we start shipping the chips"

colin_mccabe8y ago

I think people often overestimate how smart the compiler is. DJB had a slide deck about this: http://cr.yp.to/talks/2015.04.16/slides-djb-20150416-a4.pdf Compilers are really useful... but, if it really has to be fast, you still need to have a human in the loop.

I was also under the impression that there hasn't been much improvement in compiling C/C++ in a long time. It would be interesting to compare the performance of gcc from 15 years ago versus gcc today, on a real world piece of code. I suspect you wouldn't see much difference (aside from the changes in C dialect over time), and some added features in the new version. Has anyone run this experiment?

mattnewport8y ago

I think you can make a good case that the failure of the Itanium (and other similar attempts to un-hide some of this stuff like the IBM/Sony Cell used in the PlayStation 3) was precisely because they tried to shift optimization work from the CPU to the compiler / programmer.

1 more reply

wbl8y ago

The compiler wasn't smart enough. There is information available only at runtime that can be used to get out of order execution and superscalar execution, and the compiler doesn't see any of it. More effort on standard architectures paid off.

pcvarmint8y ago

Itanic failed because Intel/HP initially still used a front side bus memory architecture, which could not support the bandwidth necessary for peak performance on anything but matrix multiplication and other computations where most of the work is done in-cache. Then Opteron came around, with its faster memory, and Intel was suddenly thrown back into reality.

Itanic was also in-order (at least as far as dispatch), meaning anytime an instruction was stalled, so were all instructions in the same bundle or after it.

One "non-low-level" idea on Itanic, which never really panned out in practice, was for the assembler to automatically insert stop bits ;; marking assembly code "sequence points", instead of the programmer having to do it manually. But in practice, everyone did it manually, because they'd rather know how well their bundles were being used, and whether they could move instructions around in order get the full 3 instructions / bundle (6 instructions / clock).

And explicit stop bits did not provide any advantage to future hardware by marking explicit parallelism, because at every generation everyone was concerned about obtaining maximum performance on the current machine, which involved shuffling instructions into 6-instruction double-bundles, often at the expense of parallelism on future implementations (which never went beyond two bundles / clock).

SolarNet8y ago

Not sure if facetious, but it failed because shifting that work caused it to be slow and not backwards compatible.

PersistentCough8y ago

> the C compiler

You mean C compiler X has the feature of Y. There are lots of compilers and that's not part of the language.

2 more replies

kazinator8y ago

> Register renaming, cache hierarchies, out of order and speculative execution etc are not visible

That's different from C.

In the history of x86, most new optimizations have preserved the semantics of code. For instance, register renaming isn't blind; it identifies and resolves hazards.

In C, increasing optimization has broken existing programs.

C is like a really shitty machine architecture that doesn't detect errors. For instance, overflow doesn't wrap around and set a nice flag, or throw an exception; it's just "undefined". It's easy to make a program which appears to work, but is relying on behavior outside of the documentation, which will change.

Computer architectures were crappy like that in the beginning. The mainframe vendors smartened up because they couldn't sell a more expensive, faster machine to a customer if the customer's code relied on undocumented behaviors that no longer work on the new machine.

Then, early microprocessors in the 1970's and 80's repeated the pattern: poor exception handling and undocumented opcodes that did curious things (no "illegal instruction" trap).

gpderetta8y ago

Then again store-load reordering is visible across cores and undefined.

admax88q8y ago

> If C is not a low level language then a low level language does not exist for modern CPUs

I think that's a fair conclusion though, I don't think the article is misleading.

x86 assembly is a high level language. It's analogous to JVM bytecode. Modern x86 processors are more like a virtual machine for x86 bytecode.

magduf8y ago

>x86 assembly is a high level language. It's analogous to JVM bytecode.

If you take this position, then having the distinction between "low level" and "high level" languages becomes pointless, and we have no way to distinguish between languages like x86 assembly and C and languages like Python and Haskell. This is why we use the terms "low level" and "high level": some of these languages have a lower level of abstraction than others. The fact that it's not giving you a great idea of exactly what's happening in the transistors is irrelevant: "low" and "high" are relative terms, not absolute.

1 more reply

ninkendo8y ago

It's a fair conclusion if you're willing to accept that the phrase "low-level language" has approximately zero modern examples.

Or maybe we should relax the definition of "low-level language" a bit?

2 more replies

jrochkind18y ago

Hmm, what does "virtual machine" mean if the "virtual machine" is implemented in hardware?

And, is nothing but actual binary machine code a "low level language"? I guess it's the lowest, I don't _think_ you can go lower than that... but someone's probably gonna tell me I'm wrong.

1 more reply

mikepurvis8y ago

The JVM provides native primitives around memory and thread management which are not present in x86, but that's a matter of degree.

1 more reply

pvg8y ago

It's analogous to JVM bytecode.

That's only true in the way everything is analogous to everything.

2 more replies

rbanffy8y ago

Microcode is the new low level language. ;-)

But, to be fair, C is not that low level. In fact, when I first learned it, it was considered a high-level language because CPUs we used it with didn't have functions with parameters, only subroutine jumps.

C reaches into the realm of low-level languages because it allows you to arbitrarily read from and write to the "state" of the context you live in, but it also allows you to express constructs that have no counterpart even on the most complex CPU architectures (even if they have things that disagree fundamentally with C's point of view).

pjmlp8y ago

> Microcode is the new low level language. ;-)

Except for RISC, that has mostly always been the case, when we look back at all those mainframes and their research papers.

nerdponx8y ago

On the flipside, doesn't it allow more or less direct memory access?

1 more reply

Avshalom8y ago

>If C is not a low level language then a low level language does not exist for modern CPUs

correct, but that lack is not an argument for C being low level.

swsieber8y ago

Correct - it's an argument for broadening the title.

Calling out a specific language as not being something leads one to ask, "Well what is?". In this case, there is no qualifying alternative, so the title might as well be, "There is no low-level language for CPUs".

2 more replies

0xfaded8y ago

I played a little bit with gpgpu on the raspberry pi.

I’d imagine it’s relativly primitive compared to whatever shaders are compiled to on modern GPUs, but it was humbling to have to manage things like separate, per core, disjoint register files which can only be read 4 cycles after write. The cores are heterogeneous, so there is special hardware for exchanging register reads between cores if necessary.

hencoappel8y ago

What language do you use for GPGPU?

1 more reply

wmu8y ago

> Register renaming, cache hierarchies, out of order and speculative execution etc are not visible at the assembly / machine code level

Cache hierarchies are directly accessible with CLFLUSH, INV, WBINVD x86 instructions; we may count also PREFETCHx, but they call it "a hint". FENCE instructions touch even the multicore part of system.

Many low-level CPU concepts leak to higher layer. A bright example is false sharing, which may manifest even in Java or C# programs.

fixermark8y ago

x86 is also there on modern architectures to supply an abstraction that is increasingly divergent from the actual die hardware (but compatible with expectations of e.g. a C compiler that already has an output target for previous x86 hardware).

Modern Intel CPUs basically emulate x86; there are many layers of abstraction between individual opcodes and transistor switching.

kev0098y ago

By David's postulation, even the native assembly language for the CPU is not low level. See my other comment on the parent topic for justification.

ryao8y ago

Spend a week writing in assembly and you will never call C a low level language again.

munificent8y ago

I really really liked this article, and reading the comments here is blowing my mind. Did we read the same thing?

I think it's a strong insight that insight that chip designers and compiler vendors have spent person-millenia maintaining the illusion that we are targeting a PDP-11-like platform even while the platform has grown less and less like that. And, it turns out, with things like Spectre and the performance cost of cache misses, that abstraction layer is quite leaky in potentially disastrous ways.

But, at the same time, they have done such a good job of maintaining that illusion that we forget it isn't actually reality.

I like the title of the article because many programmers today do still think C is a close mapping to how chips work. If you happen to be one of the enlightening minority who know that hasn't been true for a while, that's great, but I don't think it's good to criticize the title based on that.

noobermin8y ago

As someone who does large scale computational work for a living, stuff like this is close to my heart. I often run into serious memory and run time constraints due to poorly written codes that have rather dumb understanding of the real underlying machine implementation that modern processors actually have rather than this imaginary PDP-11 that we've been brought up to believe.

I wonder how much I could save (and how many more sims I could run) if my codes were rewritten in a language that has an abstract system that is much more cleanly and simply translated to what the computer actually does in 2018.

Shikadi8y ago

Actually, the author's argument about PDP-11 is interesting because C would have never been considered a low level language back then, for any platform.

Wiki definition, also what I was taught in my first CS class:

"A low-level programming language is a programming language that provides little or no abstraction from a computer's instruction set architecture—commands or functions in the language map closely to processor instructions. Generally this refers to either machine code or assembly language."

The term is evolving to match the time, as shown by the author's interpretation already being higher level than the original intention despite the goal of preventing exactly that.

munificent8y ago

> Actually, the author's argument about PDP-11 is interesting because C would have never been considered a low level language back then, for any platform.

Sure, agreed, but I don't think it's super interesting that words evolve in meaning over time.

What I find strange about the comments here is that some people think the article's title is bad even though my experience is that many people today do think "C is a low level language" is a reasonable thing to say.

1 more reply

lowbloodsugar8y ago

Heh. Yeah. Was programming ARM assembly in 1988. C compiler was far too high level. Look how its always saving these registers to memory! Meanwhile I'm dropping into FIQ mode just for the banked R8-R14.

Now, sure, people say C is low level, and compared to Java it sure is. But it isn't low-level.

robochat8y ago

I also really liked the article and found it thought provoking. This is all way above my pay grade but I like to think that there is a more optimal cpu design and language pairing that we will eventually reach and it's fun to imagine what that might look like.

Obviously, it would be very hard to shift the incumbent model in reality. We just have to look at the lack of prosperity for the Itanium and Cell processors to see how hard it is to achieve success. But imagine if new computer languages had been created just for these processors. Commercially this would make little sense but it might be possible to create languages that fully used these processors yet retained simplicity for developers. Or maybe it isn't possible to beat the clarity of sequential instructions for human developers or maybe Out Of Order processing is the optimal algorithm. There are other changes coming too such as various replacements for DRAM that either integrate more closely with the CPU (such as 3d chips) [1,2] that by reducing the latency of main memory, could actually bring us back closer to the C model of the computer? or just change computing entirely...

[1] https://www.extremetech.com/computing/252007-mit-announces-b... [2] https://news.ycombinator.com/item?id=16894818

kartickv8y ago

It'll be an interesting exercise to ask the Clang folks to relax backward compatibility, this designing a new language, if it makes their compiler go faster.

That could be deployed as a new language, or adding features to existing ones, like value types in Java, or even compiler switches that relax some C rules for faster speed. Imagine -fpointers_cant_be_cast_to_ints or -freorder_struct_fields.

skybrian8y ago

It seems like the article is mostly useful for inspiring research; that is, most of us aren't the target audience.

I'm wondering what will happen as GPU's become more general-purpose. What's next after machine learning?

Would it be possible to make a machine where all code runs on a GPU? How would GPU's have to change to make that possible, and would it result in losing what makes them so useful? What would the OS and programming language look like?

antris8y ago

> It seems like the article is mostly useful for inspiring research; that is, most of us aren't the target audience.

As a group of professionals, it is highly beneficial for us to be interested in these things. People who design languages and compilers do it largely on what is perceived as being demanded, and us as programmers are the ones that create the demand for new languages.

To put in other words, if programmers aren't aware of what's going wrong with our current languages, they cannot express their need for new languages. So, there's less incentive for researchers to produce new ways of programming computers. It is much more tempting to "please the masses" in a way that causes this local-maximum problem. It's much more interesting to research problems that translate into mainstream use than academic things that nobody actually uses.

1 more reply

proverbialbunny8y ago

I think it is going the opposite direction, where cpus get more powerful graphics cards integrated more and more into them. This allows for matrix math to become a bit more standard in day to day programming.

However, in the opposite direction where a gpu becomes more like a cpu, if streams could do some level of limited branching without slowing the whole thing down, it opens the door to threading frameworks and design patterns where you write a loop in code, every thread gets it's own copy of memory, and on the threading front, it just kind of works for a lot of generic code.

Then if gpus added some sort of piped-like summation-like instruction, in the cases in a loop where variables need to be shared, they can still be added, subbed, mul, div, or mod, easily and quickly, allowing for what looks and acts like normal code today, but is actually threaded. That would kind of bring code back to where it is today.

Who knows? It's kind of fun to speculate about though.

1 more reply

kartickv8y ago

Or, conversely, could the iPhone 11 have 20 cores that are optimised not for latency but for power-efficient execution, such as how much you can do before the device runs out of battery? These would still be ARM, and backward-compatible with mainstream languages like Swift, so will have a low barrier to entry.

Maybe apps will be allowed to run longer in the background, if there are always extra CPU cores available that don't consume much power.

snarf218y ago

I think the bigger take away for me is that what constitutes low level has evolved over time. I still think C is low level because you have to manage your own memory and can play tricks with pointers and memory that other languages protect you from. Meaning that low level takes off a lot of the training wheels. The compiler still does what it can and optimizes things but you have a far greater ability to shoot yourself in the foot than in some other language. The arguments about what a "real" low level in these comments seem mostly pedantic.

munificent8y ago

> I think the bigger take away for me is that what constitutes low level has evolved over time.

Yeah, this is a good insight. The height of the overall stack has grown. The lowest low level is lower than it was in the 60s and the highest high level is higher. So we need more terms to cover that wider continuum.

zengid8y ago

I agree that this article hits pretty hard against a lot of assumptions about how our machines are working.

I also feel like the author is trying to say something about how imperative scalar (meaning 'operates on one datum at a time') languages are causing more trouble than they're worth. Sophie Wilson said something similar in her talk about the future of microprocessors [1]. This implies that declarative and functional semantics would be more amenable to parallelization, as the author mentions in the article, as well as allowing the compiler more freedom to deduce a suitable 'reordering' of operations that would better fit the memory access heuristics the machine is using.

[1] https://youtu.be/_9mzmvhwMqw?t=26m30s

jstimpfle8y ago

> cost of cache misses

How is the C memory model a leaky abstraction here? What better way do you suggest? Are we not fine coding sequential (in memory) datastructures in C?

munificent8y ago

C leads you to believe that memory access has uniform cost regardless of address. What is the perf cost of:

    *foo

Depending on what foo points to, and which memory you have previously read, the cost can vary by close to two orders of magnitude on many chips.

C does give you the ability to control those costs, but controlling how you lay out your data in memory and controlling imperatively in which order you access it. But the language doesn't show you those costs in any way.

1 more reply

pedasmith8y ago

How about a different example entirely? Memory nowadays is either CPU (and the GPU can access it) or GPU (and the CPU has a window into it). It's terribly important to use the right one: if a chunk of memory is mostly used by the CPU, it needs to be CPU memory, and if it's mostly used by the GPU, it needs to be GPU memory. But there's no good way to specify that.

You might argue that a modern computer is more like programming a tightly-bound, nonuniform multi-processor system. And I'd agree. But C doesn't much to help program such a thing.

1 more reply

United8578y ago

It's worth noting that chips that were designed for high-performance computing (e.g. the Cell) from the outset generally don't have silicon devoted to things like out of order execution, register renaming, etc. In this case, the bulk of the optimization logic does shift to the programmer (aided by the compiler).

The reason is that in these domains (e.g. game consoles, supercomputing), you know ahead of time the precise hardware characteristics of your target, you can assume it won't change, and can thus optimize specifically for that ahead of time.

This isn't true for "mass-market" software that needs to run across multiple devices, with many variants of a given architecture.

mattnewport8y ago

> The reason is that in these domains (e.g. game consoles, supercomputing), you know ahead of time the precise hardware characteristics of your target, you can assume it won't change, and can thus optimize specifically for that ahead of time.

Cell was a failure in large part because this proved to be less true / less relevant than its designers thought.

Source: many late nights / weekends trying to get PS3 launch titles performing well enough to ship.

scott_s8y ago

I did work in graduate school trying to make the Cell easier to program - basically, providing OpenMP-like abstractions that would take advantage of the SPEs. I've always been really curious: how much did your games take advantage of the SPEs? When did you send code to the SPEs versus using the GPU? Were you using libraries that helped managing the SPEs, or did you do all of it manually?

1 more reply

obl8y ago

The point of dynamic optimizations (such as ooo) is not only to hide implementation details (such as register file size) but very much to take advantage of dynamic opportunities that simply cannot be known statically. The optimal schedule can be very different depending on whether some load hit L1 vs L2 or even was forwarded from the store buffer.

There are some classes of very regular algorithm where you could probably predict everything (and handle the memory hierarchy) statically, such as GEMM, but it's not very common.

mattnewport8y ago

Yeah, this is a very important point that many people seem to be missing, including the authors of the original article it seems to me. It was certainly a big problem for performance of games on Cell in my experience.

umanwizard8y ago

The points made in the article are certainly valid, but C is low-level in an abstract sense: it is approximately the intersection of all mainstream languages.

I.e. if a feature exists in C, it probably exists in every language most programmers are familiar with. (I worded this statement carefully to exclude exotic languages like Haskell or Erlang).

Thus C, while not low-level relative to actual hardware, is low-level relative to programmers' mental model of programming. If this is what we mean, it's still true and useful to think of C as a low-level language.

That said, it's important to keep the distinction in mind -- statements like "C maps to machine operations in a straightforward way" have been categorically wrong for decades.

munificent8y ago

> if a feature exists in C, it probably exists in every language most programmers are familiar with.

I don't think that's true.

Off the top of my head, C has: array point decay, padding, bit fields, static types, stack allocated arrays, integers of various sizes, untagged enums, goto, labels, pointer arithmetic, setjmp/longjmp, static variables, void pointers, the C preprocessor.

Those features are all absent in many other languages and are totally foreign to users that only know those languages. A large part of C is exposing a model that memory is a freely-interpretable giant array of bytes. Most other languages today are memory safe and go out of their way to not expose that model.

typomatic8y ago

> I worded this statement carefully to exclude exotic languages like Haskell or Erlang

I suspect that your definition of "exotic" is exactly "not like C".

mda8y ago

Which is kinda true. Most popular languages are C like.

2 more replies

kazinator8y ago

Which languages have pointer arithmetic, longjmp, goto anywhere within a function, address-of operator, memcpy that is equivalent to assignment even for lexical variables, untagged unions, switch with fall-through in absence of explicit break and a textual/token-wise preprocessor?

pjmlp8y ago

BLISS, Modula-2, NEWP, Mesa,...?

Of course, some of those tricks are only allowed in SYSTEM/UNSAFE blocks on these languages.

1 more reply

saagarjha8y ago

One feature that many languages don't provide is the ability to have direct control over how aggregates are organized in memory.

fixermark8y ago

> Thus C, while not low-level relative to actual hardware, is low-level relative to programmers' mental model of programming

Programmers' mental model of programming is not a homogeneous set. I'm pretty comfortable in LabView, for example; a language that is extremely parallel (the entire program is composed of a graph of producer / consumer nodes and sequential operation, if desired, must be explicitly requested).

mkirklions8y ago

Its crazy that C has changed because of the way people used it.

Well crazy isnt the correct word... mainstream use has changed the future of an old language...

Rebelgecko8y ago

Going by their definition, I don't think there are any low level languages, at least on modern architectures. Even x86 assembly abstracts out a lot of what is going on within the CPU.

umanwizard8y ago

That doesn't mean the definition is useless -- rather than "C isn't a low-level language, as opposed to something else which is", the point might be "there exist no low-level languages according to most people's understanding of that term". Which is still an interesting and useful fact.

rbanffy8y ago

It also hides the fact C is just a couple notches above the absolute minimum most people would even consider - writing assembly code by hand - and is, effectively, the lowest most programmers will ever venture.

2 more replies

ModernMech8y ago

> Which is still an interesting and useful fact.

I think it just leads to quibbling over the boundary of low level, as is happening here.

I think it's just important to know that the definition changes over time relative to the state of the art. C was once considered high level. In the future, if programming languages evolve to a more natural language state, then sending serial instructions to the computer in a strange code will seem very low level to such programmers.

1 more reply

rbanffy8y ago

Assembly has been a fiction on most computers for a long time now. From the other side of the instruction decoder, they are more like VLIW machines than evolved 8080's.

fixermark8y ago

Correct. For low-level language, we may actually want to look more in the direction of HLSL or GLSL.

Rebelgecko8y ago

I haven't done any shader programming, but can we even say that about those languages? It might just be my own inexperience talking, but for anything more complicated than matrix multiplication the innards of a GPU seem just as opaque as a CPU.

1 more reply

ChuckMcM8y ago

I enjoyed reading this, mostly because it made me angry, then curious, then thoughtful all in one go.

Partly because I really like the PDP-11 architecture, and it's 'separated at birth' twin the 68K, it greatly influenced me in how I think about computation. I also believe that one of the reasons that the ATMega series of 8 bit micros were so popular was that they were more amenable to a C code generator than either the 8051 or PIC architectures were.

That said, computer languages are similar to spoken languages in that a concept you want to convey can be made more easily or less easily understood by the target by the nature of the vocabulary and structure available to you.

Many useful systems abstractions, queues, processes, memory maps, and schedulers are pretty easy to express in C, complex string manipulation, not so much.

What has endeared C to its early users was that it was a 'low constraint' language, much like perl, it historically has had a fairly loose policy about rules in order to allow for a wider variety of expression. I don't know if that makes it 'low' but it certainly helped it be versatile.

dahart8y ago

> A processor designed purely for speed, not for a compromise between speed and C support, would likely support large numbers of threads, have wide vector units, and have a much simpler memory model.

Sounds like a GPU?

> Running C code on such a system would be problematic, so, given the large amount of legacy C code in the world, it would not likely be a commercial success.

It seems like ATI & NVIDIA are doing okay, even with C & C++ kernels. GLSL and HLSL are both C-like. What is problematic?

tsomctl8y ago

C-like code that runs on GPUs is not even close to normal C, even though the syntax is similar. The way you layout your memory, schedule your threads, and add memory barriers is completely different. You are never going to take a piece of large C code written for a CPU and just run it directly on a GPU.

dahart8y ago

Huh, that’s weird, I run a C++ compiler directly on my GPU code. The only difference between CPU and GPU code at the function level is whether I tag it with a __global__ macro or not, and lots of functions compile and run for both CPU and GPU.

Memory layout, thread scheduling, and barriers are not features of the C language and have nothing to do with whether your C is “normal”. Those are part of the programming model of the device you’re using, and apply to all languages on that device. Normal C on an Arduino looks different than normal C on an Intel CPU which looks different than normal C on an NVIDIA GeForce.

1 more reply

rbanffy8y ago

> Sounds like a GPU?

Which reminds me I'd love to see a computer running exclusively from a GPU-like CPU.

And no, Xeon Phi's don't count. They are cool, but look too much like normal PCs.

dahart8y ago

Here’s one: https://en.m.wikipedia.org/wiki/Cray-1

They didn’t call it a GPU then, but the SIMD architecture is quite similar at a high level.

Larrabee was going to be a GPU-like CPU. https://en.m.wikipedia.org/wiki/Larrabee_(microarchitecture)

Here’s a more modern GPU based computer: https://www.nvidia.com/en-us/self-driving-cars/drive-platfor...

If you meant something that sits on your desktop and runs Linux, then yeah it’s uncommon but not unheard of to run it on a SIMD system. The trend is absolutely definitely going toward SIMD being used in general purpose computing. Even if you don’t want to count any of my examples, you will see the “normal” PC become more GPU-like in the future than it is today.

1 more reply

geokon8y ago

I've only done a bit of GPU kernel writing, but I always found it very .. unergonomic. Its like they mashed C to work in a context it wasn't meant for. Which is understandable since you want to encourage adoption, but I'd guess it's part of the motivation behind creating SPIR-V and allowing people to target other languages to the GPU

pjmlp8y ago

SPIR is a reaction to CUDA's adoption.

NVIdia always allowed multiple language on CUDA via PTX, with the offerings for C, C++ and Fortran coming from them, while some third parties had Haskell, .NET and Java support as well.

Yet another reasons why many weren't so keen in being stuck with OpenCL and C99.

ovao8y ago

To me the argument's akin to suggesting that Robert Wadlow wasn't tall, because giraffes are taller than Robert Wadlow.

When the spectrum of the context is unambiguous, that's not an argument for finding a way to make it ambiguous.

Sean17088y ago

I think that would be a fair point if the article was about whether or not we should call C a low-level language, but the article is actually about whether C maps cleanly onto what the machine actually does and what a machine might look like if we didn't have that expectation.

cryptonector8y ago

> The root cause of the Spectre and Meltdown vulnerabilities was that processor architects were trying to build not just fast processors, but fast processors that expose the same abstract machine as a PDP-11. [...]

This strikes me as a flavor of the VLIW+compilers-could-statically-do-more-of-the-work argument, though TFA does not mention VLIW architectures.

C or not, making compilers do more of the work is not trivial, it is not even simple, not even hard -- it's insanely difficult, at least for VLIW architectures, and it's insanely difficult whether we're using C or, say, Haskell. The only concession to make is that a Haskell compiler would have a lot more freedom than a C compiler, and a much more integrated view of the code to generate, but still, it'd be insanely hard to do all of the scheduling in the compiler. Moreover, the moment you share a CPU and its caches is the moment that static scheduling no longer works, and there is a lot of economic pressure to share resources.

There are reasons that this make-the-compilers-insanely-smart approach has failed.

It might be more likely to be successful now than 15 years ago, and it might be more successful if applied to Rust or Haskell or some such than C, but, honestly?, I just don't believe this will work anytime soon, and it's all academic anyways as long as the CPU architects keep churning out CPUs with hidden caches and speculative execution.

If you want this to be feasible, the first step is to make a CPU where you can turn off speculative execution and where there is no sharing between hardware threads. This could be an extension of existing CPUs.

A much more interesting approach might be to build asynchrony right into the CPUs and their ISAs. Suppose LOADs and STOREs were asynchronous, with an AWAIT-type instruction by which to implement micro event loops... then compilers could effectively do CPS conversion and automatically make your code locally async. This is feasible because CPS conversion is well-understood, but this is a far cry from the VLIW approach. Indeed, this is a lot simpler than the VLIW approach.

TFA mentions CMT and ULtraSPARC, and that's certainly a design direction, but note that it's one that makes C less of a problem anyways -- so maybe C isn't the problem...

Still, IMO TFA is right that C is a large part of the problem. Evented programs and libraries written in languages that insist on immutable data structures would help a great deal. Sharing even less across HW/SW threads (not even immutable data) would still be needed in order to eliminate the need for cache coherency, but just having immutable data would help reduce cache snooping overhead in actual programs. But the CPUs will continue to be von Neuman designs at heart.

kev0098y ago

The meta point from the article is that this is as much a hardware problem as it is a language or developer one. An arms race was waged to create CPUs that are very effective in running sequential programs; to the point that what they present to the program is a very much a facade and they hide an increasing great deal of internal implementation detail. By David's postulation, even the native assembly language for the CPU is not low level.

To drive this juxtaposition home, I'd point to PALcode on Alpha processors in which C (and others) can very much be a low level language. Very few commercial processors let you code at the microcode level.

The overarching premise is then brought home by GPU programming, which shows that you don't necessarily need to be writing at the ucode level if the ecosystem was built around how the modern hardware functioned.

scott_s8y ago

The author, David Chisnall, is a co-author on a related paper from PLDI 2016: "Into the Depths of C: Elaborating the De Facto Standards", https://news.ycombinator.com/item?id=11805377

favorited8y ago

He was also one of the earliest non-Apple contributors to Clang, was on the FreeBSD core team, and wrote the modern GNU Objective-C runtime implementation. His work on Objective-C in particular is prolific.

sizeofchar8y ago

Also, his book on Objective-C is the best one I read.

compiler-guy8y ago

There is an entire junkyard full of processors designed to run other languages well.

LISP machines in the 60s, Java machines in the 90s, many others.

For whatever reason, successful general purpose silicon has almost always followed a C-ish model.

It's also worth noting that Fortran runs quite well on C-ish style processors.

gpderetta8y ago

Exactly. While CPU designers c will certainly make sure they can run C code fast, it turns out that, for the last 40 years at least, the C model (sequential, procedural, mostly flat address space) is the most efficient to implement in hardware.

davidw8y ago

"C combines the power and performance of assembly language with the flexibility and ease-of-use of assembly language."

salgernon8y ago

It feels like the author really isn't talking so much about the limitations of C on modern architectures, but the architecture itself.

Possibly relevant is this (short?) discussion[1] from 2011 about a CPU more closely designed for functional programming.

[1] https://news.ycombinator.com/item?id=2645423

angry_octet8y ago

It is instructive to consider GPUs and their compilers. The death of OpenGL in favour of Vulcan has come about because OpenGL is unable to express low level constructs which are essential to achieving performance. GPU drivers are actually compilers that recompile shaders to efficient machine expressions.

Thus the fundamental limitation is that the processor has only a C ABI. If there were a vectorisation and parallel friendly ABI, then it would be possible to write high level language compilers for that. It should be possible for such an ABI to coexist with the traditional ASM/C ABI, with a mode switch for different processes.

angry_octet8y ago

s/vulcan/vulkan damn autocorrect.

arghwhat8y ago

It is correct that C is not really a low level language, but the points about how C limits the processor doesn't make much sense.

It uses UltraSPARC T1 and above processors as an example for a "better" processor "not made for C", but this argument makes no sense at all. The "unique" approach in the UltraSPARC T1 was to aim for many simple cores rather than few large cores.

This is simply about prioritizing silicon. Huge cores, many cores, small/cheap/simple/efficient die. Pick two. I'm sure Sun would have loved to cram huge caches in there, as it would benefit everything, but budgets, deadlines and target prices must be met.

Furthermore, the UltraSPARC T1 was designed to support existing C and Java applications (this was Sun, remember?), despite the claim that this was a processor "not designed for traditional C".

There are very few hardware features that one can add to a conventional CPU (which even includes things like the Mill architecture) that would not benefit C as well, and I cannot possibly imagine a feature that would benefit other languages that would be harmful to C. The example of loop count inference for use of ARM SVE being hard in C is particularly bad It is certainly no harder in the common use of a for loop than it is to deduce the length of an array on which a map function is applied.

I cannot imagine a single compromise done on a CPU as a result of conventional programming/C. That is, short of replacing the CPU with an entirely different device type, such as a GPU or FPGA.

dgreensp8y ago

The point is specifically about parallel vs sequential programs. Legacy C code is sequential, and the C model makes parallel programming very difficult.

I met a guy back in college, a PhD who went to work at Intel, who told me the same thing. In theory, the future of general purpose computing was tons of small cores. In practice, Intel's customers just wanted existing C code to keep running exponentially faster.

arghwhat8y ago

> Legacy C code is sequential, and the C model makes parallel programming very difficult.

Neither of these statements are true, unless "Legacy" refers to the early days of UNIX.

Tasks that parallelize poorly do not benefit of many small cores. This is usually a result of either dealing with a problem that does not parallelize, or just an implementation that does not parallelize (because of a poor design). Neither of these attributes are related to language choice.

An example of something that does not parallelize at all would be an AES256-CBC implementation. It doesn't matter what your tool is: Erlang, Haskell, Go, Rust, even VHDL. It cannot be parallelized or pipelined. INFLATE has a similar issue.

For such algorithms, the only way to increase throughput is to increase single-threaded performance. Increasing cores increase total capacity, but cannot increase throughput. For other tasks, synchronization costs of parallelization is too high. I work for a high performance network equipment manufacturer (100Gb/s+), and we are certainly limited by sequential performance. We have custom hardware in order to load balance data to different CPU sockets, as software based load distribution would be several orders of magnitude too slow. The CPU's just can't access memory fast enough, and many slower cores wouldn't help as they'd both be slower, and incur overheads.

Go and Erlang of course provide built-in language support for easy parallelism, while in C you need to pull in pthreads or a CSP library yourself, but the C model doesn't make parallel programming "very difficult", nor is C any more sequential by nature than Rust. It is also incorrect to assume that you can parallelize your way to performance. In reality, the "tons of small cores" is mostly just good at increasing total capacity, not throughput.

1 more reply

agumonkey8y ago

The thing is, most of the time you're reflecting at some logical level that will not be the "reality". The problem is that C programmer think that C === reality === performance. C has better (lower) constant factors but by no means better all the time.

DannyB28y ago

The sophistication of the compiler does not mean the language is high level.

The meaning of a high level language is to do with abstraction away from the hardware. C programmers often wince at languages that are highly abstracted away from the hardware. But those are what are "high level" languages. Especially languages that remove more and more of the mechanical bookkeeping of computation. Such as garbage collection (aka automatic memory management). Strong typing or automatic typing. Dynamic arrays and other collection structures. Unlimited length integers and possibly even big-decimal numbers of unlimited precision in principle. Symbols. Pattern matching. Lambda functions. Closures. Immutable data. Object programming. Functional programming. And more.

By comparison C looks pretty low level.

Now I'm not knocking C. If there were a perfect language, everyone would already be using it. Consider the Functional vs Object debate. (Or vi vs emacs, tabs vs spaces, etc) But all these languages have a place, or they would not have a widespread following. They all must be doing something right for some type of problem.

C is a low level language. And there is NOTHING wrong with that! It can be something to be proud of!

kiriakasis8y ago

One of the point of the article is that C is relatively high level by your definition.

Basically it says that the C abstract machine has very little in common with most existing processor.

moreover it makes the point that in the last decades of research for CPUs the focus was "make C go fast" wich ultimately cause meltdown.

thinkling8y ago

TLDR: C was close-to-the-metal on the PDP-11 but since then hardware has become more complex while exposing the same abstraction to the C programmer. That means that hardware features such as speculative execution and L1/L2 caching are invisible to the programmer. This was the cause of Spectre and Meltdown and it forces a lot of complexity into the compiler. GPUs achieve high performance in part because their programming model goes beyond C. Processors would be able to evolve if they weren't hamstrung by having to support C.

ajross8y ago

Yeah, but that's just reinventing the mistakes of VLIW all over again. Yes, CPUs have complicated behavior in a way that can't be captured by scalar imperative languages in a concise way. No, that doesn't mean that you can fix this with new abstractions.

The reason C won wasn't that it forced CPUs to adhere to its particular execution metaphor[1], but that it happened upon a metaphor that could be easily expressed and supported by CPUs as they evolved over decades of progress.

[1] Basically: byte-addressable memory in a single linear space, a high performance grows-down stack in that same memory space, two's complement arithmetic, and "unsurprising" cache coherence behavior. No, the last three aren't technically part of the language spec, but they're part of the model nonetheless and had successful architectures really diverged there I doubt C-like runtimes would have "won".

arghwhat8y ago

It is very important to emphasize that GPU's only "achieve high performance" in workloads tailored very specifically to their extremely limited architecture.

CPU's, on the other hand, are designed to be much more generic with decent performance for any task.

derefr8y ago

I wouldn't say this is precisely true. Look at Dolphin's "ubershaders" (https://dolphin-emu.org/blog/2017/07/30/ubershaders/): they're essentially a Turing machine, running on your GPU, used to emulate a GPU of another architecture... and yet this is still (much!) faster than doing the same on the CPU.

And there's nothing special about emulating a GPU on a GPU; you could emulate a CPU architecture just as easily, at a much higher level than you get from an FPGA, and so perhaps faster than you'd be able to get from today's FPGAs. And, if you're mapping GPU shader units 1:1 to VM schedulers, you'd also get a far higher degree of core parallelism than even a Xeon Phi-like architecture would give you. (The big limitation is that you'd be very limited in I/O bandwidth out to main memory; but each shader unit would be able to reserve its own small amount of VRAM texture space—i.e. NUMA memory—to work with.)

I'm still waiting for someone to port Erlang's BEAM VM to run on a GPU; it'd be a perfect fit. :)

umanwizard8y ago

I think it's more accurate to say that CPUs are high-performance for different tasks than GPUs. Simple code, very wide data workloads are atrociously slow on CPUs, and single-threaded heavily branching workloads are atrociously slow on GPUs. That doesn't mean that one is more limited than the other.

2 more replies

kevstev8y ago

I was with you until the last sentence: "Processors would be able to evolve if they weren't hamstrung by having to support C."

I don't think its fair or correct to say that C is the real issue. Recently there have been languages like erlang and support for more functional models that make concurrent code a lot easier to write. The first real consumer multicore processors were only released a bit over 10 years with Intel's Core 2 duo's. Of course SMP systems existed before that, Sun had them for years, but they were relatively niche. Still, Java, C++, C#, are all languages that produce much easier to maintain code if they are single threaded. Recent darlings like JS and Python are single threaded out of the box.

The large majority of languages in use today are not designed to be concurrent as a first principle. True multicore systems have been around for decades, software and mindshare is now starting to catch up and use tools that make concurrency easy.

SomeHacker448y ago

My Symbolics computers (running Symbolics Ivory processors) run Lisp really well, as well as C - they have a C compiler.

I have operational computers of a variety of architectures at home, including the oldest generations (6502, 680x0), Sparc, Symbolics, DEC Alpha, MIPS 32- and 64-bit, etc., and even an extremely rare (and unfortunately not-running) Multiflow, the granddaddy of VLIW.

My favorite part of the original article was the final section. I wish we had a modern CPU renassiance akin to what was going on in the 80s and 90s, but the market dominance of x64 and ARM seems to be squelching things, with optimizations to those architectures rather than novel new ones (with possibly novel new compiler technologies). 64-bit ARM was a nice little improvement, though.

jacquesm8y ago

> Recently there have been languages like erlang

Erlang is decades old. It's 32, only 16 years younger than C.

occamrazor8y ago

I don’t understand. CPUs do not support C, they support a specific instruction set. What stops them from having instructions for cache management, pipelining, speculative execution hints, etc?

coliveira8y ago

They do not support C officially, but every CPU designer knows that 99%+ of the code that matters is written in C. Therefore they design chips targeting this translation from C. What the authors want is a better lower level interface that would allow for modern processor features without the legacy of the features available to the PDP11.

1 more reply

mattnewport8y ago

Many of these features are very difficult to use in a useful way in a static context (e.g. at compile time) because the performance gains mostly come from taking advantage of dynamic context. Speculative execution and out of order execution for example are mostly useful because you don't know at compile time exactly what data your code is processing or what CPU it is running on, what function / context you are being called from, what is in cache and what isn't, etc.

The SPUs on the PlayStation 3 were an experiment in user managed caches and that proved to be a difficult thing to make effective use of even in games where you know more context than a lot of code can assume.

_fq4v8y ago

The Itanium processor did exactly this. Other than being a commercial flop, it was found to be quite difficult to actually get the compiler to generate good management instructions, and x86 was often able to beat out an itanium core at the same clock speed

1 more reply

zelos8y ago

Games console CPUs support those kind of instructions, don't they?

To some extent, didn't Intel go down this road with VLIW: trying to shift the burden of making code fast onto the compiler, instead of the CPU?

jackhack8y ago

Thanks for the TLDR.

But if that's the argument, then not even assembly is sufficient, as control over speculative branching and prefetch is only accessible via microcode in the CPU.

I think the argument is improperly framed. This is a discussion over public and private interface. The CPU is treated as a black box with a public interface (the x86+ instruction set). Precisely how those instructions are implemented (on chip microcode) is a private matter for the chip design team, which if correctly implemented, does not matter to the user, as the results should be correct and consistent. Obviously, a poor implementation can lead to Spectre or Meltdown. But for the most part the specific transistors & diodes used to sum a set of integers, or transfer a word from L2 to L3 cache, etc. shouldn't matter to us. If the compilers are relying on side effects to alter behavior of the internal implementation based on performance evidence, then that is a boundary violation.

C is low level. It remains "universal assembly language".

PeterisP8y ago

The argument implied in the article is that choosing a different public interface (breaking "C compatibility" and the imposted limitations) could bring a serious performance improvement.

While precisely how those instructions are implemented (on chip microcode) is a private matter for the chip design team, we do care how much resources it takes to implement these instructions, since if we can enable a more efficient implementation then we can get better price/performance.

1 more reply

umanwizard8y ago

You make good points - if we're just talking about semantics, then yes C is the closest portable language to x86 or arm and is low-level in that sense. But on the other hand, semantics is not always the only important thing: performance is sometimes important also, and there these low-level details matter. The architecture does its best to hide them from the user, but the abstraction is very leaky.

For example, when writing high-performance CPU-bound code it's usually important to keep in mind how wide cache lines are, but C doesn't expose this to the programmer in a natural way.

sytelus8y ago

Interesting tidbits from article:

A modern Intel processor has up to 180 instructions in flight at a time (in stark contrast to a sequential C abstract machine, which expects each operation to complete before the next one begins). A typical heuristic for C code is that there is a branch, on average, every seven instructions. If you wish to keep such a pipeline full from a single thread, then you must guess the targets of the next 25 branches.

The Clang compiler, including the relevant parts of LLVM, is around 2 million lines of code. Even just counting the analysis and transform passes required to make C run quickly adds up to almost 200,000 lines (excluding comments and blank lines).

anfilt8y ago

I hate the idea of "low-level". There is not really such a thing. You should be using a language suitable for the domain your working in.

Sadly, too many programming languages try to be the end all be all. C is language that is great for working at the system domain.

Ideally, we would have small minimalist languages for various problem domains. In reality maintaining and building high quality compilers is a lot work. Moreover, a lot of development will just pile together whatever works.

That aside, you could build a computer transistor by transistor, but it's probably more helpful to think at the logic gate level or even larger units. Heck even a transistor is just a of piece of silicon/germanium that behaves in a certain way.

So there are levels abstraction, but is an abstraction low-level? I think term probably came about to refer lower layers of abstraction that build what ever system your using. So unless your using something that nothing can be added upon. Everything, even what people would call high level can be low-level.

Heck, people call JS a high level language, but there are compilers that compile to JS. This makes a JS a lower level system that something else is built upon. This just again shows why I would say that low-level is often thrown around with connotation that is not exactly true.

judge20208y ago

Archive.is link, as the page loaded incredibly slow for me: http://archive.is/E9s70

plpot8y ago

I find this article insightful, but missing the points it tries to deliver.

What the article is very good at delivering is that current CPU's ISAs exports a model that doesn't exist in reality. Yes, we might call it PDP-11, although I miss that architecture dearly.

C was never meant to be a low level language. It was a way to map loosely to assembler and provide some higher level abstraction (functions, structures, unions) to write code that was more readable, and structured, than assembler. And yes, it is far from perfect. And yes, today is called a low level language with good reasons.

But this article is all about exposing the insanity that modern CPU have become, insanity that is the sacrifice to the altar of backward compatibility -- all CPU architecture that tried the path of not being compatible with older CPUs have died.

I am pretty sure that once we'll have an assembler that map closely to the microcode, or to the actual architecture of the internals of a modern, parallel, NUMA architecture, we will still need to have a C-like language that will introduce higher level features to help us ease writing of non-architecture dependent parts. And it will most probably be C.

rhacker8y ago

The article itself has 4 definitions or "attributes" for low-level languages that can be considered contradictory:

* "A programming language is low level when its programs require attention to the irrelevant."

* Low-level languages are "close to the metal," whereas high-level languages are closer to how humans think.

* One of the common attributes ascribed to low-level languages is that they're fast.

* One of the key attributes of a low-level language is that programmers can easily understand how the language's abstract machine maps to the underlying physical machine.

So basically the entire article's premise (the title) hinges on the last bullet- which can be contested. All the other mentioned attributes can be applied to Java, C, C#, C++. So failing the last bullet point doesn't apply to just C.

dgreensp8y ago

I think the author's point is that despite being perceived as low-level, C doesn't really differ from, say, Java on the last bullet.

In other words, a programmer who sits down and uses C and not Java might think, "I am being forced to pay attention to irrelevant things and think in unnatural ways, but that's because I am writing fast code using operations that map to operations done by the physical machine. In a higher-level language like Java, more of these details are out of my control because they are abstracted away by the language and handled by the compiler."

I think the article does a great job dismantling this point of view, and telling the story that C is not so different from Java, aside from being unsafe and ill-specified.

zkomp8y ago

Maybe true but I think the Java example is not that good. Java is still not that different from C. Java is more like a decendant to C and C++ - and to be honest both languages force you to pay attention to lots of irrelevant "low-level" detail, fictionally low-level since its not actually the machine but language itself (that is stuck in the PDP11 mental mode...)

Compared to something different like Erlang, Haskell, Lisp

1 more reply

Const-me8y ago

One reason for that is for many applications latency is much more critical than bandwidth. For PCs that’s input-to-screen latency, for servers that’s request-to-response. It’s possible to make multicore processors with simpler cores, design OS and language ecosystem around it, etc. Such tradeoffs will surely improve bandwidth, but will harm latency.

Another reason is most IO devices are inherently serial. Ethernet only has 4 pairs, and wifi adapters are usually connected by a single USB or PCIx lane. If a system has limited single threaded (i.e. serial, PDP11-like) performance, it gonna be hard to produce or consume these gbits/sec of data.

zwieback8y ago

Great article if you're willing to read past headlines. I would have liked to see a mention of small processors that are still hugely popular (microcontrollers, etc.) where C is still a good fit.

wglb8y ago

The article does not properly distinguish between C as a language and what the C compiler does with the C program. The logic of the article references what the compiler does.

The reasonable way to measure languages is to look at the abstractions present in the language. C has fewer abstractions than the other languages that we are familiar with. That is the reasonable definition of the level of a language.

favorited8y ago

That's exactly the author's point. The C that programmers write is remarkably far from what the compiler generates for modern hardware.

How do you propose measuring the number of abstractions? JavaScript has remarkably few built-in abstractions, but it's in no way "low-level" from a hardware perspective.

z3t48y ago

I wonder if it's easier for a compiler/cpu to optimize "async" code ? And I often find myself having an array in JavaScript that calls the same function on each item in the array, it would be nice if such cases would be made parallel, which I think is possible to do in C++. Is that ever gonna happen in JavaScript !?

Shikadi8y ago

Language evolves. C is certainly lower level than C# or JavaScript, so even if it no longer fits the definition created decades ago, I don't see a problem with the term evolving to match modern times. People say assembly language when they mean assembly language, (which others have argued isn't low level any more anyway) so using low level to describe a language closer to the hardware seems valid to me. It's interesting that the author argues C could be considered low level on the PDP-11, because by the old definition used back it definitely wouldn't be. That tells me the author's definition of low level is already an evolution of the original definition, so there's no reason the term can't evolve some more.

Wiki definition:

lmm8y ago

The whole point of the article is that by a definition like the one you quoted, modern C is not low-level, though PDP-11 era C was.

Shikadi8y ago

Except what I'm saying is that PDP-11 era C wasn't low level. It incedentally took advantage of some low level features, but that wasn't by design, and wouldn't have changed its classification as a high level language at the time anyway

richardwhiuk8y ago

That's not true - PDP-11 era C isn't either - if you run it on a modern processor. And it's doubtful it even was then.

1 more reply

hokus8y ago

http://web.archive.org/web/20180502001551/https://queue.acm....

qsdf381008y ago

"processors wishing to keep their execution units busy running C code" What? This is non sense, the processor is not running C code! The processor can only run machine code, regardless of the language used to write the source code.

monocasa8y ago

Eh, the past thirty years has had CPUs designed to run C. That's the whole point of RISC: the idea of 'let's just pare down the CPU to what actually gets compiled, and now we have less gates in the critical path and we can run our chips faster'.

qsdf381008y ago

hmmm, I think that CPUs instruction sets inherit mainly from the 8086, and https://en.wikipedia.org/wiki/Intel_8086 mentions few languages that had influence on the 8086 design, but don't mention C at all.

smadge8y ago

Makes me wonder if x86 could be extended to expose the underlying parrellelism. How much faster would my Prolog and Haskell programs run if all branches were executed simultaneously and only the successful path down my search tree returned?

tome8y ago

Probably not much faster, otherwise you would have just implemented that by hand.

sriku8y ago

I couldn't read the article, but based on the comments, would it change the way we use C whether we declared it a "low level language" or not?

Sean17088y ago

The article is actually about how closely C maps to what is actually run on the hardware and whether hardware would look significantly different today if people didn't expect C to map closely.

justicezyx8y ago

Statement of using adjective almost always is about defining the context.

burke8y ago

C is Not a True Scotsman

mar77i8y ago

Damn Scots! They ruined Scotland!

waynecochran8y ago

Ok, w/o dipping into machine code, show me a low level language. Any snippet of C-code is transparent in that you know roughly how it is going to be translated into machine code.

Someone8y ago

”Any snippet of C-code is transparent in that you know roughly how it is going to be translated into machine code.”

…for a definition of ‘roughly’ that has become significantly less precise over the past decades.

For example, there was a time where you could be reasonably sure every multiplication in your source code mapped to a multiplication instruction, but that time has long been gone. Constant folding, replacement of multiplications by shifts and loop hoisting aren’t exactly novel techniques.

umanwizard8y ago

> Any snippet of C-code is transparent in that you know roughly how it is going to be translated into machine code.

This isn't really true on a modern optimizing compiler.

phamilton8y ago

Precisely. As someone who's tried to duel a compiler for performance, -O3 has very little resemblance to anything I've ever written, and outperforms what I've written significantly.

waynecochran8y ago

You still have a good idea of what can get inlined, where loop unrolling can occurs, constant folding, etc.... if you have any assembly experience you still have a good idea of what machine could can be generated.

plpot8y ago

But this will never be true in any low level language. You can spend a lot of time optimizing any single line of code manually, but software will always be faster than you. And will optimize any code you write beyond recognition.

kwillets8y ago

    unsigned char r(unsigned char num) {
        return num % 10;
    }

https://godbolt.org/g/26HfQk

1ris8y ago

Or compile this with clang and -msse4.2 -O2

    // https://codereview.stackexchange.com/questions/38182
    // https://codereview.stackexchange.com/a/38184
    // Definition: Count number of 1's and 0's from integer     with bitwise operation
    // 2^32 = 4,294,967,296
    // unsigned int 32 bit
    #include<stdio.h>
    int CountOnesFromInteger(unsigned int);
    int main()
    {   unsigned int inputValue;
        short unsigned int onesOfValue;
        printf("Please Enter value (between 0 to 4,294,967,295) : ");
        scanf("%u",&inputValue);
        onesOfValue = CountOnesFromInteger(inputValue);
    
        printf("\nThe Number has \"%d\" 1's and \"%d\" 0's",onesOfValue,32-onesOfValue); }
    // Notice the popcnt
    int CountOnesFromInteger(unsigned int value) {
        int count;
        for (count = 0; value != 0; count++, value &= value-1);
        return count; }

1 more reply

waynecochran8y ago

Yes. this is doing to infamous division by 10 using bit shifts as you will find in hacker's delight: http://www.hackersdelight.org/divcMore.pdf.

You may notice that when you divide num by 10 you get a quotient q and a remainder r:

    num = q*10 + r

Once you get q, you can solve for r as

    r = num - q*10

So this is how you get r and q:

    q = (num >> 1) + (num >> 2);
    q = q + (q >> 4);
    q = q + (q >> 8);
    q = q + (q >> 16);
    q = q >> 3;
    r = num - q*10;
    q = q + ((r + 6) >> 4)

voila!

1 more reply

plpot8y ago

What you see there is the insane amount of complexity to create a high level feature of C: functions, plus compiler optimizations.

What I don't understand about this argument is that you are calling C a high-level language because of compiler optimizations. I can write code in assembler, or in LLVM SSA, and still use software to optimize it beyond recognition.

anonlastname8y ago

Hardware description languages like VHDL, maybe gpu shader languages like CUDA/HLSL

United8578y ago

HLSL isn't a low-level language, indeed HL stands for high level. It's not much different than CPU. The runtime compiles down to a standard bytecode, and the driver translates to the GPU's proprietary native code.

zrobotics8y ago

C-code compiled for a target like AVR/PIC would fit this article's definition of 'low level', its really the platform the code is compiled for.

anonlastname8y ago

VHDL and other hardware description languages

arghwhat8y ago

VHDL is as low-level for an FPGA as C is for a CPU.

VHDL is almost low-level for an ASIC, where you can implement logic more directly. But even then, VHDL is an abstraction.

anentropic8y ago

literally the whole point of the article was that this isn't true any more

sigjuice8y ago

Where does it say in the ISO C standard that C must be translated to assembly code or machine code of any sort?

EDIT: Various C interpreters exist

11thEarlOfMar8y ago

>403 Error - Access Forbidden We are sorry ... ... but we have temporarily restricted your access to the Digital Library. Your activity appears to be coming from some type of automated process. To ensure the availability of the Digital Library we can not allow these types of requests to continue. The restriction will be removed automatically once this activity stops.

We apologize for this inconvenience.

Please contact us with any questions or concerns regarding this matter: portal-feedback@hq.acm.org

The ACM Digital Library is published by the Association for Computing Machinery. Copyright � 2010 ACM, Inc.

suprfnk8y ago

Got that too. Interesingly enough, these automated processes were able to get through:

https://web.archive.org/web/20180501183242/https://queue.acm...

https://webcache.googleusercontent.com/search?q=cache:sClfdA...

nottorp8y ago

It's still on after 9 hours from the OP. I suppose someone in ACM management is obsessed with their intellectual property being stolen, including their public articles. Good luck to them.

klez8y ago

Could it be the influx of traffic with the same referral link that tripped some defense mechanism?

nickpsecurity8y ago

I emailed them about it. Hopefully it will be fixed soon. Just keep a tab or bookmark. :)

Animats8y ago

I got that, too. I'm using Firefox on Linux, with both Ghostery and Privacy Badger enabled. Not impressed with the ACM.

molteanu8y ago

Same here.

lowken108y ago

If the general public & tech community refers to C as a low level language then it is a low level language.

pjmlp8y ago

When a lie gets repeated enough times, eventually it becomes a fact.

sametmax8y ago

It's just practical. Otherwise how do you call java ? Or python ?

And what would be the benefit of changing those particular sementics ?

In french we define such article as "fucking a fly".

1 more reply

arseraptor8y ago

Ah yes, David Chisnall. Another Cambridge wannabe without a hope of tenure track who thinks he is cleverer than he really is and makes a bunch of trite points over and over hoping to get some attention -- not realising they've been made for over 20 years. Have an original thought David, and stop feeling smug. You're not.

dang8y ago

We've banned this account for repeatedly violating the site guidelines.

https://news.ycombinator.com/newsguidelines.html

julienfr1128y ago

Ok. But what are the alternatives that are not decade away ?

_pmf_8y ago

It's low level, but the level is not identical to the machine level.

retrogradeorbit8y ago

It's all relative. Lower level than what? Higher level than what? C is lower level than a huge number of other languages so I would feel comfortable calling it 'low level'.

Avshalom8y ago

Relative to dozens of years of "portable assembly" and "C makes you understand how a computer works" and "C is efficient because it maps to almost 1:1 with CPU operations" and a jillion of related claims.

jacquesm8y ago

Depending on your hardware that is still the case. There are plenty of embedded systems where these claims still hold. It's not really C that has changed (though the language has evolved a little bit), it's the hardware that changed and the implementation of the language.

1 more reply

emilfihlman8y ago

This article is completely clickbait.

C is low level. For example, with AVRs everything you do maps very clearly to what happens as opcodes.

It's like the author wants to blame C for whatever reason and conveniently forgets that C is also portable.

cestith8y ago

The author isn't blaming C. C has stayed largely the same. The author is saying that Intel and AMD have - unlike PIC, AVR, and such - hidden the machine from C so thoroughly that it's no longer a low-level language for that platform.

laythea8y ago

The title is not a well formed statement. It all depends on what you are used to. IE. If I write Java, C is low level. If I write assembler, C is high level.

skylyrac8y ago

This doesn't make any sense. This would mean that my C code compiled for a Cortex-M0 is low level, but for my x86 laptop is not. Or even more stupid, that the same assembly code running in an old 386 is low level, but for an i7 isn't.

Low level is about how close to talking to the CPU you are, not about how close to the silicon you are. The CPU is a black box and the programmer communicates with it. What that box does inside doesn't matter.

sigjuice8y ago

Also, various C interpreters exist where there is no explicit C —> assembly translation.

mar77i8y ago

As far as I understood what I do about C, is that most of C's here called "quirks" have actually been enablers for much of the portability and performance of modern platforms. Therefore I don't like "undefined behavior" and the like being criticised for being such a "hindrance". I hence doubt the author's familiarity with C is much beyond the basics, which kind of makes the case for why the author also had to namedrop Spectre and Meltdown, which were caused by the fact that later optimizations were unsound, ie. the Tomasulo algorithm.

The problematic with the article somewhat remind me of the problems with LCTHW, and that the author of LCTHW was unable to figure out what the deal was about had been admitted by themselves. https://zedshaw.com/2015/01/04/admitting-defeat-on-kr-in-lct... Sorry to re-repost this article again. I just somewhat perceive two variants same "smells" in both.

dikaiosune8y ago

from the article:

> David Chisnall is a researcher at the University of Cambridge, where he works on programming language design and implementation. He spent several years consulting in between finishing his Ph.D. and arriving at Cambridge, during which time he also wrote books on Xen and the Objective-C and Go programming languages, as well as numerous articles. He also contributes to the LLVM, Clang, FreeBSD, GNUstep, and Étoilé open-source projects, and he dances the Argentine tango.

If tango experience isn't enough to make his opinion credible, I imagine being an LLVM and Clang contributor are pretty good qualifications.

mar77i8y ago

I don't understand how someone who ended up working on a C compiler feels intimidated by the standard/-s such software needs to adhere to. And if such people work on a C compiler, we're not logically in for a ride of WTFs?

j / k navigate · click thread line to collapse

316 comments

mattnewport8y ago

ge0rg8y ago

pera8y ago

IMO C is close to low-level because it's relatively easy to imagine the resulting unoptimized assembly given some piece of code (which is why some people jokes about C being a macro assembly).

Maybe this old debate should get an slight update... and this could be the starting point: Is modern x86 assembly still "low-level"? :)

3 more replies

skywhopper8y ago

4 more replies

Avshalom8y ago

colin_mccabe8y ago

mattnewport8y ago

1 more reply

wbl8y ago

pcvarmint8y ago

Itanic was also in-order (at least as far as dispatch), meaning anytime an instruction was stalled, so were all instructions in the same bundle or after it.

SolarNet8y ago

Not sure if facetious, but it failed because shifting that work caused it to be slow and not backwards compatible.

PersistentCough8y ago

> the C compiler

You mean C compiler X has the feature of Y. There are lots of compilers and that's not part of the language.

2 more replies

kazinator8y ago

> Register renaming, cache hierarchies, out of order and speculative execution etc are not visible

That's different from C.

In the history of x86, most new optimizations have preserved the semantics of code. For instance, register renaming isn't blind; it identifies and resolves hazards.

In C, increasing optimization has broken existing programs.

Then, early microprocessors in the 1970's and 80's repeated the pattern: poor exception handling and undocumented opcodes that did curious things (no "illegal instruction" trap).

gpderetta8y ago

Then again store-load reordering is visible across cores and undefined.

admax88q8y ago

> If C is not a low level language then a low level language does not exist for modern CPUs

I think that's a fair conclusion though, I don't think the article is misleading.

x86 assembly is a high level language. It's analogous to JVM bytecode. Modern x86 processors are more like a virtual machine for x86 bytecode.

magduf8y ago

>x86 assembly is a high level language. It's analogous to JVM bytecode.

1 more reply

ninkendo8y ago

It's a fair conclusion if you're willing to accept that the phrase "low-level language" has approximately zero modern examples.

Or maybe we should relax the definition of "low-level language" a bit?

2 more replies

jrochkind18y ago

Hmm, what does "virtual machine" mean if the "virtual machine" is implemented in hardware?

And, is nothing but actual binary machine code a "low level language"? I guess it's the lowest, I don't _think_ you can go lower than that... but someone's probably gonna tell me I'm wrong.

1 more reply

mikepurvis8y ago

The JVM provides native primitives around memory and thread management which are not present in x86, but that's a matter of degree.

1 more reply

pvg8y ago

It's analogous to JVM bytecode.

That's only true in the way everything is analogous to everything.

2 more replies

rbanffy8y ago

Microcode is the new low level language. ;-)

pjmlp8y ago

> Microcode is the new low level language. ;-)

Except for RISC, that has mostly always been the case, when we look back at all those mainframes and their research papers.

nerdponx8y ago

On the flipside, doesn't it allow more or less direct memory access?

1 more reply

Avshalom8y ago

>If C is not a low level language then a low level language does not exist for modern CPUs

correct, but that lack is not an argument for C being low level.

swsieber8y ago

Correct - it's an argument for broadening the title.

2 more replies

0xfaded8y ago

I played a little bit with gpgpu on the raspberry pi.

hencoappel8y ago

What language do you use for GPGPU?

1 more reply

wmu8y ago

> Register renaming, cache hierarchies, out of order and speculative execution etc are not visible at the assembly / machine code level

Many low-level CPU concepts leak to higher layer. A bright example is false sharing, which may manifest even in Java or C# programs.

fixermark8y ago

Modern Intel CPUs basically emulate x86; there are many layers of abstraction between individual opcodes and transistor switching.

kev0098y ago

By David's postulation, even the native assembly language for the CPU is not low level. See my other comment on the parent topic for justification.

ryao8y ago

Spend a week writing in assembly and you will never call C a low level language again.

munificent8y ago

I really really liked this article, and reading the comments here is blowing my mind. Did we read the same thing?

But, at the same time, they have done such a good job of maintaining that illusion that we forget it isn't actually reality.

noobermin8y ago

Shikadi8y ago

Actually, the author's argument about PDP-11 is interesting because C would have never been considered a low level language back then, for any platform.

Wiki definition, also what I was taught in my first CS class:

The term is evolving to match the time, as shown by the author's interpretation already being higher level than the original intention despite the goal of preventing exactly that.

munificent8y ago

> Actually, the author's argument about PDP-11 is interesting because C would have never been considered a low level language back then, for any platform.

Sure, agreed, but I don't think it's super interesting that words evolve in meaning over time.

1 more reply

lowbloodsugar8y ago

Now, sure, people say C is low level, and compared to Java it sure is. But it isn't low-level.

robochat8y ago

[1] https://www.extremetech.com/computing/252007-mit-announces-b... [2] https://news.ycombinator.com/item?id=16894818

kartickv8y ago

It'll be an interesting exercise to ask the Clang folks to relax backward compatibility, this designing a new language, if it makes their compiler go faster.

skybrian8y ago

It seems like the article is mostly useful for inspiring research; that is, most of us aren't the target audience.

I'm wondering what will happen as GPU's become more general-purpose. What's next after machine learning?

antris8y ago

> It seems like the article is mostly useful for inspiring research; that is, most of us aren't the target audience.

1 more reply

proverbialbunny8y ago

Who knows? It's kind of fun to speculate about though.

1 more reply

kartickv8y ago

Maybe apps will be allowed to run longer in the background, if there are always extra CPU cores available that don't consume much power.

snarf218y ago

munificent8y ago

> I think the bigger take away for me is that what constitutes low level has evolved over time.

zengid8y ago

I agree that this article hits pretty hard against a lot of assumptions about how our machines are working.

[1] https://youtu.be/_9mzmvhwMqw?t=26m30s

jstimpfle8y ago

> cost of cache misses

How is the C memory model a leaky abstraction here? What better way do you suggest? Are we not fine coding sequential (in memory) datastructures in C?

munificent8y ago

C leads you to believe that memory access has uniform cost regardless of address. What is the perf cost of:

    *foo

Depending on what foo points to, and which memory you have previously read, the cost can vary by close to two orders of magnitude on many chips.

1 more reply

pedasmith8y ago

You might argue that a modern computer is more like programming a tightly-bound, nonuniform multi-processor system. And I'd agree. But C doesn't much to help program such a thing.

1 more reply

United8578y ago

This isn't true for "mass-market" software that needs to run across multiple devices, with many variants of a given architecture.

mattnewport8y ago

Cell was a failure in large part because this proved to be less true / less relevant than its designers thought.

Source: many late nights / weekends trying to get PS3 launch titles performing well enough to ship.

scott_s8y ago

1 more reply

obl8y ago

There are some classes of very regular algorithm where you could probably predict everything (and handle the memory hierarchy) statically, such as GEMM, but it's not very common.

mattnewport8y ago

umanwizard8y ago

The points made in the article are certainly valid, but C is low-level in an abstract sense: it is approximately the intersection of all mainstream languages.

I.e. if a feature exists in C, it probably exists in every language most programmers are familiar with. (I worded this statement carefully to exclude exotic languages like Haskell or Erlang).

That said, it's important to keep the distinction in mind -- statements like "C maps to machine operations in a straightforward way" have been categorically wrong for decades.

munificent8y ago

> if a feature exists in C, it probably exists in every language most programmers are familiar with.

I don't think that's true.

typomatic8y ago

> I worded this statement carefully to exclude exotic languages like Haskell or Erlang

I suspect that your definition of "exotic" is exactly "not like C".

mda8y ago

Which is kinda true. Most popular languages are C like.

2 more replies

kazinator8y ago

pjmlp8y ago

BLISS, Modula-2, NEWP, Mesa,...?

Of course, some of those tricks are only allowed in SYSTEM/UNSAFE blocks on these languages.

1 more reply

saagarjha8y ago

One feature that many languages don't provide is the ability to have direct control over how aggregates are organized in memory.

fixermark8y ago

> Thus C, while not low-level relative to actual hardware, is low-level relative to programmers' mental model of programming

mkirklions8y ago

Its crazy that C has changed because of the way people used it.

Well crazy isnt the correct word... mainstream use has changed the future of an old language...

Rebelgecko8y ago

Going by their definition, I don't think there are any low level languages, at least on modern architectures. Even x86 assembly abstracts out a lot of what is going on within the CPU.

umanwizard8y ago

rbanffy8y ago

2 more replies

ModernMech8y ago

> Which is still an interesting and useful fact.

I think it just leads to quibbling over the boundary of low level, as is happening here.

1 more reply

rbanffy8y ago

Assembly has been a fiction on most computers for a long time now. From the other side of the instruction decoder, they are more like VLIW machines than evolved 8080's.

fixermark8y ago

Correct. For low-level language, we may actually want to look more in the direction of HLSL or GLSL.

Rebelgecko8y ago

1 more reply

ChuckMcM8y ago

I enjoyed reading this, mostly because it made me angry, then curious, then thoughtful all in one go.

Many useful systems abstractions, queues, processes, memory maps, and schedulers are pretty easy to express in C, complex string manipulation, not so much.

dahart8y ago

> A processor designed purely for speed, not for a compromise between speed and C support, would likely support large numbers of threads, have wide vector units, and have a much simpler memory model.

Sounds like a GPU?

> Running C code on such a system would be problematic, so, given the large amount of legacy C code in the world, it would not likely be a commercial success.

It seems like ATI & NVIDIA are doing okay, even with C & C++ kernels. GLSL and HLSL are both C-like. What is problematic?

tsomctl8y ago

dahart8y ago

1 more reply

rbanffy8y ago

> Sounds like a GPU?

Which reminds me I'd love to see a computer running exclusively from a GPU-like CPU.

And no, Xeon Phi's don't count. They are cool, but look too much like normal PCs.

dahart8y ago

Here’s one: https://en.m.wikipedia.org/wiki/Cray-1

They didn’t call it a GPU then, but the SIMD architecture is quite similar at a high level.

Larrabee was going to be a GPU-like CPU. https://en.m.wikipedia.org/wiki/Larrabee_(microarchitecture)

Here’s a more modern GPU based computer: https://www.nvidia.com/en-us/self-driving-cars/drive-platfor...

1 more reply

geokon8y ago

pjmlp8y ago

SPIR is a reaction to CUDA's adoption.

NVIdia always allowed multiple language on CUDA via PTX, with the offerings for C, C++ and Fortran coming from them, while some third parties had Haskell, .NET and Java support as well.

Yet another reasons why many weren't so keen in being stuck with OpenCL and C99.

ovao8y ago

To me the argument's akin to suggesting that Robert Wadlow wasn't tall, because giraffes are taller than Robert Wadlow.

When the spectrum of the context is unambiguous, that's not an argument for finding a way to make it ambiguous.

Sean17088y ago

cryptonector8y ago

This strikes me as a flavor of the VLIW+compilers-could-statically-do-more-of-the-work argument, though TFA does not mention VLIW architectures.

There are reasons that this make-the-compilers-insanely-smart approach has failed.

TFA mentions CMT and ULtraSPARC, and that's certainly a design direction, but note that it's one that makes C less of a problem anyways -- so maybe C isn't the problem...

kev0098y ago

scott_s8y ago

The author, David Chisnall, is a co-author on a related paper from PLDI 2016: "Into the Depths of C: Elaborating the De Facto Standards", https://news.ycombinator.com/item?id=11805377

favorited8y ago

sizeofchar8y ago

Also, his book on Objective-C is the best one I read.

compiler-guy8y ago

There is an entire junkyard full of processors designed to run other languages well.

LISP machines in the 60s, Java machines in the 90s, many others.

For whatever reason, successful general purpose silicon has almost always followed a C-ish model.

It's also worth noting that Fortran runs quite well on C-ish style processors.

gpderetta8y ago

davidw8y ago

"C combines the power and performance of assembly language with the flexibility and ease-of-use of assembly language."

salgernon8y ago

It feels like the author really isn't talking so much about the limitations of C on modern architectures, but the architecture itself.

Possibly relevant is this (short?) discussion[1] from 2011 about a CPU more closely designed for functional programming.

[1] https://news.ycombinator.com/item?id=2645423

angry_octet8y ago

s/vulcan/vulkan damn autocorrect.

arghwhat8y ago

It is correct that C is not really a low level language, but the points about how C limits the processor doesn't make much sense.

Furthermore, the UltraSPARC T1 was designed to support existing C and Java applications (this was Sun, remember?), despite the claim that this was a processor "not designed for traditional C".

I cannot imagine a single compromise done on a CPU as a result of conventional programming/C. That is, short of replacing the CPU with an entirely different device type, such as a GPU or FPGA.

dgreensp8y ago

The point is specifically about parallel vs sequential programs. Legacy C code is sequential, and the C model makes parallel programming very difficult.

arghwhat8y ago

> Legacy C code is sequential, and the C model makes parallel programming very difficult.

Neither of these statements are true, unless "Legacy" refers to the early days of UNIX.

1 more reply

agumonkey8y ago

DannyB28y ago

The sophistication of the compiler does not mean the language is high level.

By comparison C looks pretty low level.

C is a low level language. And there is NOTHING wrong with that! It can be something to be proud of!

kiriakasis8y ago

One of the point of the article is that C is relatively high level by your definition.

Basically it says that the C abstract machine has very little in common with most existing processor.

moreover it makes the point that in the last decades of research for CPUs the focus was "make C go fast" wich ultimately cause meltdown.

thinkling8y ago

ajross8y ago

arghwhat8y ago

It is very important to emphasize that GPU's only "achieve high performance" in workloads tailored very specifically to their extremely limited architecture.

CPU's, on the other hand, are designed to be much more generic with decent performance for any task.

derefr8y ago

I'm still waiting for someone to port Erlang's BEAM VM to run on a GPU; it'd be a perfect fit. :)

umanwizard8y ago

2 more replies

kevstev8y ago

I was with you until the last sentence: "Processors would be able to evolve if they weren't hamstrung by having to support C."

SomeHacker448y ago

My Symbolics computers (running Symbolics Ivory processors) run Lisp really well, as well as C - they have a C compiler.

jacquesm8y ago

> Recently there have been languages like erlang

Erlang is decades old. It's 32, only 16 years younger than C.

occamrazor8y ago

I don’t understand. CPUs do not support C, they support a specific instruction set. What stops them from having instructions for cache management, pipelining, speculative execution hints, etc?

coliveira8y ago

1 more reply

mattnewport8y ago

_fq4v8y ago

1 more reply

zelos8y ago

Games console CPUs support those kind of instructions, don't they?

To some extent, didn't Intel go down this road with VLIW: trying to shift the burden of making code fast onto the compiler, instead of the CPU?

jackhack8y ago

Thanks for the TLDR.

But if that's the argument, then not even assembly is sufficient, as control over speculative branching and prefetch is only accessible via microcode in the CPU.

C is low level. It remains "universal assembly language".

PeterisP8y ago

The argument implied in the article is that choosing a different public interface (breaking "C compatibility" and the imposted limitations) could bring a serious performance improvement.

1 more reply

umanwizard8y ago

For example, when writing high-performance CPU-bound code it's usually important to keep in mind how wide cache lines are, but C doesn't expose this to the programmer in a natural way.

sytelus8y ago

Interesting tidbits from article:

anfilt8y ago

I hate the idea of "low-level". There is not really such a thing. You should be using a language suitable for the domain your working in.

Sadly, too many programming languages try to be the end all be all. C is language that is great for working at the system domain.

judge20208y ago

Archive.is link, as the page loaded incredibly slow for me: http://archive.is/E9s70

plpot8y ago

I find this article insightful, but missing the points it tries to deliver.

What the article is very good at delivering is that current CPU's ISAs exports a model that doesn't exist in reality. Yes, we might call it PDP-11, although I miss that architecture dearly.

rhacker8y ago

The article itself has 4 definitions or "attributes" for low-level languages that can be considered contradictory:

* "A programming language is low level when its programs require attention to the irrelevant."

* Low-level languages are "close to the metal," whereas high-level languages are closer to how humans think.

* One of the common attributes ascribed to low-level languages is that they're fast.

* One of the key attributes of a low-level language is that programmers can easily understand how the language's abstract machine maps to the underlying physical machine.

dgreensp8y ago

I think the author's point is that despite being perceived as low-level, C doesn't really differ from, say, Java on the last bullet.

I think the article does a great job dismantling this point of view, and telling the story that C is not so different from Java, aside from being unsafe and ill-specified.

zkomp8y ago

Compared to something different like Erlang, Haskell, Lisp

1 more reply

Const-me8y ago

zwieback8y ago

Great article if you're willing to read past headlines. I would have liked to see a mention of small processors that are still hugely popular (microcontrollers, etc.) where C is still a good fit.

wglb8y ago

The article does not properly distinguish between C as a language and what the C compiler does with the C program. The logic of the article references what the compiler does.

favorited8y ago

That's exactly the author's point. The C that programmers write is remarkably far from what the compiler generates for modern hardware.

How do you propose measuring the number of abstractions? JavaScript has remarkably few built-in abstractions, but it's in no way "low-level" from a hardware perspective.

z3t48y ago

Shikadi8y ago

Wiki definition:

lmm8y ago

The whole point of the article is that by a definition like the one you quoted, modern C is not low-level, though PDP-11 era C was.

Shikadi8y ago

richardwhiuk8y ago

That's not true - PDP-11 era C isn't either - if you run it on a modern processor. And it's doubtful it even was then.

1 more reply

hokus8y ago

http://web.archive.org/web/20180502001551/https://queue.acm....

qsdf381008y ago

monocasa8y ago

qsdf381008y ago

smadge8y ago

tome8y ago

Probably not much faster, otherwise you would have just implemented that by hand.

sriku8y ago

I couldn't read the article, but based on the comments, would it change the way we use C whether we declared it a "low level language" or not?

Sean17088y ago

The article is actually about how closely C maps to what is actually run on the hardware and whether hardware would look significantly different today if people didn't expect C to map closely.

justicezyx8y ago

Statement of using adjective almost always is about defining the context.

burke8y ago

C is Not a True Scotsman

mar77i8y ago

Damn Scots! They ruined Scotland!

waynecochran8y ago

Ok, w/o dipping into machine code, show me a low level language. Any snippet of C-code is transparent in that you know roughly how it is going to be translated into machine code.

Someone8y ago

”Any snippet of C-code is transparent in that you know roughly how it is going to be translated into machine code.”

…for a definition of ‘roughly’ that has become significantly less precise over the past decades.

umanwizard8y ago

> Any snippet of C-code is transparent in that you know roughly how it is going to be translated into machine code.

This isn't really true on a modern optimizing compiler.

phamilton8y ago

Precisely. As someone who's tried to duel a compiler for performance, -O3 has very little resemblance to anything I've ever written, and outperforms what I've written significantly.

waynecochran8y ago

plpot8y ago

kwillets8y ago

    unsigned char r(unsigned char num) {
        return num % 10;
    }

https://godbolt.org/g/26HfQk

1ris8y ago

Or compile this with clang and -msse4.2 -O2

    // https://codereview.stackexchange.com/questions/38182
    // https://codereview.stackexchange.com/a/38184
    // Definition: Count number of 1's and 0's from integer     with bitwise operation
    // 2^32 = 4,294,967,296
    // unsigned int 32 bit
    #include<stdio.h>
    int CountOnesFromInteger(unsigned int);
    int main()
    {   unsigned int inputValue;
        short unsigned int onesOfValue;
        printf("Please Enter value (between 0 to 4,294,967,295) : ");
        scanf("%u",&inputValue);
        onesOfValue = CountOnesFromInteger(inputValue);
    
        printf("\nThe Number has \"%d\" 1's and \"%d\" 0's",onesOfValue,32-onesOfValue); }
    // Notice the popcnt
    int CountOnesFromInteger(unsigned int value) {
        int count;
        for (count = 0; value != 0; count++, value &= value-1);
        return count; }

1 more reply

waynecochran8y ago

Yes. this is doing to infamous division by 10 using bit shifts as you will find in hacker's delight: http://www.hackersdelight.org/divcMore.pdf.

You may notice that when you divide num by 10 you get a quotient q and a remainder r:

    num = q*10 + r

Once you get q, you can solve for r as

    r = num - q*10

So this is how you get r and q:

    q = (num >> 1) + (num >> 2);
    q = q + (q >> 4);
    q = q + (q >> 8);
    q = q + (q >> 16);
    q = q >> 3;
    r = num - q*10;
    q = q + ((r + 6) >> 4)

voila!

1 more reply

plpot8y ago

What you see there is the insane amount of complexity to create a high level feature of C: functions, plus compiler optimizations.

anonlastname8y ago

Hardware description languages like VHDL, maybe gpu shader languages like CUDA/HLSL

United8578y ago

zrobotics8y ago

C-code compiled for a target like AVR/PIC would fit this article's definition of 'low level', its really the platform the code is compiled for.

anonlastname8y ago

VHDL and other hardware description languages

arghwhat8y ago

VHDL is as low-level for an FPGA as C is for a CPU.

VHDL is almost low-level for an ASIC, where you can implement logic more directly. But even then, VHDL is an abstraction.

anentropic8y ago

literally the whole point of the article was that this isn't true any more

sigjuice8y ago

Where does it say in the ISO C standard that C must be translated to assembly code or machine code of any sort?

EDIT: Various C interpreters exist

11thEarlOfMar8y ago

We apologize for this inconvenience.

Please contact us with any questions or concerns regarding this matter: portal-feedback@hq.acm.org

The ACM Digital Library is published by the Association for Computing Machinery. Copyright � 2010 ACM, Inc.

suprfnk8y ago

Got that too. Interesingly enough, these automated processes were able to get through:

https://web.archive.org/web/20180501183242/https://queue.acm...

https://webcache.googleusercontent.com/search?q=cache:sClfdA...

nottorp8y ago

It's still on after 9 hours from the OP. I suppose someone in ACM management is obsessed with their intellectual property being stolen, including their public articles. Good luck to them.

klez8y ago

Could it be the influx of traffic with the same referral link that tripped some defense mechanism?

nickpsecurity8y ago

I emailed them about it. Hopefully it will be fixed soon. Just keep a tab or bookmark. :)

Animats8y ago

I got that, too. I'm using Firefox on Linux, with both Ghostery and Privacy Badger enabled. Not impressed with the ACM.

molteanu8y ago

Same here.

lowken108y ago

If the general public & tech community refers to C as a low level language then it is a low level language.

pjmlp8y ago

When a lie gets repeated enough times, eventually it becomes a fact.

sametmax8y ago

It's just practical. Otherwise how do you call java ? Or python ?

And what would be the benefit of changing those particular sementics ?

In french we define such article as "fucking a fly".

1 more reply

arseraptor8y ago

dang8y ago

We've banned this account for repeatedly violating the site guidelines.

https://news.ycombinator.com/newsguidelines.html

julienfr1128y ago

Ok. But what are the alternatives that are not decade away ?

_pmf_8y ago

It's low level, but the level is not identical to the machine level.

retrogradeorbit8y ago

It's all relative. Lower level than what? Higher level than what? C is lower level than a huge number of other languages so I would feel comfortable calling it 'low level'.

Avshalom8y ago

jacquesm8y ago

1 more reply

emilfihlman8y ago

This article is completely clickbait.

C is low level. For example, with AVRs everything you do maps very clearly to what happens as opcodes.

It's like the author wants to blame C for whatever reason and conveniently forgets that C is also portable.

cestith8y ago

laythea8y ago

The title is not a well formed statement. It all depends on what you are used to. IE. If I write Java, C is low level. If I write assembler, C is high level.

skylyrac8y ago

sigjuice8y ago

Also, various C interpreters exist where there is no explicit C —> assembly translation.

mar77i8y ago

dikaiosune8y ago

from the article:

If tango experience isn't enough to make his opinion credible, I imagine being an LLVM and Clang contributor are pretty good qualifications.

mar77i8y ago

j / k navigate · click thread line to collapse