Cello – High Level C (opens in new tab)

(libcello.org)

251 pointsthisisastopsign6y ago77 comments

77 comments

52 comments · 18 top-level

kazinator6y ago· 3 in thread

The strategy in the GC for determining the stack top for hunting GC roots will not work on all architectures.

On aaarch-64, the address of a local dummy variable may be above a register save area in the stack frame, and thus the scan will miss some GC roots.

In TXR Lisp, I used to use a hacked constant on aarch64: STACK_TOP_EXTRA_WORDS. It wasn't large enough to straddle the area, and so operation on aarch64 was unreliable.

http://www.kylheku.com/cgit/txr/commit/?id=3aa731546c4691fac...

A good stack-top-getting trick occurred to me: call alloca for a small amount of memory and use that address. It has to be below everything; alloca cannot start allocating above some register save area in the frame, because then it would collide with it; alloca has not know the real stack top and work from there.

Since we need to scan registers, we use alloca for the size of the register file (e.g. setjmp jmp_buf), and put that there: kill two birds with one stone.

http://www.kylheku.com/cgit/txr/commit/?id=7d5f0b7e3613f8e8b...

naasking6y ago

> On aaarch-64, the address of a local dummy variable may be above a register save area in the stack frame

Then use two stack frames! Every problem can be solved by adding an additional level of indirection. ;-)

kazinator6y ago

In this case it won't help, because:

0. We are already in a frame that doesn't take any arguments of the "val" object type; how come that's not good enough?

1. The current stack frame is entered with a bunch of callee-saved registers, some of which contain GC roots.

2. The current stack frame's code saves some of them: those ones that it clobbers locally. It leaves others in their original registers.

3. Thus, if a another stack frame is called, there are still some callee-saved registers, probably containing GC roots, and some of these will go into the area below the locals.

4. You might think that if the save all the necessary registers ourselves into the stack and then make another stack frame, we would be okay. But in fact, no. Because by the time we save registers, the compiler generated function entry has already executed and saved some of those registers into the below-locals save area and clobbered them for its own use! So our snapshot possibly misses GC roots. The compiler generated code always has "first dibs" at the incoming registers, to push them into the below-locals save area, thus kicking the GC roots farther up the stack.

z926y ago

I use argv[0] as stack head.

aerovistae6y ago· 9 in thread

Seen this posted here years ago. Now as then, my gut feeling is that anyone doing serious work in C would never use something like this-- I feel like the fine grained low level control is exactly the reason they chose C in the first place, and they're not looking to escape from it or they would just choose a different language.

tyingq6y ago

"Why does this exist? I made Cello as a fun experiment to see what C looks like hacked to its limits. As well as being a powerful library and toolkit, it should be interesting to those who want to explore what is possible in C."

"Can it be used in Production? It might be better to try Cello out on a hobby project first. Cello does aim to be production ready, but because it is a hack it has its fair share of oddities"

It sounds like they don't intend for it to be anything other than an interesting case study.

mbreese6y ago

I found their FAQ to be refreshingly honest. This is in no way suited for large projects or where multiple people will be contributing. A case study sounds like a good description.

And the authors seem quite okay with that.

asveikau6y ago

Not only that, but last I looked into this library's code there was a lot of undefined behavior and general sloppiness that goes against good C practices, eg. ignoring errors, casting all types to void * literally all the time or treating char VLAs as structs without regard for alignment.

My sympathy and respect to the author, but they did not appear learn C well before trying to "fix" it. It is kind of irresponsible, I think, to say it "aims to be production ready" and write it up as something other C neophytes may be interested in with some of these issues.

shakna6y ago

> Not only that, but last I looked into this library's code there was a lot of undefined behavior

Neither GCC's nor Clang's sanitisers pick up any undefined behaviour - and it's been like that for at least the last few years I've looked at it.

As to ignoring errors, and ignoring alignment, I don't think I've ever seen anything like that in the project. I have seen several pull requests delayed so that they will.

Overall, for what it's doing, this is one of the cleaner codebases I've dealt with.

2 more replies

kazinator6y ago

The fine-grained control is just a statement away. When you're not using the $<ident>(...) macros, it's just normal C.

It's pretty much exactly the same like the low-level-C techniques being instantly available for you in C++ or Objective C.

ActorNightly6y ago

The biggest advantage for me in using C is that the syntax never changes.

When I go look at a C code, I don't have to look up some annotation or new syntax that got introduced behind some abstraction that gets compiled in automatically after the library is pulled from the internet by whatever build system the project uses.

flukus6y ago

> I feel like the fine grained low level control is exactly the reason they chose C in the first place

That's not the only reason, there is also simplicity, static typing and performance. If you favor the later two for whatever reason it can be used in places where you'd normal write a python/shell script or small program without too much extra effort (see https://github.com/RhysU/c99sh or suckless tools). Complexity is where Cello seems to fall down though, it seems like it introduces much more complexity than just using plain C with a decent "standard" library like glib.

da_chicken6y ago

> That's not the only reason, there is also simplicity, static typing and performance.

I think the only meaningful benefit here is performance.

Simplicity is at best determined by the nature of the problem and at worst a completely subjective opinion for C.

Similarly, static typing is not usually something the programmer should care about that much. You need to know which paradigm your language uses, of course, but beyond that it does not matter all that much. IMX, you're more concerned with type safety, and C is not fully type safe like, say, Java is.

2 more replies

kungtotte6y ago

You could also just pick something like Nim (https://nim-lang.org) if you wanted to hit the intersection of low effort, script-style programming with the addition of types and performance.

1 more reply

analog316y ago· 1 in thread

This is just an amusing aside: C is the lowest level of an actual cello. ;-)

aneutron6y ago

I would have never gotten the wordplay at hand. Thanks ! Very clever.

drongoking6y ago· 2 in thread

Isn't this sort of what Glib is getting at? Bringing higher level data structures and capabilities (extendable arrays, hash tables, heaps, etc.) into C.

https://developer.gnome.org/glib/stable/glib-data-types.html

You don't get Cello's macros, and it uses reference counting instead of invisible garbage collection, but you get a lot of fun high-level capabilities.

jopsen6y ago

And if you really want to go high-level wouldn't Vala be the language?

war10256y ago

Vala is actually a really pleasant way to interact with the various bits of the Gnome environment.

It's a shame, in my opinion, that it never really received widespread love.

I think all serious work on it stopped about a decade ago.

1 more reply

winrid6y ago· 3 in thread

There was a snippet I saw a while ago where someone made C look like Java for a joke, using macros. I wish I could find it to share here, it's great.

mar77i6y ago

The original bourne shell source was using mac.h ( https://minnie.tuhs.org/cgi-bin/utree.pl?file=V7/usr/src/cmd... ), a bunch of macros, according to wikipedia, "to give the C source code an ALGOL 68 flavor."

Here's the original source tree, if you're interested: https://minnie.tuhs.org/cgi-bin/utree.pl?file=V7/usr/src/cmd...

winrid6y ago

Here it is: https://stackoverflow.com/a/653028/981408

roywiggins6y ago

Could be worse:

http://oldhome.schmorp.de/marc/bournegol.html

gok6y ago· 1 in thread

Previously https://news.ycombinator.com/item?id=14091630

dang6y ago

Also 2015: https://news.ycombinator.com/item?id=10526159

2014: https://news.ycombinator.com/item?id=8799070

2013: https://news.ycombinator.com/item?id=6047576

kazinator6y ago

I'm left wondering about the iteration example that is also quoted in the home page:

https://github.com/orangeduck/Cello/blob/master/examples/ite...

Okay, so the vector is garbage-collectable once the function terminates ... but it has references to stack-allocated integers i0, i1 and i2. That leaves me wondering: won't the GC walk these and trample on stack memory that has been deallocated/reused.

(Maybe those integer values have a tag right in the val pointer that gets the GC to avoid dereferencing them.)

noncoml6y ago· 5 in thread

Just my opinion, don't mean to be inflammatory, but if the user has to know and manually manage stack vs heap objects, then I wouldn't call it "High Level" language.

pmiller26y ago

Then just allocate everything on the stack and use the optional garbage collector: http://libcello.org/learn/garbage-collection

jessaustin6y ago

Allocating on the stack doesn't need GC.

1 more reply

zozbot2346y ago

That raises a question, would you call C# a "High Level" language?

rubber_duck6y ago

In C# it's more relevant to understand semantics (ref/value type) than allocation details (unless you actually care about low level details for performance/interop)

nobleach6y ago

I'd say the fact that structs are stack-allocated, and you can slip right through years of development without even knowing that fact... yeah, it's pretty high-level. C# doesn't have `malloc`. .Net apps are managed, so all of those low-level things one has to/gets to do are abstracted away.

rs23296008n16y ago· 4 in thread

Didn't C++ start out as a set of hacks on C? Fairly sure it was originally a preprocess stage ahead of an ordinary c compiler.

Raises the question of how usefully far you can make C twist using macros / preprocessor.

Candidates like Forth or Lisp seem possible. A few weekends at most. Might need to take a few liberties.

Python... Perhaps if you implement a less dynamic subset? Duck typing may trip you up. To what extent?

What about Elixir?

DonaldFisk6y ago

I wrote my own Lisp. The virtual machine, which is a stack machine, is written in C. The interpreter, which runs until the system compiles itself, is written in C, but makes heavy use of C macros. The rest of the code, including the compiler, is in Lisp.

Code in the interpreter is directly converted to byte code, e.g. the macro Car generates the virtual machine instruction Car, rather than executing the code for car. The alternative would have been to generate byte code by hand, would have been error-prone. Here's the code for cons and let:

    Define("cons", 2)
      Local1 Local2 Cons Ret
    Termin

    DefineF("let")
      Local1 Car
      Local2 /* initialize new env */
      Prog(1)
      Ret

      Params(2)
      Until Local1 Null Do
        Local1 Caar /* var */
        Local1 Cadar Free12 Call("eval") /* val in old env */
        Local2 /* env */
        ACons
        SetLocal2 Pop /* update new env */
        PopLocal1
      Od
      Free11 Cdr Local2 Call("progn") /* use new env */
      Ret
    Termin

It is actually C, with heavy use of macros. But it can be read as Reverse Polish Lisp. It can also be thought of as a Lispy Forth.

rs23296008n16y ago

Interesting approach. maybe develop it further into a JIT arrangement or a library builder.

Brainf*ck is another classic.

carlmr6y ago

Nim transpiles to C. It's very cool since there are C compilers for almost every processor.

rs23296008n16y ago

Nim is very good. I'm using it as a glue language right now and it works out well.

scoutt6y ago

Thanks. I didn't know about this library. Interesting, but perhaps I am missing something... about stack "allocation":

  var i0 = $(Int, 5);

  int i0[5];

In both cases it doesn't need GC. What would be the reasons for redefining it? I wonder how it couples with local static variables.

shmerl6y ago

Is it using macros to achieve that?

self_awareness6y ago

For a different take for "better C", try Zig language, it looks pretty cool.

https://ziglang.org/

loeg6y ago

This gets reposted every couple years and it's still bad for all of the same reasons.

It's not higher level than C in the sense that you get any additional safety guarantees or real beneficial abstractions. If you are fine without the safety but want abstractions, use C++. If you want safety and abstractions, use Rust or Go or Zig. If you really want a transpile-to-C language, you've got Nim.

Finally, it's not good at being C; everything it does is poor practice and should be quickly recognized as such by experienced C developers, IMO. It's got no developer community and no real-world production consumers.

h0bzii6y ago· 2 in thread

What type of sorcery is this?

keyle6y ago

The good one, dark, very dark.

pmiller26y ago

Yeah, the foreach macro is particularly interesting to me in that respect. Quite a neat bit of sorcery.

FpUser6y ago

This is great. It made me smile.

tuczi6y ago· 4 in thread

Why not just C++?

macintux6y ago

Speaking for myself, I’ve never found C++’s complexity appealing, and it only seems to be getting worse over the last 20 years.

As the creator says, it’s not for production use. If I’m doing a side project, I’d give this a serious look.

zozbot2346y ago

> Speaking for myself, I’ve never found C++’s complexity appealing, and it only seems to be getting worse over the last 20 years.

True, but that's why we've got Rust these days. (Rust is actually more optimized than C, e.g. it will automatically reshuffle your structs to get rid of excess padding, and reference accesses will automatically take advantage of compiler-checked 'restrict' constraints, thus equalizing performance with e.g. FORTRAN.)

8 more replies

loeg6y ago

Sure, but Cello's complexity turns me off for the same reasons as C++'s complexity. The question isn't "why not use C++?" in isolation, but instead, "why would you use Cello over C++?"

pmiller26y ago

I agree wholeheartedly with this. You can write high level code in C++, but the cost is a terribly complex language and standard library.

In contrast, the C language is fairly simple, except for a few twisty passages (pointer declaration syntax, anyone?). The standard library does leave something to be desired, but that's not that big of a deal given all the third party libraries out there.

It would be interesting to compare the same program written in straight C vs C++ vs Cello, both for developer experience issues (clarity, simplicity, etc.) and performance. I'll have to have a look at http://libcello.org/learn/benchmarks but this does really seem like something I'd like to use on a personal project someday.

1 more reply

einpoklum6y ago

This isn't a library, it's a sort-of-a-modification of C, it seems.

Well, for a non-C language with high-level abstractions that lets me use C code relatively seamlessly - I'm content with C++. Many complain about its complexity, but you can actually avoid a lot of that complexity in _your_ code using facilities with complex implementation but relatively easy use.

reanimus6y ago

Looks interesting, but I can't help but notice they're distributing their source tarball via that site, and it doesn't have HTTPS. I don't understand why projects don't have SSL certs these days, especially considering Let's Encrypt has automated it all and made it free.

j / k navigate · click thread line to collapse

77 comments

52 comments · 18 top-level

kazinator6y ago· 3 in thread

The strategy in the GC for determining the stack top for hunting GC roots will not work on all architectures.

On aaarch-64, the address of a local dummy variable may be above a register save area in the stack frame, and thus the scan will miss some GC roots.

In TXR Lisp, I used to use a hacked constant on aarch64: STACK_TOP_EXTRA_WORDS. It wasn't large enough to straddle the area, and so operation on aarch64 was unreliable.

http://www.kylheku.com/cgit/txr/commit/?id=3aa731546c4691fac...

Since we need to scan registers, we use alloca for the size of the register file (e.g. setjmp jmp_buf), and put that there: kill two birds with one stone.

http://www.kylheku.com/cgit/txr/commit/?id=7d5f0b7e3613f8e8b...

naasking6y ago

> On aaarch-64, the address of a local dummy variable may be above a register save area in the stack frame

Then use two stack frames! Every problem can be solved by adding an additional level of indirection. ;-)

kazinator6y ago

In this case it won't help, because:

0. We are already in a frame that doesn't take any arguments of the "val" object type; how come that's not good enough?

1. The current stack frame is entered with a bunch of callee-saved registers, some of which contain GC roots.

2. The current stack frame's code saves some of them: those ones that it clobbers locally. It leaves others in their original registers.

3. Thus, if a another stack frame is called, there are still some callee-saved registers, probably containing GC roots, and some of these will go into the area below the locals.

z926y ago

I use argv[0] as stack head.

aerovistae6y ago· 9 in thread

tyingq6y ago

"Can it be used in Production? It might be better to try Cello out on a hobby project first. Cello does aim to be production ready, but because it is a hack it has its fair share of oddities"

It sounds like they don't intend for it to be anything other than an interesting case study.

mbreese6y ago

I found their FAQ to be refreshingly honest. This is in no way suited for large projects or where multiple people will be contributing. A case study sounds like a good description.

And the authors seem quite okay with that.

asveikau6y ago

shakna6y ago

> Not only that, but last I looked into this library's code there was a lot of undefined behavior

Neither GCC's nor Clang's sanitisers pick up any undefined behaviour - and it's been like that for at least the last few years I've looked at it.

As to ignoring errors, and ignoring alignment, I don't think I've ever seen anything like that in the project. I have seen several pull requests delayed so that they will.

Overall, for what it's doing, this is one of the cleaner codebases I've dealt with.

2 more replies

kazinator6y ago

The fine-grained control is just a statement away. When you're not using the $<ident>(...) macros, it's just normal C.

It's pretty much exactly the same like the low-level-C techniques being instantly available for you in C++ or Objective C.

ActorNightly6y ago

The biggest advantage for me in using C is that the syntax never changes.

flukus6y ago

> I feel like the fine grained low level control is exactly the reason they chose C in the first place

da_chicken6y ago

> That's not the only reason, there is also simplicity, static typing and performance.

I think the only meaningful benefit here is performance.

Simplicity is at best determined by the nature of the problem and at worst a completely subjective opinion for C.

2 more replies

kungtotte6y ago

You could also just pick something like Nim (https://nim-lang.org) if you wanted to hit the intersection of low effort, script-style programming with the addition of types and performance.

1 more reply

analog316y ago· 1 in thread

This is just an amusing aside: C is the lowest level of an actual cello. ;-)

aneutron6y ago

I would have never gotten the wordplay at hand. Thanks ! Very clever.

drongoking6y ago· 2 in thread

Isn't this sort of what Glib is getting at? Bringing higher level data structures and capabilities (extendable arrays, hash tables, heaps, etc.) into C.

https://developer.gnome.org/glib/stable/glib-data-types.html

You don't get Cello's macros, and it uses reference counting instead of invisible garbage collection, but you get a lot of fun high-level capabilities.

jopsen6y ago

And if you really want to go high-level wouldn't Vala be the language?

war10256y ago

Vala is actually a really pleasant way to interact with the various bits of the Gnome environment.

It's a shame, in my opinion, that it never really received widespread love.

I think all serious work on it stopped about a decade ago.

1 more reply

winrid6y ago· 3 in thread

There was a snippet I saw a while ago where someone made C look like Java for a joke, using macros. I wish I could find it to share here, it's great.

mar77i6y ago

Here's the original source tree, if you're interested: https://minnie.tuhs.org/cgi-bin/utree.pl?file=V7/usr/src/cmd...

winrid6y ago

Here it is: https://stackoverflow.com/a/653028/981408

roywiggins6y ago

Could be worse:

http://oldhome.schmorp.de/marc/bournegol.html

gok6y ago· 1 in thread

Previously https://news.ycombinator.com/item?id=14091630

dang6y ago

Also 2015: https://news.ycombinator.com/item?id=10526159

2014: https://news.ycombinator.com/item?id=8799070

2013: https://news.ycombinator.com/item?id=6047576

kazinator6y ago

I'm left wondering about the iteration example that is also quoted in the home page:

https://github.com/orangeduck/Cello/blob/master/examples/ite...

(Maybe those integer values have a tag right in the val pointer that gets the GC to avoid dereferencing them.)

noncoml6y ago· 5 in thread

Just my opinion, don't mean to be inflammatory, but if the user has to know and manually manage stack vs heap objects, then I wouldn't call it "High Level" language.

pmiller26y ago

Then just allocate everything on the stack and use the optional garbage collector: http://libcello.org/learn/garbage-collection

jessaustin6y ago

Allocating on the stack doesn't need GC.

1 more reply

zozbot2346y ago

That raises a question, would you call C# a "High Level" language?

rubber_duck6y ago

In C# it's more relevant to understand semantics (ref/value type) than allocation details (unless you actually care about low level details for performance/interop)

nobleach6y ago

rs23296008n16y ago· 4 in thread

Didn't C++ start out as a set of hacks on C? Fairly sure it was originally a preprocess stage ahead of an ordinary c compiler.

Raises the question of how usefully far you can make C twist using macros / preprocessor.

Candidates like Forth or Lisp seem possible. A few weekends at most. Might need to take a few liberties.

Python... Perhaps if you implement a less dynamic subset? Duck typing may trip you up. To what extent?

What about Elixir?

DonaldFisk6y ago

    Define("cons", 2)
      Local1 Local2 Cons Ret
    Termin

    DefineF("let")
      Local1 Car
      Local2 /* initialize new env */
      Prog(1)
      Ret

      Params(2)
      Until Local1 Null Do
        Local1 Caar /* var */
        Local1 Cadar Free12 Call("eval") /* val in old env */
        Local2 /* env */
        ACons
        SetLocal2 Pop /* update new env */
        PopLocal1
      Od
      Free11 Cdr Local2 Call("progn") /* use new env */
      Ret
    Termin

It is actually C, with heavy use of macros. But it can be read as Reverse Polish Lisp. It can also be thought of as a Lispy Forth.

rs23296008n16y ago

Interesting approach. maybe develop it further into a JIT arrangement or a library builder.

Brainf*ck is another classic.

carlmr6y ago

Nim transpiles to C. It's very cool since there are C compilers for almost every processor.

rs23296008n16y ago

Nim is very good. I'm using it as a glue language right now and it works out well.

scoutt6y ago

Thanks. I didn't know about this library. Interesting, but perhaps I am missing something... about stack "allocation":

  var i0 = $(Int, 5);

  int i0[5];

In both cases it doesn't need GC. What would be the reasons for redefining it? I wonder how it couples with local static variables.

shmerl6y ago

Is it using macros to achieve that?

self_awareness6y ago

For a different take for "better C", try Zig language, it looks pretty cool.

https://ziglang.org/

loeg6y ago

This gets reposted every couple years and it's still bad for all of the same reasons.

h0bzii6y ago· 2 in thread

What type of sorcery is this?

keyle6y ago

The good one, dark, very dark.

pmiller26y ago

Yeah, the foreach macro is particularly interesting to me in that respect. Quite a neat bit of sorcery.

FpUser6y ago

This is great. It made me smile.

tuczi6y ago· 4 in thread

Why not just C++?

macintux6y ago

Speaking for myself, I’ve never found C++’s complexity appealing, and it only seems to be getting worse over the last 20 years.

As the creator says, it’s not for production use. If I’m doing a side project, I’d give this a serious look.

zozbot2346y ago

> Speaking for myself, I’ve never found C++’s complexity appealing, and it only seems to be getting worse over the last 20 years.

8 more replies

loeg6y ago

Sure, but Cello's complexity turns me off for the same reasons as C++'s complexity. The question isn't "why not use C++?" in isolation, but instead, "why would you use Cello over C++?"

pmiller26y ago

I agree wholeheartedly with this. You can write high level code in C++, but the cost is a terribly complex language and standard library.

1 more reply

einpoklum6y ago

This isn't a library, it's a sort-of-a-modification of C, it seems.

reanimus6y ago

j / k navigate · click thread line to collapse