Lambda Operator in LaTeX (opens in new tab)

(commutative.xyz)

35 pointsmiguelmurca3y ago22 comments

22 comments

You don't have to define `\First` and `\Second` since these are predefined as `\@firstoftwo` and `\@secondoftwo`. Also the Unicode bytes are already active so setting their catcode is useless. Also redefining the first octet breaks LaTeX's UTF-8 handling and the else case forms an infinite loop. Changing the catcodes of `(` and `)` means that this command doesn't work in the arguments of other commands and it breaks other uses of `(` and `)` in the argument.

Instead you could do this as

  \documentclass{article}
  \usepackage{xparse}

  \NewDocumentCommand \LambdaCalc {u{.} r()} {%
    [arg:(#1) body:(#2)]
  }
  \DeclareUnicodeCharacter {03BB} {\LambdaCalc}

  \begin{document}

  λx.(2x)

  \end{document}

miguelmurcaOP3y ago

Thanks, and ouch. I have some follow-up questions/comments, if you have the time:

> these are predefined as `\@firstoftwo` and `\@secondoftwo`

I do wish LaTeX kernel commands (which I'm assuming these are) were more widely documented. As it stands, it's pretty hard to keep track of what already exists. Is there a nice reference for those?

> Also the Unicode bytes are already active so setting their catcode is useless.

This is true for LaTeX and not TeX, correct? Originally, I'd `\expandafter\let\expandafter\@firstoct\@firstoftwoλ`, but I decided not to assume that that character was already active.

> Also redefining the first octet breaks LaTeX's UTF-8 handling...

How so? (If the else case wasn't broken)

>...and the else case forms an infinite loop.

If `\Firstλ` was not an active character, would this still be true? Since I store `\Firstλ` in `\lambda@first@oct` before it's declared an active character.

> and it breaks other uses of `(` and `)` in the argument.

This is not a concern for the DSL, but...

> Changing the catcodes of `(` and `)` means that this command doesn't work in the arguments of other commands

...this is. Thanks.

> Instead you could do this as

Damn :)

Thanks for the nice feedback. I suppose I should read up on xparse. In any case I feel like it's not moot to try to achieve the same results with primitives, to have some idea of what's breaking when a given program doesn't compile (usually at that point the primitives surface).

zauguin3y ago

> I do wish LaTeX kernel commands (which I'm assuming these are) were more widely documented. As it stands, it's pretty hard to keep track of what already exists. Is there a nice reference for those?

Not really, the traditional commands are rather messy. Of course you can read source2e, but that's not really documentation. For new stuff it often makes sense to write the more programmy stuff in expl3 which is much better documented in interface3. (It contains these commands as `\use_i:nn` and `\use_ii:nn`)

> > Also the Unicode bytes are already active so setting their catcode is useless. > > This is true for LaTeX and not TeX, correct?

Right, this is LaTeX specific.

> Also redefining the first octet breaks LaTeX's UTF-8 handling... > > How so? (If the else case wasn't broken)

LaTeX's definition of the first byte handles arbitrary valid UTF-8 following bytes by using corresponding definitions or printing correct errors, while even a definition which wouldn't trigger the active character again would just print the two bytes which does not print a useful error message and probably prints two random characters from the font, completely ignoring any potential definition using LaTeX's mechanism for other codepoints starting with this byte.

>...and the else case forms an infinite loop. > > If `\Firstλ` was not an active character, would this still be true? Since I store `\Firstλ` in `\lambda@first@oct` before it's declared an active character.

You are correct, if the first byte wouldn't already be an active character (e.g. in plain TeX) then it wouldn't loop. It wouldn't expand to something particularly useful, but that wouldn't be any worse than without the definition so it would be "correct".

> I suppose I should read up on xparse.

Normally `xparse` is preloaded and not a package anymore, therefore also it's documentation has been moved into usrguide3. In this case you still need the package though since the `d` argument type has not been added to the kernel (and therefore also not to usrguide3) since delimited arguments are not recommended for LaTeX commands. It's still documented in the old `xparse` manual though. Just in case you're wondering about the split.

MayeulC3y ago

A bit disapointing. It's mostly about parsing, not usage.

Half of the post is about handling UTF-8, which AFAIK both LuaTeX and XeTeX (you really shoulduse either) do natively.

miguelmurcaOP3y ago

Sorry you feel like that. I tried to counteract for this by including the TL;DR at the top. OTOH this did come from an effort to implement the lambda calculus I discuss at the beginning, but I thought it was reasonable enough to make a small post.

LuaTeX and XeTeX usually aren't an option where LaTeX comes up, i.e., in academic submissions. This is a common discussion, see [the comments under my previous post].

[0]: https://news.ycombinator.com/item?id=31490136

xeonmc3y ago

Combine this with that paper of C compiler written entirely in lambda terms[0], now LaTeX is a compilation platform as well!

[0] https://news.ycombinator.com/item?id=33115719

smitty1e3y ago

The obvious next project is to port emacs.

c7b3y ago

Sums up the LaTeX experience quite well imho:

> LaTeX is great for typesetting math.

Q: Ok, great! So how do I typeset this bit of common math?

A: a 20-line barrage of import statements, makeatletter's and definitions that you copy-paste into your preamble and cross your fingers that it won't conflict with the half-dozen other barrages that you copied there to do other bits of common math, often hidden between other Google results with wildly different answers.

svat3y ago

I'll agree that not the posted article but your comment does sum up the way many users experience LaTeX (without taking a few minutes to understand what's going on), and it's interesting to think about what factors of TeX/LaTeX and its documentation/ecosystem lead to this outcome.

About the posted article: if all one wanted to do was "typeset this bit of common math", one can just type "\lambda x.(2x)" in math mode. Or, if not constrained to keep it old school i.e. pdfTeX, use XeTeX/LuaTeX with \usepackage{unicode-math}, to type "λ x.(2x)" directly.

The posted article is actually about doing some parsing using TeX, namely the author wants to type "λ x.(2x)" into their .tex file and have it be parsed into, say, [arg:(x) body:(2x)] to be used later for whatever they're building. This is not related to typesetting at all, so why do they want to do such a thing in TeX, instead of doing it outside and using TeX just for typesetting? The motivation seems to be, as their footnote 2 indicates, that some people just enjoy being perverse. That's fine!

Even there, if you compare the author's approach with that in the comment here https://news.ycombinator.com/item?id=33296527 (by someone who knows what they're doing; cf. https://www.latex-project.org/about/team/), you'll see how the "right" way is less forbidding-looking, and also less breakage-prone. What's going on is that the author has just learned something new (how Unicode is handled in pdfTeX even though it only works with 8-bit bytes), become excited at the possibilities, and hacked their own solution using the primitives, without bothering to integrate with the broader ecosystem of other packages and conventions — which is also fine; TeX will let you do that and not get in the way.

The real interesting question raised by your comment IMO is not at all about the posted article but about experiences such as those in your comment: I can easily imagine many people doing what you did (not understanding the context, and possibly even copying ad-hoc code like this into one's document and crossing one's fingers): here we start to get into the actual problems with the LaTeX ecosystem and the mismatch between users' mental models and that of the (too many!) pieces of software involved, but I've exceeded the time limit I set to comment here so I'll stop :)

miguelmurcaOP3y ago

It's true, I do enjoy being perverse. However, I think it's a bit unfair to say I have "picked up a few pieces of knowledge, become excited, and tried to hack their way through by themselves, without understanding the broader ecosystem of other packages and conventions". I'm aware of xparse, and other tools in the ecosystem (etoolbox, ifthenelse). This doesn't mean that I shouldn't try to get a given result with the provided primitives, rather than use an existing codebase. Very analogous to writing your own X programming lib rather than using an existing one: good for experimenting and learning, but maybe not something you should go for in production. (Of course, I'm certainly not at the level of a LaTeX team member.)

1 more reply

bradrn3y ago

I tend to disagree with this. In my experience most common constructs are supported pretty well, especially for maths. The complex stuff comes when you want to do slightly less common stuff — like, for instance, turning the character ‘λ’ into a command for custom typesetting of lambda expressions (as in the article).

ngcc_hk3y ago

Is it possible to do something like Jupiter notebook like so you can demonstrate some (commonn lisp) and then show their result. Like Notebook support some tex.

Too much to ask for I guess. Continue waiting.

titzer3y ago

Having spent most of the day stepping on LaTeX rakes, I am really at a loss how anyone could think that making it more programmable would think that's a good idea.

There are very few bits of software that are more arcane and broken by default than this absolute crapstraction of a platform.

bradrn3y ago

> There are very few bits of software that are more arcane and broken by default than this absolute crapstraction of a platform.

And that is exactly why a more programmable platform would be good! These issues arise in the first place largely because TeX is not easily programmable, so people have to find arcane workarounds to do anything complex in it.

slavapestov3y ago

The weirdness is totally expected for something originally designed in 1983. What is unexpected is that there is still no good replacement for writing math…

2 more replies

jcelerier3y ago

wish granted, advanced LaTeX macro programming will now have to be done in ksh

dhosek3y ago

I’ve occasionally contemplated the idea of creating a TeXasm target that one could compile programs in higher-level languages to. Then, one could bootstrap a C compiler that compiles to TeX and runs in TeX.

b2158263y ago

"Your scientists were so preoccupied with whether or not they could, they didn't stop to think if they should."

j / k navigate · click thread line to collapse

22 comments

zauguin3y ago

Instead you could do this as

  \documentclass{article}
  \usepackage{xparse}

  \NewDocumentCommand \LambdaCalc {u{.} r()} {%
    [arg:(#1) body:(#2)]
  }
  \DeclareUnicodeCharacter {03BB} {\LambdaCalc}

  \begin{document}

  λx.(2x)

  \end{document}

miguelmurcaOP3y ago

Thanks, and ouch. I have some follow-up questions/comments, if you have the time:

> these are predefined as `\@firstoftwo` and `\@secondoftwo`

I do wish LaTeX kernel commands (which I'm assuming these are) were more widely documented. As it stands, it's pretty hard to keep track of what already exists. Is there a nice reference for those?

> Also the Unicode bytes are already active so setting their catcode is useless.

This is true for LaTeX and not TeX, correct? Originally, I'd `\expandafter\let\expandafter\@firstoct\@firstoftwoλ`, but I decided not to assume that that character was already active.

> Also redefining the first octet breaks LaTeX's UTF-8 handling...

How so? (If the else case wasn't broken)

>...and the else case forms an infinite loop.

If `\Firstλ` was not an active character, would this still be true? Since I store `\Firstλ` in `\lambda@first@oct` before it's declared an active character.

> and it breaks other uses of `(` and `)` in the argument.

This is not a concern for the DSL, but...

> Changing the catcodes of `(` and `)` means that this command doesn't work in the arguments of other commands

...this is. Thanks.

> Instead you could do this as

Damn :)

zauguin3y ago

> I do wish LaTeX kernel commands (which I'm assuming these are) were more widely documented. As it stands, it's pretty hard to keep track of what already exists. Is there a nice reference for those?

> > Also the Unicode bytes are already active so setting their catcode is useless. > > This is true for LaTeX and not TeX, correct?

Right, this is LaTeX specific.

> Also redefining the first octet breaks LaTeX's UTF-8 handling... > > How so? (If the else case wasn't broken)

> I suppose I should read up on xparse.

MayeulC3y ago

A bit disapointing. It's mostly about parsing, not usage.

Half of the post is about handling UTF-8, which AFAIK both LuaTeX and XeTeX (you really shoulduse either) do natively.

miguelmurcaOP3y ago

LuaTeX and XeTeX usually aren't an option where LaTeX comes up, i.e., in academic submissions. This is a common discussion, see [the comments under my previous post].

[0]: https://news.ycombinator.com/item?id=31490136

xeonmc3y ago

Combine this with that paper of C compiler written entirely in lambda terms[0], now LaTeX is a compilation platform as well!

[0] https://news.ycombinator.com/item?id=33115719

smitty1e3y ago

The obvious next project is to port emacs.

c7b3y ago

Sums up the LaTeX experience quite well imho:

> LaTeX is great for typesetting math.

Q: Ok, great! So how do I typeset this bit of common math?

svat3y ago

miguelmurcaOP3y ago

1 more reply

bradrn3y ago

ngcc_hk3y ago

Is it possible to do something like Jupiter notebook like so you can demonstrate some (commonn lisp) and then show their result. Like Notebook support some tex.

Too much to ask for I guess. Continue waiting.

titzer3y ago

Having spent most of the day stepping on LaTeX rakes, I am really at a loss how anyone could think that making it more programmable would think that's a good idea.

There are very few bits of software that are more arcane and broken by default than this absolute crapstraction of a platform.

bradrn3y ago

> There are very few bits of software that are more arcane and broken by default than this absolute crapstraction of a platform.

slavapestov3y ago

The weirdness is totally expected for something originally designed in 1983. What is unexpected is that there is still no good replacement for writing math…

2 more replies

jcelerier3y ago

wish granted, advanced LaTeX macro programming will now have to be done in ksh

dhosek3y ago

b2158263y ago

"Your scientists were so preoccupied with whether or not they could, they didn't stop to think if they should."

j / k navigate · click thread line to collapse