Towards LaTeX in the Browser (opens in new tab)

(hackernoon.com)

110 pointsjxxcarlson8y ago67 comments

67 comments

(I'm a co-founder at Overleaf.com, which does collaborative 'LaTeX in the browser' in a different sense.)

I like the idea of a 'sane' subset of LaTeX that is easy to publish to the web. There are tools like LaTeXML and TeX4ht that try to convert general LaTeX documents to (X)HTML, but it's a very hard problem.

Some difficulties arise from the fact that TeX is just very hard to parse in general. Even the first stage of parsing TeX is Turing complete [1]. This makes it hard to write tooling e.g. for linting (though tools exist, e.g. chktex) or creating a WYSIWYG editor backed by LaTeX [2]. (edit: or creating a good LaTeX auto-complete [4])

Others arise from TeX's extensibility --- there are many thousands of packages that define their own commands and environments for different types of documents and different disciplines. This extensibility is on the one hand one of the main reasons that TeX and LaTeX are still actively used some 40 years after TeX's initial release, but on the other hand a major challenge for conversion to HTML. The LaTeXML project has many custom bindings [3] for these packages, but it's far from complete.

I guess the main question is whether we can find the right subset, and this project looks like a great start.

[1] https://tex.stackexchange.com/questions/4201/is-there-a-bnf-...

[2] https://www.overleaf.com/blog/81 --- my first attempt at rich text on Overleaf, many years ago

[3] http://dlmf.nist.gov/LaTeXML/manual/customization/customizat...

[4] https://www.overleaf.com/blog/523-a-data-driven-approach-to-...

moultano8y ago

I really like the idea of adding math support to Unicode via combining characters. It's more complicated than anything Unicode currently deals with, but not that much more complicated, and the idea of being able to put math into anything that currently accepts strings is just so enticing. We should treat math as it's own language, and rendering it as we would any other human language with an unusual way of laying out characters.

Bromskloss8y ago

It's an interesting idea. At what point, though, do we draw the line between what a character set (like Unicode) should handle, and what should be handled by a higher-level layer? I'm thinking that things like boldness, italicisation, and super script aren't really the job for a character set.

2 more replies

cryptonector8y ago

Hmmm, certainly Unicode ought to be able to represent mathematics as a script like any other. However, the complexity involved is non-trivial. To make things easier, whatever Unicode might do for math should have a mapping to and from TeX or MathAJAX. In any case, Unicode is rather complex as it is; I'm not sure I look forward to this extra level of complexity :(

jxxcarlsonOP8y ago

Hi, I am the developer of MiniLatex. Overleaf looks like a fantastic tool.

bhrgunatha8y ago

> I like the idea of a 'sane' subset of LaTeX that is easy to publish to the web.

That's probably the only approach that really makes sense:

During the past decade I was surprised to learn that the writing of programs for TeX and Metafont proved to be much more difficult than all the other things I had done (like proving theorems or writing books). The creation of good software demand a significiantly higher standard of accuracy than those other things do, and it requires a longer attention span than other intellectual tasks.

kovariance8y ago

I have found KaTeX to be the best currently-available solution. In particular, it can be rendered without client-side javascript.

lilgreenland8y ago

Agreed. KaTeX is much much faster than MathJax.

https://www.intmath.com/cg5/katex-mathjax-comparison.php

ajkjk8y ago

Yeah, I'm really surprised it's not mentioned here.

It also causes much less page-reflowing than MathJax, and is much more lightweight.

jxxcarlsonOP8y ago

I'm looking into KaTeX to see if it will work with my setup (Elm) & give better results. Thanks!

marvy8y ago

Just a bit of historical correction. The article/post says:

"Ten years later, in 1978, his work bore fruit"

This gets things pretty wrong. He got the idea in 1977, and his estimate of "this will take 6 months" was pretty close, in that the initial version was finished sometime in 1978. It then took about another ten years to be "actually done". (Rewrite, add features, fix bugs, create Metafont, create WEB, etc...)

jxxcarlsonOP8y ago

Thanks very much for the correction!

marvy8y ago

Much better :)

DavidSJ8y ago

Completely besides the point, but that integral evaluates to sqrt(2pi), not sqrt(pi).

curiousgal8y ago

I concur. It's easy to verify using the pdf of the standard normal distribution.

jxxcarlsonOP8y ago

Oops! My bad. Thanks!!

applecrazy8y ago

I wonder if somebody has taken TeX and compiled to the browser in wasm using emscripten. That would be easier to port but heavy on load times.

Edit: it exists! https://github.com/manuels/texlive.js/ is a limited port of LaTeX to JS, rendered to PDF

TheRealPomax8y ago

Classic TeX would be damn near useless in the age of Unicode, so you're looking at something like XeLaTeX or LUATex. The problem is that it's really easy to implement a really basic form of TeX, but unless you already planned for the really hard cases, maintaining your implementation is going to become intractible. TeX's real text typesetting is almost always woefully ignored even though _everything_ has to type beautifully, not just makes, and in modern version of TeX, that has to happen without insane syntax just to get a Unicode character we can already "just write" rather than needing all kinds of dedicated macros just for diacritics, it something as simple as mixing two writing scripts that necessitate two different fonts entirely.

badsectoracula8y ago

> Classic TeX would be damn near useless in the age of Unicode

Unless you want to write in English, which i am going to bet it still has a somewhat large audience :-P

2 more replies

yodon8y ago

Compiling to wasm certainly seems like the right place to start, though it might be non-trivial to get the output to render into a canvas.

svat8y ago

I wonder whether this is the right approach. TeX itself is one of the most heavily documented programs in existence. Not only are its workings documented in detail in The TeXbook (and a host of other books by other authors, such as Eijkhout's TeX by Topic) but even the program itself has been written in a “literate programming” style, with pretty formatted source code (with profuse comments) available in print (Vol B of Computers and Typesetting) and as a PDF (http://texdoc.net/texmf-dist/doc/generic/knuth/tex/tex.pdf), there's a detailed history/retrospective and log of every change that went into the program (see Chapters 10 and 11 of the book Literate Programming, though the log without explanation is also available online http://texdoc.net/texmf-dist/doc/generic/knuth/errata/errorl...), and there are even 12 hours of video of Knuth talking about the internals of the program (https://www.youtube.com/watch?v=bbqY1mTwrj8&index=12&list=PL...).

So when the article says:

> To reproduce all of LaTeX in the browser is too much to ask

I wonder why? The file tex.web is less than 25000 lines long, much of it comments, so I'd estimate that TeX itself is only about 20000 sloc (in fact tangle on tex.web generates a Pascal file tex.p which is only 6115 lines long). This is not a lot IMO, and it would be a lot better to actually re-implement this, with additional support for things like getting the parse tree etc.

patte8y ago

I was wondering recently if/how it would be possible to piggybag latex’ georgous typesetting (place the letters) to bring justified-text to websites. I want to do a PoC for absolut positioning all letters of a basic document placed by tex for my screensize.

Did anyone ever see such an approach?

gravypod8y ago

Are there any other solutions to document typesetting with latex-like features? TeX is very obtuse for someone who hasn't been using it for a long time.

flother8y ago

A common solution is to use LaTeX, but to use it indirectly: write in Markdown and convert to PDF using Pandoc [1], which uses LaTeX in the background. This is (part of) the process used in RMarkdown [2], for example. That way, you get all the benefits of TeX and LaTeX but without most of the pain.

[1]: https://pandoc.org/index.html [2]: http://rmarkdown.rstudio.com/

curiousgal8y ago

I just use Atom with the markdown-preview-plus package for live preview.

gravypod8y ago

I've seen some people do org-mode -> TeX -> research paper. It's very impressive. I just wish there was something like that with a more GUI/polished feel.

globuous8y ago

I've been using org-mode and exporting it to HTML. Then making an @media(print) style sheet and exporting the HTML/printCSS to PDF through princeXML.

It's been amazing. Latex equations are exported as pngs (for PDF export because I don't think prince does Mathjax, but org mode can export to mathjax). I have my bibliography with bibtex2html. And templating my pdfs becomes so much easier than with latex. It's just HTML CSS !! My figures are numbered and captioned and referenced throughout the text, same for tables. And my table of content is generated. And code is highlighted. And I have access to ditaa for ascii flow charts and a bunch of other stuff (for making uml in ascii with png export for the PDF for example). It also handles excel like tables with formulae (possible to have lisp formulae !! So cool !!) in text mode !!. And of course, you can plot your table through gnuplot inside your org file. You tell it which columns and rows, the type of graph etc :)

It's also easy to include other org files, or to go down to raw HTML for the export (rather than org mode->HTML) if need be (for a picture than spans over 2 pages for instance).

Give it a try, you might like it ;) In the end it's just an org mode export to HTML to PDF with the print CSS media query. But it works remarkably well and you have all the org mode features.

lorenzhs8y ago

Any particular reason why you don't use org-mode's latex export (org-latex-export-to-pdf / C-c C-e l p) directly? It will render math nicely, not as embedded images, etc.

1 more reply

gsnedders8y ago

> Latex equations are exported as pngs (for PDF export because I don't think prince does Mathjax, but org mode can export to mathjax).

Prince does MathML, at least, if you want to avoid images.

pdm558y ago

Geogebra, https://www.geogebra.org/, might suit your purpose. See it in action https://www.youtube.com/watch?v=GjPakjpEAXs You can use it to produce docs, such as https://www.geogebra.org/m/M4nBYbbG#material/c3wwdgD5

applecrazy8y ago

For lightweight stuff, there's vanilla Markdown, but you have no control over formatting. For more serious work using markdown, you can try out Ulysses[0] or Scribus[1].

And, if you feel like spending an obscene amount of money, on the order of $10k, there's Arbortext APP[2]. (I don't know why this even exists?)

[0]: https://ulyssesapp.com/ [1]: https://www.scribus.net/ [2]: https://en.wikipedia.org/wiki/Arbortext_Advanced_Print_Publi...

colejohnson668y ago

> but you have no control over formatting

Can’t CSS make up for that to some extent?

funkaster8y ago

There was Lout[1], but it seems to be abandonded. I really liked it, especially the simpified syntax (compared to latex). It was also unicode-safe by design.

[1]: https://en.m.wikipedia.org/wiki/Lout_(software)

beefhash8y ago

UNIX has been doing that for the past 40 years until AT&T ripped troff out of standard UNIX installations.

Look into groff and possibly heirloom doctools. It's fairly difficult to learn and the default macro packages on most installations may be somewhat difficult to come to terms with/adjust for your own needs. You're definitely expected to learn basic troff macros to hack up a macro package if needed. See also: http://www.schaffter.ca/mom/ and https://utroff.org/

ufo8y ago

You might want to check out LyX. It is a GUI editor that generates beautiful TeX documents but it is designed to be an user-friendly document processor instead of just a TeX GUI.

http://www.lyx.org/

martyalain8y ago

What do you think of this project {lambda way} as an alternative to LaTeX in a browser: http://lambdaway.free.fr

For instance, from this wiki page http://lambdaway.free.fr/workshop/?view=oxford I could directly generate a PDF paper, http://lambdaway.free.fr/workshop/data/lambdatalk_20170728.p..., and slides, http://lambdaway.free.fr/workshop/?view=oxford_slides

Some other pages in this workshop: http://lambdaway.free.fr/workshop/?view=factory http://lambdaway.free.fr/workshop/?view=NIL http://lambdaway.free.fr/workshop/?view=teaching http://lambdaway.free.fr/workshop/?view=lambdacode

Your comments are welcome.

Alain Marty

etaioinshrdlu8y ago

I used https://github.com/phfaist/pylatexenc to convert LaTeX to unicode text, with math symbols and superscripts etc.

It's of course never going to be as good looking as MathJax or something like that -- but it may be more appropriate to be able to treat it as plain Unicode text in some cases.

For instance, it works in title fields across the web and search engines will understand it better than anything else.

emeryberger8y ago

There is not really a need to modify LaTeX at all to make it run in the browser. It already exists. Without modifying a single line of code, we have implemented a full browser-based port of LaTeX as part of our Browsix project, which makes it possible to run full, unmodified Unix applications inside the browser. See http://browsertex.org and http://browsix.org (and http://bpowers.net and https://jvilk.com/ and http://plasma.cs.umass.edu).

djuerges8y ago

I actually did 'LaTeX in the browser' as a master thesis in 2014, but never went to continue developing it afterwards, be it as open-source project or with a commercial intent in mind. Although I though, at that time, I was at least up to the few solutions that were out there and solved the task of instant updates and real-time collaborative work on a document pretty gracefully.

Some neat improvements would have been version and so on, but you know, never made it that far after picking up a job. Kind of a shame...

https://github.com/djuerges/cotex

jessriedel8y ago

I read the post but I still don't understand: is it possible to define new commands using \def or \newcommand? At first I thought these are what the other meant by "macro", but later he says

> We are exploring ways for users to define non-default environment behaviors in the browser. The same goes for macros used outside the dollar and double-dollar fences.

But I can't use \def or \newcommend to define things that appear inside dollar signs either.

jxxcarlsonOP8y ago

Here is an example:

$$ \newcommand{\bra}{\left<} \newcommand{\ket}{\right>} $$

$$ \bra a | b \ket $$

If you go to https://jxxcarlson.github.io/app/minilatex/src/index.html, press the "Clear" button, then paste the above text, then press "Render", you should see the macros \bra and \ket properly rendered.

jessriedel8y ago

Oh I see, thanks. For what it's worth, I would definitely include this example in the demo; it's basically the first thing I wanted to use. Given your pipeline, it makes sense that the \newcommand definitions themselves has to appear inside dollar signs (not just when they are used), but for people with a TeX background it's pretty unintuitive.

Also, you should definitely use \lange and \rangle in place of < and > for bra-ket notation :)

angarg128y ago

Just for fun here is a little web game I made to look like a maths paper using MathJax.

https://angarg12.github.io/TrueExponential/

jimhefferon8y ago

PreTeXt from http://mathbook.pugetsound.edu/ has gotten some mindshare.

abritinthebay8y ago

I love the output of LaTeX but the language itself (and it’s dependencies and packages) are an absolute horror show.

I’ve never understood how people can learn be it so, writing it is painful, it’s tooling is abysmal, and it rarely seems to work except on the person who wrote its machine.

We’ve got to be able to do better.

mkl8y ago

> it’s tooling is abysmal

It seems like you haven't tried many editors. Have you tried TeXStudio (https://www.texstudio.org/)?

> it rarely seems to work except on the person who wrote its machine.

I and many others edit the same documents at the university where I work, without significant issues. Distributions like TeXLive (https://www.tug.org/texlive/) provide a consistent all-inclusive cross-platform solution.

abritinthebay8y ago

TeXStudio would be a perfect example of its abysmal tooling. It’s better than the CLI tools but it’s an awful editor and highlights how incompatible with a good writing experience LaTeX is.

Yes, many people produce good work in it - it’s output is fantastic after all - but an editor that would have been a substandard user experience in the 90s is the best LaTeX has in tooling.

That’s exactly what I mean!

1 more reply

notthemessiah8y ago

A task easier said than done.

Also, it should be considered that it's impossible to make breaking changes in the LaTeX language otherwise you lost the ability to compile a paper from 30 years ago.

But if you're trying to do something simple, I would say go for pandoc and use whatever format you're comfortable, then convert it to TeX: https://pandoc.org/

abritinthebay8y ago

At that point... why use TeX at all?

1 more reply

j / k navigate · click thread line to collapse

67 comments

jdleesmiller8y ago

(I'm a co-founder at Overleaf.com, which does collaborative 'LaTeX in the browser' in a different sense.)

I guess the main question is whether we can find the right subset, and this project looks like a great start.

[1] https://tex.stackexchange.com/questions/4201/is-there-a-bnf-...

[2] https://www.overleaf.com/blog/81 --- my first attempt at rich text on Overleaf, many years ago

[3] http://dlmf.nist.gov/LaTeXML/manual/customization/customizat...

[4] https://www.overleaf.com/blog/523-a-data-driven-approach-to-...

moultano8y ago

Bromskloss8y ago

2 more replies

cryptonector8y ago

jxxcarlsonOP8y ago

Hi, I am the developer of MiniLatex. Overleaf looks like a fantastic tool.

bhrgunatha8y ago

> I like the idea of a 'sane' subset of LaTeX that is easy to publish to the web.

That's probably the only approach that really makes sense:

kovariance8y ago

I have found KaTeX to be the best currently-available solution. In particular, it can be rendered without client-side javascript.

lilgreenland8y ago

Agreed. KaTeX is much much faster than MathJax.

https://www.intmath.com/cg5/katex-mathjax-comparison.php

ajkjk8y ago

Yeah, I'm really surprised it's not mentioned here.

It also causes much less page-reflowing than MathJax, and is much more lightweight.

jxxcarlsonOP8y ago

I'm looking into KaTeX to see if it will work with my setup (Elm) & give better results. Thanks!

marvy8y ago

Just a bit of historical correction. The article/post says:

"Ten years later, in 1978, his work bore fruit"

jxxcarlsonOP8y ago

Thanks very much for the correction!

marvy8y ago

Much better :)

DavidSJ8y ago

Completely besides the point, but that integral evaluates to sqrt(2pi), not sqrt(pi).

curiousgal8y ago

I concur. It's easy to verify using the pdf of the standard normal distribution.

jxxcarlsonOP8y ago

Oops! My bad. Thanks!!

applecrazy8y ago

I wonder if somebody has taken TeX and compiled to the browser in wasm using emscripten. That would be easier to port but heavy on load times.

Edit: it exists! https://github.com/manuels/texlive.js/ is a limited port of LaTeX to JS, rendered to PDF

TheRealPomax8y ago

badsectoracula8y ago

> Classic TeX would be damn near useless in the age of Unicode

Unless you want to write in English, which i am going to bet it still has a somewhat large audience :-P

2 more replies

yodon8y ago

Compiling to wasm certainly seems like the right place to start, though it might be non-trivial to get the output to render into a canvas.

svat8y ago

So when the article says:

> To reproduce all of LaTeX in the browser is too much to ask

patte8y ago

Did anyone ever see such an approach?

gravypod8y ago

Are there any other solutions to document typesetting with latex-like features? TeX is very obtuse for someone who hasn't been using it for a long time.

flother8y ago

[1]: https://pandoc.org/index.html [2]: http://rmarkdown.rstudio.com/

curiousgal8y ago

I just use Atom with the markdown-preview-plus package for live preview.

gravypod8y ago

I've seen some people do org-mode -> TeX -> research paper. It's very impressive. I just wish there was something like that with a more GUI/polished feel.

globuous8y ago

I've been using org-mode and exporting it to HTML. Then making an @media(print) style sheet and exporting the HTML/printCSS to PDF through princeXML.

It's also easy to include other org files, or to go down to raw HTML for the export (rather than org mode->HTML) if need be (for a picture than spans over 2 pages for instance).

Give it a try, you might like it ;) In the end it's just an org mode export to HTML to PDF with the print CSS media query. But it works remarkably well and you have all the org mode features.

lorenzhs8y ago

Any particular reason why you don't use org-mode's latex export (org-latex-export-to-pdf / C-c C-e l p) directly? It will render math nicely, not as embedded images, etc.

1 more reply

gsnedders8y ago

> Latex equations are exported as pngs (for PDF export because I don't think prince does Mathjax, but org mode can export to mathjax).

Prince does MathML, at least, if you want to avoid images.

pdm558y ago

applecrazy8y ago

For lightweight stuff, there's vanilla Markdown, but you have no control over formatting. For more serious work using markdown, you can try out Ulysses[0] or Scribus[1].

And, if you feel like spending an obscene amount of money, on the order of $10k, there's Arbortext APP[2]. (I don't know why this even exists?)

[0]: https://ulyssesapp.com/ [1]: https://www.scribus.net/ [2]: https://en.wikipedia.org/wiki/Arbortext_Advanced_Print_Publi...

colejohnson668y ago

> but you have no control over formatting

Can’t CSS make up for that to some extent?

funkaster8y ago

There was Lout[1], but it seems to be abandonded. I really liked it, especially the simpified syntax (compared to latex). It was also unicode-safe by design.

[1]: https://en.m.wikipedia.org/wiki/Lout_(software)

beefhash8y ago

UNIX has been doing that for the past 40 years until AT&T ripped troff out of standard UNIX installations.

ufo8y ago

You might want to check out LyX. It is a GUI editor that generates beautiful TeX documents but it is designed to be an user-friendly document processor instead of just a TeX GUI.

http://www.lyx.org/

martyalain8y ago

What do you think of this project {lambda way} as an alternative to LaTeX in a browser: http://lambdaway.free.fr

Your comments are welcome.

Alain Marty

etaioinshrdlu8y ago

I used https://github.com/phfaist/pylatexenc to convert LaTeX to unicode text, with math symbols and superscripts etc.

It's of course never going to be as good looking as MathJax or something like that -- but it may be more appropriate to be able to treat it as plain Unicode text in some cases.

For instance, it works in title fields across the web and search engines will understand it better than anything else.

emeryberger8y ago

djuerges8y ago

Some neat improvements would have been version and so on, but you know, never made it that far after picking up a job. Kind of a shame...

https://github.com/djuerges/cotex

jessriedel8y ago

I read the post but I still don't understand: is it possible to define new commands using \def or \newcommand? At first I thought these are what the other meant by "macro", but later he says

> We are exploring ways for users to define non-default environment behaviors in the browser. The same goes for macros used outside the dollar and double-dollar fences.

But I can't use \def or \newcommend to define things that appear inside dollar signs either.

jxxcarlsonOP8y ago

Here is an example:

$$ \newcommand{\bra}{\left<} \newcommand{\ket}{\right>} $$

$$ \bra a | b \ket $$

jessriedel8y ago

Also, you should definitely use \lange and \rangle in place of < and > for bra-ket notation :)

angarg128y ago

Just for fun here is a little web game I made to look like a maths paper using MathJax.

https://angarg12.github.io/TrueExponential/

jimhefferon8y ago

PreTeXt from http://mathbook.pugetsound.edu/ has gotten some mindshare.

abritinthebay8y ago

I love the output of LaTeX but the language itself (and it’s dependencies and packages) are an absolute horror show.

I’ve never understood how people can learn be it so, writing it is painful, it’s tooling is abysmal, and it rarely seems to work except on the person who wrote its machine.

We’ve got to be able to do better.

mkl8y ago

> it’s tooling is abysmal

It seems like you haven't tried many editors. Have you tried TeXStudio (https://www.texstudio.org/)?

> it rarely seems to work except on the person who wrote its machine.

abritinthebay8y ago

TeXStudio would be a perfect example of its abysmal tooling. It’s better than the CLI tools but it’s an awful editor and highlights how incompatible with a good writing experience LaTeX is.

Yes, many people produce good work in it - it’s output is fantastic after all - but an editor that would have been a substandard user experience in the 90s is the best LaTeX has in tooling.

That’s exactly what I mean!

1 more reply

notthemessiah8y ago

A task easier said than done.

Also, it should be considered that it's impossible to make breaking changes in the LaTeX language otherwise you lost the ability to compile a paper from 30 years ago.

But if you're trying to do something simple, I would say go for pandoc and use whatever format you're comfortable, then convert it to TeX: https://pandoc.org/

abritinthebay8y ago

At that point... why use TeX at all?

1 more reply

j / k navigate · click thread line to collapse