Relational is more than SQL (opens in new tab)

iamsam1232y ago

Does it SELECT * by default if I never define a SELECT below my FROM? ... Continuing to encourage folks by allowing them to SELECT * easier is would not be fun for me... I could be wrong?

Agreed, just parsing out the formatting so its "fewer lines" than traditional SQL soured me.

The expressions example is ridiculous, in Redshift I can do this all day?? SELECT 1 + 2 AS num1 , num1 * 2 AS num2 -- Literally no difference

Just learn SQL...

https://www.malloydata.dev/

Izkata2y ago

> Putting the FROM first isn't sufficiently compelling on its own.

Personally I see that as not even neutral, it's a downside. Optimizing for autocomplete is an antipattern, code is read far more often that it's written and the SELECT clause is the interface to the following code. It should be easy to find when skimming, not buried in the query.

The SELECT clause is also akin to an assignment and it's extremely rare I see anyone advocating flipping the order of those to match what they say they want in SQL.

Edit: Since I'm sure someone is going to jump on it, yes, I'm conflicted about the WITH clause: It's extremely useful and I like what it does, so I do use it, but I don't like where it's positioned. I've been toying with indentation to work around it so SELECT is still just as visible as otherwise.

anon848736282y ago

You might find Malloy interesting as it makes a greater departure from SQL syntax. Queries are first class objects which can be nested within each other in order to do trellising. It still compiles to SQL because that is the only language accepted by DBMSs today; however it will automatically write symmetric aggregate calculations and do those nestings that are hard for a human to write.

[PRQL dev here]

I strongly think we should have the best examples of SQL to compare against. I've ironically made this complaint for other libraries, so I'm alarmed that folks think we might have done the same.

We would take PRs for any improvements to the SQL that make it a better comparison.

remram2y ago

It's just syntax, it compiles to SQL and runs on today's DBMS. It has no difference in speed or functionality.

* https://github.com/totalhack/zillion

snthpy2y ago

For those interested who want to learn more, we have a number of presentations coming up at conferences on three continents:

- [QCon SF, October 2nd, San Francisco, USA: ](https://qconsf.com/presentation/oct2023/prql-simple-powerful...)

- [PyconZA, October 5th, Durban, South Africa: ](https://za.pycon.org/)

- [Community over Code (ApacheCon), October 9th, Halifax, Canada: ](https://communityovercode.org/schedule-list/#FT005)

- [data2day, October 12th, Karlsruhe, Germany: ](https://www.data2day.de/veranstaltung-21353-0-prql-a-modern-...)

lolinder2y ago

SQL will never die for the same reason that JavaScript will never die: because it's built in to all major database engines.

In both cases, any other language will be starting as a second class citizen that has to compile to SQL/JS. During this phase of a new language's lifetime, it is either a surface-level syntactic change (a la Coffeescript) that provides no objective improvement, or it has to compile its simple semantic structures into opaque SQL/JS structures that will be off the beaten path and therefore not highly optimized by the runtime. Neither will reach sufficient adoption to become a first-class citizen in a major existing platform.

TypeScript succeeded where others failed because it provided much-needed static analysis while keeping the changes minimal enough that it's completely obvious what the runtime code will look like, so there are no unexpected performance gotchas. SQL, on the other hand, doesn't really need a TypeScript because SQL is highly statically analyzable by nature.

It's not that I don't believe we could do with an improvement on SQL, but I really don't see a realistic path forward for a replacement.

totalhack2y ago

IMO a semantic layer is a nice UX/DX improvement over plain SQL in a business/analytics setting. I use a semantic layer* for >95% of use cases and fall back to SQL when needed. This balance will be different for each business of course.

paulddraper2y ago

While I mostly agree, there is a bit of Stockholms syndrome.

A lot of people don't know what they even could be missing.

For example, there is no succinct way of writing an antijoin in SQL .

The MERGE command has only been implemented by some engines due to (IIRC) concurrency concerns/ambiguities.

ANSI SQL JSON operations have improved but are still clunky.

Boolean NULL and IN is a clusterf of footguns.

Etc.

[PRQL dev here]

I agree with the sentiments, even if not the conclusion. SQL is omnipresent and is "fine" in a lot of cases.

TypeScript is indeed a great example of the case; Kotlin too. I'd also add that databases are already adding PRQL support — ClickHouse has native support, there's a DuckDB extension, and folks are working on a Postgres extension.

One thing I'll respectfully disagree with — "SQL is highly statically analyzable by nature":

As a really basic example: `SELECT <expr> FROM tbl` — can we tell what shape the result is? In SQL, shapes / types require a lot of context — the result could be a single row in the case of `SUM(foo)`, or it could be every row in the case of `foo, bar`. More in https://prql-lang.org/faq/...

anon848736282y ago

Well, Malloy is developed within Google by the founder of Looker, so there is a chance it could be natively integrated into BigQuery. At that point you have a next gen SQL replacement available on one of the most widely used analytics and transformation engines.

Scarbutt2y ago

It's hard to say, if microsoft or google where behind prql and promoting it, it may as well become a typescript. There's a reason many developers use query builders, while not exactly the same, they want programming language features and familiarity of modern programming languages. Not saying prql is the correct approach here since I don't know it.

frogulis2y ago

Pretty cool, your description got my click. I particularly enjoy that a filter is a filter before and after grouping.

One thing, the "showcase" section is not usable for me on mobile. The code box does not fit on the screen horizontally and I can't scroll right to see the remainder of it.

snthpy2y ago

Thank you for the feedback. I'll let the team know.

We definitely want people on all devices to be able to learn about the project.

ledauphin2y ago

I've been excited in the abstract about PRQL for quite a while. But something FQL seems to have a much better handle on is the value of document-orientedness, or what you might alternatively call "gradual schematization".

This problem has been solved (if not beautifully, at least acceptably) by modern SQL databases that support a JSON storage format and associated "secondary query language".

I know PRQL has had an open issue on this subject for a while. I just want to note that I think this is truly one of the critical "missing pieces" to PRQL, without which it may never be able to break out into common usage.

anon848736282y ago

FQL is interesting because it focuses on transactional systems and eliminating the need for an ORM in applications. I feel many of the SQL replacement projects like PRQL and Malloy instead come from the analytics side of the house, which doesn't really help application developers at all. (But does raise the question, how do I do analytics in Fauna? Do I ETL to a traditional warehouse system?)

roenxi2y ago

Great project, wish you all the best. Anything to try and unseat SQL from common use (we can all wish for the day we run PostgrespostSQL in production). At the moment the project is probably going to lose people because it isn't obvious how to get started - many SQL beginners don't know what a compiler is and will get confused by the docs.

For the sake of their sanity, it'd be worth considering putting an example of using the compiler on a local text file somewhere prominent on that site. That way beginners can go in, write some PSQL, compile it and use it against real SQL databases.

Or if not the compiler, make it clear how beginners are supposed to engage with this. There is a big need out there for something dplyr-like that works. There are a dizzying array of options and that isn't going to help some good people who need a bit of handholding.

snthpy2y ago

Thank you for your feedback. That's really valuable!

We have the [PRQL Playground](https://prql-lang.org/playground/) exactly for that purpose.

We'll try and make it more prominent on the front page. I've also felt that we should have a "Getting started" page and will push that as a priority.

[0] https://www.postgresql.org/docs/13/sql-altertable.html

psacawa2y ago

Is there any intention of eventually supporting DML or DDL statements? That's when the COBOL-like nature of SQL syntax is most frustrating. For example, in order to run "ALTER COLUMN ..." I have to parse a ridiculous BNF like this[0] almost every time. I'll never remember it.

Usually, the error is a gotcha built into the language syntax (e.g. forgot the keyword "TO").

seanhunter2y ago

SQL is based on relational calculus rather than relational algebra, which is why it's declarative. Relational algebra is built on fundamental relational operators (select, project, filter, product etc) which are imperatively applied. You can find out more about it here https://techdifferences.com/difference-between-relational-al...

jug2y ago

At least superficially this looks a lot like C# LINQ to me in terms of structure and database independence (as for EF Core + LINQ). It’s in my top 3 features of that language.

https://www.tutorialsteacher.com/linq/sample-linq-queries

Edit: Shortened to link due to formatting issues

snthpy2y ago

LINQ is definitely one of the big influences along with many other great projects that form the prior art in this space.

See this section in our FAQ: https://prql-lang.org/faq/#:~:text=Something%20here%20remind...

ilyt2y ago

> Iterating through that would use lazy evaluation by default, returning row by row from the db as needed.

That's... not an advantage in most cases

intrasight2y ago

Also reminds me of the Power Query M language - semantically anyway

biglyburrito2y ago

Yep, it reads very easily like C# LINQ method & query syntax.

danielvaughn2y ago

That first PRQL code sample is wonderfully readable.

Timon32y ago

It is! One suggestion to make it even more convincing: I'd love to see the SQL statement it compiles to.

snthpy2y ago

Awesome! That's what we're hoping for. Great to hear that you find it wonderfully readable!

raverbashing2y ago

> we believe that SQL is a combination of two things:

> 1. Relational Algebra, which is eternal because it's just maths, and 2. A language designed in the 70s that looks like COBOL.

Your belief is as real as my belief that it rains too much in London ;) (that is, it is correct)

But why people have such hold on to such a quirky syntax beats me

iamcreasy2y ago

How would you compare prql with dbt?

snthpy2y ago

dbt integration was one of our major goals early on but we found that the interaction wasn't as straightforward as we had hoped.

There is an open PR in the dbt repo: https://github.com/dbt-labs/dbt-core/pull/5982#issuecomment-...

I have some ideas about future directions in this space where I believe PRQL could really shine. I will only be able to write those down in a couple of hours. I think this could be a really exciting direction for the project to grow into if anyone would like to collaborate and contribute!

anon848736282y ago

dbt is just an orchestration tool. It uses SQL because that's what you need to pass to the target database. There is a python plugin if you prefer to use that for your models instead. Theoretically dbt could wrap any language your target system accepts. The actual configuration of the dbt runtime itself is done with yaml files.

deburo2y ago

That looks awesome. Does it support directly querying against databases (PostgreSQL, SQL Server, ...)? ie. is there a "Run" command in vscode that takes care of compiling & running the compiled sql?

hiAndrewQuinn2y ago

This is tremendous. I'm curious to know if a CLI `prqlite3` exists which wraps around the `sqlite3` CLI many of us know and love.

snthpy2y ago

Thank you.

The CLI usability was one of the aims behind [prql-query (pq)](https://github.com/prql/prql-query/). sqlite integration was on the roadmap but unfortunately that project has been largely unmaintained by me for the past 6 months. (This is just referring to prql-query and not PRQL which is under very active development.)

I'm working on a new project which will do exactly this (and a lot more!) which I hope to release next week. I'll drop the link here when that's ready.

spion2y ago

How is the language server support?

snthpy2y ago

We do have grammars for many editors. I'm not sure about LSP off the top of my head but you should find something in the docs under "Integrations".

There is also a VSCode extension: https://marketplace.visualstudio.com/items?itemName=prql-lan...

https://en.wikipedia.org/wiki/Codd%27s_12_rules

[PRQL dev here]

We don't have LSP support yet, but it's on the Roadmap. We've designed the language to be very LSP-friendly — one of the benefits of starting with `from` and pipelining each function.

continuational2y ago

In that first example, is the last line superfluous? It doesn't seem to be used.

laerus2y ago

hey, is compile time verification of queries supported for PRQL in Rust?

snthpy2y ago

Not yet, but looking at what sqlx does, I think we should be able to do something similar.

It's been a small team of core contributors so far but in the last three months we've seen more people making their first PR and then going on to contribute more over time so the momentum is growing.

We'd definitely be open to contributions in this space.

dagss2y ago

Nitpick, but relational does not mean joins, it means tables/rows of tuples. A "relational document database" which is the slogan of Fauna it seems is a contradiction in terms.

contrast2y ago

That’s technically correct, and I think the author would say he’s aware of that definition.

The article as I read it is trying to make a broader point, that there are underlying mathematical principles that inspired Codd’s relational model.

I’ve never had cause to explore it, but my understanding is that there’s nothing in those principles that require tables/rows of tuples.

One goal of the article seems to be to inspire a curiosity in knowledgeable readers: what happens if you build a document database that also supports the same mathematical principles that inspired the relational model?

gregjor2y ago

> there’s nothing in [Codd’s] principles that require tables/rows of tuples.

Have you read Codd’s Rules #1 and #2? Pretty clear on this point.

Technically the relational model uses the term relation to refer to an unordered set of tuples, where every tuple has a key (one or more elements) to uniquely identify it, and every tuple has the same number of items, of the same type. Tables are relations. So are the results of a query, which can include joins.

dragonwriter2y ago

> The article as I read it is trying to make a broader point, that there are underlying mathematical principles that inspired Codd’s relational model.

The relational model is a direct product of a set of mathematical principles Codd put together called relational algebra, which deals with sets of tuples called relations.

Nothing in the article addresses any of the mathematical underpinnings of the relational model. Its blowing smoke at an audience that it expects to know next to nothing about the topic.

> One goal of the article seems to be to inspire a curiosity in knowledgeable readers: what happens if you build a document database that also supports the same mathematical principles that inspired the relational model

The features of RDBMSs that they seem to be suggesting FQL supports are ACID transactions. While that's an important feature of RDBMSs, it isn’t the same thing as the mathematical principles addresses by the relational model, whether relational algebra or the more general set theory that inspires it. The article isn't directed at knowledgable readers.

bazoom422y ago

A relation is by definition a set of tuples (informally called a table where the tuples are the rows).

Codds relational database model adds the further constraint that nested tables are not allowed (first normal form), instead representing relationships through foreign keys.

Codds motivation for disallowing nested tables is that it makes query languages much simpler. He develops relational algebra which is the foundation behind SQL, which is why SQL does not allow nested tables.

Document databases does not follow first normal form and allows nested structures, so they cannot be queried with relational algebra, since it doesnt have a way to “drill down” into nested structures.

It is unclear to me what “mathematical principles” remain if you remove the notion of relations from the relational model.

[1]: https://ako.github.io/blog/2023/08/25/json-transformations.h...

marcosdumay2y ago

Whatever you want to point from theory, the one single distinctive feature of the relational model is the "mostly free" interdependency between the relations. AKA, the fks and joins.

crabbone2y ago

Worthless article. Zero useful description of what it's trying to sell. A bunch of disjoint historical facts about relational databases that have nothing to do with the product being sold take about 2/3 of the article.

Also the author seems to be very proud of associating themselves with Microsoft's products (w/o even a hint of doubt that that may not show them in favorable light)...

Also, marketing-inspired use of pseudo-programming terminology (eg. "dynamic languages"). Ewww.

ako2y ago

Seems like a lot of what fauna does by storing documents isn’t really new, oracle, Postgres and others have provided this for a long time. I was really surprised by the performance of json queries [1], opens the doors to using Postgres as a client api cache, storing the payload in a table, and doing deserialization using (materialized) views.

Difference seems to be the approach to minimize number of calls from your application, get all require session data in one call, similar to what graphql is doing for api calls. They’re also using http as the protocol for database connectivity.

default-kramer2y ago

Using FQL instead of SQL seems to be a pretty big difference too.

ako2y ago

Postgres has procedural languages and enables you to return complex json structures combined of relational data and json documents both with its procedural languages and regular sql. Sure, the syntax is different, but not sure if the difference makes a big impact.

robertlagrant2y ago

Yes - the difference you mention seems to be the main difference.

ttfkam2y ago

CTEs and query pipelining are not sufficient?

gigatexal2y ago

Re PRQL … I see it like my text editor. I’ll stick with vi because it has solved text editing. It’s done. Same with SQL. I’ve not seen anything yet ready to replace it. It’s not perfect. But for what I need from it it’s perfectly serviceable.

robertlagrant2y ago

On strong schemas and flexibility:

1. You still have a schema in your code. With weak schemas it's now just harder to know if every record in your database conforms to it.

2. An ORM is a great tool for prototyping. R.g. have SQLAlchemy objects in code, run a command to generate a database migration; run the migration, and you have all your data guaranteed to be compatible with your latest code, and you didn't write any SQL.

m_mueller2y ago

If you program defensively you can save on certain common Schema updates in e.g. a document based data model (e.g. adding more fields). But strong schemas definitely make sense when you’re dealing with relational data from my experience. Earlier in my career I built a relational model on top of CouchDB (due to its strong replication capabilities, including on mobile devices), but it was definitely painful (and less performant) compared to building it in a relational DB.

roenxi2y ago

> If you program defensively you can save on certain common Schema updates in e.g. a document based data model (e.g. adding more fields)

ALTER TABLE whatever ADD COLUMN new_field type DEFAULT NULL;

I've seen a lot of people claim that they don't want to waste time clarifying their schema and I'm sure there are edge cases where that is clever. But, in the majority of cases, they are literally risking data integrity for a saving smaller than the time it takes to write a HN comment.

Making schema implicit doesn't "save" anything. The schema is still there, now just only insiders who are completely familiar with the code know what it is. And they're going to have a few extra bugs because they'll forget too.

ghusbands2y ago

> Most importantly, SQL databases made supporting highly consistent ACID transactions easy.

The default transaction isolation level for every major database is not ACID. Enabling the required serializability tends to make performance terrible, and so most don't.

_a_a_a_2y ago

> Enabling the required serializability...

is trivial, no?

> ...tends to make performance terrible

I've heard this a lot but never seen any figures - anyone have any numbers/experience?

(edit: and most apps I've worked with didn't need serialisability, either because they were working with a snapshot of data or absolutely precise answers weren't needed)

ghusbands2y ago

I've heard multiple accounts of people being taken by surprise by this and by how transaction isolation actually works in databases and not finding it at all easy to correct it. A famous one is https://blog.codinghorror.com/deadlocked/

(On your edit: The problem is not knowing when you're being hit by it. Even just maintaining a limit on total size of uploaded files or such, for example, is nontrivial under default isolation levels.)

iudqnolq2y ago

that's presumably why the author said "made supporting... easy", not "is"?

ghusbands2y ago

Most people believe that databases are ACID by default, so it's worth bringing up when an article even subtly implies otherwise.

xwowsersx2y ago

I have to say this was written extremely well. Quite cogent and I feel I learned a little something. Bookmarking this as a pretty decent intro to this area that I can refer people to.

slotrans2y ago

Fixed schemas are good. Document stores are bad. SQL is good.

Stop doing this nonsense. It's a step backwards. As the intro points out, hierarchical and graph DBs came first, and relational was built in part to solve their problems. Document DBs just bring those problems back.

bob10292y ago

> Fixed schemas are good.

I recall getting into an argument recently (perhaps on HN) wherein the central thesis for why SQL is bad is because the schema is "difficult" to change relative to a document store or other no-SQL abstraction.

If you don't have a clear idea of what the representative SQL schema might be for your problem or business (say, within ~80%+ certainty), one may argue you should not be writing any software until you've further clarified things with business stakeholders.

I strongly believe that virtually all evil which emerges from practical software engineering comes out of this "flexible schema" bullshit. If the business is certain of the shape of their problem, there is almost certainly a fixed schema that can accommodate. There are very few problem domains which cannot be coaxed into a strict SQL schema.

totalhack2y ago

There are also ways to add some flexibility into a "fixed" schema when you need it. Entity-attribute-value tables, views, JSON columns (as a last resort), or a semantic layer like https://github.com/totalhack/zillion

Scarbutt2y ago

Business requirements change over time, specially at the beginning, you may have 80% certainty of the schema today but not in four months.

xtracto2y ago

I am tech advisor to a bunch of startups. One of them doing stock buy/sell came to me with their MongoDB based system. The first thing I told them is that using a document based db for oltp for their use case was going to give them problems.

I saw it first hand 10 years ago, and had to do a migration.

Their justification for using mongo was that their system is very dynamic so their data changes a lot and sql based DBs dont allow that. I told them about DBA migrations and whatnot, but I just haven't been able to convince them.

It's sad seeing how they are digging into the same hole I had to digg out myself from a decade ago.

lcnPylGDnU4H9OF2y ago

At this point, one chooses the solution for their problem. The reason the fads occur is that a person who vaguely understands both the problem and the solution will write a blog post which happens to go viral talking about how the solution will solve all problems.

NoSQL databases aren’t unilaterally worse than relational ones. They just solve different problems.

pphysch2y ago

> NoSQL databases aren’t unilaterally worse than relational ones. They just solve different problems.

I can't prove this, but I assert that a relational database that has solid JSON+text support (e.g. Postgres) is on much better footing than a NoSQL DB that attempts to implement a true relational model.

One is a adding a special new datatype, the other is trying to add an entire paradigm.

Just use Postgres. If you do need to migrate to Mongo for some reason, dumping your tables into JSON isn't the end of the world.

scott_meyer2y ago

what is a document? How is an ORANUM or a bignum not a document?

One motivation for creating documents is that modeling document contents as relations requires the creation of a bunch of primary keys which no natural definition. A simple document might be an ordered collection of paragraphs, [p23, p57, ...]

Modifying such things is difficult. In fact, the most effective way of structuring modification seems to be OTs based on document offsets. What Google docs does.

j / k navigate · click thread line to collapse

172 comments

herodoturtle2y ago

This is a very interesting way to promote a product, credit to the author (who is an industry veteran it seems).

I had no idea what Fauna was. I just clicked the link here because the title caught my eye (I work with databases quite a bit).

The opening paragraph immediately grabbed my attention - "My first deep dive into SQL was in 1987, just before I became the first technical person at Microsoft to work on SQL Server." - woah!

So I read this entire article, which is very well written and easy to read but mostly affirms what I already know.

And then I get to the final section where they promote Fauna - and so now I know about Fauna too.

Kudos to these folks, in my humble opinion, this is marketing done right.

probablypower2y ago

This is interesting, because I have the exact opposite response to these sorts of articles.

Obviously this method must resonate with people, like yourself, otherwise it wouldn't become so common. I guess I'm just the 'B' in the A/B testing that results in this type of marketing.

xwowsersx2y ago

naasking2y ago

> It really rubs me the wrong way when an article ends with a bait-and-switch, where you realise the entire article was manufactured to make you relate to their product's business case.

zzzeek2y ago

agree, I have to look at the domain name, the title / sidebar , etc. to see up front, "OK this is yet another 'we think we have a better SQL' startup", then I skip the whole thing.

A site that's about "here's our product and why you might like it!" without getting into some "SQL, well you know, it has shortcomings" which is just unnecessary.

austhrow7432y ago

Why would your original assumption not be that the Fauna website exists to promote Fauna?

snthpy2y ago

Disclaimer: I'm a core contributor to PRQL [1] and post about it a lot on HN. Apologies for jumping in on other people's threads, but for people interested in the headline, PRQL might be of interest.

At PRQL[1] we believe that SQL is a combination of two things:

1. Relational Algebra, which is eternal because it's just maths, and 2. A language designed in the 70s that looks like COBOL.

1: https://prql-lang.org/

ttfkam2y ago

It may be typical of many SQL users and formatters, but it leaves a poor taste in the mouth that you aren't interested in an actual comparison but in marketing.

After 49 years of SQL, more than syntax has to change; you need an engine that supports this natively and can actually improve planner behavior over existing engines.

I will grant that if you are limiting your target audience to primarily analytics, it's probably sufficient. The marketing of PRQL doesn't always appear to do this however.

wackget2y ago

Yeah the syntax comparison is deliberately misleading.

They style it as "4 lines vs 10 lines!" when it's actually 4 lines vs 4 lines.

   # PRQL
   from employees
   select {id, first_name, age}
   sort age
   take 10

   # Misleading SQL
   SELECT
     id,
     first_name,
     age
   FROM
     employees
   ORDER BY
     age
   LIMIT
     10

   # Actual SQL
   SELECT id, first_name, age
   FROM employees
   ORDER BY age
   LIMIT 10

The join example is similarly deceptive:

   # PRQL
   from employees
   join b=benefits (==employee_id)
   join side:left p=positions (p.id==employees.employee_id)
   select {employees.employee_id, p.role, b.vision_coverage}

   # Misleading SQL
   SELECT
     employees.employee_id,
     p.role,
     b.vision_coverage
   FROM
     employees
     JOIN benefits AS b ON employees.employee_id = b.employee_id
     LEFT JOIN positions AS p ON p.id = employees.employee_id

   # Actual SQL
   SELECT employees.employee_id, p.role, b.vision_coverage
   FROM employees
   JOIN benefits b USING employee_id
   LEFT JOIN positions p USING employee_id

Nonsense.

iamsam1232y ago

Does it SELECT * by default if I never define a SELECT below my FROM? ... Continuing to encourage folks by allowing them to SELECT * easier is would not be fun for me... I could be wrong?

Agreed, just parsing out the formatting so its "fewer lines" than traditional SQL soured me.

The expressions example is ridiculous, in Redshift I can do this all day?? SELECT 1 + 2 AS num1 , num1 * 2 AS num2 -- Literally no difference

Just learn SQL...

https://www.malloydata.dev/

Izkata2y ago

> Putting the FROM first isn't sufficiently compelling on its own.

The SELECT clause is also akin to an assignment and it's extremely rare I see anyone advocating flipping the order of those to match what they say they want in SQL.

anon848736282y ago

[PRQL dev here]

I strongly think we should have the best examples of SQL to compare against. I've ironically made this complaint for other libraries, so I'm alarmed that folks think we might have done the same.

We would take PRs for any improvements to the SQL that make it a better comparison.

remram2y ago

It's just syntax, it compiles to SQL and runs on today's DBMS. It has no difference in speed or functionality.

* https://github.com/totalhack/zillion

snthpy2y ago

For those interested who want to learn more, we have a number of presentations coming up at conferences on three continents:

- [QCon SF, October 2nd, San Francisco, USA: ](https://qconsf.com/presentation/oct2023/prql-simple-powerful...)

- [PyconZA, October 5th, Durban, South Africa: ](https://za.pycon.org/)

- [Community over Code (ApacheCon), October 9th, Halifax, Canada: ](https://communityovercode.org/schedule-list/#FT005)

- [data2day, October 12th, Karlsruhe, Germany: ](https://www.data2day.de/veranstaltung-21353-0-prql-a-modern-...)

lolinder2y ago

SQL will never die for the same reason that JavaScript will never die: because it's built in to all major database engines.

It's not that I don't believe we could do with an improvement on SQL, but I really don't see a realistic path forward for a replacement.

totalhack2y ago

paulddraper2y ago

While I mostly agree, there is a bit of Stockholms syndrome.

A lot of people don't know what they even could be missing.

For example, there is no succinct way of writing an antijoin in SQL .

The MERGE command has only been implemented by some engines due to (IIRC) concurrency concerns/ambiguities.

ANSI SQL JSON operations have improved but are still clunky.

Boolean NULL and IN is a clusterf of footguns.

Etc.

[PRQL dev here]

I agree with the sentiments, even if not the conclusion. SQL is omnipresent and is "fine" in a lot of cases.

One thing I'll respectfully disagree with — "SQL is highly statically analyzable by nature":

anon848736282y ago

Scarbutt2y ago

frogulis2y ago

Pretty cool, your description got my click. I particularly enjoy that a filter is a filter before and after grouping.

One thing, the "showcase" section is not usable for me on mobile. The code box does not fit on the screen horizontally and I can't scroll right to see the remainder of it.

snthpy2y ago

Thank you for the feedback. I'll let the team know.

We definitely want people on all devices to be able to learn about the project.

ledauphin2y ago

This problem has been solved (if not beautifully, at least acceptably) by modern SQL databases that support a JSON storage format and associated "secondary query language".

anon848736282y ago

roenxi2y ago

snthpy2y ago

Thank you for your feedback. That's really valuable!

We have the [PRQL Playground](https://prql-lang.org/playground/) exactly for that purpose.

We'll try and make it more prominent on the front page. I've also felt that we should have a "Getting started" page and will push that as a priority.

[0] https://www.postgresql.org/docs/13/sql-altertable.html

psacawa2y ago

Usually, the error is a gotcha built into the language syntax (e.g. forgot the keyword "TO").

seanhunter2y ago

jug2y ago

At least superficially this looks a lot like C# LINQ to me in terms of structure and database independence (as for EF Core + LINQ). It’s in my top 3 features of that language.

https://www.tutorialsteacher.com/linq/sample-linq-queries

Edit: Shortened to link due to formatting issues

snthpy2y ago

LINQ is definitely one of the big influences along with many other great projects that form the prior art in this space.

See this section in our FAQ: https://prql-lang.org/faq/#:~:text=Something%20here%20remind...

ilyt2y ago

> Iterating through that would use lazy evaluation by default, returning row by row from the db as needed.

That's... not an advantage in most cases

intrasight2y ago

Also reminds me of the Power Query M language - semantically anyway

biglyburrito2y ago

Yep, it reads very easily like C# LINQ method & query syntax.

danielvaughn2y ago

That first PRQL code sample is wonderfully readable.

Timon32y ago

It is! One suggestion to make it even more convincing: I'd love to see the SQL statement it compiles to.

snthpy2y ago

Awesome! That's what we're hoping for. Great to hear that you find it wonderfully readable!

raverbashing2y ago

> we believe that SQL is a combination of two things:

> 1. Relational Algebra, which is eternal because it's just maths, and 2. A language designed in the 70s that looks like COBOL.

Your belief is as real as my belief that it rains too much in London ;) (that is, it is correct)

But why people have such hold on to such a quirky syntax beats me

iamcreasy2y ago

How would you compare prql with dbt?

snthpy2y ago

dbt integration was one of our major goals early on but we found that the interaction wasn't as straightforward as we had hoped.

There is an open PR in the dbt repo: https://github.com/dbt-labs/dbt-core/pull/5982#issuecomment-...

anon848736282y ago

deburo2y ago

That looks awesome. Does it support directly querying against databases (PostgreSQL, SQL Server, ...)? ie. is there a "Run" command in vscode that takes care of compiling & running the compiled sql?

hiAndrewQuinn2y ago

This is tremendous. I'm curious to know if a CLI `prqlite3` exists which wraps around the `sqlite3` CLI many of us know and love.

snthpy2y ago

Thank you.

I'm working on a new project which will do exactly this (and a lot more!) which I hope to release next week. I'll drop the link here when that's ready.

spion2y ago

How is the language server support?

snthpy2y ago

We do have grammars for many editors. I'm not sure about LSP off the top of my head but you should find something in the docs under "Integrations".

There is also a VSCode extension: https://marketplace.visualstudio.com/items?itemName=prql-lan...

https://en.wikipedia.org/wiki/Codd%27s_12_rules

[PRQL dev here]

We don't have LSP support yet, but it's on the Roadmap. We've designed the language to be very LSP-friendly — one of the benefits of starting with `from` and pipelining each function.

continuational2y ago

In that first example, is the last line superfluous? It doesn't seem to be used.

laerus2y ago

hey, is compile time verification of queries supported for PRQL in Rust?

snthpy2y ago

Not yet, but looking at what sqlx does, I think we should be able to do something similar.

It's been a small team of core contributors so far but in the last three months we've seen more people making their first PR and then going on to contribute more over time so the momentum is growing.

We'd definitely be open to contributions in this space.

dagss2y ago

Nitpick, but relational does not mean joins, it means tables/rows of tuples. A "relational document database" which is the slogan of Fauna it seems is a contradiction in terms.

contrast2y ago

That’s technically correct, and I think the author would say he’s aware of that definition.

The article as I read it is trying to make a broader point, that there are underlying mathematical principles that inspired Codd’s relational model.

I’ve never had cause to explore it, but my understanding is that there’s nothing in those principles that require tables/rows of tuples.

gregjor2y ago

> there’s nothing in [Codd’s] principles that require tables/rows of tuples.

Have you read Codd’s Rules #1 and #2? Pretty clear on this point.

dragonwriter2y ago

> The article as I read it is trying to make a broader point, that there are underlying mathematical principles that inspired Codd’s relational model.

The relational model is a direct product of a set of mathematical principles Codd put together called relational algebra, which deals with sets of tuples called relations.

Nothing in the article addresses any of the mathematical underpinnings of the relational model. Its blowing smoke at an audience that it expects to know next to nothing about the topic.

bazoom422y ago

A relation is by definition a set of tuples (informally called a table where the tuples are the rows).

Codds relational database model adds the further constraint that nested tables are not allowed (first normal form), instead representing relationships through foreign keys.

It is unclear to me what “mathematical principles” remain if you remove the notion of relations from the relational model.

[1]: https://ako.github.io/blog/2023/08/25/json-transformations.h...

marcosdumay2y ago

Whatever you want to point from theory, the one single distinctive feature of the relational model is the "mostly free" interdependency between the relations. AKA, the fks and joins.

crabbone2y ago

Also the author seems to be very proud of associating themselves with Microsoft's products (w/o even a hint of doubt that that may not show them in favorable light)...

Also, marketing-inspired use of pseudo-programming terminology (eg. "dynamic languages"). Ewww.

ako2y ago

default-kramer2y ago

Using FQL instead of SQL seems to be a pretty big difference too.

ako2y ago

robertlagrant2y ago

Yes - the difference you mention seems to be the main difference.

ttfkam2y ago

CTEs and query pipelining are not sufficient?

gigatexal2y ago

robertlagrant2y ago

On strong schemas and flexibility:

1. You still have a schema in your code. With weak schemas it's now just harder to know if every record in your database conforms to it.

m_mueller2y ago

roenxi2y ago

> If you program defensively you can save on certain common Schema updates in e.g. a document based data model (e.g. adding more fields)

ALTER TABLE whatever ADD COLUMN new_field type DEFAULT NULL;

ghusbands2y ago

> Most importantly, SQL databases made supporting highly consistent ACID transactions easy.

The default transaction isolation level for every major database is not ACID. Enabling the required serializability tends to make performance terrible, and so most don't.

_a_a_a_2y ago

> Enabling the required serializability...

is trivial, no?

> ...tends to make performance terrible

I've heard this a lot but never seen any figures - anyone have any numbers/experience?

(edit: and most apps I've worked with didn't need serialisability, either because they were working with a snapshot of data or absolutely precise answers weren't needed)

ghusbands2y ago

iudqnolq2y ago

that's presumably why the author said "made supporting... easy", not "is"?

ghusbands2y ago

Most people believe that databases are ACID by default, so it's worth bringing up when an article even subtly implies otherwise.

xwowsersx2y ago

I have to say this was written extremely well. Quite cogent and I feel I learned a little something. Bookmarking this as a pretty decent intro to this area that I can refer people to.

slotrans2y ago

Fixed schemas are good. Document stores are bad. SQL is good.

bob10292y ago

> Fixed schemas are good.

totalhack2y ago

Scarbutt2y ago

Business requirements change over time, specially at the beginning, you may have 80% certainty of the schema today but not in four months.

xtracto2y ago

I saw it first hand 10 years ago, and had to do a migration.

It's sad seeing how they are digging into the same hole I had to digg out myself from a decade ago.

lcnPylGDnU4H9OF2y ago

NoSQL databases aren’t unilaterally worse than relational ones. They just solve different problems.

pphysch2y ago

> NoSQL databases aren’t unilaterally worse than relational ones. They just solve different problems.

One is a adding a special new datatype, the other is trying to add an entire paradigm.

Just use Postgres. If you do need to migrate to Mongo for some reason, dumping your tables into JSON isn't the end of the world.