How we built a serverless SQL database (opens in new tab)

(cockroachlabs.com)

265 pointsTheresaBraccio4y ago117 comments

117 comments

> If you’ve created a database before, you probably had to estimate how many servers to use based on the expected traffic.

The answer is "one". If you have less than 10k req/s you shouldn't even start to think about multiple DB servers or migrating from bog-standard MySQL/MariaDB or Postgres.

I will never understand this obsession with "scaling". Modern web dev seriously over-complicates so many things, it's not even funny anymore.

mjb4y ago

What happens when that database fails? Are you OK losing some data, or do you want the data to be synchronously replicated off the machine and be available somewhere else after failure? Distribution isn't only about scale, it's also about availability.

What happens when that database loses some data? Do you want an up-to-the second backup, or point-in-time recovery? Or are you OK restoring last night's backup? Distribution isn't only about scale, it's also about durability.

What happens when you need to run an expensive business process ad-hoc? Do you want it to be easy to scale out reads, or to export that data to an analytics system? Or are you OK building something else to handle that case? Distribution isn't only about scale, it's also about flexibility.

What happens when you want to serve customers in one market, and make sure that their data stays local for regulatory compliance reasons or latency? Are you OK with having separate databases? Distribution isn't only about scale, it's also about locality.

phoboslab4y ago

Fair points. I would argue that for most people a simple master-slave setup with manual failover will produce far fewer headaches than a "serverless" architecture.

When you are big enough to worry about the other issues, you surely are big enough to handle the requirements in-house. I see the dependence on some specific companies as the bigger threat to reliability.

1 more reply

_vvhw4y ago

This is probably one of the best motivations for a distributed database that I've read.

I find that it's not often that people grasp that distribution is about availability. It's obvious when you say it, but for a long time my own intuition was that distribution is about mostly durability or consensus protocols to provide total order across multiple machines. Yet these build together into availability.

In fact, I first noticed this distinction when reading Brian M. Oki's seminal 1988 paper on Viewstamped Replication, the work that would pioneer the field of consensus—a year before Paxos but with an intuitive protocol essentially identical to Raft. The surprising thing is that today many of us might have titled the paper something about "consensus" or "total order" (which it practically invented, and which was the major breakthrough, at least how to do this in the presence of network partitions) but that he titled it "Viewstamped Replication: A New Primary Copy Method to Support Highly-Available Distributed Systems".

I did a short intro talk to Viewstamped Replication (and particularly why FTP or nightly backups or manual failover are not a solution): https://www.youtube.com/watch?v=_Jlikdtm4OA

The talk is followed by interviews with Brian M. Oki and James Cowling (authors of the 1988 and 2012 papers respectively).

dcposch4y ago

This is a good argument against running a k8s cluster when u don't need one, but not a good argument against this new serverless Cockroach product.

Serverless is not just about auto scaling up from 1 to n, it's about autoscaling down from 1 to 0.

If Cockroach provides a robust SQL DB at a marginal cost of ~$0/mo for small request volumes, that is a real value add over running your own pg server.

Not having to deal with administration or backups is another big value add.

This offering looks like it compares very nicely to say running an rds instance with auto backups enabled.

gkop4y ago

Tradeoffs are tradeoffs.

In your k8s example, by running a k8s cluster when you don't need one, a cost you pay is the overhead.

In the Cockroach serverless case, costs that come to mind include vendor lock-in when you evolve a pattern of production traffic that is hard to migrate to other solutions, and security and compliance challenges due to the virtualized instances running on shared clusters. In many cases these tradeoffs may be worthwhile. My point is that looking at it only in the dimension of scaling up and down, doesn't tell the whole story. The OP doesn't talk about tradeoffs, so in the comment section we must.

1 more reply

jayd164y ago

k8s lets you put multiple deployments on virtualized hardware in a cloud agnostic/consistent way. You can do all that on one machine. Even if you're running a single instance deployment you still probably want at least two environments. K8s isn't without value in that respect.

mgartner4y ago

It seems like you’re defining “scaling” as growth of a workload to the point that it cannot be handled by a single-server DB.

But with any service without a constant workload (I’d wager almost all services besides prototypes that get no users) you’re going to have to literally scale that one machine, by replacing it with a bigger machine. When you have 50 users you’re not going to be paying for some yy.24xlarge. You’ll start with something much more affordable. When the service grows to 50,000 users, you certainly won’t be at “Facebook scale”, but that t3.small isn’t going to cut it. Should your service ever decline, it’d be nice to scale that machine down to save on costs.

At a previous job, we spent many human hours continually ratcheting up the size of our Postgres machine a few times a year. Not only did this take non-trivial engineering hours and mind-space, it also caused maintenance downtime due to the limitations of traditional DBMSs.

Self-managed CockroachDB eliminates the downtime needed to scale. To handle a more intense workload, add machines. If you want to vertically scale each machine, that can be done without downtime too.

CockroachDB Serverless takes this a step further by scaling up and down to suit the demands of a highly dynamic workload, while minimizing costs.

Maybe what looks like an mega-scale obsession to you is actually a bunch of people trying to avoid the common headaches of managing a moderately sized, dynamic service.

anamax4y ago

During a technical interview many years ago, I commented that the system architecture didn't scale.

The interviewer responded "When that matters, I won't even be managing the person whose problem that is."

emilsedgh4y ago

Isn't that the right mindset?

1 more reply

bitwize4y ago

(robot voice) Does /dev/null support sharding? Sharding is the secret ingredient in the web scale sauce.

zkldi4y ago

it's impressive how well that video has aged with modern web dev.

lumost4y ago

The answer is unfortunately less clear cut. Particularly if you assume that whoever is tasked with scaling this hypothetical DB doesn't know what they are doing a-priori.

The following questions are likely to come up

1) My t3.xl DB is down, how much bigger can I make it?

2) My r3.24xl DB can only handle 100 TPS and now my site is down, what can I do?

3) My 2x r3.24xl DB cluster costs a lot of money, Are other solutions cheaper?

4) My latency is high, are other solutions faster?

For someone who hasn't dealt with these questions before, these will become long and painful lessons with massive material impacts to the business.

It's appealing to use Dynamo as it takes the magic out of scaling. It's appealing to use serverless RDBMS as you don't have to think about it anymore unless it has high costs/latency.

mdekkers4y ago

> The answer is unfortunately less clear cut. Particularly if you assume that whoever is tasked with scaling this hypothetical DB doesn't know what they are doing a-priori.

The answer is very clear-cut:

Work with professionals.

2 more replies

Groxx4y ago

Why would you assume that the person responsible for a thing doesn't know what they're doing?

psadri4y ago

Unfortunately, easy to hit that with say GraphQL where each client request can resolve to dozens of db selects vs a single hand written/tuned SQL select.

aaaaaaaaaaab4y ago

Maybe that's a good reason to avoid GraphQL then?

2 more replies

infogulch4y ago

If you're using GraphQL with SQL -- postgres specifically -- I would say use Hasura, but support between Hasura & CockroachDB seems to have stalled due to missing triggers [0] [1]. CRDB supports a feature called "changefeeds" [2] which is claimed might cover some of Hasura's use-cases, but that's a proprietary extension not present in base PostgreSQL.

[0]: https://github.com/hasura/graphql-engine/issues/678

[1]: https://github.com/cockroachdb/cockroach/issues/28296

[2]: https://www.cockroachlabs.com/docs/v21.1/stream-data-out-of-...

1 more reply

chillfox4y ago

I feel like you forgot SQLite, and for really small scale you could just use whatever storage/serialization solution comes built in to your favorite language.

earleybird4y ago

And the rational that validates this was written back around 2006[0]

[0] https://web.archive.org/web/20090306191715/http://www.my-idc...

truetraveller4y ago

Side note: this guy is Dominic Szablewski, author of the amazing ImpactJS, JSMpeg, QuakeVR. Probably a 10x developer. He rivals Fabrice Bellard of ffmpeg fame. Check out his site: https://phoboslab.org/

brokencode4y ago

The question isn’t whether you have 10k req/s now, but whether you expect to in the future. If you are designing a blog, then yeah, you probably don’t need to worry about it. If you are starting a social network or SAAS business application, then you probably do.

GuB-424y ago

The future will be different.

A lot of successful businesses start with things that are not scalable, and it is a strength, not a weakness. If you start a social network for instance, you can't beat Facebook at its own game. You have to do something that Facebook can't do because it is too big. Scalability problems will be tackled as you grow.

Among the many things Facebook can't do is running its service on a single database. It makes things much harder on them, thankfully, you are much smaller than Facebook and you can. Take that advantage.

wyager4y ago

Alternatively, people really do need to put a lot of thought into scaling, but only because they did something like write some core web service in an interpreted language framework that maxes out at 100 requests per second.

unobatbayar4y ago

But wouldn't you agree that the initial architecture is really important; It should be at least designed with scaling in mind? Since it can get difficult changing things afterwards.

jayd164y ago

Sure if "in mind" includes realistic capacity planning.

didip4y ago

I disagree. Even the simplest master-slave setup bound to cause an outage once or twice.

Now, if you setup your DB using public cloud's flavor of PostgreSQL... That's a different story.

andydb4y ago

To any who might see this, I'm the author of the blog post, and led the engineering team that built CockroachDB Serverless. I'll be monitoring this thread in case there are any questions you'd like to ask me about it.

vegasje4y ago

I'm quite confused by Request Units, and trying to predict how many would be used by queries/operations.

I launched a test cluster, and the RUs are continuously increasing without me having connected to the cluster yet. At this rate of RU climb, the cluster would use over 8mil of the available 10mil RUs in a month without me touching it.

Coming from AWS, one of the most difficult aspects of using Aurora is guessing how many I/Os will be used for different workloads. It would be a shame to introduce this complexity for CockroachDB Serverless, especially if the RUs are impacted by internal cluster operations that aren't initiated by the user.

andydb4y ago

You've run into a "rough edge" of the beta release that will be fixed soon. When you keep the Cluster Overview page open, it runs queries against your cluster so that it can display information like "# databases in the cluster". Unfortunately, those queries run every 10 seconds in the background, and are consuming RUs, which is why you see RU usage without having connected to the cluster yet. But never fear, we'll get that fixed.

One thing that may not be clear - you get 10M RUs for free, up front, but you also get a constant accumulation of 100 RU/s for free throughout the month. That adds up to >250M free RUs per month. This ensures that your cluster is always accessible, and that you never truly "run out" of RUs - at most you get throttled to 100 RU/s.

I hear you on the difficulty of understanding how your queries map to RUs. SQL queries can be enormously complex and differ by multiple orders of magnitude from one another in terms of their compute cost. That's why we built a near real-time dashboard that shows you how quickly you're consuming RUs. You can run your workload for a few minutes and then check back on the dashboard to see how many RUs that workload consumed.

sebbul4y ago

I haven't connected to my cluster but my RUs keep going up. Extrapolating I'll be at 20M RUs over 30 days without using it.

yla924y ago

> Each node runs in its own K8s pod, which is not much more than a Docker container with a virtualized network and a bounded CPU and memory capacity. Dig down deeper, and you’ll discover a Linux cgroup that can reliably limit the CPU and memory consumption for the processes. This allows us to easily meter and limit SQL resource consumption on a per-tenant basis.

Nice use of K8s here and overall a great post! This is not related to CockroachDB but related to kube and cgroup. I am wondering if you guys have faced this infamous CPU throttling issue[0] when you guys were doing the metering and limiting.

[0] : https://github.com/kubernetes/kubernetes/issues/67577

andydb4y ago

We haven't run into that so far, but thank you for pointing it out as something to watch out for.

jawns4y ago

What's your elevator pitch for why my organization should use CockroachDB Serverless vs. something like AWS Aurora Serverless, particularly if we're already relatively invested in the AWS ecosystem?

qaq4y ago

Not CDB employee but CDB scales beyond what Aurora can support.

andydb4y ago

Oh boy, I'm an engineer, but I'll do my best to pretend I'm on the sales or marketing team for a minute...

First of all, CockroachDB Serverless is available on AWS, and should integrate quite well with that ecosystem, including with Serverless functions offered by AWS Lambda.

Here are a few advantages of CockroachDB Serverless that Aurora will struggle to match (note that we're still working on Serverless multi-region support):

1. Free-forever tier. We offer a generous "free forever" tier that doesn't end after a month or a year. As the blog post outlines, our architecture is custom-built to make this economical.

2. No ceiling on write scalability. Even non-Serverless Aurora runs into increasing trouble as the number of writes / second increases past what a single machine can handle. CockroachDB just keeps going. We've had multiple high-scale customers who hit Aurora limits and had to move over to Cockroach to support business growth.

3. True multi-region support. Aurora only allows read-only, stale replicas in other regions, while CRDB allows full ACID SQL transactions. If you want to move into other regions of the world and have latency concerns or GDPR concerns, CRDB is custom-built to make the full SQL experience possible.

4. No Cloud lock-in. Perhaps this is not a concern for you company, but many companies don't like getting completely locked in to a single Cloud provider. CockroachDB works on multiple cloud providers and doesn't have a monetary interest in locking you in to just one.

5. Online schema changes. CockroachDB supports operations like adding/removing columns, renaming tables, and adding constraints without any downtime. You can perform arbitrary schema changes without disturbing your running application workloads. SQL DDL "just works".

6. Cold start in an instant. CockroachDB clusters automatically "scale to zero" when they're not in use. When traffic arrives, they resume in a fraction of a second. Compare that to Aurora, where you need to either have a minimum compute reservation, or you need to endure multi-second cold starts.

7. Great support. We've got a friendly Slack room where you can get free support and rub shoulders with fellow CockroachDB users, as well as CockroachDB folks like myself. We also have 24/7 paid support for deeper problems you might encounter.

Taken altogether, CockroachDB can go wherever your business needs it to go, without all the constraints that traditional SQL databases usually have. Do you want thousands of clusters for testing/development/tiny apps at a reasonable cost? Could your business take off and need the scale that CRDB offers? Could your business need to expand into multiple geographic regions? Are some of your workloads erratic or periodic, but still should start up instantly when needed? It's not just about what you need now, but what you may need in the future. It makes sense to plan ahead and go with a database that has "got you covered" wherever you need to go.

4 more replies

dilyevsky4y ago

Haven’t yet got time to read the whole thing so sorry if it’s already answered but is it possible to run this sql pod/storage pod separation setup yourself with crdb community/enterprise? We run enterprise crdb but it’s all in one process (with replicas)

andydb4y ago

It's not currently possible, partly because it complicates the deployment model quite a bit. Dynamically bringing SQL pods up and down requires a technology like Kubernetes. It takes some serious operational know-how to keep it running smoothly, which is why we thought it would be perfect for a managed Cloud service.

What would be your company's reasons for wanting this available in self-hosted CRDB? What kinds of use cases would it address for you?

2 more replies

nerdywordy4y ago

Can this CRDB Serverless offering handle the burst connections of a serverless function based app? Are pooling or query queueing features built in?

Or would users face connection limits at some upper bound until the old function connections get spun down?

corentin884y ago

Do you have a getting started guide on CockroachDB Serverless? I couldn’t find one in your docs [1]. This looks very interesting.

[1] https://www.cockroachlabs.com/docs/v21.1/example-apps.html

shampeon4y ago

The Quickstart [1] is what you're looking for. Examples for NodeJS, Python, Go, and Java.

[1]https://www.cockroachlabs.com/docs/cockroachcloud/quickstart...

amitkgupta844y ago

How will you handle PrivateLink and VPC Peering connections into customer VPCs/Vnets with the multitenant architecture?

andydb4y ago

We're still discussing options there, so I don't have an answer on how we might handle them. We do recognize that many business customers will have such requirements.

chillfox4y ago

How low can CocroachDB go with resource usage?

Is the open source version viable for small hobby projects?

pier254y ago

I only skimmed the article, but... is Cockroach serverless multi region?

jungturk4y ago

The reply below says yes:

https://news.ycombinator.com/item?id=29005597

justsomeuser4y ago

Can I connect using an SSH tunnel (without the SSL cert)?

andydb4y ago

At this point, only Postgres SSL connections are supported.

pier254y ago

How does this serverless offering compare to Fauna?

reilly30004y ago

I would assume the main point is that it’s actual PostgreSQL, not a new query language.

ed25519FUUU4y ago

> And you’ll never be surprised by a bill, because you can set a guaranteed monthly spend limit.

It’s amazing that this a killer and not standard feature, but here we are!

rabaut4y ago

Does CockroachDB Serverless expose an HTTP api? This sounds like a great fit for use with Cloudflare Workers, but that requires an http api.

andydb4y ago

Great question. We recognize how important this is and are actively working on it.

webmaven4y ago

Is it possible to use PostgREST in front of CRDB to solve this use case?: https://postgrest.org/

1 more reply

sorenbs4y ago

Prisma will launch a data proxy, as well as support for cockroachdb and Cloudflare workers early next year.

password43214y ago

I think Supabase does, 500mb storage + 2gb outbound data transfer for free.

sharps_xp4y ago

Why do you prefer an HTTP API versus a DB connection? Isn't the former going to inherently have the overhead cost of creating the connection + TLS handshakes?

My question is similar, which is, is CockroachDB going to have an equivalent RDS proxy so that apps can handle traffic spikes and not have to deal with problems with DB connection pools

pistoriusp4y ago

I think we won't be getting socket connections in some of the wasm powered JS runtime engines soon. Using http solves that, and a bunch of caching issues.

1 more reply

zaidf4y ago

I kind of assumed this came with an HTTP api, woops :) I've been using Airtable with Workers for tiny side projects. This seemed like a nice alternative to Airtable's lack of support for vanilla SQL.

WatchDog4y ago

This offering seems a lot more compelling than the serverless databases offered by AWS directly.

At least if cost and scaling to zero is important to you. Dynamodb addresses the cost issue, but it's so painful to use, and is unsuitable for a lot of use-cases. Aurora Serverless takes 30 seconds to cold-start, so it's also usually a non-starter, other than for batch type workloads.

So great work, but I have a couple of questions:

Any plans to support AWS privatelink? I played around with this a bit, and it seems you need to connect to it over the internet which isn't always ideal.

Will there be a limit on how many free databases you can create? I think there are some valid use-cases where one might want to create a bunch of them, but I would be scared to do this at the risk of being kicked off your platform for abuse.

andydb4y ago

Our product roadmap will be heavily influenced by customer asks. So if there are things about CockroachDB Serverless that prevent you from using it (like requiring ingress through a public IP), we definitely want to hear about it.

Regarding a cluster limit, we currently allow up to 5 clusters per customer account. I'd like to hear what kind of use-cases you have in mind for having a lot more clusters. One I've thought about is CI runs, where you'd want dozens or hundreds of temporary clusters running at once, in order to run your testing in parallel.

WatchDog4y ago

I guess two main use-cases come to mind, providing isolated multi-tenant saas services, also easily creating test/dev environments.

In a similar vein to how you have made cockroach multi-tenant, not too long ago I worked on building a multi-tenant sass version of a business intelligence app. The app uses a relational DB, initially we used separate schemas on the same db cluster, but we had problems with noisy neighbors, as well as concerns about it's security.

We later opted to run dedicated database clusters for each tenant, however it greatly increases the marginal cost, and makes it difficult to provide a free tier of service, which is a valuable way to gain new customers.

1 more reply

ranguna4y ago

The main thing that's keeping me from moving to cockroachDB is that it doesn't support triggers. Once triggers are supported, I'll jump out of aws in a millisecond.

pachico4y ago

I really see the greatness of the serverless option. Congratulations!

What I can't really understand is why would someone use the dedicated cluster in AWS at that price.

dmitriid4y ago

- Pricing

- storage and db access is still mostly through hoops

- long running processes

- pain to develop, test and debug

qaq4y ago

pricing

mdasen4y ago

I think some of the value of Cockroach Serverless may depend on what the RUs (request units) map to. Looking at some competitors...

Google says that Cloud Spanner should get you around 7,000 read queries per second and 1,800 write queries per second for $650/mo. If a simple indexed read is 1 RU, $650 on Cockroach Serverless would get you around 2,500 reads/second. Of course, I think it's completely reasonable for a Serverless option to cost a bit more given that you'd need to over-provision Cloud Spanner (even if you smartly increased/decreased the amount of compute allocated based on demand).

Planet Scale charges $15 per 100M rows read and $15 per 10M rows written. If an RU is a row read, then Cockroach Serverless would be $10 per 100M rows read. If a write takes 10 RUs, Cockroach would cost $10 per 10M rows written. Both of those would be less than Planet Scale's cost - but it's possible that a row read will cost more than 1 RU. Let's say that an indexed lookup of a row costs 5 RUs. Then Cockroach Serverless starts costing 3.3x more than Planet Scale.

AWS DynamoDB charges $1.25 per million write request units and $0.25 per million read request units. If I can get 10M reads from Cockroach Serverless for $1 and 4M reads from DynamoDB for $1, Cockroach's pricing looks pretty good. Of course, if I need 5 RUs to do an indexed read, the pricing doesn't look as good anymore.

I do respect Cockroach Labs somewhat ambiguous description here. Planet Scale's $15 per 10M rows written feels like something that could become bad. What if I define hundreds of indexes on the table? What if I'm inserting very large blob/text columns that are 50MB in size? Likewise, what if I index no columns and end up forcing a full table scan, but only 10 rows are returned? Do they consider that I "read" 10 rows or "read" all the rows in the table? If it's the former, I'm putting a lot of strain on their system without paying for it. If it's the latter, I'm going to just define indexes that might not be worth it if I were paying for the IO needed to do all that writing.

Still, it would be nice if Cockroach Labs offered some indication of what could be accomplished with 1 RU. "An indexed read of 1 row or an indexed read of a few rows in sequence; for example, 'SELECT * FROM people WHERE age > 18 ORDER BY age, name LIMIT 10' where there exists an index on (age, name)." That would let me know what to expect. "A write of a row under 4KB in size with no secondary indexes will cost 3 RUs; expect secondary indexes to increase the cost by 1 RU each" would give me an idea of what's going on.

I think there are definitely cases that one can't easily enumerate. For example, "UPDATE people SET age = 18 WHERE EXISTS (SELECT * FROM legacy_info WHERE people.id = legacy_info.people_id AND legacy_info.is_adult = true)". That's potentially going to require lots of stuff that's harder to predict. However, at this point I don't know if an indexed read of a row costs 1 RU or 10 RU. If an indexed read of a single row costs 1 RU, if I read 10 rows sequentially will that mean 10 RUs or will 1 RU have enough IO to cover that since they're sequential (or will the billing just over-charge since there's tangibly more rows and that's easy to explain)?

I think a decent amount of the value depends on the pricing and it's hard to judge that right now.

One thing I will note is that the storage seems expensive. It's slightly cheaper than Planet Scale, but a lot more than the $0.30/GB of Cloud Spanner or the $0.23/GB of FaunaDB or $0.25/GB of DynamoDB. I've been wondering a bit about the storage pricing of Planet Scale since $1.25/GB seems expensive. Cockroach Serverless is coming in at $1/GB which also seems expensive compared compared to alternatives. If Cloud Spanner is basically offering a third the price, is the "Serverless" flexibility worth it given that Spanner can be scaled up/down pretty easily in very granular increments.

Actually, one thing that could be useful might be noting how many request units per month one of the dedicated instances would have. A 2 vCPU CockroachDB instance costs $350-400. Would that be 4 billion request units per month (assuming you were fully utilizing the box)? Would it be more like 15 billion request units per month since you're presumably paying a premium for the Serverless flexibility?

babelfish4y ago

Really great post, thanks for sharing. I spent a lot of time a couple months ago researching 'DBaaS' offerings (for fun, not business) and found it difficult to find any posts outlining the architecture of a DBaaS. Really cool to see CRDB putting this out in the open.

andydb4y ago

It's been something we've done since the start and plan to continue doing. If you read back over our engineering blog, you'll find a surprisingly thorough description of the entire CockroachDB stack, from the lowest storage layer to distributed transactions, Raft consensus, SQL => key-value mapping, online schema changes, cost-based SQL optimizer, Kubernetes usage, and so on. In fact, when we onboard new engineers, we typically point them to a stack of external blog entries to read in order to get up-to-speed on how CockroachDB works.

Being open on how we solve hard problems is the way to build our collective knowledge as a developer community. Certainly CockroachDB itself has benefited enormously from all that has gone before and been published in the open.

estambar4y ago

One of the first apps to migrate from CockroachDB Dedicated to Serverless is live now if someone wants to try it out. https://web.flightchop.com/dashboard - posting for a friend.

timwis4y ago

This sounds great! I’ve wanted to create an open data portal for a while that lets you spin up a (ephemeral, read-only) Postgres database of a dataset and run queries on it, maybe with a notebook. Sounds like this might be perfect!

Shelnutt24y ago

[Disclaimer: I work for TileDB, Inc]

We have developed just this[1] except using a MariaDB storage engine (MyTile) we've written. You can serverless run queries against TileDB arrays without spinning up a MariaDB instance. You can run any type of query MariaDB supports (joins, aggregates, CTE, etc). I've linked the basic documentation and an example notebook below[2]. You can run SQL queries from python/R or even JS or curl. We support a number of data return formats, i.e. arrow and JSON to facilitate use cases.

I'll also mention that we have a number of public example dataset[3] in TileDB cloud, such as the NYC taxi data used in this notebook[4], which you can explore!

[1] https://docs.tiledb.com/cloud/api-reference/serverless-sql

[2] https://cloud.tiledb.com/notebooks/details/TileDB-Inc/Quicks...

[3] https://cloud.tiledb.com/explore/arrays

[4] https://cloud.tiledb.com/notebooks/details/TileDB-Inc/tutori...

chatmasta4y ago

You might like what we’re building at Splitgraph: https://www.splitgraph.com/connect

timwis4y ago

Oh wow, very relevant indeed! I guess I thought ephemeral DBs would be better so that a user’s expensive query wouldn’t bog down the db for other users. And rather than just limiting them, enabling them to do whatever queries they could with pg running locally

2 more replies

kendru4y ago

The lack of a serverless option was the only reason that I did not use Cockroach on a recent project. I am excited to see the serverless offering now, and the architectural details in the post are awesome. Nice work!

boynamedsue4y ago

Any plans to expand the region availability for AWS beyond us-west-2 in the US? I am interested in us-east-2.

andydb4y ago

Yes, definitely, we'll be expanding to more regions soon.

geoduck144y ago

How does this compare to Snowflake?

reilly30004y ago

Snowflake is OLAP and billed based on usage time + storage. This looks like it’s a regular OLTP SQL DB and pay by request.

geoduck144y ago

I see. Looking at the docs, the pricing for storing data is measured in Gigs (I think $1 per gig per month) - not Tb. This makes me think it is intended for smallish data.

I have a database now with 40 Tb - this would cost $40k a month just to store!

1 more reply

j / k navigate · click thread line to collapse

117 comments

phoboslab4y ago

> If you’ve created a database before, you probably had to estimate how many servers to use based on the expected traffic.

The answer is "one". If you have less than 10k req/s you shouldn't even start to think about multiple DB servers or migrating from bog-standard MySQL/MariaDB or Postgres.

I will never understand this obsession with "scaling". Modern web dev seriously over-complicates so many things, it's not even funny anymore.

mjb4y ago

phoboslab4y ago

Fair points. I would argue that for most people a simple master-slave setup with manual failover will produce far fewer headaches than a "serverless" architecture.

1 more reply

_vvhw4y ago

This is probably one of the best motivations for a distributed database that I've read.

I did a short intro talk to Viewstamped Replication (and particularly why FTP or nightly backups or manual failover are not a solution): https://www.youtube.com/watch?v=_Jlikdtm4OA

The talk is followed by interviews with Brian M. Oki and James Cowling (authors of the 1988 and 2012 papers respectively).

dcposch4y ago

This is a good argument against running a k8s cluster when u don't need one, but not a good argument against this new serverless Cockroach product.

Serverless is not just about auto scaling up from 1 to n, it's about autoscaling down from 1 to 0.

If Cockroach provides a robust SQL DB at a marginal cost of ~$0/mo for small request volumes, that is a real value add over running your own pg server.

Not having to deal with administration or backups is another big value add.

This offering looks like it compares very nicely to say running an rds instance with auto backups enabled.

gkop4y ago

Tradeoffs are tradeoffs.

In your k8s example, by running a k8s cluster when you don't need one, a cost you pay is the overhead.

1 more reply

jayd164y ago

mgartner4y ago

It seems like you’re defining “scaling” as growth of a workload to the point that it cannot be handled by a single-server DB.

Self-managed CockroachDB eliminates the downtime needed to scale. To handle a more intense workload, add machines. If you want to vertically scale each machine, that can be done without downtime too.

CockroachDB Serverless takes this a step further by scaling up and down to suit the demands of a highly dynamic workload, while minimizing costs.

Maybe what looks like an mega-scale obsession to you is actually a bunch of people trying to avoid the common headaches of managing a moderately sized, dynamic service.

anamax4y ago

During a technical interview many years ago, I commented that the system architecture didn't scale.

The interviewer responded "When that matters, I won't even be managing the person whose problem that is."

emilsedgh4y ago

Isn't that the right mindset?

1 more reply

bitwize4y ago

(robot voice) Does /dev/null support sharding? Sharding is the secret ingredient in the web scale sauce.

zkldi4y ago

it's impressive how well that video has aged with modern web dev.

lumost4y ago

The answer is unfortunately less clear cut. Particularly if you assume that whoever is tasked with scaling this hypothetical DB doesn't know what they are doing a-priori.

The following questions are likely to come up

1) My t3.xl DB is down, how much bigger can I make it?

2) My r3.24xl DB can only handle 100 TPS and now my site is down, what can I do?

3) My 2x r3.24xl DB cluster costs a lot of money, Are other solutions cheaper?

4) My latency is high, are other solutions faster?

For someone who hasn't dealt with these questions before, these will become long and painful lessons with massive material impacts to the business.

It's appealing to use Dynamo as it takes the magic out of scaling. It's appealing to use serverless RDBMS as you don't have to think about it anymore unless it has high costs/latency.

mdekkers4y ago

> The answer is unfortunately less clear cut. Particularly if you assume that whoever is tasked with scaling this hypothetical DB doesn't know what they are doing a-priori.

The answer is very clear-cut:

Work with professionals.

2 more replies

Groxx4y ago

Why would you assume that the person responsible for a thing doesn't know what they're doing?

psadri4y ago

Unfortunately, easy to hit that with say GraphQL where each client request can resolve to dozens of db selects vs a single hand written/tuned SQL select.

aaaaaaaaaaab4y ago

Maybe that's a good reason to avoid GraphQL then?

2 more replies

infogulch4y ago

[0]: https://github.com/hasura/graphql-engine/issues/678

[1]: https://github.com/cockroachdb/cockroach/issues/28296

[2]: https://www.cockroachlabs.com/docs/v21.1/stream-data-out-of-...

1 more reply

chillfox4y ago

I feel like you forgot SQLite, and for really small scale you could just use whatever storage/serialization solution comes built in to your favorite language.

earleybird4y ago

And the rational that validates this was written back around 2006[0]

[0] https://web.archive.org/web/20090306191715/http://www.my-idc...

truetraveller4y ago

brokencode4y ago

GuB-424y ago

The future will be different.

wyager4y ago

unobatbayar4y ago

But wouldn't you agree that the initial architecture is really important; It should be at least designed with scaling in mind? Since it can get difficult changing things afterwards.

jayd164y ago

Sure if "in mind" includes realistic capacity planning.

didip4y ago

I disagree. Even the simplest master-slave setup bound to cause an outage once or twice.

Now, if you setup your DB using public cloud's flavor of PostgreSQL... That's a different story.

andydb4y ago

vegasje4y ago

I'm quite confused by Request Units, and trying to predict how many would be used by queries/operations.

andydb4y ago

sebbul4y ago

I haven't connected to my cluster but my RUs keep going up. Extrapolating I'll be at 20M RUs over 30 days without using it.

yla924y ago

[0] : https://github.com/kubernetes/kubernetes/issues/67577

andydb4y ago

We haven't run into that so far, but thank you for pointing it out as something to watch out for.

jawns4y ago

What's your elevator pitch for why my organization should use CockroachDB Serverless vs. something like AWS Aurora Serverless, particularly if we're already relatively invested in the AWS ecosystem?

qaq4y ago

Not CDB employee but CDB scales beyond what Aurora can support.

andydb4y ago

Oh boy, I'm an engineer, but I'll do my best to pretend I'm on the sales or marketing team for a minute...

First of all, CockroachDB Serverless is available on AWS, and should integrate quite well with that ecosystem, including with Serverless functions offered by AWS Lambda.

Here are a few advantages of CockroachDB Serverless that Aurora will struggle to match (note that we're still working on Serverless multi-region support):

1. Free-forever tier. We offer a generous "free forever" tier that doesn't end after a month or a year. As the blog post outlines, our architecture is custom-built to make this economical.

4 more replies

dilyevsky4y ago

andydb4y ago

What would be your company's reasons for wanting this available in self-hosted CRDB? What kinds of use cases would it address for you?

2 more replies

nerdywordy4y ago

Can this CRDB Serverless offering handle the burst connections of a serverless function based app? Are pooling or query queueing features built in?

Or would users face connection limits at some upper bound until the old function connections get spun down?

corentin884y ago

Do you have a getting started guide on CockroachDB Serverless? I couldn’t find one in your docs [1]. This looks very interesting.

[1] https://www.cockroachlabs.com/docs/v21.1/example-apps.html

shampeon4y ago

The Quickstart [1] is what you're looking for. Examples for NodeJS, Python, Go, and Java.

[1]https://www.cockroachlabs.com/docs/cockroachcloud/quickstart...

amitkgupta844y ago

How will you handle PrivateLink and VPC Peering connections into customer VPCs/Vnets with the multitenant architecture?

andydb4y ago

We're still discussing options there, so I don't have an answer on how we might handle them. We do recognize that many business customers will have such requirements.

chillfox4y ago

How low can CocroachDB go with resource usage?

Is the open source version viable for small hobby projects?

pier254y ago

I only skimmed the article, but... is Cockroach serverless multi region?

jungturk4y ago

The reply below says yes:

https://news.ycombinator.com/item?id=29005597

justsomeuser4y ago

Can I connect using an SSH tunnel (without the SSL cert)?

andydb4y ago

At this point, only Postgres SSL connections are supported.

pier254y ago

How does this serverless offering compare to Fauna?

reilly30004y ago

I would assume the main point is that it’s actual PostgreSQL, not a new query language.

ed25519FUUU4y ago

> And you’ll never be surprised by a bill, because you can set a guaranteed monthly spend limit.

It’s amazing that this a killer and not standard feature, but here we are!

rabaut4y ago

Does CockroachDB Serverless expose an HTTP api? This sounds like a great fit for use with Cloudflare Workers, but that requires an http api.

andydb4y ago

Great question. We recognize how important this is and are actively working on it.

webmaven4y ago

Is it possible to use PostgREST in front of CRDB to solve this use case?: https://postgrest.org/

1 more reply

sorenbs4y ago

Prisma will launch a data proxy, as well as support for cockroachdb and Cloudflare workers early next year.

password43214y ago

I think Supabase does, 500mb storage + 2gb outbound data transfer for free.

sharps_xp4y ago

Why do you prefer an HTTP API versus a DB connection? Isn't the former going to inherently have the overhead cost of creating the connection + TLS handshakes?

My question is similar, which is, is CockroachDB going to have an equivalent RDS proxy so that apps can handle traffic spikes and not have to deal with problems with DB connection pools

pistoriusp4y ago

I think we won't be getting socket connections in some of the wasm powered JS runtime engines soon. Using http solves that, and a bunch of caching issues.

1 more reply

zaidf4y ago

I kind of assumed this came with an HTTP api, woops :) I've been using Airtable with Workers for tiny side projects. This seemed like a nice alternative to Airtable's lack of support for vanilla SQL.

WatchDog4y ago

This offering seems a lot more compelling than the serverless databases offered by AWS directly.

So great work, but I have a couple of questions:

Any plans to support AWS privatelink? I played around with this a bit, and it seems you need to connect to it over the internet which isn't always ideal.

andydb4y ago

WatchDog4y ago

I guess two main use-cases come to mind, providing isolated multi-tenant saas services, also easily creating test/dev environments.

1 more reply

ranguna4y ago

The main thing that's keeping me from moving to cockroachDB is that it doesn't support triggers. Once triggers are supported, I'll jump out of aws in a millisecond.

pachico4y ago

I really see the greatness of the serverless option. Congratulations!

What I can't really understand is why would someone use the dedicated cluster in AWS at that price.

dmitriid4y ago

- Pricing

- storage and db access is still mostly through hoops

- long running processes

- pain to develop, test and debug

qaq4y ago

pricing

mdasen4y ago

I think some of the value of Cockroach Serverless may depend on what the RUs (request units) map to. Looking at some competitors...

I think a decent amount of the value depends on the pricing and it's hard to judge that right now.

babelfish4y ago

andydb4y ago

estambar4y ago

One of the first apps to migrate from CockroachDB Dedicated to Serverless is live now if someone wants to try it out. https://web.flightchop.com/dashboard - posting for a friend.

timwis4y ago

Shelnutt24y ago

[Disclaimer: I work for TileDB, Inc]

I'll also mention that we have a number of public example dataset[3] in TileDB cloud, such as the NYC taxi data used in this notebook[4], which you can explore!

[1] https://docs.tiledb.com/cloud/api-reference/serverless-sql

[2] https://cloud.tiledb.com/notebooks/details/TileDB-Inc/Quicks...

[3] https://cloud.tiledb.com/explore/arrays

[4] https://cloud.tiledb.com/notebooks/details/TileDB-Inc/tutori...

chatmasta4y ago

You might like what we’re building at Splitgraph: https://www.splitgraph.com/connect

timwis4y ago

2 more replies

kendru4y ago

boynamedsue4y ago

Any plans to expand the region availability for AWS beyond us-west-2 in the US? I am interested in us-east-2.

andydb4y ago

Yes, definitely, we'll be expanding to more regions soon.

geoduck144y ago

How does this compare to Snowflake?

reilly30004y ago

Snowflake is OLAP and billed based on usage time + storage. This looks like it’s a regular OLTP SQL DB and pay by request.

geoduck144y ago

I see. Looking at the docs, the pricing for storing data is measured in Gigs (I think $1 per gig per month) - not Tb. This makes me think it is intended for smallish data.

I have a database now with 40 Tb - this would cost $40k a month just to store!

1 more reply

j / k navigate · click thread line to collapse