Decoupling a core service from your monolith the right way (opens in new tab)

(betterprogramming.pub)

61 pointseallam2y ago48 comments

48 comments

I don't understand why people choose to put an HTTP barrier in their code. It would have been much better if they had stopped at a Billing module. They get all the code factoring benefits they seek without the performance and operational overhead of an additional service.

zug_zug2y ago

I think it's a branding thing. Basically the cargo cult (which copies what's new/hype/difficult) latched on it. Most of these hype things die in a few years with no lasting legacy or resume value though (mongo, serverless, et al).

Someday hopefully some genius proposes "hyper-sidecar-ification" where you take microservices and package them together in a sophisticated way to avoid the limitations and latency of an http barrier. As long as it's new & complicated & buzzwordy (even if it's just monorepo again) it can catch on.

wtyyasdf2y ago

The latency is kind of a FUD with topology aware routing.

2 more replies

__MatrixMan__2y ago

It's because the code smells bad and they're having a hard time getting motivated to clean it up piece by piece because even when their piece is clean they'll still have to deal with the smell of the adjacent pieces. Http lets you say:

> That dumpster fire over there is not my problem

Without having to say:

> I want to rewrite the whole thing from scratch, you'll have to deal with it being down for a few months

marcosdumay2y ago

Again, you don't need HTTP for that either. The GP's proposal achieves exactly the same thing.

IPC (networks, FIFO, pipes or whatever) decoupled services solve problems more on the ops side, from binary compatibility to resource segregation. They do very little on the dev side.

2 more replies

brundolf2y ago

Yeah, I think this is the realistic answer. Technical solution to a social problem etc

rco87862y ago

Yes! I call them “modular monoliths” and fight tooth and nail to get people to stop introducing network when it doesn’t need to be there.

I want to make people do Bart Simpson style writing on the chalkboard “Microservices don’t make things easier” 200 times

adra2y ago

When you're releasing changes daily, a dozen groups are all touching the same code, and all of a sudden one of those ten changes kill the application with OOMs galore, your ops people are going to have a bad day. Oh some minor component written by a newer dev added a non-reversible DB migration in that same push? You're having a very bad day(s). Microservices are not the panacea that will work for all organizations/products/workflows, but damn do they do amazing things for others. Running a platform with dozens of teams with loose organization, distributed timezones, conflicting priorities abound, it's nice to not worry about other teams poofing our perfectly happy chunk of the company from doing it's thing.

3 more replies

marcyb5st2y ago

I think that while the approach is good, there was a better option (IMHO). Specifically, I would:

1. Create protobuffers (or similar) messages that wrap the requests and responses. E.g. CheckSubscriptionPaidRequest and CheckSubscriptionPaidResponse.

2. Refactor your code so that the fields in the messages defined in #1 are your only mean to pass/retrieve information.

3. If need be, expose the service through GRPC or similar.

4. Repeat for each service/endpoint.

This way, you don't incur in any overhead apart from the negligible creation of the protobuffer messages instances.

Tyr422y ago

I think they wanted their own deploy timelines.

treis2y ago

In practice there's not much difference if you use a wrapper application. The wrapper application sets the version of modules and provides whatever glue they need. For Rails that would be gems in the Gemfile and mounted engines. To deploy a new version of your module you bump the version in the Gemfile and roll the nodes.

Essentially you have:

Modules = Microservices

Wrapper = Terraform/Helm Charts

And practically they work the same way.

This is another thing that used to make sense but no longer does. Back in the day we had pets and not cattle. You couldn't just roll someone's servers and expect everything to be fine. But today we write stateless cattle that can be killed at any moment.

1 more reply

paulddraper2y ago

Exactly.

If your concern is code health, a compile-time dependency works great.

If your concern is resource management, then you need a runtime dependency.

ewalk1532y ago

This.

The end of the article covers why they felt complete isolation was worth the network costs. Maybe this true for their organization.

From working on one of the largest ruby code base for the last 5+ years, I see the massive benefits of isolation without introducing the http barrier. Yes, service isolation can let you ship faster. In practice, the network barrier will make some kinds of rollbacks easier and other far more difficult.

The reliably of the whole system is a lot more costly with the added network failure modes.

Over time, the proliferation of services impose an ever growing maintenance tax to keep libraries up to date and mitigate security vulnerabilities across an organization.

Tl;dr there are no free lunches.

myvoiceismypass2y ago

In theory, separate teams can manage separate deployment schedules, etc more closely aligned with their feature work.

AtNightWeCode2y ago

In theory, you mean if there were like separation of concerns. Like the billing address being in the billing service instead of the customer service?

jupp0r2y ago

You have to draw the line somewhere, or Chrome would have to ship with the whole WWW.

golemotron2y ago

Scaling and load balancing.

treis2y ago

This is one of those things that was true in like 2008 but is no longer true in 2023. Scaling a monolith is slightly more expensive in memory consumption but in practice it's irrelevant for most cases.

In this particular case it's worse due to how Rails works. Usually people deploy one Rails thread per core and that core is blocked by the thread.

If that's the case when Service A calls Service B, Service A is blocking a core and waiting until Service B completes. That's effectively doubling resource consumption during that call. You have one server waiting on another server to do work it could have done itself.

2 more replies

Berone2y ago

From a business perspective, unlocking this precision for infrastructure is only worth the investment at the highest levels of scale.

brundolf2y ago

> and since everything lived in the same monolithic, the billing codebase was coupled with other core modules like transfers and authorization

This doesn't follow, but everyone always seems to think that it does. They even demonstrate that it doesn't in Phase 1! "Decouple billing logic within the monolith"

liampulles2y ago

This is building a SOA distributed monolith, which is kind of cutting off your nose to spite your face. I've been there - would not recommend.

It makes the system brittle, slow, and forces strong commitments that dependent services remain up (rolling releases with non breaking migrations, etc).

If I were to do it again, then I would first ensure that the infrastructure is there for inter-service communication to be done asynchronously, and that changes are eventually consistent. Maybe using a workflow manager like Camunda or Temporal. Or even just event choreography between services - either of those is better than a synchronous HTTP call chain of what will become 7 dependent services.

tra32y ago

I agree that asynchrony would be a stronger technical solution, but it’s not the defining characteristic of SOA. They author mentions circuit breakers so seems like they did think about network resiliency.

Other then async what would make it less of a “distributed monolith”, can you say?

liampulles2y ago

I am using SOA out of place and I should not have included it in my comment - I agree.

I guess distributed monolith is a nebulous term, and I'm sure people have their own criteria. To me, the defining characteristic IS the size of that synchronous call chain. If to serve some of the public operations of your system you need to make a synchronous HTTP call that spans more than one service, then I consider those services to be too tightly coupled and the system is closer to being a distributed monolith then a set of independent services (I'd make an exception if the first service is an API gateway or is very explicitly a kind of middleware service, and not defining business logic).

The degree to which the system is a distributed monolith, and how much one should care about that fact or invest effort to steer away from it is a function of how big the biggest one of those call chains is. I don't have a binary definition, more of a sliding scale. The way to avoid sliding more into the direction of a distributed monolith (at least the way I reckon it) is to avoid making those call chains from the get go.

1 more reply

karmakaze2y ago

> If a rollback were needed, some features would have to be implemented in both the monolith and Billy. This was time-consuming, so moving fast to remove that code from the monolith and rely 100% on Billy was essential.

This is the hardest bit, where if the monolith is relying on a shared db transaction between the client and service the network boundary makes them separate. Even without explicit rollbacks, at any high scale/load there can be timeouts/failures leaving an inconsistent data state.

The article suggests migrating a less critical client first, then developing such consistency mechanisms before migrating the more critical clients.

gitgud2y ago

The article suggests using gitsubmodules, which I wouldn’t recommend.

Mainly because depending on another repo can be flakey. If you depend on another repo’s tag i.e. “submodule@v1.3” that tag commit could be changed, which could break the build.

If you depend on another repo’s commit hash i.e. “submodule@hash” then you depend on nobody rebasing or “git push -f” which could remove that hash from the git history.

All these problems disappear if you use a mono-repo, rather than a git submodule…

R3ll1K72y ago

I work with a large monolith, and we have been trying to decouple some of out core features for the past couple of years. The one lesson I learnt is that you have to start by making sure new features are not built on your legacy app. Feature flags is also a must.

klysm2y ago

Grug don’t understand why take decoupling and add network call

neonate2y ago

https://archive.ph/2fWeS

besus2y ago

A very pragmatic use of SOA / microservice.

victor1062y ago

Why do people still use Medium?

mteam882y ago

My blog is hosted on GitHub pages on astro[1]. I'm considering mirroring to Medium and Substack to get more SEO. In the spirit of POSSE: https://indieweb.org/POSSE

[1] https://mteam88.github.io/

revskill2y ago

Because Large is expensive, and Small is limited ?

j / k navigate · click thread line to collapse

48 comments

treis2y ago

zug_zug2y ago

wtyyasdf2y ago

The latency is kind of a FUD with topology aware routing.

2 more replies

__MatrixMan__2y ago

> That dumpster fire over there is not my problem

Without having to say:

> I want to rewrite the whole thing from scratch, you'll have to deal with it being down for a few months

marcosdumay2y ago

Again, you don't need HTTP for that either. The GP's proposal achieves exactly the same thing.

IPC (networks, FIFO, pipes or whatever) decoupled services solve problems more on the ops side, from binary compatibility to resource segregation. They do very little on the dev side.

2 more replies

brundolf2y ago

Yeah, I think this is the realistic answer. Technical solution to a social problem etc

rco87862y ago

Yes! I call them “modular monoliths” and fight tooth and nail to get people to stop introducing network when it doesn’t need to be there.

I want to make people do Bart Simpson style writing on the chalkboard “Microservices don’t make things easier” 200 times

adra2y ago

3 more replies

marcyb5st2y ago

I think that while the approach is good, there was a better option (IMHO). Specifically, I would:

1. Create protobuffers (or similar) messages that wrap the requests and responses. E.g. CheckSubscriptionPaidRequest and CheckSubscriptionPaidResponse.

2. Refactor your code so that the fields in the messages defined in #1 are your only mean to pass/retrieve information.

3. If need be, expose the service through GRPC or similar.

4. Repeat for each service/endpoint.

This way, you don't incur in any overhead apart from the negligible creation of the protobuffer messages instances.

Tyr422y ago

I think they wanted their own deploy timelines.

treis2y ago

Essentially you have:

Modules = Microservices

Wrapper = Terraform/Helm Charts

And practically they work the same way.

1 more reply

paulddraper2y ago

Exactly.

If your concern is code health, a compile-time dependency works great.

If your concern is resource management, then you need a runtime dependency.

ewalk1532y ago

This.

The end of the article covers why they felt complete isolation was worth the network costs. Maybe this true for their organization.

The reliably of the whole system is a lot more costly with the added network failure modes.

Over time, the proliferation of services impose an ever growing maintenance tax to keep libraries up to date and mitigate security vulnerabilities across an organization.

Tl;dr there are no free lunches.

myvoiceismypass2y ago

In theory, separate teams can manage separate deployment schedules, etc more closely aligned with their feature work.

AtNightWeCode2y ago

In theory, you mean if there were like separation of concerns. Like the billing address being in the billing service instead of the customer service?

jupp0r2y ago

You have to draw the line somewhere, or Chrome would have to ship with the whole WWW.

golemotron2y ago

Scaling and load balancing.

treis2y ago

In this particular case it's worse due to how Rails works. Usually people deploy one Rails thread per core and that core is blocked by the thread.

2 more replies

Berone2y ago

From a business perspective, unlocking this precision for infrastructure is only worth the investment at the highest levels of scale.

brundolf2y ago

> and since everything lived in the same monolithic, the billing codebase was coupled with other core modules like transfers and authorization

This doesn't follow, but everyone always seems to think that it does. They even demonstrate that it doesn't in Phase 1! "Decouple billing logic within the monolith"

liampulles2y ago

This is building a SOA distributed monolith, which is kind of cutting off your nose to spite your face. I've been there - would not recommend.

It makes the system brittle, slow, and forces strong commitments that dependent services remain up (rolling releases with non breaking migrations, etc).

tra32y ago

Other then async what would make it less of a “distributed monolith”, can you say?

liampulles2y ago

I am using SOA out of place and I should not have included it in my comment - I agree.

1 more reply

karmakaze2y ago

The article suggests migrating a less critical client first, then developing such consistency mechanisms before migrating the more critical clients.

gitgud2y ago

The article suggests using gitsubmodules, which I wouldn’t recommend.

Mainly because depending on another repo can be flakey. If you depend on another repo’s tag i.e. “submodule@v1.3” that tag commit could be changed, which could break the build.

If you depend on another repo’s commit hash i.e. “submodule@hash” then you depend on nobody rebasing or “git push -f” which could remove that hash from the git history.

All these problems disappear if you use a mono-repo, rather than a git submodule…

R3ll1K72y ago

klysm2y ago

Grug don’t understand why take decoupling and add network call

neonate2y ago

https://archive.ph/2fWeS

besus2y ago

A very pragmatic use of SOA / microservice.

victor1062y ago

Why do people still use Medium?

mteam882y ago

My blog is hosted on GitHub pages on astro[1]. I'm considering mirroring to Medium and Substack to get more SEO. In the spirit of POSSE: https://indieweb.org/POSSE

[1] https://mteam88.github.io/

revskill2y ago

Because Large is expensive, and Small is limited ?

j / k navigate · click thread line to collapse