Benchmarking Codswallop: Node.js vs. PHP (opens in new tab)

(philsturgeon.co.uk)

163 pointsthere412y ago82 comments

82 comments

Node benchmark is flawed though. Add something like

require('http').globalAgent.maxSockets = 64;

at the top of node script if you want a fair comparison with async php version. The bottleneck is bandwidth here. Not the runtime.

On my laptop, original script from the author took 35 seconds to complete.

With maxAgents = 64, it took 10 seconds.

Edit: And who is downvoting this? I just provided actual numbers and a way to reproduce them. If you don't like how the universe works, don't take it out on me.

STRML12y ago

This is the "insane default" that substack goes on a rant about in the hyperquest README: https://github.com/substack/hyperquest#rant

  There is a default connection pool of 5 requests. 
  If you have 5 or more extant http requests, any 
  additional requests will HANG for NO GOOD REASON.

This one trips up people on #node.js constantly and hopefully will be removed very soon.

IsaacSchlueter12y ago

It's fixed in master already. Test with the latest 0.11 to verify.

dspillett12y ago

Unless this is what has changed I'd say the default is sane and has a good reason. The doc (http://nodejs.org/docs/latest/api/http.html#http_agent_maxso...) states:

> agent.maxSockets: By default set to 5. Determines how many concurrent sockets the agent can have open per host.

This stops you accidentally overloading a single host that you are scraping. It would not (assuming it works as described) affect your app if you are making requests to many hosts to collate data. Many applications (scrapers like httrack for instance) implement similar limits by default. If you are piling requests onto a single host but you either know the host is happy for you to do that (i.e. it is your own service or you have a relevant agreement) or have put measures in place yourself to not overload the target then by all means increase the connection limit.

1 more reply

philstu12y ago

Nobody should be downvoting you, you raise an excellent point and you are of course right.

NodeJS v0.10.21 + Cheerio

real 0m47.986s user 0m7.252s sys 0m1.080s

NodeJS v0.10.21 + Cheerio + 64 connections

real 0m14.475s user 0m8.853s sys 0m1.696s

PHP 5.5.5 + ReactPHP + phpQuery

real 0m15.989s user 0m11.125s sys 0m1.668s

Considerably quicker! As I said I was sure NodeJS could go faster, but the point of the article was that PHP itself is not just magically 4 times slower, it is in fact almost identical when you use almost identical approaches. :)

philstu12y ago

Proof: https://pbs.twimg.com/media/BY4jok7IEAADGmE.png:large

dalore12y ago

They mention this in the update:

> Update: A few people have mentioned that Node by default will use maxConnections of 5, but setting it higher would make NodeJS run much quicker. As I said, im sure NodeJS could go faster - I would never make assumptions about something I don't know much about - and the numbers reflect that suggestions. Removing the blocking PHP approach (because obviously it's slow as shit) and running just the other three scripts looks like this:

CJefferson12y ago

It think the problem is you are feeding into the same loop. There are also probably ways of making the PHP script go faster still.

Also, that seems like a bit of a magic flag to add / tune, why is that not the default, and would I have to keep tuning it for each of my apps?

dspillett12y ago

The article starts the loop though. If tuning one setting is a black are that an inexperienced dev would miss (which it of course is, I don't disagree at all there), then drawing in a new library to work around the limits (in this circumstance) of file_get_contents() is very much so too.

Neither the original benchmark nor the response were well researched IMO. This is the Apache vs IIS wars again, where good benchmarks that reveal useful information were drowned out by the noise of a great any poorly executed (or sometimes completely biased and deliberately poorly constructed), with bad test resulting in a bad result for one side being followed by an equally bad test to try prove the opposite.

1 more reply

k3n12y ago

This is already why the author did though; the original benchmark that he replied to used Cheerio vs. phpQuery, but he rationalized that that was a losing battle anyways, and decided to test Cheerio vs. ReactPHP -- which requires a non-standard PHP extension called libevent.

1 more reply

filipedeschamps12y ago

I didn't know about this maxSockets limitation.

Is this something safe to raise?

eknkc12y ago

Default is 5. Should be just fine if you don't have a specific use case that would require higher limits.

If so, just crank it up, should be safe unless you assign Infinity or something like that and push it too much (then you have another problem though). We use 15 in production where our server parses a lot of external web pages.

1 more reply

iaminsane12y ago

http://nodejs.org/docs/latest/api/http.html#http_agent_maxso...

If you use it smart its safe enought

STRML12y ago

It's pretty much always safe to set to Infinity or to turn the agent off. This is an anti-feature.

1 more reply

dubcanada12y ago

If we really want to get into benchmarks LuaJIT with multithreading is almost 2x faster then both, it took 21 seconds to complete on my computer. And I'm willing to bet that multithreaded C would be even faster.

However you want to know what this benchmark proves? Absolutely nothing as it has to query a website. So the response time of the website matters more then this test.

gopalv12y ago

Long term php guy (I maintained APC for years, slowly given up now), so I've worked a lot with ~2k/3k request-per-second PHP websites.

The real trick here is async processing. A lot of the slow bits of PHP code is people not writing async data patterns.

If you use synchronous calls in PHP - mc::get or mysql or curl calls, then PHP absolutely sucks in performance.

Nodejs automatically trains you around this with a massive use of callbacks for everything. That is the canonical way to do things - while in PHP blocking single-threaded calls is what everyone uses.

The most satifying way to actually get PHP to perform well is to use async PHP with a Future result implementation. To be able to do a get() on a future result was the only sane way to mix async data flows with PHP.

For instance, I had a curl implementation which fetched multiple http requests in parallel and essentially lets the UI wait for each webservices call at the html block where it was needed.

https://github.com/zynga/zperfmon/blob/master/server/web_ui/...

There was a similar Memcache async implementation, particularly for the cache writebacks (memcache NOREPLY). Memcache multi-get calls to batch together key fetches and so on.

The real issue is that this is engineering work on top of the language instead of being built into the "one true way".

So often, I would have to dig in and rewrite massive chunks of PHP code to hide latencies and get near the absolute packet limits of the machines - getting closer to the ~3500 to 4000 requests per-second on a 16 core machine (sigh, all of that might be dead & bit-rotting now).

Osiris12y ago

What are best practices for writing async code in PHP?

gopalv12y ago

A lot of extensions expose async modes, use them.

On the extension APIs

curl-multi - http://php.net/manual/en/function.curl-multi-select.php

memcached-getdelayed - http://us2.php.net/manual/en/memcached.getdelayed.php

mysqli-reap_async - http://us2.php.net/manual/en/mysqli.reap-async-query.php

postgres-send_query - http://www.php.net/manual/en/function.pg-send-query.php

gearman doBackground - http://www.php.net/manual/en/gearmanclient.dobackground.php

Something like gearman queues basically take the asynchronous processing out of the web layer into a different daemon. There were things like S3 uploads and fb API calls which were shoved into gearman tasks instead of holding up the web page.

Some of the stuff is very design oriented, for instance in most of my memcache code, there are no mc-lock calls at all - all of them are mc-cas calls. A lot of the atomicity is done by using add/delete/cas which involve no sleep timeouts. A bit of it was done using atomic append, increment and decrement as well.

SQL queries are another place where PHP doing actual work sucks for the web apps. A bunch of the mysql/postgresql functionality within a lock is actually moved onto stored procedures, instead of being driven by PHP.

https://github.com/zynga/zperfmon/blob/master/server/schemas...

So the code above is horribly written because you can't parameterize table names or column names in PL/SQL. But that essentially cuts down the involvement PHP has with the backend's locked sections.

Also a lot of the stats data was flooded onto apache log files instead of being written out from the PHP code directly using an fwrite.

https://github.com/zynga/zperfmon/blob/master/client/zperfmo...

This uses apache_note() function in PHP to log stuff after the request is done & the connections are closed. That gets into the log files as %(<name)n fields in the access log.

You can see there that every single access log has an associated user, the HMAC of the request and peak memory usage. All collected at zero latency to the actual HTTP call.

The thing to avoid though is pcntl - it absolutely messes up all of apache/fastcgi process management code.

This is not all of what I've done. I am sorry to say some of my best work in this hasn't been open-sourced & has perhaps been killed since I left Zynga.

PHP backends I built using these methods were handling approx ~6-7 million users a day on 9 web servers (well, we kept 16 running - 8 on each UPS).

Ah, fun times indeed - too bad I didn't make any real money out of all that.

senorcastro12y ago

I get sick of these language wars, especially the constant stream of PHP ridicule that just never seems to end. The positives I try to take away from all of it is that there are a lot of people that are extremely passionate about software development and are striving for better tools and ways to express themselves. I want to believe that through the vitriol encountered in some of these articles that there are people really trying to improve the technologies at heart instead of taking part of some kind of programing language apologetics. In regards to PHP, I think that the ridicule has led to improvements in the language, but the overall tone in some of these articles is still a turn off for me.

tegeek12y ago

I'm sick of all of these generic SPEED benchmarks. Let me tell you some BIGGEST & REAL benefits of NodeJS where PHP SUCKS.

1. Takes 1 minute to install on any platform (*nix, windows etc.)

2. A modern Package Manager (NPM) works seamlessly with all platforms.

3. All libraries started from 0 with async baked in from day 0.

4. No need to use any 3rd party JSON serialize/deserialize libs.

5. And above all, its Atwood's law

"any application that can be written in JavaScript, will eventually be written in JavaScript".

http://www.codinghorror.com/blog/2009/08/all-programming-is-...

Wilya12y ago

Not meaning to troll but:

1. apt-get install php5 ? Seriously, that's it. On the other hand, neither Debian stable nor Ubuntu LTS have any usable version of node in their package repository (Debian has nothing, Ubuntu has 0.6)

4. json_decode() ?

5. If Atwood's law ever becomes reality, it will be a consequence, not a source of benefit.

(I don't use either Node or PHP as my main language)

tegeek12y ago

I've developed software in Java,.NET, Php & NodeJS. I ll rate JVM higher than any other platform. But when it comes to simplicity, easy scalability, Node ranks way higher. Php isnt even closer at all.

NodeJS literally takes 5 minutes to get started writing scalable apps without even thinking about concurency at all.

Write a software in Php yourself and let people download and run themselves, there are endless pain. Actually Php sucks in many areas which I dont want to touch right now.

RRRA12y ago

Actually Debian testing has : nodejs 0.10.21~dfsg1-1

debaserab212y ago

I think the only valid point here is your 3rd one.

1. You have to be kidding, right? PHP's popularity is precisely because of this.

2. getcomposer.org

4. json_decode/json_encode have been a part of PHP since PHP 5.2 (2006)

5. That's not a benefit.

bpicolo12y ago

No, it's popularity is because it was the simplest a long time ago, and it's supported on lots of virtual boxes by default.

Composer is terrible compared to NPM.

2 more replies

RossM12y ago

I'm not 100% on whether this is serious or not, but what the hell:

1. My recent installs of node have required compiling from source to get anything remotely up-to-date, however there are packages for both,

2. Composer with the Packagist registry is comparable here - you might be thinking of PEAR.

3. JS certainly has much better async support - it being JavaScript after all.

4. PHP has JSON encoding/decoding bundled, no third party lib required.

5. For better or worse

octo_t12y ago

Yet there are no decent XML libraries for node.js.

I'd trade decent JSON support for decent XML support every single day of the week.

And Scala/Java/JVM have already solved the problems you mention above.

simonw12y ago

"I'd trade decent JSON support for decent XML support every single day of the week."

Just out of interest, why is that?

I work with JSON at least once a week, but it can be months between moments when I need to work with any XML.

glenjamin12y ago

I quite like https://npmjs.org/package/sax

What is it you think is missing XML-wise in Node.js?

tegeek12y ago

1. libxmljs can take you pretty far with NodeJS.

2. Whatever we're saying here includes only NodeJS vs PHP.

jbeja12y ago

You seriously don't know what are you talking about.

balac12y ago

All your points are good apart from 4, don't the native json_encode() / json_decode() functions do what you need?

girvo12y ago

Actually, only 3 is good. The rest are incorrect, entirely, and 5 is debatable as to whether that's a good thing.

ohwp12y ago

Benchmarking is very hard because even the same language could show different results.

For example:

  for($i = 0; $i < count($list); $i++)

  $count = count($list);
  for($i = 0; $i < $count; $i++)

Most of the time benchmarks prove how capable a programmer is, not the speed of the language used.

humanrebar12y ago

Any decent compile-time optimizer will transform your first snippet into the second one (or better). Some languages preclude that optimization at compile time, but I presume that a JIT would also have little problem performing that optimization.

That is, one could argue that a good language is one that lets developers ignore trivial changes like this without hurting performance.

Jach12y ago

More as a reference for others, this is a fun page that highlights a few interesting optimizations GCC 4.2 is capable of making: http://ridiculousfish.com/blog/posts/will-it-optimize.html Maybe the optimizer can deduce guarantees about a function, but the writers may have just included an optimization for a specific built-in because the unoptimized form is such a common mistake...

likeclockwork12y ago

I don't see how that would work. I'd say that the two snippets describe different intentions and using one when you want the other is a case of not saying what you mean.

A function call in a loop condition might have side effects or do something very unorthodox.

1 more reply

disdev12y ago

Good test.

At some point, you'd expect these arbitrary this vs. that comparisons to die off. They haven't, and I'm guessing they won't.

Basically, it comes down to picking the tool that best supports your use case, or being okay with a compromise. Like the SQL/NoSQL discussions recently... Use it poorly and you get poor results.

idProQuo12y ago

I was always under the impression that PHP hate comes from design flaws that make it easy to make mistakes, not from it being necessarily slower.

CmonDev12y ago

Node.js is also based around a subpar language, so it's not necessarily the main cause either.

aaronem12y ago

One might argue that Javascript is considerably less sub-par than PHP, though; speaking purely from my own experience, I've found that writing Javascript involves a significantly lower probability of the language attempting, at random intervals, to shatter my kneecaps with a crowbar.

2 more replies

geerlingguy12y ago

I've made a similar observation to the original post — in my case, moving a bit of functionality from PHP to Node.js gave me 100x better performance: https://servercheck.in/blog/moving-functionality-nodejs-incr...

But the reason for this wasn't that Node/JS is faster than PHP; it was because I was able to write the Node.js app asynchronously, but the PHP version was making hundreds of synchronous requests (this is the gist of the OP).

The issue I have is that Node.js makes asynchronous http calls relatively easy, whereas in PHP, using curl_multi_exec is kludgy, and few libraries support asynchronous requests.

The situation is changing, but the fact remains that asynchronous code is the norm in Node.js, while blocking code is the norm in PHP. This makes it more difficult (as of this writing) to do any non-trivial asynchronous work in PHP.

lukeholder12y ago

I agree that the comparisons are often unfair between languages/frameworks, and agree with everything phil says, but there is a lot to be said for language level non-blocking constructs.

I am really enjoying reading Go code and seeing how people use concurrency etc; and they are all doing it the same. When I would read ruby, I would have to know the particulars of a library like Celluloid or EventMachine which made it harder.

joeblau12y ago

The "Thoughts" section was the most informative part of the benchmark which underscores the way I, when I was working with PHP, operated. When I started with PHP(2005), the frameworks were terrible, I would cobble together many random coding examples from stuff I found on the web and just make my own Framework up. I don't think PHP from a performance standpoint is any better or worse, but the default examples that you generally see in the ecosystem provide significantly worse performance. The one thing that Node clearly has an upper hand on PHP with is the ecosystem. It's a lot easier for a developer new to the Node ecosystem to hit that Node target than it would be for someone of the same skill to hit the PHP target in terms of hours spent.

One funny thing is that the ReactPHP[1] site is visually similar to the Node[2] homepage.

[1] - http://reactphp.org/ [2] - http://nodejs.org/

girvo12y ago

React was originally called Node.PHP or something similar, if I recall correctly!

phpnode12y ago

http://nodephp.org originally

prima-facie12y ago

Doing async in PHP still feels like strapping a dildo to a horse and calling it an unicorn.

erikig12y ago

I'm a PHP user/fan and I find this as funny as it is true.

jbeja12y ago

Waow i never tough that i would actually read the word "fan" along side php, it make my day.

1 more reply

dude312y ago

I have used RollingCurl (non blocking CURL) to fetch multiple API requests at once using PHP. Really easy to implement using a simple class. The example shows how you could build a simple efficient scraper.

alextingle12y ago

Finally, proof that you can put lipstick on a pig.

ausjke12y ago

ReactPHP does look great, hope it will be in stable enough state for production usage. Also it seems it's only done by one person?

philstu12y ago

Two core devs with a few contributors.

A lot of the components are in production already, it was built by the original developers to be used in production. It's on 0.3.0 for many parts, which is no further behind where Node was when people started flapping about it :)

dubcanada12y ago

I've used ReactPHP in production for a while, it's stable enough.

hugofirth12y ago

Only a brit would use Codswallop :)

onion2k12y ago

I read the entire article in the accent of a West Country farmer. And it still made sense. :)

philstu12y ago

http://i.telegraph.co.uk/multimedia/archive/01603/wurzels_16...

alexyoung12y ago

Node isn't a framework.

jlebrech12y ago

so how does php fare when scraping javascript apps?

denysonique12y ago

Node.js for web scraping usually is the obvious choice:

Scraping using jQuery syntax such as:

  $('table tr').each(function(ix, el) {
    names   .push($(el).find('td').eq(0));
    surnames.push($(el).find('td').eq(1));
  })

is more familiar to most web developers as opposed to the PHP syntax.

Even if Node was 5x slower than PHP I would still go for Node because of its easy jQuery syntax.

jqueryin12y ago

Did you bother to read the post? It pitted two different DOM traversal libraries against each other:

* cheerio (https://github.com/MatthewMueller/cheerio)

* PhpQuery (https://code.google.com/p/phpquery/wiki/jQueryPortingState)

Both of these use a jQuery-esque syntax, so your comment regarding DOM traversal in PHP is a moot point.

tehwebguy12y ago

Yeah the CSS style selectors and methods are the same, I assumed he was referring to the fact that it's all JS.

When you are scraping it's great to be able to do a test run in the browser console and then just paste the code into your node script without any language porting.

It's not an argument that it's better or faster or anything than PHP, just that some find it easier to hack a scraper together in this way.

wldlyinaccurate12y ago

> Even if Node was 5x slower than PHP I would still go for Node because of its easy jQuery syntax

That "jQuery syntax" has nothing to do with the language itself. jQuery uses Sizzle[0], which is a CSS selector library for JavaScript. There are plenty of PHP libraries which provide CSS selectors, such as the Symfony CssSelector component[1].

[0] https://github.com/jquery/sizzle

[1] https://github.com/symfony/CssSelector

deanclatworthy12y ago

The argument you really should be making is that the Javascript syntax is familiar. jQuery and it's methods for traversing the DOM can trivially be implemented in any langauge. e.g. PHP:

http://symfony.com/doc/current/components/dom_crawler.html#n...

j / k navigate · click thread line to collapse

82 comments

eknkc12y ago

Node benchmark is flawed though. Add something like

require('http').globalAgent.maxSockets = 64;

at the top of node script if you want a fair comparison with async php version. The bottleneck is bandwidth here. Not the runtime.

On my laptop, original script from the author took 35 seconds to complete.

With maxAgents = 64, it took 10 seconds.

Edit: And who is downvoting this? I just provided actual numbers and a way to reproduce them. If you don't like how the universe works, don't take it out on me.

STRML12y ago

This is the "insane default" that substack goes on a rant about in the hyperquest README: https://github.com/substack/hyperquest#rant

  There is a default connection pool of 5 requests. 
  If you have 5 or more extant http requests, any 
  additional requests will HANG for NO GOOD REASON.

This one trips up people on #node.js constantly and hopefully will be removed very soon.

IsaacSchlueter12y ago

It's fixed in master already. Test with the latest 0.11 to verify.

dspillett12y ago

Unless this is what has changed I'd say the default is sane and has a good reason. The doc (http://nodejs.org/docs/latest/api/http.html#http_agent_maxso...) states:

> agent.maxSockets: By default set to 5. Determines how many concurrent sockets the agent can have open per host.

1 more reply

philstu12y ago

Nobody should be downvoting you, you raise an excellent point and you are of course right.

NodeJS v0.10.21 + Cheerio

real 0m47.986s user 0m7.252s sys 0m1.080s

NodeJS v0.10.21 + Cheerio + 64 connections

real 0m14.475s user 0m8.853s sys 0m1.696s

PHP 5.5.5 + ReactPHP + phpQuery

real 0m15.989s user 0m11.125s sys 0m1.668s

philstu12y ago

Proof: https://pbs.twimg.com/media/BY4jok7IEAADGmE.png:large

dalore12y ago

They mention this in the update:

CJefferson12y ago

It think the problem is you are feeding into the same loop. There are also probably ways of making the PHP script go faster still.

Also, that seems like a bit of a magic flag to add / tune, why is that not the default, and would I have to keep tuning it for each of my apps?

dspillett12y ago

1 more reply

k3n12y ago

1 more reply

filipedeschamps12y ago

I didn't know about this maxSockets limitation.

Is this something safe to raise?

eknkc12y ago

Default is 5. Should be just fine if you don't have a specific use case that would require higher limits.

1 more reply

iaminsane12y ago

http://nodejs.org/docs/latest/api/http.html#http_agent_maxso...

If you use it smart its safe enought

STRML12y ago

It's pretty much always safe to set to Infinity or to turn the agent off. This is an anti-feature.

1 more reply

dubcanada12y ago

However you want to know what this benchmark proves? Absolutely nothing as it has to query a website. So the response time of the website matters more then this test.

gopalv12y ago

Long term php guy (I maintained APC for years, slowly given up now), so I've worked a lot with ~2k/3k request-per-second PHP websites.

The real trick here is async processing. A lot of the slow bits of PHP code is people not writing async data patterns.

If you use synchronous calls in PHP - mc::get or mysql or curl calls, then PHP absolutely sucks in performance.

Nodejs automatically trains you around this with a massive use of callbacks for everything. That is the canonical way to do things - while in PHP blocking single-threaded calls is what everyone uses.

For instance, I had a curl implementation which fetched multiple http requests in parallel and essentially lets the UI wait for each webservices call at the html block where it was needed.

https://github.com/zynga/zperfmon/blob/master/server/web_ui/...

There was a similar Memcache async implementation, particularly for the cache writebacks (memcache NOREPLY). Memcache multi-get calls to batch together key fetches and so on.

The real issue is that this is engineering work on top of the language instead of being built into the "one true way".

Osiris12y ago

What are best practices for writing async code in PHP?

gopalv12y ago

A lot of extensions expose async modes, use them.

On the extension APIs

curl-multi - http://php.net/manual/en/function.curl-multi-select.php

memcached-getdelayed - http://us2.php.net/manual/en/memcached.getdelayed.php

mysqli-reap_async - http://us2.php.net/manual/en/mysqli.reap-async-query.php

postgres-send_query - http://www.php.net/manual/en/function.pg-send-query.php

gearman doBackground - http://www.php.net/manual/en/gearmanclient.dobackground.php

https://github.com/zynga/zperfmon/blob/master/server/schemas...

So the code above is horribly written because you can't parameterize table names or column names in PL/SQL. But that essentially cuts down the involvement PHP has with the backend's locked sections.

Also a lot of the stats data was flooded onto apache log files instead of being written out from the PHP code directly using an fwrite.

https://github.com/zynga/zperfmon/blob/master/client/zperfmo...

This uses apache_note() function in PHP to log stuff after the request is done & the connections are closed. That gets into the log files as %(<name)n fields in the access log.

You can see there that every single access log has an associated user, the HMAC of the request and peak memory usage. All collected at zero latency to the actual HTTP call.

The thing to avoid though is pcntl - it absolutely messes up all of apache/fastcgi process management code.

This is not all of what I've done. I am sorry to say some of my best work in this hasn't been open-sourced & has perhaps been killed since I left Zynga.

PHP backends I built using these methods were handling approx ~6-7 million users a day on 9 web servers (well, we kept 16 running - 8 on each UPS).

Ah, fun times indeed - too bad I didn't make any real money out of all that.

senorcastro12y ago

tegeek12y ago

I'm sick of all of these generic SPEED benchmarks. Let me tell you some BIGGEST & REAL benefits of NodeJS where PHP SUCKS.

1. Takes 1 minute to install on any platform (*nix, windows etc.)

2. A modern Package Manager (NPM) works seamlessly with all platforms.

3. All libraries started from 0 with async baked in from day 0.

4. No need to use any 3rd party JSON serialize/deserialize libs.

5. And above all, its Atwood's law

"any application that can be written in JavaScript, will eventually be written in JavaScript".

http://www.codinghorror.com/blog/2009/08/all-programming-is-...

Wilya12y ago

Not meaning to troll but:

4. json_decode() ?

5. If Atwood's law ever becomes reality, it will be a consequence, not a source of benefit.

(I don't use either Node or PHP as my main language)

tegeek12y ago

NodeJS literally takes 5 minutes to get started writing scalable apps without even thinking about concurency at all.

Write a software in Php yourself and let people download and run themselves, there are endless pain. Actually Php sucks in many areas which I dont want to touch right now.

RRRA12y ago

Actually Debian testing has : nodejs 0.10.21~dfsg1-1

debaserab212y ago

I think the only valid point here is your 3rd one.

1. You have to be kidding, right? PHP's popularity is precisely because of this.

2. getcomposer.org

4. json_decode/json_encode have been a part of PHP since PHP 5.2 (2006)

5. That's not a benefit.

bpicolo12y ago

No, it's popularity is because it was the simplest a long time ago, and it's supported on lots of virtual boxes by default.

Composer is terrible compared to NPM.

2 more replies

RossM12y ago

I'm not 100% on whether this is serious or not, but what the hell:

1. My recent installs of node have required compiling from source to get anything remotely up-to-date, however there are packages for both,

2. Composer with the Packagist registry is comparable here - you might be thinking of PEAR.

3. JS certainly has much better async support - it being JavaScript after all.

4. PHP has JSON encoding/decoding bundled, no third party lib required.

5. For better or worse

octo_t12y ago

Yet there are no decent XML libraries for node.js.

I'd trade decent JSON support for decent XML support every single day of the week.

And Scala/Java/JVM have already solved the problems you mention above.

simonw12y ago

"I'd trade decent JSON support for decent XML support every single day of the week."

Just out of interest, why is that?

I work with JSON at least once a week, but it can be months between moments when I need to work with any XML.

glenjamin12y ago

I quite like https://npmjs.org/package/sax

What is it you think is missing XML-wise in Node.js?

tegeek12y ago

1. libxmljs can take you pretty far with NodeJS.

2. Whatever we're saying here includes only NodeJS vs PHP.

jbeja12y ago

You seriously don't know what are you talking about.

balac12y ago

All your points are good apart from 4, don't the native json_encode() / json_decode() functions do what you need?

girvo12y ago

Actually, only 3 is good. The rest are incorrect, entirely, and 5 is debatable as to whether that's a good thing.

ohwp12y ago

Benchmarking is very hard because even the same language could show different results.

For example:

  for($i = 0; $i < count($list); $i++)

  $count = count($list);
  for($i = 0; $i < $count; $i++)

Most of the time benchmarks prove how capable a programmer is, not the speed of the language used.

humanrebar12y ago

That is, one could argue that a good language is one that lets developers ignore trivial changes like this without hurting performance.

Jach12y ago

likeclockwork12y ago

I don't see how that would work. I'd say that the two snippets describe different intentions and using one when you want the other is a case of not saying what you mean.

A function call in a loop condition might have side effects or do something very unorthodox.

1 more reply

disdev12y ago

Good test.

At some point, you'd expect these arbitrary this vs. that comparisons to die off. They haven't, and I'm guessing they won't.

Basically, it comes down to picking the tool that best supports your use case, or being okay with a compromise. Like the SQL/NoSQL discussions recently... Use it poorly and you get poor results.

idProQuo12y ago

I was always under the impression that PHP hate comes from design flaws that make it easy to make mistakes, not from it being necessarily slower.

CmonDev12y ago

Node.js is also based around a subpar language, so it's not necessarily the main cause either.

aaronem12y ago

2 more replies

geerlingguy12y ago

The issue I have is that Node.js makes asynchronous http calls relatively easy, whereas in PHP, using curl_multi_exec is kludgy, and few libraries support asynchronous requests.

lukeholder12y ago

I agree that the comparisons are often unfair between languages/frameworks, and agree with everything phil says, but there is a lot to be said for language level non-blocking constructs.

joeblau12y ago

One funny thing is that the ReactPHP[1] site is visually similar to the Node[2] homepage.

[1] - http://reactphp.org/ [2] - http://nodejs.org/

girvo12y ago

React was originally called Node.PHP or something similar, if I recall correctly!

phpnode12y ago

http://nodephp.org originally

prima-facie12y ago

Doing async in PHP still feels like strapping a dildo to a horse and calling it an unicorn.

erikig12y ago

I'm a PHP user/fan and I find this as funny as it is true.

jbeja12y ago

Waow i never tough that i would actually read the word "fan" along side php, it make my day.

1 more reply

dude312y ago

alextingle12y ago

Finally, proof that you can put lipstick on a pig.

ausjke12y ago

ReactPHP does look great, hope it will be in stable enough state for production usage. Also it seems it's only done by one person?

philstu12y ago

Two core devs with a few contributors.

dubcanada12y ago

I've used ReactPHP in production for a while, it's stable enough.

hugofirth12y ago

Only a brit would use Codswallop :)

onion2k12y ago

I read the entire article in the accent of a West Country farmer. And it still made sense. :)

philstu12y ago

http://i.telegraph.co.uk/multimedia/archive/01603/wurzels_16...

alexyoung12y ago

Node isn't a framework.

jlebrech12y ago

so how does php fare when scraping javascript apps?

denysonique12y ago

Node.js for web scraping usually is the obvious choice:

Scraping using jQuery syntax such as:

  $('table tr').each(function(ix, el) {
    names   .push($(el).find('td').eq(0));
    surnames.push($(el).find('td').eq(1));
  })

is more familiar to most web developers as opposed to the PHP syntax.

Even if Node was 5x slower than PHP I would still go for Node because of its easy jQuery syntax.

jqueryin12y ago

Did you bother to read the post? It pitted two different DOM traversal libraries against each other:

* cheerio (https://github.com/MatthewMueller/cheerio)

* PhpQuery (https://code.google.com/p/phpquery/wiki/jQueryPortingState)

Both of these use a jQuery-esque syntax, so your comment regarding DOM traversal in PHP is a moot point.

tehwebguy12y ago

Yeah the CSS style selectors and methods are the same, I assumed he was referring to the fact that it's all JS.

When you are scraping it's great to be able to do a test run in the browser console and then just paste the code into your node script without any language porting.

It's not an argument that it's better or faster or anything than PHP, just that some find it easier to hack a scraper together in this way.

wldlyinaccurate12y ago

> Even if Node was 5x slower than PHP I would still go for Node because of its easy jQuery syntax

[0] https://github.com/jquery/sizzle

[1] https://github.com/symfony/CssSelector

deanclatworthy12y ago

The argument you really should be making is that the Javascript syntax is familiar. jQuery and it's methods for traversing the DOM can trivially be implemented in any langauge. e.g. PHP:

http://symfony.com/doc/current/components/dom_crawler.html#n...

j / k navigate · click thread line to collapse