“You meant to install ripgrep” (opens in new tab)

ncmncm3y ago

Sorry, how does ripgrep save you tens of hours? I get that it is faster than regular grep, but that doesn't really answer the question; I don't find myself stalled waiting for grep. The only reasonable explanation would be something ripgrep does that grep actually doesn't. I could try to guess, but have no confidence I would guess right.

wincy3y ago

I use ripgrep on git bash for Windows, and my team members act like I’ve got searching superpowers. Searching on windows is such a pain and this makes it easy. Thanks so much for making this fantastic tool!

stjohnswarts3y ago

I use ripgrep and "everything" all the time on windows to find files and where I put stuff :) . My filing system leaves something to be desired, as does my many idea test folders.

burntsushi3y ago

Hah, w00t. Have your teammates figured out that your superpowers are teachable? :-)

locusofself3y ago

doesn't vscode use ripgrep under the hood when you search your open files/repos?

pjmlp3y ago

Maybe it has been Stockholm syndrom, but findstr/PowerShell have served perfectly alright.

BossingAround3y ago

Would you consider taking the "rg" package and redirecting people to ripgrep? I mean, asking the current owner to kindly donate it to you.

burntsushi3y ago

Yes, I'd be fine with that. I just spent a few minutes looking for their contact info, but I can't find it.

EDIT: Found their email via git. Always forget about that one.

https://github.com/junegunn/fzf/blob/master/ADVANCED.md#ripg...

skunkworker3y ago

`ripgrep` is my favorite of the "new" linux utilities, it makes searching for a single string across all of my cloned repos extremely easy, and especially for diving into multiple versions of vendored dependencies.

stinos3y ago

`ripgrep` is my favorite of the "new" linux utilities

Yeah, but only when used together with fzf, the other favorite new cross-platform shell utility. I mean, after rg spits out a list you do want to narrow it down and then do something with the files in that list, right?

ivanche3y ago

I love HN because of this :) Thank you so much for making ripgrep! You made people's lives easier.

darksaints3y ago

Just want to thank you for ripgrep and your other rust contributions. Ripgrep is insanely powerful and is absolutely indispensable for me.

nicce3y ago

It is smart, in multiple ways. For some guys who don’t know why, it prevents supply chain attacks.

laughingbovine3y ago

Love your tool. Use it all the time. Thanks!!

salawat3y ago

You should not have done this unless you want to further normalize the practice of namespace squatting. This is the same type of behavior leads to domain squatting. While arguably being slightly more benign in the sense of hedging against typosquatting, if everyone started going things like that, we'd quickly begin to run into namespace exhaustion problems as people started ballooning their package namespace footprint.

Before you do something like that, always ask yourself: "What if everyone else started doing this?"

If the result feels like a nightmare in the making, don't do it.

burntsushi3y ago

> Before you do something like that, always ask yourself: "What if everyone else started doing this?"

No, I don't think so. There is no universality implied in my comment or in the specific practice here. You can make value judgments based on specific circumstances. For example:

* How many people try 'cargo install rg' and have it do the wrong thing? I'd say "probably a lot."

* Is 'rg' on its own something that is a likely useful or desirable name on its own? No, I don't think so.

This doesn't have to mean that everyone should do it for every possible alias of every crate out there. You can say things like "yeah I think it makes sense to squat a name here to improve failure modes for folks."

Other than that, I have squatted a few names before. I don't see anything wrong with the practice in and of itself. It's when it gets abused that it starts to become a problem.

woodruffw3y ago

I've worked on various package management ecosystems for close to a decade now, and I wouldn't qualify this (if 'burntsushi had done it) as namespace squatting. It's clearly not an attempt to reserve a name for unspecified future use (or as a potential typosquatting target); it's the name of the binary installed by the crate and an obvious mistake for an installing user to make.

Even flat namespaces are virtually infinite; a couple of extra names that correct user error do not pose a serious exhaustion risk.

maxbond3y ago

I'd like to note there are three perverse incentives that lead to abuses of public namespaces (that I am aware of - please tell me if I've missed any):

1.) The use of names as a speculative financial instrument (in all shades of grey, up to and including extortion for lapsed or stolen names)

2.) The use of names as vectors of attack, such as by exploiting typos or homographs (such as malicious packages)

3.) The reserving of names you don't have a sincere or immediate intention to use (hoarding/FOMO)

This isn't very much like the situation with domains, which is primarily a result of #1 (there is no market for crates.io names, as far as I'm aware). #3 is a problem to some degree on crates.io, my understanding is that they basically treat this as a human moderation problem. #2 is endemic to all package managers.

By putting a helpful instead of malicious package here, the community (and Richard Dodd in particular) are able to mitigate the hazard of #2 (unless this account is compromised or turns malicious - a better but imperfect situation). If a project called `rg` comes around, they can appeal to moderators to get this name, and probably succeed (as if this were a #3 problem).

This isn't a perfect way to do things by any means, but it seems like a decent balance of concerns to me.

meowface3y ago

I think this is akin to saying nytimes.com buying nyt.com and redirecting it to nytimes.com is domain squatting.

https://wiki.archlinux.org/title/PKGBUILD#provides

Dylan168073y ago

> Before you do something like that, always ask yourself: "What if everyone else started doing this?"

Seems fine to me. Something like one tenth of packages reserving a second name? Not a big deal.

0xbadcafebee3y ago

I'm fully in support of exhausting namespaces in programming languages. It's really annoying that people keep making one-off projects with weird names that reinvent the wheel.

In CPAN, you create a module with a hierarchical name (Net::LDAP), and people inherit from it and extend the namespace to add new functionality (Net::LDAP::Batch). Finding a package that does what you want is [relatively] easy. Old code gets maintained rather than somebody reinventing it for the 72nd time with a hodge-podge of functionality.

woodruffw3y ago

This classic version of this in the Ruby ecosystem is "bundle"[1], which helpfully installs `bundler` for you. Of the 6.7 million downloads it has, I'm probably in there a dozen or so times.

[1]: https://rubygems.org/gems/bundle

steveklabnik3y ago

There’s also “nokogirl”

willlll3y ago

I appreciate each and every download.

richdodd3y ago

Hi - author of `rg` here :'). I've transferred over to BurntSushi which will give people a bit more assurance that `rg` won't become malware in the future.

I also squatted `memap` and `memap2` for the same reasons.

I wonder if there is an algorithmic way to decide when two crate names are 'near' each other. Then, if you added a crate with `cargo add` and there is another similarly-named crate with much higher usage, a warning could be emitted.

*EDIT* I know there's already https://en.wikipedia.org/wiki/Levenshtein_distance, but I wonder if there is a better measure that looks at e.g. keyboard layouts and likely typos. I'm sure there will have been research done on this.

burntsushi3y ago

Thank you! I've updated the crate to use dtolnay's suggestion (a compilation error), added a short README and created a repo for it with a small FAQ: https://github.com/BurntSushi/rg-cratesio-typosquat

samatman3y ago

If a package manager is starting from zero, and wants to have a privileged namespace such that a short name has a canonical value, it would make sense for those packages to be able to include a list of strings which the package should also reserve.

That way "ripgrep" could include "rg", searching cargo for "rg" brings back "ripgrep", not a second package named "rg", and an install could tell the user the correct name for any attempt to install it.

This also covers typo-squats, so there would be no need for packages like "memap".

Obviously this represents a low-effort vector for massive squatting, so maintainers would need to be responsible for preventing that, and could add some typos themselves, being the ones which see the request for the mis-typed packages.

karmanyaahm3y ago

While (afaik) this is not supposed to be used for typos, Arch Linux' provides enables 'synonyms' to be registered.

6keZbCECT2uB3y ago

I use fzf with an text file which lists all files installed by a package with the package name. That way if I know the header name, I can get the package name. If I know the package name, I can get all its files.

mherdeg3y ago

Wow, it's been years now since I typed "sl" at a terminal and got an ascii steam locomotive.

rsr3y ago

A friend of mine in college installed this on my laptop when I had my back turned. For a month or so, I wondered why anyone thought this feature was a good idea, and why no one I knew who used MacOS seemed to complain about it.

lifthrasiir3y ago

It was a helpful reminder to finally learn Ctrl-\ for SIGQUIT.

__henil3y ago

TIL!

jimjimjim3y ago

it's a nice touch that sl includes a man page and command line flags

micouayOP3y ago

`rg` has only one version, and one line of code:

    println!("You meant to install ripgrep: type `cargo uninstall rg` followed by `cargo install ripgrep`");

dtolnay3y ago

Seems like it would be better to contain:

    compile_error!("You meant to …");

so that the install would fail and `cargo uninstall rg` wouldn't be needed.

taink3y ago

Can always-failing-to-compile crates be deployed to the registry?

burntsushi3y ago

Yeah that sounds much nicer.

jfk133y ago

Huh - the same author also has https://crates.io/crates/memap and memap2, which explicitly say that they're "squatting to prevent a malicious typo package".

Not sure how to feel about this... on an individual-package level, it seems a sensible enough idea, but if it becomes a widespread practice, the namespace could get really cluttered.

sedatk3y ago

I guess "owner/packagename" convention could solve such issues as it's common with other package ecosystems.

burntsushi3y ago

Right. So then you add burnsushi/ripgrep instead. See the problem?

Namespaces are a solution or mitigation to some problem, but that problem is not malicious typo-squatting.

Macha3y ago

Just moves the problem to packagename/packagename looking like a more "official" source.

KMnO43y ago

> but if it becomes a widespread practice, the namespace could get really cluttered.

Crates.io is incredibly cluttered with namesquatting. It’s probably the worst package registry for it, even surpassing NPM.

Part of the problem is that they explicitly say name squatting isn’t against the rules.

filereaper3y ago

Can't tell you how often I've run: `pip install aws`

This installs a library by some authors not affiliated with AWS.

Instead of: `pip install awscli`

Which is what you expect.

RulerOf3y ago

I'm frequently worried that I'm going to install malware on my machine doing this one of these days.

noswi3y ago

What does one do if they wish to see the actual contents of this crate? The web interface I'm looking at contains no hints at peeking inside, not even direct archive download links, nothing.

I can't believe that a good way to see what's inside is to make a rust project, add the crate and then go searching around the local filesystem.

ripley123y ago

crates.io is a little bare-bones sometimes.

I usually use lib.rs instead: https://lib.rs/crates/rg

That has a link to source: https://docs.rs/crate/rg/0.1.0/source/

And here's the Rust code: https://docs.rs/crate/rg/0.1.0/source/src/main.rs

pie_flavor3y ago

The source is hosted alongside the documentation at https://docs.rs. But far simpler than that is just going to the prominent GitHub link.

maxbond3y ago

In this case, there isn't a GitHub link, as there's no repository in the Cargo.toml: https://docs.rs/crate/rg/0.1.0/source/Cargo.toml

remram3y ago

Similar in the Python world: https://pypi.org/project/sklearn/

This one just depends on the correct `scikit-learn` package though.

learndeeply3y ago

Same for: https://pypi.org/project/pytorch/

> You tried to install “pytorch”. The package named for PyTorch is “torch”

walthamstow3y ago

Also bs4 / beautifulsoup4

OJFord3y ago

These are both like numpy & pandas in always documenting with `import longname as ln` right? I think they bring it on themselves.

https://github.com/fregante/npm-helpful-typosquatting

fregante3y ago

Back when npm didn’t have any “similar name” restrictions I did the same for some popular packages. My redirects also helped me a couple of times as I wonder whether a package name had a dash or not.

Here’s what it looks like: https://www.npmjs.com/package/webext

typon3y ago

Neovim + Telescope + ripgrep. It's taken 30 years, but we finally have the perfect code navigation solution.

aidos3y ago

Amen to that!

rpigab3y ago

I never run cargo install or any other package manager download commands without checking the website of the package manager for the right name first, ensuring the author is right, and the commit/update history looks right.

I love Python but pip/pypi and imports always felt wierd to me because of namespaces, package names, special imports "as", etc., maybe this is a bias because I started using them when I was younger and now I'm more experienced, I already know how to use most package managers.

BTW Ripgrep is awesome, I'm learning Rust and it's an inspiration to me, thanks burntsushi!

chlorion3y ago

It would be nice if crates supported being signed with GPG or minisign or whatever.

I can imagine for example, importing keys from only the authors that I think I can trust, and passing a flag to cargo that only allows using those packages for cargo install or cargo add.

In this case I think just checking the top level crates signature (and not dependencies) would be enough to mitigate a lot of issues including typo squatting.

burntsushi3y ago

'cargo crev' makes this kind of workflow possible: https://github.com/crev-dev/cargo-crev

richdodd3y ago

Can't recommend `cargo crev` enough!!! The more people use it, the more powerful it becomes.

ashishbijlani3y ago

Tools like Packj[1] that check for typosquatting and install packages under a sandbox can help in avoiding accidental installation of malicious packages. Disclaimer: I’m one of the devs.

1. https://github.com/ossillate-inc/packj

underyx3y ago

I maintain a Python package that parks names like this. There's a Python library called pypi-parker[0] that makes it really easy to do this via CI.

[0]: https://pypi.org/project/pypi-parker/

woodruffw3y ago

For what it's worth: using a tool like pypi-parker technically violates PEP 541[1], since it uploads projects with no functionality solely to reserve parts of the namespace. You may or may not get away with using it, depending on how you use it, but PyPI's admins (who I do not speak for) would be within their enumerated rights to ban any account that uses it to squat names.

[1]: https://peps.python.org/pep-0541/#invalid-projects

underyx3y ago

Thanks for flagging this, I was unaware! I agree with your assessment; I just hope that this is considered to not be in breach of the spirit of the PEP. It seems like the PEP intended to disallow squatting in terms of pre-emptively reserving and hogging names, the way domain squatters do it. So hopefully typosquatting prevention for the sake of security is considered fine by the admins; especially since our project was designated a 'critical project' and stricter security measures apply to our maintainers.

worewood3y ago

Why not just add ripgrep as a dependency, effectively making it an alias of the original package?

burntsushi3y ago

I wouldn't sign off on this personally. It makes auditing harder. You see `cargo install rg` somewhere, but you also see that `cargo install ripgrep` is what's listed in ripgrep's README. So now you wonder, is `cargo install rg` correct? Then maybe ripgrep has to add a note about this to the README, and maybe you see it, maybe you don't.

Better to just make `cargo install rg` fail so that it never worked in the first place. `cargo install ripgrep` is also more self-describing and gives you a better search engine query.

Longwelwind3y ago

Maybe it's only for me, but I've never liked this of too-smart solutions.

Let people do the mistakes once and learn the correct package name, instead of relying on a hack and potentially introduce confusion later.

yellowapple3y ago

Not to mention adding a juicy target for malicious shenanigans.

kevincox3y ago

Would this install the binaries of the dependency to your $PATH? I would expect that only the top-level package would be "installed" that way.

3a2d293y ago

This would work, but I think hiding a package behind an alias is never a good idea.

stjohnswarts3y ago

Nah KISS

thombles3y ago

To my knowledge this is the only wrong crate I've ever installed due to my own error. It's... not a good feeling to read this message, even though the author turned out to be doing me a favour. :)

benreesman3y ago

In spite of some tense conversations with the author I am still a rg super fan: it’s fantastic, reliable, performant, well-maintained software and I would recommend it to anyone. It’s best in class.

seanw4443y ago

Is there no way to have a package mirror, or alias or something? Unless I'm missing a joke or something, this seems like an easily solvable problem.

woodruffw3y ago

Aliases turn a flat namespace into a potentially cyclical graph, and introduce all kinds of permission considerations (Should a non-owner be able to alias a project? If so, should they be allowed to update it?).

The solutions here are non-flat namespacing (which has worse UX, since `cargo install some-tool` now becomes `cargo install whats-their-handle-again/some-tool`) or some kind of content addressing (which is similarly bad for UX, if not worse). Most package indices choose neither, and "solve" the problem by playing whac-a-mole with abuse instead.

tomjakubowski3y ago

Clojars has the right idea for namespacing: some-tool is an alias for some-tool/some-tool.

This means the first package to squat on the name can use the shorthand version, while allowing other packages with the same name in other namespaces. (which may be forks or entirely different packages)

VWWHFSfQ3y ago

I think you would rather have explicitly named dependencies. I don't want a bunch of aliased dependencies redirecting to wherever

stjohnswarts3y ago

I'm the same, I'd rather it be broken so I can figure out what's going on rather than bounced around all over the place.

kevincox3y ago

It is likely better to get the error and fix the mistake than be relying on an redirect owned and operated by who-knows-who indefinitely.

bmn__3y ago

Similar: http://p3rl.org/install

debacle3y ago

What is ripgrep?

Edit: Because I'm on a Zoom call that will never end.

"ripgrep is a line-oriented search tool that recursively searches the current directory for a regex pattern. By default, ripgrep will respect gitignore rules and automatically skip hidden files/directories and binary files."

kibwen3y ago

A grep alternative that optimizes for performance: https://github.com/BurntSushi/ripgrep . There are detailed performance comparisons and discussions in the readme there.

tmtvl3y ago

A file searcher akin to grep, ack, or ag (aka the silver searcher) it's programmed in Rust so it is decently fast with good support for UTF-8.

Unfortunately it defaults to parsing a git tree's gitignore file and skipping over files listed in it.

burntsushi3y ago

It also defaults to ignoring hidden and binary files. It's also simultaneously the thing folks cite as their favorite part about ripgrep.

The idea behind it is that it acts a heuristic for reducing false positives from your search results. For example, ripgrep replaced several little grep wrapper scripts I had in ~/bin.

And fortunately the default behavior is easy to disable. `rg -uuu foo` will search the same stuff as `grep -r foo ./`, but will do it faster.

Sohcahtoa823y ago

> Unfortunately it defaults to parsing a git tree's gitignore file and skipping over files listed in it.

That's a feature.

Like, it's the entire point of ripgrep. It's designed to search through the things a developer actually cares about searching through.

If you actually want to search everything, just use grep.

kevincox3y ago

"decently fast" is a significant understatement. It is likely the fastest similar tool. (`git grep` may win due to not listing the directory tree and packed files and GNU grep is very fast if you don't use Unicode, but other than that ripgrep wins).

stjohnswarts3y ago

just use --hidden, people shouldn't be afraid of typing an additional word. I prefer the defaults to keep things clean.

enriquto3y ago

> it defaults to parsing a git tree's gitignore file and skipping over files listed in it

Is that true? How could anybody think that this non-orthogonal monstrosity would make any sense?

4 more replies

c7DJTLrn3y ago

Crates should be namespaced by user. This is a disaster waiting to happen.

remram3y ago

Do you change the name every time there is a change in the maintainers' team?

If you have `ripgrep-team/ripgrep` rather than `ripgrep`, it doesn't help at all with people typing the wrong thing, like `rg-team/rg`. I fail to see how it helps.

It's even worse with packages that are (currently) authored by a single person, how many people know the name of ripgrep's author? Or rand? Or bevy?

pornel3y ago

Then you'd have people installing "burnedsushi/ripgrep" instead of "burntsushi/ripgrep". It only kicks the problem one step down without fixing it.

jrochkind13y ago

worse, if the correct one was `burntsushi/ripgrep`, someone else would just squat `ripgrep/ripgrep`.

stjohnswarts3y ago

I don't think so. ripgrep could easily become super cluttered if any john-joe-jimmy-larry could namespace it

PartiallyTyped3y ago

I agree, same for PyPI and all package repositories.

totorovirus3y ago

Why no alias linking or clone? Is it technically impossible?

low_tech_punk3y ago

What is in the mysterious `rg` crate? There is no doc.

secondcoming3y ago

Is it faster than Silver Searcher (ag)?

burntsushi3y ago

Yes. And less buggy.

If someone can find a meaningful case where ag is faster than ripgrep, then I'm happy to accept a bug report. I'll do my best at that point to give an analysis of the benchmark, and if it's correct, I'll either try to fix it or say why it's hard to fix.

By "meaningful" I mean "something that is noticeable to humans." So for example, reporting a bug because ripgrep took 9ms and ag took 7ms on a tiny repo is one I would consider not meaningful. :)

(Sorry about the verbose caveats, but just trying to head off responses I've got in the past.)

Lapsa3y ago

long time silver searcher user here. made a switch to rg and haven't looked back. although ag is still a tool of that rare breed I really have nothing bad to say about.

notorandit3y ago

Life is too short to spend time just to know what ripgrep is/does. I mean, yes, you all look so cool and I look so dumb. But, c'mon, is this software so complex that its description doesn't fit 42 words? Is like having a shop with no sign and no window. Everyone is saying it's great and worth shopping. But still no sign. Ridiculuos.

burntsushi3y ago

What are you talking about? If you bothered to search for it and go to the shop, you'd see there is not only a sign, but a huge fucking window with several signs helpfully telling you what the shop offers and even going so far as telling you when you might not want find the shop useful.

There's even a huge sign with only 12 words pithily explaining what the shop has inside.

Lapsa3y ago

you search stuff with it

harry83y ago

So it's grep like i already have installed and know its quirks?

j / k navigate · click thread line to collapse

159 comments

burntsushi3y ago

Hah! TIL. I had no idea someone did this. But it's smart. I should have thought of it!

(I'm the author of ripgrep.)

aryik3y ago

Unrelated, but thank you for your work!

burntsushi3y ago

w00t! Thanks for the kind words. :-)

ncmncm3y ago

wincy3y ago

stjohnswarts3y ago

I use ripgrep and "everything" all the time on windows to find files and where I put stuff :) . My filing system leaves something to be desired, as does my many idea test folders.

burntsushi3y ago

Hah, w00t. Have your teammates figured out that your superpowers are teachable? :-)

locusofself3y ago

doesn't vscode use ripgrep under the hood when you search your open files/repos?

pjmlp3y ago

Maybe it has been Stockholm syndrom, but findstr/PowerShell have served perfectly alright.

BossingAround3y ago

Would you consider taking the "rg" package and redirecting people to ripgrep? I mean, asking the current owner to kindly donate it to you.

burntsushi3y ago

Yes, I'd be fine with that. I just spent a few minutes looking for their contact info, but I can't find it.

EDIT: Found their email via git. Always forget about that one.

https://github.com/junegunn/fzf/blob/master/ADVANCED.md#ripg...

skunkworker3y ago

stinos3y ago

`ripgrep` is my favorite of the "new" linux utilities

ivanche3y ago

I love HN because of this :) Thank you so much for making ripgrep! You made people's lives easier.

darksaints3y ago

Just want to thank you for ripgrep and your other rust contributions. Ripgrep is insanely powerful and is absolutely indispensable for me.

nicce3y ago

It is smart, in multiple ways. For some guys who don’t know why, it prevents supply chain attacks.

laughingbovine3y ago

Love your tool. Use it all the time. Thanks!!

salawat3y ago

Before you do something like that, always ask yourself: "What if everyone else started doing this?"

If the result feels like a nightmare in the making, don't do it.

burntsushi3y ago

> Before you do something like that, always ask yourself: "What if everyone else started doing this?"

No, I don't think so. There is no universality implied in my comment or in the specific practice here. You can make value judgments based on specific circumstances. For example:

* How many people try 'cargo install rg' and have it do the wrong thing? I'd say "probably a lot."

* Is 'rg' on its own something that is a likely useful or desirable name on its own? No, I don't think so.

Other than that, I have squatted a few names before. I don't see anything wrong with the practice in and of itself. It's when it gets abused that it starts to become a problem.

woodruffw3y ago

Even flat namespaces are virtually infinite; a couple of extra names that correct user error do not pose a serious exhaustion risk.

maxbond3y ago

I'd like to note there are three perverse incentives that lead to abuses of public namespaces (that I am aware of - please tell me if I've missed any):

1.) The use of names as a speculative financial instrument (in all shades of grey, up to and including extortion for lapsed or stolen names)

2.) The use of names as vectors of attack, such as by exploiting typos or homographs (such as malicious packages)

3.) The reserving of names you don't have a sincere or immediate intention to use (hoarding/FOMO)

This isn't a perfect way to do things by any means, but it seems like a decent balance of concerns to me.

meowface3y ago

I think this is akin to saying nytimes.com buying nyt.com and redirecting it to nytimes.com is domain squatting.

https://wiki.archlinux.org/title/PKGBUILD#provides

Dylan168073y ago

> Before you do something like that, always ask yourself: "What if everyone else started doing this?"

Seems fine to me. Something like one tenth of packages reserving a second name? Not a big deal.

0xbadcafebee3y ago

I'm fully in support of exhausting namespaces in programming languages. It's really annoying that people keep making one-off projects with weird names that reinvent the wheel.

woodruffw3y ago

This classic version of this in the Ruby ecosystem is "bundle"[1], which helpfully installs `bundler` for you. Of the 6.7 million downloads it has, I'm probably in there a dozen or so times.

[1]: https://rubygems.org/gems/bundle

steveklabnik3y ago

There’s also “nokogirl”

willlll3y ago

I appreciate each and every download.

richdodd3y ago

Hi - author of `rg` here :'). I've transferred over to BurntSushi which will give people a bit more assurance that `rg` won't become malware in the future.

I also squatted `memap` and `memap2` for the same reasons.

burntsushi3y ago

samatman3y ago

This also covers typo-squats, so there would be no need for packages like "memap".

karmanyaahm3y ago

While (afaik) this is not supposed to be used for typos, Arch Linux' provides enables 'synonyms' to be registered.

6keZbCECT2uB3y ago

mherdeg3y ago

Wow, it's been years now since I typed "sl" at a terminal and got an ascii steam locomotive.

rsr3y ago

lifthrasiir3y ago

It was a helpful reminder to finally learn Ctrl-\ for SIGQUIT.

__henil3y ago

TIL!

jimjimjim3y ago

it's a nice touch that sl includes a man page and command line flags

micouayOP3y ago

`rg` has only one version, and one line of code:

    println!("You meant to install ripgrep: type `cargo uninstall rg` followed by `cargo install ripgrep`");

dtolnay3y ago

Seems like it would be better to contain:

    compile_error!("You meant to …");

so that the install would fail and `cargo uninstall rg` wouldn't be needed.

taink3y ago

Can always-failing-to-compile crates be deployed to the registry?

burntsushi3y ago

Yeah that sounds much nicer.

jfk133y ago

Huh - the same author also has https://crates.io/crates/memap and memap2, which explicitly say that they're "squatting to prevent a malicious typo package".

Not sure how to feel about this... on an individual-package level, it seems a sensible enough idea, but if it becomes a widespread practice, the namespace could get really cluttered.

sedatk3y ago

I guess "owner/packagename" convention could solve such issues as it's common with other package ecosystems.

burntsushi3y ago

Right. So then you add burnsushi/ripgrep instead. See the problem?

Namespaces are a solution or mitigation to some problem, but that problem is not malicious typo-squatting.

Macha3y ago

Just moves the problem to packagename/packagename looking like a more "official" source.

KMnO43y ago

> but if it becomes a widespread practice, the namespace could get really cluttered.

Crates.io is incredibly cluttered with namesquatting. It’s probably the worst package registry for it, even surpassing NPM.

Part of the problem is that they explicitly say name squatting isn’t against the rules.

filereaper3y ago

Can't tell you how often I've run: `pip install aws`

This installs a library by some authors not affiliated with AWS.

Instead of: `pip install awscli`

Which is what you expect.

RulerOf3y ago

I'm frequently worried that I'm going to install malware on my machine doing this one of these days.

noswi3y ago

What does one do if they wish to see the actual contents of this crate? The web interface I'm looking at contains no hints at peeking inside, not even direct archive download links, nothing.

I can't believe that a good way to see what's inside is to make a rust project, add the crate and then go searching around the local filesystem.

ripley123y ago

crates.io is a little bare-bones sometimes.

I usually use lib.rs instead: https://lib.rs/crates/rg

That has a link to source: https://docs.rs/crate/rg/0.1.0/source/

And here's the Rust code: https://docs.rs/crate/rg/0.1.0/source/src/main.rs

pie_flavor3y ago

The source is hosted alongside the documentation at https://docs.rs. But far simpler than that is just going to the prominent GitHub link.

maxbond3y ago

In this case, there isn't a GitHub link, as there's no repository in the Cargo.toml: https://docs.rs/crate/rg/0.1.0/source/Cargo.toml

remram3y ago

Similar in the Python world: https://pypi.org/project/sklearn/

This one just depends on the correct `scikit-learn` package though.

learndeeply3y ago

Same for: https://pypi.org/project/pytorch/

> You tried to install “pytorch”. The package named for PyTorch is “torch”

walthamstow3y ago

Also bs4 / beautifulsoup4

OJFord3y ago

These are both like numpy & pandas in always documenting with `import longname as ln` right? I think they bring it on themselves.

https://github.com/fregante/npm-helpful-typosquatting

fregante3y ago

Here’s what it looks like: https://www.npmjs.com/package/webext

typon3y ago

Neovim + Telescope + ripgrep. It's taken 30 years, but we finally have the perfect code navigation solution.

aidos3y ago

Amen to that!

rpigab3y ago

BTW Ripgrep is awesome, I'm learning Rust and it's an inspiration to me, thanks burntsushi!

chlorion3y ago

It would be nice if crates supported being signed with GPG or minisign or whatever.

I can imagine for example, importing keys from only the authors that I think I can trust, and passing a flag to cargo that only allows using those packages for cargo install or cargo add.

In this case I think just checking the top level crates signature (and not dependencies) would be enough to mitigate a lot of issues including typo squatting.

burntsushi3y ago

'cargo crev' makes this kind of workflow possible: https://github.com/crev-dev/cargo-crev

richdodd3y ago

Can't recommend `cargo crev` enough!!! The more people use it, the more powerful it becomes.

ashishbijlani3y ago

Tools like Packj[1] that check for typosquatting and install packages under a sandbox can help in avoiding accidental installation of malicious packages. Disclaimer: I’m one of the devs.

1. https://github.com/ossillate-inc/packj

underyx3y ago

I maintain a Python package that parks names like this. There's a Python library called pypi-parker[0] that makes it really easy to do this via CI.

[0]: https://pypi.org/project/pypi-parker/

woodruffw3y ago

[1]: https://peps.python.org/pep-0541/#invalid-projects

underyx3y ago

worewood3y ago

Why not just add ripgrep as a dependency, effectively making it an alias of the original package?

burntsushi3y ago

Better to just make `cargo install rg` fail so that it never worked in the first place. `cargo install ripgrep` is also more self-describing and gives you a better search engine query.

Longwelwind3y ago

Maybe it's only for me, but I've never liked this of too-smart solutions.

Let people do the mistakes once and learn the correct package name, instead of relying on a hack and potentially introduce confusion later.

yellowapple3y ago

Not to mention adding a juicy target for malicious shenanigans.

kevincox3y ago

Would this install the binaries of the dependency to your $PATH? I would expect that only the top-level package would be "installed" that way.

3a2d293y ago

This would work, but I think hiding a package behind an alias is never a good idea.

stjohnswarts3y ago

Nah KISS

thombles3y ago

To my knowledge this is the only wrong crate I've ever installed due to my own error. It's... not a good feeling to read this message, even though the author turned out to be doing me a favour. :)

benreesman3y ago

seanw4443y ago

Is there no way to have a package mirror, or alias or something? Unless I'm missing a joke or something, this seems like an easily solvable problem.

woodruffw3y ago

tomjakubowski3y ago

Clojars has the right idea for namespacing: some-tool is an alias for some-tool/some-tool.

VWWHFSfQ3y ago

I think you would rather have explicitly named dependencies. I don't want a bunch of aliased dependencies redirecting to wherever

stjohnswarts3y ago

I'm the same, I'd rather it be broken so I can figure out what's going on rather than bounced around all over the place.

kevincox3y ago

It is likely better to get the error and fix the mistake than be relying on an redirect owned and operated by who-knows-who indefinitely.

bmn__3y ago

Similar: http://p3rl.org/install

debacle3y ago

What is ripgrep?

Edit: Because I'm on a Zoom call that will never end.

kibwen3y ago

A grep alternative that optimizes for performance: https://github.com/BurntSushi/ripgrep . There are detailed performance comparisons and discussions in the readme there.

tmtvl3y ago

A file searcher akin to grep, ack, or ag (aka the silver searcher) it's programmed in Rust so it is decently fast with good support for UTF-8.

Unfortunately it defaults to parsing a git tree's gitignore file and skipping over files listed in it.

burntsushi3y ago

It also defaults to ignoring hidden and binary files. It's also simultaneously the thing folks cite as their favorite part about ripgrep.

The idea behind it is that it acts a heuristic for reducing false positives from your search results. For example, ripgrep replaced several little grep wrapper scripts I had in ~/bin.

And fortunately the default behavior is easy to disable. `rg -uuu foo` will search the same stuff as `grep -r foo ./`, but will do it faster.

Sohcahtoa823y ago

> Unfortunately it defaults to parsing a git tree's gitignore file and skipping over files listed in it.

That's a feature.

Like, it's the entire point of ripgrep. It's designed to search through the things a developer actually cares about searching through.

If you actually want to search everything, just use grep.

kevincox3y ago

stjohnswarts3y ago

just use --hidden, people shouldn't be afraid of typing an additional word. I prefer the defaults to keep things clean.

enriquto3y ago

> it defaults to parsing a git tree's gitignore file and skipping over files listed in it

Is that true? How could anybody think that this non-orthogonal monstrosity would make any sense?

4 more replies

c7DJTLrn3y ago

Crates should be namespaced by user. This is a disaster waiting to happen.

remram3y ago

Do you change the name every time there is a change in the maintainers' team?

If you have `ripgrep-team/ripgrep` rather than `ripgrep`, it doesn't help at all with people typing the wrong thing, like `rg-team/rg`. I fail to see how it helps.

It's even worse with packages that are (currently) authored by a single person, how many people know the name of ripgrep's author? Or rand? Or bevy?

pornel3y ago

Then you'd have people installing "burnedsushi/ripgrep" instead of "burntsushi/ripgrep". It only kicks the problem one step down without fixing it.

jrochkind13y ago

worse, if the correct one was `burntsushi/ripgrep`, someone else would just squat `ripgrep/ripgrep`.

stjohnswarts3y ago

I don't think so. ripgrep could easily become super cluttered if any john-joe-jimmy-larry could namespace it

PartiallyTyped3y ago

I agree, same for PyPI and all package repositories.

totorovirus3y ago

Why no alias linking or clone? Is it technically impossible?

low_tech_punk3y ago

What is in the mysterious `rg` crate? There is no doc.

secondcoming3y ago

Is it faster than Silver Searcher (ag)?

burntsushi3y ago

Yes. And less buggy.

By "meaningful" I mean "something that is noticeable to humans." So for example, reporting a bug because ripgrep took 9ms and ag took 7ms on a tiny repo is one I would consider not meaningful. :)

(Sorry about the verbose caveats, but just trying to head off responses I've got in the past.)

Lapsa3y ago

long time silver searcher user here. made a switch to rg and haven't looked back. although ag is still a tool of that rare breed I really have nothing bad to say about.

notorandit3y ago

burntsushi3y ago

There's even a huge sign with only 12 words pithily explaining what the shop has inside.

Lapsa3y ago

you search stuff with it

harry83y ago

So it's grep like i already have installed and know its quirks?