Facebook apology as AI labels black men 'primates' (opens in new tab)

(bbc.com)

169 pointslindenstark4y ago252 comments

252 comments

TOMDM4y ago

Previous discussion from a couple days ago.

https://news.ycombinator.com/item?id=28415582

lindenstarkOP4y ago

my mistake, missed that.

TOMDM4y ago

Eh, no harm, 200+ comments and no one complaining.

People obviously still see value in discussing it

simonw4y ago

This happened to both Google Photos and Flickr too. Which makes it an inexcusable mistake to make in 2021 - how are you not testing for this?

Google Photos in 2015: https://www.wired.com/story/when-it-comes-to-gorillas-google...

Flickr in 2015: https://www.independent.co.uk/life-style/gadgets-and-tech/ne...

IncRnd4y ago

The reason these companies don't fix these systems is because they don't know how. It is easier to remove certain outputs or retire the whole system. There is no line of code they can tweak.

nolaspring4y ago

Richen the dataset it’s trained on enough so that the model is correct before you release it to prod.

2 more replies

wpietri4y ago

This reminds me of a favorite tweet from 2013: "Then Google Maps was like, 'turn right on Malcolm Ten Boulevard' and I knew there were no black engineers working there" -- https://twitter.com/alliebland/status/402990270402543616

Facebook, like a lot of tech companies, has long had problems with diversity in engineering. Here's an article from April that discusses specific incidents and the broader background: https://www.washingtonpost.com/technology/2021/04/06/faceboo...

oreilles4y ago

This isn't a problem with diversity. Everybody knows how to pronounce Malcom X. And it's not like just because a google engineer was black that he was like "oh, let's try and see if Malcom X is pronounced correctly because he's black and I'm black too". This only happens in white people's brain.

5 more replies

throwcommonsns4y ago

This is a contrarian take that may get me downvoted and unfairly labeled, but I encourage critical thinking instead:

I've struggled with people telling me that these FAANG companies have "diversity problems," as a person of color myself. A majority of software engineers are female and male immigrants from East Asia and South Asia. These population centers are some of the most diverse regions of the world. The engineers who have been hired by preparing for and passing these companies' selective merit based coding tests had to overcome adverse conditions in their home countries as well, including extreme poverty, starvation, and totalitarian regimes.

Why do they not count toward diversity, to some white and white-adjacent critics? What message are we sending to people who are ethnic minorities from certain groups who earned their spots through merit and have also been targeted in recent newsworthy attacks, just as others have, when we make these kinds of accusations? What does a non problematic ethnic composition look like? What are these companies doing right toward some minority groups and wrong towards others?

5 more replies

IncRnd4y ago

Your comment implies black engineers will check that Malcom X Boulevard is pronounced correctly. That's awfully specious.

jazzyjackson4y ago

Alternatively it just implies white engineers never have their GPS taking them through Harlem.

3 more replies

root_axis4y ago

As others noted, just because someone is black doesn't mean that they would have caught this. The whole point of ML is to adapt to what is effectively an unbounded set of inputs, pretty much by definition there will be cases where even a team of 100% black people will train a model that, given the correct input, will fail in ways that particularly affect black people.

sinyug4y ago

> Facebook, like a lot of tech companies, has long had problems with diversity in engineering.

If that is the case, why is it that Google voice nav routinely butchers the names of places and roads in India in spite of having thousands of Indian engineers on staff?

Could we blame the intractability of the problem, or just plain old incompetence, before we blame every single problem in the world on racism and lack of 'diversity'?

4 more replies

zozbot2344y ago

> Then Google Maps was like, 'turn right on Malcolm Ten Boulevard' and I knew there were no black engineers working there

Silly Google TTS, the proper pronunciation is obviously "Malcolm the Tenth" there.

dd444fgdfg4y ago

google maps is made in Australia, and the diversity there is different

gumboshoes4y ago

Google Photos solved the problem by simply returning no results for words like gorilla, monkey, primate, etc.

jcims4y ago

I was just thinking about that. Unfortunately it just makes the bias harder to detect.

Once you search for these:

https://www.google.com/search?q=human+female+face&tbm=isch

https://www.google.com/search?q=human+male+face&tbm=isch

You can see that 'human face' has a bit of post-hoc tuning.

https://www.google.com/search?q=human+face&tbm=isch

silisili4y ago

So disappointing. I was legitimately looking for a monkey pic I took years ago to no avail because of no searchability. One of the richest companies in the world prefers to just remove ability than to solve hard problems. But hey, at least we all get ads.

2 more replies

bpodgursky4y ago

You can't just "test" a neural network like that. For all you know they tested a thousand pictures of Chimpanzees and Gorillas against the network, but for some reason the NN decided to classify the photo differently because the subject was standing in front of the wrong kind of tree or wearing a funny-colored hat.

There's no super reliable way to prevent this (with current tech) other than forbidding that output entirely.

silisili4y ago

Is it inexcusable that if I search 'Japan' to look for pics from my trip to Japan, it shows me pictures containing any Asian person at all? If I search Japan today, I get mostly pics of my not Japanese wife. But I guess we don't complain enough for anyone to care.

https://i.ibb.co/Mf6rVdf/Screenshot-20210907-002516-Photos.j...

Nobody who has traveled at all would mistake my wife and child as Japanese. And doing so is especially insidious considering the Bataan death march.

godelski4y ago

> Which makes it an inexcusable mistake to make in 2021 - how are you not testing for this?

They probably are, but not good enough. These things can be surprisingly hard to detect. Post hoc it is easy to see the bias, but it isn't so easy before you deploy the models.

If we take racial connotations out of it then we could say that the algorithm is doing quite well because it got the larger hierarchical class correct, primate. The algorithm doesn't know the racial connotations, it just knows the data and what metric you were seeking. BUT considering the racial and historical context this is NOT an acceptable answer (not even close).

I've made a few comments in the past about bias and how many machine learning people are deploying models without understanding them. This is what happens when you don't try to understand statistics and particularly long tail distributions. gumboshoes mentioned that Google just removed the primate type labels. That's a solution, but honestly not a great one (technically speaking). But this solution is far easier than technically fixing the problem (I'd wager that putting a strong loss penalty for misclassifiying a black person as an ape is not enough). If you follow the links from jcims then you might notice that a lot of those faces are white. Would it be all that surprising if Google trained from the FFHQ (Flickr) Dataset?[0] A dataset known to have a strong bias towards white faces. We actually saw that when Pulse[1] turned Obama white (do note that if you didn't know the left picture was a black person and who they were that this is a decent (key word) representation). So it is pretty likely that _some_ problems could simply be fixed by better datasets (This part of the LeCunn controversy last year).

Though datasets aren't the only problems here. ML can algorithmically highlight bias in datasets. Often research papers are metric hacking, or going for the highest accuracy that they can get[2]. This leaderboardism undermines some of the usage and often there's a disconnect between researchers and those in production. With large and complex datasets we might be targeting leaderboard scores until we have a sufficient accuracy on that dataset before we start focusing on bias on that dataset (or more often we, sadly, just move to a more complex dataset and start the whole process over again). There's not many people working on the biased aspects of ML systems (both in data bias and algorithmic bias), but as more people are putting these tools into production we're running into walls. Many of these people are not thinking about how these models are trained or the bias that they contain. They go to the leaderboard and pick the best pre-trained model and hit go, maybe tuning on their dataset. Tuning doesn't eliminate the bias in the pre-training (it can actually amplify it!). ~~Money~~Scale is NOT all you need, as GAMF often tries to sell. (or some try to sell augmentation as all you need)

These problems won't be solved without significant research into both data and algorithmic bias. They won't be solved until those in production also understand these principles and robust testing methods are created to find these biases. Until people understand that a good ImageNet (or even JFT-300M) score doesn't mean your model will generalize well to real world data (though there is a correlation).

So with that in mind, I'll make a prediction that rather than seeing fewer cases of these mistakes rather we're going to see more (I'd actually argue that there's a lot of this currently happening that you just don't see). The AI hype isn't dying down and more people are entering that don't want to learn the math. "Throw a neural net at it" is not and never will be the answer. Anyone saying that is selling snake oil.

I don't want people to think I'm anti-ML. In fact I'm a ML researcher. But there's a hard reality we need to face in our field. We've made a lot of progress in the last decade that is very exciting, but we've got a long way to go as well. We can't just have everyone focusing on leaderboard scores and expect to solve our problems.

[0] https://github.com/NVlabs/ffhq-dataset

[1] https://twitter.com/Chicken3gg/status/1274314622447820801

[2] https://twitter.com/emilymbender/status/1434874728682901507

trhway4y ago

>how are you not testing for this?

i wonder how testing for that looks and sounds in corporate environment. It may as well be an area similar to patents - you pretend that you never heard, never discussed, God forbid any mentioning in corporate email/chat/etc. or clicking on a link from inside a corporate network,...

jcims4y ago

Why are you so sure they aren't testing for it? Bias finds a way.

jcims4y ago

Curious if anyone on HN has built a testing framework to catch this kind of issue.

silisili4y ago

I've been trying to avoid controversy lately, but hey, here's one to downvote.

Have we considered AI and ML as a general brain replacement is a failed idea? That we humans feel we are so smart we can recreate or exceed millions of year evolution of a human brain?

I'd never call AI a waste, it's not. But getting it to do human things just may be.

Even a child can tell the difference between a human of any color and an ape. How many billions have been spent trying, and failing, to exceed the bar of the thoughts of a human child?

SilverRed4y ago

No it isn't a failed idea at all. The products out today are remarkably useful even if not perfect. I have tested out the google lens thing on google photos and it is astounding.

I took a photo of the water pump from a car windscreen wiper and google was able to correctly identify what it was. I took a photo of a generic PCB which showed the back of a driver board for an LCD and google was able to bring up the exact type of board it was with the names of the ICs on it.

In these examples, google photos ai has far exceeded what the average human can achieve. We just have to keep in mind that these systems are not perfect and only a best guess which should be verified by a person later.

The problem here is not that the mistake was very costly or disruptive to the function of the feature, but that the mistake was highly offensive which is something very hard to avoid.

silisili4y ago

The problems it solved for you are immensely useful to you, but not remarkable IMO.

The problem it's solving is that it can do things that somebody with zero experience cannot. If you had an auto parts pro, or an EE, they probably could have done the same for you.

So, in general, AI is helpful because it has a much larger breadth of knowledge. Granted.

But I want examples of it doing depth, too.

My wife uses Lens when we fish. It's way, way worse than a fisherman with any experience at all.

1 more reply

ShamelessC4y ago

> Have we considered AI and ML as a general brain replacement is a failed idea?

Yes. It is currently known to fail at this prospect. It is an open research question as to whether current methods can be merely "scaled up" using more compute to achieve "general brain replacement". I personally am skeptical about that considering basic problems such as concept drift (but I am by no means an expert).

You define what constitutes as valuable to be arbitrarily difficult/inconceivable with current methods (because it's an area of open research) and then say we should divert course merely because we don't know it's possible?

> never call AI a waste, it's not. But getting it to do human things just may be.

It already can do things thought to be previously exclusively "human" (such as beating Go). Recently it also helped make significant advancements for protein folding which are sure to yield benefits to medical science at least indirectly. I believe this statement is either incorrect, or you're expecting people to have some strange definition of "exclusively human", which is of course also open research and unanswered.

3grdlurker4y ago

Couldn’t you apply the same way of thinking to finding a cure for AIDS, or doing interstellar travel, or P = NP, or pretty much any problem that we haven’t solved yet? Just because we can’t solve a problem within our lifetime doesn’t mean it’s not solvable at all. This is one of the most basic principles by which knowledge, and therefore, technology, progresses.

silisili4y ago

Not at all. If a child could solve the AIDS issue and science couldn't, then maybe.

Humans and machines are so different today. Of course machines beat us at number calculations and such. But we have organs that computers don't and can't have. And our brains are much more in tune with using those than power of 2 bit twiddling.

As we ourselves don't understand how it works, how can we ever write a machine that does?

1 more reply

e_proxus4y ago

Given the complexity of the solutions employed in this space and the task we’re trying to get them to solve (or perhaps, the solutions we’re looking for problems to) I’m not that surprised.

Taken to the extreme, AI code is essentially something like:

  add(M, N) {
    return M + N + rand();
  }

In addition, being tested with a (in relation to the complete set) very small set of input data.

scotty794y ago

Is that a result of a skewed training set or are people really hard to tell apart from gorillas if there are no obvious tells like large difference in brightness of different areas of the face?

civilized4y ago

Deep learning, for all its recent glories, still suffers from relatively crude, slow-converging training algorithms compared to other areas of ML and statistics.

Maybe to your typical SGD-type algorithm, working off a dataset filled with mostly light skin toned people, skin tone just looks like a real solid first-order way to distinguish humans and primates, and picking up the black people / primate distinction seems much more marginal and second-order, in terms of impact on the cost function.

If most of the people in the dataset were black, I predict you wouldn't see this.

tomrod4y ago

Consider too what they are likely using for inputs: photos with associated comments.

I don't know Facebook's TOS sufficiently to know whether they are using private groups as source material, but if you're utilizing bigoted content to train pattern recognition, you will replicate bigoted content.

1 more reply

exporectomy4y ago

Gorillas have black and human-like faces. I think they're just quite similar so AI can more easily confuse them.

0x4265776172654y ago

I’m fairly certain if you showed pictures of both groups to a toddler they’d be able to sort them correctly. It’s really not hard for a human to tell the difference. Which tells me that FB’s AI isn’t really that great.

1 more reply

maweki4y ago

Human-like really depends on your interpretation. That's a generous reading of what's going on. If you google Gorilla faces, I don't think you would be confused.

The AI is not that smart and these examples show it.

1 more reply

__MatrixMan__4y ago

Is it even a confusion?

Humans are primates. It's weird that it selected such a broad label, but it didn't select an incorrect label.

1 more reply

hnick4y ago

Now you've made me wonder what it thinks about albino gorillas.

e: I assume something similar has been done before by training a model on brown/black bears then throwing polar bears at it. Anyone know the outcome?

MattGaiser4y ago

I would be interested to know if young children would do much better in many cases.

When I was quite young, I referred to some firefighters as robots.

JasonFruit4y ago

It reminds me of some of the explorers' tales of people who were half-human, half some other animal, or of people covered in hair, the first of which may have originated from seeing people riding animals, and the others to various (actual) primates. If humans can make such mistakes, certainly Facebook's AI can be excused for its confusion.

2 more replies

Barrin924y ago

the biggest tell is probably that, unless the person is in a zoo, I haven't seen many gorillas at starbucks

which says a lot about the state of our alleged human outperforming AI

maweki4y ago

I don't know. I think I would not have a hard time picking out a gorilla at Starbucks.

And I'd like to see a gorilla in any pose that's really hard (for a human) to differentiate from a person.

The truth is: the recognition algorithm is not very sophisticated after all.

jeromic4y ago

There is evidence that CNN's use texture features more than shape features, i.e. have a texture bias. It's hard to tell in this case without access to the data/model, but it's very possible colour is being overvalued by the classifier and causes the errors.

epmaybe4y ago

Isn’t the latter a possible consequence of the former?

not2b4y ago

Skewed training set, very few dark skinned human faces.

exporectomy4y ago

Got a source for that?

2 more replies

missedthecue4y ago

The training set had more primates than dark skinned humans? Unlikely.

ALittleLight4y ago

The video features white and black men. It seems like concluding the algorithm is calling black men primates is the same kind of error people are accusing the algorithm/Facebook of. i.e. The reason you think it's racist is because you assume it's talking about black people specifically suggesting you think the word is more apt to describe black people.

Primates and humans are similar labels. This was almost certainly not intentional. Video classifiers are going to make mistakes - sometimes crude or offensive ones. I don't get outrage over labeling errors like this. Facebook should fix the issue - but they shouldn't apologize. It only encourages grievance seekers.

6gvONxR4sf7o4y ago

We’re assuming it because that’s exactly what has happened with other products in the past. It’s an issue the field has struggled with, so it seems likely.

ALittleLight4y ago

Maybe I'm not aware of what you're referring to, but I don't think so. I think, like this incident, companies apologize for stuff like this because they lack the courage to say the truth, which is that it's an unfortunate labeling error but not a big deal. Instead, they judge it to be more political to beg forgiveness. Of course, the people who get offended by labeling errors are only encouraged by apologies and use them as evidence of wrongdoing.

dadoge4y ago

Intentional or not, the outcome is all that matters.

In every aspect of your life

aufhebung4y ago

I don't know if this is necessarily true. We have separate charges for murder and manslaughter for example.

3 more replies

systemvoltage4y ago

Speak for yourself. Intent is one of the key factors in crime investigations. Even in 'every aspect of life', intent plays a critical role in greasing the society's abrasion. It helps us understand each other better. Did you accidentally bumped me or did you try to push me out of the bus?

1 more reply

crooked-v4y ago

> The reason you think it's racist is because you assume it's talking about black people specifically suggesting you think the word is more apt to describe black people.

No, I think it's racist because racists have a long history of calling black people primates, and because an automated system doesn't get to escape scrutiny and critique just because someone didn't specifically put in a line of code that emulates the actions of racists.

firefoxd4y ago

This happens because there are no black people of consequence in the ML pipeline. In my previous company Everytime we built a new model, a bunch of us would test it. Being the only black person in the company, I often found some very odd things and we would correct it before shipping.

I understand that fb is a much bigger scale, but all the reason to have a much more diverse set of eyes to test their models before they go live.

If you want to avoid this, hire more black people, seriously.

dshpala4y ago

"Hire more black people" - isn't that what FAANG desperately been trying to do for some time now?

I guess first step might be to "hire more black QA people".

SamReidHughes4y ago

They already get hired, when they meet the hiring bar.

joebob424y ago

It's not obvious to me what black people would have done to fix this specific problem. Would they have said "oh we should make sure to test the algorithm on blurry images of people in a forest and make sure it doesn't get confused"?

firefoxd4y ago

"I uploaded my picture and it says I'm a monkey"

"Oh, maybe we should look into that"

2 more replies

yanlezeiler4y ago

I worked for another computer vision company, Clarifai that had the same issue. One of the employees noticed it and we retrained the model before it became public.

MBCook4y ago

This is what amazes me. Given this exact thing has happened in the past and resulted in public humiliation of the companies involved, how did they not notice this? Why didn’t they check for it?

avalys4y ago

What’s being reported is that there is a single video which is mislabeled. For all we know, they did test for this, and believed there was no issue.

AI models are deterministic in a purely technical sense, but practically speaking, they are non-deterministic black boxes. It’s not as if you can write a unit test which generates all possible videos of black people and makes sure it never outputs “gorilla”.

MattGaiser4y ago

Checking would require checking every possible video against the classifier.

root_axis4y ago

I think the negative reaction is reasonable. Clearly, if a human did this it would a problem, so why should it be acceptable for an automated system to do the same thing? The fact that it is unintentional doesn't negate the fact that it's an embarrassing mistake.

On the other hand, imagine a world where these labels were applied by a massive team of humans instead of a deep learning algorithm. At Facebook's scale, would the photos end up with more or less racist labels on average over time? My guess is that the model does a better job, but this is just another example of why we should be wary about trusting ML systems with important work.

jessaustin4y ago

Clearly, if a human did this it would a problem, so why should it be acceptable for an automated system to do the same thing?

One worries that the corporate overlords are preparing the legal system for completely impune manufacturers of self-driving cars. "Sorry your child is dead; the car did it so there's no one to sue or convict."

ipaddr4y ago

That raises the question, is it embarrassing or an expected mistake to be learned from. Many things are mislabel many things are labelled properly but we never say AI must feel pride at the good labeling job why would we give emotions to an emotionless system?

root_axis4y ago

> it embarrassing or an expected mistake to be learned from.

I would say it's both. It's embarrassing for Facebook because it looks racist even though it really isn't. The system might be emotionless but the people who interact with it aren't, and we don't expect them to be.

1 more reply

varelse4y ago

AI is not the problem here. AI just notices stuff. It's the lack of even amateur hour emotional intelligence in the product managers who deploy systems like this IMO.

Cycl0ps4y ago

I don't like these stories. It always trends towards the most inflammatory arguments, those being inherint bias and unconscious racism put upon our technology. Real issues in those topics aside, are any articles like this doing anything but feeding flames and generating ad revenue?

Instead, I want to talk about pareidolia. Humans are social creatures. We have evolved to identify others of our kind and read their expressions. This was important to us, as we evolved alongside gorilla analogues as well, and the few of us that couldn't discern one face from another didn't usually last long.

I think we're trying to place too much of a human expectation onto these machines. I think that human features and primate features are strikingly similar, and it's our specialized brains that let us so easily discern. Yes, with enough data and training we could have more accurate models, but we can't cry foul everytime an algorithm doesn't behave like a human does.

Reference: https://www.reddit.com/r/Pareidolia/

IncRnd4y ago

The thing is that humans are generally excellent at these sorts of pattern recognition, but these networks aren't nearly as good. Even in rigorously trained networks that operate surprisingly well, mistakes will appear that if made by a human would be treated as due to carelessness or stupidity.

So, this is going to happen.

MattGaiser4y ago

> The thing is that humans are generally excellent at these sorts of pattern recognition

Humans with a lot of experience are. Would kids be? I once referred to firefighter as robots as a kid.

2 more replies

pen2l4y ago

Something grotesque was put forth by a technology made by an entity that is comically monied. A mistake was similarly made by another monied entity only months ago, so it should have dedicated considerable effort to prevent such things. This is the way this works, we hold higher expectations out of the ones who have resources.

Please do not trivialize acts that have the potential to cut humans so deep with handwavy substantiations. Facebook should have known better, and done better.

Cycl0ps4y ago

By the sound of it this is a problem that neither entities nor their moniedness can solve. I'm sure these companies watch each other, and when one steps on a metaphorical rake the others are likely taking notes on how to avoid it on their future attempt. And yet rakes are still being stepped on.

When you have an automated system that has irregular behavior to a given input, we call that a bug. Bugs exist in all software, not always unique, but always present. This software is no different than any other. It will have errors. Because the software is categorizing faces, its errors will result in miscategorizing them. The only relevant questions to this are how frequent these errors are and how disparate they are across racial lines.

Another reference: this one is a Tool-Assisted Speedrun of a game that relies on basic image recognition software. While not entirely related, it does show how error-prone these algorithms can be. It's also fun to watch. https://youtu.be/mSFHKAvTGNk

chrisseaton4y ago

> I don't like these stories.

Nobody likes the stories. No reasonable person is celebrating them. You’re not in disagreement with anyone.

makeitdouble4y ago

You put a strong focus on how we evolved to deeply care about small facial expression differences and face features to identify and interact with an individual.

These stories are about how we also deeply care about labels and categorization. Aren't we just looking at the natural selection (making them not "last long") of these way too rough AIs that step on bounderies that are pretty important to a lot of people ?

Cycl0ps4y ago

Ha! I like that! Yes, I guess in a way we are. These models are always being evolved in their own version of 'natural' selection. They go through tens of thousands of mutations before finding one that guesses well enough to be pushed to production. This is just another stage of that algorithms life cycle I suppose. If you want to take an optimistic view of it this is just another part of the tuning process. The AI can train for as long as it likes, but the real thing it's being weighted against is public outcry.

OneEyedRobot4y ago

>I don't like these stories. It always trends towards the most inflammatory arguments, those being inherint bias and unconscious racism put upon our technology.

Oh well, it's the times we live in.

If people simply laughed at the results and fixed the problems they'd miss all the endorphin rush of outrage.

1 more reply

nashequilibrium4y ago

I think your comment is a bit dismissive. Facebook is not the first to encounter this, it happened 6/7 years ago and they should have known better. Secondly, if the Data Scientist working on this were all black, this would not have happened, just like the automatic soap dispensers in bathrooms.

joebob424y ago

Why would that be true? I don't understand the argument that black people would have avoided this.

From what I can tell the only fix here is a hardcoded workaround outside the net, or a substantially more powerful architecture.

marmshallow4y ago

What’s this about soap dispensers?

1 more reply

SV_BubbleTime4y ago

That’s all interesting and you make good points but…

I think the conversation can be made a lot simpler.

AI isn’t ready for anything important. Done. That’s it. If one of the pioneers in the field can’t determine black peoples from primates - it isn’t ready for driving or war or legal matters or really anything of importance.

I think we (colloquial) made something kinda cool and jumped the gun on when and where to use it.

dd444fgdfg4y ago

Humans are primates. The AI is correct. Does it classify white men and Asians as primates too? If not, that's a bug.

OneEyedRobot4y ago

I wonder if AIs are good at distinguishing individual gorillas, etc. I'd never really thought about the problem of classification being harder (perhaps) than identification if you see what I mean.

istillwritecode4y ago

Google: hold my beer

Paraesthetic4y ago

ooof, thats uncomfortable

jimjimjim4y ago

nothing important in the world should RELY on a AI/nn/ml.

SilverRed4y ago

This feature was not of any importance. It just asks the viewer if they want to see more or less of a certain kind of video.

jimjimjim4y ago

yes, and now imagine if the feature was important. like sorting job applications, or medical diagnosis, or (dramatic pause) driving. Lots of organizations are looking at ways of completely removing humans from their decision making processes. 99% certainty rates would be fantastic unless you are in the 1% false positive/negative group

sonicggg4y ago

Wow, what else? Did it also label them as "Homo Sapiens"?

q-rews4y ago

I feel that 0-failure-rate expectations from technology will keep us from progressing as a species.

Facebook disabled Thai-to-English translation back in April because it translated the queen as “slut” and it’s been disabled since.

Maybe we should learn to accept non-fatal errors from applications instead of forcing things to stop entirely.

I find it ridiculous that my Photos app suggests I change monkey to “lemur” while I have plenty of photos of monkeys and zero of lemurs.

smoldesu4y ago

Who takes the fall when an AI screws up?

If you shine enough light on it, apparently the brand does. If a human were to do this, the company would immediately fire the employee and cut all ties with them. But as the article points out, 'fixing' an AI mistake isn't really a fix at all:

> [Google] said it was "appalled and genuinely sorry", though its fix, Wired reported in 2018, was simply to censor photo searches and tags for the word "gorilla".

exporectomy4y ago

If a human did it with intent to offend, they would. But if a black person genuinely looked like a primate to human eyes, that would be pretty shitty to fire the poor worker who had no way of knowing. Here, the AI isn't trying to offend so maybe there should be no consequences and people should stop demanding severe punishment for minor accidental insults.

userbinator4y ago

The AI is very honest and innocent, it doesn't know what political correctness is. I've heard stories of parents whose kids would also mislabel a black human as a gorilla.

exporectomy4y ago

People need to learn to cope with the difference between innocent mistakes and expressions of genuine feelings of contempt/etc.

crooked-v4y ago

An 'innocent mistake' performed by a megacorporation with bajillions of dollars, of exactly the same kind of error that has publicly appeared in the actions of other megacorporations with bajillions of dollars... at that point it's not a 'mistake', it's just the people in charge not caring.

userbinator4y ago

Unfortunately, this is the sort of article that just causes all the SJWs to come crawling out and destroy any attempts at logical reasoning.

tartoran4y ago

AI doesn’t build itself in a vaccuum. The very people who train it are likely to be biased in one direction or another and if left unchecked their mastetpiece will be biased as well. Now, if these algos were staying in a lab or something it wouldn’t be a problem but as soon as they hit the real world they should be held up to some standards, don’t you think?

kodah4y ago

I don't really think the world needs AI right now. One can argue that the AI is making an innocent mistake and that calling an AI or ML (or it's improper training, however that works) "racist" is overblown rhetoric as people are here, but I think all of that aschews the actual issue. The problem is that AI and ML are primarily used for decision making, like in recommendation engines. These little gadgets that provide recommendations may be fairly low-stakes, but are theoretically proof-of-concepts for future applications like policing, fighting terrorism, or human trafficking. If you get it wrong there, the consequences are devastating. If people don't raise the flag about how wildly wrong the AI is now, then there will inevitably be a false confidence to use it for the aforementioned applications (and there are plenty of examples of how this has already happened).

throwthere4y ago

Maybe the algo or the training set or something else was racist, maybe it wasn't. But if you code something that labels people slurs, you've messed something up. Like, you need to be 99.999999% sure you're not throwing out slurs or your whole project is failing spectacularly. And then you have to apologize to the 0.0000001% , which is still probably like 10 people if half the planet uses your site. How do you get there? I don't know. I guess it'd help if you could be 99.999999% sure you weren't looking at a human face before using another label. Like, bias towards humans in a big big way. Heck, the pre-test probability that your algo is looking at a person is probably much higher than the one from your training set if you're facebook. Or maybe you drop primates from your training set. I guess in that case you'll misidentify some primates as people-- which is kind of the flipside of the same problem technically but oh so much more acceptable.

overgard4y ago

This isn't the kind of slur that you can just run a dictionary search for. There are totally valid contexts to tag a gorilla in a picture if it contains gorillas. I'm sure there are other words it could also mistakenly classify with that might be insulting on accident but arent slurs (maybe tagging an athlete as a statue, for instance, like "that quarterback is a statue in the pocket"). This tech isn't perfect so you either need a human editor or you have to learn to live with mistakes. IMO the fact that this was unintentional and an AI mistake makes me think the outrage is more performative than genuine.

throwthere4y ago

Oh boy. People who know basic ML think "Oh, it was unintentional, just a basic misclassification, it happens." Guess what? You're still calling people slurs on your website, even if you did it accidentally.

1 more reply

j / k navigate · click thread line to collapse

252 comments

TOMDM4y ago

Previous discussion from a couple days ago.

https://news.ycombinator.com/item?id=28415582

lindenstarkOP4y ago

my mistake, missed that.

TOMDM4y ago

Eh, no harm, 200+ comments and no one complaining.

People obviously still see value in discussing it

simonw4y ago

This happened to both Google Photos and Flickr too. Which makes it an inexcusable mistake to make in 2021 - how are you not testing for this?

Google Photos in 2015: https://www.wired.com/story/when-it-comes-to-gorillas-google...

Flickr in 2015: https://www.independent.co.uk/life-style/gadgets-and-tech/ne...

IncRnd4y ago

The reason these companies don't fix these systems is because they don't know how. It is easier to remove certain outputs or retire the whole system. There is no line of code they can tweak.

nolaspring4y ago

Richen the dataset it’s trained on enough so that the model is correct before you release it to prod.

2 more replies

wpietri4y ago

oreilles4y ago

5 more replies

throwcommonsns4y ago

This is a contrarian take that may get me downvoted and unfairly labeled, but I encourage critical thinking instead:

5 more replies

IncRnd4y ago

Your comment implies black engineers will check that Malcom X Boulevard is pronounced correctly. That's awfully specious.

jazzyjackson4y ago

Alternatively it just implies white engineers never have their GPS taking them through Harlem.

3 more replies

root_axis4y ago

sinyug4y ago

> Facebook, like a lot of tech companies, has long had problems with diversity in engineering.

If that is the case, why is it that Google voice nav routinely butchers the names of places and roads in India in spite of having thousands of Indian engineers on staff?

Could we blame the intractability of the problem, or just plain old incompetence, before we blame every single problem in the world on racism and lack of 'diversity'?

4 more replies

zozbot2344y ago

> Then Google Maps was like, 'turn right on Malcolm Ten Boulevard' and I knew there were no black engineers working there

Silly Google TTS, the proper pronunciation is obviously "Malcolm the Tenth" there.

dd444fgdfg4y ago

google maps is made in Australia, and the diversity there is different

gumboshoes4y ago

Google Photos solved the problem by simply returning no results for words like gorilla, monkey, primate, etc.

jcims4y ago

I was just thinking about that. Unfortunately it just makes the bias harder to detect.

Once you search for these:

https://www.google.com/search?q=human+female+face&tbm=isch

https://www.google.com/search?q=human+male+face&tbm=isch

You can see that 'human face' has a bit of post-hoc tuning.

https://www.google.com/search?q=human+face&tbm=isch

silisili4y ago

2 more replies

bpodgursky4y ago

There's no super reliable way to prevent this (with current tech) other than forbidding that output entirely.

silisili4y ago

https://i.ibb.co/Mf6rVdf/Screenshot-20210907-002516-Photos.j...

Nobody who has traveled at all would mistake my wife and child as Japanese. And doing so is especially insidious considering the Bataan death march.

godelski4y ago

> Which makes it an inexcusable mistake to make in 2021 - how are you not testing for this?

They probably are, but not good enough. These things can be surprisingly hard to detect. Post hoc it is easy to see the bias, but it isn't so easy before you deploy the models.

[0] https://github.com/NVlabs/ffhq-dataset

[1] https://twitter.com/Chicken3gg/status/1274314622447820801

[2] https://twitter.com/emilymbender/status/1434874728682901507

trhway4y ago

>how are you not testing for this?

jcims4y ago

Why are you so sure they aren't testing for it? Bias finds a way.

jcims4y ago

Curious if anyone on HN has built a testing framework to catch this kind of issue.

silisili4y ago

I've been trying to avoid controversy lately, but hey, here's one to downvote.

Have we considered AI and ML as a general brain replacement is a failed idea? That we humans feel we are so smart we can recreate or exceed millions of year evolution of a human brain?

I'd never call AI a waste, it's not. But getting it to do human things just may be.

Even a child can tell the difference between a human of any color and an ape. How many billions have been spent trying, and failing, to exceed the bar of the thoughts of a human child?

SilverRed4y ago

No it isn't a failed idea at all. The products out today are remarkably useful even if not perfect. I have tested out the google lens thing on google photos and it is astounding.

The problem here is not that the mistake was very costly or disruptive to the function of the feature, but that the mistake was highly offensive which is something very hard to avoid.

silisili4y ago

The problems it solved for you are immensely useful to you, but not remarkable IMO.

The problem it's solving is that it can do things that somebody with zero experience cannot. If you had an auto parts pro, or an EE, they probably could have done the same for you.

So, in general, AI is helpful because it has a much larger breadth of knowledge. Granted.

But I want examples of it doing depth, too.

My wife uses Lens when we fish. It's way, way worse than a fisherman with any experience at all.

1 more reply

ShamelessC4y ago

> Have we considered AI and ML as a general brain replacement is a failed idea?

> never call AI a waste, it's not. But getting it to do human things just may be.

3grdlurker4y ago

silisili4y ago

Not at all. If a child could solve the AIDS issue and science couldn't, then maybe.

As we ourselves don't understand how it works, how can we ever write a machine that does?

1 more reply

e_proxus4y ago

Given the complexity of the solutions employed in this space and the task we’re trying to get them to solve (or perhaps, the solutions we’re looking for problems to) I’m not that surprised.

Taken to the extreme, AI code is essentially something like:

  add(M, N) {
    return M + N + rand();
  }

In addition, being tested with a (in relation to the complete set) very small set of input data.

scotty794y ago

Is that a result of a skewed training set or are people really hard to tell apart from gorillas if there are no obvious tells like large difference in brightness of different areas of the face?

civilized4y ago

Deep learning, for all its recent glories, still suffers from relatively crude, slow-converging training algorithms compared to other areas of ML and statistics.

If most of the people in the dataset were black, I predict you wouldn't see this.

tomrod4y ago

Consider too what they are likely using for inputs: photos with associated comments.

1 more reply

exporectomy4y ago

Gorillas have black and human-like faces. I think they're just quite similar so AI can more easily confuse them.

0x4265776172654y ago

1 more reply

maweki4y ago

Human-like really depends on your interpretation. That's a generous reading of what's going on. If you google Gorilla faces, I don't think you would be confused.

The AI is not that smart and these examples show it.

1 more reply

__MatrixMan__4y ago

Is it even a confusion?

Humans are primates. It's weird that it selected such a broad label, but it didn't select an incorrect label.

1 more reply

hnick4y ago

Now you've made me wonder what it thinks about albino gorillas.

e: I assume something similar has been done before by training a model on brown/black bears then throwing polar bears at it. Anyone know the outcome?

MattGaiser4y ago

I would be interested to know if young children would do much better in many cases.

When I was quite young, I referred to some firefighters as robots.

JasonFruit4y ago

2 more replies

Barrin924y ago

the biggest tell is probably that, unless the person is in a zoo, I haven't seen many gorillas at starbucks

which says a lot about the state of our alleged human outperforming AI

maweki4y ago

I don't know. I think I would not have a hard time picking out a gorilla at Starbucks.

And I'd like to see a gorilla in any pose that's really hard (for a human) to differentiate from a person.

The truth is: the recognition algorithm is not very sophisticated after all.

jeromic4y ago

epmaybe4y ago

Isn’t the latter a possible consequence of the former?

not2b4y ago

Skewed training set, very few dark skinned human faces.

exporectomy4y ago

Got a source for that?

2 more replies

missedthecue4y ago

The training set had more primates than dark skinned humans? Unlikely.

ALittleLight4y ago

6gvONxR4sf7o4y ago

We’re assuming it because that’s exactly what has happened with other products in the past. It’s an issue the field has struggled with, so it seems likely.

ALittleLight4y ago

dadoge4y ago

Intentional or not, the outcome is all that matters.

In every aspect of your life

aufhebung4y ago

I don't know if this is necessarily true. We have separate charges for murder and manslaughter for example.

3 more replies

systemvoltage4y ago

1 more reply

crooked-v4y ago

> The reason you think it's racist is because you assume it's talking about black people specifically suggesting you think the word is more apt to describe black people.

firefoxd4y ago

I understand that fb is a much bigger scale, but all the reason to have a much more diverse set of eyes to test their models before they go live.

If you want to avoid this, hire more black people, seriously.

dshpala4y ago

"Hire more black people" - isn't that what FAANG desperately been trying to do for some time now?

I guess first step might be to "hire more black QA people".

SamReidHughes4y ago

They already get hired, when they meet the hiring bar.

joebob424y ago

firefoxd4y ago

"I uploaded my picture and it says I'm a monkey"

"Oh, maybe we should look into that"

2 more replies

yanlezeiler4y ago

I worked for another computer vision company, Clarifai that had the same issue. One of the employees noticed it and we retrained the model before it became public.

MBCook4y ago

This is what amazes me. Given this exact thing has happened in the past and resulted in public humiliation of the companies involved, how did they not notice this? Why didn’t they check for it?

avalys4y ago

What’s being reported is that there is a single video which is mislabeled. For all we know, they did test for this, and believed there was no issue.

MattGaiser4y ago

Checking would require checking every possible video against the classifier.

root_axis4y ago

jessaustin4y ago

Clearly, if a human did this it would a problem, so why should it be acceptable for an automated system to do the same thing?

ipaddr4y ago

root_axis4y ago

> it embarrassing or an expected mistake to be learned from.

1 more reply

varelse4y ago

AI is not the problem here. AI just notices stuff. It's the lack of even amateur hour emotional intelligence in the product managers who deploy systems like this IMO.

Cycl0ps4y ago

Reference: https://www.reddit.com/r/Pareidolia/

IncRnd4y ago

So, this is going to happen.

MattGaiser4y ago

> The thing is that humans are generally excellent at these sorts of pattern recognition

Humans with a lot of experience are. Would kids be? I once referred to firefighter as robots as a kid.

2 more replies

pen2l4y ago

Please do not trivialize acts that have the potential to cut humans so deep with handwavy substantiations. Facebook should have known better, and done better.

Cycl0ps4y ago

chrisseaton4y ago

> I don't like these stories.

Nobody likes the stories. No reasonable person is celebrating them. You’re not in disagreement with anyone.

makeitdouble4y ago

You put a strong focus on how we evolved to deeply care about small facial expression differences and face features to identify and interact with an individual.

Cycl0ps4y ago

OneEyedRobot4y ago

>I don't like these stories. It always trends towards the most inflammatory arguments, those being inherint bias and unconscious racism put upon our technology.

Oh well, it's the times we live in.

If people simply laughed at the results and fixed the problems they'd miss all the endorphin rush of outrage.

1 more reply

nashequilibrium4y ago

joebob424y ago

Why would that be true? I don't understand the argument that black people would have avoided this.

From what I can tell the only fix here is a hardcoded workaround outside the net, or a substantially more powerful architecture.

marmshallow4y ago

What’s this about soap dispensers?

1 more reply

SV_BubbleTime4y ago

That’s all interesting and you make good points but…

I think the conversation can be made a lot simpler.

I think we (colloquial) made something kinda cool and jumped the gun on when and where to use it.

dd444fgdfg4y ago

Humans are primates. The AI is correct. Does it classify white men and Asians as primates too? If not, that's a bug.

OneEyedRobot4y ago

I wonder if AIs are good at distinguishing individual gorillas, etc. I'd never really thought about the problem of classification being harder (perhaps) than identification if you see what I mean.

istillwritecode4y ago

Google: hold my beer

Paraesthetic4y ago

ooof, thats uncomfortable

jimjimjim4y ago

nothing important in the world should RELY on a AI/nn/ml.

SilverRed4y ago

This feature was not of any importance. It just asks the viewer if they want to see more or less of a certain kind of video.

jimjimjim4y ago

sonicggg4y ago

Wow, what else? Did it also label them as "Homo Sapiens"?

q-rews4y ago

I feel that 0-failure-rate expectations from technology will keep us from progressing as a species.

Facebook disabled Thai-to-English translation back in April because it translated the queen as “slut” and it’s been disabled since.

Maybe we should learn to accept non-fatal errors from applications instead of forcing things to stop entirely.

I find it ridiculous that my Photos app suggests I change monkey to “lemur” while I have plenty of photos of monkeys and zero of lemurs.

smoldesu4y ago

Who takes the fall when an AI screws up?

> [Google] said it was "appalled and genuinely sorry", though its fix, Wired reported in 2018, was simply to censor photo searches and tags for the word "gorilla".

exporectomy4y ago

userbinator4y ago

The AI is very honest and innocent, it doesn't know what political correctness is. I've heard stories of parents whose kids would also mislabel a black human as a gorilla.

exporectomy4y ago

People need to learn to cope with the difference between innocent mistakes and expressions of genuine feelings of contempt/etc.

crooked-v4y ago

userbinator4y ago

Unfortunately, this is the sort of article that just causes all the SJWs to come crawling out and destroy any attempts at logical reasoning.

tartoran4y ago

kodah4y ago

throwthere4y ago

overgard4y ago

throwthere4y ago

1 more reply

j / k navigate · click thread line to collapse