AI Can Transform Anyone Into a Professional Dancer (opens in new tab)

(news.developer.nvidia.com)

222 pointstuckermi7y ago168 comments

168 comments

I could see this being used in the same way that auto-tune type effects are added to singers voices. The editing of music videos could include a tool like this to make sure all of the dancers behind the main one are moving just right.

And just like auto-tuned voices, it will come off as janky and fake.

valtism7y ago

The autotune you refer to as "janky and fake" is when autotune is used with intention to create that sound as an artistic syle. Autotune is used across most pop (and other) music, but in such a way that that you never realise it is there.

It is similar to special effects. People complain about how bad and fake they look, but this is only for the effects that are bad enough that they are noticible. People don't realise the sheer amount of special effects being used in scenes they never realise they are being used for.

hyperpallium7y ago

But similarly, it could be used as a "janky and fake" artistic syle. Because no one's ever seen it before, it will look wrong in unexpected ways, and therefore somewhat fascinating.

Stylus noise, fret noise and mp3 compression artefacts are other "mistakes" deliberately introduced.

sarreph7y ago

You just described T-Pain’s reasoning for using auto tune (he doesn’t actually need it but instead uses it stylistically).

I think you’re onto something here as well about how it may well get adopted as a ‘style’ in someone’s music video.

kevin_thibedeau7y ago

It is still noticeable in normal performances. You can hear unnatural overtones from autotune used as intended. Even more so when supposed singers use it live and have to stay glued to their mic to isolate any outside noise.

gpvos7y ago

Interesting! Do you have examples where it is possible to hear these unnatural overtones?

2 more replies

ssijak7y ago

There is nothing "artistic" in overusing autotune. It is just a tool to "mask" that that person can`t really sing at all and is overused in popular/commercial genres where there is an abundance of untalented musicians. I can`t imagine anyone saying that obvious autotuned songs sound good.

throwaway88797y ago

I don't understand this attitude at all. So something that doesn't seem artistic to you simply isn't artistic for anybody else because you decided that it isn't? Autotune is just another processing like reverb, delay, whatever. You think they didn't use any processing on Beatles' records? You think non-autotuned singers don't use a TC-helicon or whatever when they're performing live?

Autotuned songs can sound good, just as non-autotuned songs can sound bad. I'm as much a music elitist as anybody, but thinking that you have the one true objective idea of what music sounds good or bad is just childish.

1 more reply

dzjkb7y ago

>I can't imagine anyone saying that obvious autotuned songs sound good.

Well just look at the status quo of popular rap music then - tons of artists are aiming for the obvious autotune style and tons of people are listening to it (so I assume they think it sounds good). I'm not sure what you mean by "artistic", but it's definitely a sound people try to achieve on purpose in their music, so I think it does deserve some recognition as "art".

Krakadero7y ago

T-Pain is a great singer, and I'm pretty sure he's the first person a lot of people think about when they think about autotune.

https://www.youtube.com/watch?v=CIjXUg1s5gc

puranjay7y ago

Let me introduce you to Bon Iver's "22, A Million"

https://www.youtube.com/playlist?list=PLN61gg9VNXPomdZu0UY_w...

apengwin7y ago

Kanye West used autotune to great effect in 808s and Heartbreak, one of the seminal albums of this century

2 more replies

chillee7y ago

Practically all singers use "pitch correction", which is essentially "autotune that's not done for artistic effect".

If done properly, it's not really janky or fake sounding at all.

nineteen9997y ago

Maybe for recorded mainstream pop music that is true, but we can tell because the performance then sounds bowdlerised and dull.

It can't really help with a live performance and a for a good singer, recording multiple takes is going to be faster/more economical, punching in/out is so easy and with modern digital DAW's like ProTools (which does this by default) it keeps all your takes for you anyway - no need to waste another track on your 24 track tape or tape over the previous one.

Here's another viewpoint:

https://www.quora.com/Do-all-most-singers-use-pitch-correcti...

Fantastic vocal performances were captured all throughout the last century without the parachute of pitch correction/autotune. I'd rather listen to an imperfect take with flaws than to a machine assisted correction any day. Each to their own I guess.

djaychela7y ago

I don't think it's always the case that you -can- tell. I think it falls nearly into 'Rumsfeld classification':

Known Knowns - done for the clear effect, which is audible by everyone (as popularised, but not invented by Cher, T-Pain, etc).

Known Unknowns - ones where most people wouldn't notice, but if you've listened to autotune a lot you'd swear it's been done (sustained notes with vibrato added seem to be clear contenders here to me, such as one in 'Angels' by Robbie Williams

Unknown Unknowns - There are lots of recordings I've worked on for people (mixing, mostly) where the vocal sounds perfect, and it's actually been autotuned/processed. Subtly, but still processed. I only know this because the person who's done it has confirmed it. In the context of a mix there's little evidence, if any at all. And if you listen to backing vocals of the last decade versus BVs from say 30 years ago, you really hear the difference - not because modern ones necessarily sound like they've been worked on, but because the old ones sound just a little bit out of tune in comparison.

I've done work on singers' recordings where I've fixed the pitch and they haven't even noticed it on their own voice, in isolation (which really is the sound everyone knows best). If done well (and appropriately), it doesn't turn it into awful processed rubbish, it can tweak an otherwise brilliant performance and make it near-perfect. I say this as a recording engineer who loves the sound of a band playing together, and would always sacrifice separation / absolute recording quality for the communication and feel that you get with a live band playing together how they normally do - I wouldn't sacrifice personality for pitch, but I think you can improve on nearly everyone's performance in some places.

Having said that, the flip side of this is that people think 'they can fix that in the mix' when they've not given a great performance... and that's never the case!

1 more reply

puranjay7y ago

As someone who likes to write and produce songs but can't sing, I'd say that auto-tune and pitch correction are a Godsend. It means I can try out new ideas without having to rely on a vocalist.

Modern production tools have mostly eliminated the need to work with a band. I can create music that I want to create with complete artistic freedom. Auto-tune is just another step in that direction.

1 more reply

throwaway88797y ago

Except you have no idea how any of those "fantastic vocal performances" were processed. Vocal effects go back a good 50 years.

1 more reply

Siemer7y ago

Recording is not the problem. Cutting and splicing the best bits of all those recordings is very labor intensive.

1 more reply

aczerepinski7y ago

I want to plug Jacob Collier though, who doesn't use pitch correction because it's less in tune than where he wants to place the notes naturally, up or down a few cents depending on the note's relationship to the prevailing chord.

noddy1w7y ago

He's a maestro, love his iharmu vids especially

justonepost7y ago

Yeah, I suspect the closer you are to the optimal voice/movement the better the effect is.

hrktb7y ago

Good parralel.

And the same way some song makers have used autotune to adjust synthetic voices like Hatsune Miku’s, would this have any use as an external filter to smooth out synthethized videos ?

tommoor7y ago

Oh man, the first company to put this in a mobile app is going to blow up.

(I guess it might take a few years for the performance to get there)

compute_me7y ago

Totally what I thought too after seeing this: https://twitter.com/smeddinck/status/1032970885148364800 And, yes of course, the models should easily run in the cloud. Could be a whole application series of "make your friends do X", where X is a hilariously remapped activity ... bonus here: it probably does not hurt if the results are somewhat crappy at times.

abledon7y ago

Isn’t there a patent / copyright or something for taking all the algorithms / structures in the tech stack that this relies on? (GAN , densePOSE , etc)

Is any tech that is published in arxiv just free game immediately? Seems unfair to the researchers

MasterScrat7y ago

It's their choice to publish.

flattone7y ago

Why wouldn't this be possible now?

pcthrowaway7y ago

Probably because they're not powerful enough to render the video yet.

StephenMelon7y ago

Wouldn't the solution to that be to render in the cloud?

1 more reply

tachyonbeam7y ago

Funnily enough, I think this technology would be better demoed by people doing more natural motions like walking around and doing basic gestures, or dancing in a style that is more fluid. This type of dancing is made to look intentionally "unnatural" (ie: you rarely see people moving like this in your daily life), which makes it a bit difficult to tell how much of the strangeness/unanny-ness comes from the dancing style vs imprecision in the algorithm.

phalangion7y ago

That might be intentional. We're used to seeing people walk, so it would be easy to point out flaws in transformed video of people walking. With the unusual movements, we attribute it to the moves because we're not used to seeing them.

space007y ago

I would like to see the moon walk of MJ :)

bsenftner7y ago

Nice save way not to say "New Deeper Fakes - now full body!". This is hardly just about dancing. This can be used to create anyone doing anything. The quality will improve, far beyond the quality levels of the hands now and the other interpolated body parts far past the quality necessary for games.

learnstats27y ago

Yep.

The real consequence of this is that video footage is no longer a reliable source.

TheCoreh7y ago

Very impressive!

I'm a bit disappointed though that they didn't also include results for a synthetic source video with "impossible" poses (e.g. joints bending backwards, stretching, separating from the body or performing full rotations). That would have been pretty interesting (though perhaps a bit unsettling) to see.

StephenMelon7y ago

I'm really impressed that the shadows behind in the window are reasonably realistic too.

richdougherty7y ago

I loved watching the movements in the "Detected Pose" corner. I felt like I could see the forms of the dance more clearly. I wonder if ML could learn aesthetically pleasing dance forms, then perhaps we could get some generative choreography!

flattone7y ago

Interesting. If we could grab audience wide fmri at the next nutcracker for training, right ?

flattone7y ago

Edit for labeling

RhysU7y ago

Even capture of pose detection into written choreograph form, or else for iterative feedback while learning a variation, would be interesting.

schaefer7y ago

The title seems poorly worded.

Using AI to transform anyone into a professional dancer might include using AI to process live video (webcam) of someone dancing and then giving them some feedback for improvement. In a word: coaching.

However this is using AI to produce composite videos of people dancing.

cowpewter7y ago

We actually kinda already have the first, in the form of the Dance Central games for the Kinect. You dance, the software detects which of your limbs aren't moving the right way, and it display visual feedback (highlighting the limb in red, reducing your score, etc) to show you what you're doing wrong, so you can perform the dance more correctly next time.

It's not good enough to produce professional dancers, but it has definitely improved my dancing as someone who just dances for fun.

monsieurbanana7y ago

For what it's worth, I got the correct meaning from the title. An AI coach makes sense, but for dance? Seems weirdly specific.

Meanwhile composite videos really blend in with all the augmented reality phone apps that teenagers use nowadays.

I'm half surprised there isn't already something like this for smartphones (with inferior quality).

ajmurmann7y ago

Having just watched the movie Upgrade I had something very different (and scarier) in my head...

deegles7y ago

An AI dance coach-as-a-service would be amazing and have a real impact!

mc327y ago

So this is a cool demo --and it has applications in cinema, MVs, etc. But, this is being presented as something which could allow Jane Q User to portray herself as an accomplished dancer --just transfer a style onto herself.

Maybe I'm in the minority, but I think if we take this idea and walk with it, it has the potential to trivialize actual accomplishment. Maybe I'm overthinking it.

djsumdog7y ago

I think that's all it's intended to be right? A cool demo. And it looks pretty amazing for what it is really.

We're not going to see The Running Man style/quality fake videos any time soon, and the media kinda runs with this an exaggerates; making people wonder if camera footage may one day no longer be considered evidence.

We're far from that. At the most, the quality of the transfers here is about the same as what you'd see with Deep Fakes (celebrities imposed on top of pornographic models using computer vision and AI algorithms).

colordrops7y ago

Are we really that far?

bsenftner7y ago

We ARE that far. Any deep fakes you see in the wild are the hobbyists applying the tech to the lowest common denominator. There are professional state sponsored groups creating serious fake video, easily manipulating presence of people at news worthy incidents.

2 more replies

codewithcheese7y ago

Interesting thought. One upside might be people learn accomplishments are to be enjoyed as a personal achievement rather than requiring social acknowledgement to be validated.

tuckermiOP7y ago

I suspect you are overthinking it, or at least should acknowledge that one needs to take the idea and walk a long, long way!

But, with that said, this work is dependent on having a "source" that the user is using as an input for pose detection. The actual accomplishment must still be performed and recorded, though I suppose this opens the door to the dance equivalent of "lip syncing" even beyond what might be done today with a body double.

tuckermiOP7y ago

Or perhaps you don't even need a human at all, according to this work: https://medium.com/syncedreview/busting-moves-with-dancenet-...

adrianN7y ago

Just because calculators allow anyone to do complex arithmetic, that doesn't lessen the accomplishment of someone who can do it unassisted.

bcheung7y ago

It will just shift it.

They said the same thing with any new artistic medium. Digital cameras, photoshop, Instagram filters, MIDI music instruments, etc.

swaggyBoatswain7y ago

So things can AI do now

- Mimic a target's body motions (this link)

- Mimic a target's facial expressions (deepfakes)

- Mimic a target's voice (lyrebird AI, etc)

related video, digital animation puppeteering

https://www.youtube.com/watch?v=YiOByO8J7xg&t=2s&list=LLI462...

Its not perfect by any means, but we're seeing a new age of CGI. Once perfected, I wonder how the entertainment industry will change as a result (Faster rendering times, less time to make scenes, puppeteering, not needing expensive famous actors or stunt doubles, digital identity copyrights, etc)

piyh7y ago

I'm worried about the social ramifications and moving us one step closer to "post-truth". Not much we can to do stop it at some point, but if a certain pee tape came out in the next year, there'd always be a reasonable doubt to its veracity.

SeanAppleby7y ago

Yeah, I don't think people are lending enough weight to how big of a deal that is.

We're heading into a world where it would not be very hard to bombard the public with a large number of long form videos of highly convincing videos of anyone in the world ranting on any topic and acting out anything they want, and we would have borderline no idea if it was legitimate.

Combining that with our media climate and already runaway problem with monetary and political incentives for fabricated stories seems really dangerous.

You could make a video of Neil Armstrong and Nasa execs talking about how they faked the moonlanding, or even much more nefarious fake content confirming conspiracy theories for political ends.

What will we use as a scalable filter to know what is actually going on, and how will we keep that content from manipulating public discussion?

swaggyBoatswain7y ago

We've had digital and video manipulation for years, it leaves behind pixelated artifacts though and can be spotted (E.g. see captain disillusion on youtube).

But yes there's going to be a big market for tracking fake data sources in the future. We're already seeing tools to track fake twitter accounts, fake instagram followers, fake amazon purchase reviews, this is an ongoing trend.

earenndil7y ago

I disagree that it is a big deal. There'll be brief period when many people don't know about it, or don't know how mainstream it is, but when it will be done. Then people will realise and there will be a cultural shift where video evidence is significantly less trusted.

ggm7y ago

Its a bungee jump into uncanny valley: exciting, but how many times would you pay, and who has more fun? you, or your friends watching?

lovingdancer7y ago

As a dancer & instructor of over 18 years, I think this technology is fascinating. I actually think it would be most effective as a teaching tool for my students. Often times, since the kids are so focused on the physicality of the steps, I find a disconnect between the visual and physical experience as they train, i.e. the kids don't realize that the steps/movements they make are in attempt to create visual shapes and lines. They run around the studio 'feeling themselves' (precious), but at the end of the year on-stage, the choreography suffers from this visual connection.

I appreciate that the detected poses and motions create clear pictures for what different parts of the body are doing. Particularly for ballet, if I had access to this technology (in a way that was user friendly), I'd love to see the difference between ballet styles (Vaganova, Cechetti, ABT, ect). I think it would be much clearer from a students' perspective, to see the stylistic difference in lines, shapes and movement.

This AI reminds me of Happy Feet, where they took Savion Glover's movement and choreography and applied it to the animation penguin. It doesn't seem too far-fetched. And lastly, for those who say this seems unnatural--dancing is unnatural to the body, hence the training and years put into it. So having an AI applied to it will only make it look more unnatural.

Artistically, this can be debated (as it has been), but in search for 'real life application,' I'd love to get my hands on this as a teaching tool.

sorry for the long post--this is my first time on this site--my boyfriend sent this to me & warned me that if i blabbed too long, this post would not be successful.

pelario7y ago

As many other comments have said, the title is misleading; the key quotation is:

"(...) allows anyone to portray themselves as a world-class ballerina (...)"

Moreover, after AlphaGO took away Go from us, I started to wonder "what is left" for humans, and I believe that we are centuries away to have machines that achieve world class dancing level. My reasoning is than in things like Go, image or speech recognition, it is easier to "encode" the information for the ML to actually learn. On the other hand, encoding the movements of professional dancers is already quite difficult. Consider for example in the video linked here, the whole human body is mapped into ~20 points. Sure, this may be enough to portray someone as a dancer. But good luck making a dancing robot.

So, maybe I quit my programming career to become a dancer, it is less likely to be a job that the machines will take away ;-)

edit: grammar

PakG17y ago

I tried joining a dance class at my gym. I looked ridiculous. I have no confidence that I'd look as good as the coach even after a year of practice. Well, I think it helps that everyone else in the class is in much better shape than me, their moves are more... nice to look at.

Yeah, it doesn't matter if machines can't dance if I can't either. Still no job for me. :)

Hendrikto7y ago

> I believe that we are centuries away to have machines that achieve world class dancing level

People said the same about Go. There are AIs that can compose enjoyable music already.

goatlover7y ago

Robotics is a bit harder than board games.

tjr2257y ago

AI can make anyone appear to be a professional dancer. There is a difference between what is real and what is fake.

flattone7y ago

Good thing we often accept versions of reality as far away as snap filters.

paraschopra7y ago

Here's an unpopular opinion: such applications aren't going to trivialize art.

Like competitive sports, art is all about display of human ability under constraints. This is why even in the age of photographs, we still value hand-painted canvases. Such techniques are simply going to make people more discerning between real effort v/s automated means of generating the same outcome.

Rather than thinking AI-assisted style transfers are the end of art, we should think that these are new tools for artists to do even more interesting stuff. See this upcoming tool for example: https://runwayml.com/

monsieurbanana7y ago

A interesting parallel could be made with chess. How did Deep Blue affect the interest of humans in the game of chess? I'm not a chess player, but I seem to recall the effect was at least neutral, if not positive.

And more recently with AlphaGo. Now that humans have no chance of ever beating AI again in the game of go, what will change?

I'm a go player so I'm more interested in this question. Professional go players said that AlphaGo is positive for go, that they will be able to learn from it and reach new levels of play.

Although of course their livelihood depends on the popularity of go, it would be bad press for them to say the opposite.

hellofunk7y ago

I entirely agree with you. It's one thing for computers/AI to emulate the creative work that humans have already achieved, essentially copying, or porting, or manipulating prior art, but it's something else entirely to genuinely create something new and fresh and connects with people emotionally, and I have yet to see any evidence that AI is close to this.

krapp7y ago

Most modern art isn't made to great something new and fresh... it's mass produced pastoral stuff like Thomas Kinkade or connected to a multimedia franchise (book covers, movie posters, game art, etc.) and a lot of that is certainly formulaic and derivative.

Maybe AI isn't able to copy human technique well enough yet but whether it succeeds or fails will have little to do with whether or not it creates work that resonates emotionally like classic art, because that's no longer the purpose of the vast majority of art that people encounter.

And I would argue that human beings, for the most part, copy other human beings anyway. Working within a "genre" and using cultural references and even recognizable techniques are all essentially copying or at least adapting what came before.

hellofunk7y ago

I guess it depends on how broad a definition of "art" we are using.

rm_-rf_slash7y ago

The same argument was made about the Mona Lisa when copies of it began appearing in books and newspapers and such. Instead of obsolescence, it made Da Vinci’s work more popular than ever.

MPSimmons7y ago

I'm sure that this will, at no point, be used for evil.

trukterious7y ago

Indeed. It's still in the uncanny valley, but then evil is uncanny valley too. Goodness has always been linked in our minds with beauty and grace, including grace of movement.

dangerface7y ago

Need to send this to theresa may

robaato7y ago

Exactly my thought!

https://www.theguardian.com/politics/2018/aug/30/theresa-may...

DrNuke7y ago

We should think of something like a blockchain to mark all this sh*t as fake though, because in five years time there will be no way to distinguish reality from invention and we will all be under constant blackmail from malicious agents and rogue governments showing up at our door with whatever made-up accusation they want.

Raphmedia7y ago

> blackmail from malicious agents and rogue governments showing up at our door with whatever made-up accusation they want.

I think the opposite. I believe that this will kill blackmail. Why care if someone has a leaked sex tape featuring you in an age where anyone can fake them. Simply say it's fake. In a few years, I bet there will be simply apps where you can point to a person's social network accounts and have the app generate whatever you want. Blackmail will die once everyone will have access to those videos with a few clicks.

eivarv7y ago

How would blockchain solve the issue?

IshKebab7y ago

It lets you prove a file was not modified after a certain point in time. So if you have two similar videos that are both timestamped you can prove which one is the original.

This idea dates back to way before bitcoin.

https://en.m.wikipedia.org/wiki/Trusted_timestamping

eivarv7y ago

You only prove which was timestamped first, though - not which is the original.

DrNuke7y ago

Link a blockchain app made with IOTA (for example, but this is the best and most manageable protocol I can see out there just now) to your unique ID in the form of something like your heartbeat plus your physical location plus your actual body shape, then perform a mini-transaction every 30 seconds or every minute or the granularity you need, finally store the transactions linked to that data as an alibi against any accusation that you were somewhere else doing something fake an AI created to blackmail you.

eivarv7y ago

Or you could just be critical, and be aware of the limitations of media as it pertains to representation of truth and reality.

Besides, techniques for identifying fakes is likely to lag closely behind the techniques for producing them.

The first time we talked about "DeepFakes", someone I know pointed out that there is nothing inherently new in this issue. Media has been faked to manipulate the truth as long as there has been media.

Wether you trust the medium, a person, or a blockchain, trust is only as good as the information you base it upon – and there's always ways to circumvent it, or otherwise deceive you.

Also: There are some pretty big privacy issues (from what I understand) with what you describe.

1 more reply

xg157y ago

> blockchain

> perform a mini-transaction every 30 seconds

I guess if you don't want to be blackmailed, you better have money.

1 more reply

gitgud7y ago

> unique ID in the form of something like your heartbeat plus your physical location plus your actual body shape, then perform a mini-transaction every 30 seconds

Extreme surveillance, for what? To give evidence that you weren't in a fake dancing video?

I can't see this happening any time soon ...

1 more reply

browsercoin7y ago

Why do you need a blockchain database specifically? What properties of it is relevant here?

1 more reply

Krasnol7y ago

At that point nobody will believe anything so nobody will show up at your door.

lainga7y ago

Oh boy, those hands at 0:14 are a no for me.

taneq7y ago

Technically impressive, this is. Canny, it is not.

bryanrasmussen7y ago

Is it uncanny then?

taneq7y ago

That... would be the implication, yes.

1 more reply

alexcnwy7y ago

This is incredible!

I wonder if seeing yourself dance like this might speed up learning to actually dance like this...

skookumchuck7y ago

I doubt it. Almost nobody learns to dance without a coach. It's because what you think you see is not what the movement actually is. Much of dance is playing with your perception of the movement (the moonwalk is the most obvious example, and very, very few moonwalkers can pull off the illusion).

nikkwong7y ago

I'm itching to make this. Would this be a good intro ML project for a solid software engineer (with a decently strong math background) or would it likely be far over my head? Seems like reverse engineering it from the paper would be tough, but maybe doable :p

currymj7y ago

it would be expensive in terms of hardware, you’d need to shell out for quite a few GPUs or else budget a lot for expensive cloud instances.

plus this is just a very complicated thing, in that it’s gluing together multiple new techniques to do various things.

some of the pieces that went into this work (like GANs) have lots of tutorials online and might be a more manageable, and budget-friendly, place to start. you could do something interesting on Google CoLab with free GPU time.

mirimir7y ago

No, not "transform into". It's "make look like, in a video".

I mean, who cares what you look like in some video? When you actually meet people, they'll know that it's bullshit.

Now, if you could manage it in meatspace, that would be cool!

onion2k7y ago

who cares what you look like in some video?

Everyone who watches television, movies, YouTube, etc. I know that's only a few people, but hey, it's a start.

mirimir7y ago

I suppose. But then, what would be the hiring criteria? Lowest bid? Nice ass?

And the focus here is "anyone", not professionals.

Jaruzel7y ago

The biggest worry here, is where instead of hiring 20 dancers for a music video, they only hire one, and use AI to AutoDance[1] a bunch of low-cost actors instead. This could destroy the jobbing dancer-for-hire market.

[1] AutoDance™ - Like AutoTune. I hereby claim it as a term. ;)

1 more reply

onion2k7y ago

But then, what would be the hiring criteria?

Literally anything apart from someone's ability to dance. Instead of needing someone who can sing, act, look good, and dance, you now only need someone who can sing, act, and look good. And AI will eventually replace all the other talents too I guess.

This is why some people worry about the effect of AI on jobs.

yedawg7y ago

Anyone want to get rich developing this for android /w me? haha

abledon7y ago

Can’t nvidia and the papers authors sue you or claim a takedown / charge licensing or something ?!? How is it legal for people to take cutting edge tech from universities for free and use it for their own software/profit?

currymj7y ago

same as it’s legal to use open source licensed software to profit. a lot of academics don’t care, and their work might be built very strongly on other work. there might or might not be a patent on this research, but I would guess probably not?

abledon7y ago

I thought github projects were only ok to use if the author provided a LICENSE file e.g. MIT or APACHE 2.0 , do these papers include a license clause at the bottom? Usually no right , so it seems to be a ‘grey’ area

1 more reply

nikkwong7y ago

me. let's do it

browsercoin7y ago

Does this mean that we can manipulate videos programmatically in the future? I don't see why not. Maybe we'll see games that are literally undistinguishable from reality.

yummybear7y ago

My kids would pay any amount of my money to play this as a game.

bryanrasmussen7y ago

So would my kid, but that's because she does not seem to understand money.

amelius7y ago

Some time ago there was a submission saying that the adult industry uses similar algorithms to put arbitrary faces on videos of actors.

thefounder7y ago

The title is clickbait...It doesn't turn anyone into a better dancer. It's just about CGI stuff. A kind of future of emoji.

goatlover7y ago

Or the future of fake news.

ratsimihah7y ago

What is going here, the world is looking more and more like a movie! Apocalypse soon?

rm_-rf_slash7y ago

Massively underwhelming read from the headline. This is basically just deepfakes for dancing. When machine learning can actually teach someone how to dance, then we’ll have something interesting on our hands.

revskill7y ago

AI couldn't be used for critical task, in which mistakes are not allowed at all. What's its real use cases ?

m0ew7y ago

Very impressive!

ch4ck7y ago

Looks as fake as dinosaurs in 1993.

mraison7y ago

> Using NVIDIA TITAN Xp and GeForce GTX 1080 Ti GPUs, with the cuDNN-accelerated PyTorch deep learning framework for both training and inference

> the team based their algorithm on the pix2pixHD architecture developed by NVIDIA researchers

Is it me, or is NVIDIA trying very hard to take credit for this UC Berkeley paper? (they're almost taking credit for Pytorch as well). Sure, this kind of work wouldn't be possible without their hardware, but in that case Intel could probably take credit for most of science in the last few decades.

twtw7y ago

It's a company blog. It exists to highlight applications of the company's products. I don't see why you are bothered, both of those statements are factually true.

mraison7y ago

Normally I wouldn't be bothered, but in this case I saw people being misled into thinking Nvidia did this work. Given Nvidia publishes a lot at computer vision conferences, there's a higher than usual potential for confusion.

anjc7y ago

How does that seem like taking credit? They aren't saying they're using NVIDIA hardware, they're saying it developed it from NVIDIA work, i.e. pix2pixHD, no?

It also seems as UC Berkeley and NVIDIA collaborated on pix2pixHD, judging by the paper

mraison7y ago

> They aren't saying they're using NVIDIA hardware

They are, see quote above. They're also going out of their way to mention that Pytorch is using cuDNN, which is true but off-topic.

boomlinde7y ago

I think he meant that they aren't just saying that they're using NVIDIA hardware

mattmanser7y ago

It's on the nvidia site, it's either been edited by them, or they supplied the cards or part funded the work.

mraison7y ago

There's no mention of it in the paper. The acknowledgements section says:

> This work was supported, in part, by NSF grant IIS-1633310 and research gifts from Adobe, eBay, and Google

The fact that people are thinking "it's on the nvidia site, they must have participated somehow" is precisely the reason I wanted to bring this up.

xiphias7y ago

It looks impressive, I just don't understand why computer games don't use these techniques to feature realistic human bodies/faces.

TheCoreh7y ago

I don't think this can run on real time on current hardware yet, so it would only work for pre-rendered cutscenes

anonytrary7y ago

Did you notice the hands in the video? I don't think computer games can accept that level of error. I would ask again in a decade, when this technique is more viable. Until then, I feel game developers will still need humans to do the work.

saagarjha7y ago

You appear to be shadowbanned. Most of your (reasonable-looking) comments are showing up as dead to me.

xiphias7y ago

Sounds interesting, thanks! I had a few informative but controversial comments in the past... that may be the reason.

Jaruzel7y ago

Seconded, I just had to vouch for you in the above comment to un-dead it.

Aha, it was when you said this:

https://news.ycombinator.com/item?id=15725493

1 more reply

m0ck7y ago

This issue was already solved with invention of booze and soft drugs.

WA7y ago

No, this changed only the self-perception of one being a great dancer.

j / k navigate · click thread line to collapse

168 comments

cheschire7y ago

And just like auto-tuned voices, it will come off as janky and fake.

valtism7y ago

hyperpallium7y ago

But similarly, it could be used as a "janky and fake" artistic syle. Because no one's ever seen it before, it will look wrong in unexpected ways, and therefore somewhat fascinating.

Stylus noise, fret noise and mp3 compression artefacts are other "mistakes" deliberately introduced.

sarreph7y ago

You just described T-Pain’s reasoning for using auto tune (he doesn’t actually need it but instead uses it stylistically).

I think you’re onto something here as well about how it may well get adopted as a ‘style’ in someone’s music video.

kevin_thibedeau7y ago

gpvos7y ago

Interesting! Do you have examples where it is possible to hear these unnatural overtones?

2 more replies

ssijak7y ago

throwaway88797y ago

1 more reply

dzjkb7y ago

>I can't imagine anyone saying that obvious autotuned songs sound good.

Krakadero7y ago

T-Pain is a great singer, and I'm pretty sure he's the first person a lot of people think about when they think about autotune.

https://www.youtube.com/watch?v=CIjXUg1s5gc

puranjay7y ago

Let me introduce you to Bon Iver's "22, A Million"

https://www.youtube.com/playlist?list=PLN61gg9VNXPomdZu0UY_w...

apengwin7y ago

Kanye West used autotune to great effect in 808s and Heartbreak, one of the seminal albums of this century

2 more replies

chillee7y ago

Practically all singers use "pitch correction", which is essentially "autotune that's not done for artistic effect".

If done properly, it's not really janky or fake sounding at all.

nineteen9997y ago

Maybe for recorded mainstream pop music that is true, but we can tell because the performance then sounds bowdlerised and dull.

Here's another viewpoint:

https://www.quora.com/Do-all-most-singers-use-pitch-correcti...

djaychela7y ago

I don't think it's always the case that you -can- tell. I think it falls nearly into 'Rumsfeld classification':

Known Knowns - done for the clear effect, which is audible by everyone (as popularised, but not invented by Cher, T-Pain, etc).

Having said that, the flip side of this is that people think 'they can fix that in the mix' when they've not given a great performance... and that's never the case!

1 more reply

puranjay7y ago

As someone who likes to write and produce songs but can't sing, I'd say that auto-tune and pitch correction are a Godsend. It means I can try out new ideas without having to rely on a vocalist.

Modern production tools have mostly eliminated the need to work with a band. I can create music that I want to create with complete artistic freedom. Auto-tune is just another step in that direction.

1 more reply

throwaway88797y ago

Except you have no idea how any of those "fantastic vocal performances" were processed. Vocal effects go back a good 50 years.

1 more reply

Siemer7y ago

Recording is not the problem. Cutting and splicing the best bits of all those recordings is very labor intensive.

1 more reply

aczerepinski7y ago

noddy1w7y ago

He's a maestro, love his iharmu vids especially

justonepost7y ago

Yeah, I suspect the closer you are to the optimal voice/movement the better the effect is.

hrktb7y ago

Good parralel.

And the same way some song makers have used autotune to adjust synthetic voices like Hatsune Miku’s, would this have any use as an external filter to smooth out synthethized videos ?

tommoor7y ago

Oh man, the first company to put this in a mobile app is going to blow up.

(I guess it might take a few years for the performance to get there)

compute_me7y ago

abledon7y ago

Isn’t there a patent / copyright or something for taking all the algorithms / structures in the tech stack that this relies on? (GAN , densePOSE , etc)

Is any tech that is published in arxiv just free game immediately? Seems unfair to the researchers

MasterScrat7y ago

It's their choice to publish.

flattone7y ago

Why wouldn't this be possible now?

pcthrowaway7y ago

Probably because they're not powerful enough to render the video yet.

StephenMelon7y ago

Wouldn't the solution to that be to render in the cloud?

1 more reply

tachyonbeam7y ago

phalangion7y ago

space007y ago

I would like to see the moon walk of MJ :)

bsenftner7y ago

learnstats27y ago

Yep.

The real consequence of this is that video footage is no longer a reliable source.

TheCoreh7y ago

Very impressive!

StephenMelon7y ago

I'm really impressed that the shadows behind in the window are reasonably realistic too.

richdougherty7y ago

flattone7y ago

Interesting. If we could grab audience wide fmri at the next nutcracker for training, right ?

flattone7y ago

Edit for labeling

RhysU7y ago

Even capture of pose detection into written choreograph form, or else for iterative feedback while learning a variation, would be interesting.

schaefer7y ago

The title seems poorly worded.

However this is using AI to produce composite videos of people dancing.

cowpewter7y ago

It's not good enough to produce professional dancers, but it has definitely improved my dancing as someone who just dances for fun.

monsieurbanana7y ago

For what it's worth, I got the correct meaning from the title. An AI coach makes sense, but for dance? Seems weirdly specific.

Meanwhile composite videos really blend in with all the augmented reality phone apps that teenagers use nowadays.

I'm half surprised there isn't already something like this for smartphones (with inferior quality).

ajmurmann7y ago

Having just watched the movie Upgrade I had something very different (and scarier) in my head...

deegles7y ago

An AI dance coach-as-a-service would be amazing and have a real impact!

mc327y ago

Maybe I'm in the minority, but I think if we take this idea and walk with it, it has the potential to trivialize actual accomplishment. Maybe I'm overthinking it.

djsumdog7y ago

I think that's all it's intended to be right? A cool demo. And it looks pretty amazing for what it is really.

colordrops7y ago

Are we really that far?

bsenftner7y ago

2 more replies

codewithcheese7y ago

Interesting thought. One upside might be people learn accomplishments are to be enjoyed as a personal achievement rather than requiring social acknowledgement to be validated.

tuckermiOP7y ago

I suspect you are overthinking it, or at least should acknowledge that one needs to take the idea and walk a long, long way!

tuckermiOP7y ago

Or perhaps you don't even need a human at all, according to this work: https://medium.com/syncedreview/busting-moves-with-dancenet-...

adrianN7y ago

Just because calculators allow anyone to do complex arithmetic, that doesn't lessen the accomplishment of someone who can do it unassisted.

bcheung7y ago

It will just shift it.

They said the same thing with any new artistic medium. Digital cameras, photoshop, Instagram filters, MIDI music instruments, etc.

swaggyBoatswain7y ago

So things can AI do now

- Mimic a target's body motions (this link)

- Mimic a target's facial expressions (deepfakes)

- Mimic a target's voice (lyrebird AI, etc)

related video, digital animation puppeteering

https://www.youtube.com/watch?v=YiOByO8J7xg&t=2s&list=LLI462...

piyh7y ago

SeanAppleby7y ago

Yeah, I don't think people are lending enough weight to how big of a deal that is.

Combining that with our media climate and already runaway problem with monetary and political incentives for fabricated stories seems really dangerous.

You could make a video of Neil Armstrong and Nasa execs talking about how they faked the moonlanding, or even much more nefarious fake content confirming conspiracy theories for political ends.

What will we use as a scalable filter to know what is actually going on, and how will we keep that content from manipulating public discussion?

swaggyBoatswain7y ago

We've had digital and video manipulation for years, it leaves behind pixelated artifacts though and can be spotted (E.g. see captain disillusion on youtube).

earenndil7y ago

ggm7y ago

Its a bungee jump into uncanny valley: exciting, but how many times would you pay, and who has more fun? you, or your friends watching?

lovingdancer7y ago

Artistically, this can be debated (as it has been), but in search for 'real life application,' I'd love to get my hands on this as a teaching tool.

sorry for the long post--this is my first time on this site--my boyfriend sent this to me & warned me that if i blabbed too long, this post would not be successful.

pelario7y ago

As many other comments have said, the title is misleading; the key quotation is:

"(...) allows anyone to portray themselves as a world-class ballerina (...)"

So, maybe I quit my programming career to become a dancer, it is less likely to be a job that the machines will take away ;-)

edit: grammar

PakG17y ago

Yeah, it doesn't matter if machines can't dance if I can't either. Still no job for me. :)

Hendrikto7y ago

> I believe that we are centuries away to have machines that achieve world class dancing level

People said the same about Go. There are AIs that can compose enjoyable music already.

goatlover7y ago

Robotics is a bit harder than board games.

tjr2257y ago

AI can make anyone appear to be a professional dancer. There is a difference between what is real and what is fake.

flattone7y ago

Good thing we often accept versions of reality as far away as snap filters.

paraschopra7y ago

Here's an unpopular opinion: such applications aren't going to trivialize art.

monsieurbanana7y ago

And more recently with AlphaGo. Now that humans have no chance of ever beating AI again in the game of go, what will change?

I'm a go player so I'm more interested in this question. Professional go players said that AlphaGo is positive for go, that they will be able to learn from it and reach new levels of play.

Although of course their livelihood depends on the popularity of go, it would be bad press for them to say the opposite.

hellofunk7y ago

krapp7y ago

hellofunk7y ago

I guess it depends on how broad a definition of "art" we are using.

rm_-rf_slash7y ago

The same argument was made about the Mona Lisa when copies of it began appearing in books and newspapers and such. Instead of obsolescence, it made Da Vinci’s work more popular than ever.

MPSimmons7y ago

I'm sure that this will, at no point, be used for evil.

trukterious7y ago

Indeed. It's still in the uncanny valley, but then evil is uncanny valley too. Goodness has always been linked in our minds with beauty and grace, including grace of movement.

dangerface7y ago

Need to send this to theresa may

robaato7y ago

Exactly my thought!

https://www.theguardian.com/politics/2018/aug/30/theresa-may...

DrNuke7y ago

Raphmedia7y ago

> blackmail from malicious agents and rogue governments showing up at our door with whatever made-up accusation they want.

eivarv7y ago

How would blockchain solve the issue?

IshKebab7y ago

It lets you prove a file was not modified after a certain point in time. So if you have two similar videos that are both timestamped you can prove which one is the original.

This idea dates back to way before bitcoin.

https://en.m.wikipedia.org/wiki/Trusted_timestamping

eivarv7y ago

You only prove which was timestamped first, though - not which is the original.

DrNuke7y ago

eivarv7y ago

Or you could just be critical, and be aware of the limitations of media as it pertains to representation of truth and reality.

Besides, techniques for identifying fakes is likely to lag closely behind the techniques for producing them.

Wether you trust the medium, a person, or a blockchain, trust is only as good as the information you base it upon – and there's always ways to circumvent it, or otherwise deceive you.

Also: There are some pretty big privacy issues (from what I understand) with what you describe.

1 more reply

xg157y ago

> blockchain

> perform a mini-transaction every 30 seconds

I guess if you don't want to be blackmailed, you better have money.

1 more reply

gitgud7y ago

> unique ID in the form of something like your heartbeat plus your physical location plus your actual body shape, then perform a mini-transaction every 30 seconds

Extreme surveillance, for what? To give evidence that you weren't in a fake dancing video?

I can't see this happening any time soon ...

1 more reply

browsercoin7y ago

Why do you need a blockchain database specifically? What properties of it is relevant here?

1 more reply

Krasnol7y ago

At that point nobody will believe anything so nobody will show up at your door.

lainga7y ago

Oh boy, those hands at 0:14 are a no for me.

taneq7y ago

Technically impressive, this is. Canny, it is not.

bryanrasmussen7y ago

Is it uncanny then?

taneq7y ago

That... would be the implication, yes.

1 more reply

alexcnwy7y ago

This is incredible!

I wonder if seeing yourself dance like this might speed up learning to actually dance like this...

skookumchuck7y ago

nikkwong7y ago

currymj7y ago

it would be expensive in terms of hardware, you’d need to shell out for quite a few GPUs or else budget a lot for expensive cloud instances.

plus this is just a very complicated thing, in that it’s gluing together multiple new techniques to do various things.

mirimir7y ago

No, not "transform into". It's "make look like, in a video".

I mean, who cares what you look like in some video? When you actually meet people, they'll know that it's bullshit.

Now, if you could manage it in meatspace, that would be cool!

onion2k7y ago

who cares what you look like in some video?

Everyone who watches television, movies, YouTube, etc. I know that's only a few people, but hey, it's a start.

mirimir7y ago

I suppose. But then, what would be the hiring criteria? Lowest bid? Nice ass?

And the focus here is "anyone", not professionals.

Jaruzel7y ago

[1] AutoDance™ - Like AutoTune. I hereby claim it as a term. ;)

1 more reply

onion2k7y ago

But then, what would be the hiring criteria?

This is why some people worry about the effect of AI on jobs.

yedawg7y ago

Anyone want to get rich developing this for android /w me? haha

abledon7y ago

currymj7y ago

abledon7y ago

1 more reply

nikkwong7y ago

me. let's do it

browsercoin7y ago

Does this mean that we can manipulate videos programmatically in the future? I don't see why not. Maybe we'll see games that are literally undistinguishable from reality.

yummybear7y ago

My kids would pay any amount of my money to play this as a game.

bryanrasmussen7y ago

So would my kid, but that's because she does not seem to understand money.

amelius7y ago

Some time ago there was a submission saying that the adult industry uses similar algorithms to put arbitrary faces on videos of actors.

thefounder7y ago

The title is clickbait...It doesn't turn anyone into a better dancer. It's just about CGI stuff. A kind of future of emoji.

goatlover7y ago

Or the future of fake news.

ratsimihah7y ago

What is going here, the world is looking more and more like a movie! Apocalypse soon?

rm_-rf_slash7y ago

revskill7y ago

AI couldn't be used for critical task, in which mistakes are not allowed at all. What's its real use cases ?

m0ew7y ago

Very impressive!

ch4ck7y ago

Looks as fake as dinosaurs in 1993.

mraison7y ago

> Using NVIDIA TITAN Xp and GeForce GTX 1080 Ti GPUs, with the cuDNN-accelerated PyTorch deep learning framework for both training and inference

> the team based their algorithm on the pix2pixHD architecture developed by NVIDIA researchers

twtw7y ago

It's a company blog. It exists to highlight applications of the company's products. I don't see why you are bothered, both of those statements are factually true.

mraison7y ago

anjc7y ago

How does that seem like taking credit? They aren't saying they're using NVIDIA hardware, they're saying it developed it from NVIDIA work, i.e. pix2pixHD, no?

It also seems as UC Berkeley and NVIDIA collaborated on pix2pixHD, judging by the paper

mraison7y ago

> They aren't saying they're using NVIDIA hardware

They are, see quote above. They're also going out of their way to mention that Pytorch is using cuDNN, which is true but off-topic.

boomlinde7y ago

I think he meant that they aren't just saying that they're using NVIDIA hardware

mattmanser7y ago

It's on the nvidia site, it's either been edited by them, or they supplied the cards or part funded the work.

mraison7y ago

There's no mention of it in the paper. The acknowledgements section says:

> This work was supported, in part, by NSF grant IIS-1633310 and research gifts from Adobe, eBay, and Google

The fact that people are thinking "it's on the nvidia site, they must have participated somehow" is precisely the reason I wanted to bring this up.

xiphias7y ago

It looks impressive, I just don't understand why computer games don't use these techniques to feature realistic human bodies/faces.

TheCoreh7y ago

I don't think this can run on real time on current hardware yet, so it would only work for pre-rendered cutscenes

anonytrary7y ago

saagarjha7y ago

You appear to be shadowbanned. Most of your (reasonable-looking) comments are showing up as dead to me.

xiphias7y ago

Sounds interesting, thanks! I had a few informative but controversial comments in the past... that may be the reason.

Jaruzel7y ago

Seconded, I just had to vouch for you in the above comment to un-dead it.

Aha, it was when you said this:

https://news.ycombinator.com/item?id=15725493

1 more reply

m0ck7y ago

This issue was already solved with invention of booze and soft drugs.

WA7y ago

No, this changed only the self-perception of one being a great dancer.

j / k navigate · click thread line to collapse