AutoML-Zero: Evolving machine learning algorithms from scratch (opens in new tab)

(github.com)

260 pointslainon6y ago46 comments

46 comments

manually6y ago

- Autosuggest database tables to use

- Automatically reserve parallel computing resources

- Autodetect data health issues and auto fix them

- Autodetect concept drift and auto fix it

- Auto engineer features and interactions

- Autodetect leakage and fix it

- Autodetect unfairness and auto fix it

- Autocreate more weakly-labelled training data

- Autocreate descriptive statistics and model eval stats

- Autocreate monitoring

- Autocreate regulations reports

- Autocreate a data infra pipeline

- Autocreate a prediction serving endpoint

- Auto setup a meeting with relevant stakeholders on Google Calendar

- Auto deploy on Google Cloud

- Automatically buy carbon offset

- Auto fire your in-house data scientists

neximo646y ago

Would be funny but most of those things are already on AutoML Tables, including the carbon offset

https://cloud.google.com/automl-tables

westurner6y ago

> Would be funny but most of those things are already on AutoML Tables, including the carbon offset

GCP datacenters are 100% offset with PPAs. Are you referring to different functionality for costing AutoML instructions in terms of carbon?

...

I'd add:

- Setup a Jupyter Notebook environment

> Jupyter Notebooks are one of the most popular development tools for data scientists. They enable you to create interactive, shareable notebooks with code snippets and markdown for explanations. Without leaving Google Cloud's hosted notebook environment, AI Platform Notebooks, you can leverage the power of AutoML technology.

> There are several benefits of using AutoML technology from a notebook. Each step and setting can be codified so that it runs the same every time by everyone. Also, it's common, even with AutoML, to need to manipulate the source data before training the model with it. By using a notebook, you can use common tools like pandas and numpy to preprocess the data in the same workflow. Finally, you have the option of creating a model with another framework, and ensemble that together with the AutoML model, for potentially better results.

https://cloud.google.com/blog/products/ai-machine-learning/u...

1 more reply

jiofih6y ago

I guess this is a job-safety type comment?

xiaodai6y ago

Autodetect data health issues and auto fix them

Funy you say that cos my company is actually developing something along those lines

otagekki6y ago

Poor data scientists, now whose heads get cut when things go wrong and companies lose billions?

manually6y ago

In the days when Sussman was a novice, Minsky once came to him as he sat hacking at the PDP-6.

“What are you doing?”, asked Minsky.

“I am training a randomly wired neural net to play Tic-Tac-Toe” Sussman replied.

“Why is the net wired randomly?”, asked Minsky.

“I do not want it to have any preconceptions of how to play”, Sussman said.

Minsky then shut his eyes.

“Why do you close your eyes?”, Sussman asked his teacher.

“So that the room will be empty.”

At that moment, Sussman was enlightened.

3 more replies

deepsun6y ago

- Auto-negotiate proper metrics to use with stakeholders.

TaylorAlexander6y ago

Shouldn’t this link directly to the Readme?

https://github.com/google-research/google-research/blob/mast...

lokimedes6y ago

Reminds me of https://www.nutonian.com/products/eureqa/ which I used quite productively to model multivariate distributions from data back in the 2000’s. Funny how everything stays the same, but with a new set of players on the bandwagon.

jmmcd6y ago

Not really similar. Nutonian did straight-up genetic programming symbolic regression. This does genetic programming to discover ML algorithms.

lokimedes6y ago

Actually it is somewhat similar as both find the model, and obliviously fits the data to that model in the process. My use was finding a parameterization that could be reused through regular regression fitting.

joe_the_user6y ago

AutoML-Zero aims to automatically discover computer programs that can solve machine learning tasks, starting from empty or random programs and using only basic math operations.

If this system is not using human bias, who is it choosing what good program is? Surely, human labeling data involves humans adding their bias to the data?

It seems like AlphaGoZero was able to do just end-to-end ML because it was able to use a very clear and "objective" standard, whether a program wins or loses at the game of Go.

Would this approach only deal with similarly unambiguous problems?

Edit: also, AlphaGoZero was one of the most ML ever created (at least at the time of its creation). How much computing resources would this require for more fully general learning? Will there be a limit to such an approach?

darawk6y ago

> It seems like AlphaGoZero was able to do just end-to-end ML because it was able to use a very clear and "objective" standard, whether a program wins or loses at the game of Go.

Just a fun note: winning or losing at the game of Go is actually surprisingly subjective:

https://en.wikipedia.org/wiki/Go_(game)#Scoring_rules

pmontra6y ago

The game ends by agreement of the players. If they don't agree on the result ("those stones are alive!") they must keep playing. Chinese rules are much better at this than Japanese ones especially (IMHO) the old ones with the group tax. There are no ambiguities there. Unfortunately the group tax is unpleasant and Chinese rules are a pain to score manually. Japanese rules are full of flaws but are such a nice shortcut that almost everybody except China use them or some variant of them.

Btw, if any Chinese player is reading this, how do you count the score while playing? Do you count territory and remember the number of captured stones or do you count both stones and territory? Thanks.

1 more reply

Recursing6y ago

Different scoring rules agree on the winner in >99% of cases

westurner6y ago

Is the question "Does AutoML-Zero minimize or maximize a cost function with error as a primary component, instead of using a binary win/lose classifier like AlphaGoZero?"

https://en.wikipedia.org/wiki/AlphaZero

mark_l_watson6y ago

This reminds me of John Koza’s Genetic Programming, a technique for evolving small programs. There is an old Common Lisp library to play with it.

drongoking6y ago

My reaction too. They've reinvented genetic/evolutionary programming. They should probably read some of the decades of work that have already been done on it.

vidarh6y ago

The paper [1] cites Koza among a total of 102 citations.

"An early example of a symbolically discovered optimizer is that of Bengio et al. [8], who represent F as a tree: the leaves are the possible inputs to the optimizer (i.e. the xi above) and the nodes are one of {+, −, ×, ÷}. F is then evolved, making this an example of genetic programming [36]. Our search method is similar to genetic programming but we choose to represent the program as a sequence of instructions—like a programmer would type it—rather than a tree. "

"[36]" is "Koza, J. R. and Koza, J. R. Genetic programming: on the programming of computers by means of natural selection. MIT press, 1992."

[1] https://arxiv.org/pdf/2003.03384.pdf

sadness26y ago

I wonder whether you have some reason to think they haven't read that work, and this isn't them building on it

imvetri6y ago

Same here. When I studied genetic programming, I was hoping that's where problem solving evolve from as it was flawless. But recent events prove otherwise which made me believe we are using the wrong tool for the wrong problem. Here is why.

When AI gets to 100% accuracy, the equation to find the answer becomes 100% accurate. We no longer have to run the AI with heavy resources and equation can be converted to an executable program. This modal of AI will save computing power, and uses resources smartly.

Example.

AI tries to find right equation to add two numbers.

AI finds the equation to add two numbers.

AI outputs the equation as an executable program.

AI discards itself.

2 more replies

tmpmov6y ago

For those interested AutoML-Zero cites "Evolving neural networks through augmenting topologies" (2002) among other "learning to learn" papers and is worth a read if you have time and inclination.

For those with more background and time, would any mind bridging the 18 year gap succinctly? A quick look at the paper reveals solution space constraints (assuming for speed), discovering better optimizers, and specific to the AutoML-Zero paper: symbolic discovery.

ypcx6y ago

Now, can we evolve a ML algorithm that would in turn produce a better AutoML? Ladies and gentlemen, the Singularity Toolkit v[quickly changing digits here].

jxcole6y ago

Interesting, but how does it perform on standard benchmarks like image net and MNIST?

JulianWasTaken6y ago

I am way out of my depth so maybe this is pure nonsense, but presumably the goals aren't so much to evolve better performing models directly for some dataset specifically, but to see what kinds of model families it evolves and whether we've thought of all of them?

Once we've got one we can then presumably train specific models in a more targeted way.

p1esk6y ago

They have some cifar10 results in the paper, but only very small networks.

manthideaal6y ago

if AutoML-Zero is going to be more than a grid-like method then I think it should try to learn a probabilistic distribution over (method, problem, efficiency) and use it to discover features for problems using an auto-encoder in which the loss function is a metric over the (method,efficiency) space. That means using transfer-learning from related problems in which the similarity of problems is based of the (method,efficiency) differency.

Problem P1 is locally similar to P2 if (method,efficiency,P1) meassured in computation time is similar to (method,efficiency,P2) for method in a local space of methods. The method should learn to classify both problem and methods, that's similar to learning words and context words in NLP or matrix factorization in recommendation systems. To sample the (space,method,efficiency) space one need huge resources.

Added: To compare a pair of (method,problem) some stardardization should be used, for linear problems related to solving linear systems the condition number of the coefficiency matrix should be used as a feature for standardization and, for example in SAT an heuristic using the number of clauses and variables should be used for estimating the complexity and normalization of problems. So the preprocessing step should use the best known heuristic for solving the problem and estimating its complexity as both a feature and a method for normalization. Heuristic and DL for TSP is approaching SOTA (but concord is better yet).

Finally perhaps some encoding about how the heuristic was obtained could be used as a feature of the problem (heuristic from minimum spanning tree, branch and bound, dynamic programming, recurrence, memoization, hill climbing, ...) as an enumerative type.

So some problems for preprocessing are: 1) What is a good heuristic for solving this problem. 2) What is a good heuristic for bounding or estimating its complexity. 3) How can you use those heuristics to standardize or normalize its complexity. 4) How big should be the problem so that the assymptotic complexity takes over the noise of small problems. 5) How do you encode the different types of heuristics. 6) How do you value the sequential versus parallel method for solving the problem.

Finally, I wonder if once a problem is autoencoded then if some kind of curvature could be defined, that curvature should be related to the average complexity of a local space of problems, also transitions like in graph problems should be feautured. The idea is using gems of features to allow the system to combine those or discover new better features. Curvature could be used for clustering problem that is for classification of types of problems. For example all preprocessed problems for solving a linear system should be normalize to have similar efficiency when using the family F of learning methods otherwise a feature is introduced for further normalization. For example some problems could require to estimate the number of local extrema and the flat (zero curvature extend of those zones)

no_identd6y ago

Very insightful comment, thank you. There's one other related thing I also find worthy of exploring further namely the population based training used by AutoML-Zero at the moment seems extremely simplistic, and there exist a lot of bleeding edge methods in that area which can tremendously improve outcomes of evolutionary algorithms, I've tweeted about them here (and at the AutoML-Zero people):

https://twitter.com/no_identd/status/1238565087675330560

And it doesn't seem unlikely that tweaking these would tremendously improve the outcomes. Combining that with what you've just described would… well, I'll leave that to the readers imagination. ;)

nobodywillobsrv6y ago

Basic Tech Bros still don't get it. This is cool but real problem is finding/defining the problem. And you don't get a million guesses.

Here is a simple test: get me data to predict the future. Can an algo like this learn to read APIs, build scripts, sign up and pay fees, collect data (laying down a lineage for causal prediction), set up accounts, figure out how account actions work and then take actions profitably without going bust?

If it can even do the first part of this I am in. But I doubt it. This is still just at the level of "cool! Your dog can play mini golf."

dang6y ago

Can you please omit name calling from your comments here? I'm sure you can make your substantive points without that.

This is in the site guidelines: https://news.ycombinator.com/newsguidelines.html.

ajconway6y ago

Can it learn to learn to play Go on a human level? Not yet, but someday it likely will.

j / k navigate · click thread line to collapse

46 comments

manually6y ago

- Autosuggest database tables to use

- Automatically reserve parallel computing resources

- Autodetect data health issues and auto fix them

- Autodetect concept drift and auto fix it

- Auto engineer features and interactions

- Autodetect leakage and fix it

- Autodetect unfairness and auto fix it

- Autocreate more weakly-labelled training data

- Autocreate descriptive statistics and model eval stats

- Autocreate monitoring

- Autocreate regulations reports

- Autocreate a data infra pipeline

- Autocreate a prediction serving endpoint

- Auto setup a meeting with relevant stakeholders on Google Calendar

- Auto deploy on Google Cloud

- Automatically buy carbon offset

- Auto fire your in-house data scientists

neximo646y ago

Would be funny but most of those things are already on AutoML Tables, including the carbon offset

https://cloud.google.com/automl-tables

westurner6y ago

> Would be funny but most of those things are already on AutoML Tables, including the carbon offset

GCP datacenters are 100% offset with PPAs. Are you referring to different functionality for costing AutoML instructions in terms of carbon?

...

I'd add:

- Setup a Jupyter Notebook environment

https://cloud.google.com/blog/products/ai-machine-learning/u...

1 more reply

jiofih6y ago

I guess this is a job-safety type comment?

xiaodai6y ago

Autodetect data health issues and auto fix them

Funy you say that cos my company is actually developing something along those lines

otagekki6y ago

Poor data scientists, now whose heads get cut when things go wrong and companies lose billions?

manually6y ago

In the days when Sussman was a novice, Minsky once came to him as he sat hacking at the PDP-6.

“What are you doing?”, asked Minsky.

“I am training a randomly wired neural net to play Tic-Tac-Toe” Sussman replied.

“Why is the net wired randomly?”, asked Minsky.

“I do not want it to have any preconceptions of how to play”, Sussman said.

Minsky then shut his eyes.

“Why do you close your eyes?”, Sussman asked his teacher.

“So that the room will be empty.”

At that moment, Sussman was enlightened.

3 more replies

deepsun6y ago

- Auto-negotiate proper metrics to use with stakeholders.

TaylorAlexander6y ago

Shouldn’t this link directly to the Readme?

https://github.com/google-research/google-research/blob/mast...

lokimedes6y ago

jmmcd6y ago

Not really similar. Nutonian did straight-up genetic programming symbolic regression. This does genetic programming to discover ML algorithms.

lokimedes6y ago

joe_the_user6y ago

AutoML-Zero aims to automatically discover computer programs that can solve machine learning tasks, starting from empty or random programs and using only basic math operations.

If this system is not using human bias, who is it choosing what good program is? Surely, human labeling data involves humans adding their bias to the data?

It seems like AlphaGoZero was able to do just end-to-end ML because it was able to use a very clear and "objective" standard, whether a program wins or loses at the game of Go.

Would this approach only deal with similarly unambiguous problems?

darawk6y ago

> It seems like AlphaGoZero was able to do just end-to-end ML because it was able to use a very clear and "objective" standard, whether a program wins or loses at the game of Go.

Just a fun note: winning or losing at the game of Go is actually surprisingly subjective:

https://en.wikipedia.org/wiki/Go_(game)#Scoring_rules

pmontra6y ago

1 more reply

Recursing6y ago

Different scoring rules agree on the winner in >99% of cases

westurner6y ago

Is the question "Does AutoML-Zero minimize or maximize a cost function with error as a primary component, instead of using a binary win/lose classifier like AlphaGoZero?"

https://en.wikipedia.org/wiki/AlphaZero

mark_l_watson6y ago

This reminds me of John Koza’s Genetic Programming, a technique for evolving small programs. There is an old Common Lisp library to play with it.

drongoking6y ago

My reaction too. They've reinvented genetic/evolutionary programming. They should probably read some of the decades of work that have already been done on it.

vidarh6y ago

The paper [1] cites Koza among a total of 102 citations.

"[36]" is "Koza, J. R. and Koza, J. R. Genetic programming: on the programming of computers by means of natural selection. MIT press, 1992."

[1] https://arxiv.org/pdf/2003.03384.pdf

sadness26y ago

I wonder whether you have some reason to think they haven't read that work, and this isn't them building on it

imvetri6y ago

Example.

AI tries to find right equation to add two numbers.

AI finds the equation to add two numbers.

AI outputs the equation as an executable program.

AI discards itself.

2 more replies

tmpmov6y ago

For those interested AutoML-Zero cites "Evolving neural networks through augmenting topologies" (2002) among other "learning to learn" papers and is worth a read if you have time and inclination.

ypcx6y ago

Now, can we evolve a ML algorithm that would in turn produce a better AutoML? Ladies and gentlemen, the Singularity Toolkit v[quickly changing digits here].

jxcole6y ago

Interesting, but how does it perform on standard benchmarks like image net and MNIST?

JulianWasTaken6y ago

Once we've got one we can then presumably train specific models in a more targeted way.

p1esk6y ago

They have some cifar10 results in the paper, but only very small networks.

manthideaal6y ago

no_identd6y ago

https://twitter.com/no_identd/status/1238565087675330560

And it doesn't seem unlikely that tweaking these would tremendously improve the outcomes. Combining that with what you've just described would… well, I'll leave that to the readers imagination. ;)

nobodywillobsrv6y ago

Basic Tech Bros still don't get it. This is cool but real problem is finding/defining the problem. And you don't get a million guesses.

If it can even do the first part of this I am in. But I doubt it. This is still just at the level of "cool! Your dog can play mini golf."

dang6y ago

Can you please omit name calling from your comments here? I'm sure you can make your substantive points without that.

This is in the site guidelines: https://news.ycombinator.com/newsguidelines.html.

ajconway6y ago

Can it learn to learn to play Go on a human level? Not yet, but someday it likely will.

j / k navigate · click thread line to collapse