Why tensors? A beginner's perspective (opens in new tab)

(mfaizan.github.io)

173 pointsmfn4y ago105 comments

105 comments

That was explanation from a perspective of someone acquainted with modern physics. As such, it will make sense to physicist, but no sense to most everyone else, including mathematicians who don’t know modern physics.

For example, in the beginning, author describes tensors as things behaving according to tensor transformation formula. This is already very much a physicist kind of thinking: it assumes that there is some object out there, and we’re trying to understand what it is in terms of how it behaves. It also uses the summation notation which is rather foreign to non-physicist mathematicians. Then, when it finally reaches the point where it is all related to tensors in TensorFlow sense, we find that there is no reference made to the transformation formula, purportedly so crucial to understanding tensors. How comes?

The solution here is quite simple: what author (and physicists) call tensors is not what TensorFlow (and mathematicians) call tensors. Instead, author describes what mathematicians call “a tensor bundle”, which is a correspondence that assigns each point of space a unique tensor. That’s where the transformation rule comes from: if we describe this mapping in terms of some coordinate system (as physicist universally do), the transformation rule tells you how to this description changes in terms of change of the coordinates. This setup, of course, has little to do with TensorFlow, because there is no space that its tensors are attached to, they are just standalone entities.

So what are the mathematician’s (and TensorFlow) tensors? They’re actually basically what the author says, after very confusing and irrelevant introduction talking about change of coordinates of underlying space — irrelevant, because TensorFlow tensors are not attached as a bundle to some space (manifold) as they are on physics, so no change of space coordinates ever happens. Roughly, tensors are a sort of universal objects representing multi linear maps: bilinear maps V x W -> R correspond canonically one-to-one to regular linear maps V (x) W -> R, where V (x) W is a vector space called tensor product of V and W, and tensors are simply vectors in this tensor product space.

Basically, the idea is to replace weird multi linear objects with normal linear objects (vectors), that we know how to deal with, using matrix multiplication and stuff. That’s all there is to it.

mbbutler4y ago

Why are you complaining that the author didn't talk about tensors as they are used in tensorflow? Tensorflow is never even mentioned in the piece.

The author is perfectly clear in the first sentence that the piece's focus is about the usefulness of tensors in a physics context.

xyzzyz4y ago

Huh, you’re right, not sure why I thought it’s TensorFlow related.

ABeeSea4y ago

> Instead, author describes what mathematicians call “a tensor bundle”, which is a correspondence that assigns each point of space a unique tensor.

Technically, that’s a tensor field which is a section of the tensor bundle. Similarly, a vector field is a section of the tangent bundle (the collection of all the tangent spaces of the points on the manifold). A vector field is just a choice of a tangent vector for each point from that point’s tangent space.

hinkley4y ago

> author describes tensors as things behaving according to tensor transformation formula

In grade school it drove me nuts when the homework required us to describe a word without using the word (or it’s Latinate siblings). And yet as an adult there are few enough weeks that go by where some grownup doesn’t try to pull that same trick.

If you think developers are guilty of circular logic, check out some of the math pages on Wikipedia. You can get lost in moments.

jhokanson4y ago

Speaking of math pages on Wikipedia ... and math text more generally

Is it just me or are we horrible at teaching advanced math? Where are the examples (with actual numbers)? Where is the motivation? Where are the pictures?

hinkley4y ago

Randall Monroe has a comic about how most people need enough math to be able to handle a birthday dinner where the guests split the bill for the birthday boy/girl evenly and pay for their meals and tip separately.

That’s a pretty good bar and I wonder if we could just cut to that chase earlier. But I also believe that people need enough math to see when they’re being cheated, and I feel like you could just tell middle schoolers that and they would pay attention. Maybe even primary school.

You told Billy he could have three apples, and now there are two left. Did Billy take more apples than he should have?

It’s always how do you share your cookies fairly with your friends and if they’re my cookies why do I have to share them at all? Screw “fairly” I’m keeping the extras at least. That sort of sharing is a socially advanced concept they don’t entirely get just yet.

1 more reply

GoatOfAplomb4y ago

> Where are the examples (with actual numbers)?

In upper-level undergraduate math, I made a game of seeing how many pages I would go before seeing 7 printed anywhere. It was usually 10 pages, if I included the page numbers.

ravi-delia4y ago

What are we calling advanced math? There comes a point where I personally find it much easier to avoid examples until I'm problem-solving, since otherwise I'll get stuck in a loop of wondering if the thing I noticed generalizes. Could just be that my working memory is poor, but when I see a real honest number I know I'm in for a grueling day.

joppy4y ago

Wikipedia is a terrible place to learn advanced mathematics, for the reasons you raise (and more). There are lots of terrific short books, and many terrific lectures online.

chobytes4y ago

This is definitely a problem! Having a large set of interests and problems to draw examples and intuition from are how I deal with it. I suspect this is why so many mathematicians are also into physics.

1 more reply

xyzzyz4y ago

It's not just you, we are horrible at teaching advanced math. However, the reason for it is that advanced math is, as far as we can tell, just really, really, really hard. It's not that mathematicians don't care about teaching others (they very much do, and they try their best to get their understanding across to others), or that Wikipedia authors are particularly bad at clear exposition (they are, if anything, above average). Quite simply, we know of no royal road to understanding mathematics, you have to put in many hours to bite it in very small pieces.

Here's an example:

https://en.wikipedia.org/wiki/Homology_(mathematics)

It has motivation, examples, and even actual numbers (though they're really just 0 and 1. most of the time). In my opinion, it's very good and clear exposition, for an encyclopedic article. However, I strongly suspect that people without enough mathematical knowledge (and "enough" in this case is something in the neighborhood of "enough to obtain an undergraduate degree in Mathematics") will simply not get anything about it beyond "it's about number of holes" (and that's not even remotely close to the whole picture: homology theories are important and useful in context of things with no "holes" to speak of). If you think otherwise, but not know what a quotient group is, you're just fooling yourself.

This is something I observe on HN a lot: people don't understand advanced mathematics, and are dumbfounded by the fact, trying to blame weird notation mathematicians insist on, or lack of motivation/examples/pictures etc. I never see people here do the same with advanced physics ("if the Standard Model is so standard, why can't they briefly and clearly describe what it is" is not something I ever see), molecular biology, or material science. People seem to know their limits and understand that really grokking these fields requires many years of deep study.

I think it's because many people on HN have good experience learning mathematics at school: it was something they always grasped really easily, and were easily able to figure out how to calculate derivatives, integrals, get matrices into normal forms etc. I don't want to rain on anyone's parade, because these things are still relatively difficult, and it does require more intellectual ability and effort that probably 3/4ths of the population aren't capable of. However, relative to advanced mathematics, undergraduate calculus is really rather trivial stuff.

Point is, if you don't understand modern advanced mathematics, you shouldn't get any more disappointed than you are about not being able to play violin. These things just don't come easy.

chobytes4y ago

I think a lot of circularity occurs in mathematics because we don't typically qualify our utterances when it can be implicitly understood.

Eg "Numbers (formal) are those objects which behave like numbers (informal)."

dboreham4y ago

Bertrand Russell turns over in grave.

CatsAreCool4y ago

I'm working on a language MathLingua (www.mathlingua.org) whose goal is to precisely describe mathematics using a format that is easy to read and understand to help address ambiguity in mathematical texts written using natural language.

It is still a work in progress, but does it help address some of the problems you see in learning mathematics? Any feedback is greatly appreciated. Thanks.

tagrun4y ago

It has nothing to do with tensor fields, uniform/constant tensors still obey the proper coordinate transformations, that's the defining property of any tensor. (With non-uniform tensor fields, covariant derivatives also pick up a correction, but that's a separate thing.)

TensorFlow "tensor"(and most other use of "tensor" in programmer jargon) is not a tensor at all, it's just a multidimensional array.

contravariant4y ago

Mathematicians would disagree with you there. There are no coordinates to transform in an ordinary tensor space and therefore no way for a tensor to be affected by such a transformation.

Matrices (or linear transformations in general) are important examples of tensors. There's a nice adjunction between tensor spaces A(x)B and the space of linear transformations B=>C given by:

Hom(A(x)B, C) = Hom(A, B=>C)

In the case of Tensorflow I think they do actually still talk about linear transformations of some kind so it's perfectly fine to call them tensors.

tagrun4y ago

Which mathematician disagrees with what exactly?

Tensors are introduced by physicists to ensure various physical quantities (which involve coordinates and their derivatives) do not depend on the arbitrarily chosen coordinate system. This is ensured through the transformation properties of tensors.

The name tensor itself comes from the theory of elasticity, Cauchy stress tensor, which BTW is uniform in many practical cases, and obeys the following tensor transformation rule:

https://en.wikipedia.org/wiki/Cauchy_stress_tensor#Transform...

like any other (contravariant) tensor must.

Matrices are not examples of tensors. Matrices can be used for representation of tensors, in which case tensor product becomes Kronecker product, but matrices in general don't have to represent tensors. You can put anything, including your favorite colors or a list of random numbers, in a matrix, and it won't be a tensor in general, not unless it must transform like a tensor under coordinate system changes.

Similarly, TensorFlow "tensor" is just a multidimensional data array, with no transformation rules enforced on it, and therefore is not a tensor.

3 more replies

cygx4y ago

There are no coordinates to transform in an ordinary tensor space and therefore no way for a tensor to be affected by such a transformation.

Sure there are: Any basis of the underlying vector space(s) induces a basis of the tensor space. Components respective to some basis are coordinates. You can then investigate what happens to the induced basis (or rather, the respective components) under a basis transformation of the underlying vector space(s), which is where the "physicist's" definition of tensors originates.

1 more reply

catgary4y ago

What do you think the tensor product of finite dimensional vector spaces looks like?

ummonk4y ago

The post was also a poor explanation for someone doing modern physics. [edit: not true actually I should have read the rest of the post - it’s a good post]

Wald's approach in General Relativity is much better - he treats Tensors as a multilinear map from vectors and dual vectors to scalars.

He then derives the underlying coordinate transformaton rules, for the vector spaces used in differential geometry. But

mfnOP4y ago

That’s the approach I used as well in the second half of the article - I just mentioned the transformation law in the beginning since that’s what most physics students encounter first.

Most of the article tries to provide some intuition behind why multilinear maps, which sound like a fairly abstract concept, might be relevant in physics. The key link being the importance of coordinate invariance.

I didn’t go into deriving the coordinate transforms from the multilinear map definition as I didn’t feel that it’d provide much better intuition, but I did mention the equivalence near the end.

ummonk4y ago

Yeah sorry you’re right - I should have read the rest of your post, which is excellent and describes precisely why the coordinates/transformations focused definition is bad for one’s intuition.

kaashif4y ago

> author describes tensors as things behaving according to tensor transformation formula

Yeah, the idea that there are pre-existing things that we're trying to describe is somewhat weird to me when we're trying to come up with a definition of a tensor. The whole point of mathematics is that you come up with the definitions and theorems fall out.

In particular, this comment is funny and speaks to some difference in how I and the author view what we're doing when defining a tensor:

> But why that specific transformation law - why must tensors transform in that way in order to preserve whatever object the tensor represents?

Because we defined it like that! When you make the definition "a tensor is a thing that follows X laws", you don't get to ask why, you just defined it!

Just a funny bit of phrasing, I get what is meant :)

edflsafoiewq4y ago

> The whole point of mathematics is that you come up with the definitions and theorems fall out.

That's just how it's presented in textbooks. It's obviously not math is actually done.

ravi-delia4y ago

It's presented that way in textbooks because it's way easier to learn it that way (or it's Stockholm syndrome, which I won't deny is possible). The motivated way would require way, way, way more background knowledge.

hansen4y ago

To be a bit pedantic: the identification of tensors with multilinear forms requires finite dimensions (or reflexive topological spaces).

l33t23284y ago

I think it’s a little funny you said that

>[the explanation in the OP] will make sense to physicist, but no sense to most everyone else, including mathematicians

And then went on to describe tensors in a way that is unfriendly to non mathematicians by saying

> tensors are a sort of universal objects representing multi linear maps: bilinear maps V x W -> R correspond canonically one-to-one to regular linear maps V (x) W -> R, where V (x) W is a vector space called tensor product of V and W, and tensors are simply vectors in this tensor product space.

bigger_cheese4y ago

Engineering usage seems to match the physics usage. In classic engineering fashion however we were always taught just to 'plug them in' without learning all the minutia that go with them.

For example the stress and strain calculations which are used for calculating Deformation (Say if you were rolling a sheet of steel in a mill) makes use of tensors and also something called an "Invariant" I assume this also comes from Physics/Mathematics world.

mr_mitm4y ago

Thanks for this summary.

Even as a physicist I found it highly confusing when I got told in physics classes that a tensor is "just a thing (or object) that behaves like so under coordinate transformation". Like, what do you mean by "thing"? I have no intuition to this yet, I need it concise definitions! Fortunately I took a differential geometry class at the same time, which was really helpful.

ericphanson4y ago

I was happy to see that this article is actually talking about tensors, not just multidimensional arrays (which for some reasons are often called tensors by machine learning folks).

lalaithion4y ago

This is mostly a semantic argument, but I find this to be a very annoying perspective. Given a basis, there is a natural isomorphism between tensors of a certain type and multidimensional arrays of certain dimensions.

edflsafoiewq4y ago

Of course there is, but if you perform an operation on a multidimensional array, there is no guarantee it corresponds to an operation on tensors, ie. the resulting tensor may depend on the basis.

lalaithion4y ago

Sure, if you perform an arbitrary operation on a multidimensional array. But the same is true of any representation of any mathematical object. It makes no physical sense to take the sine of a mass, or two to the power of a length. But that doesn't mean that whenever someone says "oh, the mass of an object is a real number" I need to nitpick them.

mhh__4y ago

But it's like calling my table a cat because they both have four legs.

zarzavat4y ago

If your table is furry and has a tail it may actually be a cat (from a machine learning perspective).

The machine learning packages have an einsum function / tensor contraction, etc. What more do you need for it to be called a tensor?

1 more reply

jstx14y ago

Same word, different contexts and meaning. In ML tensors are multidimensional arrays (and nothing more). Neither physicists nor ML researchers/developers are confused about what it means.

cloogshicer4y ago

> Neither physicists nor ML researchers/developers are confused about what it means.

I'm sure this has confused a lot of people, especially beginners. Clashing terminology is one of the main difficulties in interdisciplinary work, in my experience. I don't think it's good to shrug it off like that.

russellbeattie4y ago

I'm literally learning this, or at least being reminded of something I had totally forgotten, as I read this thread. So yeah.

Words can have multiple meanings, but I think we can all agree it's preferable if they don't have multiple slightly different depending on context meanings. That's just confusing.

lanstin4y ago

I'm pretty sure that a multidimensional array was the original implementation of tensors and the more fancy linear forms formalism came later in the twentieth century. Then the question was how do these multidimensional arrays operate on vectors and how do they transport around a manifold. (Source: first book I read on general relativity in the 1970s had a lot of pages about multidimensional arrays and also this good summary of the history: https://math.stackexchange.com/questions/2030558/what-is-the...

(Sort of like how vectors kind of got going via a list of numbers and then they found the right axioms for vector spaces and then linear algebra shifted from a lot of computation to a sort of spare and elegant set of theorems on linearity).

westurner4y ago

AFAIU, Matrices are categorically subsets of Tensors where the product operator, at least, is not the tensor product but the Dot product.

Dot product: https://en.wikipedia.org/wiki/Dot_product

Matrix multiplication > Dot product, bilinear form and inner product: https://en.wikipedia.org/wiki/Matrix_multiplication#Dot_prod...

> The dot product of two column vectors is the matrix product

Tensor > Geometric objects https://en.wikipedia.org/wiki/Tensor :

> The transformation law for a tensor behaves as a functor on the category of admissible coordinate systems, under general linear transformations (or, other transformations within some class, such as local diffeomorphisms.) This makes a tensor a special case of a geometrical object, in the technical sense that it is a function of the coordinate system transforming functorially under coordinate changes.[24] Examples of objects obeying more general kinds of transformation laws are jets and, more generally still, natural bundles.[25][26]

1 more reply

hnarayanan4y ago

I was confused by this when I first got to ML.

lysecret4y ago

I am a bit confused

kaashif4y ago

It's not clear to me what you're annoyed about exactly. The way I see it, there are a few options:

You're getting annoyed that people are confusing the map with the territory [1]. Multidimensional arrays with certain properties can be used to represent tensors, but aren't tensors. In the same way a diagram of torus isn't a topological space, or a multiplication table isn't a group, or a matrix is not a linear map. Isomorphic but not literally the thing.

Or you're annoyed that people forget an array representing a tensor needs to satisfy some transformation law and can't just be any big array with some numbers in it.

Or maybe you're a fan of basis-free linear algebra!

Which one is it?

1: https://en.wikipedia.org/wiki/Map%E2%80%93territory_relation

ravi-delia4y ago

I mean, if I wanted to refer to the reals exclusively as a vectorspace I wouldn't be wrong, but if you aren't actually using what makes it a vectorspace why would you choose to call it that? Hell, call 7 a tensor. There's more than a map-territory distinction (I'd argue formal mathematics is perhaps the only realm where the two are one and the same, but I see what you're saying), it's a convention of language more generally. You typically use the most necessary term, rather than a random also accurate label. If you don't care about invariance under coordinate transformations (and most machine learning does not), why would you call it a tensor?

ummonk4y ago

I personally am a fan of basis free linear algebra.

More importantly though, “tensors” as commonly used in machine learning seem to rely on a single special basis, so they really are just multidimensional arrays. A machine learning algorithm isn’t really invariant under a change of basis. For example, the ReLU activation function is not independent of a change of basis.

The_rationalist4y ago

What is the difference?

icapybara4y ago

Tensors have additional properties that arrays don't necessarily have. For example, the coordinate system transform rule that the author describes in the beginning of the post.

One of my old physics professors taught us to think of tensors as "arrays with units." If it's a vector/matrix/higher dimensional array but has physical units, it's probably a tensor. The fact that it has units means it represents something physical which must obey additional constraints (like the coordinate system transformation rule).

uoaei4y ago

This is a pretty tired line, I gotta say.

Obviously they are not. One is a linear operator, the other is a data structure for implementing computations using that operator. This description extends to all tensors.

It's like saying "queues are not just lists". That is true and also neither insightful nor helpful.

I don't see it as mystifying or complicated, what am I missing?

2 more replies

omarhaneef4y ago

Since I also thought Tensors were just higher dimension arrays, isn't this really what ML folks think Tensors are, since they (we?) do attach units to the Tensors most of the time?

2 more replies

kgwgk4y ago

Something like the difference between a “vector” of length N which is just a collection of N numbers and one that is a representation of a N-dimensional geometric algebra object.

7256864y ago

A wonderful little video to understand what tensors are, by Daniel Fleish:

https://www.youtube.com/watch?v=f5liqUk0ZTw

Very simple and basic.

Edit: incorrectly wrote vectors instead of tensors.

mettamage4y ago

Wow, that's such a good video. Thanks! Haha, mind blown really. And other than graph theory, I never took a college level math course (I artfully skipped almost all math during my CS degree), I'm doing pre-calculus at the moment, because I want to get better at it.

saberience4y ago

This doesn't seem like it's for beginners.

VeninVidiaVicii4y ago

> Most commonly, a tensor is defined as being anything that transforms like a tensor.

Definitely not beginner level.

brummm4y ago

Hmm, this is stuff physicists learn in their first year undergrad classes for mathematical foundations. Seems to me it's the very definition of beginner.

cyber_kinetist4y ago

I don't know what undergraduate program you have gone through, but this is definitely second-year or third-year course material for most physics degrees in universities. Maybe if you've already taken lots of AP classes in high school then you might be able to skip some stuff, but we're talking about the standard curriculum here.

Normally, you first study the distinction between vectors (which can be expanded to tensors) and scalars in second-year Analytical Mechanics class. You also get a taste of tensors toward the later material in Electromagnetism (which is also probably second-year). And you finally arrive at a rigorous definition of tensors when you take Mathematical Physics (second-year or third-year depending on your skills).

bmitc4y ago

Anyone interested in a visual exploration should checkout Geometrical Vectors by Gabriel Weinreich.

https://www.maa.org/press/maa-reviews/geometrical-vectors

billfruit4y ago

Is there any book that treats whole off geometry using vectors?

bmitc4y ago

I’m not sure I understand the question enough to answer. Do you mean something like differential geometry? There, the theory is built upon vectors and covectors (i.e., differential forms) that are associated with tangent spaces and cotangent spaces, respectively. But that is modern differential geometry and not classical geometry.

billfruit4y ago

I was asking for a classical geometry book which is using a treatment using vectors. Usually classical geometry is treated without resorting to vectors.

1 more reply

beaconstudios4y ago

OK that helps me to understand why tensorflow is called what it is - if a tensor turns a set of vectors into a scalar that's exactly what an artificial neuron does with weights and inputs, and they are linked up to form a data flow graph.

Koshkin4y ago

Here is a really good resource for a beginner:

https://grinfeld.org/books/An-Introduction-To-Tensor-Calculu...

ok1234564y ago

Is that you Pavel?

Beldin4y ago

The way I think of it: you have 0-dimensional arrays of numbers (plain numbers or scalars). You have 1-dimensional arrays of numbers (a list of N numbers or an N-vector). You have 2-dimensional arrays of numbers (an NxM matrix). We can extend this concept to 3- and 4-dimensional arrays and even further.

The kicker? All of them are tensors. Tensor is just a generalisation of the concept.

I am no licensed mathematician, so this could be off. However, every time I dive into this topic, I have to wade through way too complex mathnobabble to arrive at that notion. So let's keep it simple: tensors are a mathematician's template for arrays of any dimension.

mkehrt4y ago

A (d_0 * d_1 * ... * d_{k-1} * d_k) tensor is just a linear map from a (d_0 * d_1 * ... * d_{m-1} * d_{m+1} * ... * d_{k-1} * d_k) tensor to a (d_0 * d_1 * ... * d_{n-1} * d_{n+1} * ... * d_{k-1} * d_k) tensor, where a () tensor is a scalar, right?

(I kid, but I think this is true, right?)

chobytes4y ago

My version is just: Tensors allow us to write data and operations on data in a way which does not depend on how we chose to represent them.

For example, if I have a vector x in V and a map T from V to W, then I would like the truth of T(x)=y to be independent of how I represent T and x.

zardo4y ago

I like the concrete example from when I first used tensors in school. Stress in a block of concrete. You can choose any basis you like to represent the stresses and transform between them.

Whether or not the concrete block breaks under that stress obviously does not depend on your choice of basis or units, so your transformation rules had better reflect that reality.

j / k navigate · click thread line to collapse

105 comments

xyzzyz4y ago

Basically, the idea is to replace weird multi linear objects with normal linear objects (vectors), that we know how to deal with, using matrix multiplication and stuff. That’s all there is to it.

mbbutler4y ago

Why are you complaining that the author didn't talk about tensors as they are used in tensorflow? Tensorflow is never even mentioned in the piece.

The author is perfectly clear in the first sentence that the piece's focus is about the usefulness of tensors in a physics context.

xyzzyz4y ago

Huh, you’re right, not sure why I thought it’s TensorFlow related.

ABeeSea4y ago

> Instead, author describes what mathematicians call “a tensor bundle”, which is a correspondence that assigns each point of space a unique tensor.

hinkley4y ago

> author describes tensors as things behaving according to tensor transformation formula

If you think developers are guilty of circular logic, check out some of the math pages on Wikipedia. You can get lost in moments.

jhokanson4y ago

Speaking of math pages on Wikipedia ... and math text more generally

Is it just me or are we horrible at teaching advanced math? Where are the examples (with actual numbers)? Where is the motivation? Where are the pictures?

hinkley4y ago

You told Billy he could have three apples, and now there are two left. Did Billy take more apples than he should have?

1 more reply

GoatOfAplomb4y ago

> Where are the examples (with actual numbers)?

In upper-level undergraduate math, I made a game of seeing how many pages I would go before seeing 7 printed anywhere. It was usually 10 pages, if I included the page numbers.

ravi-delia4y ago

joppy4y ago

Wikipedia is a terrible place to learn advanced mathematics, for the reasons you raise (and more). There are lots of terrific short books, and many terrific lectures online.

chobytes4y ago

1 more reply

xyzzyz4y ago

Here's an example:

https://en.wikipedia.org/wiki/Homology_(mathematics)

Point is, if you don't understand modern advanced mathematics, you shouldn't get any more disappointed than you are about not being able to play violin. These things just don't come easy.

chobytes4y ago

I think a lot of circularity occurs in mathematics because we don't typically qualify our utterances when it can be implicitly understood.

Eg "Numbers (formal) are those objects which behave like numbers (informal)."

dboreham4y ago

Bertrand Russell turns over in grave.

CatsAreCool4y ago

It is still a work in progress, but does it help address some of the problems you see in learning mathematics? Any feedback is greatly appreciated. Thanks.

tagrun4y ago

TensorFlow "tensor"(and most other use of "tensor" in programmer jargon) is not a tensor at all, it's just a multidimensional array.

contravariant4y ago

Mathematicians would disagree with you there. There are no coordinates to transform in an ordinary tensor space and therefore no way for a tensor to be affected by such a transformation.

Matrices (or linear transformations in general) are important examples of tensors. There's a nice adjunction between tensor spaces A(x)B and the space of linear transformations B=>C given by:

Hom(A(x)B, C) = Hom(A, B=>C)

In the case of Tensorflow I think they do actually still talk about linear transformations of some kind so it's perfectly fine to call them tensors.

tagrun4y ago

Which mathematician disagrees with what exactly?

The name tensor itself comes from the theory of elasticity, Cauchy stress tensor, which BTW is uniform in many practical cases, and obeys the following tensor transformation rule:

https://en.wikipedia.org/wiki/Cauchy_stress_tensor#Transform...

like any other (contravariant) tensor must.

Similarly, TensorFlow "tensor" is just a multidimensional data array, with no transformation rules enforced on it, and therefore is not a tensor.

3 more replies

cygx4y ago

There are no coordinates to transform in an ordinary tensor space and therefore no way for a tensor to be affected by such a transformation.

1 more reply

catgary4y ago

What do you think the tensor product of finite dimensional vector spaces looks like?

ummonk4y ago

The post was also a poor explanation for someone doing modern physics. [edit: not true actually I should have read the rest of the post - it’s a good post]

Wald's approach in General Relativity is much better - he treats Tensors as a multilinear map from vectors and dual vectors to scalars.

He then derives the underlying coordinate transformaton rules, for the vector spaces used in differential geometry. But

mfnOP4y ago

That’s the approach I used as well in the second half of the article - I just mentioned the transformation law in the beginning since that’s what most physics students encounter first.

I didn’t go into deriving the coordinate transforms from the multilinear map definition as I didn’t feel that it’d provide much better intuition, but I did mention the equivalence near the end.

ummonk4y ago

Yeah sorry you’re right - I should have read the rest of your post, which is excellent and describes precisely why the coordinates/transformations focused definition is bad for one’s intuition.

kaashif4y ago

> author describes tensors as things behaving according to tensor transformation formula

In particular, this comment is funny and speaks to some difference in how I and the author view what we're doing when defining a tensor:

> But why that specific transformation law - why must tensors transform in that way in order to preserve whatever object the tensor represents?

Because we defined it like that! When you make the definition "a tensor is a thing that follows X laws", you don't get to ask why, you just defined it!

Just a funny bit of phrasing, I get what is meant :)

edflsafoiewq4y ago

> The whole point of mathematics is that you come up with the definitions and theorems fall out.

That's just how it's presented in textbooks. It's obviously not math is actually done.

ravi-delia4y ago

hansen4y ago

To be a bit pedantic: the identification of tensors with multilinear forms requires finite dimensions (or reflexive topological spaces).

l33t23284y ago

I think it’s a little funny you said that

>[the explanation in the OP] will make sense to physicist, but no sense to most everyone else, including mathematicians

And then went on to describe tensors in a way that is unfriendly to non mathematicians by saying

bigger_cheese4y ago

Engineering usage seems to match the physics usage. In classic engineering fashion however we were always taught just to 'plug them in' without learning all the minutia that go with them.

mr_mitm4y ago

Thanks for this summary.

ericphanson4y ago

I was happy to see that this article is actually talking about tensors, not just multidimensional arrays (which for some reasons are often called tensors by machine learning folks).

lalaithion4y ago

edflsafoiewq4y ago

Of course there is, but if you perform an operation on a multidimensional array, there is no guarantee it corresponds to an operation on tensors, ie. the resulting tensor may depend on the basis.

lalaithion4y ago

mhh__4y ago

But it's like calling my table a cat because they both have four legs.

zarzavat4y ago

If your table is furry and has a tail it may actually be a cat (from a machine learning perspective).

The machine learning packages have an einsum function / tensor contraction, etc. What more do you need for it to be called a tensor?

1 more reply

jstx14y ago

Same word, different contexts and meaning. In ML tensors are multidimensional arrays (and nothing more). Neither physicists nor ML researchers/developers are confused about what it means.

cloogshicer4y ago

> Neither physicists nor ML researchers/developers are confused about what it means.

russellbeattie4y ago

I'm literally learning this, or at least being reminded of something I had totally forgotten, as I read this thread. So yeah.

Words can have multiple meanings, but I think we can all agree it's preferable if they don't have multiple slightly different depending on context meanings. That's just confusing.

lanstin4y ago

westurner4y ago

AFAIU, Matrices are categorically subsets of Tensors where the product operator, at least, is not the tensor product but the Dot product.

Dot product: https://en.wikipedia.org/wiki/Dot_product

Matrix multiplication > Dot product, bilinear form and inner product: https://en.wikipedia.org/wiki/Matrix_multiplication#Dot_prod...

> The dot product of two column vectors is the matrix product

Tensor > Geometric objects https://en.wikipedia.org/wiki/Tensor :

1 more reply

hnarayanan4y ago

I was confused by this when I first got to ML.

lysecret4y ago

I am a bit confused

kaashif4y ago

It's not clear to me what you're annoyed about exactly. The way I see it, there are a few options:

Or you're annoyed that people forget an array representing a tensor needs to satisfy some transformation law and can't just be any big array with some numbers in it.

Or maybe you're a fan of basis-free linear algebra!

Which one is it?

1: https://en.wikipedia.org/wiki/Map%E2%80%93territory_relation

ravi-delia4y ago

ummonk4y ago

I personally am a fan of basis free linear algebra.

The_rationalist4y ago

What is the difference?

icapybara4y ago

Tensors have additional properties that arrays don't necessarily have. For example, the coordinate system transform rule that the author describes in the beginning of the post.

uoaei4y ago

This is a pretty tired line, I gotta say.

Obviously they are not. One is a linear operator, the other is a data structure for implementing computations using that operator. This description extends to all tensors.

It's like saying "queues are not just lists". That is true and also neither insightful nor helpful.

I don't see it as mystifying or complicated, what am I missing?

2 more replies

omarhaneef4y ago

Since I also thought Tensors were just higher dimension arrays, isn't this really what ML folks think Tensors are, since they (we?) do attach units to the Tensors most of the time?

2 more replies

kgwgk4y ago

Something like the difference between a “vector” of length N which is just a collection of N numbers and one that is a representation of a N-dimensional geometric algebra object.

7256864y ago

A wonderful little video to understand what tensors are, by Daniel Fleish:

https://www.youtube.com/watch?v=f5liqUk0ZTw

Very simple and basic.

Edit: incorrectly wrote vectors instead of tensors.

mettamage4y ago

saberience4y ago

This doesn't seem like it's for beginners.

VeninVidiaVicii4y ago

> Most commonly, a tensor is defined as being anything that transforms like a tensor.

Definitely not beginner level.

brummm4y ago

Hmm, this is stuff physicists learn in their first year undergrad classes for mathematical foundations. Seems to me it's the very definition of beginner.

cyber_kinetist4y ago

bmitc4y ago

Anyone interested in a visual exploration should checkout Geometrical Vectors by Gabriel Weinreich.

https://www.maa.org/press/maa-reviews/geometrical-vectors

billfruit4y ago

Is there any book that treats whole off geometry using vectors?

bmitc4y ago

billfruit4y ago

I was asking for a classical geometry book which is using a treatment using vectors. Usually classical geometry is treated without resorting to vectors.

1 more reply

beaconstudios4y ago

Koshkin4y ago

Here is a really good resource for a beginner:

https://grinfeld.org/books/An-Introduction-To-Tensor-Calculu...

ok1234564y ago

Is that you Pavel?

Beldin4y ago

The kicker? All of them are tensors. Tensor is just a generalisation of the concept.

mkehrt4y ago

(I kid, but I think this is true, right?)

chobytes4y ago

My version is just: Tensors allow us to write data and operations on data in a way which does not depend on how we chose to represent them.

For example, if I have a vector x in V and a map T from V to W, then I would like the truth of T(x)=y to be independent of how I represent T and x.

zardo4y ago

I like the concrete example from when I first used tensors in school. Stress in a block of concrete. You can choose any basis you like to represent the stresses and transform between them.

Whether or not the concrete block breaks under that stress obviously does not depend on your choice of basis or units, so your transformation rules had better reflect that reality.

j / k navigate · click thread line to collapse