Can view polynomial regression example as a set of ``hand built'' multiplicative units, and a single output unit. Can train via linear algebra.
Generally, can create an entire network of units, each of which computes the (weighted) sum of its inputs.
What happens when we string a bunch of these together?