Distributed representation

In machine learning, data with a local representation typically has 1 unit per element. A 5-word vocabulary might be defined by a 5-dimensional vector, with $[1, 0, 0, 0, 0]^{T}$ denoting the first word, $[0, 1, 0, 0, 0]^{T}$ denoting the second word, and so forth.

Distributed representations are the opposite, instead of concentrating the meaning of a data point into one component or one “element”, the meaning of the data is distributed across the whole vector.

The word that is $[1, 0, 0, 0, 0]^{T}$ in a local representation might look like $[- 0.150, - 0.024, - 0.233, - 0.253, - 0.183]^{T}$ in a distributed representation.

Search Results