Jump to content

User:Simply Another Editor/sandbox

From Wikipedia, the free encyclopedia

In mathematics, the softmax function, softargmax function,[1] or normalized exponential function,[2]: 198  frequently denoted by , takes a vector of real numbers and normalizes it into a probability distribution. That is, after applying softmax, the order(??) of the vector elements will be the same, but each element will be between 0 and 1 and the sum of all the elements will be 1. The term softmax comes from the fact that it is a continuous, or "soft", varient of the argmax function, which... The standard (unit) softmax function is given by the standard exponential function on each element, divided by the sum of all the exponentiated elements, as a normalizing constant:

   for j = 1, …, K.

For example, the vector becomes after applying the standard softmax function. Softmax is often used in machine learning to map the output of a neural network to a vector of probabilities for each output classes.

Domain and range

[edit]

Bases

[edit]

Instead of e, a different base b can be used, for any positive b > 0. This is written using or . Positive β means the highest score will have the highest probability, called the "maximum convention", and is usual in machine learning. Negative β corresponds to the minimum convention, and is conventional in thermodynamics, corresponding to the lowest energy state having the highest probability; this matches the convention in the Gibbs distribution, interpreting β as coldness. (for any real β), the notation β is for the thermodynamic beta, which is inverse temperature: , , yielding the expression:

   or       for j = 1, …, K.[a]

In some fields, the base is fixed, corresponding to a fixed scale,[b] while in others the parameter β is varied.

  1. ^ Goodfellow, Bengio & Courville 2016, p. 184.
  2. ^ Cite error: The named reference bishop was invoked but never defined (see the help page).


Cite error: There are <ref group=lower-alpha> tags or {{efn}} templates on this page, but the references will not show without a {{reflist|group=lower-alpha}} template or {{notelist}} template (see the help page).