LogSoftmax - 11 vs 13

Next section compares an older to a newer version of the same operator after both definition are converted into markdown text. Green means an addition to the newer version, red means a deletion. Anything else is unchanged.

Files changed (1) hide show
  1. LogSoftmax11 → LogSoftmax13 +13 -18
LogSoftmax11 → LogSoftmax13 RENAMED
@@ -1 +1 @@
1
+ The operator computes the log of softmax values for the given input:
2
+ LogSoftmax(input, axis) = Log(Softmax(input, axis=axis))
3
+
1
- The operator computes the logsoftmax (log of softmax) values for each layer in the batch
4
+ The "axis" attribute indicates the dimension along which LogSoftmax
2
- of the given input.
3
- The input does not need to explicitly be a 2D vector; rather, it will be
4
- coerced into one. For an arbitrary n-dimensional tensor
5
- input in [a_0, a_1, ..., a_{k-1}, a_k, ..., a_{n-1}] and k is
6
- the axis provided, then input will be coerced into a 2-dimensional tensor with
7
- dimensions [a_0 * ... * a_{k-1}, a_k * ... * a_{n-1}]. For the default
8
- case where axis=1, this means the input tensor will be coerced into a 2D tensor
9
- of dimensions [a_0, a_1 * ... * a_{n-1}], where a_0 is often the batch size.
10
- In this situation, we must have a_0 = N and a_1 * ... * a_{n-1} = D.
11
- Each of these dimensions must be matched correctly, or else the operator
12
- will throw errors. The output tensor has the same shape
5
+ will be performed. The output tensor has the same shape
13
- and contains the logsoftmax values of the corresponding input.
6
+ and contains the LogSoftmax values of the corresponding input.
14
7
  ### Attributes
15
- * **axis - INT** (default is '1'):
8
+ * **axis - INT** (default is '-1'):
16
- Describes the axis of the inputs when coerced to 2D; defaults to one because the 0th axis most likely describes the batch_size. Negative value means counting dimensions from the back. Accepted range is [-r, r-1] where r = rank(input).
9
+ Describes the dimension LogSoftmax will be performed on.
10
+ Negative value means counting dimensions
11
+ from the back. Accepted range is [-r, r-1] where r = rank(input).
17
12
  ### Inputs
18
13
  - **input** (heterogeneous) - **T**:
19
- The input tensor that's coerced into a 2D matrix of size (NxD) as described above.
14
+ The input tensor of rank >= axis.
20
15
  ### Outputs
21
16
  - **output** (heterogeneous) - **T**:
22
- The output values with the same shape as input tensor (the original size without coercion).
17
+ The output values with the same shape as the input tensor.
23
18
  ### Type Constraints
24
- * **T** in ( tensor(double), tensor(float), tensor(float16) ):
19
+ * **T** in ( tensor(bfloat16), tensor(double), tensor(float), tensor(float16) ):
25
20
  Constrain input and output types to float tensors.