You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Generates a Blackman window as described in the paper https://ieeexplore.ieee.org/document/1455106.
20823
+
20824
+
#### Version
20825
+
20826
+
This version of the operator has been available since version 17 of the default ONNX operator set.
20827
+
20828
+
#### Attributes
20829
+
20830
+
<dl>
20831
+
<dt><tt>output_datatype</tt> : int (default is 1)</dt>
20832
+
<dd>The data type of the output tensor. Strictly must be one of the values from DataType enum in TensorProto whose values correspond to T2. The default value is 1 = FLOAT. </dd>
20833
+
<dt><tt>periodic</tt> : int (default is 1)</dt>
20834
+
<dd>If 1, returns a window to be used as periodic function. If 0, return a symmetric window. When 'periodic' is specified, hann computes a window of length size + 1 and returns the first size points. The default value is 1. </dd>
20835
+
</dl>
20836
+
20837
+
#### Inputs
20838
+
20839
+
<dl>
20840
+
<dt><tt>size</tt> (non-differentiable) : T1</dt>
20841
+
<dd>A scalar value indicating the length of the window.</dd>
<dd>Constrain output types to numeric tensors.</dd>
20858
+
</dl>
20859
+
20860
+
### <a name="DFT-17"></a>**DFT-17**</a>
20861
+
20862
+
Computes the discrete Fourier transform of input.
20863
+
20864
+
#### Version
20865
+
20866
+
This version of the operator has been available since version 17 of the default ONNX operator set.
20867
+
20868
+
#### Attributes
20869
+
20870
+
<dl>
20871
+
<dt><tt>axis</tt> : int (default is 1)</dt>
20872
+
<dd>The axis on which to perform the DFT. By default this value is set to 1, which corresponds to the first dimension after the batch index.</dd>
20873
+
<dt><tt>inverse</tt> : int (default is 0)</dt>
20874
+
<dd>Whether to perform the inverse discrete fourier transform. By default this value is set to 0, which corresponds to false.</dd>
20875
+
<dt><tt>onesided</tt> : int (default is 0)</dt>
20876
+
<dd>If onesided is 1, only values for w in [0, 1, 2, ..., floor(n_fft/2) + 1] are returned because the real-to-complex Fourier transform satisfies the conjugate symmetry, i.e., X[m, w] = X[m,w]=X[m,n_fft-w]*. Note if the input or window tensors are complex, then onesided output is not possible. Enabling onesided with real inputs performs a Real-valued fast Fourier transform (RFFT). When invoked with real or complex valued input, the default value is 0. Values can be 0 or 1.</dd>
20877
+
</dl>
20878
+
20879
+
#### Inputs (1 - 2)
20880
+
20881
+
<dl>
20882
+
<dt><tt>input</tt> (non-differentiable) : T1</dt>
20883
+
<dd>For real input, the following shape is expected: [batch_idx][signal_dim1][signal_dim2]...[signal_dimN][1]. For complex input, the following shape is expected: [batch_idx][signal_dim1][signal_dim2]...[signal_dimN][2]. The first dimension is the batch dimension. The following N dimentions correspond to the signal's dimensions. The final dimension represents the real and imaginary parts of the value in that order.</dd>
<dd>The length of the signal.If greater than the axis dimension, the signal will be zero-padded up to dft_length. If less than the axis dimension, only the first dft_length values will be used as the signal. It's an optional value. </dd>
20886
+
</dl>
20887
+
20888
+
#### Outputs
20889
+
20890
+
<dl>
20891
+
<dt><tt>output</tt> : T1</dt>
20892
+
<dd>The Fourier Transform of the input vector.If onesided is 0, the following shape is expected: [batch_idx][signal_dim1][signal_dim2]...[signal_dimN][2]. If axis=0 and onesided is 1, the following shape is expected: [batch_idx][floor(signal_dim1/2)+1][signal_dim2]...[signal_dimN][2]. If axis=1 and onesided is 1, the following shape is expected: [batch_idx][signal_dim1][floor(signal_dim2/2)+1]...[signal_dimN][2]. If axis=N-1 and onesided is 1, the following shape is expected: [batch_idx][signal_dim1][signal_dim2]...[floor(signal_dimN/2)+1][2]. The signal_dim at the specified axis is equal to the dft_length.</dd>
Generates a Hamming window as described in the paper https://ieeexplore.ieee.org/document/1455106.
20907
+
20908
+
#### Version
20909
+
20910
+
This version of the operator has been available since version 17 of the default ONNX operator set.
20911
+
20912
+
#### Attributes
20913
+
20914
+
<dl>
20915
+
<dt><tt>output_datatype</tt> : int (default is 1)</dt>
20916
+
<dd>The data type of the output tensor. Strictly must be one of the values from DataType enum in TensorProto whose values correspond to T2. The default value is 1 = FLOAT. </dd>
20917
+
<dt><tt>periodic</tt> : int (default is 1)</dt>
20918
+
<dd>If 1, returns a window to be used as periodic function. If 0, return a symmetric window. When 'periodic' is specified, hann computes a window of length size + 1 and returns the first size points. The default value is 1. </dd>
20919
+
</dl>
20920
+
20921
+
#### Inputs
20922
+
20923
+
<dl>
20924
+
<dt><tt>size</tt> (non-differentiable) : T1</dt>
20925
+
<dd>A scalar value indicating the length of the window.</dd>
Generates a Hann window as described in the paper https://ieeexplore.ieee.org/document/1455106.
20947
+
20948
+
#### Version
20949
+
20950
+
This version of the operator has been available since version 17 of the default ONNX operator set.
20951
+
20952
+
#### Attributes
20953
+
20954
+
<dl>
20955
+
<dt><tt>output_datatype</tt> : int (default is 1)</dt>
20956
+
<dd>The data type of the output tensor. Strictly must be one of the values from DataType enum in TensorProto whose values correspond to T2. The default value is 1 = FLOAT. </dd>
20957
+
<dt><tt>periodic</tt> : int (default is 1)</dt>
20958
+
<dd>If 1, returns a window to be used as periodic function. If 0, return a symmetric window. When 'periodic' is specified, hann computes a window of length size + 1 and returns the first size points. The default value is 1. </dd>
20959
+
</dl>
20960
+
20961
+
#### Inputs
20962
+
20963
+
<dl>
20964
+
<dt><tt>size</tt> (non-differentiable) : T1</dt>
20965
+
<dd>A scalar value indicating the length of the window.</dd>
Generate a MelWeightMatrix that can be used to re-weight a Tensor containing a linearly sampled frequency spectra (from DFT or STFT) into num_mel_bins frequency information based on the [lower_edge_hertz, upper_edge_hertz] range on the mel scale.
21075
+
This function defines the mel scale in terms of a frequency in hertz according to the following formula:
21076
+
21077
+
mel(f) = 2595 * log10(1 + f/700)
21078
+
21079
+
In the returned matrix, all the triangles (filterbanks) have a peak value of 1.0.
21080
+
21081
+
The returned MelWeightMatrix can be used to right-multiply a spectrogram S of shape [frames, num_spectrogram_bins] of linear scale spectrum values (e.g. STFT magnitudes) to generate a "mel spectrogram" M of shape [frames, num_mel_bins].
21082
+
21083
+
#### Version
21084
+
21085
+
This version of the operator has been available since version 17 of the default ONNX operator set.
21086
+
21087
+
#### Attributes
21088
+
21089
+
<dl>
21090
+
<dt><tt>output_datatype</tt> : int (default is 1)</dt>
21091
+
<dd>The data type of the output tensor. Strictly must be one of the values from DataType enum in TensorProto whose values correspond to T3. The default value is 1 = FLOAT. </dd>
<dd>The size of the original DFT. The size of the original DFT is used to infer the size of the onesided DFT, which is understood to be floor(dft_length/2) + 1, i.e. the spectrogram only contains the nonredundant DFT bins.</dd>
<dd>Samples per second of the input signal used to create the spectrogram. Used to figure out the frequencies corresponding to each spectrogram bin, which dictates how they are mapped into the mel scale.</dd>
Computes the Short-time Fourier Transform of the signal.
21130
+
21131
+
#### Version
21132
+
21133
+
This version of the operator has been available since version 17 of the default ONNX operator set.
21134
+
21135
+
#### Attributes
21136
+
21137
+
<dl>
21138
+
<dt><tt>onesided</tt> : int (default is 1)</dt>
21139
+
<dd>If onesided is 1, only values for w in [0, 1, 2, ..., floor(n_fft/2) + 1] are returned because the real-to-complex Fourier transform satisfies the conjugate symmetry, i.e., X[m, w] = X[m,w]=X[m,n_fft-w]*. Note if the input or window tensors are complex, then onesided output is not possible. Enabling onesided with real inputs performs a Real-valued fast Fourier transform (RFFT).When invoked with real or complex valued input, the default value is 1. Values can be 0 or 1.</dd>
<dd>Input tensor representing a real or complex valued signal. For real input, the following shape is expected: [batch_size][signal_length][1]. For complex input, the following shape is expected: [batch_size][signal_length][2], where [batch_size][signal_length][0] represents the real component and [batch_size][signal_length][1] represents the imaginary component of the signal.</dd>
<dd>A tensor representing the window that will be slid over the signal.The window must have rank 1 with shape: [window_shape]. It's an optional value. </dd>
<dd>The Short-time Fourier Transform of the signals.If onesided is 1, the output has the shape: [batch_size][frames][dft_unique_bins][2], where dft_unique_bins is frame_length // 2 + 1 (the unique components of the DFT) If onesided is 0, the output has the shape: [batch_size][frames][frame_length][2], where frame_length is the length of the DFT.</dd>
0 commit comments