LSTM: data2pkl #8

xiaojian10 · 2019-09-11T01:51:42Z

In your code, I found that the lstm folder does not define how to dump the data into .pkl file format. I think this is a job that I need to do. How do I define my image commands when converting a spectrogram of speech to a .pkl format?
Is the naming convention for images similar to converting data to binary files?

zhr1201 · 2019-09-11T02:22:21Z

Sorry for missing that piece of code. That was like written a long time ago and I can only refer to the current code to try to remember the details.

So this line should explain the format of the pkl files:

        self.ref_data = np.reshape(
            self.ref_data,
            [self.batch_size, self.epoch_size, self.num_steps, self.NEFF])

Basically you should store the forward beamformer (also for backward bf, reference) output spectrograms as a big NP array with shape total_time_steps * NEFF(num of effective FFT points e.g. 256/2 + 1). By total_time_steps, I mean concatenating all samples together so all the data should stored together rather than store them separately. As for how to convert spectrograms into the pkl files, you just store their log magnitude and reshape into that shape. Another thing is the pkl files for forward BF, backward BF, and reference should be aligned, which means that same time index value should correspond to the same sample's forward BF, backward BF, reference sampled at the same time.

xiaojian10 · 2019-09-12T00:55:31Z

Your suggestion is really great, I will try to implement it according to your suggestion. I am truly grateful for your help.

xiaojian10 · 2019-09-27T08:59:25Z

I found that in your shared project, the vocalization time of male and female voices in the test voice is inconsistent. Does this mean that we have to do this when we make the data set? For example, let the interfering speech appear first, and after a few seconds delay, the target speech appears, and the following sound is a mixture of interfering speech and target speech.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LSTM: data2pkl #8

LSTM: data2pkl #8

xiaojian10 commented Sep 11, 2019

zhr1201 commented Sep 11, 2019

xiaojian10 commented Sep 12, 2019

xiaojian10 commented Sep 27, 2019

LSTM: data2pkl #8

LSTM: data2pkl #8

Comments

xiaojian10 commented Sep 11, 2019

zhr1201 commented Sep 11, 2019

xiaojian10 commented Sep 12, 2019

xiaojian10 commented Sep 27, 2019