Header Image

This page showcases sample Mel spectrogram images and their related audio files from the Sonic Diffusion project trained on melodic techno sound files. Feel free to explore the samples and listen to the audio files. For more details and source code, visit the GitHub repository.


Please note that the included files have been deliberately reduced in quality. Due to the utilization of mel spectrograms for image representation, the inherent "phase" component of the original audio files was lost during the conversion process. As a result, the "phase" information was reconstructed using interpolation techniques, which may introduce a noticeable "combing" effect in the audio playback. It is important to understand that this compromise was necessary to effectively work with the high-dimensional data inherent in mel spectrograms.

Model Generated Samples


Sample 1

Sample 1

Sample 2

Sample 2

Sample 3

Sample 3

Sample 4

Sample 4

Sample 5

Sample 5

Human Created Samples


Sample 1

Sample 1

Sample 2

Sample 2

Sample 3

Sample 3

Sample 4

Sample 4

Sample 5

Sample 5