Audio

From TSG Doc
Jump to navigation Jump to search

When using audio in your experiment, especially when presenting time-critical stimuli, special care should be taken to optimize the audio settings on multiple levels (hardware, OS, script), as many things can go wrong along the way.

This page outlines some best practices, however we advise to always consult a TSG member if you plan to run an audio experiment in the labs.

Recording

When recording audio for stimuli material or as input for your experiment, please:

  • Use a high quality microphone, with a polar pattern suitable for your application.
  • Use a high quality recorder or audio interface, capable of recording at 24bit and 48kHz or higher.
  • Place the microphone at an appropriate distance from your subject. Set the levels so the audio does not clip (exceeding maximum volume).
  • Record in a quiet environment.

You can use our Sound Recording Labs for high quality voice recording.

Editing

We recommend using Audacity for editing and converting audio files. Audacity is open-source and fairly easy to use, available here: https://www.audacityteam.org/

Export Settings

We recommend using the following export settings:

  • File format: .wav (PCM).
  • Sample Frequency: 44.1kHz.
  • Bit depth: 16 bit.

The Lab Computer audio output is also set to 16 bit, 44.1kHz. We found that this is good enough for most applications; higher settings will increase file size with limited perceivable quality gains.

When using multiple audio files in your experiment, make sure they all use the same settings for consistent playback in your experiment.

In Audacity, you can set up Macros to automate processing and exporting your audio files: https://manual.audacityteam.org/man/macros.html

Windows Settings

Windows 10 has a habit of automatically enabling audio enhancements when connecting new speakers or headphones. These "enhancements" can distort your audio and cause timing issues. Therefore, please make sure they are turned off:

  1. Right click sound icon on taskbar (next to clock) -> Sounds
  2. Goto Playback tab. Select your audio output device and click "Properties"
  3. Goto Enhancements tab. Make sure "Disable all enhancements" is checked.
  4. Click Apply.

Playback

Psychopy

This is an example of a Python script that plays a .wav file with high time accuracy.

 1from psychopy import sound, core
 2from psychopy import prefs
 3prefs.hardware['audioLib'] = ['PTB']
 4
 5# Path to audio file
 6audio_file = "voice.wav"
 7
 8# Load audio
 9# preBuffer – integer to control streaming/buffering -1 means store all 
10audio = sound.Sound(audio_file,preBuffer=-1)
11
12# Play audio
13audio.play()
14
15# Wait for audio to finish playing
16core.wait(audio.getDuration())
17
18# Close audio
19audio.stop()
20audio.close()