Japanese Sitemap Toppage
           
Research Areas

OUTLINE
1.1 Super Hi-Vision research
1.1.1 Super Hi-Vision format
1.1.2 Super Hi-Vision equipment
1.1.3 Super Hi-Vision satellite transmission
1.1.4 Super Hi-Vision optical time-division multiplexed transmission
1.1.5 SHV coding

1.2 Three-dimensional television research
1.2.1 Integral 3D television
1.2.2 Capture and reproduction of 3D images

1.3 High-presence audio systems research
1.3.1 3D Audio
Contents

1.3.1

3D Audio




It is assumed SHV will be viewed on screens that are larger than today's Hi-Vision screens, so it will be important to ensure that sound and video are consistent in the vertical and horizontal directions. As such, we are conducting R&D on a 22.2 multichannel audio format (Figure 1) for use with SHV, which expands the earlier 5.1 channel sound system in the vertical direction and enhances the sense of being enveloped in sound.


Figure 1. Speaker arrangement for 22.2 channel audio system


Standardization

During FY2009, we continued to promote standardization efforts for codings and formats related to 22.2 multichannel sound. For coded transport, we revised the MPEG-2 AAC standard to accommodate 22.2 multichannel sound. 22.2 multichannel sound has also been adopted as one of the recommended audio modes for the ARIB STD-B32 standard. We also proposed a "3D Multichannel Studio Sound Format Standard" for use in studios to the ITU-R. At the IEC, proposals for channel naming and transmission procedures for multichannel audio systems have moved to the international voting stage. SMPTE has also issued a standard (SMPTE299-2009) for transmission of 96 kHz sampled audio. Finally, AES has issued a standard (AES11-2009) for synchronization of UHDTV video and digital audio.


22.2 Multichannel sound production systems

We are conducting R&D on a system to simplify and increase the sophistication of 22.2 multichannel sound production. In FY2009, we developed a mixing system able to control the directionality of up to 1000 individual tracks of audio, a one-point microphone making it easy to capture 22.2 multichannel sound in the field, a 22.2-multichannel-sound speaker system for use in production, and an automatic equalizer method for correcting the frequency characteristics for the speaker system (Figure 2). We also developed a new 3D panning method and verified its capabilities in subjective evaluation tests. As a result, we found that the sound localization produced by the system matches human hearing closely in the horizontal direction, but that if the opening in the speakers exceeds 30 in the vertical direction, it begins to diverge significantly from human hearing. Also, in live multi-channel SHV satellite relay tests using the Kizuna Wideband satellite, we conducted the first ever live relay of all channels of 22.2 multichannel sound captured in the field.


Figure 2. 22.2-Channel production systems under development


22.2-channel sound for homes

With the goal of introducing 22.2 multichannel sound into homes, we are advancing research on signal processing that will allow sound reproduction with fewer loudspeakers while maintaining the sound's spatial impression. In FY2009, we developed a method to automatically convert 22 channels into 8 channels, while maintaining sound pressure and directionality at the listening point. We also developed a method for reproducing 22-channel sound using only three forward speakers, by using the Head-Related Transfer Function, which represents the propagation characteristics of sound arriving at both ears from various directions. We also performed experiments to investigate the perception of the apparent sound source's elevation when reproducing sounds on loudspeakers and headphones. This research was in order to improve the spatial reproduction capabilities of the 22.2 multichannel headphone processor. We found that for sound coming from directly in front, the perceptual resolution of the sound's elevation was degraded through loudspeakers when the elevation angles exceeded 70 degrees or through headphones when the angle exceeded 40 degrees .


Psychology of hearing

We are also advancing research related to the processes contributing to the senses of presence and feeling ("Kandoh": deep emotional feelings) in order to evaluate the sound reproduction of high-presence audio systems in terms of a variety of factors. Conventionally, bipolar adjective pairs such as "clarity" and "muffledness" have been used to express impressions about sound, but words like muffled, for example, have negative nuance, so the impression given by a sound can be swayed by emotional preferences. Thus, we have proposed a two-dimensional metric for evaluating sound, with a sonic axis expressing impressions in terms of physical characteristics, and a preference axis expressing level of like and dislike. We have also investigated changes in impression due to differences in intensity when listening to music. As a result, we found that an optimum intensity can be estimated from the maximum value on the preference axis. We also conducted experiments to evaluate changes in the position of loudspeakers on impressions of expansiveness while listening to music. These impressions have a strong correlation with kandoh. Scores evaluating kandoh, together with those evaluating "sense of expansiveness", also have a strong correlation with the interaural cross correlation, which can be measured physically. This shows that kandoh is promoted by the expansiveness of the sound, and scores for kandoh can be estimated from a physical quantity.






Copyright NHK (Japan Broadcasting Corporation) All rights reserved.

NHK STRL
ANNUAL REPORT 2009
NHK STRL