List of accepted papers in session order
List of demonstrations
List of Authors
Instructions for preparing presentations, posters and demos can be found in the
Author's guide.
Please note! Paper number 199
has been updated and the new version is
available here in PDF-format.
Due to an unfortunate error in the process for mastering the
conference proceedings, an older version of this paper was included in the
proceedings. We apologize for the inconveniece caused by this. Please refer
to this updated version of the paper instead of the one in the proceedings.
A limited number of the proceedings is available
from the AES Webstore.
Invited papers
Jens Blauert, professor
Institut für Kommunikationsakustik, Ruhr-Universtität Bochum, Germany
Instrumental Analysis and Synthesis of Auditory Scenes: Communication Acoustics
Abstract:
We propose in the following to use the cover label Communication
Acoustics for those branches of acoustics which are closely related to
the information technologies and computer sciences. After a short
review of the relevant research fields of the Institute of
Communication Acoustics at Bochum, Germany, two areas are dealt with in
more detail, namely, instrumental analysis and instrumental synthesis
of auditory scenes. In both areas it shows up that cognitive and
multi-modal phenomena have to be taken into account. Thus, future
communication-acoustical systems shall most probably increasingly
contain knowledge-based and multi-modal components - and, further,
future communication-acoustical system will be embedded as subsystems
into more complex systems. This technological development trend will
coin the future of Communication Acoustics in the context of the
information technologies.
Jürgen Herre, Dr.-Ing.
FhG-IIS A, Audio & Multimedia, Erlangen, Germany
Audio Coding - An All-round Entertainment Technology
Abstract:
Audio compression technology has become a rapidly developing core
technology within the recent decade. Starting out from early basic
research, its potential has received increasing recognition and has led
to the wide spread deployment of this technology in numerous novel
applications, and many more exciting applications are still to come in
the near future. This paper attempts to provide an overview of the most
relevant aspects of the audio coding (r)evolution and illustrates how
audio coding has evolved over time to accommodate the needs of a wide
range of entertainment applications.
Xavier Rodet, professor
IRCAM, Paris, France
Present state and future challenges of synthesis and processing of the
singing voice
Abstract:
The synthesis of the singing voice has been a topic of study for more
than 35 years. Recent work shows that the musical and natural quality
of singing voice synthesis has evolved enough for high fidelity
commercial applications to be realistically envisioned. This paper
begins by presenting a brief historical perspective of synthesis
methods, control strategies and research in this field. The synthesis
methods themselves, i.e. the synthesizers which compute the sound
signal samples, are then presented. These range from models of the
physics of the vocal apparatus to models of the signal produced by
human singers and voice processing techniques. The next section of the
paper presents control strategies, rules and a variety of recorded
data which are employed to compute the parameter values for
synthesizers. Different levels of such rules are needed, from the low
level rules describing details of articulation, to those of a higher
level which implement various aspects of musical interpretation. Some
aspects of choir singing synthesis are also considered. Recent research
and accomplishments are presented as well. Future challenges include
synthesizer models improvements, automatic estimation of model
parameter values from recordings, learning techniques for automatic
rule construction and, last but not least, gaining a better
understanding of the technical, acoustical and interpretive aspects of
the singing voice. Sound examples will be played at the conference.
Peter Svensson, professor
NTNU, Trondheim, Norway
Computational Modeling and Simulation of Acoustic Spaces
Abstract (with Ulf. R. Kristiansen):
The computational modeling of acoustic spaces is fundamental to many
applications in auralization/virtual acoustics. The demands vary
widely, from real-time simulation in multimedia and computer games, to
non-real time situations with high accuracy needs, such as prediction
of room acoustic conditions in music performance spaces. Acoustic
spaces include single room or multi-room spaces, with simple or complex
geometries and boundary conditions. Outdoor spaces can range from city
environments to open landscapes. Sound transmission through partitions
is an important issue in some cases. This presentation gives an
overview of techniques used in the various auralization applications.
Aspects of accuracy and computational efficiency will be discussed, as
well as which acoustical phenomena can and cannot be modelled with
various techniques.
A total of 57 full paper submissions was sent to the conference. Based on
the results of a scientific review process, 44 papers were
accepted to be presented at the conference and published in
the conference proceedings.