AES22 June 15-17, 2002, Espoo, Finland
International Conference on Virtual, Synthetic and Entertainment Audio

Site Navigation Map

Conference papers

Get paper details by paper number or session id:

List of accepted papers in session order

List of demonstrations

List of Authors

Instructions for preparing presentations, posters and demos can be found in the Author's guide.

Please note! Paper number 199 has been updated and the new version is available here in PDF-format. Due to an unfortunate error in the process for mastering the conference proceedings, an older version of this paper was included in the proceedings. We apologize for the inconveniece caused by this. Please refer to this updated version of the paper instead of the one in the proceedings.

A limited number of the proceedings is available from the AES Webstore.

Invited papers

Jens Blauert, professor

Institut für Kommunikationsakustik, Ruhr-Universtität Bochum, Germany

Instrumental Analysis and Synthesis of Auditory Scenes: Communication Acoustics


We propose in the following to use the cover label Communication Acoustics for those branches of acoustics which are closely related to the information technologies and computer sciences. After a short review of the relevant research fields of the Institute of Communication Acoustics at Bochum, Germany, two areas are dealt with in more detail, namely, instrumental analysis and instrumental synthesis of auditory scenes. In both areas it shows up that cognitive and multi-modal phenomena have to be taken into account. Thus, future communication-acoustical systems shall most probably increasingly contain knowledge-based and multi-modal components - and, further, future communication-acoustical system will be embedded as subsystems into more complex systems. This technological development trend will coin the future of Communication Acoustics in the context of the information technologies.

Jürgen Herre, Dr.-Ing.

FhG-IIS A, Audio & Multimedia, Erlangen, Germany

Audio Coding - An All-round Entertainment Technology


Audio compression technology has become a rapidly developing core technology within the recent decade. Starting out from early basic research, its potential has received increasing recognition and has led to the wide spread deployment of this technology in numerous novel applications, and many more exciting applications are still to come in the near future. This paper attempts to provide an overview of the most relevant aspects of the audio coding (r)evolution and illustrates how audio coding has evolved over time to accommodate the needs of a wide range of entertainment applications.

Xavier Rodet, professor

IRCAM, Paris, France

Present state and future challenges of synthesis and processing of the singing voice


The synthesis of the singing voice has been a topic of study for more than 35 years. Recent work shows that the musical and natural quality of singing voice synthesis has evolved enough for high fidelity commercial applications to be realistically envisioned. This paper begins by presenting a brief historical perspective of synthesis methods, control strategies and research in this field. The synthesis methods themselves, i.e. the synthesizers which compute the sound signal samples, are then presented. These range from models of the physics of the vocal apparatus to models of the signal produced by human singers and voice processing techniques. The next section of the paper presents control strategies, rules and a variety of recorded data which are employed to compute the parameter values for synthesizers. Different levels of such rules are needed, from the low level rules describing details of articulation, to those of a higher level which implement various aspects of musical interpretation. Some aspects of choir singing synthesis are also considered. Recent research and accomplishments are presented as well. Future challenges include synthesizer models improvements, automatic estimation of model parameter values from recordings, learning techniques for automatic rule construction and, last but not least, gaining a better understanding of the technical, acoustical and interpretive aspects of the singing voice. Sound examples will be played at the conference.

Peter Svensson, professor

NTNU, Trondheim, Norway

Computational Modeling and Simulation of Acoustic Spaces

Abstract (with Ulf. R. Kristiansen):

The computational modeling of acoustic spaces is fundamental to many applications in auralization/virtual acoustics. The demands vary widely, from real-time simulation in multimedia and computer games, to non-real time situations with high accuracy needs, such as prediction of room acoustic conditions in music performance spaces. Acoustic spaces include single room or multi-room spaces, with simple or complex geometries and boundary conditions. Outdoor spaces can range from city environments to open landscapes. Sound transmission through partitions is an important issue in some cases. This presentation gives an overview of techniques used in the various auralization applications. Aspects of accuracy and computational efficiency will be discussed, as well as which acoustical phenomena can and cannot be modelled with various techniques.

A total of 57 full paper submissions was sent to the conference. Based on the results of a scientific review process, 44 papers were accepted to be presented at the conference and published in the conference proceedings.

The Audio Engineering Society AES22 Conference Site

Last modified: 2002-08-19 10:12:44 About these pages

The Helsinki University of Technology Nokia Conference Venue Papers Contact Information Author's Guide Conference Main Page Registering Details Important Dates and Deadlines Travel Guide Conference Program Conference Program Conference Venue Papers Author's Guide Contact Information Travel Guide Important Dates and Deadlines Registering Details Conference Main Page