Working Report, 1/2002 Laboratory Evaluation of Annoyance of Low Frequency Noise
Table of contents
Foreword and acknowledgementsThe project described in the present report was initiated by the Danish Environmental Protection Agency and made possible by a collaboration contract between the Agency and The Department of Acoustic Technology at The Technical University of Denmark in Lyngby, Denmark. We would like to thank all test subjects for collaboration and patience during the time consuming and sometimes boring - listening tests. Especially the four persons who are suffering from low frequency noise in their daily life are thanked for their willingness to participate in the investigation. DELTA, Lyngby, provided sound recordings for the listening tests. The Danish Environmental Protection Agency has followed the investigation with great and encouraging interest during all parts of the project. Tina Hjort is thanked for providing contact to the special group of test subjects and for a continued interest in the investigation and the results.
Lyngby, May 2001
Torben Poulsen
Introduction. Objective of the ProjectSeveral different assessment methods for the annoyance due to low frequency noise have been proposed during the later years. Also the noise limits or criteria values used with the different assessment methods differ. As a consequence of the ambiguity the Danish Environmental Protection Agency has asked for an investigation, where the subjectively assessed annoyance due to a number of practical examples of low frequency noise was compared with the predicted annoyance using different assessment methods. Such an investigation could indicate the best suitable method for assessment of low frequency noise. It could probably also indicate a need for an adjustment or a revision of the Danish assessment method that was published in Information No. 9/1997 from the Danish Environmental Protection Agency, "Low frequency noise, infrasound and vibration in the environment" [1]. The project reported here was made in cooperation between the Danish Environmental Protection Agency and the Department of Acoustic Technology at The Technical University of Denmark. M. Sc. Frank Rysgaard Mortensen prepared, conducted and analysed the listening tests, and Associate Professor Torben Poulsen was responsible for the quality control and for the major part of the report. Preliminary results from the project was presented at the conference "Low Frequency noise and Vibration" in Aalborg, Denmark, May 2000 [2]. The views that are expressed in this report do not necessarily reflect the attitude of the Danish Environmental Protection Agency. It is found however that the project represents a valuable contribution to the investigation about noise and its effect on people.
SummaryThe present report documents a project that was carried out in cooperation between The Department of Acoustic Technology, Technical University of Denmark and the Danish Environmental Protection Agency in year 2000. The purpose of the project was to investigate the subjective annoyance due to a number of low frequency noise examples. The main part of the project consisted of a series of listening tests, where a group of test persons listened to different examples of low frequency noise and gave their evaluation of the loudness of the noises and the annoyance of the noises. The main group of test persons (reference group) was eighteen young persons with normal hearing. The test persons evaluated eight different noise examples, where one example was road traffic noise and the other examples were realistic samples of environmental low frequency noise. The noise examples included both stationary noise with and without tones, intermittent noise (from a passing ferry), music, and impulsive low frequency noise from a punch press. The noises were presented in an IEC listening room at three different LAeq levels between 20 and 35 dB, and each presentation was repeated once in a random order. The subjective assessments made by the test persons were compared to objective results from a number of recent methods for assessment of low frequency noise. A total of seven different assessment methods were used. It was found that the Danish assessment method, published in Information No. 9/1997 from the Danish Environmental Protection Agency, "Low frequency noise, infrasound and vibration in the environment" gave the best relation to the subjective assessments made by the test persons. There was no obvious indication in the present results of how to further improve the Danish method. Apparently one important property of this method is that it includes a 5 dB penalty in case of impulsive noise. In addition to the main investigation with 18 young normal hearing test persons, listening tests were also made with a small group of four test persons, the special group. The special group consisted of four persons that had reported annoyance due to low frequency noise in their homes. It was found that the special group assessed the noise examples in a different way than the reference group. Generally the annoyance was much higher, especially the annoyance at night. It was also found that the two groups evaluated the noise samples differently. It was not the same types of noise that the two groups found most and least annoying. Due to the limited number of persons in the special group, no explanations of these differences can be indicated within the frame of the present investigation.
Udvidet resuméDenne rapport dokumenterer et projekt, som blev udført i samarbejde mellem Institut for Akustisk Teknologi på Danmarks Tekniske Universitet og Miljøstyrelsen i år 2000. Projektets formål var at undersøge genevirkningen af en række eksempler på lavfrekvent støj. Baggrunden for projektet er, at der de seneste år er offentliggjort flere forskellige vurderingsmetoder, som alle har til formål at sætte mål på genevirkningen af lavfrekvent støj. Metoderne adskiller sig en del fra hinanden, og de tilhørende kriterie- eller grænseværdier er også temmelig forskellige. Der er således nogen usikkerhed om, hvordan en objektiv vurdering af genevirkningen fra lavfrekvent støj kan udføres. Resultatet af projektet kunne muligvis pege på en velegnet objektiv metode til vurdering af lavfrekvent støj, eller på behovet for en revision eller en justering af den nuværende vurderingsmetode, som er beskrevet i Orientering fra Miljøstyrelsen nr. 9/1997 "Lavfrekvent støj, infralyd og vibrationer i eksternt miljø" [1]. Projektets hoveddel udgøres af lytteforsøg, hvor forsøgspersoner vurderede en række eksempler på lavfrekvent støj med hensyn til lydstyrke og oplevet gene under forudsætning af, at støjen kunne høres om dagen og aftenen eller om natten. Hovedgruppen af forsøgspersoner bestod af 18 unge mennesker med normal hørelse. Forsøgspersonernes hørelse blev kontrolleret både ved normal audiometri (i frekvensområdet 125 Hz - 8 kHz) og i lavfrekvensområdet ned til 31 Hz. Der blev benyttet 8 forskellige støjeksempler, hvor et eksempel var vejtrafikstøj, som skulle bruges som reference idet netop genen af vejstøj er godt belyst i litteraturen. De syv andre støjeksempler var realistiske lydoptagelser af lavfrekvent ekstern støj. Der indgik stationær støj både med og uden toner, intermitterende støj (fra en forbisejlende færge), musik, og impulsagtig støj fra en smedehammer, hvor lyden var transmitteret igennem jorden. Alle støjeksemplerne blev præsenteret via højttalere i et IEC lytterum, og de blev afspillet ved tre forskellige LAeq niveauer mellem 20 dB og 35 dB. Hver præsentation blev gentaget to gange. Forsøgspersonernes subjektive vurdering af støjens genevirkning blev sammenholdt med objektive målinger på de samme støjtyper. Syv nyere vurderingsmetoder til brug for lavfrekvent støj blev benyttet. Ud over den danske vurderingsmetode blev den svenske (SOSFS 1996:7) og den tyske metode (DIN 45 680) benyttet tillige med et forslag til en polsk og to hollandske vurderingsmetoder. Endelig blev det forsøgt at bruge støjens C-vægtede lydtrykniveau som parameter. Det blev fundet (rapportens afsnit 6.8) at den danske vurderingsmetode (Miljøstyrelsens orientering nr. 9/1997) giver den bedste sammenhæng med den subjektive vurdering. Undersøgelsen har ikke indikeret hvordan metoden eventuelt kan forbedres eller justeres. Det tyder på, at en vigtig egenskab ved metoden er, at grænseværdien skærpes med 5 dB når der er tale om impulsagtig lavfrekvent støj; den skærpede grænseværdi er i undersøgelsen også brugt for den støjtype som bestod af musik. Ud over hovedundersøgelsen blev der gennemført tilsvarende lytteundersøgelser med en mindre gruppe af forsøgspersoner, specialgruppen. Der er tale om fire personer, som oplever at være generet af lavfrekvent støj i deres hjem. Det var ikke et kriterium, at deltagerne i specialgruppen skulle have normal hørelse; nogle af forsøgspersonerne havde et markant høretab ved høje frekvenser. Specialgruppen havde ikke bedre hørelse ved lave frekvenser end referencegruppen. Undersøgelserne viste, at specialgruppen havde en anderledes vurdering af støjeksemplerne end referencegruppen. Først og fremmest angav specialgruppen en tydeligt kraftigere gene for de samme støjeksempler, især genen om natten var meget højere end referencegruppens vurdering. Forskellen mellem de to gruppers vurdering svarer omtrent til, at støjens styrke blev forøget med 15 dB. Dernæst blev det fundet, at det ikke var de samme støjtyper som de to grupper af forsøgspersoner fandt henholdsvis mest og mindst generende. På grund af det lille antal forsøgspersoner har det ikke været muligt at gå i detaljer for at finde en forklaring på de meget forskellige vurderinger.
1. Background
1.1 InfrasoundSound in the frequency range below 20 Hz is defined as infrasound. Previously it was believed that infrasound was inaudible, and the three decades 20 Hz 20 kHz was often called the audible frequency range. We now know that infrasound can be heard provided it is strong enough. The threshold of hearing is determined at least down to 4 Hz, and there is a reasonable agreement between the recently measured thresholds. The G-weighting function has been standardised in ISO 7196 [3] and has a close relation to the shape of the hearing threshold in the infrasound region but also includes a sharp cut off at frequencies above 20 Hz. A determination of equal loudness level contours in the infrasound region at 20, 40, and 60 Phones was reported as early as in 1984 [4]. The slope of the lowest loudness level contour was comparable to the slope of the threshold curve, while the higher loudness level contours had slightly less slope. As a first approximation it may thus be expected that the G-weighted infrasound level would predict the strength or loudness of the infrasound, at least when the level is not too high above the threshold. Infrasound is usually not perceived as a tonal sound but rather as a pulsating sensation, pressure on the ears or chest, or other less specific phenomena. The loudness and annoyance due to infrasound increase very quickly with increasing level. The hearing threshold for single tones is usually about 95 dB(G), and tones with a 20 dB higher level are expected to be sensed as very loud. It can be assumed that infrasound below the hearing threshold is not annoying. The recommended limit value for infrasound in dwellings in Denmark is 85 dB(G). The Danish Environmental Protection Agency explains that the purpose is to secure that even persons with a very low hearing threshold are not annoyed by infrasound. After the Information No 9 was published in 1997 a number of measurements of environmental infrasound has been made in dwellings. Only at rare occasions levels have been seen near the recommended limit value. In most situations the level is at least 20 dB lower than the 85 dB limit. 1.2 Low Frequency NoiseWhereas infrasound is a well-defined concept, low frequency noise is not. In the Information No. 9/1997 [1] low frequency noise is defined to comprise the frequency range 10 Hz 160 Hz, but other references including other assessment methods use other frequency ranges usually within the interval 8 250 Hz. Low frequency noise is noise with a strong content at low frequencies, giving the noise a rumbling character different to e.g. road traffic noise. The concept of low frequency noise was introduced as this type of noise was expected to be more annoying than noise without the strong low frequency content. This expectation is mentioned in Guideline on environmental noise from industry [5]. Here it is also mentioned that there is no universally accepted assessment method for low frequency noise at present. Since then, several countries have published measurement and assessment methods for low frequency noise; reference is made to Chapter 4 in the present Report. Many assessment methods use the spectrum of the noise (1/3-octave spectrum measured indoors), and compare this spectrum to a criterion curve, which must not be exceeded in any 1/3-octave band. In the Danish assessment method the noise is evaluated by the A-weighted level of the noise in the frequency range consisting of all the 1/3-octave bands from 10 Hz to 160 Hz. Depending on the actual conditions, many types of noise can be regarded as low frequency noise. The firing rate of many diesel engines is usually below 100 Hz, so road traffic noise can be regarded as low frequency noise as well as (diesel) train noise or noise from ferries. Similar considerations can be made for engines or compressors in industries or co-production plants. Burners can emit broadband low frequency flame roar. Low frequency noise can be noise or vibration from traffic or from industries, totally or partly transmitted through the ground as vibration and re-radiated from the floor or the walls in the dwelling. By this transmission, frequencies above approximately 20 Hz are attenuated. It is a general observation that indoor noise is perceived as more low-frequency-like than the same noise heard out of doors. 1.3 Motivation for the InvestigationThe Danish Environmental Protection Agency has the opinion that hearing and assessment of infrasound is reasonably well described. In addition it is extremely rare that infrasound in the environment is anywhere near the recommended limit value. Contrary to this there is an obvious need for investigations where the subjective annoyance due to typical examples of low frequency noise is compared to different objective measures of the strength of the same noises. There are several different features of the different assessment methods presently in use, and the corresponding limits or criteria values differ. The fundamental assumptions for the assessment methods are, however, largely the same. (I.e. the same investigations and the theories
2. Description of the Listening Tests
2.1 Noise ExamplesEight different noise examples were chosen for the listening tests. Example No. 1 was the noise from a densely trafficked six-lane highway. The traffic noise served as a reference noise, because there is a well-described relation between the level of road traffic noise and the annoyance of this type of noise. The other noise examples had all a strong low frequency content. The noise examples were: Table 1.
2.2 Noise duration, levels and presentationThe duration of all the noises was 2 minutes. The noise examples were either recorded indoor or filtered to simulate indoor noise. They were taken from DAT recordings to the hard disk of a PC where they were edited digitally. The examples are presented to the test persons at nominal levels of 20 dB, 27.5 dB, and 35 dB (A-weighted levels). In the listening room the sound was measured at the listening position and subsequently analysed to obtain the objective measurements of the noises. The noise examples were played directly from the PC via a D/A converter to a crossover filter and via four amplifiers to two broadband loudspeakers (KEF 105) and two subwoofers (Amadeus Sub). A detailed description of the set-up can be found in [7]. 2.3 Filtering of the Noise SignalsThe noise examples were filtered with two types of filtering prior to the presentation. One type of filter compensates for resonance modes in the listening room. The listening position was chosen so that only one mode at 45.5 Hz influenced the sound level. At this mode the room mode increases the sound pressure level by about 18 dB, which is compensated for by a narrow band notch filter as illustrated in Figure 1. This filter was used in all presentations. Figure 1.
Figure 2. The other type of filter was used only with those noise examples that were recorded outdoors. Based on the information in [8], [9] and [10] an outdoor-to-indoor filter was created that represents the reduction index of ordinary building materials and construction principles. The filter was defined in the range 16 Hz 4000 Hz and is illustrated in Figure 2. The level was further rolled off below 15 Hz to suppress wind noise and other errors in the recordings that would otherwise be enhanced. Also the level above 4000 Hz was attenuated, as this frequency range was not considered in the project. From a subjective evaluation, the noise examples sounded natural in the listening room. All the examples including the traffic noise had a pronounced low frequency characteristic. The outdoor-to-indoor filtering of the noise examples recorded outdoors changed also the characteristic of the traffic noise. Although the noise was easily recognised as traffic noise, the idea of using this noise as a kind of reference became less obvious. It seems that the test subjects regarded the traffic noise as one of the low frequency noises. 2.4 Test SubjectsEighteen young persons (9 males and 9 females) with normal hearing were chosen as reference group for the listening tests. The age of the test persons was between 19 and 25 years. This group of listeners is called Reference group. In addition, listening tests were made with a small group of people that have reported annoyance due to low frequency noise in their homes. This group (called Special group) consisted of four persons aged between 41 and 57 years, two of each gender. The inclusion of the special group was regarded as a pilot experiment only because of the small number of test persons and the significantly higher age range. The main results of the project will be based on the listening tests with the reference group. The results from the pilot experiment with the special group are described in Chapter 7. Pure tone audiometry was made in the frequency range 125 Hz to 8000 Hz with a Madsen Midimate 602 audiometer, equipped with Sennheiser HDA 200 earphones. The calibration of the audiometer was made with the values from [11] which are practically identical to ISO 389-8 [12]. The results of the audiometry are given directly as hearing threshold level in dB HL. Hearing threshold levels at or below 15 dB HL was accepted in the frequency range 125 Hz to 4000 Hz, and a hearing threshold level at 20 dB at a single frequency (including 8 kHz) was also accepted. The average hearing threshold is shown in Figure 3. In addition to the conventional audiometry, the hearing threshold in the low frequency range was determined. The tests were made with pure tones at 31 Hz, 50 Hz, 80 Hz, and 125 Hz, and the signal length was 800 ms. A computer controlled Tucker Davis system with Sennheiser HDA 200 earphones was used to measure the thresholds according to the Two Alternative Forced Choice method. The present implementation of the procedure determines the 79.4 % point of the psychometric function. A detailed description of the procedure can be found in [13]. The results of these hearing threshold measurements were given as sound pressure level (SPL). No reference values are available at present for the HDA 200 earphone at frequencies below 125 Hz and therefore the measured SPL-thresholds were compared to the standard hearing thresholds given in ISO 389-7 [14]. In this way the deviation from the standardised ISO threshold could express approximate, artificial dB HL values. The average of these artificial hearing levels are also shown in Figure 3. It can be seen in Figure 3 that the two sets of curves differ by 5 dB at 125 Hz. This is due to the artificial reference for the low frequency curves and also because the definition of threshold differs in the two methods. Figure 3. From Figure 3 it is seen that the threshold of the reference group has a slight decrease at the highest frequencies (6 kHz and 8 kHz). The special group show a hearing loss at the highest frequencies above 2 kHz. It should be noted though, that the data for the special group are average values over just four persons. Some of the persons had a considerable hearing loss, partly due to a higher age. It is assumed that the high frequency hearing loss does not influence the subjective evaluations of the low frequency noises used in the present investigation. The low frequency part of the hearing thresholds (Figure 3, 31 Hz 125 Hz) shows that the special group is about 5 dB less sensitive than the reference group. At 31 Hz the (average) thresholds are almost the same. The threshold data show that the special group of people who have reported annoyance due to low frequency noise in their homes do not have higher (better) hearing sensitivity at the low frequencies. The hypothesis that the special group should be able to hear the low frequency sounds more easily than the reference group is not supported by these hearing threshold measurements. 2.5 Subjects TaskThe test persons were given a written introduction to the tests, and they could ask about the procedure throughout the tests. A full training session was made prior to the listening tests. Information about the sound examples was given after all the tests were finalized. The tests persons were asked to answer four questions after each presentation:
All the scales were 10 cm long, and the response was measured in cm with a ruler and thus given as a figure between 0 and 10. An example of one of the annoyance scales is given here:
3. Primary Results of the Listening Tests
Table 2.
3.1 Statistical Analysis of Primary ResultsIn order to investigate the structure lying behind the average data, the raw data from each listening test were typed into a spreadsheet. For each data the following information was also recorded: the age and gender of the test person, the repetition number (round 1 or 2), the sound example number, the nominal presentation level, the measured A-weighted level (dB(A)), and the A-weighted level of the sound in the frequency range 10 Hz 160 Hz, LpA,LF. All data were subsequently transformed to a statistical analysis program (Statgraphics 4.0). It was found that if one disregarded data near the endpoints of the scales, the responses almost followed a normal distribution curve. However since many of the responses were near one or the other of the endpoints, the primary results were not normal distributed. Despite this lack of normality in the distribution of the data it was decided to perform an analysis of variance. The analysis was made for each parameter separately. Table 3 shows the significance levels of the influence from a number of different factors upon the evaluations of the reference group. If the number is less than 0.05, this factor has a significant effect on the evaluation on a 95% level or above (this means less than 5 % probability for drawing a wrong conclusion). If the number is above 0.05 it cannot be proved that this factor has a significant effect upon the relevant evaluation. Table 3.
The repetition number (round 1 or round 2 with the same presentation) has no significant influence, which shows the absence of a training effect. The gender of the test persons has influence on the evaluation of annoyance during the day and during the night but not on the evaluation of loudness and on the yes/no question about whether the noise is annoying or not. 3.2 Statistical Analysis of the Results from the Special GroupA corresponding analysis was made with the data from the special group. Since this group has only four persons the data are very uncertain and highly dependent on random variations. The result of the analysis is shown in Annex A. It is found that the noise level influences the evaluations. The influence from noise example on the annoyance evaluations is just at the limit of being significant.
4. Assessment Methods for Low Frequency Noise
4.1 Danish MethodThe Danish method [1] is described in "Information No. 9/1997 from the Danish Environmental Protection Agency". It also gives the recommended limit values for low frequency noise and infrasound. The recommended measurement method is specified. The noise is measured in several positions indoor, and is analysed in 1/3-octave bands. The nominal A-weighting corrections are added to the spectra, and the weighted spectrum is summed to form the A-weighted level of the noise in the frequency range 10 Hz 160 Hz. The resulting level is called LpA,LF. It would not be possible make a direct measurement of the A-weighted level, LpA,LF, since the minimum limit of the tolerance for the A-weighting filter is undefined (i.e. minus infinity) below 20 Hz. The nominal A-weighting is shown in Table 4. Table 4.
4.2 German MethodIn the German method [15, DIN 45 680 / 1997] low frequency noise is defined as noise where the C weighted noise level is at least 20 dB higher than the A-weighted level, based on either equivalent levels or maximum levels. If the noise is evaluated to be low frequency, a 1/3-octave frequency analysis is made. The method considers the frequency range 10 Hz 80 Hz, but in special situations the 8 Hz and / or the 100 Hz band can be included. The method applies to rooms in dwellings where people stay or rests. In an Annex to the method a range of limit or criteria values are given for the day period (06 22) and for the night period (22 06). In the German method, distinction is made between tonal noise and noise without tones. If the level in a particular 1/3-octave band is 5 dB or more above the level in the two neighbouring bands, the noise is said to be tonal.
The hearing threshold, LHS, used in the German assessment method is given in Table 5: Table 5.
4.3 Swedish methodThe recommendations from the Swedish National Board of Health and Welfare (SOSFS 1996:7) [16] give guidance to an assessment of whether noise under different conditions may have health effects. The recommendation comprise a criteria curve of recommended maximum levels of low frequency noise in rooms used for living (see Table 6). The curve covers the frequency range 31.5 Hz 200 Hz and applies to the equivalent level of the noise. A measurement method is specified and is described in a report from the Swedish Testing Institute [17]. If the noise level exceeds the criteria curve in any 1/3-octave band, the health and environmental authorities may characterise the noise as a sanitary nuisance. Table 6.
4.4 Polish MethodAlso the Polish method applies a threshold curve. This is defined in the frequency range 10 Hz 250 Hz, and corresponds to 1/3-octave levels each giving an A-weighted level of 10 dB (i.e. 10 dB above the inverse A-weighting correction). The criterion curve is called LA10 [18]. The noise is considered annoying if both of these conditions are met:
It is mentioned in [18] that usually the background noise is somewhat higher than the criterion curve at the highest frequencies, above 100 Hz. 4.5 Dutch Proposed Method, SlovenThis proposed method [19] is intended for use along with granting environmental permission to industries and enterprises. Also this method uses a criterion curve. It is defined in the frequency range 10 Hz 200 Hz. In the upper part of the frequency range the criterion curve agree well with the Swedish criterion curve. At the lowest frequencies, where the Swedish curve is not defined, it corresponds with the hearing threshold as specified in the German method. It is expected that annoying low frequency noise will occur if the criterion curve is exceeded in one or more 1/3-octave bands. 4.6 Dutch Criterion for AudibilityThis method is described in [20]. It is intended for use in cases where people complain about low frequency noise to decide if audible low frequency noise occurs. The aim of the method is not to verify whether the noise is annoying or not. The method employs a hearing threshold based on the best 10 % of a non-selected population aged 50 60 years. The threshold curve is used in the frequency range 20 Hz 100 Hz. 4.7 C-weighted Sound Pressure LevelIn the German method the difference between the C-weighted and the A-weighted sound pressure level is used to determine if low frequency noise is present. Similar rules of thumb have regularly been mentioned in the literature. 4.8 Comparison of Criterion CurvesIn Figure 4 the criterion curves from the different assessment methods are compared. The order is arranged after the criterion value at 100 Hz. The curves Sloven, Swedish, and Polish are criterion curves directly aimed at assessing if the noise is annoying. The curves Sloven and Swedish differ only in the frequency range 50 Hz 80 Hz, where the Swedish curve is clearly lower than Sloven. In the entire frequency range the Polish curve is lower than the two other curves. Here it must be remembered that the background noise is also part of the Polish criterion, which will often have a relieving influence on the criterion curve at high frequencies, but this part of the method is not considered here.
Figure 4. The curve German is a hearing threshold curve and is used as a criterion for tones in the noise. It is allowed for tones to exceed the curve by 5 dB during daytime, and higher a higher exceeding is allowed at higher frequencies. The curve Dutch is used in cases with complaints in order to decide whether there is audible noise in the relevant frequency range. The curve is not used to determine if the noise is annoying. It can be seen that the German and the Dutch threshold curves are almost identical, and that they almost coincide with the curve Sloven below 40 Hz. The Polish criterion curve is clearly lower than the threshold curves in the frequency range below ca. 30 Hz.
5. Objective Analysis of the Noise Examples
5.1 One-third octave spectraBased on the 1/3-octave analysis of the noise at the listening position in the listening room, a range of objective measurements have been made for each noise example. All the levels and spectra are given in Annex B. As an example, the spectrum of the traffic noise is shown in Figure 5 below. The spectrum is shown for the three nominal presentation levels: 35 dB(A), 27,5 dB(A) and 20 dB(A). The spectrum is a typical low frequency spectrum with a maximum around 20 Hz. The spectra of the other noise signals are shown in Annex B.
Figure 5. 5.2 Comparison to criterion curvesIn this section the different objective measures, resulting from the different assessment methods are discussed. The discussion is limited to the results from the presentations at the highest level 35 dB, since there are no systematic differences between these results and the results from the two presentations of the same noises at lower levels (apart from the change in level). The objective results are used as parameters in the comparison with the subjective evaluation, Chapter 6. In Table 7 it is shown for each noise example how much each of the criterion curves is exceeded, and at which frequency the highest excess occurred. Also the Danish measure LpA,LF is given, and the German A-weighted level (of the frequency bands exceeding the hearing threshold) Table 7.
It is generally seen that the Swedish and / or the Sloven method gives less excess than the German method, the Polish method, or the Dutch method (for audibility). With a view to the Polish method, it has to be remembered that the background noise criterion, which normally will relieve the criterion curve at higher frequencies, has not been considered here. Furthermore the German criterion curve has not been used correctly in this comparison: firstly it only applies to tones in the noise, and secondly the German method specifically mentions a relief of the criterion curve in the 80 and 100 Hz bands which was not applied here. Finally, the Dutch method (for audibility) is not a method for assessing annoyance, so it is not surprising that it gives a stronger assessment than some of the other methods.
6. Comparison of Subjective Evaluation of Annoyance with Objective Measures
In every evaluation the test person was asked to assess the annoyance for two different situations: if the noise was heard in the day and the evening, and if the noise was heard at night. It has been found that there is a very close relation between these two assessments, as is illustrated in Figure 6. Generally the annoyance at night is slightly larger than annoyance in the day / evening.
Figure 6. The relation between the pair of evaluations can be described by the correlation coefficient, which is as high as 0.9885. This indicates that it is not necessary to perform a complete analysis on both the assessments, as they are not independent. In the following only the subjective assessment of the annoyance in the night period is considered. 6.1 Danish MethodIn this section a thorough description of the analysis process is given. The same analysis method is applied to the other assessment methods, but for the other assessment methods only the results and some comments are given. In Table 8 below all the noise examples are shown, the names are given in the first column. In the second column the excess of the Danish limit is shown. For the night period the limit value is LpA,LF = 20 dB, but since the drop forge as well as the discotheque are considered as impulsive noises, the limit for these noises is 15 dB. The third column gives the average assessment made by the test persons. Column 2 and 3 are from the presentation with a nominal level of 20 dB, the lowest presentation level. Column 4 and 5 are form the presentations with the middle presentation level, nominally 27.5 dB, and column 6 and 7 are from the loudest presentation with 35 dB nominal level. Table 8.
Figure 7. The Figure illustrates the subjective evaluation as a function of the excess of the Danish criteria. It is seen that a straight line (not shown in the figure) can represent the group of points. This line is found by linear regression (least squares method, in practise the function LINEST of an Excel spreadsheet is used). The regression line has the formula: y = 1.61 + 0.26 * x Where y represents the (average) subjective evaluation made by the test subjects and x represents the excess of the limit in dB. The regression line does not explain how well it represents the group of points. Here we can use the residuals, which are the vertical distance between the points and the line. We also use the average of all y-values and the y-values determined by the regression method for each x-value. The determination coefficient or degree of explanation (r2) is defined as:
where SSe is the residual sum of squares, and SSr is the regression sum of squares. In practice r2 is calculated by the Excel function LINEST. If r2 equals 1.00 there is a perfect linear relationship between the points. If r2 is close to zero, the regression line cannot be used to explain the relation between x and y. In other words, the r2 value indicates how well the points can be described by a straight line. The closer the value is to 1, the better the description. The relation between x and y can also be described by the correlation coefficient, r . This is calculated as the ratio between the covariance of x- and y-values, and the product of the x-variance and the y-variance:
The covariance is calculated as the deviation between the x-value and the x-average, multiplied by the deviation between the y-value and the y-average. The x-variance, s x, is the deviation between the x-value and the x-average squared. Similarly the y-variance, s y, is the deviation between the y-value and the y-average squared. The correlation coefficient is calculated by use of the Excel function CORREL. It explains the degree of relation between the x and the y values; it gives a coarse indication of the shape of the swarm of points. If r is close to 1 (or 1) the shape of the group of points has the shape of a cigar around the regression line. If r is close to zero, the points lie in a diffuse cloud and there is no obvious relation between x and y. There is an important difference between the degree of explanation and the correlation coefficient. The degree of explanation, r2, assumes a functional relationship between y and x (e.g. the subjective evaluation is caused by the noise). The correlation coefficient does not assume such causality and can be calculated from any two datasets. E.g. in chapter 7 a correlation coefficient is calculated for the annoyance evaluations made by the reference test subject group and the special test subject group. In this situation it is obvious that the ratings made by one of the groups are not caused by the ratings made by the other group. The values calculated for the Danish method are summarized in the following table.
6.2 German A-levelFigure 8 shows the A-weighted level of the noise examples, calculated according to the German standard DIN 45 680. The level is calculated as the sum of the A-weighted levels of those 1/3-octave bands that exceed the hearing threshold. All the noise examples are used in this calculation, including those examples where the noise contains tones.
Figure 8. It is seen that the points fall in two groups (two lines); the upper points are the noise examples from drop forge, discotheque, and compressor. The German method (in the present interpretation) obviously cannot give a sufficient assessment of impulsive noise (drop forge and discotheque). The degree of explanation is not too convincing, r2 being 0.54, though r is 0.73.
6.3 German Tonal MethodFigure 9 shows the relation between the subjective evaluation and the tone level above the hearing threshold for those noises that contain tones. The tone/no tone decision was made according to the German assessment method. In the figure only one point per noise is shown. This point corresponds to the tone with the greatest excess of the threshold. The Gas Turbine has two tones above threshold at the highest presentation level (triangle) and one at the intermediate level (square). There is no tone above threshold at the lowest presentation level. The Generator has one tone above threshold at the highest level (triangle), at the intermediate level (square) and at the lowest level (diamond). The tone is at 80 Hz and thus the level has been reduced by 5 dB according to the German assessment method. The Cooling Compressor has two tones (50 Hz and 100 Hz) above threshold at the highest presentation level (triangle) and at the intermediate level (squares). As one of the tones is at 100 Hz the level for this tone must be reduced by 10 dB according to the German assessment method. This makes the other tone (at 50 Hz) the greatest. There is only one tone above threshold at the lowest presentation level (diamond).
Figure 9. The calculations gave the following results that are very similar to the results obtained from the German A-level method (see section 6.2).
6.4 Swedish MethodThis method uses a criterion curve that must not be exceeded in any 1/3-octave band. Figure 10 shows the subjective assessment as a function of the greatest excess.
Figure 10. The result of the regression and the correlation analysis is
It may be seen from Figure 10 that three points fall to the left of the rest of the points. These three points are from the discotheque. Obviously this type of noise should have been assessed about 10 dB higher for the points to fit into the rest of the points. The relative low values of r2 and r are mainly caused by these points. An analysis where the discotheque points have been discarded gave the following result
6.5 Polish MethodFigure 11 shows the excess of the Polish criterion curve, which is a curve of 1/3-octave band levels each of which corresponds to an A-weighted level of 10 dB. The other part of the Polish method, which deals with the excess of the background noise level, has not been considered.
Figure 11. It may be seen that the noise examples fall in two rather distinct groups, where (again) the discotheque, the drop forge and the cooling compressor are to the left of the remaining points. They should have been assessed at least 5 dB higher by the method to align with the remaining points. If one looks at the three groups of points from each nominal level separately (diamonds, squares and triangles), it appears as if the points in each group shows a tendency to a downwards slope to the right; that is, the subjective evaluation decreases as the max excess increases within each group. The degree of explanation is not convincing, r2 being 0.50.
6.6 Dutch Proposal, SlovenThe proposed assessment for use with environmental approval of industries also employs a criterion curve. As is seen in Figure 12 the points spread a lot, and no clear picture can be seen. If one again focuses at the three groups of points from the same nominal level, it appears as if these points slope the wrong way as it was seen with the Polish method. As an example (filled squares) it can be seen that when the excess increases from 1 dB to 12 dB, the subjective evaluation decreases from 5.5 to 2.5.
Figure 12. Both the degree of explanation and the correlation are rather poor.
6.7 C-weighted LevelFigure 13 shows the relation between the C-weighted level of the noise examples and the subjective evaluation. It can be seen that the spread of the points is very large and the correlation is poor. Only the frequency range 10 Hz to 160 Hz is included in the calculation of the C-weighted level.
Figure 13.
6.8 Overview of the ResultsThe results of the analysis of the relation between the subjective evaluation and the various objective measures are summarized in table 9 below. Table 9.
It can be seen from the results that the assessment method with far the best correlation between the subjective and the objective assessment is the Danish method, using the A-weighted level in the frequency range from 10 Hz to 160 Hz. The degree of explanation is high, and there is no obvious indication in the results obtained here of ways to further improve the method. The second best method is either the Swedish method, based on a criterion curve, or the German method using the A-weighted level. It is seen from the rightmost column that a noise that would exceed the Danish criterion by 1.5 dB will give rise to a subjective annoyance of 2 (on a scale from 0 to 10). The same noise would be 0.5 dB below the Swedish criterion and would give rise to a German A-weighted level as low as 15.7 dB. Thus it can be stated in broad terms, that when the assessment methods are compared in this way with representative noise examples and calibrated to the same subjective annoyance
These results cannot be used to decide which of the methods that are unnecessarily strict or too relaxed, since the results are based on a limited number of noise examples only.
7. Results obtained with the Special Group
7.1 Comparison of the two groups of test personsIn the first series of analysis, the evaluations made by the two groups of test persons are compared. This is illustrated in the following four Figures (14-17), showing how the reference group and the special group evaluates the same noise examples with respect to:
Figure 14. There is a good correlation between the assessments of loudness between the two groups, and the correlation coefficient is calculated to be 0.82. The special group generally finds the noise examples somewhat louder than the reference group does. The points are rather close to a line that would be offset from but parallel to the line indicated in Figure 14 (showing a fictive 1:1 relationship).
Figure 15. The relation between the assessments of annoyance (day / evening) of the two groups, Figure 15, is less clear. The correlation coefficient drops to 0.75, and especially the group of points from the highest nominal level (triangles) shows a considerable scatter. In every case the special group finds the noise example more annoying than the reference group does. On the average the special group rate the annoyance at day/evening about 2 to 3 scale units higher that the reference group. An increase in the rating of 2 to 3 units corresponds roughly to an increase in level of about 10 dB.
Figure 16. For the assessment of annoyance at night, Figure 16, the picture is shifted. The special group finds the noises much more annoying at night than at day (or evening), and the difference between the assessments of the two groups increases significantly. Figure 16 shows a saturation phenomenon, that is, one or more of the test persons in the special group uses the maximum indication of the annoyance scale, and this will break the linear relation between the points. The correlation coefficient drops to 0.73. On the average the special group rate the annoyance at night about 4 to 5 scale units higher that the reference group. Such an increase in the rating of 4 to 5 units corresponds roughly to an increase in the level of about 17 dB.
Figure 17. Finally the assessments annoying yes / no of the two groups are shown in Figure 17. The scales show how many percent of the group that have marked the noise as annoying. Here the saturation is obvious, where all (four) persons in the special group have marked several noise examples as annoying. The Yes/No parameter has not been used in the previous analysis of the data from the reference group. 7.2 Comparison of the assessments of loudness and annoyance by the special groupThe relation between the assessments by the special group of loudness and annoyance (day / evening) of the same noise examples is illustrated in Figure 18. It is seen that the relation is almost linear, and the correlation coefficient is as high as 0.96. The loudness scalings are less than the annoyance scalings and thus the noises are perceived more annoying than loud.
Figure 18. The relation between the assessments of annoyance at day / evening and at night is illustrated in Figure 19. The non-linear relation due to saturation is clearly seen.
Figure 19. This indicates the need (in this case) for a longer response line or rather the need for a stronger assessment than very annoying. The group of points from the middle level (filled squares) is evaluated 2 3 units more annoying when they occur at night than at daytime, but the points from the loudest presentation are only indicated 1 2 units more annoying. For the special group the annoyance generally increase by two units from day to night corresponding roughly to a 10 dB change in the noise level. This can be compared to the results from the reference group, Figure 6 (in the beginning of Chapter 6) where the annoyance at night generally was rated about one unit higher than at day at all presentation levels. Such a one-unit change in the rating corresponds approximately to a 5 dB change in the noise level and supports thus the 5 dB penalty in the noise limits at night. 7.3 Comparison of subjective evaluation of annoyance with objective measuresIn the same way as in Chapter 6 the subjective evaluation of annoyance in the night period was compared to a number of objective measures, but in this case only the subjective evaluations from the special group were used. Table 10 below shows the subjective evaluation by the special group, and Table 11 shows the results of the statistical analysis made for each of the different objective assessment methods. Table 10
A very interesting result that is obtained by comparing Table 8 and 10 is that it is not the same noise examples that are evaluated as most annoying by the two groups. The reference group clearly found the drop forge, the discotheque, and the cooling compressor the most annoying. This rank would hold at any of the three presentation levels. In contrast, the special group found the generator the most annoying (at the lowest presentation level) and the discotheque as one of the lesser annoying sounds. Table 11.
Figure 20.
Figure 21. The other assessment methods show results without any particular trend like it is seen with the Danish assessment method. Obviously there is no strong connection between the subjective assessment made by the special group and the objective results found by the objective measuring methods. Since only four persons have made these assessments, it was evaluated that no more effort should be paid to optimise an objective assessment method to fit the special groups evaluations. However, the results obviously give rise to a number of questions about how low frequency noise in the environment is experienced and how it can be assessed. 7.4 Discussion of results from the special groupThe various scaling results show clearly that the special group made the annoyance evaluations differently from the reference group. The overall scaling value (averaged over all annoyance evaluations, presentation levels and noises) was 3,5 for the reference group and 6,7 for the special group. In other words, the special group evaluated the noises to be almost double as annoying as the reference group did. This may also be illustrated by ordering the noises from the most annoying to the least annoying. This is done in Table 12. Table 12.
For the special group the Generator is on top of the list whereas the Discotheque and the Traffic noise are evaluated as the least annoying. It is interesting that Traffic noise gives the lowest overall scaling. The value 5,6 is well below the next one (Discotheque) at 6,2. The order of the noise signals could indicate that the special group put more attention to those noises, which resemble the typical low frequency noises that they are complaining about.
8. General discussion
8.1 The Experimental MethodThe present investigation has been performed as a typical laboratory experiment in contrast to a field investigation. The advantage of a laboratory experiment is that it is possible to control almost all the experimental conditions (noises, levels, duration, presentation sequence, test subjects, etc). The disadvantage is that the presentations may not be realistic enough. The test subjects made the scaling by a mark on a 10 cm long horizontal line. The results from the special group of subjects showed in some cases a saturation effect that made the results more difficult to interpret. A way to alleviate the saturation effect could be to change the layout of the response line. An example of one of the annoyance scales was
but the saturation effect might have been less pronounced if the scale was made like this:
Another way of reducing the saturation effect could be to exchange the word very with a stronger adjective like e.g. extremely. The response sheet contained all the questions on one sheet. It has been argued that the different questions should have been on separate sheets in order to avoid a smudging effect from one question to the next. Also the wording of the questions might be revised even though no test subjects reported any difficulties with the questions. The amount of subjects (18 in the reference group; 4 in the special group) could be increased in order to obtain more certainty in the results. For the reference group it is believed that an increase in the amount of test subjects would not change the general results dramatically. For the special group, an increase in the amount of test subjects would certainly improve the validity of the findings as group results. On the other hand it may be difficult to handle the persons with low frequency problems as a homogeneous group of subjects. The problems they report are very different and thus it might be more relevant to handle the results from these test persons individually. Such an individual analysis has not been done in the present report. The noises constitute a reasonable broad selection of low frequency sounds. The noises were selected to represent typical low frequency noise known to produce community claims. In retrospect it would have been an improvement to include more noises with an impulsive character in order to better test the impulse penalty in the Danish method. All noises had clearly a low frequency character partly because of an outdoor-to-indoor filtering of the noises recorded out of doors. Traffic noise was included in order to serve as a reference noise, but due to the outdoor-to-indoor filtering the traffic noise was converted into another low frequency noise. In possible future research a real reference noise should be included. 8.2 Low Frequency Hearing ThresholdsThe low frequency pure-tone hearing threshold was measured for both subject groups. The result showed that the special group was less sensitive to low frequency sounds than the reference group. Although this is an interesting result, it might be more informative to measure the loudness growth curve for the test subjects. It is believed that the loudness growth curve would be a much better predictor for annoyance than just the hearing threshold. Measurement of loudness growth is very time consuming and is certainly not straightforward at low frequencies. 8.3 Criteria and Measurement MethodsThe results from the two groups seem to be different. All the subjects in the reference group evaluate the noises in almost the same way whereas more variance is seen in the special group. This could be caused by a bias in the special group who tend to put more emphasis on the type of noises that they complain about. There is a good agreement between the annoyance evaluations from the reference group and the Danish measurement/calculation method (including the impulse noise penalty). The same good agreement is not found for the special group. This raises a question about the aim or objectives of a measuring method. Shall such an evaluation method be made for the average person (the general population) or shall a method be made with special emphasis on the persons who react more pronounced to low frequency noise? The criteria and evaluation methods used in this investigation are all based on some kind of measurement of the noise level. For the reference group there is a clear connection between the noise level and the experienced annoyance and thus it makes sense to use such criteria and evaluation methods. For the special group this connection between level and annoyance is less clear and thus an evaluation method based on noise level measurements may be of little value for this subject group. 8.4 Statistical analysisIn the statistical analysis it was seen that the data deviated somewhat from a normal distribution partly caused by the saturation effect from the fixed endpoints of the scale. Despite this deviation in the distribution of the data the statistical analysis showed the expected effects and thus no attempt was made to correct for the saturation effect in the data. For the special group of subjects the saturation effect was pronounced for the annoyance ratings at night. In this case a simple linear regression may not be suited for a description of the data. A logarithmic or polynomial model would probably give a better fit to the data. Due to the relatively small amount of data in this subject group, no attempt has been made to do this. 8.5 Noise limits, Criteria curves and equal loudness level contoursIn chapter 5 and in chapter 6 a comparison with the noise limits and the criteria curves was made and as input data for the analysis the maximum excess of a criterion curve or the excess of a noise limit was used. It was decided to do it in this way because this is the way the criteria curves and the noise limits are used in practise. For the use of noise limits there is no problem because the level of the noise is calculated according to some rule and compared to the limit. The calculation of the level is based on the low frequency spectrum of the noise within a certain frequency range. For the criteria curves, on the other hand, the procedure may constitute a problem as only a single frequency band of the noise is used in the comparison and not the whole spectrum. Only the band where the maximum excess occur is taken into account and the excess at other frequency bands are neglected. From Figure 4 (in chapter 4) it is seen that the course of the different criteria curves differ somewhat above 40 Hz and this means that the various criteria curves will give very different results if the excess occur in this frequency range. It also means that the excess decision will be very dependent on the inherent measurement uncertainty in the measurement of the spectrum. The calculation of a level based on a spectrum is much less sensitive to the measurement uncertainty as the uncertainties are averaged in the calculation process. The measurement uncertainty is inversely proportional to the bandwidth of the analysing filter and also inversely proportional to the duration of the measurement (i.e. the integration time). This means that a one-third-octave analysis of a low frequency noise must be extended over a long period of time in order to keep the uncertainty below a certain limit. It is common practise to require that the standard deviation of repeated measurements shall be less then 0,2 dB. This corresponds to an integration time (in seconds) greater than 471/B where B is the bandwidth in Hertz of the analysing filter. For the one-third-octave filter at 10 Hz this means an integration time of almost five minutes. At 40 Hz a one-minute integration time is necessary and at 1000 Hz two seconds are needed. The noise signal shall be stable over this period of time but this is not always the case in practise. Uncritical use of criteria curves may be misleading. Some of the curves (e.g. the German and the Dutch) are hearing threshold curves and can therefore only be used to predict whether a nose is audible or not. If the curve is exceeded, the amount of excess cannot predict neither the loudness of the noise or the annoyance. This will depend on the shape of the spectrum of the noise. The use of weighting functions (such as G- and A-weighting) will not automatically give a loudness or annoyance measure. In the conventional audible frequency range it is well known that neither the loudness level contours nor the A-weighting can predict the loudness of complex sounds. Loudness can only be predicted by the loudness level contours when a pure tone (no harmonics) is heard in isolation. The A-weighting resemble the hearing sensitivity at low levels but should not be used for loudness ratings. It is believed that loudness is a major component of annoyance. Loudness is related to the level and the spectrum of the noise. Annoyance is therefore also dependent on level and spectrum but annoyance is also influenced by (or dependent on) many other factors and these factors cannot be described by physical measurements of the noise.
9. ConclusionA laboratory investigation of the annoyance of low frequency noises has been performed. Eighteen normal hearing test subjects listened to eight different noises and evaluated the loudness, the annoyance at day/evening and the annoyance at night. All noises had a considerable low frequency content. The evaluations were compared to the noise limits and criteria curves for low frequency noise used in the European countries. As a pilot project, a group of four persons who were known to experience problems with low frequency noises participated in the same listening test. The results show that the Danish measuring method describes the subjectively experienced annoyance better than the measuring methods used in other countries. This result relies on the 5 dB impulse noise penalty included in the Danish method. The decision about whether or not a 5 dB penalty shall be applied to a specific noise is based on a purely subjective judgment and therefore the Danish method could be improved at this point. The Swedish method is almost as good as the Danish method if the (impulsive) discotheque sound is omitted from the analysis. The Swedish method is based on a specified criterion curve (in contrast to the Danish noise level calculation) and as such more sensitive to random measurement uncertainties. Because the traffic noise was outdoor-to-indoor filtered, the traffic noise did not serve as a reference noise as originally intended. This also prevented a comparison of the present findings to ordinary traffic noise annoyance investigations from the literature. An almost perfect correlation was found between the annoyance at day/evening and the annoyance at night. The annoyance at night is slightly lager than the annoyance at day/evening. The difference in the annoyance ratings between day and night corresponds to a level change of about 5 dB. The low frequency hearing threshold of the four special test subjects was not found to be better than the hearing threshold of the ordinary test subjects. The annoyance evaluations made by the four special test subjects were clearly different from the evaluations made by the ordinary test subjects. The ratings were systematically higher. Especially at night the annoyance was rated as close to maximum and thus not dependent on the level of the noise. The four special test subjects were not annoyed by the impulsive noises to the same degree as the ordinary test subjects were. The Danish evaluation method seems to be suitable for predicting the annoyance experienced by the general population. It seems to be is less suitable for predicting the annoyance experienced by the special subject group where the influence from the noise level is less pronounced and where individual reactions to the quality of the sounds becomes more important.
10. References
Annex A
|
Loudness |
Presentation level, dB(A) |
||
Ref. Group |
20 |
27,5 |
35 |
Traffic noise |
1,0 |
2,5 |
4,4 |
Drop forge |
1,9 |
3,2 |
4,8 |
Gas turbine |
0,5 |
1,4 |
3,5 |
Fast ferry |
0,7 |
1,8 |
3,7 |
Steel factory |
0,5 |
1,7 |
3,4 |
Generator |
1,0 |
1,9 |
3,3 |
Cooling compr. |
1,4 |
2,5 |
4,5 |
Discotheque |
1,1 |
2,6 |
4,1 |
Avg. loudness |
1,0 |
2,2 |
4,0 |
Average rating of annoyance day/evening
Annoyance |
Presentation level, dB(A) |
||
Ref. Group |
20 |
27,5 |
35 |
Traffic noise |
1,0 |
2,8 |
4,2 |
Drop forge |
3,0 |
4,5 |
5,9 |
Gas turbine |
1,0 |
2,2 |
4,4 |
Fast ferry |
0,8 |
2,5 |
4,6 |
Steel factory |
0,9 |
2,2 |
4,3 |
Generator |
1,7 |
2,7 |
4,1 |
Cooling compr. |
2,3 |
3,8 |
5,4 |
Discotheque |
2,2 |
4,4 |
5,9 |
Avg. day |
1,6 |
3,1 |
4,9 |
Average rating of annoyance at night
Annoyance |
Presentation level, dB(A) |
||
Ref. Group |
20 |
27,5 |
35 |
Traffic noise |
1,6 |
3,4 |
5,2 |
Drop forge |
4,3 |
5,9 |
6,9 |
Gas Turbine |
0,9 |
2,5 |
5,2 |
Fast Ferry |
0,9 |
3,2 |
5,4 |
Steel factory |
1,0 |
2,7 |
4,9 |
Generator |
1,7 |
3,2 |
5,0 |
Cooling compr. |
2,7 |
4,4 |
6,0 |
Discotheque |
3,0 |
5,4 |
6,7 |
Avg. night |
2,0 |
3,8 |
5,7 |
Average of percentage of yes responses
Percentage |
Presentation level, dB(A) |
||
Ref. Group |
20 |
27,5 |
35 |
Traffic noise |
5,6 |
41,7 |
72,2 |
Drop forge |
63,9 |
88,9 |
94,4 |
Gas turbine |
11,1 |
36,1 |
77,8 |
Fast ferry |
11,1 |
44,4 |
80,6 |
Steel factory |
11,1 |
41,7 |
69,4 |
Generator |
19,4 |
41,7 |
80,6 |
Cooling compr. |
30,6 |
69,4 |
83,3 |
Discotheque |
47,2 |
80,6 |
94,4 |
Avg. |
25,0 |
55,6 |
81,6 |
All data (i.e. subjective evaluation, sound example number, nominal presentation level, measured A-weighted level (dB(A)), A-weighted level in the frequency range 10 Hz 160 Hz (LpA,LF ), repetition number, gender and age) were used as input to a statistical analysis program (Statgraphics 4.0). The analysis was made for each parameter separately.
The table below shows the significance levels for the different factors. If the number is less than 0.05, this factor has a significant effect on the evaluation on a 95% level or above (this means less than 5 % probability for drawing a wrong conclusion). If the number is above 0.05 it cannot be proved that this factor has a significant effect upon the relevant evaluation.
Table A-1.
The significance level of different factors that may influence the evaluation made by
the reference group of listeners.
|
Loudness |
Annoyance day |
Annoyance night |
Annoying ? (Y/N) |
Noise example |
0.0000 |
0.0000 |
0.0000 |
0.0000 |
Nominal level |
0.0000 |
0.0000 |
0.0000 |
0.0000 |
dB (A) |
0.0000 |
0.0000 |
0.0000 |
0.0000 |
LpA,LF |
0.0000 |
0.0000 |
0.0000 |
0.0000 |
Repetition no. |
0.5814 |
0.6123 |
0.6804 |
0.1533 |
Gender |
0.1888 |
0.0001 |
0.0001 |
0.0654 |
Age |
0.0000 |
0.0000 |
0.0000 |
0.0000 |
As expected, the noise example, the nominal level, the dB(A) level and the low-frequency
level (LpA,LF), all have a significant influence upon the evaluations from the
test persons.
The repetition number (round 1 or round 2) is not significant and thus no training effect is seen in the data.
The gender of the test persons has influence on the evaluation of annoyance during the day and during the night but not on the evaluation of loudness and on the yes/no question about whether the noise is annoying or not.
The age of the test subjects is a significant effect. An inspection of the analysis shows that this is caused by one subject, age 21, with responses in the low end of the scale and another subject, age 24, with responses at the high end of the scale. The remaining subjects, age 19 to 25, gave responses in between.
The same 10 cm long scale was used in the pilot experiment with the special group. Each of the four test subject indicated his/her response by setting a mark on a scale consisting of a 10 cm long line. The following tables show the evaluation of the various noises averaged over the four test subjects in the special group.
Average rating of loudness
Loudness |
Presentation level, dB(A) |
||
Special group |
20 |
27,5 |
35 |
Traffic noise |
2,1 |
3,3 |
4,5 |
Drop forge |
2,4 |
4,5 |
6,0 |
Gas turbine |
1,4 |
4,7 |
7,6 |
Fast ferry |
1,5 |
3,7 |
4,9 |
Steel factory |
2,7 |
3,9 |
6,1 |
Generator |
2,6 |
4,2 |
5,7 |
Cooling compr. |
2,6 |
4,1 |
6,6 |
Discotheque |
1,6 |
3,2 |
5,3 |
Avg. |
2,1 |
3,9 |
5,8 |
Average rating of annoyance day/evening
Annoyance |
Presentation level, dB(A) |
||
Special group |
20 |
27,5 |
35 |
Traffic noise |
3,0 |
4,2 |
5,7 |
Drop forge |
4,4 |
6,1 |
7,1 |
Gas turbine |
3,4 |
6,3 |
9,0 |
Fast ferry |
3,5 |
5,7 |
7,5 |
Steel factory |
4,0 |
5,7 |
7,7 |
Generator |
5,1 |
6,0 |
7,2 |
Cooling compr. |
4,3 |
6,3 |
7,6 |
Discotheque |
4,0 |
4,5 |
6,3 |
Avg. day |
4,0 |
5,6 |
7,3 |
Average rating of annoyance at night
Annoyance |
Presentation level, dB(A) |
||
Special group |
20 |
27,5 |
35 |
Traffic noise |
4,7 |
7,2 |
8,5 |
Drop forge |
7,5 |
8,3 |
8,9 |
Gas turbine |
5,0 |
8,1 |
9,8 |
Fast ferry |
6,6 |
8,8 |
9,3 |
Steel factory |
5,9 |
8,2 |
9,3 |
Generator |
8,4 |
8,3 |
9,0 |
Cooling compr. |
7,4 |
8,5 |
9,1 |
Discotheque |
6,0 |
7,9 |
8,6 |
Avg. night |
6,4 |
8,2 |
9,1 |
Average of percentage of yes responses
Percentage |
Presentation level, dB(A) |
||
Special group |
20 |
27,5 |
35 |
Traffic noise |
25,0 |
25,0 |
75,0 |
Drop forge |
75,0 |
75,0 |
100,0 |
Gas turbine |
37,5 |
62,5 |
100,0 |
Fast ferry |
62,5 |
100,0 |
100,0 |
Steel factory |
50,0 |
100,0 |
100,0 |
Generator |
75,0 |
87,5 |
100,0 |
Cooling compr. |
87,5 |
100,0 |
100,0 |
Discotheque |
50,0 |
87,5 |
100,0 |
Avg. |
57,8 |
79,7 |
96,9 |
All data from the special group (i.e. subjective evaluation, sound example number, nominal presentation level, measured A-weighted level (dB(A)), A-weighted level in the frequency range 10 Hz 160 Hz (LpA,LF ), repetition number, gender and age) were used as input to a statistical analysis program (Statgraphics 4.0).
As the special group of listeners consists of only four persons, it is almost irrelevant to test for normal distribution of the data. Despite of this it was decided to perform an analysis of variance on the data from the special group. The analysis was made for each parameter separately.
The table below shows the significance levels for the different factors. If the number is less than 0.05, this factor has a significant effect on the evaluation on a 95% level or above (this means less than 5 % probability for drawing a wrong conclusion). If the number is above 0.05 it cannot be proved that this factor has a significant effect upon the relevant evaluation.
Table A-2.
The significance level of different factors that may influence the evaluation by the
special group.
|
Loudness |
Annoyance day |
Annoyance night |
Annoying? (Y/N) |
Noise example |
0.2118 |
0.0364 |
0.0592 |
0.0005 |
Nominal level |
0.0000 |
0.0000 |
0.0000 |
0.0000 |
dB (A) |
0.0000 |
0.0000 |
0.0000 |
0.0000 |
LpA,LF |
0.0000 |
0.0000 |
0.0000 |
0.0000 |
Repetition no. |
0.7107 |
0.4968 |
0.6794 |
0.9984 |
Gender |
0.1749 |
0.3651 |
0.3651 |
0.2722 |
Age |
0.0018 |
0.0625 |
0.4447 |
0.0003 |
Noise examples: The annoyance evaluations are found to be close to the limit for being
significantly different. The Annoyance day is just below the limit (i.e. the noises are
evaluated as different) whereas Annoyance night is just above the limit. This may mean
that all the noises are evaluated almost equally annoying by the special group. The
evaluation of loudness is not significant. The result of the yes/no question is a
significant effect, i.e. some noises are found to be annoying others are not. This is
somewhat in contradiction to the findings from the annoyance scaling. The limited number
of test subjects in the special group probably causes the contradiction.
The nominal level, the dB(A) level and the low-frequency level (LpA,LF) are significant in all cases.
The repetition number (round 1 or round 2 with the same presentation) has no significant influence, which shows the absence of a training effect.
The gender of the test persons has no influence on the evaluations. The age is significant for loudness evaluation and for the response to the yes/no question but not for the annoyance evaluations.
The 1/3-octabe levels of the noises are shown in the table below. The levels are equivalent levels averaged over the duration of the noise examples, 2 minutes, and are given in dB re 20 mPa. The data are from a nominal presentation level of 35 dB(A).
The A-weighted levels (in dB re 20 mPa) calculated in the frequency range 10 Hz to 160 Hz are given in the following table for the three presentation levels.
Presen- tation level |
Traffic |
Drop |
Gas |
Fast |
Steel |
Gene- rator |
Cooling |
Disco- theque |
35 |
34,5 |
36,5 |
34,8 |
35,0 |
33,1 |
36,1 |
34,0 |
33,1 |
27,5 |
27,0 |
29,2 |
27,3 |
27,5 |
25,6 |
28,6 |
26,5 |
25,7 |
20 |
19,7 |
21,9 |
19,8 |
20,1 |
19,0 |
21,3 |
19,6 |
18,7 |
The C-weighted levels (in dB re 20 mPa ) calculated in the
frequency range 10 Hz to 160 Hz are given in the following table for the three
presentation levels.
Presen- tation level |
Traffic |
Drop |
Gas |
Fast |
Steel |
Gene- rator |
Cooling |
Disco- theque |
35 |
63,0 |
67,4 |
73,7 |
61,8 |
66,0 |
58,7 |
58,8 |
57,8 |
27,5 |
55,4 |
60,0 |
66,1 |
54,3 |
58,4 |
51,2 |
51,4 |
50,5 |
20 |
48,1 |
52,5 |
58,6 |
47,0 |
51,3 |
44,5 |
44,8 |
44,2 |
The 1/3-octave spectra of the noises are shown in the following figures. All measurements were made acoustically at the test subjects position. The spectra are show for the three nominal presentation levels: 35 dB(A) [solid line], 27,5 dB(A) [thick dashed line] and 20 dB(A) [thin dashed line].
Traffic noise:
Drop forge:
Gas turbine:
Fast ferry:
Steel factory:
Generator:
Cooling compressor
Discotheque:
The background noise at the listening position is shown in the figure below.