Journal List > J Korean Med Sci > v.38(5) > 1516081175

Lee, Yun, Hwang, Kim, and Han: The Study on the Identification of Musical Passages for an Emotion Perception Scale for People With Developmental Disabilities

Abstract

Background

Emotion recognition is essential to the social adjustment and social interaction of people with intellectual and developmental disabilities (IDDs). Given that music is a medium for expressing and conveying human emotion, we conducted this preliminary study to identify musical passages representing the basic human emotions of happiness, sadness, and anger, with the goal of developing a music-based emotion perception scale for IDDs.

Methods

To identify musical passages for emotion perception, 20 certified music therapists evaluated 100 selected musical passages and established 60 pieces that yielded the highest agreement for each emotion category. During the second phase of this study, 300 neurotypical participants rated 60 passages in terms of the perceived type and intensity of emotions expressed.

Results

The 60 passages showed high reliability and were statistically classified into three factors: happiness, sadness, and anger. The k-means cluster analysis yielded a cut-off score of 41 for the low emotion perception group (F = 1120.63, P < 0.001). The hierarchical logistic regression analysis revealed that only model 3 (musical passages) was significantly associated with low emotion perception (step χ2 = 227.8, P < 0.001).

Conclusion

The selected musical passages demonstrated high reliability and established three factors for identifying perceptions of happiness, sadness, and anger. Neither psychological status nor individual demographic characteristics affected the emotion perception results.

Graphical Abstract

jkms-38-e30-abf001.jpg

INTRODUCTION

Emotion recognition is essential to the social adjustment and social interaction of people with developmental disabilities (IDDs),12 and it is believed to affect many other learning areas. Planalp3 emphasized the role of facial recognition in human interaction, and most studies have focused on visual stimuli like facial expressions when assessing emotion recognition.4 However, the use of visual stimuli alone to assess emotion recognition has several limitations. Specifically, people with intellectual disabilities or autism spectrum disorders often face challenges from their limited ability to detect and process emotions in social situations.56 Those with IDDs show diminished capacity in emotion recognition because they have slower processing skills for social cues.7 Additionally, they experience difficulties integrating multilayered visual and auditory information related to emotion,8 and they have structural abnormalities or dysfunction in brain regions associated with emotion processing, particularly in the temporal lobes.910
Beyond visual stimuli, auditory information also plays a substantial role in emotion detection and processing. Considering that there are challenges to relying solely on visual stimuli for emotion recognition and that most studies have focused on facial emotion recognition, we concentrated on developing an alternative approach to assessing auditory emotion recognition through music.
Major functions of music include the “enhancement, communication, and regulation of emotion.”11 In other words, people use music to experience and express emotions fully and to regulate their affective states. Given the direct relationship between music and emotion, over 300 studies have investigated the subjective perception of emotions expressed in music.12 This extensive literature supports the use of music to assess and promote emotion recognition. Experts have argued that music can stimulate emotion recognition as accurately as facial or verbal expressions can.131415 Consequently, music can be effective in assessing and promoting emotion recognition abilities.161718
Most studies of music emotion recognition (MER) have evaluated the emotions either perceived or felt from music.19 In the former case, the listener identifies the type of emotion that music expresses or conveys. In the latter case, the listener describes personally experienced, induced, or evoked emotions. The consensus is that the listener perceives emotions in music because of music’s structural components, such as tone quality and other musical elements.2021 In contrast, an individual’s physiological, personal, cultural, and psychological characteristics significantly influence the music-induced emotions felt.22232425 Therefore, music for an emotion recognition test needs to be selected and evaluated according to the “perceived” emotion type.
In an extensive review of musical stimuli used for emotion recognition, 74% of studies investigated perceived emotion in music.12 In addition, 55% of studies relied on passages instead of the entire musical work, and 78% of those studies used selections that were less than a minute long. The most frequently assessed emotions were sadness, happiness, and anger, as observed in more than half of the reviewed studies. The researcher or an expert usually selected the music, although 37% of studies employed previously used stimuli for MER. The most frequent musical genres were popular, Western classical, and film music.12 Although these results provide a helpful guide for musical passage selection in future MER studies, they emphasize the need for a more precise rationale and selection criteria, and for the implementation of measures to reach a consensus on choosing music for each emotion category.
MER involves multiple factors associated with each type of music.26 Scherer27 outlined variables that can influence the affective appraisal of music, including physiological arousal, motor expression, subjective feeling, behavioral preparation, and cognitive processes, and explained why variance occurs when evaluating emotion perception in music. Yang and Chen28 noted that it is difficult to reach a consensus on the emotional content of music. One way to overcome this problem is to involve multiple perspectives in selecting stimuli for emotion recognition. Similarly, in our previous project, chosen images were rated by a group of healthy participants. Then, we used the pictures with the highest agreement rates to develop the facial emotion perception test.29
Another solution is to measure the intensity of expressed emotion, which allows for differences in the degree of expressiveness and makes the question less deterministic.30 When presented with scales, listeners feel less pressure to make difficult dichotomous decisions about whether a musical stimulus falls under one predetermined emotion.
Familiarity is a major topic of discussion in MER studies. It is often considered a confounding variable in the emotional appraisal of music.31 For instance, people have shown stronger emotional reactions to familiar music than unfamiliar music. Consequently, studies investigating induced emotion tend to use familiar music more frequently, whereas studies of perceived emotion rely on unfamiliar music.12
Familiarity has been observed to play a different role in IDDs, however. A recent neuroimaging study reported that familiar music did not show the same level of impact on people with autism as on neurotypical individuals.32 The authors postulated that a brain with autism processes information as new every time. Moreover, familiar stimuli may be preferable in psychometric tools for people with intellectual disabilities because unfamiliar stimuli often cause unnecessary anxiety in this population.33 Since listening to music can induce emotional wellbeing,34 the use of a familiar musical genre may be appropriate in assessing emotion recognition skills among people with IDDs. Similarly, Pochon and Declercq emphasized that they use familiar cues for emotion recognition when working with children with Down syndrome, as familiarity improves the children’s confidence level and task understanding.
Our study aimed to evaluate participants’ subjective perception of emotions expressed in music by asking them to classify each music type as happy, sad, or angry and to use a 7-point Likert-type scale to rate the expressive intensity of each musical passage. The purpose of this preliminary study was to compile a pool of musical passages needed to develop a music-based emotion perception test through evaluation by experts and neurotypical adults. In accordance with the predetermined methods of music selection for MER studies,12 we utilized researcher-chosen music verified by expert and non-expert groups through pilot testing.
We postulated that researchers, experts, and non-experts would reach a notable agreement in identifying musical passages for each emotion classification. In addition, we hypothesized that the music perception test using musical passages could effectively assess healthy participants’ ability to detect emotions.

METHODS

Phase I

Participants

We conducted the study in two phases: selection of musical passages for the musical passages for emotional perception test (MPEP; phase 1) and MPEP validation (phase 2) (Fig. 1). The participants in the first phase comprised 20 certified music therapists (19 women). They each had a bachelor’s degree in classical or popular music and a master’s degree or higher in music therapy. The participants were recruited using the snowball sampling method; we asked the first group to invite other participants who had also majored in both music and music therapy (Table 1).
Fig. 1

The process of selecting musical passages for emotional perception composition and validation

MPEP = musical passages for emotional perception test.
jkms-38-e30-g001
Table 1

Participants’ demographic characteristics: Phase I

jkms-38-e30-i001
Item Statistical status
Age, yr 32.65 ± 5.69 (26–44)
Sex
Male 1 (5.0)
Female 19 (95.0)
Years of musical instruction, yr 11.3 ± 4.75 (3–20)
Major of undergraduate study
Composition 8 (40.0)
String 4 (20.0)
Piano 4 (20.0)
Vocal 3 (15.0)
Woodwind 1 (5.0)
Values are presented as mean ± standard deviation (range) or number (%).

Procedure

After obtaining their consent, we sent the expert group a Google survey link with a detailed description of the study. Music therapists rated the appropriateness of emotion classification and the level of expressed intensity for 100 musical passages initially identified and classified by the researchers.

Process of music selection

In accordance with a comprehensive review of MER studies,12 we implemented the most frequently practiced method of selecting musical passages; the experimenter chose music based on expert opinion and within the top three most frequently chosen musical genres—popular, Western classical music, and film music. However, we designed the study to follow a two-step verification process, with a group of experts participating in the first phase, and typically developing participants without musical expertise included in the second phase.
We selected passages representing happiness, sadness, and anger because these are the emotions most often assessed through music.12 Three researchers with a background in music therapy reviewed the relevant literature and determined the duration, type, genre, and texture of the musical passages. Based on multiple discussions about previous MER studies, we limited the selections to instrumental music. Music that included percussion instruments was not considered, as rhythmic and percussive sounds cause stimulation. Each piece used no more than two instruments to minimize the impact of the complexity of musical texture. The performance format was limited to a keyboard or orchestral instrument solo, or an orchestral instrument accompanied by a keyboard.
At first, two music therapists identified 120 musical pieces representing the three emotions (40 pieces per emotion). Three music therapists from the research team met online three times to examine each piece and reach an agreement on its classification. These music therapists explored 30 additional pieces as replacements for initial selections. Finally, they compiled three playlists containing 34 pieces for happiness and 33 each for sadness and anger. These playlists were used in the first phase of the study (Supplementary Tables 1, 2, 3). Among the selected passages, the types of instrumentation were piano solos, string instruments accompanied by piano, and guitar solos. In terms of musical genre, classical music was the most frequent, followed by new age and jazz.

Survey construction

We constructed a survey using Google forms. Participants were provided with a complete study description and informed consent, which required them to insert a checkmark after each itemized statement related to the availability of information, extent of the research, and their rights. We obtained demographic characteristics, including age, sex, educational background, and musical background (Table 1).
The participants received three playlists from an online music streaming service in South Korea. Each playlist was consecutively titled happy (1 to 34), sad (1 to 33), and angry (1 to 33), with selected musical tracks numbered. The Google survey had a table with corresponding numbered items listing each song title and a Likert-type scale. Participants listened to each track from the playlist and then marked their answers on the corresponding numbered item on the Google survey. The presented musical passages are listed in Supplementary Tables 1, 2, 3.
The participants were instructed to listen to the first 15 seconds of each piece in a quiet space. We decided to use only the first 15 seconds of music because musical compositions often evolve and express different or multiple emotions as they unfold.28 Moreover, previous studies showed that 15-second musical stimuli are ideal for evaluating perceived emotions.16 A different starting point was specified for five pieces because they either had a dramatic change in expression or no notable expressions within the first 15 seconds. After listening to each piece, participants were asked to rate on a Likert-type scale 0 if they disagreed with the classified emotion. If they agreed with the classification, they rated the intensity of expressiveness of the classified emotion from 1 (extremely weak) to 7 (extremely strong). The same steps were repeated for all 100 pieces of music, with the entire process completed online. Due to the lengthy process, we suggested that participants take a 10-minute break after filling out the demographic information and rating the first batch of music selections.

Analysis of the survey results

Descriptive statistics were used to derive the frequency of disagreements and the mean intensity scores. For each category, we selected the 20 musical pieces with the highest intensity scores and the fewest occurrences of disagreement. We chose a total of 60 passages for use in the second phase of the study.

Phase 2

Participants

Participants in the second phase were 300 adults ages 18 to 40. The inclusion criteria were voluntary participation in the online survey and the ability to read and understand Korean. Professionally trained musicians, music majors, and people with auditory disorders, developmental disabilities, or a history of mental illness were excluded from the study. Initially, the participants were divided into two age groups—18- to 29-year-olds and 30- to 40-year-olds—so that an equal number of participants would be recruited for each age group. There were 170 participants and 130 participants in the first and second age groups, respectively. Their demographic information is presented in Table 2.
Table 2

Demographic characteristics: Phase 2

jkms-38-e30-i002
Item Statistical status
Age, yr 29.2 ± 5.95 (18–48)
Sex
Male 124 (41.3)
Female 176 (58.7)
Education
< 12 years (less than university) 75 (25.0)
12–16 years (more than university) 186 (62.0)
> 16 years (more than graduate school) 39 (13.0)
Occupation
Student 74 (24.7)
Officer 155 (51.7)
Service/freelancer 15 (5.0)
No job 56 (18.6)
Music listening pattern
Mean listening time, hr/day 2.3 ± 1.3 (0.0–5.0)
Genre of music
K-pop 290 (96.7)
Foreign pop 288 (96.0)
New age 234 (78.0)
Classical 179 (59.7)
Psychological scale
CES-D 15.1 ± 11.8 (0–47)
BAI 12.9 ± 11.2 (0–53)
MDQ 4.9 ± 3.3 (0–12)
Values are presented as mean ± standard deviation (range) or number (%).
CES-D = Center for Epidemiologic Studies Depression Scale-Revised, BAI = Beck Anxiety Inventory, MDQ = Mood Disorder Questionnaire.

Procedure

We completed the entire process online using SurveyMonkey (Momentive, San Mateo, CA, USA) to create a survey and sending survey invitations via university online bulletin boards, internet cafés, and social media sites, such as Facebook. The invitation included a detailed study description, information about participants’ rights, and online survey limitations. Before answering the survey items, all participants provided their consent online to participate in the study.

Survey construction

The survey included questions across four parts: demographic information, musical background and listening habits, current mood, and perception of expressed emotions and their intensity in the selected musical passages. We added a section on current mood given the inseparable relationship between mood and emotion perception.35
Demographic information included age, sex, and education. Musical background and listening habits comprised the type and duration of musical training, time spent listening to music per day, and listening frequency by genre. The section on mood used validated instruments for assessing anxiety, depression, and mood disorder symptoms.
Anxiety levels were measured using the Korean version of the Beck Anxiety Inventory (BAI).36 For this 21-item questionnaire, respondents rate their perceived anxiety level from the past week using a 4-point Likert-type scale. In this study, Cronbach’s α was 0.92.
Depression was measured with the Center for Epidemiologic Studies Depression Scale (CES-D), which comprises twenty 4-point Likert-type questions assessing the perceived frequency of depressive symptoms. We used the Korean version devised by Cho and Kim,37 and Cronbach’s α was 0.88.
We also employed the Korean version of the Mood Disorder Questionnaire (MDQ), which was initially devised by Hirschfeld et al.38 and then translated and validated by Jon et al.39 The Korean version of the MDQ consists of 13 items assessing the presence and severity of mood disorder symptoms. Cronbach’s α was 0.83.
The section asking participants for their perception of the type and intensity of emotion expressed in music was prepared by embedding 60 randomly presented musical passages under each question. We purchased MP3 files of the musical pieces selected during the first study phase and used an audio editing tool to trim each file into a 15-second passage. We uploaded all passages to SoundCloud (Berlin, Germany) and linked each segment using the embed function in SurveyMonkey. Participants listened to a passage by clicking the play button next to each item number and then selected one of three Likert-type scales specified as happy, sad, or angry. Next, they rated the intensity of expressed emotion from 1 (extremely weak) to 7 (extremely strong). Participants were not allowed to choose more than one emotion type.
In summary, we used the emotion classification verified by 20 music therapy specialists in the first phase as comparison data for the second phase. When 300 responses from phase 2 matched the verified answer from phase 1, each item was coded as 1, if not 0. With factor loading in exploratory factor analysis (EFA), we wanted to confirm the similarities among the happy, sad, and angry musical passages.

Statistical analysis of the music emotion perception test

The data were visually analyzed to screen for undesirable answer behaviors before the analysis. We noted demographic characteristics, including age, sex, education, music-listening habits, and psychological status, with means, standard deviations, and minimum and maximum ranges.
The factor analysis for the MPEP items included Cronbach’s α and the consistency coefficient. We used EFA to determine whether the MPEP items accurately indicated music emotion perception among the participants. With the means of total item scores assessed by the general participants, a k-means cluster analysis was performed to segregate participants into either low music emotion perception or general music emotion perception groups. All analyses were conducted using SPSS (version 24.0; IBM Corp., Armonk, NY, USA).
We controlled for covariates, including participants’ demographic characteristics and psychological status, and adopted a hierarchical logistic regression analysis. A discrete set of hierarchical variables was added in a multiple hierarchical regression analysis of emotional perception: model 1, demographic characteristics; model 2, demographic characteristics plus psychological status; and model 3, demographic characteristics plus psychological status plus music perception. The dependent variable of the low emotion perception group was coded as 1, and the general emotion perception group was coded as 0.
Covariates were selected according to several factors that could influence music perception.2734 Personal variables, including age, sex, and years of musical training, could be regarded as important factors affecting the emotional response to music. As music has been reported to influence human emotion, emotional state can have an impact on emotional reactions to music.40 In addition, because more than 90% of the participants listened to K-pop and Western pop music, we omitted those genres in the regression analysis.

Ethics statements

The study protocol was reviewed and approved by the Institutional Review Board (IRB) of Chung-Ang University (IRB #1041078-202009-HRBM-286-01). Participants from both phases were fully informed about the purpose of the study and the voluntary nature of participation, including their right to withdraw at any time without reprisal. Given that all data were submitted online without any identifiers for the respondents, participants were informed that data could not be deleted once submitted.

RESULTS

Participants

Among the 300 participants, the mean age was 29.2 ± 5.95 years. In addition, 41.3% were male, 51.7% had office jobs, and 62.0% had completed college. The mean duration of music listening was 2.3 ± 1.3 hr/day. The mean scores of the CES-D, BAI, and MDQ were 15.1 ± 11.8, 12.9 ± 11.2, and 4.9 ± 3.3, respectively (Table 2).

Analysis of musical passages for correction rate

The mean score on the happiness scales was 4.25 ± 1.59 (0.40–0.85, skewness = −0.935, kurtosis = −0.239), the mean score on the sadness scales was 4.49 ± 1.58 (0.40–6.95, skewness = −0.998, kurtosis = −0.008), and the mean score on the anger scales was 4.00 ± 1.63 (0.30–7.00, skewness = −0.645, kurtosis = −0.435). Considering the skewness and kurtosis of all the data, the data showed a normal distribution. The mean correction rates for the happy, sad, and angry passages were 83.8%, 85.9%, and 81.6%, respectively. The happy passage with the highest correction rate was “Happy1,” and the passage with the lowest correction rate was “Happy13.” The sad passage with the highest correction rate was “Sad7,” and “Sad1” showed the lowest correction rate. Finally, the anger passage with the highest correction rate was “Anger15,” and the passage with the lowest correction rate was “Anger7” (Table 3).
Table 3

Correction rates for the three perceived emotions

jkms-38-e30-i003
Item Perceived emotion, %
Happy Sad Anger
Happy1 89.7 7.3 3.0
Happy2 87.7 6.3 6.0
Happy3 87.3 8.3 4.3
Happy4 86.7 8.7 4.7
Happy5 88.0 7.3 4.7
Happy6 83.0 8.3 8.7
Happy7 86.3 8.0 5.7
Happy8 86.0 8.0 6.0
Happy9 86.0 7.3 6.7
Happy10 89.0 8.3 2.7
Happy11 87.0 7.7 5.3
Happy12 85.7 8.0 6.3
Happy13 78.3 7.0 14.7
Happy14 88.7 8.7 2.7
Happy15 82.0 9.7 8.3
Happy16 84.3 10.7 5.0
Happy17 86.3 8.0 5.7
Happy18 84.3 11.0 4.7
Happy19 85.3 7.3 7.3
Happy20 85.3 7.7 7.0
Sad1 7.0 79.7 13.3
Sad2 5.3 90.0 4.7
Sad3 7.7 87.3 5.0
Sad4 8.3 87.3 4.3
Sad5 4.0 89.7 6.3
Sad6 8.3 87.0 4.7
Sad7 4.7 90.3 5.0
Sad8 7.7 86.7 5.7
Sad9 9.3 84.0 6.7
Sad10 8.7 86.3 5.0
Sad11 5.7 82.0 12.3
Sad12 7.3 85.7 7.0
Sad13 6.3 84.7 9.0
Sad14 6.3 84.7 9.0
Sad15 7.0 87.0 6.0
Sad16 9.0 86.7 4.3
Sad17 6.0 88.3 5.7
Sad18 7.7 84.0 8.3
Sad19 6.7 87.7 5.7
Sad20 7.3 86.0 6.7
Anger1 9.3 12.0 78.7
Anger2 11.0 14.0 75.0
Anger3 4.3 24.0 71.7
Anger4 8.3 17.7 74.0
Anger5 8.3 11.3 80.3
Anger6 6.3 9.0 84.7
Anger7 8.0 22.0 70.0
Anger8 10.7 13.0 76.3
Anger9 7.3 15.3 77.3
Anger10 5.7 12.0 82.3
Anger11 10.0 9.7 80.3
Anger12 10.0 11.0 79.0
Anger13 6.3 12.7 81.0
Anger14 8.3 13.0 78.7
Anger15 5.7 7.0 87.3
Anger16 6.0 11.0 83.0
Anger17 7.7 19.3 73.0
Anger18 7.0 8.0 85.0
Anger19 15.7 10.3 74.0
Anger20 7.3 10.7 82.0
As an exploratory analysis, we evaluated the musical characteristics of the passages that received the highest correction rates in each affective category. The top three happy passages featured relatively fast tempi ranging from 138 to 168 bpm, with repetitive and predictable rhythmic patterns consisting mainly of equally divided eighth or sixteenth notes and multiple staccatos. Happy passages were all in a major mode, with a distinct melody line in a high register and a narrow range, and supported by simple accompaniment parts in a lower range. The melodic line was predictable, with a recurring melodic sequence and frequent stepwise motion. The dynamic levels were medium to loud with no sudden changes, and all pieces were played on a piano. In terms of musical genre, two of the top three happy passages were new age, and one was a classical pianist’s rendition of the opera Don Giovanni.
The top three sadness passages featured extremely slow tempi ranging from 35 to 66 bpm, with frequent use of long dotted note values and reserved rhythmic divisions. All sad passages were in a minor mode with a distinct melodic line appearing in the middle to lower range, and they had a combination of stepwise and intermittent leap patterns. In contrast, the accompaniment employed simple block-chord or arpeggio patterns. In terms of dynamic levels, sad passages were at the mid to quiet level with gradual changes. Two pieces featured a cello solo accompanied by a piano, and one excerpt was a piano solo. In terms of musical genre, one selection was classical and two were new age.
The anger passages showed extremely fast tempi with irregular beat patterns, making it difficult to detect the recurring pulse. Rhythmic patterns were unpredictable, with sudden changes and frequently used smaller rhythmic units. The melody and accompaniment parts were intermingled, making a clear distinction difficult and the melody unpredictable, with a wide range and abrupt changes. All three pieces were performed on the piano and generally were loud with sudden changes. In terms of genre, two of the selections were classical pieces composed by Rachmaninoff and Mussorgsky, and one was a video game soundtrack.

Reliability analysis and exploratory factor analysis

The reliability of the 60 musical passages was high (Cronbach’s α = 0.978). There was no significant difference in Cronbach’s α between the standard and corrected items when some items were removed (Cronbach’s α = 0.979). Hence, all 60 musical passages were retained for the analysis.
According to the EFA, the Kaiser-Meyer-Olkin (KMO) value was 0.969, and the result of Bartlett’s test of sphericity was significant (P < 0.001, χ2 = 12,996.759). The factor loading was > 0.4, and the three emotional factors were successfully constructed using the 60-item passage (Table 4).
Table 4

Factor loading for the MPEP

jkms-38-e30-i004
Factor Pattern matrix
1 2 3
1 0.680 −0.433 0.510
2 0.791 −0.475 0.648
3 0.884 −0.474 0.403
4 0.843 −0.455 0.568
5 0.805 −0.412 0.489
6 0.711 −0.572 0.437
7 0.824 −0.623 0.407
8 0.856 −0.443 0.479
9 0.711 −0.578 0.410
10 0.649 −0.416 0.479
11 0.630 −0.493 0.506
12 0.692 −0.517 0.413
13 0.438 −0.432 0.463
14 0.601 −0.420 0.551
15 0.530 −0.560 0.484
16 0.678 −0.472 0.467
17 0.634 −0.603 0.444
18 0.446 −0.451 0.641
19 0.661 −0.571 0.607
20 0.569 −0.615 0.505
Name of factor Happy Sad Anger
Eigen value total 27.828 4.016 4.008
Percentage of variance 46.380 6.694 6.678
Cumulative percentage 46.380 53.074 59.752
Extraction method: maximum likelihood; rotation method: Oblimin with Kaiser normalization.
MPEP = musical passages for emotional perception test.

Cluster analysis of all participants

In the k-means cluster analysis of the 300 participants, the highest total scores among the general music emotion perception group (n = 60) and the low music emotion perception group (n = 11) were selected as initial seeds (centroids) of the respective groups. The final centroid and mean ± standard deviation of the general perception group’s total music emotion perception scores were 55 and 55.21 ± 13.81, respectively. The corresponding scores of the low perception group were 28 and 28.03 ± 13.81, respectively. Following the k-means cluster analysis, 185 participants were classified into the general music emotion perception group and 115 into the low music emotion perception group (F = 1,120.63, P < 0.001). Based on these results, the cut-off score of the low emotion perception group was 41.

Determining the low music emotion perception group

Of the three models suggested in the current study, only model 3 was significantly associated with the low music emotion perception group. In model 3 (model 2 + music perception), the model χ2 (240.3, P < 0.001) and Nagelkerke’s R2 (0.735, 73.5% of the variance in the dependent variable of the low perception group) indicated that the model was adequate for predicting the low music emotion perception group. As mentioned before, the low emotion perception group was defined by an MPEP score below 41. With step χ2 (227.8, P < 0.001), music perception items could predict the low music emotion perception group. According to the Wald statistics for all independent variables, happiness, sadness, anger, and total item scores were significant predictors of improved low music emotion perception (Table 5).
Table 5

Hierarchical logistic regression analysis

jkms-38-e30-i005
Variables Model 1 Model 2 Model 3
B Wald OR B Wald OR B Wald OR
Demographic characteristics
Age 0.020 0.526 1.020 0.013 0.199 1.013 0.063 1.823 0.177
Sex −0.016 0.003 0.984 0.008 0.001 1.008 −0.173 0.125 0.724
Educational year −0.040 0.098 0.961 0.002 0.000 1.002 0.182 0.896 0.344
Job 0.105 0.689 1.111 0.124 0.926 1.132 0.097 0.213 0.644
Time spent listening to music 0.195 4.699 1.216* 0.209 5.207 1.232* 0.045 0.086 0.770
Psychological status
BAI −0.004 0.063 0.996 −0.023 0.703 0.402
CES-D −0.021 1.769 0.979 −0.004 0.020 0.888
MDQ 0.010 0.072 1.010 0.026 0.186 0.666
Music perception
Happy −4.179 0.653 0.419*
Sad −4.074 0.623 0.430*
Angry −3.868 0.559 0.455*
Total −5.082 0.976 0.323**
Indices
Model 0 Model 1 Model 2 Model 3
−2 log likelihood
415.891 408.704 403.379 175.553
Step χ2/P
N/A 7.1/0.21 5.34/0.15 227.8/< 0.001
Model χ2/P
N/A 7.1/0.21 12.5/0.13 240.3/< 0.001
Nagelkerke’s R2
N/A 0.031 0.054 0.735
Classification accuracy,a %
50.7 54.0 61.0 90.3
Model 1: demographic characteristics; Model 2: demographic characteristics + psychological status; Model 3: demographic characteristics + psychological status + music perception.
OR = odds ratio, BAI = Beck Anxiety Inventory, CES-D = Center for Epidemiologic Studies Depression Scale-Revised, MDQ = Mood Disorder Questionnaire, N/A = not applicable.
aClassification accuracy: dependent variable, low music emotion perception scale group.
*P < 0.05, **P < 0.01.

DISCUSSION

The MPEP has high reliability and supports the construction of three factors for identifying emotional perceptions of happiness, sadness, and anger. The cut-off score for low emotion perception was 41. Neither psychological status nor individual demographic characteristics affected emotional perception as assessed by the MPEP.
According to the reliability test and factor analysis, the current results showed that musical passages could assess emotional perception abilities in the general population. Perceived emotions were classified into three categories: happy, sad, and angry. While several trials have used music to investigate perceived emotions,2021 the MPEP is the first trial, to the best of our knowledge, that systematically identified musical passages for the development of a music-based emotion perception test. Using musical passages for the three emotions, the MPEP could detect individuals with and without an adequate ability to recognize emotions.
The musical passages with the highest ratings under each affective category shared musical traits that resembled the typical characteristics of musical pieces used in MER studies. For example, the sadness passages in the MPEP also demonstrated the traits of melancholic music specified by Huron41; they were quiet, slow, low in pitch, minor in mode, and legato. Juslin et al.42 utilized cello music to induce sadness in listeners, with the rationale that the cello has the “sound closest to the human voice.” Interestingly, two of our sad passages involved cello solos. The music for anger showed traits reported by Mohn et al.,43 which included loud volumes, rapid tempos, and tempo changes. Likewise, the happy passages shared characteristics like major mode, relatively fast tempi, and use of staccatos.2543 These resemblances are meaningful as they provide support for the identified musical passages. We plan to conduct an in-depth analysis of all the MPEP passages to investigate whether the degree of such musical traits is associated with the intensity of emotions perceived by the participants.
In a comparison of correction rates for the three types of passages, the mean score for anger was lowest with the greatest variance (78.68 ± 4.73) compared with happiness (85.85 ± 2.62) and sadness (86.26 ± 2.6). This variance can be explained in two ways. Huron attributes such variance to people’s conceptual differences in perceiving anger as either cold or hot. Cold anger refers to an unexpressed furious feeling that is not easily noticeable, whereas hot anger is distinctive and accompanied by outbursts.44 Another explanation for this variance is similarities in valence and arousal levels. According to the arousal and valence dimensions often used to classify emotional stimuli, happiness is located in the top-right corner, indicating a pleasant valence and high arousal,45 whereas sadness is located in the bottom-left corner, indicating an unpleasant valence with low arousal.46 Anger is placed at the top-left corner, with high arousal and an unpleasant valence, meaning that anger shares certain characteristics with happiness and sadness.47 Thus, a listener may only focus on one aspect of emotion appraisal, leading to a higher disagreement rate for anger music.
Moreover, we need to consider alexithymia when interpreting these outcomes. Alexithymia refers to difficulties in acknowledging, explaining, and differentiating emotions, and it affects 5% to 19% of the general population.4849
Interestingly, alexithymia’s prevalence seems to explain the disagreement rate observed in this study. Given that the MPEP was designed to screen the ability to perceive emotion, members of the low music emotion group with scores of 41 should be regarded as having low sensitivity for emotional perception through music. From this perspective, further studies are needed to evaluate the possibility of implementing the MPEP for screening people with alexithymia and other psychological conditions that interfere with emotion recognition. Furthermore, future research should incorporate both music- and non-music-based emotion recognition tests, comparing neurotypical groups with experimental groups of people who have psychological conditions.
One controversy surrounding MER studies is that some variance exists in the degree or type of emotions perceived through music, which can be explained in different ways. Levinson argued that people often experience more than one emotion at a time.50 As music conveys complex emotions, it is difficult to analyze dichotomously. One’s mental health status and personality traits often affect the level of response to music.12 Emotional responses to music also are related to cognitive status, physiological arousal, motor behavior, and subjective feelings.
However, in this study, hierarchical multiple regression analysis showed that emotion perception of musical passages was unaffected by participants’ demographic characteristics and psychological status, suggesting that such factors did not yield significant differences among the analyzed subgroups. In other words, the MPEP could assess emotional perception unaffected by individual emotional conditions or demographic factors. Further studies should be conducted to test the reliability of musical passages across different age groups and personality traits that were not investigated in this study.
This study had some limitations. First, the possibility of reporting bias should be considered, as the collected data were based on online self-report questionnaires. Based on previous studies, we set the duration of the musical passages to 15 seconds. While the study outcome may support the use of a 15-second passage to elicit emotions in listeners, some could argue that 15 seconds may not be long enough. Thus, further investigation might be helpful in comparing reactions among listeners for different durations.
Second, the one-sided phase 1 study population consisting almost entirely of women could have affected music passage recognition. Several studies have suggested that musical recognition sensitivity could be influenced by age and gender.51 The gender sensitivity of music recognition should be considered for future study. Third, we did not control for the extensiveness of musical training or background among the participants. Although we excluded those with music-related degrees or careers, some participants may have had extensive musical training, possibly affecting the study outcomes. Finally, one should be mindful that this was a preliminary study aimed at identifying musical passages that could be used to develop a music-based emotion perception test in the near future. Thus, our study outcomes should not be generalized to include musical passages other than those assessed.
In conclusion, the musical passages assessed in this study for emotion perception presented highly viable options for evaluating an individual’s emotional perception without the interference of demographic or psychological status. Carefully selected musical passages verified by a group of experts and healthy adults showed more than sufficient inter-rater agreement. Consequently, we conclude that musical passages can serve as a tool for assessing the ability to recognize happiness, sadness, and anger.
Further studies are needed to test the reliability and validity of the MPEP. First, we developed a unity-based application to assess the level of emotional perception using MPEP. Initially, we plan to implement the application for IDDs and then expand it to people with other health needs to field-test the MPEP in conjunction with a preexisting facial emotion assessment tool. Second, we propose a thorough analysis of the musical characteristics of the passages included in the MPEP and the identification of musical elements corresponding to each emotion category. Such an evaluation could improve the validity of the MPEP and lead to a better understanding of reliable musical passages for MER studies.

Notes

Funding: This research was funded by the Ministry of Culture, Sports and Tourism and the Korea Creative Content Agency (No. R2020040186).

Disclosure: The authors have no potential conflicts of interests to disclose.

Author Contributions:

  • Conceptualization: Lee JH, Yun J, Han DH.

  • Data curation: Lee JH, Yun J, Hwang H.

  • Funding acquisition: Han DH.

  • Investigation: Hwang H.

  • Project administration: Kim SM.

  • Software: Hwang H.

  • Supervision: Kim SM, Han DH.

  • Writing - original draft: Lee JH, Han DH.

  • Writing - review & editing: Kim SM.

References

1. Baker KF, Montgomery AA, Abramson R. Brief report: perception and lateralization of spoken emotion by youths with high-functioning forms of autism. J Autism Dev Disord. 2010; 40(1):123–129. PMID: 19653088.
2. Philip RC, Whalley HC, Stanfield AC, Sprengelmeyer R, Santos IM, Young AW, et al. Deficits in facial, body movement and vocal emotional processing in autism spectrum disorders. Psychol Med. 2010; 40(11):1919–1929. PMID: 20102666.
3. Planalp S. Varieties of cues to emotion in naturally occurring situations. Cogn Emotion. 1996; 10(2):137–153.
4. de Gelder B. Why bodies? Twelve reasons for including bodily expressions in affective neuroscience. Philos Trans R Soc Lond B Biol Sci. 2009; 364(1535):3475–3484. PMID: 19884142.
5. Cho HJ, Kim S. Emotion perception and multisensory integration in autism spectrum disorder: a review of behavioral and cognitive neuroscience studies. Sci Emot Sensib. 2018; 21(4):77–90.
6. Doikou-Avloidou M, Dadatsi K. Enhancing social integration of immigrant pupils at risk for social, emotional, and/or behavioral difficulties: the outcomes of a small-scale social-emotional learning program. Emot Behav Difficulties. 2013; 18(1):3–23.
7. Tardif C, Lainé F, Rodriguez M, Gepner B. Slowing down presentation of facial movements and vocal sounds enhances facial expression recognition and induces facial-vocal imitation in children with autism. J Autism Dev Disord. 2007; 37(8):1469–1484. PMID: 17029018.
8. Rozga A, King TZ, Vuduc RW, Robins DL. Undifferentiated facial electromyography responses to dynamic, audio-visual emotion displays in individuals with autism spectrum disorders. Dev Sci. 2013; 16(4):499–514. PMID: 23786469.
9. Carper RA, Moses P, Tigue ZD, Courchesne E. Cerebral lobes in autism: early hyperplasia and abnormal age effects. Neuroimage. 2002; 16(4):1038–1051. PMID: 12202091.
10. Zilbovicius M, Meresse I, Chabane N, Brunelle F, Samson Y, Boddaert N. Autism, the superior temporal sulcus and social perception. Trends Neurosci. 2006; 29(7):359–366. PMID: 16806505.
11. Zatorre RJ, Salimpoor VN. From perception to pleasure: music and its neural substrates. Proc Natl Acad Sci U S A. 2013; 110(Suppl 2):10430–10437. PMID: 23754373.
12. Warrenburg LA. Choosing the right tune: a review of music stimuli used in emotion research. Music Percept. 2020; 37(3):240–258.
13. Fritz T, Jentschke S, Gosselin N, Sammler D, Peretz I, Turner R, et al. Universal recognition of three basic emotions in music. Curr Biol. 2009; 19(7):573–576. PMID: 19303300.
14. Juslin PN. Perceived emotional expression in synthesized performances of a short melody: capturing the listener’s judgment policy. Music Sci. 1997; 1(2):225–256.
15. Laukka P, Eerola T, Thingujam NS, Yamasaki T, Beller G. Universal and culture-specific factors in the recognition and performance of musical affect expressions. Emotion. 2013; 13(3):434–449. PMID: 23398579.
16. Eerola T, Vuoskoski JK. A review of music and emotion studies: approaches, emotion models, and stimuli. Music Percept. 2013; 30(3):307–340.
17. Gabrielsson A, Juslin PN. Emotional expression in music performance: between the performer’s intention and the listener’s experience. Psychol Music. 1996; 24(1):68–91.
18. Juslin PN, Sloboda JA. Music and Emotion: Theory and Research. Oxford, UK: Oxford University Press;2001.
19. Juslin PN, Sloboda JA. Handbook of Music and Emotion: Theory, Research, Applications. New York, NY, USA: Oxford University Press;2010.
20. Juslin PN, Laukka P. Communication of emotions in vocal expression and music performance: different channels, same code? Psychol Bull. 2003; 129(5):770–814. PMID: 12956543.
21. Sloboda JA, Lehmann AC. Tracking performance correlates of changes in perceived intensity of emotion during different interpretations of a Chopin piano prelude. Music Percept. 2001; 19(1):87–120.
22. Demos AP, Chaffin R, Begosh KT, Daniels JR, Marsh KL. Rocking to the beat: effects of music and partner’s movements on spontaneous interpersonal coordination. J Exp Psychol Gen. 2012; 141(1):49–53. PMID: 21668129.
23. Gabrielsson A. Emotion perceived and emotion felt: same or different? Music Sci. 2001; 5(1):Suppl. 123–147.
24. Huron D. Sweet Anticipation: Music and the Psychology of Expectation. Cambridge, MA, USA: The MIT Press;2006.
25. Juslin PN. From everyday emotions to aesthetic emotions: towards a unified theory of musical emotions. Phys Life Rev. 2013; 10(3):235–266. PMID: 23769678.
26. Rigg MG. The mood effects of music: a comparison of data from four investigators. J Psychol. 1964; 58(2):427–438.
27. Scherer KR. Which emotions can be induced by music? What are the underlying mechanisms? And how can we measure them? J New Music Res. 2004; 33(3):239–251.
28. Yang YH, Chen HH. Machine recognition of music emotion: a review. ACM Trans Intell Syst Technol. 2012; 3(3):1–30.
29. Bae S, Kang KD, Kim SW, Shin YJ, Nam JJ, Han DH. Investigation of an emotion perception test using functional magnetic resonance imaging. Comput Methods Programs Biomed. 2019; 179:104994. PMID: 31443867.
30. Yang YH, Liu CC, Chen HH. Music emotion classification: a fuzzy approach. In : Proceedings of the 14th ACM International Conference on Multimedia; 2006 Oct 23-27; Santa Barbara, CA, USA. New York, NY, USA: Association for Computing Machinery;2006. p. 81–84.
31. Behrens GA, Green SB. The ability to identify emotional content of solo improvisations performed vocally and on three different instruments. Psychol Music. 1993; 21(1):20–33.
32. Freitas C, Hunt BA, Wong SM, Ristic L, Fragiadakis S, Chow S, et al. Atypical functional connectivity during unfamiliar music listening in children with autism. Front Neurosci. 2022; 16:829415. PMID: 35516796.
33. Dosen A. Applying the developmental perspective in the psychiatric assessment and diagnosis of persons with intellectual disability: part I--assessment. J Intellect Disabil Res. 2005; 49(Pt 1):1–8. PMID: 15634306.
34. Martínez-Castilla P, Gutiérrez-Blasco IM, Spitz DH, Granot R. The efficacy of music for emotional wellbeing during the COVID-19 lockdown in Spain: an analysis of personal and context-related variables. Front Psychol. 2021; 12:647837. PMID: 33897554.
35. Harris DA, Hayes-Skelton SA, Ciaramitaro VM. What’s in a face? How face gender and current affect influence perceived emotion. Front Psychol. 2016; 7:1468. PMID: 27733839.
36. Beck AT, Epstein N, Brown G, Steer RA. An inventory for measuring clinical anxiety: psychometric properties. J Consult Clin Psychol. 1988; 56(6):893–897. PMID: 3204199.
37. Cho MJ, Kim KH. Diagnostic validity of the CES-D (Korean version) in the assessment of DSM-III-R major depression. J Korean Neuropsychiatr Assoc. 1993; 32(3):381–399.
38. Hirschfeld RM, Williams JB, Spitzer RL, Calabrese JR, Flynn L, Keck PE Jr, et al. Development and validation of a screening instrument for bipolar spectrum disorder: the Mood Disorder Questionnaire. Am J Psychiatry. 2000; 157(11):1873–1875. PMID: 11058490.
39. Jon DI, Hong N, Yoon BH, Jung HY, Ha K, Shin YC, et al. Validity and reliability of the Korean version of the Mood Disorder Questionnaire. Compr Psychiatry. 2009; 50(3):286–291. PMID: 19374975.
40. Heath TP, Melichar JK, Nutt DJ, Donaldson LF. Human taste thresholds are modulated by serotonin and noradrenaline. J Neurosci. 2006; 26(49):12664–12671. PMID: 17151269.
41. Huron D. Affect induction through musical sounds: an ethological perspective. Philos Trans R Soc Lond B Biol Sci. 2015; 370(1664):20140098. PMID: 25646521.
42. Juslin PN, Harmat L, Eerola T. What makes music emotionally significant? Exploring the underlying mechanisms. Psychol Music. 2014; 42(4):599–623.
43. Mohn C, Argstatter H, Wilker FW. Perception of six basic emotions in music. Psychol Music. 2011; 39(4):503–517.
44. Biassoni F, Balzarotti S, Giamporcaro M, Ciceri R. Hot or cold anger? Verbal and vocal expression of anger while driving in a simulated anger-provoking scenario. SAGE Open. 2016; 6(3):2158244016658084.
45. Kerr MA, Schneider BH. Anger expression in children and adolescents: a review of the empirical literature. Clin Psychol Rev. 2008; 28(4):559–577. PMID: 17884263.
46. Laukka P, Neiberg D, Forsell M, Karlsson I, Elenius K. Expression of affect in spontaneous speech: acoustic correlates and automatic detection of irritation and resignation. Comput Speech Lang. 2011; 25(1):84–104.
47. Barrett LF. Are emotions natural kinds? Perspect Psychol Sci. 2006; 1(1):28–58. PMID: 26151184.
48. Lyvers M, Holloway N, Needham K, Thorberg FA. Resilience, alexithymia, and university stress in relation to anxiety and problematic alcohol use among female university students. Aust J Psychol. 2020; 72(1):59–67.
49. Mason O, Tyson M, Jones C, Potts S. Alexithymia: its prevalence and correlates in a British undergraduate sample. Psychol Psychother. 2005; 78(Pt 1):113–125. PMID: 15826409.
50. Levinson J. Musical chills and other delights of music. Davidson JW, editor. The Music Practitioner. Aldershot, UK: Ashgate Publishing;2004. p. 335–351.
51. Allgood R, Heaton P. Developmental change and cross-domain links in vocal and musical emotion recognition performance in childhood. Br J Dev Psychol. 2015; 33(3):398–403. PMID: 26080754.

SUPPLEMENTARY MATERIALS

Supplementary Table 1

List of passages for ‘Happy’
jkms-38-e30-s001.doc

Supplementary Table 2

List of passages for ‘Anger’
jkms-38-e30-s002.doc

Supplementary Table 3

List of passages for ‘Sadness’
jkms-38-e30-s003.doc
TOOLS
Similar articles