IMPROVING L2 PRONUNCIATION INSIDE AND OUTSIDE THE CLASSROOM: PERCEPTION, PRODUCTION AND AUTONOMOUS LEARNING OF L2 VOWELS

Spanish/Catalan learners of English as a Foreign Language (EFL) attended a formal instruction (FI) period combined with explicit pronunciation instruction, which consisted of theoretical and practical approaches to English segmental phonetics as well as a contrastive analysis between the participants’ irst languages and the target language, English. he EFL learners’ ability to perceive and produce L2 vowels was assessed before and ater the 8-week instructional treatment. Results show that the EFL learners signiicantly improved their perception of vowel sounds embedded in real and non-words. However, no improvement in production was found as a result of the instruction received. While these results suggest that learners’ perceptual skills can be improved with teacher-led instruction, the quantity and availability of explicit pronunciation instruction was not suicient to modify learners’ speech production. hus, optimal results require learners to continue learning outside the classroom context. With this aim, this paper presents two autonomous activities developed to increase learners’ awareness about phonology: L1-L2 Pronunciation Comparison Task and a Phonological Self-awareness Questionnaire.


Introduction
Pronunciation is a crucial component of the learning of oral skills in a second language (L2 1 ) and according to previous research, oral skills and pronunciation rely on the exposure to good quality input in order to be successfully enhanced (Celce-Murcia et al., 1996;Flege, 1991;Long & Larsen-Freeman, 1991).his might result problematic in the foreign language (FL) setting where authentic input (i.e.native/native-like) tends to be scarce (Celce-Murcia et al., 1996;Muñoz, 2008;Saito, 2015) and where learners have limited opportunities for interaction with the target language (TL).According to Larson-Hall (2008), FL instruction comprises minimal input conditions and is usually delivered in no more than four hours of instruction per week.herefore, the quantity of language input received by learners acquiring a language in these contexts is usually restricted to the teachers' instruction, which might partly be delivered in the learners' native language (Muñoz, 2008).Muñoz (2011) and Derwing and Munro (2015) highlight the need to provide learners not simply with additional exposure but also with greater quality of exposure inside and outside the classroom.Authors (2015) found that Portuguese EFL learners outperformed Catalan learners of English when identifying and discriminating English voiceless stops, despite the comparable amount of years of formal instruction between the two groups and the similarity of the L1s. he authors speculated that, these outcomes might be attributed to the greater quantity, and possibly quality (i.e.authentic), of input received by the Portuguese learners outside the classroom.his might be connected to the fact that Portuguese learners are exposed to native English input through TV programs and ilms on a regular basis rather than being generally exposed to foreign shows and ilms dubbed in the local language, as is the case in Spain.However, further research with more controlled measures would have to be carried out in order to conirm this issue.
Consequently, previous studies investigating the efect of formal instruction (FI) 2 on oral skills and/or pronunciation in the EFL context have failed to provide evidence that the input found in the EFL setting can alter these domains (Fullana & Mora, 2009;García-Lecumberri & Gallardo del Puerto, 2003;Monje-Sangüesa, 2016).One noteworthy reason for this lack of success lies in the fact that English pronunciation has been described as one of the most challenging skills to be learned (Calvo Benzies, 2014;Moyer, 1999;Scovel, 1988).In fact, Moyer (1999) explains that whilst learners may achieve near native-like abilities in other aspects of a foreign language (i.e., grammar and/or lexis); learners rarely achieve native-like abilities in pronunciation, even ater years of experience with the target language.As a consequence, most adult EFL learners are likely to speak with a foreign accent (Scovel, 1988).
Positive results of pronunciation enhancement as a result of FI have previously been reported in one of the four following scenarios: i) when the amount of FI was considerably large (Saito, 2015), ii) when the FI was accompanied by corrective feedback (Saito & Lyster, 2011), iii) when the FI was accompanied by explicit pronunciation instruction (Gordon & Darcy, 2016;Kissling, 2013;homson & Derwing, 2014) and iv) when learners' awareness of the cross-linguistic features between the L1 and the L2 had risen (Alves & Magro, 2011;Kennedy & Troimovich, 2010;Silveira & Alves, 2009;Wrembel, 2005).his last scenario might be a consequence of any of the three previous ones (i, ii or iii) or even, a consequence of autonomous learning.Topics iii and iv, explicit pronunciation instruction and raising language learners' phonological awareness, will be further explored in the review of literature, since they are directly connected with the objectives of this paper.
he present paper builds on two theoretical frameworks: Schmidt's Noticing Hypothesis (1990) and Flege's Speech Learning Model (1995).Schmidt (1990;1995) postulates that learning cannot take place without the learner consciously noticing the target feature.Applied to the ield of L2 speech learning, noticing the phonological form is necessary for its acquisition.Moreover, Schmidt (1990) as well as other researchers (e.g.Ellis, 2005;Long 1991) advocate that noticing can be enhanced with explicit instruction.he Speech Learning Model (SLM), on the other hand provides speciic predictions on how L2 learner can come to notice L2 sounds.Namely, the L2 learner is more likely to notice, and consequently form categories, for L2 sounds that are perceived as dissimilar to the L1 sounds.On the other hand, the learner is more likely to have diiculties in noticing diferences between L1 and L2 vowels that are perceived as similar (equivalence classiication).
he objective of the paper is to examine the efectiveness of FI combined with explicit pronunciation instruction on developing EFL learners' perception and production on L2 vowels and to present consciousness-raising pronunciation activities which could be employed by the language learners autonomously in order to complement the beneits of explicit pronunciation instruction.he paper is structured as follows.We will begin by discussing some previous studies on explicit pronunciation instruction and the employment of consciousness-raising activities in the L2.We will then present the results of a study examining the efect of explicit pronunciation instruction on L2 vowel production and perception by looking into the objectives, participants, method, procedure and the results.We will then turn to presenting two pronunciation activities developed to raise EFL learners' awareness about L2 pronunciation.Finally, we will discuss the role of explicit pronunciation instruction and consciousness-raising in L2 pronunciation teaching and put forward some suggestions for further research.

Literature review
In this section we will review some recent studies on the efectiveness of explicit pronunciation instruction and the role of phonological awareness in the FL classroom.Gordon & Darcy (2016) suggest that explicit L2 pronunciation teaching would be necessary in order to enhance EFL learners' oral abilities.In fact, despite the discouraging results of some early studies on the efect of pronunciation instruction on EFL learners' production of L2 sounds (Purcell & Suter, 1980), recent research evaluating the efect of pronunciation instruction in improving the perception and/or production of L2 target sounds has reported positive results (Gordon & Darcy, 2016;Kissling, 2013;Saito, 2012;homson & Derwing, 2014).For instance, Saito (2012) investigated the efects of a short period of L2 explicit pronunciation teaching (four hours) on the accentedness and comprehensibility of 20 native Japanese learners of English.Instruction targeted English L2 sounds that are commonly mispronounced by Japanese learners.Learners' oral abilities were assessed immediately before and ater the pronunciation instruction and they were judged by four native English speakers (NES).No signiicant reduction in foreign accentedness was observed as a result of the explicit instruction received.However, the learners' comprehensibility, as perceived by the NES, had signiicantly improved.

Explicit pronunciation instruction
In another study that investigated the efects of L2 pronunciation instruction, Gordon, Darcy & Ewert (2013) compared three groups of ESL learners difering on the type of pronunciation instruction received.One group received instruction on segmentals, another group received instruction on suprasegmentals and the third group was taught by a combination of both.Learners' comprehensibility was assessed immediately before and ater a 3-week instruction period by a group of 12 non-native English speakers (NNES).Results revealed that only the learners whose instruction focused on suprasegmentals were perceived signiicantly more comprehensible at post-test.
In a later study, homson and Derwing (2014) reviewed 75 diferent pronunciation studies and concluded that "pronunciation instruction is efective in improving the target form(s)" (p.7), since 82% of the studies reported a signiicant improvement as a result of the instruction received.Moreover, the authors add that "pronunciation research and instruction should be primarily concerned with helping learners become more understandable" (p.2).
In spite of the reported positive efects of pronunciation instruction in improving learners' intelligibility and comprehensibility of L2 speech, pronunciation instruction has not been given the necessary importance in the FL classroom (Piske, 2008).According to Gilbert (2010), pronunciation still is the EFL orphan, since it is either completely neglected in the FL classroom or it is the language aspect given the least attention (Fraser, 2000).Setter and Jenkins (2005) add that neither pronunciation instruction nor training of perceptual and/or production abilities have a secure place in most language curriculums.hat is so, since pronunciation instruction is oten viewed as a complementary activity rather than an essential part of the EFL syllabus (Cenoz & García-Lecumberri, 1999).
Against this lack of attention given to pronunciation in the FL classroom, alternative avenues may be found in High Variability Phonetic Training (HVPT) and/or autonomous learning.On the one hand, HVPT, which provides learners with immediate feedback, might be a good complement to explicit pronunciation instruction in order to explicitly draw the learners' attention to challenging L2 segments.Both perceptive and productive abilities might be enhanced by HVPT even ater a short training regime (e.g., Author, 2017a;Authors, 2014a;Alves & Luchini, 2016;Bradlow, 2008;Logan & Pruitt, 1995;Rato & Rauber, 2015).On the other hand, the creation of autonomous pronunciation activities to learners might also be efective in facilitating L2 phonological learning.According to Holec (1981), autonomous activities promote the individualization of learning by allowing learners to control their own performance and gradual progress.Moreover, previous research indicates that the employment of consciousness-raising activities in FL classroom is beneicial for students' language proiciency (e.g., White & Ranta, 2002), an issue which will be further discussed in the following section.

Raising language learners' phonological awareness
Even though a large number of studies has shown the efectiveness of consciousness raising activities in FL learning, in the ield of phonology, the employment of activities to raise awareness about L2 phonology has not been extensively studied.Phonological awareness is seen to form part of Language Awareness, which can be deined as "explicit knowledge about language, and conscious perception and sensitivity in language learning, language teaching and language use" (Association for Language Awareness, 2012).Extending this deinition to the ield of phonology, phonological awareness in the L2 context can be seen to consist of declarative and procedural knowledge.
Viewing phonological awareness in the L2 as consisting of both verbalizable and non-verbalizable knowledge explains why L2 speech learning might be a complex task.Whereas morphosyntactic and lexical acquisition are more susceptible to conscious learning eforts, the inherent nature of speech makes conscious noticing of phonological features diicult for L2 learners (Jilka, 2009).Furthermore, due to trade-ofs between form and meaning, only the most proicient language learners whose attentional resources are no longer needed in deciphering the meaning of the message can focus on its form (VanPatten, 1996).Following Schmidt (1995, and elsewhere), if a (phonological) feature is not noticed, it will not be acquired.For these reasons, aiding learners to notice features of L2 pronunciation is essential.
Previous studies indicate that language learners who are more aware of their L2 pronunciation and who possess high L2 phonological awareness demonstrate more accurate L2 speech perception and production (Author, 2015;Baker & Troimovich, 2006).Phonological awareness about the L2 can be developed through any activity that brings a speciic aspect into the language learners' consciousness (consciousness-raising; CR).Examples of CR activities in L2 speech learning are: explicit teaching of L2 phonology (e.g. Gordon & Darcy, 2016;Kissling, 2013;Saito, 2012), explicit comparing and contrasting of L1 and L2 phonology (Solé & Estebas, 2000), articulatory and/or perceptual training (e.g.Alves & Magro, 2011;Authors, 2014a), use of enhanced input and the use of feedback (e.g.Saito, 2013), to name but a few.
he vast majority of studies focusing on increasing language learners' awareness about the L2 phonology has taken place in an instructional, teacher-or researcherled setting.Some studies (e.g.Kennedy et al., 2014;Wrembel, 2011) have nevertheless employed CR-activities that with little or no instruction could be applied by the learners autonomously outside the classroom.Kennedy et al. (2014), Kennedy and Blanchet (2014) and Kennedy and Troimovich (2010) taught learners to keep learning journals in order to write down relections on pronunciation learningrelated issues raised while they were attending a TL speaking and listening course.Wrembel (2011) recorded language learners and asked them to elaborate on some metaphonetic questions while the recordings were played back to them (stimulated recall). he results from these studies are positive for L2 speech development: learners seemed to become more aware of the target language phonology, which eventually might lead to more target-like perception and production.
As mentioned in the previous section, increasing learner autonomy is highly beneicial as becoming an autonomous learner means that learning is not limited to the classroom context.Teaching learners how to increase awareness about L2 phonology does not only positively relect on their L2 pronunciation, but also enables them to take control of their pronunciation learning by developing selfmonitoring abilities.his seems especially relevant for L2 speech acquisition when we take into account the already discussed issues of the quality and quantity of L2 input available for learners in the FL classroom (Derwing & Munro, 2015;Larson-Hall, 2008) and the inherent diiculty of noticing L2 phonology (Jilka, 2009;VanPatten, 1996).
Consequently, we part from the idea that the best results for L2 speech development may be obtained by combining explicit pronunciation teaching inside the classroom with helping the learners to continue raising their phonological awareness outside the classroom.Section 2 discusses an empirical study which aimed at improving learners' perception and production of L2 vowels through explicit instruction.Section 3 presents two autonomous CR activities which were designed to help language learners to increase their phonological awareness outside the classroom.

Improving pronunciation inside the classroom
his classroom-based study aimed at further investigating the efects of an instructional treatment on the perception and production of L2 sounds.More speciically, this study sought to determine whether an 8-week FI period combined with explicit pronunciation instruction could positively afect the perception and/or the production of ive standard Southern British English (SSBE) vowels / iː, ɪ, ae, ʌ, ɜː/ by Spanish/Catalan EFL learners.he speciic L2 vowel sounds were carefully selected on the basis of previous research that reports these vowels to be commonly mispronounced by native speakers of Spanish/Catalan (Cebrian, 2006;2015;Rallo-Fabra & Romero, 2012).Regarding the vowel pair /i/-/ɪ/, their acquisition is problematic since neither Spanish nor Catalan has a comparable tense-lax distinction.Catalan learners of English have been found to assimilate English /i/ to their L1 /i/ category, while the lax vowel /ɪ/ has been found to be less consistently categorized, being mapped onto Catalan /e/, and to a lesser extent to /i/ (Cebrian, 2006). he vowel pair /ae/-/ʌ/ is troublesome for these learners as there is no low front-back distinction in Spanish or Catalan, both languages having only one low vowel, /a/.As a consequence, Catalan/Spanish bilinguals have been found not only to perceive, but also to produce both English /ae/ and /ʌ/ in terms of the existing L1 vowel category (Aliaga-García & Mora, 2009;Fullana & MacKay, 2003).Concerning the central vowel /ɜː/, Cebrian et al. (2011) report no clear match for this sound in Spanish/Catalan, being assimilated with low degrees of perceptual assimilation to the Catalan sounds /ɛ, e, o, ɔ/.Moreover, Cebrian (2015) found the SSBE central vowel /ɜː/ to be very dissimilar from Spanish and Catalan vowels and stated that "such vowel is a good candidate for accurate L2 categorization given enough exposure to the target language" (p.4), in accordance with the SLM (Flege, 1995).
In light of the above, the study sought to investigate the following research questions and hypotheses.
RQ1: Does an 8-week FI period combined with explicit pronunciation instruction modify the perception of L2 vowel sounds by Spanish/Catalan learners of English?H1: In line with previous research, it is hypothesized that an 8-week FI period combined with explicit pronunciation instruction will play a role in modifying L2 learners' vowel perception (Gordon et al., 2013;Saito, 2015), due to the combination of TL exposure and the metalinguistic knowledge received.RQ2: Does an 8-week FI period combined with explicit pronunciation instruction modify the production of L2 vowel sounds by Spanish/Catalan learners of English as perceived by native English Speakers?H2: In line with previous research, it is hypothesized that an 8-week FI period combined with explicit pronunciation instruction will not suice in modifying L2 learners' production abilities (Fullana & Mora, 2007).Longer periods of FI would necessarily be undertaken in order to observe signiicant pronunciation gains (e.g.Saito, 2015).

Participants
A group of sixteen Spanish/Catalan bilinguals learning English as an L2 made part of the cohort of the present study.All learners were second-year English majors at a state funded university in Barcelona and were enrolled in the third semester of an English Studies degree.Students were attending ive compulsory courses, which were taught entirely in English 4 either by native or proicient non-native instructors 5 .Importantly, all students were taking an introductory phonetics and phonology course at the time, which consisted of theoretical and practical approaches to English segmental phonetics as well as a contrastive analysis between the participants' L1 and the TL, English.Further details on the introductory phonetics and phonology course and the English exposure learners were receiving are provided in section 2. 1.4.hree SSBE NES, who were currently living in the UK, made part of the study by providing baseline data and validating the materials.An additional group of four SSBE NES judged learners' oral productions for comprehensibility.Due to practical reasons, the NES judges were currently living in Barcelona.hey were all TEFL teachers, who had been in Barcelona for an average of 6 years at the time of the data collection.Tests were performed in a quiet room by making use of good quality headphones (SONY ZX310) connected to a laptop computer.Learners' language proiciency was assessed through the Cambridge Online test (COT), which is an open source proiciency test. 6Table 1 presents demographic information about the participants.As observed in Table 1, the mean score of the COT was 7.6 out of 10, which corresponds to an upper intermediate level of proiciency.Nevertheless, the results ranged from 3.7 to 10, meaning that the level of proiciency among the participants was not homogeneous.Out of the 16 participants, 10 participants were female and six were male.heir mean age was 19.6, ranging from 18 to 23.Learners' irst exposure to English ranged from three to 12 years, with a mean of 5.8.Moreover, the average number of years of English FI was 12.3, ranging from a minimum of ive and a maximum of 18.Additionally, it is relevant to note that learners difered on their self-reported language dominance.hat is, six learners claimed to be dominant in Catalan and ten learners reported being dominant in Spanish. 7

Procedure and testing tasks
he L2 learners were tested previous to (T1) and ater (T2) an eight-week period of FI (including pronunciation instruction (see section 2.1.4))in English.
Testing assessed learners' ability to perceive and produce English vowels and aimed at assessing perceptual and/or production changes emerging as a result of the instructional treatment.
A seven-forced-choice (7FC) identiication task assessed learners' perception. he perceptual testing was delivered by means of the sotware TP 3.1 (Rauber, Rato, Kluge & Santos, 2012) and the task contained the ive target vowel sounds /iː, ɪ, ae, ʌ, ɜː/ and two illers /e, ɑː/ as response options.Participants were given the following instructions, "You will be hearing a series of non-words and some real words.Your task is to click on the sound you think that matches the word/non-word spoken".Learners' production abilities were assessed through a picture naming task, delivered through a PowerPoint presentation and monitored by the irst author and a sound technician. he L2 production data was analysed by means of NES comprehensibility judgments.Comprehensibility in this study was understood as 'the degree of diiculty the listener reports in attempting to understand the utterance' (Munro & Derwing, 2001 p. 454).Production was rated on a 9-point-likert scale, in which 1 meant "easy to recognize as the target sound" and 9 meant "diicult to recognize as the target sound".he NES judges irst identiied the vowel sounds that they heard in a 7FC Identiication task 8 , and subsequently rated the learners' production by indicating the degree of diiculty to recognize the produced sound as the target sound.Each target vowel was recorded in two diferent words twice (2 x 5 x 2), totalling in 20 words per participant.hus, 320 tokens were analysed.Two production measures were adopted: percentage of correct identiication of target sounds by NES and the median of the comprehensibility rating scores.All stimuli were evaluated in one single session.Judges could hear each individual production twice and could not go back to previous stimuli and/or change their previous responses.A reliability analysis using an intra-class correlation coeicient (ICC) with a level of "absolute agreement" was conducted on the rating scores in order to assess whether the raters agreed with one another.he results of the reliability analysis revealed a robust rater-agreement (α=.883), conirming that the four judges agreed on the analysis of the tests performed.
Both tests were done at the speech laboratory at the university where the learners were studying.he perception test took place in a quiet computer room with individual computers and headphones, and the production test took place in a soundproof room at the same institution.At both testing times (T1 and T2), the production testing occurred before the perceptual testing in the attempt to minimize any carry-over efect from the perception task to the production task. he overall duration of the testing session (perception + production) ranged from 30-40 minutes with two time-controlled breaks of 30 seconds, and learners were given course credit for their participation.Prior to the assessment, three SSBE NES performed the perceptual ID task and obtained over 96% accurate performance, thus validating the stimuli.he instructional treatment took place at the same institution and will be described further in section 2.1.4.

Stimuli
Stimuli for the perception task consisted of CVC real and non-words containing the ive target vowel sounds /iː, ɪ, ae, ʌ, ɜː/ produced by two SSBE speakers (a male and a female).hey were both from and had spent most of their lives in the south of England and thus they spoke a homogeneous and standard variety of British English, fulilling the requirement for selection.None of the talkers reported speaking any other languages luently and/or having any knowledge or contact with Spanish/Catalan in their daily routines.Talkers were recorded within a period of a week and were paid for their participation.All participants reported having normal vision and hearing Recordings were carried out at the Speech, Hearing and Phonetic Science Department at University College London (UCL).Stimuli were elicited by means of a sentence reading task which made use of the following carrier sentence: "I say X, I say X now, I say X again".he stimuli was composed by 30 CVC non-words, seven practice non-words and eight non-words illers involving the vowels /e/ and /ɑː/, totalling 150 trials for non-words.Moreover, testing involved ten CVC real words and eight real words as testing illers, totalling 56 trials for real words.he production task, on the other hand, elicited real word stimuli only. he picture naming task contained diferent pictures that were used to elicit participants' production of the ive target sounds.Each word was elicited twice from each participant.he stimuli lists for perception and production can be seen in Appendix 1.

Instructional treatment
As previously mentioned, learners were attending ive courses, which corresponded to the third semester of their English Studies degree.he speciic course list and the amount of hours corresponding to each subject per semester is described in Table 2. Since an entire semester at this given institution corresponds to approximately 16 weeks of FI, the present investigation assessed the efect of half of the total amount of the hours for each subject.In this paper, formal instruction is viewed as the exposure to the TL received during the classes, not necessarily including focus on form (FOF).As observed in Table 2, the total number of hours of FI is divided into two categories: teacher-led and supervised.Teacher-led hours are understood as lecturing time, whereas supervised hours are understood as hours that involve peer interaction with teacher supervision.On these bases, students would be exposed to 240 hours of input through the teacher-lead hours and 117 hours of peer interaction throughout the entire semester.Consequently, 120 hours of good quality input and 58.5 hour of classroom interaction are being evaluated in the present investigation 9 .he explicit pronunciation instruction examined in this study was administered through a compulsory subject titled 'Phonetics and Phonology I' .In this speciic subject, the instruction consisted of providing students with an introduction to phonetics and phonology, an articulatory description of the English sounds and a contrastive analysis between their L1s (Spanish/Catalan) and English.Moreover, some practical approaches, such as phonemic and phonetic transcription, reading practice and computer exercises in the speech laboratory were part of the course.

Results and Discussion
Participants' perception and production of the target vowel sounds was assessed immediately before and ater an instructional treatment, including FI and explicit pronunciation instruction.Perception results were calculated by obtaining the correct percentage scores in the forced choice identiication task.Production results were assessed by means of NES identiication and judgments of learners' productions.Four diferent judges irst identiied learners' productions of L2 vowels embedded in real words and then provided a comprehensibility rating in a 9-point likert scale.he perception results will be presented irst, followed by the production results.ability to perceive the target sounds to a larger extent when embedded in real words (8.3% gain for real words and 4.4% for nonsense words).
he fact that learners identiied the target vowel sounds better when embedded in real words than when embedded in non-words is a noteworthy outcome.his may indicate that learners found it easier to recognize the vowels when they were in words that they recognized and possibly heard during FI.his inding supports previous research indicating that lexical knowledge is essential for vowel category formation (Mora, 2005).It also relates to previous indings from lexical decision studies, which report that learners perform faster and more accurately when identifying real words than non-word stimuli (Vitevitch & Luce, 1999).
As to investigate the efect of FI combined with explicit pronunciation instruction further, the identiication scores for each of the target vowels were examined at both testing times.T-tests were run on the percentage scores and the results can be seen in Table 4 and Figure   As evidenced in Table 4, the vowels which were perceived the poorest from the outset were the vowels /ɜː/, /ae/ and /ʌ/.In turn, the vowels that were perceived the most accurately at the outset were the high front vowels, especially the lax one.hisinitial perception is striking since the SLM would predict that the lax high front vowel, being a dissimilar sound to the L1 inventory, would pose as much diiculty to Spanish/Catalan learners as the vowels /ae-ʌ/.Importantly, learners were able to improve their perception of two vowel sounds embedded in non-words signiicantly, namely /ɪ/ and /ʌ/.Interestingly, these results pattern in line with the predictions of the SLM model, as these two sounds correspond to "dissimilar sounds" from the learners' L1 inventory.he SLM predicts that dissimilar sounds are strong candidates to accurate perception, provided that enough good quality input is received.However, the most dissimilar sound when comparing the L1-L2 vowel inventories (/ɜː/) was not perceived more accurately ater the FI when embedded in non-words.his result is particularly interesting considering the low percentage obtained at both testing times and the fact that learners succeeded in improving their perception of the low-mid central vowel when embedded in real words (Table 3 and Figure 1).

Performance on L2 vowel production
As previously explained, two measures were applied for L2 vowel production analysis: SSBE NES identiication of learners' productions of L2 vowels embedded in real words and NES comprehensibility rating scores.he percentage of correct identiication by the NES and the median rating scores for each token at T1 and T2 were calculated (Table 5.) Table 5.Production performance with target vowel sounds.Percentage of correct identiication and median ratings.As observed in Table 5, 80% of the L2 sounds were correctly identiied at the outset of the study.However, these tokens were poorly rated for comprehensibility (4.5 on a 1-9 scale).he percentage of correctly identiied vowels increased numerically as a result of the FI (3.7%), albeit non-signiicantly, as evidenced by a paired t-test (t = -1.14, df = 15, p =.135, one-tailed).In turn, the rating scores experienced a slight non-signiicant decrease (0.3) from T1 to T2.hese results indicate that the instructional treatment received in this study did not succeed in altering the comprehensibility of the learners' production as perceived by NES.
he rating scores attributed to each individual vowel sound by the NES judges are shown in Table 6.Interestingly, the vowel rated for the highest comprehensibility was the central vowel /ɜː/ followed by the high front vowel /iː/.Recall that the central vowel was the only vowel in whose perception learners improved from T1 to T2, and that the tense high front vowel is the most acoustically similar vowel sound when comparing the L1 and L2 (Cebrian, 2006).In fact, the author describes the L1 high front vowel to be nearly identical to the L2 counterpart.In turn, the most poorly rated vowel was /ʌ/, which seems to pose diiculties to learners both in perception and production.his result goes in line with the predictions of the SLM, since this sound is nonexistent in the learners' L1.
he main objective of this study was to investigate the inluence of formal instruction combined with explicit pronunciation instruction on the perception and production of ive English vowels by Spanish/Catalan bilinguals majoring in English studies.Learners' perception and production skills were assessed before and ater a 8-week period of formal instruction and the analysis revealed a positive perceptual improvement as a result of a semester of formal instruction received, which contained explicit pronunciation instruction as well as additional transcription exercises.Participants mostly enhanced their perception of two vowel sounds from T1 to T2, namely /ɪ/ and /ʌ/.his improvement is probably accounted for by three main factors: the positive impact of explicit pronunciation instruction (Darcy et al., 2012;Kissling, 2013;homson & Derwin, 2014), since it entails focus on form (Long & Larsen-Freeman, 1991), and the consequent increase in phonological awareness of the two novel English sounds by the participants.According to Author (2015), explicit pronunciation instruction might enhance the awareness of the target sounds by making them more noticeable, and as a consequence, more accurate perception and/or production of the speciic target sounds might occur.
No improvement was observed on the production of the L2 sounds as measured by NES judgements immediately ater the instructional treatment, as measure in T2.his result suggests that a period of 8-week of FI, even when including explicit pronunciation instruction did not suice in altering learners' production abilities immediately.Possibly, a larger amount of exposure to the TL would be required in order to modify learners' production of vowel sounds from T1 to T2.hus, this study has provided further evidence to the fact that the quantity and/or quality of input received in the EFL context might not be enough to alter learners' pronunciation (Muñoz, 2011).In the assumption that perception might lead production skills (Flege, 1995), the fact that learners were able to enhance their perception through FI might indicate that production changes might take place at a later phase.However, due to the lack of a delayed post-test in this study, this hypothesis cannot be conirmed.
Given the classroom-based nature of this study, explicit pronunciation instruction took place during the compulsory semester, which makes impossible to separate the two types of instruction.his might indicate that some gains in L2 perception found in the current study may be attributable to the L2 input received from general language classes or to the explicit pronunciation instruction received.It becomes diicult to assess whether pronunciation instruction or classroom interactions led to perceptual improvement.Ideally, the performance of a control group receiving no explicit pronunciation instruction would be assessed.However, due to fact that all subjects are mandatory in the third semester at this given institution, this scenario was not possible.

Improving pronunciation outside the classroom: raising learners' phonological awareness
he results of the experimental study described in the preceding section suggest that being exposed to target language FI and explicit pronunciation instruction is not necessarily enough to result in substantial gains in the perception and production of L2 speech as far as short term efects are concerned.Taken into account the low amount of input language learners are exposed to in a FL instructional setting, learners should be encouraged to seek learning opportunities outside the classroom.Albeit relevant for improving all the areas of L2, this is especially true for the case of pronunciation due to the diiculty in noticing phonetic details in the speech stream.Noticing is a pre-requisite for learning (Schmidt, 1995), and a large body of research attests for the beneits of activities designed to promote noticing (CR activities).Moreover, learning outside the classroom develops learner autonomy.Autonomous learning increases motivation, makes learning more meaningful, and allows individualization and tracking of one's learning (Holec, 1981).
With this in mind, the present section presents two tested and validated autonomous activities designed to raise language learners' phonological awareness.he presented activities require very little or no guidance from the instructor and can be carried out by learners of any proiciency level and with no prior knowledge in phonetics and phonology.However, learners with background in phonetics and phonology will also beneit from the activities.Both activities are designed to increase noticing by raising learners' awareness of their own pronunciation, and can be applied to both, segmental and suprasegmental levels.
he irst proposed activity is an L1-L2 pronunciation comparison task.In this task, learners will irst record themselves reading L2 sentences aloud and will subsequently compare their recording to native speaker productions of the same sentences.he ability to perceive diferences between one's own speech and the L2 auditory targets might lead the speaker to modify their speech to correspond to the targets more closely, facilitating the development of accurate articulatory and auditory representations, as hypothesized by Baker and Troimovich (2006).Comparison can be carried out orally (think aloud) or in writing, and the learner should engage into as much detail as possible, listening to both samples several times and stopping when necessary.Lower-level learners could reduce the playback speed in order to facilitate the task (aural enhancement).When native speakers are not readily available, online learning materials (e.g.podcasts, videos) could be used alternatively.
Learners could be asked to focus on a speciic feature (e.g.vowels) or to listen to the whole sample and elaborate on any features noticed.For example, in Author (forthcoming), Spanish/Catalan learners of English recorded L2 sentences with challenging phones.Participants were then divided into three groups that received diferent instructions for the L1-L2 pronunciation comparison.Learners in Group 1 were told to pay special attention to the vowels, learners in Group 2 were told to pay special attention to the consonants and learners in Group 3 were not given a speciic focus and were asked to comment on any diferences they could notice.he analysis of the groups' performance hopes to shed light on which focus (narrow/ broad) would lead to more noticing.
Unless speciically required, the learners should not be required to use technical vocabulary but they should be allowed to explain in their own words, as declarative knowledge about phonology is not a requirement for L2 speech development and its use could be seen intimidating.hat being said, learners with background in phonetics and phonology could beneit from employing the appropriate terminology or even trying to phonetically transcribe the productions.Furthermore, for students who are familiar with phonetics and phonology, using a speech analysis sotware and accompanying the auditory comparisons with visual spectrogram and waveform analysis could be interesting.Ramírez Verdugo (2006) aided L1-Spanish learners of English to analyse TL pitch contours auditorily and visually during a 10-week training program.Her indings showed that learners' awareness about English intonation had risen as a result of the training.Moreover, learners were able to transfer the newly gained awareness to production, and manifested a more target-like prosodic performance.
Another task that can help language learners to increase their awareness of L2 phonology is to encourage them to relect on pronunciation through thinkaloud protocols or questionnaires, for example.Due to the very nature of speech, which unravels nonstop, and to the supremacy of meaning over form (VanPatten, 1996), language learners rarely stop to think about pronunciation unless they are asked to do so in a pronunciation instruction class, for instance.Foreign language learners, and native speakers, for that matter, are largely unaware of the movements of their articulators and the constant low of stimuli their cognitive and auditory systems are processing.Asking language learners to contemplate their views on pronunciation, to think about sounds that are diicult to pronounce or to all the diferent ways a sentence can be uttered (taking into account regional and social variation, for example) is a valuable way to raise learners' awareness about L2 phonology.In order to aid the learners, the instructor can provide some questions or a speciic topic (e.g.regional vowel variants) for discussion or to employ an already existing questionnaire.
Previous research indicates that engaging in self-relection can be beneicial for developing phonological awareness.Kennedy and Blanchet (2014) asked participants taking a 15-week course in connected speech to relect upon their learning in written journal entries.Comments about language awareness were positively related to the participants' listening comprehension.Wrembel (2011), on the other hand, employed think-aloud protocols, and asked participants questions about pronunciation while they were listening to recordings of their own speech.Participants reported that the activity helped them to become more aware of their own pronunciation and to monitor their speech.
In another study (Author, 2015), Brazilian Portuguese learners of English answered a phonological self-awareness questionnaire at the end of a testing session, which measured the learners' phonological awareness.he learners were asked to provide their opinion on pronunciation-related statements and to evaluate their diiculty in perceiving L2 speech on a 5-point scale.he statements focused on self-perception (e.g."I can hear I have a foreign accent when I speak in English") and on the perception of L1-accented L2 speech (e.g."here are some speciic English sounds that are diicult for Brazilians").he self-evaluations asked learners to estimate their abilities to notice and explain (cf.Schmidt, 1995: awareness at the level of understanding) speech phenomena.For example, learners were asked to think about how easy they ind it to identify a regional accent, to determine whether a given intonation pattern is adequate in English or to explain why a given sound they hear is not pronounced in a target-like manner.Behaviour in this questionnaire was positively related to the participants' L2 phonological awareness at the segmental level (r=.46, p<.001, See Author (2015) for further details), indicating that higher degrees of selfawareness were positively related to the learners' overall knowledge about the L2 segmental domain.his questionnaire asked learners to choose from existing options, but the questions could easily be let open-ended, encouraging a more elaborate relection.Furthermore, the studies reviewed in this section report on self-awareness on L2 pronunciation abilities as whole.Instructors of a course on segmental phonetics and phonology such as the course reported in Section 2.1.4might wish to limit the scope and bring learners' attention to the vowel and consonant sounds of the L2. he aim of this section has been to provide some ideas on how phonological awareness can be fostered by language learners at home.Encouraging language learners to develop their phonological awareness outside the classroom through autonomous activities can be an efective way to aid the noticing of L2 speech features.Combined with FI and explicit phonetics instruction, this is likely to help learners with the aim of aiding noticing and helping them to reach optimal results in their L2 speech development.

General discussion
he present paper had two objectives.Firstly, it presented the results of an empirical study examining the efects of FI combined with explicit pronunciation instruction on the perception and production of ive English vowels.he indings showed that only the learners' perception skills improved immediately ater the 8-week period of formal instruction and that the improved perception was not carried to production.In other words, the learners' pronunciation of the target vowels was not more comprehensible, as judged by NES, ater the testing period.Taken these results into account, the second objective of the paper was to present two autonomous activities that learners could employ outside the classroom in order to enhance their awareness about L2 vowels or L2 phonology in general.
he study reported in this paper is not without limitations, which suggest directions for further exploration.First, the lack of a control group is a limitation of this study.he fact that all students were enrolled in the same mandatory courses made it impossible to have a control group.Another limitation of the present study is the lack of a delayed post-test, which could have revealed production results at a later stage.his question will remain unanswered and it suggests that further research is needed.he fact that the NES judges were currently living in Spain and were TEFL teachers is also a limitation of this paper, since their familiarity with Spanish accented English might have played a role in their ratings.Furthermore, production data was analysed by means of NES ratings only.An acoustic analysis of the stimuli would be necessary to fully evaluate the efect of FI on production, since it might be possible that they have modiied their F1 and F2 values.In addition, contrasting the acoustic data with the NES ratings would allow an analysis of what aspects of production play a more relevant role in NES perception.Another issue that might have afected the outcomes of the present investigation is the fact that the participants were enrolled in a phonetics and phonology course and thus acquired meta-linguistic knowledge during the FI regime.Ideally, it would be interesting to know the extent to which the knowledge of phonetics and phonology may have afected the results, and further test students without this knowledge.
Previous research suggests, on the one hand, that the quantity and quality of the L2 input might not be ideal in FL classroom setting (Derwing & Munro, 2015;Muñoz, 2008) and that noticing phonetic detail from the speech stream is challenging (Jilka, 2009), on the other hand.Consequently, we propose that obtaining optimal results in L2 speech learning would be best achieved by combining classroom-based learning with CR activities carried out autonomously outside the classroom.Whereas language learners can beneit from explicit instruction and the instructor's expertise within the class time, promoting autonomous learning by aiding learners to increase noticing of phonetic detail on their own is likely to lead to superior results.Future research should compare the efects of classroom only vs. classroom and autonomous learning on pronunciation development in order to determine which method would lead to higher gains and which activities within and outside the classroom are the most beneicial.
4. he English variety adopted in the courses was SSBE, however students' accuracy of SSBE English vowel production was only controlled for/assessed during the Phonetics and Phonology classes.
5.Not all instructors were speakers of the SSBE variety of English and as previously mentioned, some instructors were non-native speakers of English.
6. http://www.cambridgeenglish.org/test-your-english/general-english/ 7.For the purpose of the present study, the learners' language dominance was not expected to play role as Catalan and Spanish do not difer in terms of their L1 vowels in relation to the ive vowels under study.
8. All perception tasks used in this study were validated by three SSBE NES.
9. Outside classroom exposure to the TL was not controlled for in the present investigation. 2.

Table 1 .
Demographic characteristics of the participants.

Table 2 .
Compulsory subjects during the third semester of the English Studies degree.

Table 4 .
Identiication performance for individual sounds embedded in real and non-words.