Abstract
In this article, we discuss the elaboration and use of vocabulary lists aimed for learners of French as a for-eign language. These lists are commonly based on corpora, which, in the ideal case, are representative, relevant and large. As for the English language, this kind of corpora has been available for a long time (e.g. the COCA and the BNC). Vocabulary lists, which are often used in learning contexts, have been based on these corpora. The situation is, however, less favorable when it comes to the French language, with fewer corpora meeting the mentioned criteria, and, thus, fewer possibilities to create vocabulary lists that are useful for learners. In this contribution, we present work that has been done in order to create a vocabulary list, Riksprovsordlistan, containing about 4,000 words and used at all Swedish universities. The discussion focuses on methodological challenges such as choice of counting unit – lemma vs. word family –, the role of frequency, thematic vocabulary, as well as characteristics of written vs. spoken corpora.
References
Aston, G. & Burnard, L. (1998). The BNC handbook : exploring the British National Corpus with SARA. Edinburgh : Edinburgh University Press.
Baudot, J. (1992). Les fréquences d'utilisation des mots en français écrit contemporain. Montréal : Presses de l'Université de Montréal.
Benigno, V., Grossmann, F. & Kraif, O. (2015). Les collocations fondamentales : une piste pour l'apprentissage lexical. Revue française de linguistique appliquée, 1, 81-96. DOI : https://doi.org/10.3917/rfla.201.0081 DOI: https://doi.org/10.3917/rfla.201.0081
Campione, E., Véronis, J. & Deulofeu, J. (2005). The French corpus. In E. Cresti & M. Moneglia (eds.), C-ORAL-ROM : Integrated reference corpora for spoken romance languages (pp. 111-133). Amsterdam : John Benjamins. DOI : https://doi.org/10.1075/scl.15.05cam DOI: https://doi.org/10.1075/scl.15.05cam
Cobb, T. (2013). Frequency 2.0. Incorporating homoforms and multiword units in pedagogical frequency lists. In C. Bardel, C. Lindqvist & B. Laufer (ed.), L2 vocabulary acquisition, knowledge and use : new perspectives on assessment and corpus analysis (pp. 79-108). Amsterdam : Eurosla.
Cobb, T. (2014). A resource wish-list for data-driven learning in French. In H. Tyne, V. André, A. Boulton, C. Benzitoun & Y. Greub (dir.), French through corpora : Ecological and data-driven perspectives in French language studies (pp. 255-290). Newcastle : Cambridge Scholars.
Dang, T.N.Y. (2020). Corpus-based word lists in second language vocabulary research, learning, and teaching. In S. Webb (eds.), The Routledge Handbook of Vocabulary Studies (pp. 288-303). New York : Routledge. DOI https://doi.org/10.4324/9780429291586-19 DOI: https://doi.org/10.4324/9780429291586-19
Davies, M. & Gardner, D. (2010). Frequency dictionary of contemporary American English : Word sketches, collocates, and thematic lists. New York : Routledge.
Forsberg, F. (2006). Le langage préfabriqué en français parlé L2 : Étude acquisitionnelle et comparative. (Thèse de doctorat). Université de Stockholm.
Gougenheim, G. (1958). Dictionnaire fondamental de la langue française. Paris : Didier.
Gougenheim, G., Michéa, R., Rivenc, P. & Sauvageot, A. (1967). L'Élaboration du français fondamental (1er degré). Paris : Didier.
Laufer, B. & Nation, P. (1995). Vocabulary size and use : Lexical richness in L2 written production. Applied Linguistics, 16 (3), 307-322. DOI : https://doi.org/10.1093/applin/16.3.307 DOI: https://doi.org/10.1093/applin/16.3.307
Lindqvist, C. (2010). La richesse lexicale dans la production orale de l'apprenant avancé de français. La revue canadienne des langues vivantes/The Canadian Modern Language Review, 66 (3), 393-420. DOI : https://doi.org/10.3138/cmlr.66.3.393 DOI: https://doi.org/10.3138/cmlr.66.3.393
Lindqvist, C. (2021). L'apprentissage du vocabulaire en langue étrangère. In P. Leclercq, A. Edmonds & E. Sneed German (eds.), Introduction à l'acquisition des langues étrangères (pp. 79-92). Louvainla-Neuve : De Boeck Supérieur. DOI : https://doi.org/10.3917/dbu.lecle.2021.01.0079 DOI: https://doi.org/10.3917/dbu.lecle.2021.01.0079
Lindqvist, C., Bardel, C. & Gudmundson, A. (2011). Lexical richness in the advanced learner's oral production of French and Italian L2. International Review for Applied Linguistics in language teaching, 49 (3), 221-240. DOI : https://doi.org/10.1515/iral.2011.013 DOI: https://doi.org/10.1515/iral.2011.013
Lindqvist, C., Gudmundson, A. & Bardel, C. (2013). A new approach to the measuring of vocabulary in L2 oral production. In C. Bardel, C. Lindqvist & B. Laufer (eds.), L2 vocabulary acquisition, knowledge and use. New perspectives on assessment and corpus analysis (pp. 109-126). Amsterdam : Eurosla.
Lindqvist, C. & Ramnäs, M. (2017). L'enseignement du vocabulaire à l'université. Synergies pays scandinaves, 11-12, 55-64.
Lindqvist, C. & Ramnäs, M. (2020). Vokabulärundervisning i franska - då, nu och i framtiden. In A. Romeborn & E. Bladh (eds.), Romanistiken i Sverige. Tradition och förnyelse (pp. 265-280). Göteborg : Acta Romanica Gothoburgensis/Kriterium. DOI : https://doi.org/10.21524/kriterium.18.n DOI: https://doi.org/10.21524/kriterium.18.n
Lonsdale, D. & Le Bras, Y. (2009). A Frequency Dictionary of French. New York : Routledge. DOI : https://doi.org/10.4324/9780203883044 DOI: https://doi.org/10.4324/9780203883044
Milton, J. (2009). Measuring second language vocabulary acquisition. Bristol : Multilingual Matters. DOI : https://doi.org/10.21832/9781847692092 DOI: https://doi.org/10.21832/9781847692092
Nation, I.S.P. (2001). Learning vocabulary in another language. Oxford : Oxford University Press. DOI : https://doi.org/10.1017/CBO9781139524759 DOI: https://doi.org/10.1017/CBO9781139524759
Pallaud, B. & Henry, S. (2004). Amorces de mots et répétitions : des hésitations plus que des erreurs en français parlé. In G. Purnelle, C. Fairon & A. Dister (dir.), Le poids des mots. Actes des 7es Journées internationales d'Analyse statistique des Données Textuelles (pp. 848-858). Louvain-la-Neuve : Presses universitaires de Louvain.
Ramnäs, M. & Lindqvist, C. (2021). Le concept de famille de mot appliqué au FLE. Langues modernes, 3, 45-53.
Schmid, H. (1994). Probabilistic part-of-speech tagging using decision trees. Proceedings of International Conference on New Methods in Language Processing (pp. 44-49). Manchester, UK.
Schmid, H. (1995). Improvements in part-of-speech tagging with an application to German. Proceedings of the ACL SIGDAT-Workshop (pp. 47-50). Dublin, Ireland.
Scott, M. (2004). WordSmith tools version 4. Oxford : Oxford University Press.
Siepmann, D., Bürgel, C. & Diwersy, S. (2016). Le Corpus de référence du français contemporain (CRFC), un corpus massif du français largement diversifié par genres. SHS Web of conferences, 27. https://www.shs-conferences.org/articles/shsconf/pdf/2016/05/shsconf_cmlf2016_11002.pdf [accès : 16.12.2022]. DOI : https://doi.org/10.1051/shsconf/20162711002 DOI: https://doi.org/10.1051/shsconf/20162711002
Tidball, F. & Treffers-Daller, J. (2008). Analysing lexical richness in French learner language : What frequency lists and teacher judgment can tell us about basic and advanced words. French Language Studies, 18 (3), 299-313. DOI : https://doi.org/10.1017/S0959269508003463 DOI: https://doi.org/10.1017/S0959269508003463
License
Copyright (c) 2022 Christina Lindqvist
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
- The Author hereby warrants that he/she is the owner of all the copyright and other intellectual property rights in the Work and that, within the scope of the present Agreement, the paper does not infringe the legal rights of another person.The owner of the copyright work also warrants that he/she is the sole and original creator thereof and that is not bound by any legal constraints in regard to the use or sale of the work.
- The Author grants the Purchaser non-exclusive and free of charge license to unlimited use worldwide over an unspecified period of time in the following areas of exploitation:
2.1. production of multiple copies of the Work produced according to the specific application of a given technology, including printing, reproduction of graphics through mechanical or electrical means (reprography) and digital technology;
2.2. marketing authorisation, loan or lease of the original or copies thereof;
2.3. public performance, public performance in the broadcast, video screening, media enhancements as well as broadcasting and rebroadcasting, made available to the public in such a way that members of the public may access the Work from a place and at a time individually chosen by them;
2.4. inclusion of the Work into a collective work (i.e. with a number of contributions);
2.5. inclusion of the Work in the electronic version to be offered on an electronic platform, or any other conceivable introduction of the Work in its electronic version to the Internet;
2.6. dissemination of electronic versions of the Work in its electronic version online, in a collective work or independently;
2.7. making the Work in the electronic version available to the public in such a way that members of the public may access the Work from a place and at a time individually chosen by them, in particular by making it accessible via the Internet, Intranet, Extranet;
2.8. making the Work available according to appropriate license pattern Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) as well as another language version of this license or any later version published by Creative Commons. - The Author grants the Adam Mickiewicz University in Poznan permission to:
3.1. reproduce a single copy (print or download) and royalty-free use and disposal of rights to compilations of the Work and these compilations.
3.2. send metadata files related to the Work, including to commercial and non-commercial journal-indexing databases. - The Author declares that, on the basis of the license granted in the present Agreement, the Purchaser is entitled and obliged to allow third parties to obtain further licenses (sublicenses) to the Work and to other materials, including derivatives thereof or compilations made based on or including the Work, whereas the provisions of such sub-licenses will be the same as with the attributed license pattern Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)or another language version of this license, or any later version of this license published by Creative Commons. Thereby, the Author entitles all interested parties to use the work, for non-commercial purposes only, under the following conditions:
4.1. acknowledgment of authorship, i.e. observing the obligation to provide, along with the distributed work, information about the author, title, source (link to the original Work, DOI) and of the license itself.
4.2. the derivatives of the Work are subject to the same conditions, i.e. they may be published only based on a licence which would be identical to the one granting access to the original Work. - The University of Adam Mickiewicz in Poznań is obliged to
5.1. make the Work available to the public in such a way that members of the public may access the Work from a place and at a time individually chosen by them, without any technological constraints;
5.2. appropriately inform members of the public to whom the Work is to be made available about sublicenses in such a way as to ensure that all parties are properly informed (appropriate informing messages).
Other provisions
- The University of Adam Mickiewicz in Poznań hereby preserves the copyright to the journal as a whole (layout/stylesheet, graphics, cover design, logo etc.)