Abstract
While corpus linguistics has existed since the 1960s, Forensic Linguistics is
a relatively new discipline, involving both linguistic evidence in court and wider applications of linguistics to legal texts and discourses. Computer corpora of natural language may be marked up in various ways, grammatically tagged, parsed, lemmatised and analysed with concordance, collocation and other specialist soft ware. In the relatively short history of forensic linguistics, its exponents have oft en employed corpus linguistics techniques in order
to throw light on questions like disputed authorship. However, the corpora employed have been general ones such as the Cobuild “Bank of English”, rather than purpose-built databases of language used in legal contexts, with the result that such research sometimes raises more questions than it answers. Conversely, corpus linguists have from time to time incorporated
data from legal settings into their collections; but they have tended to use these resources as the basis for sociolinguistic or historical linguistic research rather than as a means of exploring topics in language and law. This paper makes a plea for these two fi elds, which are both already cross-disciplinary, to join forces and create a purpose-built corpus for forensic linguistics. It illustrates how corpus techniques may be successfully applied to questions of disputed authorship, citing both hypothetical and actual examples. It ends with an outline of the kinds of texts which a proposed new corpus for Forensic Linguistics should contain and the tools required to exploit it eff ectively.
References
Berk-Seligson, S., 1990, The Bilingual Courtroom: Court Interpreters in the Judicial Process. Chicago: University of Chicago Press.
Blackwell, 2000, “Looking up look: Discourse Markers in the Bank of English”. In Kirk, J. ed., Corpora Galore: Analyses and Techniques in Describing English. Amsterdam: Rodopi.
Botley, S., McEnery, A. and Wilson, A. (eds). 2000, Multilingual Corpora: Teaching and Research. Amsterdam: Rodopi.
Burnard, L. ed., 1995, Users’ Reference Guide to the British National Corpus. Oxford: Oxford University Computing Service.
Collins Cobuild English Dictionary for Advanced Learners, 2001. London: Harper Collins.
Coulthard, M., 1994, “Powerful evidence for the defence: an exercise in forensic discourse analysis”. In Gibbons, ed., Language and the Law. London and New York: Longman.
Coulthard, M., 1996, “The official version: Audience manipulation in police records of interviews with suspects”. In Caldas-Coulthard and Coulthard (ed.s), Texts and Practices: Readings in Critical Discourse Analysis. London: Routledge.
Eades, D., 1994, “A case of communicative clash: Aboriginal English and the legal system.” in Gibbons, ed., Language and the Law, London: Longman.
Fox, G., 1993, "A Comparison of 'Policespeak' and 'Normalspeak': a Preliminary Study". In Sinclair, Hoey and Fox (ed.s), Techniques of Description: Spoken and Written Discourse. A festschrift for Malcolm Coulthard. London: Routledge.
Garside, Leech and Sampson ed.s, 1987, The computational analysis of English: a corpus-based approach. London: Longman.
Goutsos, D., 1995, Review article: “Forensic Stylistics”. In Forensic Linguistics vol. 2 no. 1.
Granger, S. (1994) “The Learner Corpus: A Revolution in Applied Linguistics”. English Today 39(10/3), pp. 25-29.
Greenbaum, S., ed., 1996, Comparing English worldwide: the International Corpus of English. Oxford: Clarendon Press.
Hockey, S. and Martin, J., 1987, "The Oxford Concordance Program Version 2", in Literary and Linguistic Computing, 2, pp. 125-131.
Huber, M., 2007, 'Playing tag with Old Bailey. Creating a corpus of 18th-century spoken English' . In Meurman-Solin, A. and Nurmi, A. (eds.): eVARIENG 1: Proceedings of the workshop on corpus annotation at ICAME 27, Helsinki, 24-28 May 2006. Helsinki: VARIENG.
Hymes, D., 1974, "Ways of Speaking", in R. Bauman and J. Sherzer (ed.s), Explorations in the Ethnography of Speaking. Cambridge: C.U.P.
Ihalainen, O., M. Kytö and M. Rissanen, 1987, "The Helsinki Corpus of English Texts: Diachronic and Dialectal: Report on work in progress", in Corpus Linguistics and Beyond. Proceedings of the Seventh International Conference on English Language Research on Computerized Corpora, ed. W. Meijs. Amsterdam: Rodopi.
Kirk, J., 1994, “Taking a Byte at Corpus Linguistics”, in Entering Text, edited by Flowerdew, L. and Tong, A.K.K., pp. 18-43.
Klemola, J. and Jones, M.J., 1999, “The Leeds corpus of English dialects – project”. In Leeds Studies in English 30: 17-30.
Kniffka, H., 2000, “Anonymous Authorship Analysis without Comparison Data? A Case Study with methodological impact”. In: Linguistische Berichte 182, 179-198.
Kucera, H. and Francis, W.N., 1967, Computational analysis of present-day American English. Providence, R.I.: Brown Univ. Press.
Kytö, M. 1994, Manual to the Diachronic Part of the Helsinki Corpus of English Texts: Coding Conventions and Lists of Source Texts, 2nd ed. Helsinki: Helsinki University Press for Department of English, University of Helsinki.
Lindsay, J. and O’Connell, D.C., 1995, “How do transcribers deal with audio recordings of spoken discourse?”, in Journal of Psycholinguistic Research, 2:101-115.
MacWhinney, B., 1995, The CHILDES-Project: Tools for Analyzing Talk. Second edition. Hillsdale, NJ: Lawrence Erlbaum
Nelson, Gerald (1996) The Design of the Corpus. In S. Greenbaum (ed.), pp. 27-35.
Okawara, M.H., 2006, A linguistic analysis of Some Japanese trademark cases. Ph.D. thesis, University of Sydney.
Okawara, M.H., forthcoming, “Legal Japanese viewed through the Unfair Competition Prevention Law”. In Gibbons, Prakasham and Tirumalesh, ed.s, Justice and Language. Delhi: Longman Orient.
Pajzs, J., 1991, “The Use of a Lemmatized Corpus for Compiling the Dictionary of Hungarian”, in Using Corpora: proceedings of the 7th annual conference of the UW Centre for the New OED and Text Research. Waterloo: UW Centre for the New OED and Text Research.
Renouf, A., 1987, “Corpus Development”. In Sinclair, ed.
Shapero, J.J., forthcoming, Ph.D. thesis, University of Birmingham.
Shuy, R., 1993, Language Crimes: the Use and Abuse of Language Evidence in the Courtroom. Cambridge, MA and Oxford: Blackwell.
Shuy, R., 2005, Creating Language Crimes: How Law Enforcement Uses (and Misuses) Language. Oxford: Oxford University Press.
Sinclair, J. 1982. “Reflections on computer corpora in English language research.” In Computer corpora in English language research, ed. Johansson, S.: 1-6. Bergen.
Sinclair, J.M., ed., 1987, Looking Up: An Account of the COBUILD Project in lexical computing. London: HarperCollins.
Solan, L., 1993, The Language of Judges. Chicago: U. of Chicago Press.
Stenström, Anna-Brita, Andersen, G., Hasund, K., Monstad, K. and Aas, H., 1998, User's manual to accompany the Bergen Corpus of London Teenage Language (COLT). Department of English, University of Bergen, Norway.
Svartvik, J. (1968) The Evans Statements: A case for forensic linguistics, Gothenburg: Gothenburg University Press
Tiersma, P.M., 1999, Legal language. Chicago: University of Chicago Press.
Walker, A.G., 1999, Handbook on Questioning Children: a Linguistic Perspective. Washington: American Bar Association.
License
When submitting a paper the author agrees to the following publishing agreement and processing personal data.
PUBLICATION AGREEMENT, COPYRIGHT LICENSE, PERSONAL DATA PROCESSING CONSENT
This is a publication agreement and copyright license (“Agreement”) regarding a written manuscript currently submitted via Pressto platform
(“Article”) to be published in Comparative Legilinguistics International Journal for Legal Communication (“Journal”).
The parties to this Agreement are:
the Author or Authors of the submitted article (individually, or if more than one author, collectively, “Author”) and Comparative Legilinguistics International Journal for Legal Communication (“Publisher”), address al. Niepodległości 4, 61-874 Poznań, represented by its editor in chief Aleksandra Matulewska.
§1. LICENSE OF COPYRIGHT
a) The Author and the Publisher agree that the Author grants a Creative Commons Attribution 4.0 International License, which is incorporated herein by reference and is further specified at Creative Commons — Attribution 4.0 International — CC BY 4.0 copyright license in the Article to the general public.
b) The Author grants to the Publisher a royalty-free, worldwide nonexclusive license to publish, reproduce, display, distribute, translate and use the Article in any form, either separately or as part of a collective work, including but not limited to a nonexclusive license to publish the Article in an issue of the Journal, copy and distribute individual reprints of the Article, authorize reproduction of the entire Article in another publication, and authorize reproduction and distribution of the Article or an abstract thereof by means of computerized retrieval systems (such as Westlaw, Lexis and SSRN). The Author retains ownership of all rights under copyright in the Article, and all rights not expressly granted in this Agreement.
c) The Author grants to the Publisher the power to assign, sublicense or otherwise transfer any and all licenses expressly granted to the Publisher under this Agreement.
d) Republication. The Author agrees to require that the Publisher be given credit as the original publisher in any republication of the Article authorized by the Author. If the Publisher authorizes any other party to republish the Article under the terms of paragraphs 1c and 1 of this Agreement, the Publisher shall require such party to ensure that the Author is credited as the Author.
§2. EDITING OF THE ARTICLE
a) The Author agrees that the Publisher may edit the Article as suitable for publication in the Journal. To the extent that the Publisher’s edits amount to copyrightable works of authorship, the Publisher hereby assigns all right, title, and interest in such edits to the Author.
§3. WARRANTIES
a) The Author represents and warrants that to the best of the Author’s knowledge the Article does not defame any person, does not invade the privacy of any person, and does not in any other manner infringe upon the rights of any person. The Author agrees to indemnify and hold harmless the Publisher against all such claims.
b) The Author represents and warrants that the Author has full power and authority to enter into this Agreement and to grant the licenses granted in this Agreement.
c) The Author represents and warrants that the Article furnished to the Publisher has not been published previously. For purposes of this paragraph, making a copy of the Article accessible over the Internet, including, but not limited to, posting the Article to a database accessible over the Internet, does not constitute prior publication so long as the as such copy indicates that the Article is not in final form, such as by designating such copy to be a “draft,” a “working paper,” or “work-in-progress”. The Author agrees to hold harmless the Publisher, its licensees and distributees, from any claim, action, or proceeding alleging facts that constitute a breach of any warranty enumerated in this paragraph.
§4. TERM
a) The agreement was concluded for an unspecified time.
§5. PAYMENT
a) The Author agrees and acknowledges that the Author will receive no payment from the Publisher for use of the Article or the licenses granted in this Agreement.
b) The Publisher agrees and acknowledges that the Publisher will not receive any payment from the Author for publication by the Publisher.
§6. ENTIRE AGREEMENT
a) This Agreement supersedes any and all other agreements, either oral or in writing, between the Author and the Publisher with respect to the subject of this Agreement. This Agreement contains all of the warranties and agreements between the parties with respect to the Article, and each party acknowledges that no representations, inducements, promises, or agreements have been made by or on behalf of any party except those warranties and agreements embodied in this Agreement.
b) In all cases not regulated by this Agreement, legal provisions of Polish Copyright Act and Polish Civil Code shall apply.
c) Any disputes arising from the enforcement of obligations connected with this Agreement shall be resolved by a court competent for the headquarters of the Publisher.
d) Any amendments or additions to the Agreement must be made in writing and signed by authorised representative of both parties, otherwise being ineffective.
e) This Agreement is signed electronically and the submission of the article via the PRESSto platform is considered as the conclusion of the Agreement by the Author and the Publisher.
f) Clause for consent to the processing of personal data - general
g) The Author shall give his or her consent to the processing of their personal data in accordance with the Act of 10 May 2018 on the protection of personal data and Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of persons physical in connection with the processing of personal data and on the free movement of such data, and repealing Directive 95/46 / EC (General Data Protection Regulation) for the purpose and in connection with making publications available on the PRESSto scientific journals platform and DeGruyter platform, guaranteeing the security of services rendered, and improving them.
I HAVE READ AND AGREE FULLY WITH THE TERMS OF THIS AGREEMENT.
The Author The Publisher