Abstract
The aim of the article is to present several problems encountered by the editor compiling the Corpus of Wrocław University of Technology. The problems are connected with the application of the method of disambiguation of meanings of words in text (WSD – Word Sense Disambiguation) during the work on material of phraseological character. Any researcher using this method confronts the problem of the lack of semantic independence of words entering into the composition of fixed word combinations. The necessity to develop a unique procedure of conduct for this type of words, inclined the author of the article to experiment in the isolation of independent meanings of words co-forming phraseologisms and to make an attempt at their typologization. The isolation of such meanings has proved possible, although the accuracy of their description is very unstable: in some cases the motivation of semantic changes—which the meaning of a word undergoes under the influence of “being petrified” in a set phrase—is transparent, in other cases, however, we can speak of a hypothetical source of changes rather than a definite modification of meaning. It is worth mentioning here about the rigour imposed by the adopted WSD marking method: a set of all meanings which a given word form may assume in the analysed text must be entered into the database. That is why the problem of providing the definition of words forming phraseologisms is vital from the point of view of a linguist dealing with machine processing of language.References
D. Bas, B. Broda, M. Piasecki, Towards Word Sende Disambiguation of Polish, w: W Proceedings of the International Multiconference on Computer Science and Information Technology --- 3rd International Symposium Advances in Artificial Intelligence and Applications, Wisła 2008.
A. Pajdzińska, A. M. Lewicki, Frazeologia, w: Współczesny język polski, Jerzy Bartmiński (red.) , Lublin 2001.
K. Siekierska, Frazeologia związana z rzeczownikiem głowa. Rys historyczny, „Prace Filologiczne”, nr 2.
S. Skorupka, Frazeologia i semantyka, „Poradnik Językowy”, z. 7, nr 8.
M. Stevenson, Y. Wilks, Word-Sense Disambiguation, w: The Oxford Handbook of Computional Linguistics, Ruslan Mitkov (red.), Oxford 2009.
License
Authors
Authors of texts accepted for publication in „Poznańskie Studia Polonistyczne. Seria Językoznawcza” are required to complete, sign and return to the editor's office the Agreement for granting a royalty-free license to works with a commitment to grant a CC sub-license.
Under the agreement, the authors of texts published in „Poznańskie Studia Polonistyczne. Seria Językoznawcza” grant the Adam Mickiewicz University in Poznań a non-exclusive, royalty-free license and authorize the use of Attribution-NoDerivatives 4.0 International (CC BY-ND 4.0) Creative Commons sub-license.
The authors retain the right to continue the free disposal of the work.
Users
Interested Internet users are entitled to use works published in „Poznańskie Studia Polonistyczne. Seria Językoznawcza” since 2016 under the following conditions:
- attribution - obligation to provide, together with the distributed work, information about the authorship, title, source (link to the original work, DOI) and the license itself.
- no derivatives - the work must be preserved in its original form, without the author's consent it is not possible to distribute the modified work, such as translations, publications, etc.
Copyrights are reserved for all texts published before 2016.
Miscellaneous
Adam Mickiewicz University in Poznań retains the right to magazines as a whole (layout, graphic form, title, cover design, logo etc.).