Аннотация
The study deals with the issue of acquisition of digital literary data, specifically prose texts of Czech literature, which the data would serve for independent scientific research in the context of digital humanities, or computational literary studies. In the first part, we focus on selected available foreign textual databases, which we characterize with respect to the stated goal, i.e. to the existence of such a digital data collection that would be internally structured and machine-readable. We then focus on the Czech environment, in the context of which we present the emerging database of prosaic texts of Czech literature. We describe its basic structure, the advantage of such structuring, and concrete examples of possible use of the database in statistical analysis of literary texts. We conclude that in the context of the current development of DH we can expect an increasing demand not only for specialized web applications of digital literary corpora, but especially for access to such or similar databases, as these allow for highly variable and individual research.
Библиографические ссылки
MISTRÍK, Josef. (1968). Stylistics of the Slovak language. Košice: Slovak Pedagogical Publishing House in Bratislava.
ČECH, Radek. (2016). Thematic concentration of text in Czech. Prague: Institute of Formal and Applied Linguistics.
ČECH, Radek; POPESCU, Ioan-Iovitz & ALTMAN, Gabriel. (2014). Methods of quantitative analysis of (not only) poetic texts. Olomouc: Palacky University in Olomouc.
DEFUS, A. (2024). What is stylometry? Available from: https://nauka.uj.edu.pl/ aktualnosci/-/journal_content/56_INSTANCE_Sz8leL0jYQen/74541952/141176992.
WARMER-COLAN, A. (2024). Stylometry Methods and Practices. Available from: https://guides.temple.edu/stylometryfordh/home.
Project Gutenberg. Available from: https://archive.org/details/gutenberg.
DraCor. Available from: https://dracor.org.
Czech National Corpus. Prague: Institute of the Czech National Corpus FF UK. Available from: https://www.korpus.cz.
Corpus of Czech verse. Available from: https://versologie.cz/v2/web_content/corpus.php.
Literary Corpora. Available from: https://www.clarin.eu/resource-families/literary-corpora.
The Complete Corpus of Anglo-Saxon Poetry. Available from: https://sacred-texts.com/neu/ascp.
Korp – The Language Bank of Finland. Available from: https://korp.csc.fi/shibboleth-ds/index.html?https%3A%2F%2Fkorp.csc.fi%2Fkorp%2F%3Fsaved_ params%3D1705922341337.
Distant Reading. Available from: https://distantreading.github.io/ELTeC.
Kramer. Available from: https://kramerius.nkp.cz/kramerius/Welcome.do;jsessionid=EDA92C4CCCFBB6ED5585E62C91C37BD3.
Literary Cartographic and Quantitative Models of Czech Novels from the 19th to 21st Century. Available from: https://korpusprozy.com.
Лицензия
Copyright (c) 2025 Richard Změlík

Это произведение доступно по лицензии Creative Commons «Attribution-NonCommercial-NoDerivatives» («Атрибуция — Некоммерческое использование — Без производных произведений») 4.0 Всемирная.
