Összesen 1 találat.
#/oldal:
Részletezés:
Rendezés:

1.

001-es BibID:BIBFORM122768
Első szerző:Gál Zoltán (informatikus)
Cím:Deep Learning-Based Analysis of Ancient Greek Literary Texts in English Version: A Statistical Model Based on Word Frequency and Noise Probability for the Classification of Texts / Gál Zoltán, Tóth Erzsébet
Dátum:2024
ISSN:2061-2079
Megjegyzések:In our paper we intend to present a methodology that we elaborated for clustering texts based on the word frequency in the English translations of selected old Greek texts. We used the classification system of the ancient Library of Alexandria, devised by the prominent Greek scholar-poet, Callimachus in the 3rd century BC., as a basis for categorizing literary masterpieces. In our content analysis, we could determine a triplet of a, b, c values for describing a power function that appropriately fits a curve determined by the word frequencies in the texts. In addition, we have discovered 16 special features of the different texts that correspond to various token categories investigated in each text, such as part of speech of the word in the context, numerals, subordinate conjunction, symbols, etc. We have developed a cognitive model in which several hundred different subtexts were utilized for supervised learning with the aim of subtext class recognition. Concerning 200 subtexts, the triplet of a, b, c values, the classes of the subtexts, and their 16-dimensional feature vectors were learnt for the Recurrent Neural Network (RNN). It turned out that the Long-Short Term Memory RNN could efficiently predict which class a chosen subtext could be categorized into without considering the interpretation of the content. The influence of the non-zero error rate of new communication services on the meaning of the transferred texts was also investigated. The impact of the noise on the classification accuracy was found to be linear, dependent on the character error rate.
Tárgyszavak:Műszaki tudományok Informatikai tudományok idegen nyelvű folyóiratközlemény hazai lapban
folyóiratcikk
mélytanulás
ókori görög irodalmi szövegek
szövegklaszterezés
zajos szövegek
Pinakes
text classification
automatic content analysis
Recurrent Neural Network (RNN)
Long-Short Term Memory
Megjelenés:Infocommunications Journal. - 16 : Joint Special Issue on Cognitive Infocommunications and Cognitive Aspects of Virtual Reality (2024), p. 2-11. -
További szerzők:Tóth Erzsébet (1972-) (informatikus könyvtáros)
Pályázati támogatás:TKP2021-NKTA-34
Egyéb
Internet cím:Szerző által megadott URL
DOI
Intézményi repozitóriumban (DEA) tárolt változat
Borító:
Rekordok letöltése1