Facebook Instagram Twitter RSS Feed PodBean Back to top on side

Text Segmentation Using Roget-Based Weighted Lexical Chains

In: Computing and Informatics, vol. 32, no. 2
D. Tatar - D. Inkpen - G. Czibula
Detaily:
Rok, strany: 2013, 393 - 410
Kľúčové slová:
Lexical chains, text segmentation, topic boundaries, Roget's thesaurus, segmentation evaluation
O článku:
In this article we present a new method for text segmentation. The method relies on the number of lexical chains (LCs) which end in a sentence, which begin in the following sentence and which traverse the two successive sentences. The lexical chains are based on Roget's thesaurus (the 1987 and the 1911 version). We evaluate the method on ten texts from the DUC 2002 conference and on twenty texts from the CAST project corpus, using a manual segmentation as gold standard.
Ako citovať:
ISO 690:
Tatar, D., Inkpen, D., Czibula, G. 2013. Text Segmentation Using Roget-Based Weighted Lexical Chains. In Computing and Informatics, vol. 32, no.2, pp. 393-410. 1335-9150.

APA:
Tatar, D., Inkpen, D., Czibula, G. (2013). Text Segmentation Using Roget-Based Weighted Lexical Chains. Computing and Informatics, 32(2), 393-410. 1335-9150.