Development of an Information System for Reducing the Volume of Text Information in the Process of Information Search
DOI:
https://doi.org/10.22213/2410-9304-2017-3-94-99Keywords:
word processing, information system, search words, text compression, information searchAbstract
The paper considers the possibility of applying specialized algorithms for an information system by the users that provides compression of the analyzed text information in the process of information retrieval. The relevance of the work is justified by the complexity of information retrieval associated with the user's solution of a particular task and the need to process large amounts of text data. The goal is to reduce the volume of the analyzed text information of Russian-language texts, while preserving their semantic component. The main functional nodes of the developed information system are determined. The coincidence search engine generates the text consisting of several paragraphs containing user-defined search phrases. This text is much smaller by volume than the original text and reflects the information that the user wants. The compression module is an iterative procedure that further reduces the amount of the text allocated by the user for analysis. In the proposed approach, each word of the sentence is assigned an estimate, determined on the basis of a number of criteria. A graphical user interface has been developed that has compact dimensions and a convenient layout of elements. As a result of the described approach, a significant reduction in the amount of text information processed by the user in the process of information retrieval is achieved. To further reduce the amount of information in the future, it is proposed to develop a text compression module and its practical implementation.References
MCR.DLL // Морфоанализ русского языка. - URL: http://macrocosm.narod.ru/madown.html (дата обращения: 12.04.2017).
Бледнов А. М. Разработка и исследование моделей и информационной технологии семантико-синтаксического анализа русскоязычного текста : дис. … канд. техн. наук. - Ижевск, 2007. - 120 с.
Моченов, С. В. Применение статистических методов для семантического анализа текста / С. В. Моченов, А. М. Бледнов, Ю. А. Луговских. - Ижевск : НИЦ «Регулярная и хаотическая динамика», 2005.
Downloads
Published
02.10.2017
How to Cite
Vtyurin М. В., Yastrebov А. И., & Mochenov С. В. (2017). Development of an Information System for Reducing the Volume of Text Information in the Process of Information Search. Intellekt. Sist. Proizv., 15(3), 94–99. https://doi.org/10.22213/2410-9304-2017-3-94-99
Issue
Section
Informatics, Computer Science and Control